tenstorrent / tt-metal

:metal: TT-NN operator library, and TT-Metalium low level kernel programming model.
Apache License 2.0
303 stars 26 forks source link

[On Hold] Tests with input dataset #4351

Open boris-drazic opened 6 months ago

boris-drazic commented 6 months ago

Make tests for models that will use appropriate validation data set (e.g., SQUAD, hellaswag) to run model and compare output to expected outputs from data set. This tests should be organized in a similar way to models/experimental/mistral/tests/test_perf_accuracy_mistral.py. Here is list of models to make tests for

Priority:

Rest:

Don't do:

Sudharsan-V commented 6 months ago

Please find the independent tickets/issues for the models below: bloom - #4374 t5 - #4407 whisper - #4427 distilbert - #4482 mnist - #4431 ssd - #4506 tr-ocr - #4486 Roberta - #4556 Swin - #4557 Diet - #4609 EfficientNet - #4610 ViT - #4611 LeNet #4623 Yolov3 #4622 Yolov5 #4668 llama #4717

saichandax commented 5 months ago

Currently, only 5 models are pending at different levels.

The work will be resumed once Stable Diffusion tasks are done #4765