Make tests for models that will use appropriate validation data set (e.g., SQUAD, hellaswag) to run model and compare output to expected outputs from data set.
This tests should be organized in a similar way to models/experimental/mistral/tests/test_perf_accuracy_mistral.py.
Here is list of models to make tests for
Make tests for models that will use appropriate validation data set (e.g., SQUAD, hellaswag) to run model and compare output to expected outputs from data set. This tests should be organized in a similar way to
models/experimental/mistral/tests/test_perf_accuracy_mistral.py
. Here is list of models to make tests forPriority:
Rest:
Don't do: