quesiton about Llama2-70b

mlcommons / inference

Reference implementations of MLPerf™ inference benchmarks

Apache License 2.0

1.24k stars 536 forks source link

Hi @amine-ammor , the accuracy refers to the accuracy run.

From https://github.com/mlcommons/inference_policies/blob/master/inference_rules.adoc:

For each benchmark, MLPerf will provide pointers to: An accuracy data set, to be used to determine whether a submission meets the quality target, and used as a validation set A speed/performance data set that is a subset of the accuracy data set to be used to measure performance

If you would like to change the precision of your model, you can change it in the custom.py config file if you are running the NVIDIA implementation at https://github.com/mlcommons/inference_results_v4.0/tree/main/closed/NVIDIA

If this resolves your question, you can close the issue.

mlcommons / inference

quesiton about Llama2-70b #1818