Closed Anindyadeep closed 2 months ago
This PR introduces all the changes by PR https://github.com/premAI-io/benchmarks/pull/167 and integrates those for ONNX Runtime with HF Optimum. ONNX Runtime LLM README now has quality checks table for both Llama 2 Chat and Mistral Instruct.
LGTM, @Anindyadeep just solve the conflicts
Done
This PR introduces all the changes by PR https://github.com/premAI-io/benchmarks/pull/167 and integrates those for ONNX Runtime with HF Optimum. ONNX Runtime LLM README now has quality checks table for both Llama 2 Chat and Mistral Instruct.