aws-samples / foundation-model-benchmarking-tool

Foundation model benchmarking tool. Run any model on any AWS platform and benchmark for performance across instance type and serving stack options.
https://aws-samples.github.io/foundation-model-benchmarking-tool/
MIT No Attribution
182 stars 27 forks source link

support for llama3.1 on neuron + use_messages_api metadata addition #164

Closed madhurprash closed 1 month ago

madhurprash commented 1 month ago

Contains the following files and changes:

  1. Config files for llama3-8b, 70 instruct on Neuron on an ml.inf2.48xl
  2. use_messages_api addition