Foundation model benchmarking tool. Run any model on any AWS platform and benchmark for performance across instance type and serving stack options.
182
stars
27
forks
source link
support for llama3.1 on neuron + use_messages_api metadata addition #164
Closed
madhurprash closed 1 month ago
Contains the following files and changes: