nod-ai / SHARK-Studio

SHARK Studio -- Web UI for SHARK+IREE High Performance Machine Learning Distribution
Apache License 2.0
1.42k stars 171 forks source link

Enable llama2 benchmarking with Turbine #2050

Open kuhar opened 10 months ago

kuhar commented 10 months ago

This is extension of the main Turbine refactoring work: https://github.com/nod-ai/SHARK/issues/1931. To enable future performance-related work, we should recreate the 1.0 benchmarking mode from vicuna.py:

Enablement

Correctness

Performance

kuhar commented 10 months ago

cc: @antiagainst @harsh-nod