google / gemma.cpp

lightweight, standalone C++ inference engine for Google's Gemma models.
Apache License 2.0
5.76k stars 487 forks source link

Use benchmark_helper in py bindings (adds BOS) #282

Closed copybara-service[bot] closed 6 days ago

copybara-service[bot] commented 6 days ago

Use benchmark_helper in py bindings (adds BOS)

Also remove thread clamp (OK to be zero or large).