google / gemma.cpp

lightweight, standalone C++ inference engine for Google's Gemma models.
Apache License 2.0
5.94k stars 502 forks source link

Extends Transformer() to prepare for batched processing. #238

Closed copybara-service[bot] closed 3 months ago

copybara-service[bot] commented 3 months ago

Extends Transformer() to prepare for batched processing.