google / gemma.cpp

lightweight, standalone C++ inference engine for Google's Gemma models.
Apache License 2.0
5.76k stars 487 forks source link

Add config for att/final cap, skip max-subtract. Fixes #278 #279

Closed copybara-service[bot] closed 1 week ago

copybara-service[bot] commented 1 week ago

Add config for att/final cap, skip max-subtract. Fixes #278

Also update includes/deps for backprop/.