google / gemma.cpp

lightweight, standalone C++ inference engine for Google's Gemma models.
Apache License 2.0
5.76k stars 487 forks source link

7x compile time speedup: shard gemma.cc #288

Closed copybara-service[bot] closed 5 days ago

copybara-service[bot] commented 5 days ago

7x compile time speedup: shard gemma.cc

Use overloaded functions defined in gemma/instantiations. Also split out activations.h.