google / gemma.cpp

lightweight, standalone C++ inference engine for Google's Gemma models.
Apache License 2.0
5.8k stars 491 forks source link

Simplifications: remove GemmaInterface and GemmaImpl #209

Closed copybara-service[bot] closed 1 month ago

copybara-service[bot] commented 1 month ago

Simplifications: remove GemmaInterface and GemmaImpl

Split common and weights into separate lib Remove common-inl (does not have to be SIMD code), activations.cc Centralize switch(Model) to avoid duplication Move CompressWeightsT to compress_weights.cc Move LoadWeights to weights.cc