issues
search
google
/
gemma.cpp
lightweight, standalone C++ inference engine for Google's Gemma models.
Apache License 2.0
5.94k
stars
502
forks
source link
Code cleanup
#264
Closed
copybara-service[bot]
closed
3 months ago
copybara-service[bot]
commented
3 months ago
Code cleanup
Simplify template arg list, enable deduction
missing hn:: on " Lanes"
1.0f suffix
move RMSNormBatched into ops.h
static constexpr -> constexpr
concrete type instead of LayerT, WeightArrayT
inline GetWeights
remove if (runtime_config.verbosity
merge AllocatePrefill and AllocateDecode
remove bf_ffw_hidden
Code cleanup