issues
search
google
/
gemma.cpp
lightweight, standalone C++ inference engine for Google's Gemma models.
Apache License 2.0
5.76k
stars
487
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
Implement a missing (bf16, f32) tiled MatMul kernel.
#245
copybara-service[bot]
closed
3 weeks ago
0
Internal change.
#244
copybara-service[bot]
closed
3 weeks ago
1
Integrate matmul into FFW: 4.3x prefill speedup
#243
copybara-service[bot]
closed
3 weeks ago
0
Reduce duplication in Config* by inheriting no-SSM
#242
copybara-service[bot]
closed
3 weeks ago
0
Added MatMul_4x4_Batch which is MatMul_4x4, but with the first template arg moved to the first function arg, so the batch size (num A rows) can be variable at run-time.
#241
copybara-service[bot]
closed
3 weeks ago
0
Major duplicated code reduction in test/benchmarks
#240
copybara-service[bot]
closed
3 weeks ago
0
Tiny cleanup: distinguish between "ids" and "pieces" in argument names when encoding.
#239
copybara-service[bot]
closed
3 weeks ago
0
Extends Transformer() to prepare for batched processing.
#238
copybara-service[bot]
closed
3 weeks ago
0
Support mixed (bf16, sfp) tiled MatMul. Same sfp-decompress strategy as in (f32,
#237
copybara-service[bot]
closed
3 weeks ago
1
Fix numerical issue in Softcap by subtracting max.
#236
copybara-service[bot]
closed
3 weeks ago
1
Fix numerical issue in Softcap by subtracting max.
#235
copybara-service[bot]
closed
3 weeks ago
0
Add benchmark dependency to cmake build.
#234
szabadka
closed
3 weeks ago
0
Increase parallelism in ops_test
#233
copybara-service[bot]
closed
3 weeks ago
1
Add internal initialization code to debug_prompt.
#232
copybara-service[bot]
closed
3 weeks ago
0
Implement float * SfpStream matmul by decompressing 4 * kColsA_RowsB -sized chunks of the second matrix.
#231
copybara-service[bot]
closed
3 weeks ago
1
Update AssertClose for large matrices and add large matrix test
#230
copybara-service[bot]
closed
3 weeks ago
0
Updated benchmarks.cc to recent changes to Gemma API.
#229
copybara-service[bot]
closed
3 weeks ago
1
Add compression/ comments, especially on SFP range
#228
copybara-service[bot]
closed
3 weeks ago
0
Use Loader/AppArgs to construct gemma_test model, simplify AcceptFunc
#227
copybara-service[bot]
closed
3 weeks ago
0
Adds simple-loop versions of missing batched functions.
#226
copybara-service[bot]
closed
3 weeks ago
0
Update benchmark with internal init
#225
copybara-service[bot]
closed
4 weeks ago
0
Use CompressedWeights<TConfig<float>> in backpropagation.
#224
szabadka
closed
4 weeks ago
0
Fix for transpose matrix creation and additional tests
#223
copybara-service[bot]
closed
4 weeks ago
0
Add CPU output, error if not C++17, simplify tokenizer ctor
#222
copybara-service[bot]
closed
4 weeks ago
0
OFF Topic, Request for Open-Sourcing Google Gemini Flash
#221
0wwafa
opened
1 month ago
20
Shifting large matrix init to heap in ops_test.cc
#220
copybara-service[bot]
closed
1 month ago
0
Support all weight types in a single binary.
#219
copybara-service[bot]
closed
1 month ago
0
Small code cleanup suggestions while reading the code.
#218
copybara-service[bot]
closed
1 month ago
0
Add support for custom sampling function to runtime config.
#217
szabadka
closed
1 month ago
0
Fix fix for weight type define, refs #198
#216
copybara-service[bot]
closed
1 month ago
0
Fix reference to GEMMA_WEIGHT_T. Refs #198
#215
copybara-service[bot]
closed
1 month ago
0
Toward only using compressed weights:
#214
copybara-service[bot]
closed
1 month ago
0
Fix Softmax on SVE
#213
copybara-service[bot]
closed
1 month ago
0
Add Adam optimizer.
#212
szabadka
closed
1 month ago
0
Internal experiment
#211
copybara-service[bot]
closed
4 weeks ago
0
Implement mixed mode matmul: f32 * bf16
#210
copybara-service[bot]
closed
1 month ago
1
Simplifications: remove GemmaInterface and GemmaImpl
#209
copybara-service[bot]
closed
1 month ago
0
Remove no longer required stats.h - use Highway version instead
#208
copybara-service[bot]
closed
1 month ago
0
revert back to HWY_ASSERT for lane constraints, qualify hn::Add
#207
copybara-service[bot]
closed
1 month ago
0
Fix for GenerateZeroMat call in TestTiledMatMul
#206
copybara-service[bot]
closed
1 month ago
0
Add bf16 matmul support, update naming+test
#205
copybara-service[bot]
closed
1 month ago
0
Use system topology to pin threads across clusters.
#204
copybara-service[bot]
closed
1 month ago
0
Add first version of backpropagation support.
#203
szabadka
closed
1 month ago
1
Refactor GemmaImpl dispatch to use Highway 1.2's HWY_DYNAMIC_DISPATCH_T
#202
copybara-service[bot]
closed
1 month ago
0
Update to Highway 1.2 for topology/VQSelect
#201
copybara-service[bot]
closed
1 month ago
0
static_assert shape constraints in MatMul 4x4
#200
copybara-service[bot]
closed
1 month ago
0
Unrolled / tiled 4x4 MatMul
#199
copybara-service[bot]
closed
1 month ago
0
Gemma.cpp hangs on a Gemma 7B model that was finetuned using huggingface peft(QLoRA)
#198
webbigdata-jp
opened
1 month ago
13
Compilation fails for raspberry pi
#197
EphemeralSapient
closed
1 month ago
2
gemma.cc:1322: Failed to load model weight
#196
ordentid
closed
1 month ago
6
Previous
Next