-
**Qwen2**
warning: not compiled with GPU offload support, --n-gpu-layers option will be ignored
warning: see main README.md for information on enabling GPU BLAS support
Log start
main: build = 2…
-
### Description
When tokenizing a text and decoding these tokens, one can see that tokenization now (as of version 0.14.0) adds one additional starting space to `text` for every call of `Context.Toke…
-
Hi Daniel,
Thank you so much for releasing such an awesome VLM fine-tuning notebook to the public!
I was really excited, tried the notebook out and found the following error:
![image](https:/…
-
David+Whitney
Alexis+Anacona
Tracey+Sazare
Atlas+Siluca
Damar+Aishela
Worapoj+Lalrinkimi
Hector+Harmonica
Takeshi+Kasumi
Wayne+Esmeralda
Arjen+Ketifa
Kanatbek+Lamia
Kazuhiro+Shinobu
Tugsts…
-
It's effectively being used broader than the VNG/GEMMA & Zaakgericht Werken context - it really is aimed at JSON-based OpenAPI 3 driven services.
Proposal: `oas3-client` (boring but carries the wei…
-
I tried some of the `web-ai-demos` on https://chrome.dev/, such as https://chrome.dev/web-ai-demos/perf-client-side-gemma-worker/
Some demos say that the model will take about 30s or 1 minute to lo…
-
The cookbook aims to provide a comprehensive guide for researchers and practitioners interested in fine-tuning the Gemma model from Google on a mental health assistant dataset.
Key components of th…
-
### 🚀 The feature, motivation and pitch
Gemma-2 and new Ministral models use alternating sliding window and full attention layers to reduce the size of the KV cache.
The KV cache is a huge inferen…
-
**Describe the bug**
git lfs pull --include gemma-2-9b-it-Q8_0_L.gguf
vs
git lfs pull gemma-2-9b-it-Q8_0_L.gguf (typed accidentally)
does not make it very clear how many files, or how much data …
-
Error (Xcode): Undefined symbols:
Error (Xcode): Linker command failed with exit code 1 (use -v to see invocation)