-
### Have I written custom code (as opposed to using a stock example script provided in MediaPipe)
Yes
### OS Platform and Distribution
Android 14
### Mobile device if the issue happens on …
-
I get the following error when I try to do a search (especially when in Expert mode):
```500: {"error":"json: cannot unmarshal string into Go struct field ChatRequest.messages of type api.ToolCallF…
-
Version: llama-cpp-python==0.2.82
Model: "bartowski/gemma-2-9b-it-GGUF/gemma-2-9b-it-Q8_0.gguf"
When I load the gemma2 model with temperature=0, and run a simple prompt, it always gives the same o…
-
### What is the issue?
After ollama's upgrade to 0.27 from 0.20, it runs gemma 2 9b at very low speed. I don't think the OS is out of vram, since gemma 2 only costs 6.8G (q_4_0) vram while my lapto…
-
### Which API Provider are you using?
OpenAI Compatible
### Which Model are you using?
llama-3.2-3b-preview to gemma2-9b-it
### What happened?
I just changed from to
llama-3.2-3b-preview to gem…
-
## Description
I am encountering a timeout error when running the following code on macOS. The error occurs approximately 10 seconds after the request is made. I would like to know if there is a wa…
-
Hello, How should I set the decoding parameters (e.g., temperature) for Gemma-2? My result is about ~50.0, far from the benchmark of 76.
-
-
## Describe the bug
When running this command RUST_BACKTRACE=full CUDA_LAUNCH_BLOCKING=1 target/release/mistralrs-server -i --isq Q4K -n "1:16;2:16;3:10" --no-paged-attn plain -m google/gemma-2-9b-i…
-
### System Info
python version: 3.11.9
transformers version: 4.44.2
accelerate version: 0.33.0
torch version: 2.4.0+cu121
### Who can help?
@gante
### Information
- [X] The official example sc…