gemma2 Search Results - Githubissues

570 results
for gemma2

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

ggerganov/llama.cpp #8853

Bug: Gemma 2 incoherent output when using quantized k cache …

### What happened? Output like "Mh giàu され rodas reliablyacheteurδε Są" happens when using quantized K cache, CUDA, with Gemma 2. Here's how to reproduce: ./llama-server -m "Gemma-2-9B-It-SPPO-I…

Dampfinchen updated 1 week ago
2
Cinnamon/kotaemon #138

Steps I took to get local app working

Hey there! Super cool project. Thought I'd add some of the (yet to be documented) steps that I took to get the application working on my macbook pro with an M1 chip. I did not use the docker image …

bfdykstra updated 1 week ago
9
huggingface/trl #2019

finetuning gemma2-2b with multi-gpu get OOM, how do i only d…

### System Info Name: transformers Version: 4.45.0.dev0 Name: trl Version: 0.8.6 ### Information - [ ] The official example scripts - [X] My own modified scripts ### Tasks - [ ] A…

bhupendrathore updated 1 week ago
2
frdel/agent-zero #3

[feature request] local only

having the ability to use the api to paid services is cute and all. can we have local only. nobody wants to pay for these services anymore especially as llama3.1 blew them away with costly tie…

Tom-Neverwinter updated 3 weeks ago
7
langgenius/dify #7237

In the model configuration, ollama is configured with three,…

### Self Checks - [X] This is only for bug report, if you would like to ask a question, please head to [Discussions](https://github.com/langgenius/dify/discussions/categories/general). - [X] I have s…

lizhanyang505 updated 2 days ago
5
KudoAI/googlegpt #57

gpt 3.5 turbo

Here in git it says that it uses gpt 4o, but when testing the tool, it is at 3.5 turbo. Do I need to configure something to use 4o? Thank you very much!

dsakura updated 1 week ago
3
frdel/agent-zero #66

No Docker image and looping response on Ollama and Ubuntu.

I am trying to run it on an Ubuntu system with local Ollama installed, but I'm facing three issues: 1. The code is unable to create//pull the Docker image. 2. It is using only the CPU (not the GPU…

Rishabh-Bajpai updated 1 month ago
2
unslothai/unsloth #768

AutoModelForSequenceClassification or output is only one to…

I am using AutoModelForSequenceClassification for classifying a large model. Can I use this library, and how should I use it? Additionally, if my output is only one token and I do batch inference, w…

shyoulala updated 1 month ago
3
huggingface/autotrain-advanced #737

[BUG] AMD ROCm -- HIP out of memory. Tried to allocate...

### Prerequisites - [X] I have read the [documentation](https://hf.co/docs/autotrain). - [X] I have checked other issues for similar problems. ### Backend Local ### Interface Used CLI ### CLI Co…

unclemusclez updated 3 weeks ago
9
huggingface/chat-ui #1386

System role problem running Gemma 2 on vLLM

Hello, In running chat ui and trying some models, with phi3 and llama i had no problem but when I run gemma2 in vllm Im not able to make any good api request, in env.local: { "name": "google/g…

juanjuanignacio updated 2 weeks ago
4

上一页 1...8 9 10 11 12 13 14...57 下一页

570 results for gemma2

570 results
for gemma2