gemma Search Results - Githubissues

1000+ results
for gemma

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

axolotl-ai-cloud/axolotl #1918

Different training losses when flash_attention is on/off

### Please check that this issue hasn't been reported before. - [X] I searched previous [Bug Reports](https://github.com/axolotl-ai-cloud/axolotl/labels/bug) didn't find any similar reports. ###…

zhangchen-xu updated 2 weeks ago
6
arcee-ai/mergekit #446

KeyError model[0] did not exist in tensor?

I am performing a Mega Merge using LLaMA 3.2 3B, both the base model and fine-tuning/instruction tuning, with the DARE linear method. Following the successful completion of the initial merge, I encoun…

FrozzDay updated 2 days ago
2
huggingface/transformers #33807

Saving model in safetensors format through Trainer fails for…

### System Info - `transformers` version: 4.44.2 - Platform: Linux-5.10.220-209.869.amzn2.x86_64-x86_64-with-glibc2.26 - Python version: 3.10.14 - Huggingface_hub version: 0.25.1 - Safetensors ve…

oranshayer updated 1 day ago
4
vllm-project/vllm #9517

[Feature]: google/gemma-2-2b supports 8K context length but …

### Your current environment vllm version: 0.6.3.post1 ### Model Input Dumps _No response_ ### 🐛 Describe the bug I see on the official site of gemma: https://huggingface.co/google/gemma-2b, cont…

yananchen1989 updated 1 month ago
1
unslothai/unsloth #923

beam search does not work for gemma2b

Env: torch2.4 cuda 12.4 unsloth main below is the code errored ``` from unsloth import FastLanguageModel import torch model_id="unsloth/gemma-2-2b-it-bnb-4bit" model, tokenizer = FastLanguageM…

world2vec updated 1 month ago
5
Bavarder/Bavarder #59

Add support for Gemma AI.

Hello. Please add support for Google's open source Gemma AI, which was launched this week. It has 2 models, 2b and 7b. Both are great and 2b version can be easily run in both mobile and desktop with…

Anikmoujegy updated 7 months ago
1
HKUDS/LightRAG #209

Lots of RuntimeError: Event loop is closed

Running lightrag_ollama_demo.py with gemma:2b orlightrag_openai_compatible_demo.py with qwen2.5-3b-instruct wil cast a lot of this: ``` Exception ignored in: Traceback (most recent call last): …

XenonAesir updated 2 days ago
3
yxli2123/LoftQ #30

Method fails on Gemma-7B model

Hello, I have tried your method on gemma-7b model. I found that this method is work on gsm-8k dataset, but this fails on wikitext-2 dataset. This is my training log: ``` [WARNING|logging.py:329] 2…

Ther-nullptr updated 6 months ago
1
ggerganov/llama.cpp #9644

Bug: IQ3_M is significantly slower than IQ4_XS on AMD, is it…

### What happened? Model: https://huggingface.co/bartowski/gemma-2-27b-it-GGUF AMD GPU: RX 7600 XT + RX 7600 (full offload) With IQ3_M I get about 10 t/s when IQ4_XS is nearly 15 t/s. I thought …

Nekotekina updated 2 weeks ago
5
ObrienlabsDev/machine-learning #27

Google Gemma 2 27B is out - setup inference and upgrade tran…

# Fix for gemma-2-9b - run with blfloat16 ![image](https://github.com/ObrienlabsDev/machine-learning/assets/24765473/4e149bf2-e84e-48a8-b3bc-1939d1543f66) https://huggingface.co/google/gemma…

obriensystems updated 5 months ago
11

上一页 1...15 16 17 18 19 20 21...100 下一页

1000+ results for gemma

1000+ results
for gemma