gemma Search Results - Githubissues

1000+ results
for gemma

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

pytorch/torchtune #1239

Gemma not saving checkpoints

I am using Gemma-2B and it is not saving checkpoints at all. It hangs (no error, just waiting forever). I use 4 gpus, but even if memory usage is very low (5 GBs out of 24 GBs available, per each gpu)…

wiiiktor updated 3 months ago
7
keras-team/keras-hub #1978

Can you provide some examples of llama and gemma on common b…

I am unable to reproduce the performance of llama3 and gemma2 implemented by Keras Hub on the GSM8k benchmark. paper ref https://arxiv.org/pdf/[2407.21783](https://arxiv.org/pdf/2407.21783) and http…

pass-lin updated 1 week ago
3
unslothai/unsloth #785

Gemma2 fails saving as GGUF

@danielhanchen Hi Daniel, thanks for your work! having an error just like in the issue #275, but this time while trying to save tuned version of unsloth/gemma-2-9b-it-bnb-4bit. >> model.save_p…

kupletist updated 4 weeks ago
25
lm-sys/FastChat #3448

Error in Gemma 2 using model_worker (probably an error in co…

When using model_worker with transformers to run Gemma 2 9B model does not work correctly and the conversation template applied to Gemma 2 model continue to generate response until model_worker is kil…

vikrantrathore updated 3 weeks ago
5
gradio-app/gradio #8909

Sharing in chatbot is broken

### Describe the bug Not sure if this is a widespread issue, but as @osanseviero reported, sharing in https://huggingface.co/spaces/gokaygokay/Gemma-2-llamacpp is broken. > I tried https://hugging…

abidlabs updated 1 week ago
3
unslothai/unsloth #986

Gemma batch inference much slower than Mistral

Hi. Raising this issue as I am experimenting a much slower inference time with Gemma-1 models. > Environment: > - xformers 0.0.26.post1 pypi_0 pypi > - unsloth …

lctdulac updated 1 month ago
5
LoveCatc/supervised-llm-uncertainty-estimation #6

Unable to generate the required files

Hello again, I am trying to generate the required files from generate-ds and train-supervised. When I execute generate-ds coqa related files are not getting generated and with train-supervised I a…

learningfromscratch21 updated 5 days ago
12
janhq/models #46

bug: Fix, update & improve models in Jan Hub

# Problem I have encountered many issues with the wrong model default settings (incorrect prompt template, the stop words missing, etc.). e.g., comments in Jan 0.5.7 Release Sign Off janhq/jan#3818…

imtuyethan updated 2 weeks ago
22
expectedparrot/edsl #1308

Initializing Model objects for different model names can tak…

```python from edsl import Model import time models_list = [['Austism/chronos-hermes-13b-v2', 'deep_infra', 0], ['BAAI/bge-base-en-v1.5', 'together', 1], ['BAAI/bge-large-en-v1.5', 'together', …

zer0dss updated 2 days ago
2
Telosnex/fllama #6

Supported models?

I have been experimenting with different models in fllama, specifically Gemma, Phi3, and QWEN 2. I noticed significant differences in the performance and response quality across these models: Gemma…

Vinayak006 updated 1 week ago
2

上一页 1...4 5 6 7 8 9 10...100 下一页

1000+ results for gemma

1000+ results
for gemma