gemma2 Search Results - Githubissues

788 results
for gemma2

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

ggerganov/llama.cpp #8490

Bug: gemma2 perplexity pending forever

### What happened? I am attempting to measure the perplexity of the gemma-2-9b-it-Q4_K_M.gguf model using llama.cpp. However, I encounter an issue where the process gets stuck at the "tokenizing th…

StatPan updated 2 months ago
1
huggingface/autotrain-advanced #700

Gemma2 support[FEATURE REQUEST]

### Feature Request Gemma2 support 👉👉👉[我的哔哩哔哩频道](https://space.bilibili.com/3493277319825652) 👉👉👉[我的YouTube频道](https://www.youtube.com/@AIsuperdomain) ### Motivation Gemma2 support 👉👉👉[我的哔哩哔哩频…

win4r updated 2 months ago
2
tidyverse/elmer #140

Consider support for vllm-hosted models?

Hi @hadley, thanks for sharing this, really exciting. Very nice to see support for open models via ollama. I wonder if you would consider adding support for VLLM-hosted models as well, e.g. see ht…

cboettig updated 1 day ago
10
ggerganov/llama.cpp #9848

Bug: Erroneous Output in llama-cli

### What happened? When using llama.cpp models (e.g., granite-code and llama3) with Nvidia GPU acceleration (nvidia/cuda:12.6.1-devel-ubi9 and RTX 3080 10GB VRAM), the models occasionally return nons…

ericcurtin updated 3 weeks ago
14
bentoml/OpenLLM #1042

feat: add gemma2

### Feature request add gemma2 ### Motivation _No response_ ### Other _No response_

bojiang updated 3 months ago
1
ollama/ollama #6558

Multiple GPU´s Nvidia 56GB VRAM gemma2:27b

### What is the issue? Hi, Error: cudaMalloc failed: out of memory ### OS Windows ### GPU Nvidia ### CPU Intel ### Ollama version 0.3.8

paulopais updated 2 months ago
15
mlc-ai/mlc-llm #2865

[Bug] gemma2_q4f16_1_batch_prefill_ragged_kv: Expect arg[10…

Why is initialization successful when running on Android using gemma2-2b-it model compression, but shows org.apache.tvm.Base$TVMError: TVMError: Assert fail: rotary_mode_code == 0, gemma2_q4f16_1_ bat…

mos-fine updated 2 months ago
5
ggerganov/llama.cpp #9065

Bug: Gemma2 adapter weights `lm_head` skipped on gguf conver…

### What happened? The `lm_head` layer for a [Gemma2](https://huggingface.co/google/gemma-2-2b) LoRA adapter is not converted by `convert_lora_to_gguf.py`, and therefore not applied at inference (r…

ltoniazzi updated 1 month ago
10
THUDM/GLM-4 #521

ollama 加载 glm-4-9b-chat 胡言乱语

### System Info / 系統信息 cuda: 12.6 transformer: 4.44.0 OS: win10 python: 3.11.4 ollama: 0.3.8 & 0.2.3 配置: RTX3090 12700kf ### Who can help? / 谁可以帮助到您？ _No response_ ### Information / 问…

siegrainwong updated 2 weeks ago
2
mayank-Pareek/dev-mate-cli #1

[BUG] Output contains Markdown notations

### Description With certain language models, the output contains extra Markdown code block notations (```). ### Environment - **Operating System:** Fedora Workstation 40 - **Node.js Version:** …

theoforger updated 1 month ago
1

上一页 1...29 30 31 32 33 34 35...79 下一页

788 results for gemma2

788 results
for gemma2