gemma2 Search Results - Githubissues

647 results
for gemma2

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

Vaibhavs10/hf-llm.rs #3

Can't not get any LLM's feedback ?

Hugginface hub login successful Used gemma2-27b LLM to testing: cargo run --release -- -m "google/gemma-2-27b-it" -c Finished release [optimized] target(s) in 0.03s Running `target/re…

skyxiaobai updated 3 weeks ago
8
FlagOpen/FlagEmbedding #1015

bge-multilingual-gemma2的最大输入token是多少？

[bge-multilingual-gemma2](https://huggingface.co/BAAI/bge-multilingual-gemma2)和[bge-reranker-v2.5-gemma2-lightweight](https://huggingface.co/BAAI/bge-reranker-v2.5-gemma2-lightweight)的最大输入token是多少？

adol001 updated 2 months ago
1
abetlen/llama-cpp-python #1598

Gemma 2 : flash_attn is not compatible with attn_soft_cap - …

# Prerequisites Please answer the following questions for yourself before submitting an issue. - [ Yes] I am running the latest code. Development is very rapid so there are no tagged versions as…

iamsaurabhgupt updated 2 months ago
2
vllm-project/vllm #8660

[Bug]: Gemma2 model not working with vLLM 0.6.0 CPU backend

### Your current environment - vLLM CPU : v0.6.0 - Hardware: Intel(R) Xeon(R) Platinum 8480+ CPU - Model: google/gemma-2-2b ### 🐛 Describe the bug vLLM v0.6.0 (cpu) is throwing below erro…

jerin-scalers-ai updated 1 week ago
2
vllm-project/vllm #6186

[Usage]: BNB Gemma2 9b loading problems

### Your current environment ```text The output of `python collect_env.py` ``` Collecting environment information... PyTorch version: 2.3.0+cu121 Is debug build: False CUDA used to build PyTo…

orellavie1212 updated 2 months ago
2
ggerganov/llama.cpp #9587

Bug: passing `tfs_z` crashes the server

### What happened? If you pass `tfs_z` param to the server, it crashes sometimes. Starting the server: ``` ~/test/llama.cpp/llama-server -m /opt/models/text/gemma-2-27b-it-Q8_0.gguf --verbose `…

z80maniac updated 1 week ago
2
unslothai/unsloth #1068

Unexpected latency times

Hi. I'm an early adopter of unsloth and my recent experiments with the library delivered unexpected latency results. I followed the official notebooks and got the following results while fine tuning…

davidjimenezphd updated 5 days ago
1
huggingface/text-embeddings-inference #368

add support bge-reranker-v2.5-gemma-lightweight

### Model description bge-reranker-v2.5-gemma-lightweight 's performance is better bge-m3 :) Please support model. ### Open source status - [ ] The model implementation is available - [X] The …

ziozzang updated 1 month ago
2
xorbitsai/inference #1945

gemma2-9b-it模型部署推理异常

### System Info / 系統信息 Ubuntu 22.04.4 LTS python 3.10 transformer 4.43.0 cuda 12.0 torch 2.3.0 vllm 0.4.3 ### Running Xinference with Docker? / 是否使用 Docker 运行 Xinfernece？ - [ ] docker / docke…

niceyida updated 1 month ago
3
huggingface/transformers #33963

Flash attention 2 support for PaliGemma model

### Feature request Hi, Is it possible to enable flash attention for PaliGemma models? ### Motivation This feature is required to speed up inference using PaliGemma VLMs ### Your contribution …

Ram81 updated 1 day ago
1

上一页 1...2 3 4 5 6 7 8...65 下一页

647 results for gemma2

647 results
for gemma2