gemma2 Search Results - Githubissues

853 results
for gemma2

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

yashasvini121/predictive-calc #103

Bhagvad gita : ideas to resolve any problem in life using N…

🔍 **Problem Description**: It is a simple project based on predicting the text and quotes that lord god ideas suggest method to overcome our problems in our daily life. It will produce nlp sentences …

PiyushVIT346 updated 1 month ago
2
ollama/ollama #6695

Q6_K is slower than Q8_0

### What is the issue? gemma2:9b-instruct-**q6_K** : gemma2:9b-instruct-**q8_0** = **21**t/s : **25**t/s mistral-nemo:12b-instruct-2407-**q6_K** : mistral-nemo:12b-instruct-2407-**q8_0** = **17**t/s…

napa3um updated 2 months ago
1
abetlen/llama-cpp-python #1598

Gemma 2 : flash_attn is not compatible with attn_soft_cap - …

# Prerequisites Please answer the following questions for yourself before submitting an issue. - [ Yes] I am running the latest code. Development is very rapid so there are no tagged versions as…

iamsaurabhgupt updated 4 months ago
2
pytorch/pytorch #140965

xpu: implement aten::_linalg_eigvals for XPU backend (affect…

Recent changes in Huggingface Transformers (https://github.com/huggingface/transformers/commit/cdee5285cade176631f4f2ed3193a0ff57132d8b and https://github.com/huggingface/transformers/commit/4a3f1a686…

dvrogozh updated 3 days ago
1
huggingface/transformers #34822

Maybe memory leak occurs after evaluation when using `use_li…

### System Info `transformers==4.46.1` `python==3.10.14` ### Who can help? @muellerzr @SunMarc @ArthurZucker ### Information - [X] The official example scripts - [ ] My own modified scripts ##…

upskyy updated 1 day ago
5
triton-inference-server/client #779

tensorrtllm and vllm backend results are different using gen…

Thank you for releasing a great project. I measured [`genai-perf`](https://docs.nvidia.com/deeplearning/triton-inference-server/user-guide/docs/perf_analyzer/genai-perf/docs/tutorial.html#profile-g…

upskyy updated 2 weeks ago
1
vllm-project/vllm #7125

[Feature]: How to run the int4 quantized version of the gemm…

### 🚀 The feature, motivation and pitch How to run the int4 quantized version of the gemma2-27b model ### Alternatives _No response_ ### Additional context _No response_

maxin9966 updated 2 weeks ago
6
xorbitsai/inference #1945

gemma2-9b-it模型部署推理异常

### System Info / 系統信息 Ubuntu 22.04.4 LTS python 3.10 transformer 4.43.0 cuda 12.0 torch 2.3.0 vllm 0.4.3 ### Running Xinference with Docker? / 是否使用 Docker 运行 Xinfernece？ - [ ] docker / docke…

niceyida updated 3 weeks ago
3
huggingface/transformers #34789

Add `Tensor Parallel` support for ALL models

Just opening this to add support for all models following #34184 Lets bring support to all model! 🤗 - [x] Llama It would be great to add the support for more architectures such as - [ ] Qwe…

ArthurZucker updated 1 day ago
3
unslothai/unsloth #1101

Getting CUDA OOM on training gemma-2-2b with "lm_head" and "…

Hi @danielhanchen I am trying to fine-tune gemma2-2b for my task following the guidelines of the continued finetuning in unsloth. Howver, I am facing OOM while doing so. My intent is to train gemm…

InderjeetVishnoi updated 1 month ago
6

上一页 1...3 4 5 6 7 8 9...86 下一页

853 results for gemma2

853 results
for gemma2