gemma-2b-it Search Results

890 results
for gemma-2b-it

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

dvlab-research/MGM #83

Finetune

Hi,i finetune MGM-2B on coco, but i got the warning that: `{'loss': 6.9221, 'grad_norm': tensor(18.7422, device='cuda:0', dtype=torch.float64), 'learning_rate': 9.203084832904885e-06, 'epoch': 0.01}…

ZhangScream updated 6 months ago
7
huggingface/transformers #29887

Include other tokenizers/image processors in Llava

### Feature request Generalize the functionality in [processing_llava.py](https://github.com/huggingface/transformers/blob/main/src/transformers/models/llava/processing_llava.py) to include other t…

muhark updated 2 weeks ago
8
google-ai-edge/mediapipe #5357

Support for Converting Other LLMs like GPT-2 & QWEN in Medi…

### Enhancement Request - Support for Additional LLM Types #### Description: After reviewing the [Mediapipe documentation](https://developers.google.com/mediapipe/solutions/genai/llm_inference) an…

mayurmarvel updated 2 weeks ago
9
NVIDIA/TensorRT-LLM #1060

Out of Memory error after updating to the latest branch

the instruction code for mpt-7b works fine when using older version 20240123, but when updating to the latest branch, using the new code, always have OOM error with multiple gpus, even when using 8*A1…

TobyGE updated 23 minutes ago
4
google/gemma.cpp #365

macbook android studio compile error

C/C++: CMake Error: CMake was unable to find a build program corresponding to "Ninja". CMAKE_MAKE_PROGRAM is not set. You probably need to select a different build tool.

isold23 updated 1 month ago
10
huggingface/trl #2019

finetuning gemma2-2b with multi-gpu get OOM, how do i only d…

### System Info Name: transformers Version: 4.45.0.dev0 Name: trl Version: 0.8.6 ### Information - [ ] The official example scripts - [X] My own modified scripts ### Tasks - [ ] A…

bhupendrathore updated 1 month ago
4
hiyouga/LLaMA-Factory #5090

CLI执行训练任务时没有生成曲线图

### Reminder - [X] I have read the README and searched the existing issues. ### System Info windows 11 python 3.12 今日最新源码安装 LLaMA-Factory ### Reproduction ```yaml ### model model_na…

neavo updated 3 months ago
2
ollama/ollama #6286

Context window size cannot be changed

### What is the issue? I see this issue has been partially reported, but none of the previous reports seem to be extensive in their tests of possible methods to set this option. The problem: Ol…

mihaelagrigore updated 4 weeks ago
21
vllm-project/vllm #7303

[Bug]: vllm hangs after model download / load

### Your current environment ```text The output of `python collect_env.py` ``` ### 🐛 Describe the bug ### On the Tesla T4 the model "hangs" after loading the model (the vram usage spikes normal…

ArtificialEU updated 3 weeks ago
6
langchain-ai/langchain #26656

Tutorial is not working with local model (e.g Llama3) due to…

### URL https://python.langchain.com/docs/tutorials/sql_qa/ ### Checklist - [x] I added a very descriptive title to this issue. - [X] I included a link to the documentation page I am referring to (…

NourOM02 updated 1 month ago
4

上一页 1...35 36 37 38 39 40 41...89 下一页

890 results for gemma-2b-it

890 results
for gemma-2b-it