transformers-models Search Results

1000+ results
for transformers-models

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

hongshi97/CAD #2

Using `flash_attention_2` raises a `ValueError` for `padding…

Thank you for developing this! ## Context Due to lenghty computation time and in order to speed things up, I thought about using the `flash_attention_2` and smaller floating points `torch.float16`…

ylkhayat updated 3 days ago
1
unslothai/unsloth #1296

Error while importing "from unsloth import FastLanguageModel…

Trace - ` File "/home/ec2-user/SageMaker/mistral-finetune-unsloth/multi-run-compare/run_model_qwen.py", line 1, in from unsloth import FastLanguageModel File "/home/ec2-user/anaconda3/en…

thesillystudent updated 1 week ago
2
guidance-ai/guidance #782

guidance.models.Transformers() seems to support very few mod…

**The bug** It seems that many models loaded with `models.Transformers()` error out with: `AssertionError: The passed tokenizer does have a byte_decoder property and using a standard gpt2 byte_d…

crclark updated 2 months ago
25
huggingface/transformers #34207

MLlama does not work with FSDP mix-precision

### System Info transformers==4.45.2 ### Who can help? @ArthurZucker ### Information - [ ] The official example scripts - [X] My own modified scripts ### Tasks - [ ] An officially supported tas…

YangFei1990 updated 5 days ago
7
Panchovix/stable-diffusion-webui-reForge #176

[Bug]: Kohya HRFix (aka deepshrink) not activating when usin…

### Checklist - [ ] The issue exists after disabling all extensions - [X] The issue exists on a clean installation of webui - [ ] The issue is caused by an extension, but I believe it is caused by a …

antrobot1234 updated 1 week ago
2
salesforce/CodeGen #94

[BUG] CodeGen 2.5 Tokenizer cannot be initialized anymore

The code from https://huggingface.co/Salesforce/codegen25-7b-multi_P#causal-sampling-code-autocompletion and https://github.com/salesforce/CodeGen/tree/main/codegen25#sampling does not work currently.…

AlEscher updated 1 week ago
1
unslothai/unsloth #1059

[FIXED] Exception: data did not match any variant of untagge…

I get this error: ``` Traceback (most recent call last): File "/home/denis/Documents/ai/unsloth/llama3-chat-template.py", line 20, in model, tokenizer = FastLanguageModel.from_pretrained(…

djannot updated 2 weeks ago
28
meta-llama/llama-recipes #771

RuntimeError: Expected all tensors to be on the same device,…

### System Info ubuntu 22.04 torch 2.5.0 cuda 12.4 running on a single gpu with CUDA_VISIBLE_DEVICES=1 ![image](https://github.com/user-attachments/assets/30134067-427a-4421-94d1-8d958ec628f5) …

Emersonksc updated 2 weeks ago
5
huggingface/transformers #33643

Typing wrong and contradicting.

### System Info not relevant here ### Who can help? @stevhliu ### Information - [ ] The official example scripts - [ ] My own modified scripts ### Tasks - [ ] An officially suppo…

PhilipMay updated 2 days ago
10
NVIDIA/TensorRT-LLM #2467

Error convert_checkpoint in TensorRT-LLM 0.13.0 for Llama3.2…

hellow, I failed to covert trt-llm Llama3.2 3B when I tried to run convert_checkpoint.py. (like this link - https://github.com/NVIDIA/TensorRT-LLM/issues/2339) I want to know if Llama3.2 3B model con…

yspch2022 updated 4 days ago
3

上一页 1...5 6 7 8 9 10 11...100 下一页

1000+ results for transformers-models

1000+ results
for transformers-models