-
## Bug Description
Using `transformer-engine[pytorch]==1.11` with `flash-attn>=2.5.7` results in the following error:
```
> output = func(
query_layer,
…
-
I get this error:
```
Traceback (most recent call last):
File "/home/denis/Documents/ai/unsloth/llama3-chat-template.py", line 20, in
model, tokenizer = FastLanguageModel.from_pretrained(…
-
Thank you for taking the time to review my question.
Before I proceed, I would like to mention that I am a beginner, and I would appreciate your consideration of this fact.
I am seeking assistan…
-
In `pytorch_borzoi_example_eqtl_chr10_116952944_T_C.ipynb`, when I load the converted torch weights into the model, I see this missing keys error:
`Missing key(s) in state_dict: "transformer.8.0.fn.…
-
Traceback (most recent call last):
File "/home/justin/Desktop/code/python_project/pytorch-Tutorial-2nd/chapter-9/c_transformer/inference_transformer.py", line 115, in
main()
File "/home/ju…
-
## Resource
- examples: https://github.com/pytorch/examples, the fork: https://github.com/cosdt/pytorch-examples/tree/npu
- benchmark: https://github.com/pytorch/benchmark, models: https://github.…
shink updated
16 hours ago
-
I reinstall `pip install flash-attn==2.6.1` in NGC pytorch docker image 24.06.
When I run train job, I got follow error:
```
Traceback (most recent call last):
File "/data1/nfs15/nfs/bigdata/zha…
-
### System Info
transformer version: 4.46.0 - install from master
pytorch version: 1.9
python version: 3.8
### Who can help?
_No response_
### Information
- [ ] The official example scripts…
-
when I run code according to README.md:
cd MobileUNETR
cd experiments/isic_2016/exp_2_dice_b8_a2/
# the default gpu device is set to cuda:0 (you can change it)
CUDA_VISIBLE_DEVICES="0" accelerate …
-
### System Info
- `transformers` version: 4.45.1
- Platform: Linux-5.4.247-162.350.amzn2.x86_64-x86_64-with-glibc2.26
- Python version: 3.10.12
- Huggingface_hub version: 0.24.0
- Safetensors v…