-
I'm currently working on distributed training of a large language model and I'm using opt-1.3B with layers from ``fairscale.nn.model_parallel.layers`` and split checkpoints for loading. However, I'm e…
-
### Describe the bug
Bug occurs when using Wav2Vec2 and unfreezing the last 2 layers.
This leads to the following error:
`Traceback (most recent call last):
File "train_with_wav2vec.py", line 36…
-
Hello. When I run `bash run.sh`, it had an error. The error details are as follows:
```
2021-07-14 02:40:06.707607: I tensorflow/stream_executor/platform/default/dso_loader.cc:53] Successfully opene…
-
Got this error running "./scripts/run_finetune_with_lora.sh", seems like a dependency issue, anybody got the same error here?
Traceback (most recent call last):
File "/opt/conda/lib/python3.8/…
-
## Environment info
- `transformers` version: 4.9.1
- Platform: Google Colab
- Python version: 3.7 & 3.8
- PyTorch version (GPU?): 1.8
- Tensorflow version (GPU?): N/A
- Using GPU in script?…
-
----------------Environment: the same as :https://huggingface.co/edbeeching/gpt-neo-125M-imdb-lora
Transformers 4.27.0.dev0
Pytorch 1.13.1+cuda116
Datasets 2.9.0
Tokenizers 0.13.2
trl 0.4.1.dev0/…
-
### System Info
- `transformers` version: 4.28.1
- Platform: Linux-4.19.0-23-cloud-amd64-x86_64-with-glibc2.28
- Python version: 3.9.0
- Huggingface_hub version: 0.13.3
- Safetensors version: not…
-
### System Info
When I use transformers' OPTModel to load the opt-13b model for training with Pytorch FSDP, I found that the whole training is limited by batch_size. Although FSDP has the ability to …
-
### Search before asking
- [X] I have searched the YOLOv5 [issues](https://github.com/ultralytics/yolov5/issues) and [discussions](https://github.com/ultralytics/yolov5/discussions) and found no simi…
-
Even though `aten::_softmax_backward_data` is apparently supported, I am getting this runtime error with the code below.
`def train(model, train_dataloader, loss_function, optimizer):
model…