-
Hi I am trying out the step 1 of deepspeed-chat with the default example. I have one A100-40G, with torch1.12.1-cuda11.3-cudnn8 and deepspeed==0.9.2 in my local environment. I ran into a CUDA OOM erro…
-
### System Info
transformers
- 4.17.1
torch
- 1.10.1
sagemaker
- 2.112.2
### Who can help?
@Narsil
@patrickvonplaten
@anton-l
### Information
- [X] The official example scripts
- [ …
-
I think pytorch should add Windows support.
Other deep learning frameworks, like tensorflow, theano and mxnet, all support Windows.
I only use Windows in my work. So I want to know whether pytorch…
-
```
GPU available: True (cuda), used: True
TPU available: False, using: 0 TPU cores
IPU available: False, using: 0 IPUs
HPU available: False, using: 0 HPUs
Total 23 examples, average length 28.41…
-
### System Info
- `transformers` version: 4.23.1
- Platform: Linux-4.15.0-193-generic-x86_64-with-glibc2.27
- Python version: 3.10.6
- Huggingface_hub version: 0.10.1
- PyTorch version (GPU?): …
-
Can I training a bart model from scratch by transformers?
-
### Search before asking
- [X] I have searched the YOLOv8 [issues](https://github.com/ultralytics/ultralytics/issues) and found no similar bug report.
### YOLOv8 Component
Training
### B…
-
### Describe the bug
I'm trying to follow the librispeech ASR recipe (/recipes/LibriSpeech/ASR/CTC) and turn on auto_mix_precision (AMP). However, I run into an assertion error saying "No inf checks …
-
### Feature request
I want to train model in the order in which the data are stored.
For example, if there are 100 data, then I want to feed 1st, 2nd data together(because I set batch_size=2 in …
-
### System Info
- `transformers` version: 4.20.1
- Platform: macOS-12.4-arm64-arm-64bit
- Python version: 3.9.10
- Huggingface_hub version: 0.8.1
- PyTorch version (GPU?): 1.13.0.dev20220709 (Fal…