Closed avi-otterai closed 3 years ago
I am unable to reproduce the problem (also you seem to have made changes to the run_clm
script since it does not accept those arguments: --method range --source fi.json --from_scratch
) but in general, PyTorch discourages the use of DataParallel for multiGPU, so you should try to see if using DistributedDataParallel (by launching the script with torch.distributed.launch
) works better?
This issue has been automatically marked as stale because it has not had recent activity. If you think this still needs to be addressed please comment on this thread.
Please note that issues that do not follow the contributing guidelines are likely to be ignored.
Environment info
Who can help
Models:
Library:
Examples:
Information
Model I am using (Bert, XLNet ...): openai-gpt
The problem arises when using:
The tasks I am working on is:
My dataset is a simple text file with strings for causal language modelling.
To reproduce
Error Log:
Expected behavior
The same as run_clm.py with a single GPU.