kanyun-inc / fairseq-gec

Source code for paper: Improving Grammatical Error Correction via Pre-Training a Copy-Augmented Architecture with Unlabeled Data
Other
246 stars 67 forks source link

when single node with multi gpus , it does not run . #40

Open hl312 opened 3 years ago

hl312 commented 3 years ago

hi , question descirption :
states = torch.load(args.pretrained_model)['model'] --> error line error message: RuntimeError: CUDA error: all CUDA-capable devices are busy or unavailable please give me a hand .