THUDM / GLM-130B

GLM-130B: An Open Bilingual Pre-Trained Model (ICLR 2023)
Apache License 2.0
7.65k stars 611 forks source link

RuntimeError: CUDA error: invalid device ordinal #181

Open TranscenderNing opened 1 year ago

TranscenderNing commented 1 year ago

在运行器件报错:Traceback (most recent call last): File "/home/ldn/ChatGLM130Dir/GLM-130B/generate.py", line 218, in args = initialize(extra_args_provider=add_generation_specific_args) File "/home/ldn/ChatGLM130Dir/GLM-130B/initialize.py", line 51, in initialize args = get_args(args_list) File "/home/ldn/anaconda3/envs/glm1301/lib/python3.9/site-packages/SwissArmyTransformer/arguments.py", line 385, in get_args initialize_distributed(args) File "/home/ldn/anaconda3/envs/glm1301/lib/python3.9/site-packages/SwissArmyTransformer/arguments.py", line 414, in initialize_distributed torch.cuda.set_device(args.device) File "/home/ldn/anaconda3/envs/glm1301/lib/python3.9/site-packages/torch/cuda/init.py", line 314, in set_device torch._C._cuda_setDevice(device) RuntimeError: CUDA error: invalid device ordinal CUDA kernel errors might be asynchronously reported at some other API call,so the stacktrace below might be incorrect. For debugging consider passing CUDA_LAUNCH_BLOCKING=1.

alexhmyang commented 1 year ago

same