PaddlePaddle / Paddle

PArallel Distributed Deep LEarning: Machine Learning Framework from Industrial Practice (『飞桨』核心框架,深度学习&机器学习高性能单机、分布式训练和跨平台部署)
http://www.paddlepaddle.org/
Apache License 2.0
22.05k stars 5.54k forks source link

无法识别到多张gpu #53812

Closed ericosmic closed 3 months ago

ericosmic commented 1 year ago

bug描述 Describe the Bug

使用paddle-gpu2.3.2 只能识别到第一张gpu,其他gpu没有被识别到。 服务器上共有3张显卡 image 但paddle只能识别到第一张gpu image

其他补充信息 Additional Supplementary Information

使用export CUDA_VISIBLE_DEVICE=0,1,2 ,也无法解决 No response

ghostxsl commented 1 year ago

建议看下官方文档的说明:https://www.paddlepaddle.org.cn/documentation/docs/zh/api/paddle/device/get_device_cn.html#get-device

ericosmic commented 1 year ago

@ghostxsl 我使用下面的命令进行多卡训练,也会报错没有找的gpu1 python3 -m paddle.distributed.launch --gpus '0,1' tools/train.py -c configs/rec/PP-OCRv3/ch_PP-OCRv3_rec_distillation.yml -o Global.pretrained_model=ppocr_models/pretrain/ch_PP-OCRv3_rec_train/best_accuracy 那这该怎么解决,需要设置全局设备?

ghostxsl commented 1 year ago

建议可以试一下paddle官方发布的docker

paddle-bot[bot] commented 3 months ago

Since you haven\'t replied for more than a year, we have closed this issue/pr. If the problem is not solved or there is a follow-up one, please reopen it at any time and we will continue to follow up. 由于您超过一年未回复,我们将关闭这个issue/pr。 若问题未解决或有后续问题,请随时重新打开,我们会继续跟进。