Closed xuguozhi closed 1 year ago
可以使用带有BMTrain的镜像,我使用的是wangguojim/deepspeed-bmtrain-cuda10.2_torch1.10:20220702
wangguojim/deepspeed-bmtrain-cuda10.2_torch1.10:20220702
谢谢
可以使用带有BMTrain的镜像,我使用的是wangguojim/deepspeed-bmtrain-cuda10.2_torch1.10:20220702
这个才是根本解决方式!!
可以使用带有BMTrain的镜像,我使用的是wangguojim/deepspeed-bmtrain-cuda10.2_torch1.10:20220702
这个才是根本解决方式!!
是的,我就用的这个镜像解决了,但是要考虑rank的问题
when running text_generation, bmtrain error appeared:
ImportError: /home/CPM-Bee/BMTrain/bmtrain/optim/_cuda.cpython-39-x86_64-linux-gnu.so: undefined symbol: _ZN2at4_ops9new_zeros4callERKNS_6TensorEN3c108ArrayRefINS5_6SymIntEEENS5_8optionalINS5_10ScalarTypeEEENS9_INS5_6LayoutEEENS9_INS5_6DeviceEEENS9_IbEE
can you please provide conda env or Dockerfile?