OpenBMB / CPM-Bee

百亿参数的中英文双语基座大模型
2.69k stars 214 forks source link

environment issue #31

Closed xuguozhi closed 1 year ago

xuguozhi commented 1 year ago

when running text_generation, bmtrain error appeared: ImportError: /home/CPM-Bee/BMTrain/bmtrain/optim/_cuda.cpython-39-x86_64-linux-gnu.so: undefined symbol: _ZN2at4_ops9new_zeros4callERKNS_6TensorEN3c108ArrayRefINS5_6SymIntEEENS5_8optionalINS5_10ScalarTypeEEENS9_INS5_6LayoutEEENS9_INS5_6DeviceEEENS9_IbEE can you please provide conda env or Dockerfile?

qingfengcss commented 1 year ago

可以使用带有BMTrain的镜像,我使用的是wangguojim/deepspeed-bmtrain-cuda10.2_torch1.10:20220702

xuguozhi commented 1 year ago

wangguojim/deepspeed-bmtrain-cuda10.2_torch1.10:20220702

谢谢

XuJianzhi commented 1 year ago

可以使用带有BMTrain的镜像,我使用的是wangguojim/deepspeed-bmtrain-cuda10.2_torch1.10:20220702

这个才是根本解决方式!!

xuguozhi commented 1 year ago

可以使用带有BMTrain的镜像,我使用的是wangguojim/deepspeed-bmtrain-cuda10.2_torch1.10:20220702

这个才是根本解决方式!!

是的,我就用的这个镜像解决了,但是要考虑rank的问题