apache / mxnet

Lightweight, Portable, Flexible Distributed/Mobile Deep Learning with Dynamic, Mutation-aware Dataflow Dep Scheduler; for Python, R, Julia, Scala, Go, Javascript and more
https://mxnet.apache.org
Apache License 2.0
20.77k stars 6.79k forks source link

Your software broke my system #20613

Closed Yuri6037 closed 2 years ago

Yuri6037 commented 2 years ago

Description

By requiring install of NCCL you broke my system: nvidia-smi now refuses to load due to Failed to initialize NVML: Driver/library version mismatch

Please make your lib no longer require this NCCL which causes destruction of NVIDIA system everytime you try to install it!!!!

EDIT: I'm trying to repair NVIDIA by running sudo apt install cuda-10-2. If this does not work you make me loose 1 day of work cause I have to re-install the entire system!

EDIT2: NVIDIA is fixed now

github-actions[bot] commented 2 years ago

Welcome to Apache MXNet (incubating)! We are on a mission to democratize AI, and we are glad that you are contributing to it by opening this issue. Please make sure to include all the relevant context, and one of the @apache/mxnet-committers will be here shortly. If you are interested in contributing to our project, let us know! Also, be sure to check out our guide on contributing to MXNet and our development guides wiki.

Yuri6037 commented 2 years ago

Re-running apt install libnccl2 wfter apt install cuda-10-2 seem to have fixed the problem.

This might be a problem with default install of tencent cloud machines.