Closed apeforest closed 4 years ago
@apeforest assign [@apeforest ]
This fixed it for me. You basically need to update the OpenBLAS libs to latest with a the dynamic architecture flag ...
@mjsML Thanks. Issue fixed.
@apeforest what platform are you on? Ideally the installation instructions should be updated based on this issue
@leezu I was using the latest AWS DLAMI. I will verify once more.
There are both 16.04 and 18.04 versions of DLAMI. Based on your kernel version, you are using 16.04. 18.04 works fine for me.
If 16.04 ships with problematic openblas, let's add a Note to the install guide.
This issue happens on 18.04 too (this is my OS at the moment) ... I think it depends which subversion of ubuntu you have ... This is a very old install of 18.04 (probably 18.04.00 or the likes) ... and given I have not updated OpenBLAS explicitly it happened on 18.04 too .. according to the build guide from OpenBLAS the core issue is some symbols don't make it if you don't compile with the dynamicarch flag (so even if you have the most recent build but you didn't use this flag you'd get the same error.)
Which branch is this? master or 1.6.x?
@larroy master ... a similar issue also occurs when you chose MKL as BLAS too (the linker complains on another symbol tho) ...
what's the root cause of this? This is something we should add to CI if users are facing these difficulties.
agreed ... as I mentioned above it's a well documented issue with OpenBLAS and debian based distros.
Debian and Ubuntu LTS versions provide OpenBLAS package which is not updated after initial release, and under circumstances one might want to use more recent version of OpenBLAS e.g. to get support for newer CPUs
Ubuntu and Debian provides 'alternatives' mechanism to comfortably replace BLAS and LAPACK libraries systemwide.
After successful build of OpenBLAS (with DYNAMIC_ARCH set to 1)
$ make clean
$ make DYNAMIC_ARCH=1
$ sudo make DYNAMIC_ARCH=1 install
Basically the symbol mentioned above is missing from the default binaries that ships with debian based distros, so OpenBLAS needs to be rebuilt from latest source with this flag to force the compiler to output the symbol that the linker here is looking for(in this instance the old "gotoblas" symbol).
@apeforest @mjsML please provide the output of your cmake
commands.
It's unclear whether MKL is found during your compilation attempt based on the information provided.
I can't reproduce this issue both when linking against MKL as well as linking against OpenBLAS on a blank Ubuntu 18.04 system.
Are you compiling inside conda?
@mjsML we don't require updating OpenBLAS. This is unrelated.
Okay if I follow all the instructions on the web to apt update all required packages, I no longer face the issue. I guess it's probably I messed up cmake in my old enviroment. @mjsML if you also don't see this issue any more, I guess we can close it. Thanks @leezu for investigation.
I'll close this issue for now, please reopen @mjsML if you can still reproduce this.
Description
Following the steps at https://mxnet.apache.org/get_started/ubuntu_setup I got the following error:
Error Message
To Reproduce
then
Environment
We recommend using our script for collecting the diagnositc information. Run the following command and paste the outputs below: