apache / mxnet

Lightweight, Portable, Flexible Distributed/Mobile Deep Learning with Dynamic, Mutation-aware Dataflow Dep Scheduler; for Python, R, Julia, Scala, Go, Javascript and more
https://mxnet.apache.org
Apache License 2.0
20.77k stars 6.79k forks source link

MXNet static library build results in error in centos, oracle linux and similar distros #14332

Open ashutosh-dwivedi-e3502 opened 5 years ago

ashutosh-dwivedi-e3502 commented 5 years ago

Note: Providing complete information in the most concise form is the best way to get help. This issue template serves as the checklist for essential information to most of the technical issues and bug reports. For non-technical issues and feature requests, feel free to present the information in what you believe is the best form.

For Q & A and discussion, please start a discussion thread at https://discuss.mxnet.io

Description

MXNet static library build results in error in centos, oracle linux and similar distros

Environment info (Required)

What to do:
1. Download the diagnosis script from https://raw.githubusercontent.com/apache/incubator-mxnet/master/tools/diagnose.py
2. Run the script using `python diagnose.py` and paste its output here.

Package used (Python/R/Scala/Julia): (I'm using ...)

For Scala user, please provide:

  1. Java version: (java -version)
  2. Maven version: (mvn -version)
  3. Scala runtime if applicable: (scala -version)

For R user, please provide R sessionInfo():

Build info (Required if built from source)

Compiler (gcc/clang/mingw/visual studio):

MXNet commit hash: (Paste the output of git rev-parse HEAD here.)

Build config: (Paste the content of config.mk, or the build command.)

Error Message:

a - build/src/cg++: error: /root/mxnet/staticdeps/lib/libzmq.a: No such file or directory make: [lib/libmxnet.so] Error 1 make: Waiting for unfinished jobs.... _api/c_predict_api.o a - build/src/c_api/c_api_function.o a - build/src/c_api/c_api.o a - build/src/c_api/c_api_error.o a - build/src/executor/onnx_to_tensorrt.o a - build/src/executor/inplace_addto_detect_pass.o a - build/src/executor/infer_graph_attr_pass.o a - build/src/executor/graph_executor.o a - build/src/executor/trt_graph_executor.o a - build/src/executor/tensorrt_pass.o a - build/src/executor/attach_op_execs_pass.o a - build/src/executor/attach_op_resource_pass.o a - build/src/kvstore/gradient_compression.o a - build/src/kvstore/kvstore_utils.o a - build/src/kvstore/kvstore.o a - build/src/resource.o a - build/src/libinfo.o a - build/src/initialize.o make: *** wait: No child processes. Stop.

Minimum reproducible example

(If you are using your own code, please provide a short script that reproduces the error. Otherwise, please provide link to the existing example.)

Steps to reproduce

(Paste the commands you ran that produced the error.)

  1. Build the docker image from this Dockerfile : https://gist.github.com/aashudwivedi/9566b544c4784aed7b86080f53ba170d which results in the the error : https://gist.github.com/aashudwivedi/ad943c79f1f2907f5c7bb32fd502d80b

What have you tried to solve it?

1. 2.

mxnet-label-bot commented 5 years ago

Hey, this is the MXNet Label Bot. Thank you for submitting the issue! I will try and suggest some labels so that the appropriate MXNet community members can help resolve it.

piyushghai commented 5 years ago

@mxnet-label-bot Add [Build]

@lanking520 , @zachgk Can you guys have a look at this issue perhaps ?

lanking520 commented 5 years ago

@aashudwivedi Nice try and thanks for working on this. I also find similar issue when trying Cent OS 7. The libzmq.a file is living in lib64 other than the lib under staticdeps. You should change the LD_LIBRARY_PATH to resolve the issue or copying the content from lib64 to lib.

Currently we are using Ubuntu 14.04 to run this script, you can do with Ubuntu 16.04 as well if you want.

cc: @szha as a known problem in Cent OS 7 build