apache / mxnet

Lightweight, Portable, Flexible Distributed/Mobile Deep Learning with Dynamic, Mutation-aware Dataflow Dep Scheduler; for Python, R, Julia, Scala, Go, Javascript and more
https://mxnet.apache.org
Apache License 2.0
20.77k stars 6.8k forks source link

CI fails for unix cpu Static build CPU 14.04 #14174

Open ChaiBapchya opened 5 years ago

ChaiBapchya commented 5 years ago

In an unrelated PR https://github.com/apache/incubator-mxnet/pull/14168 (fixes spelling), Unix CPU in CI failed for the test - Static build CPU 14.04 Scala

Mostly the error is caused here

To install the library, you can run "make PREFIX=/path/to/your/installation install".

++++ make --quiet -j 72 PREFIX=/work/mxnet/staticdeps install
make[1]: warning: -jN forced in submake: disabling jobserver mode.
Generating openblas_config.h in /work/mxnet/staticdeps/include
Generating f77blas.h in /work/mxnet/staticdeps/include
Generating cblas.h in /work/mxnet/staticdeps/include
Copying LAPACKE header files to /work/mxnet/staticdeps/include
Copying the static library to /work/mxnet/staticdeps/lib
Copying the shared library to /work/mxnet/staticdeps/lib
install: cannot stat 'libopenblasp-r0.3.3.so': No such file or directory
make[1]: *** [install] Error 1
make: *** [install] Error 2

Log can be found here - http://jenkins.mxnet-ci.amazon-ml.com/blue/organizations/jenkins/mxnet-validation%2Funix-cpu/detail/PR-14168/1/pipeline @lanking520 can you weigh in? Any thoughts?

mxnet-label-bot commented 5 years ago

Hey, this is the MXNet Label Bot. Thank you for submitting the issue! I will try and suggest some labels so that the appropriate MXNet community members can help resolve it. Here are my recommended labels: Scala, Build

ChaiBapchya commented 5 years ago

@mxnet-label-bot add [Scala, Build]

ChaiBapchya commented 5 years ago

Yet another instance - Refer PR https://github.com/apache/incubator-mxnet/pull/14247 Jenkins - http://jenkins.mxnet-ci.amazon-ml.com/blue/organizations/jenkins/mxnet-validation%2Funix-cpu/detail/PR-14247/1/

@lanking520 Looks more than a random error

lanking520 commented 5 years ago

@marcoabreu not quite sure if it caused by some network issue or timeout from Jenkins. It's rare to see this issue, but it did happened.

ChaiBapchya commented 5 years ago

Yet again - Refer PR #14303 Jenkins - http://jenkins.mxnet-ci.amazon-ml.com/blue/organizations/jenkins/mxnet-validation%2Funix-cpu/detail/PR-14303/2/pipeline

piyushghai commented 5 years ago

It's not related to Scala. The errors on the Jenkins stage are at : Static Build CPU 14.04 Python, more specifically this :

build.py: 2019-02-25 06:41:31,028Z CRITICAL Execution of ['/work/runtime_functions.sh', 'build_static_python_mkl'] failed with status: 2
piyushghai commented 5 years ago

@mxnet-label-bot Update [Python, Build, MKL]

lanking520 commented 5 years ago

@piyushghai Well, it both happened.

lanking520 commented 5 years ago

https://github.com/apache/incubator-mxnet/issues/14414

lanking520 commented 5 years ago

http://jenkins.mxnet-ci.amazon-ml.com/blue/organizations/jenkins/mxnet-validation%2Funix-cpu/detail/master/416/pipeline

perdasilva commented 5 years ago

Seeing it again: http://jenkins.mxnet-ci.amazon-ml.com/blue/organizations/jenkins/mxnet-validation%2Funix-cpu/detail/master/440/pipeline

ChaiBapchya commented 5 years ago

Again : http://jenkins.mxnet-ci.amazon-ml.com/blue/organizations/jenkins/mxnet-validation%2Funix-cpu/detail/PR-14243/4/pipeline

For unrelated #14243

ChaiBapchya commented 5 years ago

For unrelated #15606

http://jenkins.mxnet-ci.amazon-ml.com/blue/organizations/jenkins/mxnet-validation%2Funix-cpu/detail/PR-15606/2/pipeline

ChaiBapchya commented 5 years ago

Again #15941 http://jenkins.mxnet-ci.amazon-ml.com/blue/organizations/jenkins/mxnet-validation%2Funix-cpu/detail/PR-15941/9/pipeline