Open Zha0q1 opened 3 years ago
Hi @barry-jin would you help identify the issue with docs ci?
for the gpu ci I think libcudnn 8 is missing
Hi @barry-jin would you help identify the issue with docs ci?
for the gpu ci I think libcudnn 8 is missing
Hi @Zha0q1, gluon-nlp v0.x CI is running on master branch's docker, which use nvidia/cuda:10.2-cudnn7-devel-ubuntu18.04 as base. Looks like mxnet-cu102==1.8.0.post0 is forced to use libcudnn.so.8. I have created a workaround try to fix this in #1553
Hi @barry-jin would you help identify the issue with docs ci? for the gpu ci I think libcudnn 8 is missing
Hi @Zha0q1, gluon-nlp v0.x CI is running on master branch's docker, which use nvidia/cuda:10.2-cudnn7-devel-ubuntu18.04 as base. Looks like mxnet-cu102==1.8.0.post0 is forced to use libcudnn.so.8. I have created a workaround try to fix this in #1553
Thanks! I think the other two pipelines also have some issues
Hi @barry-jin would you help identify the issue with docs ci? for the gpu ci I think libcudnn 8 is missing
Hi @Zha0q1, gluon-nlp v0.x CI is running on master branch's docker, which use nvidia/cuda:10.2-cudnn7-devel-ubuntu18.04 as base. Looks like mxnet-cu102==1.8.0.post0 is forced to use libcudnn.so.8. I have created a workaround try to fix this in #1553
Thanks! I think the other two pipelines also have some issues
Yes, for website-build pipeline, it will submit job to AWS batch to run the script compile_notebooks.sh, make sure updating your ci folder to have this script in ci/batch. For unittest, it looks like there may be some memory related issues, more investigation is needed.
Hi @barry-jin would you help identify the issue with docs ci? for the gpu ci I think libcudnn 8 is missing
Hi @Zha0q1, gluon-nlp v0.x CI is running on master branch's docker, which use nvidia/cuda:10.2-cudnn7-devel-ubuntu18.04 as base. Looks like mxnet-cu102==1.8.0.post0 is forced to use libcudnn.so.8. I have created a workaround try to fix this in #1553
Thanks! I think the other two pipelines also have some issues
Yes, for website-build pipeline, it will submit job to AWS batch to run the script compile_notebooks.sh, make sure updating your ci folder to have this script in ci/batch. For unittest, it looks like there may be some memory related issues, more investigation is needed.
Thanks! I merged with v0.x
The documentation website for preview: http://gluon-nlp-staging.s3-accelerate.dualstack.amazonaws.com/PR1556/d4caa0b65b3733fc5fe6fc516bf47894b6482b8c/index.html
The documentation website for preview: http://gluon-nlp-staging.s3-accelerate.dualstack.amazonaws.com/PR1556/d4caa0b65b3733fc5fe6fc516bf47894b6482b8c/index.html
Hi @Zha0q1 . v0.x CI has been fixed. Feel free to merge with current v0.x and re-run the tests. Thanks!
@barry-jin thanks!
Adds a new notebook that runs transformer based on onnxruntime. The onnx export feature will be added in MXNet 1.9
@szha @sxjscience @sandeep-krishnamurthy