Closed KingICCrab closed 1 week ago
I think the problem is that GitHub was down or you don't have an access to it.
Can you please try git clone https://github.com/mlcommons/ck mlcommons@ck
in some temp directory to check if it works and then restart the cm command when it's working? Please tell us if it helps! Thanks!
@KingICCrab - did you try again to see if it works? I believe it's a network issue - it happens with GitHub from time to time ;) ...
Thank you for your consideration! I‘m sorry. I temporarily give up reproducing it, because I know about docker little.
Thank you for your consideration! I‘m sorry. I temporarily give up reproducing it, because I know about docker little.
No problem. What I meant is that may I ask you to retry the same CM command and see if it works now:
cm docker script --tags=build,nvidia,inference,server
When there is a network issue, CM should restart building Docker container at the place it failed ... Thanks!
After I run the command, the error is following.
(These words are red!)
Cloning into 'repo'...
error: RPC failed; curl 28 Failed to connect to github.com port 443: Connection timed out
fatal: the remote end hung up unexpectedly
Traceback (most recent call last):
File "/home/cmuser/.local/bin/cm", line 8, in
Interesting. Thank you very much again for your feedback @KingICCrab - we didn't encounter such case before and will need to CM support to handle it in a better way! I will keep this ticket open to check it when we have time ... Thanks again!
I improved handling of broken CM repositories (when, for example, GitHub fails): https://github.com/mlcommons/ck/commit/c39caa38ec470e1e75ddb6679e56e5b1a079e34e . It should be available in the next CM release v2.0.3 ...
I believe it's fixed.
I want to reproduce nvidia-bert https://github.com/mlcommons/ck/blob/master/docs/mlperf/inference/bert/README_nvidia.md#build-nvidia-docker-container-from-31-inference-round when I run "cm docker script --tags=build,nvidia,inference,server", I encounter some problems. => ERROR [10/12] RUN cm pull repo mlcommons@ck 104.6s
CM error: Portable CM script failed (name = build-docker-image, return code = 256)