I wrote this DockerFile to compose the environment:

FROM nvidia/cuda:12.0.0-cudnn8-runtime-ubuntu20.04 RUN apt-get update && apt-get install -y git python3 python3-pip python3-dev libsndfile1 RUN git clone https://github.com/alibaba-damo-academy/FunASR-APP.git WORKDIR /FunASR-APP RUN pip3 install "modelscope[audio_asr]" -f https://modelscope.oss-cn-beijing.aliyuncs.com/releases/repo.html RUN pip3 install -r ClipVideo/requirments.txt RUN pip3 install torch torchvision torchaudio EXPOSE 7860 CMD ["python3", "ClipVideo/clipvideo/gradio_service.py"]

When the container ran, it stuck at ' loading punctuation model from /root/.cache/modelscope/hub/damo/punc_ct-transformer_zh-cn-common-vocab272727-pytorch'. Output following:

========== == CUDA ==

CUDA Version 12.0.0

This container image and its contents are governed by the NVIDIA Deep Learning Container License. By pulling and using the container, you accept the terms and conditions of this license: https://developer.nvidia.com/ngc/nvidia-deep-learning-container-license

A copy of this license is made available in this container at /NGC-DL-CONTAINER-LICENSE for your convenience.

2023-07-23 16:35:47,819 - modelscope - INFO - PyTorch version 2.0.1 Found. 2023-07-23 16:35:47,819 - modelscope - INFO - Loading ast index from /root/.cache/modelscope/ast_indexer 2023-07-23 16:35:47,840 - modelscope - INFO - Loading done! Current index file version is 1.7.1, with md5 25e329753dff2bc06f1af41a30e22094 and a total number of 861 components indexed 2023-07-23 16:35:48,718 - modelscope - INFO - Model revision not specified, use the latest revision: v1.2.4 2023-07-23 16:35:49,087 - modelscope - INFO - initiate model from /root/.cache/modelscope/hub/damo/speech_paraformer-large-vad-punc_asr_nat-zh-cn-16k-common-vocab8404-pytorch 2023-07-23 16:35:49,087 - modelscope - INFO - initiate model from location /root/.cache/modelscope/hub/damo/speech_paraformer-large-vad-punc_asr_nat-zh-cn-16k-common-vocab8404-pytorch. 2023-07-23 16:35:49,087 - modelscope - INFO - initialize model from /root/.cache/modelscope/hub/damo/speech_paraformer-large-vad-punc_asr_nat-zh-cn-16k-common-vocab8404-pytorch 2023-07-23 16:35:49,089 - modelscope - WARNING - No preprocessor field found in cfg. 2023-07-23 16:35:49,089 - modelscope - WARNING - No val key and type key found in preprocessor domain of configuration.json file. 2023-07-23 16:35:49,089 - modelscope - WARNING - Cannot find available config to build preprocessor at mode inference, current config: {'model_dir': '/root/.cache/modelscope/hub/damo/speech_paraformer-large-vad-punc_asr_nat-zh-cn-16k-common-vocab8404-pytorch'}. trying to build by task and model information. 2023-07-23 16:35:49,089 - modelscope - WARNING - No preprocessor key ('generic-asr', 'auto-speech-recognition') found in PREPROCESSOR_MAP, skip building preprocessor. 2023-07-23 16:35:49,689 - modelscope - INFO - Model revision not specified, use the latest revision: v1.2.0 2023-07-23 16:35:49,893 - modelscope - INFO - loading vad model from /root/.cache/modelscope/hub/damo/speech_fsmn_vad_zh-cn-16k-common-pytorch ... 2023-07-23 16:35:50,090 - modelscope - INFO - Model revision not specified, use the latest revision: v1.1.7 2023-07-23 16:35:50,423 - modelscope - INFO - loading punctuation model from /root/.cache/modelscope/hub/damo/punc_ct-transformer_zh-cn-common-vocab272727-pytorch ...

modelscope / FunClip

Stuck at 'loading punctuation model' #4

========== == CUDA ==