mosaicml / llm-foundry

LLM training code for Databricks foundation models
https://www.databricks.com/blog/introducing-dbrx-new-state-art-open-llm
Apache License 2.0
4.02k stars 525 forks source link

unable to do 'apt update' in mosaicml/llm-foundry:2.0.1_cu118-latest #473

Closed guoyejun closed 1 year ago

guoyejun commented 1 year ago

the command to launch the container is: docker run --gpus all -it --name yjguo_mpt --ipc=host --privileged --network=host -e http_proxy=$http_proxy -e https_proxy=$https_proxy -v /home/yejun/yjguo/mpt:/workspace mosaicml/llm-foundry:2.0.1_cu118-latest

after login, I tried apt update but failed:

root@skyocean:/# apt-get update
Get:1 http://archive.ubuntu.com/ubuntu focal InRelease [265 kB]
Err:1 http://archive.ubuntu.com/ubuntu focal InRelease
  At least one invalid signature was encountered.
Get:2 http://archive.ubuntu.com/ubuntu focal-updates InRelease [114 kB]
Err:2 http://archive.ubuntu.com/ubuntu focal-updates InRelease
  At least one invalid signature was encountered.
Get:3 http://security.ubuntu.com/ubuntu focal-security InRelease [114 kB]
Get:4 http://ppa.launchpad.net/deadsnakes/ppa/ubuntu focal InRelease [18.1 kB]
Err:3 http://security.ubuntu.com/ubuntu focal-security InRelease
  At least one invalid signature was encountered.
Err:4 http://ppa.launchpad.net/deadsnakes/ppa/ubuntu focal InRelease
  At least one invalid signature was encountered.
Get:5 http://archive.ubuntu.com/ubuntu focal-backports InRelease [108 kB]
Err:5 http://archive.ubuntu.com/ubuntu focal-backports InRelease
  At least one invalid signature was encountered.
Get:6 https://deb.nodesource.com/node_18.x focal InRelease [4583 B]
Err:6 https://deb.nodesource.com/node_18.x focal InRelease
  At least one invalid signature was encountered.
Get:7 http://ppa.launchpad.net/git-core/ppa/ubuntu focal InRelease [23.8 kB]
Err:7 http://ppa.launchpad.net/git-core/ppa/ubuntu focal InRelease
  At least one invalid signature was encountered.
Get:8 https://developer.download.nvidia.cn/compute/cuda/repos/ubuntu2004/x86_64  InRelease [1581 B]
Err:8 https://developer.download.nvidia.cn/compute/cuda/repos/ubuntu2004/x86_64  InRelease
  At least one invalid signature was encountered.
Reading package lists... Done
W: GPG error: http://archive.ubuntu.com/ubuntu focal InRelease: At least one invalid signature was encountered.
E: The repository 'http://archive.ubuntu.com/ubuntu focal InRelease' is not signed.
N: Updating from such a repository can't be done securely, and is therefore disabled by default.
N: See apt-secure(8) manpage for repository creation and user configuration details.
W: GPG error: http://archive.ubuntu.com/ubuntu focal-updates InRelease: At least one invalid signature was encountered.
E: The repository 'http://archive.ubuntu.com/ubuntu focal-updates InRelease' is not signed.
N: Updating from such a repository can't be done securely, and is therefore disabled by default.
N: See apt-secure(8) manpage for repository creation and user configuration details.
W: GPG error: http://security.ubuntu.com/ubuntu focal-security InRelease: At least one invalid signature was encountered.
E: The repository 'http://security.ubuntu.com/ubuntu focal-security InRelease' is not signed.
N: Updating from such a repository can't be done securely, and is therefore disabled by default.
N: See apt-secure(8) manpage for repository creation and user configuration details.
W: GPG error: http://ppa.launchpad.net/deadsnakes/ppa/ubuntu focal InRelease: At least one invalid signature was encountered.
E: The repository 'http://ppa.launchpad.net/deadsnakes/ppa/ubuntu focal InRelease' is not signed.
N: Updating from such a repository can't be done securely, and is therefore disabled by default.
N: See apt-secure(8) manpage for repository creation and user configuration details.
W: GPG error: http://archive.ubuntu.com/ubuntu focal-backports InRelease: At least one invalid signature was encountered.
E: The repository 'http://archive.ubuntu.com/ubuntu focal-backports InRelease' is not signed.
N: Updating from such a repository can't be done securely, and is therefore disabled by default.
N: See apt-secure(8) manpage for repository creation and user configuration details.
W: GPG error: https://deb.nodesource.com/node_18.x focal InRelease: At least one invalid signature was encountered.
E: The repository 'https://deb.nodesource.com/node_18.x focal InRelease' is not signed.
N: Updating from such a repository can't be done securely, and is therefore disabled by default.
N: See apt-secure(8) manpage for repository creation and user configuration details.
W: GPG error: http://ppa.launchpad.net/git-core/ppa/ubuntu focal InRelease: At least one invalid signature was encountered.
E: The repository 'http://ppa.launchpad.net/git-core/ppa/ubuntu focal InRelease' is not signed.
N: Updating from such a repository can't be done securely, and is therefore disabled by default.
N: See apt-secure(8) manpage for repository creation and user configuration details.
W: GPG error: https://developer.download.nvidia.cn/compute/cuda/repos/ubuntu2004/x86_64  InRelease: At least one invalid signature was encountered.
E: The repository 'https://developer.download.nvidia.com/compute/cuda/repos/ubuntu2004/x86_64  InRelease' is not signed.
N: Updating from such a repository can't be done securely, and is therefore disabled by default.
N: See apt-secure(8) manpage for repository creation and user configuration details.
kashifmunir92 commented 1 year ago

requirements.txt

accelerate==0.19.0 aiohttp==3.8.4 aiosignal==1.3.1 antlr4-python3-runtime==4.9.3 apache-libcloud==3.7.0 appdirs==1.4.4 argcomplete==3.1.1 arrow==1.2.3 asttokens==2.2.1 async-timeout==4.0.2 attrs==23.1.0 azure-core==1.27.1 azure-storage-blob==12.16.0 backcall==0.2.0 backoff==2.2.1 bcrypt==4.0.1 boto3==1.28.1 botocore==1.31.1 Brotli==1.0.9 certifi==2023.5.7 cffi==1.15.1 chardet==3.0.4 charset-normalizer==3.2.0 circuitbreaker==1.4.0 click==8.1.4 cmake==3.26.3 coloredlogs==15.0.1 composer==0.15.1 contourpy==1.1.0 coolname==2.2.0 cryptography==39.0.2 cxxfilt==0.3.0 cycler==0.11.0 datasets==2.10.1 dbus-python==1.2.16 decorator==5.1.1 deepspeed==0.9.3 dill==0.3.6 docker==6.1.3 docker-pycreds==0.4.0 einops==0.5.0 exceptiongroup==1.1.2 executing==1.2.0 filelock==3.12.2 flatbuffers==23.5.26 flit_core==3.9.0 fonttools==4.40.0 frozenlist==1.3.3 fsspec==2023.6.0 gitdb==4.0.10 GitPython==3.1.31 gql==3.4.1 graphql-core==3.2.3 hjson==3.1.0 huggingface-hub==0.16.4 humanfriendly==10.0 idna==2.8 importlib-metadata==6.8.0 iniconfig==2.0.0 ipython==8.14.0 isodate==0.6.1 jedi==0.18.2 Jinja2==3.1.2 jmespath==1.0.1 kiwisolver==1.4.4 lit==16.0.6 markdown-it-py==3.0.0 MarkupSafe==2.1.3 matplotlib==3.7.2 matplotlib-inline==0.1.6 mdurl==0.1.2 mosaicml==0.15.1 mosaicml-cli==0.4.15 mosaicml-streaming==0.5.1 mpmath==1.3.0 multidict==6.0.4 multiprocess==0.70.14 networkx==3.1 ninja==1.11.1 numpy==1.24.4 oci==2.105.0 omegaconf==2.3.0 onnx==1.14.0 onnxruntime==1.15.1 packaging==22.0 pandas==2.0.3 paramiko==3.2.0 parso==0.8.3 pathtools==0.1.2 pexpect==4.8.0 pickleshare==0.7.5 pillow

Pillow-SIMD==9.0.0.post1

pluggy==1.2.0 prompt-toolkit==3.0.39 protobuf==4.23.4 psutil==5.9.5 ptyprocess==0.7.0 pure-eval==0.2.2 py-cpuinfo==9.0.0 pyarrow==12.0.1 pycparser==2.21 pydantic==1.10.12 Pygments==2.15.1 PyGObject==3.36.0 PyNaCl==1.5.0 pyOpenSSL==23.2.0 pyparsing==3.0.9 pytest==7.4.0 python-dateutil==2.8.2 python-snappy==0.6.1 pytorch-ranger==0.1.1 pytz==2023.3 PyYAML==6.0 questionary==1.10.0 regex==2023.6.3 requests==2.31.0 requests-unixsocket==0.2.0 responses==0.18.0 rich==13.4.2 ruamel.yaml==0.17.32 ruamel.yaml.clib==0.2.7 s3transfer==0.6.1 safetensors==0.3.1 sentencepiece==0.1.97 sentry-sdk==1.27.1 setproctitle==1.3.2 six==1.14.0 slack-sdk==3.21.3 smmap==5.0.0 stack-data==0.6.2 sympy==1.12 tabulate==0.9.0 tokenizers==0.13.3 tomli==2.0.1

torch==2.0.1+cu118

torch-optimizer==0.3.0 torchdata==0.6.1 torchmetrics==0.11.4

torchtext==0.15.2+cpu

torchvision==0.15.2+cu118

tqdm==4.65.0 traitlets==5.9.0 transformers==4.30.2 triton==2.0.0 typing_extensions==4.6.3 tzdata==2023.3 urllib3==1.26.16 validators==0.20.0 wandb==0.15.5 wcwidth==0.2.6 websocket-client==1.6.1 websockets==10.4 xxhash==3.2.0 yarl==1.9.2 zipp==3.15.0 zstandard==0.21.0 zstd==1.5.5.1

conda create -n mpt python=3.10
conda activate mpt
pip install -r requirements.txt
git clone https://github.com/mosaicml/llm-foundry.git
cd llm-foundry
pip install llm-foundry
conda install -c "nvidia/label/cuda-11.8.0" cuda-toolkit
pip install -e ".[gpu]"

I got the above dependencies from the docker image, installed these, you may comment out some dependencies if you cant install them and get error, i did comment out some while installing but it was no issue and had nothing to do with the environment, then installed llm-foundry through pip, install Cuda toolkit 11.8 inside conda environment and thenpip install -e ".[gpu], now it works like charm. Hope its helpful. Cheers!

dakinggg commented 1 year ago

Hey, were you able to resolve this?

dakinggg commented 1 year ago

Closing due to inactivity. Please feel free to open a new issue if you are still encountering problems.