Closed guoyejun closed 1 year ago
accelerate==0.19.0 aiohttp==3.8.4 aiosignal==1.3.1 antlr4-python3-runtime==4.9.3 apache-libcloud==3.7.0 appdirs==1.4.4 argcomplete==3.1.1 arrow==1.2.3 asttokens==2.2.1 async-timeout==4.0.2 attrs==23.1.0 azure-core==1.27.1 azure-storage-blob==12.16.0 backcall==0.2.0 backoff==2.2.1 bcrypt==4.0.1 boto3==1.28.1 botocore==1.31.1 Brotli==1.0.9 certifi==2023.5.7 cffi==1.15.1 chardet==3.0.4 charset-normalizer==3.2.0 circuitbreaker==1.4.0 click==8.1.4 cmake==3.26.3 coloredlogs==15.0.1 composer==0.15.1 contourpy==1.1.0 coolname==2.2.0 cryptography==39.0.2 cxxfilt==0.3.0 cycler==0.11.0 datasets==2.10.1 dbus-python==1.2.16 decorator==5.1.1 deepspeed==0.9.3 dill==0.3.6 docker==6.1.3 docker-pycreds==0.4.0 einops==0.5.0 exceptiongroup==1.1.2 executing==1.2.0 filelock==3.12.2 flatbuffers==23.5.26 flit_core==3.9.0 fonttools==4.40.0 frozenlist==1.3.3 fsspec==2023.6.0 gitdb==4.0.10 GitPython==3.1.31 gql==3.4.1 graphql-core==3.2.3 hjson==3.1.0 huggingface-hub==0.16.4 humanfriendly==10.0 idna==2.8 importlib-metadata==6.8.0 iniconfig==2.0.0 ipython==8.14.0 isodate==0.6.1 jedi==0.18.2 Jinja2==3.1.2 jmespath==1.0.1 kiwisolver==1.4.4 lit==16.0.6 markdown-it-py==3.0.0 MarkupSafe==2.1.3 matplotlib==3.7.2 matplotlib-inline==0.1.6 mdurl==0.1.2 mosaicml==0.15.1 mosaicml-cli==0.4.15 mosaicml-streaming==0.5.1 mpmath==1.3.0 multidict==6.0.4 multiprocess==0.70.14 networkx==3.1 ninja==1.11.1 numpy==1.24.4 oci==2.105.0 omegaconf==2.3.0 onnx==1.14.0 onnxruntime==1.15.1 packaging==22.0 pandas==2.0.3 paramiko==3.2.0 parso==0.8.3 pathtools==0.1.2 pexpect==4.8.0 pickleshare==0.7.5 pillow
pluggy==1.2.0 prompt-toolkit==3.0.39 protobuf==4.23.4 psutil==5.9.5 ptyprocess==0.7.0 pure-eval==0.2.2 py-cpuinfo==9.0.0 pyarrow==12.0.1 pycparser==2.21 pydantic==1.10.12 Pygments==2.15.1 PyGObject==3.36.0 PyNaCl==1.5.0 pyOpenSSL==23.2.0 pyparsing==3.0.9 pytest==7.4.0 python-dateutil==2.8.2 python-snappy==0.6.1 pytorch-ranger==0.1.1 pytz==2023.3 PyYAML==6.0 questionary==1.10.0 regex==2023.6.3 requests==2.31.0 requests-unixsocket==0.2.0 responses==0.18.0 rich==13.4.2 ruamel.yaml==0.17.32 ruamel.yaml.clib==0.2.7 s3transfer==0.6.1 safetensors==0.3.1 sentencepiece==0.1.97 sentry-sdk==1.27.1 setproctitle==1.3.2 six==1.14.0 slack-sdk==3.21.3 smmap==5.0.0 stack-data==0.6.2 sympy==1.12 tabulate==0.9.0 tokenizers==0.13.3 tomli==2.0.1
torch-optimizer==0.3.0 torchdata==0.6.1 torchmetrics==0.11.4
tqdm==4.65.0 traitlets==5.9.0 transformers==4.30.2 triton==2.0.0 typing_extensions==4.6.3 tzdata==2023.3 urllib3==1.26.16 validators==0.20.0 wandb==0.15.5 wcwidth==0.2.6 websocket-client==1.6.1 websockets==10.4 xxhash==3.2.0 yarl==1.9.2 zipp==3.15.0 zstandard==0.21.0 zstd==1.5.5.1
conda create -n mpt python=3.10
conda activate mpt
pip install -r requirements.txt
git clone https://github.com/mosaicml/llm-foundry.git
cd llm-foundry
pip install llm-foundry
conda install -c "nvidia/label/cuda-11.8.0" cuda-toolkit
pip install -e ".[gpu]"
I got the above dependencies from the docker image, installed these, you may comment out some dependencies if you cant install them and get error, i did comment out some while installing but it was no issue and had nothing to do with the environment, then installed llm-foundry through pip, install Cuda toolkit 11.8 inside conda environment and thenpip install -e ".[gpu]
, now it works like charm. Hope its helpful. Cheers!
Hey, were you able to resolve this?
Closing due to inactivity. Please feel free to open a new issue if you are still encountering problems.
the command to launch the container is:
docker run --gpus all -it --name yjguo_mpt --ipc=host --privileged --network=host -e http_proxy=$http_proxy -e https_proxy=$https_proxy -v /home/yejun/yjguo/mpt:/workspace mosaicml/llm-foundry:2.0.1_cu118-latest
after login, I tried apt update but failed: