Open jianwang-ntu opened 1 month ago
The high frequently system library is cudatoolkit, anaconda, and some python library like pip install pytorch, transformers, datasets
python -m torch.utils.collect_env
- if possible, please share a tutorial script in the user home folder, on how to finetune with Lora in a toy dataset.
Explain why current not satisfy
Lots of AI developers focus on GPU tasks and want to easily train their jobs, such as simply modifying little lines of code and starting it to train.
So, quick and easy using is essential. I suggest an end-to-end docker image that includes some high-rated libraries in this docker image, the user does not need to install it by themselves.
For example, The installation of Cudatoolkit and Anaconda took a long time (40 minutes), this should be avoided if integrated into a docker image rather than an os image.
How to reproduce
Error tips
Recommendation resolving idea
please build the docker image from nvidia/cuda:12.4.1-cudnn-devel-ubuntu22.04