Closed jinz2014 closed 1 week ago
The installation is finicky. I had to mess around with for a while with a fresh env to make it work. Good luck.
What is your version of torchtune? The module that is failing to import was moved to torchtune.training
a couple releases ago, so you may have an outdated version of the library.
Since you've installed the rocm version of pytorch you may need to do something similar for torchtune:
pip install --pre torchtune --extra-index-url https://download.pytorch.org/whl/nightly/rocm6.2 --no-cache-dir
Thank you for your comments and suggestions.
pip uninstall torchtune Found existing installation: torchtune 0.3.1 Uninstalling torchtune-0.3.1:
pip install --pre torchtune --extra-index-url https://download.pytorch.org/whl/nightly/rocm6.2 --no-cache-dir
Successfully installed torchtune-0.4.0.dev20241112+rocm6.2
I created a fresh env, but the error is still there.
Hey @jinz2014 - could you share the config you're using? checkpointer._component_
in your config should be torchtune.training.FullModelHFCheckpointer
.
@SalmanMohammadi Thanks for the solution.
The command to install the tools:
pip3 install --pre --upgrade torch torchvision torchaudio torchao --index-url https://download.pytorch.org/whl/nightly/rocm6.2
Running torch tune shows the following error.
ModuleNotFoundError: No module named 'torchtune.utils.FullModelHFCheckpointer'