feizc / FluxMusic

Text-to-Music Generation with Rectified Flow Transformers
Other
1.61k stars 122 forks source link

requirements.txt #15

Open eftSharptooth opened 2 months ago

eftSharptooth commented 2 months ago

I think it should be considered to have a requirements file, as a couple libraries need to be pinned to specific versions to function (mostly for training so far), whereas the current method installs (sometimes incompatible) newest versions. The one that specifically brings this to mind is librosa, which must be pinned around 0.9.2 instead of 0.10 due to the changing of the number of required arguments in 0.10. This causes the training to fail to start.

pip install librosa==0.9.2 fixes the error, just putting it on here for anyone else running into the issue.

eftSharptooth commented 2 months ago

So far I have as requirements the following: nvcc 12.1 or better for windows.

requirements.txt: torch>=2.0.0 torchvision einops fire>=0.6.0 huggingface-hub safetensors sentencepiece transformers tokenizers protobuf requests invisible-watermark diffusers einops diffusers transformers scipy numpy regex tqdm accelerate soundfile unidecode phonemizer torchlibrosa ftfy pandas timm matplotlib thop flash-attn==2.6.3 sentencepiece Pillow #Only required for creating your own datasets progressbar librosa==0.9.2 #required for audiollm, which apparently does not use torchlibrosa

EnzoVLLC commented 2 months ago

respeto maximo :P