autonomousvision / stylegan-xl

[SIGGRAPH'22] StyleGAN-XL: Scaling StyleGAN to Large Diverse Datasets
MIT License
964 stars 112 forks source link

ResolvePackageNotFound on Windows #10

Closed Renaldas111 closed 2 years ago

Renaldas111 commented 2 years ago

As I read, all these not found packages should go under pip section with different formatting, it is some kind of a bug, I understand (https://github.com/datitran/object_detector_app/issues/41). This would be very time consuming to rename and reformat all these packages manually. Is there a way to make a requirements file for all platforms?

conda env create -f environment.yml Collecting package metadata (repodata.json): done Solving environment: failed

ResolvePackageNotFound:

xl-sr commented 2 years ago

I added a minimal environment, you can give it a try again

Renaldas111 commented 2 years ago

Ok, thanks! This time the environment installs with no problem, but the next problem was AssertionError: Torch not compiled with CUDA enabled. Solved it by conda install pytorch torchvision torchaudio cudatoolkit=11.3 -c pytorch . Remember spending hours and getting the same problem when trying to install Stylegan and ProjectedGAN, so this time solved the problem in minutes. Would be nice if the proper version would be installed straight out of environment file. Finally, tried to run python train.py --outdir=I:/Out --cfg=stylegan3-t --data=I:/data256.zip --gpus 1 --batch=4 --mirror=1 --snap 10 --kimg 10000 --stem --syn_layers 10 --metrics=none The process stuck at Setting up PyTorch plugin "filtered_lrelu_plugin"... Maybe I was not patient enough, but waited for more than 5 minutes seemed for me reasonable amount of time to conclude something is wrong.

Finally, I just ran Stylegan XL from ProjectedGAN env I used for many months. It asked for several additional modules to install, and at least now I am at:

Setting up augmentation... Distributing across 1 GPUs... Setting up training phases... Exporting sample images... Initializing logs... Skipping tfevents export: No module named 'tensorboard' Training for 10000 kimg...

Setting up PyTorch plugin "upfirdn2d_plugin"... Done. tick 0 kimg 0.0 time 2m 08s sec/tick 12.8 sec/kimg 3199.28 maintenance 115.1 cpumem 4.98 gpumem 6.60 reserved 8.08 augment 0.000

xl-sr commented 2 years ago

great that it works for you now :) it's always tricky to make environments work everywhere. I'll leave the environment as is now and close this issue, but I'll refer to it in the README for Windows users.

huvers commented 2 years ago

Windows still gave me an issue with 'cudatoolkit=11.1', but adding 'conda-forge' to the channels in environment.yml solves it.

rimless commented 1 year ago

Ok, thanks! This time the environment installs with no problem, but the next problem was AssertionError: Torch not compiled with CUDA enabled. Solved it by conda install pytorch torchvision torchaudio cudatoolkit=11.3 -c pytorch . Remember spending hours and getting the same problem when trying to install Stylegan and ProjectedGAN, so this time solved the problem in minutes. Would be nice if the proper version would be installed straight out of environment file. Finally, tried to run python train.py --outdir=I:/Out --cfg=stylegan3-t --data=I:/data256.zip --gpus 1 --batch=4 --mirror=1 --snap 10 --kimg 10000 --stem --syn_layers 10 --metrics=none The process stuck at Setting up PyTorch plugin "filtered_lrelu_plugin"... Maybe I was not patient enough, but waited for more than 5 minutes seemed for me reasonable amount of time to conclude something is wrong.

Finally, I just ran Stylegan XL from ProjectedGAN env I used for many months. It asked for several additional modules to install, and at least now I am at:

Setting up augmentation... Distributing across 1 GPUs... Setting up training phases... Exporting sample images... Initializing logs... Skipping tfevents export: No module named 'tensorboard' Training for 10000 kimg...

Setting up PyTorch plugin "upfirdn2d_plugin"... Done. tick 0 kimg 0.0 time 2m 08s sec/tick 12.8 sec/kimg 3199.28 maintenance 115.1 cpumem 4.98 gpumem 6.60 reserved 8.08 augment 0.000

I met the same problem, and don't know why. It will stuck for 30 mins and quit automatically since the timeout.