Closed Renaldas111 closed 2 years ago
I added a minimal environment, you can give it a try again
Ok, thanks! This time the environment installs with no problem, but the next problem was AssertionError: Torch not compiled with CUDA enabled. Solved it by conda install pytorch torchvision torchaudio cudatoolkit=11.3 -c pytorch . Remember spending hours and getting the same problem when trying to install Stylegan and ProjectedGAN, so this time solved the problem in minutes. Would be nice if the proper version would be installed straight out of environment file. Finally, tried to run python train.py --outdir=I:/Out --cfg=stylegan3-t --data=I:/data256.zip --gpus 1 --batch=4 --mirror=1 --snap 10 --kimg 10000 --stem --syn_layers 10 --metrics=none The process stuck at Setting up PyTorch plugin "filtered_lrelu_plugin"... Maybe I was not patient enough, but waited for more than 5 minutes seemed for me reasonable amount of time to conclude something is wrong.
Finally, I just ran Stylegan XL from ProjectedGAN env I used for many months. It asked for several additional modules to install, and at least now I am at:
Setting up augmentation... Distributing across 1 GPUs... Setting up training phases... Exporting sample images... Initializing logs... Skipping tfevents export: No module named 'tensorboard' Training for 10000 kimg...
Setting up PyTorch plugin "upfirdn2d_plugin"... Done. tick 0 kimg 0.0 time 2m 08s sec/tick 12.8 sec/kimg 3199.28 maintenance 115.1 cpumem 4.98 gpumem 6.60 reserved 8.08 augment 0.000
great that it works for you now :) it's always tricky to make environments work everywhere. I'll leave the environment as is now and close this issue, but I'll refer to it in the README for Windows users.
Windows still gave me an issue with 'cudatoolkit=11.1', but adding 'conda-forge' to the channels in environment.yml solves it.
Ok, thanks! This time the environment installs with no problem, but the next problem was AssertionError: Torch not compiled with CUDA enabled. Solved it by conda install pytorch torchvision torchaudio cudatoolkit=11.3 -c pytorch . Remember spending hours and getting the same problem when trying to install Stylegan and ProjectedGAN, so this time solved the problem in minutes. Would be nice if the proper version would be installed straight out of environment file. Finally, tried to run python train.py --outdir=I:/Out --cfg=stylegan3-t --data=I:/data256.zip --gpus 1 --batch=4 --mirror=1 --snap 10 --kimg 10000 --stem --syn_layers 10 --metrics=none The process stuck at Setting up PyTorch plugin "filtered_lrelu_plugin"... Maybe I was not patient enough, but waited for more than 5 minutes seemed for me reasonable amount of time to conclude something is wrong.
Finally, I just ran Stylegan XL from ProjectedGAN env I used for many months. It asked for several additional modules to install, and at least now I am at:
Setting up augmentation... Distributing across 1 GPUs... Setting up training phases... Exporting sample images... Initializing logs... Skipping tfevents export: No module named 'tensorboard' Training for 10000 kimg...
Setting up PyTorch plugin "upfirdn2d_plugin"... Done. tick 0 kimg 0.0 time 2m 08s sec/tick 12.8 sec/kimg 3199.28 maintenance 115.1 cpumem 4.98 gpumem 6.60 reserved 8.08 augment 0.000
I met the same problem, and don't know why. It will stuck for 30 mins and quit automatically since the timeout.
As I read, all these not found packages should go under pip section with different formatting, it is some kind of a bug, I understand (https://github.com/datitran/object_detector_app/issues/41). This would be very time consuming to rename and reformat all these packages manually. Is there a way to make a requirements file for all platforms?
conda env create -f environment.yml Collecting package metadata (repodata.json): done Solving environment: failed
ResolvePackageNotFound: