Open IEWbgfnYDwHRoRRSKtkdyMDUzgdwuBYgDKtDJWd opened 4 years ago
can you post the error you‘re getting.
Seeing faces for your first fake is correct. This is using transfer learning (you can look it up on my youtube page and learn more about the technique there). Those faces get erased completely after a handful of ticks.
Thank you very much for the great tutorial my GPU is running out of memory and failing in the beginning, appreciate if you take a look at the error bellow
her is my error `2020-05-20 11:18:44.904340: W tensorflow/core/common_runtime/bfc_allocator.cc:314] Allocator (GPU_0_bfc) ran out of memory trying to allocate 36.00MiB (rounded to 37748736). Current allocation summary follows. 2020-05-20 11:18:44.914558: W tensorflow/core/common_runtime/bfc_allocator.cc:319] ****x*x 2020-05-20 11:18:44.919358: W tensorflow/core/framework/op_kernel.cc:1502] OP_REQUIRES failed at transpose_op.cc:199 : Resource exhausted: OOM when allocating tensor with shape[3,3,512,4,512] and type float on /job:localhost/replica:0/task:0/device:GPU:0 by allocator GPU_0_bfc Traceback (most recent call last): File "C:\ProgramData\Anaconda3\envs\old_tensorflow\lib\site-packages\tensorflow\python\client\session.py", line 1356, in _do_call return fn(args) File "C:\ProgramData\Anaconda3\envs\old_tensorflow\lib\site-packages\tensorflow\python\client\session.py", line 1341, in _run_fn options, feed_dict, fetch_list, target_list, run_metadata) File "C:\ProgramData\Anaconda3\envs\old_tensorflow\lib\site-packages\tensorflow\python\client\session.py", line 1429, in _call_tf_sessionrun run_metadata) tensorflow.python.framework.errors_impl.ResourceExhaustedError: OOM when allocating tensor with shape[4,3,3,128,128] and type float on /job:localhost/replica:0/task:0/device:GPU:0 by allocator GPU_0_bfc [[{{node GPU0/G_loss/G/G_synthesis/128x128/Conv1/Square}}]] Hint: If you want to see a list of allocated tensors when OOM happens, add report_tensor_allocations_upon_oom to RunOptions for current allocation info.
During handling of the above exception, another exception occurred:
Traceback (most recent call last):
File "run_training.py", line 192, in
Errors may have originated from an input operation. Input Source operations connected to node GPU0/G_loss/G/G_synthesis/128x128/Conv1/Square: GPU0/G_loss/G/G_synthesis/128x128/Conv1/mul_3 (defined at C:\Users\USER6459\Documents\python\stylegan2\training\networks_stylegan2.py:100)
Original stack trace for 'GPU0/G_loss/G/G_synthesis/128x128/Conv1/Square':
File "run_training.py", line 192, in
can you tell me what GPU you’re using, the CUDA version you’re running and what NVIDIA driver is running?
I use workstation RTX 2080 Ti 11GB X1 and Ram is 128GM CUDA version is 10.0 and even if I reduce batch size it still fails due to space lack my dataset is 500 pictures 1024x1024 i tried bigger data set and smaller dataset and tried png, jpg, I tried all config avilable (a , b , c ,d ,e , f) I tried running it on another computer with GTX 1060 and cuda fails I tried 512 by 512 image dataset I tried 256 x 256 image dataset
< ERROR CUDA RUN OUT OF MEMORY > please note, I am trying to build my own models on building shapes not human faces thank you in advance
First of all, much thanks and appreciation for your repo, your script for GCP setup worked like a charm.
Only issue is when I try to train a new model using a custom dataset, it errors about 20 minutes after the first tick. also seems to have initial sample fake outputs as human faces (my dataset isnt faces). Unsure if this is normal or if I am doing something wrong.