da03 / Attention-OCR

Visual Attention based OCR
MIT License
1.11k stars 362 forks source link

Issue with GPU training (Windows) #84

Open gagan144 opened 6 years ago

gagan144 commented 6 years ago

Hi, I am training on GPU and the loss seems to suddenly shoot up when reaching to certain steps and it then never decreases. However, if I train it only on CPU only, the training is perfectly fine with nicely decreasing loss values. I have repeated this several times both on GPU & CPU and surprisingly the loss always shoots at the same step no ~800.

My hardware specs are: Windows 10 (64-bit) GPU: Nvidia GTX 1050 - 4GB; Also tried on Nvidia GTX 1060 with max-Q design - 6GB RAM: 16 GB Processor: i7 7th Generation Tensorflow: tensorflow-gpu==1.8.0

Following are the loss graphs: With GPU aocr_gpu

With CPU acor_cpu

I have tried:

It is something to do with OS. Is this code compatible with linux only as far as GPU is concerned?

Thanks!