ZJULearning / pixel_link

Implementation of our paper 'PixelLink: Detecting Scene Text via Instance Segmentation' in AAAI2018
MIT License
768 stars 254 forks source link

Train on RCTW dataset #84

Open Jacqueline720 opened 5 years ago

Jacqueline720 commented 5 years ago

Has anybody trained pixel_link on the RCTW-17 from scratch? When I trained this dataset on pixel_link,the model can not converge:when I adjust learning rate as 1e-2,the loss goes to nan,while the loss goes to 0 when learning rate = 1e-3. Hope anybody have any idea about this problem~

HoracceFeng commented 5 years ago

Hi @Jacqueline720, have you solved this problem? In the source code, the image will be cropped to 512 while the images in RCTW dataset are dramatically larger, like 2000-3000 per edge. I have done some modification to make the whole image resize and put into training. But then I get the 'nan loss' once lr=1e-3.

Hi author @dengdan, would you give me some suggestions? Thx

HoracceFeng commented 5 years ago

Problem Solved. Many Questions are actually because of the TFrecords converter and mismatch of tf.summary.image(bounding_box). Will do a pipeline code for text segmentation later.

yhpwsid commented 5 years ago

Hi @Jacqueline720, have you solved this problem? In the source code, the image will be cropped to 512 while the images in RCTW dataset are dramatically larger, like 2000-3000 per edge. I have done some modification to make the whole image resize and put into training. But then I get the 'nan loss' once lr=1e-3.

Hi author @dengdan, would you give me some suggestions? Thx does the loss have anything to do with tf.summary.image? how did you solve the loss problem? thanks

xiiiiiiii commented 5 years ago

Has anybody trained pixel_link on the RCTW-17 from scratch? When I trained this dataset on pixel_link,the model can not converge:when I adjust learning rate as 1e-2,the loss goes to nan,while the loss goes to 0 when learning rate = 1e-3. Hope anybody have any idea about this problem~

你好,想问一下你在RCTW-17数据集上准确率召回率是多少,表现如何