the train was so slowly with my own dataset

MhLiao / TextBoxes_plusplus

TextBoxes++: A Single-Shot Oriented Scene Text Detector

Other

956 stars 281 forks source link

the train was so slowly with my own dataset #74

Open qianxuyidian-2018 opened 6 years ago

qianxuyidian-2018 commented 6 years ago

hi, @MhLiao I prepared my own dataset, there were 500 720*1280 photos, I marked these pictures, and then trained. I used 6 NVIDIA K80 GPUs, but the training speed is very slow, 10 iterations take an hour, I don't Know what the reason is, can you give me some suggestion?

The annotation format is as follows:

The log of the training is as follows:

qianxuyidian-2018 commented 6 years ago

Well, if i only use one GPU (gpus="0"), the training speed is normal, 10 iterations take 1 minute. If I set (gpus = "0,1,2,3,4,5") or (gpus = "2,3") or other combinations, the speed will become very slow. I donot know what wrong with muti-GPU.

MhLiao commented 6 years ago

The images in ICDAR 2015 are the same resolution as yours. Have you tried it?

qianxuyidian-2018 commented 6 years ago

Thank you for your reply, I solved this problem I reinstalled the nvidia k80 driver, if I set gpus=“0,2” or gpus=“0,2,4”, the speed is normal, if I set gpus = "0,1" or gpus="2,3" or something else, the speed becomes very slow, I think it may be because each K80 card has two cores. I have three K80 cards, so i can see 6 GPU IDs through nvidia-smi. If I use two GPU ID on one K80, like "2,3".., the speed will be slower.

425183525 commented 6 years ago

hello，could you tell me how use my prepared dataset. @qianxuyidian-2018 .please,tell me details.Thank you

qianxuyidian-2018 commented 6 years ago

you can perpare you dataset as PASCAL VOC format. you can find some overview of PASCAL VOC on network first.Then train you dataset as described in README.md. Besides,You should make some changes to these scripts to suit your project.,so,you should understand these scripts. Because I am in a closed work environment, although I can browse the web,but I can't send the code and scripts directly outside.

425183525 commented 5 years ago

Hello, I trained 1600 images with the size of 1280 x 720. With 1 GPU (GTX 1080), it took 2 hours to iterate 100 times. I've tried multiple gpus, different combinations, and GPU training is still slow. Can you tell me your thoughts on using multiple gpus to speed up training? Thank you very much. And my gpu-util is always 0, but occupies GPU memory.

cainiaojy commented 5 years ago

I have the same problem@425183525 , do you solve it?

mrlihellohorld commented 4 years ago

I have the same problem@425183525 , do you solve it?

I have the same problem,did you solve it?