GXYM / TextBPN-Plus-Plus

Arbitrary Shape Text Detection via Boundary Transformer;The paper at: https://arxiv.org/abs/2205.05320, which has been accepted by IEEE Transactions on Multimedia (T-MM 2023).
172 stars 37 forks source link

About the training time #16

Closed lcy0604 closed 1 year ago

lcy0604 commented 1 year ago

Thanks for your excellent work. I have trained your TextBPN++ on my scene text dataset with a single 3090, but I found the training time is too long. Our data is about 9000 and the batch size is set to 8. I adjust the input size to 800. How much time do you spend on training?

GXYM commented 1 year ago

Thanks for your excellent work. I have trained your TextBPN++ on my scene text dataset with a single 3090, but I found the training time is too long. Our data is about 9000 and the batch size is set to 8. I adjust the input size to 800. How much time do you spend on training?

In training, I set the the input size to 640 with batch size is set to 12, which can make training time is shorter. But in this way, it also takes about a week of training on MLT17 (9000 imgs) with a single 3090. The training time is mainly limited by the computing ability of the CPU, because of the data augmentation algorithm. Therefore, it is recommended to use stronger cups and as many threads as possible, as memory allows.

lcy0604 commented 1 year ago

Thanks for your excellent work. I have trained your TextBPN++ on my scene text dataset with a single 3090, but I found the training time is too long. Our data is about 9000 and the batch size is set to 8. I adjust the input size to 800. How much time do you spend on training?

In training, I set the the input size to 640 with batch size is set to 12, which can make training time is shorter. But in this way, it also takes about a week of training on MLT17 (9000 imgs) with a single 3090. The training time is mainly limited by the computing ability of the CPU, because of the data augmentation algorithm. Therefore, it is recommended to use stronger cups and as many threads as possible, as memory allows.

Thank you for your reply

123cjjjj commented 1 year ago

Thanks for your excellent work. I have trained your TextBPN++ on my scene text dataset with a single 3090, but I found the training time is too long. Our data is about 9000 and the batch size is set to 8. I adjust the input size to 800. How much time do you spend on training?

In training, I set the the input size to 640 with batch size is set to 12, which can make training time is shorter. But in this way, it also takes about a week of training on MLT17 (9000 imgs) with a single 3090. The training time is mainly limited by the computing ability of the CPU, because of the data augmentation algorithm. Therefore, it is recommended to use stronger cups and as many threads as possible, as memory allows.

Thank you for your reply

您好,3090环境下需要修改项目中的哪些参数啊?与3080环境训练的结果相差较大