PaddlePaddle / PaddleOCR

Awesome multilingual OCR toolkits based on PaddlePaddle (practical ultra lightweight OCR system, support 80+ languages recognition, provide data annotation and synthesis tools, support training and deployment among server, mobile, embedded and IoT devices)
https://paddlepaddle.github.io/PaddleOCR/
Apache License 2.0
44.26k stars 7.82k forks source link

使用RARE模型进行训练收敛速度很慢,并且预测结果精度很低 #9535

Closed kano201 closed 1 year ago

kano201 commented 1 year ago

使用RARE模型进行蒙古文识别的训练 配置文件只修改了

Optimizer: name: Adam beta1: 0.9 beta2: 0.999 lr: learning_rate: 0.0005 regularizer: name: 'L2' factor: 0.00001

Architecture: model_type: rec algorithm: RARE Transform: name: TPS num_fiducial: 20 loc_lr: 0.1 model_name: small Backbone: name: MobileNetV3 scale: 0.5 model_name: large Neck: name: SequenceEncoder encoder_type: rnn hidden_size: 96 Head: name: AttentionHead
hidden_size: 96

Loss: name: AttentionLoss

PostProcess: name: AttnLabelDecode

Metric: name: RecMetric main_indicator: acc

Train: dataset: name: SimpleDataSet data_dir: ./train_data/ label_file_list:

Eval: dataset: name: SimpleDataSet data_dir: ./train_data/ label_file_list:

an1018 commented 1 year ago

图片宽度改为2000,尺寸变大,训练速度就会变慢,效果查,可以排查下拉伸到这个尺寸 [3, 32, 2000],图片是不是太模糊了,或者infer过程尺寸有没有对应修改

github-actions[bot] commented 1 year ago

This issue has been automatically marked as stale because it has not had recent activity. It will be closed in 7 days if no further activity occurs. Thank you for your contributions.