PaddlePaddle / PaddleOCR

Awesome multilingual OCR toolkits based on PaddlePaddle (practical ultra lightweight OCR system, support 80+ languages recognition, provide data annotation and synthesis tools, support training and deployment among server, mobile, embedded and IoT devices)
https://paddlepaddle.github.io/PaddleOCR/
Apache License 2.0
43.55k stars 7.77k forks source link

训练V3版本的模型ch_PP-OCRv3_rec.yml报错,acc一直是0 #11902

Closed hds1999 closed 6 months ago

hds1999 commented 6 months ago

请提供下述完整信息以便快速定位问题/Please provide the following information to quickly locate the problem

Optimizer: name: Adam beta1: 0.9 beta2: 0.999 lr: name: Cosine learning_rate: 0.0001 warmup_epoch: 5 regularizer: name: L2 factor: 3.0e-05

Architecture: model_type: rec algorithm: SVTR_LCNet Transform: Backbone: name: MobileNetV1Enhance scale: 0.5 last_conv_stride: [1, 2] last_pool_type: avg last_pool_kernel_size: [2, 2] Head: name: MultiHead head_list:

Loss: name: MultiLoss loss_config_list:

PostProcess:
name: CTCLabelDecode

Metric: name: RecMetric main_indicator: acc ignore_space: False

Train: dataset: name: SimpleDataSet data_dir: ./train_data/ch_rec/ ext_op_transform_idx: 1 label_file_list:

GreatV commented 6 months ago

是不是把预训练模型载入错了

hds1999 commented 6 months ago

是不是把预训练模型载入错了 用的配置文件: https://github.com/PaddlePaddle/PaddleOCR/blob/main/configs/rec/PP-OCRv3/ch_PP-OCRv3_rec.yml 预训练模型: https://paddleocr.bj.bcebos.com/PP-OCRv3/chinese/ch_PP-OCRv3_rec_train.tar

TingquanGao commented 6 months ago

使用 https://github.com/PaddlePaddle/PaddleOCR/blob/main/configs/rec/PP-OCRv3/ch_PP-OCRv3_rec_distillation.yml 训练配置,或者: 参考文档 https://github.com/PaddlePaddle/PaddleOCR/blob/main/doc/doc_ch/knowledge_distillation.md#215-%E8%92%B8%E9%A6%8F%E6%A8%A1%E5%9E%8B%E5%BE%AE%E8%B0%83https://paddleocr.bj.bcebos.com/PP-OCRv3/chinese/ch_PP-OCRv3_rec_train.tar 中的student参数提取出来。

TingquanGao commented 6 months ago

长时间未回复,该issue已关闭,如仍有问题可以reopen或新开issue。

nissansz commented 4 months ago

使用 https://github.com/PaddlePaddle/PaddleOCR/blob/main/configs/rec/PP-OCRv3/ch_PP-OCRv3_rec_distillation.yml 训练配置,或者: 参考文档 https://github.com/PaddlePaddle/PaddleOCR/blob/main/doc/doc_ch/knowledge_distillation.md#215-%E8%92%B8%E9%A6%8F%E6%A8%A1%E5%9E%8B%E5%BE%AE%E8%B0%83https://paddleocr.bj.bcebos.com/PP-OCRv3/chinese/ch_PP-OCRv3_rec_train.tar 中的student参数提取出来。

哪个svtr 配置文件速度最快,准确率最高?