PaddlePaddle / PaddleOCR

Awesome multilingual OCR toolkits based on PaddlePaddle (practical ultra lightweight OCR system, support 80+ languages recognition, provide data annotation and synthesis tools, support training and deployment among server, mobile, embedded and IoT devices)
https://paddlepaddle.github.io/PaddleOCR/
Apache License 2.0
43.55k stars 7.77k forks source link

ocr识别,单一的ctc-head输出,使用MultiScaleSampler,精度一直为0,loss为nanxxx,使用SimpleDataSet精度正常 #10936

Closed zhangxiaopang88 closed 4 months ago

zhangxiaopang88 commented 1 year ago

Architecture: model_type: rec algorithm: SVTR Transform: Backbone: name: PPLCNetV3 scale: 0.95 Neck: name: SequenceEncoder encoder_type: reshape Head: name: CTCHead

fc_decay: 0.00001

Loss: name: CTCLoss

PostProcess:
name: CTCLabelDecode

Metric: name: RecMetric main_indicator: acc

Train: dataset: name: MultiScaleDataSet ds_width: false data_dir: ./ ext_op_transform_idx: 1 label_file_list:

log一直是这样 ppocr INFO: epoch: [1/200], global_step: 1580, lr: 0.000004, acc: 0.000000, norm_edit_dis: 0.000000, loss: nanxxx, avg_reader_cost: 0.00013 s, avg_batch_cost: 0.32655 s, avg_samples: 147.2, ips: 450.77714 samples/s, eta: 6 days, 4:14:47

IbrarBabar009 commented 1 year ago

is your problem solved?