Awesome multilingual OCR toolkits based on PaddlePaddle (practical ultra lightweight OCR system, support 80+ languages recognition, provide data annotation and synthesis tools, support training and deployment among server, mobile, embedded and IoT devices)
KeepKeys:
keep_keys: ['image', 'label'] # dataloader will return list in this order
loader:
shuffle: True
batch_size_per_card: 32 # [*] 512
drop_last: True
num_workers: 0
KeepKeys:
keep_keys: ['image', 'label'] # dataloader will return list in this order
loader:
shuffle: False
drop_last: False
batch_size_per_card: 32 # [*] 512
num_workers: 4
请提供下述完整信息以便快速定位问题/Please provide the following information to quickly locate the problem
系统环境/System Environment:linux, CUDA12.1, paddlepaddle-gpu 2.6.0
版本号/Version:Paddle: PaddleOCR: 问题相关组件/Related components:
What can I do for this problem? My data size -> trainimg: 270000, evalimg: 32 <- Is it too unbalanced?
cls.yml
Architecture: model_type: cls algorithm: CLS Transform: Backbone: name: MobileNetV3 scale: 0.35 model_name: small Neck: Head: name: ClsHead class_dim: 2
Loss: name: ClsLoss
Optimizer: name: Adam beta1: 0.9 beta2: 0.999 lr: name: Cosine learning_rate: 0.001 regularizer: name: 'L2' factor: 0
PostProcess: name: ClsPostProcess
Metric: name: ClsMetric main_indicator: acc
Train: dataset: name: SimpleDataSet data_dir: ./train_data/cls label_file_list:
Eval: dataset: name: SimpleDataSet data_dir: ./train_data/cls label_file_list: