breezedeus / CnSTD

CnSTD: 基于 PyTorch/MXNet 的 中文/英文 场景文字检测(Scene Text Detection)、数学公式检测(Mathematical Formula Detection, MFD)、篇章分析(Layout Analysis)的Python3 包
https://www.breezedeus.com/article/cnocr
Apache License 2.0
653 stars 101 forks source link

layout yolov7 support #63

Open Shadow-Alex opened 1 year ago

Shadow-Alex commented 1 year ago

I am excited to share that I have trained a yolov7 model on the CDLA dataset, adding a small amount of self-labeled documents. I have adjusted the default training input to 1280x1280, as opposed to the standard 640. This modification has improved the model's performance on standard documents, particularly when the document's output dpi is set above 150. I am thrilled to contribute this model to your open-source community and hope it will be a valuable addition to your model zoo.

breezedeus commented 1 year ago

Thanks. That's great. Would you mind submitting a PR? Maybe need some text to describe what kind of pictures and scenes your model is good at.

Ontheroad123 commented 10 months ago

I am excited to share that I have trained a yolov7 model on the CDLA dataset, adding a small amount of self-labeled documents. I have adjusted the default training input to 1280x1280, as opposed to the standard 640. This modification has improved the model's performance on standard documents, particularly when the document's output dpi is set above 150. I am thrilled to contribute this model to your open-source community and hope it will be a valuable addition to your model zoo.

Could you please contribute yolov7 model to this open-source community, your model's input size may be more suit for my tasks.