Layout-Parser / layout-parser

A Unified Toolkit for Deep Learning Based Document Image Analysis
https://layout-parser.github.io/
Apache License 2.0
4.78k stars 459 forks source link

Can we use Yolov5 or other model to do DLA? #40

Closed dagongji10 closed 3 years ago

dagongji10 commented 3 years ago

LayoutParser used Faster-RCNN and MaskRCNN to do DLA, but the model size are over 300M, and hard to do inference in android. Maybe DLA is similar with object-detection, so can we use yolov5/tiny-yolo to replace Faster-RCNN/MaskRCNN?

lolipopshock commented 3 years ago

54 this is an even better version of yolo, though it is based on paddledetection - You might need to install extra dependencies to have it running. And I'll have it updated in the next few PRs.