johnning2333 / M2Doc

28 stars 0 forks source link

yolov8 #3

Closed sharonjunjun closed 3 months ago

sharonjunjun commented 4 months ago

when can you add the yolo detector in the project and add the add the data format samples for M2Doc and add the dataset converting scripts? thanks.

johnning2333 commented 4 months ago

when can you add the yolo detector in the project and add the add the data format samples for M2Doc and add the dataset converting scripts? thanks.

We don't actually use yolov8 as our baseline for comparison. As we mentioned in the paper, we use DINO, Cascade Mask R-CNN for our main baseline. While we encourage you to extend the m2doc to the yolo detector.

Since we use mmdetection to train the models, we will provide the coco dataformat converting script and data samples in this month.

carlos-vinicios commented 4 months ago

when can you add the yolo detector in the project and add the add the data format samples for M2Doc and add the dataset converting scripts? thanks.

We don't actually use yolov8 as our baseline for comparison. As we mentioned in the paper, we use DINO, Cascade Mask R-CNN for our main baseline. While we encourage you to extend the m2doc to the yolo detector.

Since we use mmdetection to train the models, we will provide the coco dataformat converting script and data samples in this month.

And the pretrained weights for the best model. You will provide too?

johnning2333 commented 4 months ago

when can you add the yolo detector in the project and add the add the data format samples for M2Doc and add the dataset converting scripts? thanks.

We don't actually use yolov8 as our baseline for comparison. As we mentioned in the paper, we use DINO, Cascade Mask R-CNN for our main baseline. While we encourage you to extend the m2doc to the yolo detector. Since we use mmdetection to train the models, we will provide the coco dataformat converting script and data samples in this month.

And the pretrained weights for the best model. You will provide too?

yes, we will also provide the pretrained weight of DINO and Cascade Mask R-CNN with m2doc on DocLayNet.

sharonjunjun commented 4 months ago

when can you add the yolo detector in the project and add the add the data format samples for M2Doc and add the dataset converting scripts? thanks.

We don't actually use yolov8 as our baseline for comparison. As we mentioned in the paper, we use DINO, Cascade Mask R-CNN for our main baseline. While we encourage you to extend the m2doc to the yolo detector.

Since we use mmdetection to train the models, we will provide the coco dataformat converting script and data samples in this month.

ok. if i want to use DocLayNet dataset training, i should download DocLayNet_core and DocLayNet_extra get the object boundingbox and text boundingbox?thanks.

johnning2333 commented 4 months ago

when can you add the yolo detector in the project and add the add the data format samples for M2Doc and add the dataset converting scripts? thanks.

We don't actually use yolov8 as our baseline for comparison. As we mentioned in the paper, we use DINO, Cascade Mask R-CNN for our main baseline. While we encourage you to extend the m2doc to the yolo detector. Since we use mmdetection to train the models, we will provide the coco dataformat converting script and data samples in this month.

ok. if i want to use DocLayNet dataset training, i should download DocLayNet_core and DocLayNet_extra get the object boundingbox and text boundingbox?thanks.

yes

sharonjunjun commented 4 months ago

when can you add the yolo detector in the project and add the add the data format samples for M2Doc and add the dataset converting scripts? thanks.

We don't actually use yolov8 as our baseline for comparison. As we mentioned in the paper, we use DINO, Cascade Mask R-CNN for our main baseline. While we encourage you to extend the m2doc to the yolo detector. Since we use mmdetection to train the models, we will provide the coco dataformat converting script and data samples in this month.

ok. if i want to use DocLayNet dataset training, i should download DocLayNet_core and DocLayNet_extra get the object boundingbox and text boundingbox?thanks.

yes

ok,the inference script by onnx model will release and how long is the inference time? Can the model be deployed on triton?

johnning2333 commented 4 months ago

when can you add the yolo detector in the project and add the add the data format samples for M2Doc and add the dataset converting scripts? thanks.

We don't actually use yolov8 as our baseline for comparison. As we mentioned in the paper, we use DINO, Cascade Mask R-CNN for our main baseline. While we encourage you to extend the m2doc to the yolo detector. Since we use mmdetection to train the models, we will provide the coco dataformat converting script and data samples in this month.

ok. if i want to use DocLayNet dataset training, i should download DocLayNet_core and DocLayNet_extra get the object boundingbox and text boundingbox?thanks.

yes

ok,the inference script by onnx model will release and how long is the inference time? Can the model be deployed on triton?

Thank you for your interest in our work. We currently do not have plans to release versions for ONNX or Triton

johnning2333 commented 3 months ago

when can you add the yolo detector in the project and add the add the data format samples for M2Doc and add the dataset converting scripts? thanks.

We don't actually use yolov8 as our baseline for comparison. As we mentioned in the paper, we use DINO, Cascade Mask R-CNN for our main baseline. While we encourage you to extend the m2doc to the yolo detector.

Since we use mmdetection to train the models, we will provide the coco dataformat converting script and data samples in this month.

Sorry about the delay of the dataset format converting script. We are now provide the ocr_anno_convert.py to format doclaynet ocr annotations, and we upload 3 test samples for illustration.