Open inkarar opened 2 years ago
@inkarar The best way to approach training the cascadetabnet is to go through mmdetection framework where we treat each table as an object. So, once you are able to get the correct training annotation format, it's pretty straightforward for training/inference here for table detection.
I haven't actually tried benchmarking it w.r.t the original paper (didn't get time to do it) but had done custom training with the custom dataset I had. It was reasonably good for detection (table only, no cells). The vanilla configuration did struggle with tables with very small heights (for instance, tables with a single row). So, I had to change the anchor box scale in the config and it worked. In fact, I used the exact change of scale to train the header region and it was pretty good.
So, I recommend you try mmdet first for training/inference with its train detector. After that it's pretty straightforward.
Need better and detailed step by step documentation on how to TRAIN CascadeTabNet on custom datasets to replicate the results produced by CascadeTabNet There are tons of issues related to training this model and mmdetection and the relative paths to config & datasets mentioned in code are a huge mess. Tree/Directory structure should have been mentioned and file names/paths should have been clearly specified.
Here's what I did:
_1. Annotated custom datasets using labeleme and converted these annotations to COCOjson and VOC format
Here's what I'm asking:
_1. Please include a readme file mentioned exact steps to follow to replicate results on custome datasets
The code written is neat and documented and the results achieved are commendable.
Help me out to replicate the results on custom dataset.
@DevashishPrasad @AyanGadpal @kshitijkapadni @ManishDV @francescoperessini @mhmd-azeez @akadirpamukcu @MrZilinXiao @mfproto @iiLaurens @NISH1001