The model architecture of Table-MASTER is different from the official solution report on arxiv.

JiaquanYe / TableMASTER-mmocr

2nd solution of ICDAR 2021 Competition on Scientific Literature Parsing, Task B.

Apache License 2.0

410 stars 100 forks source link

The model architecture of Table-MASTER is different from the official solution report on arxiv. #67

Open amish1706 opened 1 year ago

amish1706 commented 1 year ago

According to arxiv.org/pdf/2105.01848.pdf, the shared decoder has only one layer and then the bbox and structure decoders have "n" layers. It specifically mentions (as shown below) that they split the decoders after the first layer itself rather than the last layer as in the code here.

Correct me if I'm looking at the wrong code file.

delveintodetail commented 7 months ago

Only a slight difference, all we tried, the results were similar.