Layout-Parser / layout-parser

A Unified Toolkit for Deep Learning Based Document Image Analysis
https://layout-parser.github.io/
Apache License 2.0
4.75k stars 456 forks source link

Is there any way to preserve heading and content together. #92

Open SAIVENKATARAJU opened 2 years ago

SAIVENKATARAJU commented 2 years ago

Hi,

Thanks for your library. I have below screenshot with headers and text, is there any way to get the heading together with text. output

lolipopshock commented 2 years ago

Hi @SAIVENKATARAJU !

Could you please provide more context to your questions?

  1. Is it a PDF or just scanned document?
  2. What do you mean by "get the heading together with text"? You mean you'd like to get structured data like the following?
    [
        {
            "heading": "..",
            "text": "..."
        }, ...
    ]
SAIVENKATARAJU commented 2 years ago

Hi @lolipopshock Thanks for your reply. My documents are PDF's. and yes I just want like you mentioned above.

ciepielajan commented 2 years ago

@SAIVENKATARAJU I have the same case :) Somebody solve this problem ? @lolipopshock