jpWang / LiLT

Official PyTorch implementation of LiLT: A Simple yet Effective Language-Independent Layout Transformer for Structured Document Understanding (ACL 2022)
MIT License
335 stars 40 forks source link

How we can use it for unstructured data #38

Closed Ajithbalakrishnan closed 3 months ago

Ajithbalakrishnan commented 1 year ago

Hi Team, I have some requirements for unstructured data extraction. I have seen the model performance with structured data. How can we use it for unstructured data?

jpWang commented 3 months ago

Hi, the purpose of LiLT is to enhance the text model by utilizing layout information. If there is no structural/layout information available to use, it is recommended to use a pre-trained plain text model directly.