shabie / docformer

Implementation of DocFormer: End-to-End Transformer for Document Understanding, a multi-modal transformer based architecture for the task of Visual Document Understanding (VDU)
MIT License
253 stars 40 forks source link

Organized the structure for using DocFormer #47

Closed uakarsh closed 1 year ago

uakarsh commented 1 year ago
  1. Added the pre-training task's code for IDL Dataset (https://github.com/furkanbiten/idl_data)
  2. Organized the folder for using DocFormer in examples folder