Implementation of DocFormer: End-to-End Transformer for Document Understanding, a multi-modal transformer based architecture for the task of Visual Document Understanding (VDU)
@shabie , I am planning to do it :). Quite excited to do it. as far as I see, more of the details is in pre-training, so to release weights, there would be need of some compute resources
Hi @shabie! Are there any plans to release the DocFormerv2 soon? Great work! Thanks!