shabie / docformer

Implementation of DocFormer: End-to-End Transformer for Document Understanding, a multi-modal transformer based architecture for the task of Visual Document Understanding (VDU)
MIT License
255 stars 40 forks source link

DocFormerv2 #56

Open shubhamagarwal92 opened 8 months ago

shubhamagarwal92 commented 8 months ago

Hi @shabie! Are there any plans to release the DocFormerv2 soon? Great work! Thanks!

uakarsh commented 8 months ago

@shabie , I am planning to do it :). Quite excited to do it. as far as I see, more of the details is in pre-training, so to release weights, there would be need of some compute resources