can you provide visual question answering task code

shabie / docformer

Implementation of DocFormer: End-to-End Transformer for Document Understanding, a multi-modal transformer based architecture for the task of Visual Document Understanding (VDU)

MIT License

253 stars 40 forks source link

can you provide visual question answering task code #37

Open mayankpathaklumiq opened 2 years ago

uakarsh commented 2 years ago

Hi there, Sorry for the late reply. I am planning to implement DocFormer for Token Classification and Visual Question Answering. So, would be releasing it soon

Regards, Akarsh