Implementation of DocFormer: End-to-End Transformer for Document Understanding, a multi-modal transformer based architecture for the task of Visual Document Understanding (VDU)
MIT License
253
stars
40
forks
source link
can you provide visual question answering task code #37
Hi there,
Sorry for the late reply. I am planning to implement DocFormer for Token Classification and Visual Question Answering. So, would be releasing it soon
Hi there, Sorry for the late reply. I am planning to implement DocFormer for Token Classification and Visual Question Answering. So, would be releasing it soon
Regards, Akarsh