facebookresearch / nougat

Implementation of Nougat Neural Optical Understanding for Academic Documents
https://facebookresearch.github.io/nougat/
MIT License
8.98k stars 567 forks source link

pictures #73

Closed shuishu closed 1 year ago

shuishu commented 1 year ago

Can I keep pictures after conversion?

lukas-blecher commented 1 year ago

Figures are generally ignored. However you can detect them using pdffigures2 and then match the caption to the one transcribed by nougat.