facebookresearch / nougat

Implementation of Nougat Neural Optical Understanding for Academic Documents
https://facebookresearch.github.io/nougat/
MIT License
8.43k stars 541 forks source link

Test dataset for benchmark evaluation #41

Open jzyustc opened 10 months ago

jzyustc commented 10 months ago

Thanks for your wonderful works!

As noted in your paper, there seems to be a lack of public benchmarks for academic documents. Would you kindly consider releasing your test dataset as a benchmark, allowing for comparative analysis?

jzyustc commented 10 months ago

Since you have graciously shared code for generating datasets from PDFs, I think it would be sufficient to release only the metadata, such as the URLs of your arXiv test set.