facebookresearch / nougat

Implementation of Nougat Neural Optical Understanding for Academic Documents
https://facebookresearch.github.io/nougat/
MIT License
8.81k stars 560 forks source link

Issue with Partial Detection of Pages by Nougat OCR #244

Open SaimonDahal-02 opened 1 week ago

SaimonDahal-02 commented 1 week ago

Some pages are not being fully detected by the Nougat OCR model. In many cases, only half of the content on a page is detected, while the rest is skipped. However, for other pages, the detection works perfectly fine.

Steps to Reproduce:

Answers Snippets to Papers_page-0008

 ```
 ## Answers (LC2020 HL, P2):
 1. \(0\); \(A\), \(B\) and \(C\) are collinear [0, 4, 7, 11, 15]
 2. \(33\cdot 435^{\circ}\)[0, 4, 7, 11, 15]
 3. \(9\)[0, 4, 7, 11, 15]
 4. \(x^{2}+y^{2}+4x-21=0\), \(x^{2}+y^{2}-8x-9=0\)[0, 4, 7, 11, 15]
 5. \(6\cdot 44\) m [0, 4, 7, 11, 15]
 6. \(k=9\)[0, 4, 7, 11, 15]
 7. \(\frac{5\pi}{3}\), \
 ```