facebookresearch / nougat

Implementation of Nougat Neural Optical Understanding for Academic Documents
https://facebookresearch.github.io/nougat/
MIT License
8.81k stars 561 forks source link

How to create custom data set for nougat training? #131

Open imrankh46 opened 1 year ago

imrankh46 commented 1 year ago

Hi, try different approach but not be able to create a custom dataset for training. Can some one give me example notebook.

HarshadDolas07 commented 8 months ago

Hi, try different approach but not be able to create a custom dataset for training. Can some one give me example notebook.

hello, did find anything about how to prepare data?

imrankh46 commented 8 months ago

Hi, try different approach but not be able to create a custom dataset for training. Can some one give me example notebook.

hello, did find anything about how to prepare data?

Nop🤢

AhmadHakami commented 7 months ago

Hi, try different approach but not be able to create a custom dataset for training. Can some one give me example notebook.

you can try many tools for generating data for finetuning just search for ocr synthetic tools to find many relevant options i have tried some of these tools and think TRDG is good enough for this task especially in latin languages

good luck :)