how to add new lang + how do you put it on web without html,css,js ?

oke hear me out:

Add the language with the correct abbreviation in the app.py, for example dutch language: 'Dutch': 'nld', is added on line 28. you can find the correct combinations at https://tesseract-ocr.github.io/tessdoc/Data-Files-in-different-versions.html
Then download the datafile for the language (dutch in my case) from the following github page: https://github.com/tesseract-ocr/tessdata/blob/4.1.0/nld.traineddata and put it in /usr/share/tesseract/tessdata:

ls /usr/share/tesseract/tessdata/
configs  eng.traineddata  nld.traineddata  tessconfigs

Done :)

nainiayoub / pdf-text-data-extractor