Open Anish-M-code opened 2 years ago
@Anish-M-code I can do it , but I think I might need your help !
Sure @pravincoder feel free to contribute and open pull request, i can provide guidence if you need any.
Hi! Currently the package uses tesseract for ocr operations, I was thinking for multiple language support can I introduce a different onnx model other than tesseract? With the Models bundled up with the package it will support both windows and linux without the requirement of tesseract already being installed in the system.
sure @chirag4862 we can try other models as well
Currently Pdftotext only supports english , potential contributors may try to add non english languages , simplify installation and uninstallation of additional language packs , add code to support above mentioned features on both linux and windows.