RapidAI / RapidOCRPDF

Based on RapidOCR, extract the PDF content.
Apache License 2.0
131 stars 14 forks source link

RapidOCRPDF 默认调用的还是中文OCR 模型吗? #7

Closed SWHL closed 10 months ago

SWHL commented 10 months ago

Discussed in https://github.com/RapidAI/RapidOCRPDF/discussions/6

Originally posted by **cobaltautomationdev** November 27, 2023 默认的模型是中文,当识别英文文档的时候,会出现一些空格没有识别出来。可以加上paddleocr英文模型吗?或者提供一个选项让用户自己选择模型。 ![image](https://github.com/RapidAI/RapidOCRPDF/assets/145321686/2052c61e-0937-4ffe-94f1-d8e7794d6df6)
SWHL commented 10 months ago

已经在v0.0.8中实现