Menghuan1918 / pdfdeal

A python wrapper for the Doc2X API and comes with native PDF processing (to improve PDF recall in RAG). | Doc2X API的python封装,同时附带本地的PDF处理(提升PDF在RAG中的召回率)。
https://menghuan1918.github.io/pdfdeal-docs/
MIT License
162 stars 8 forks source link

Update to v0.1.0 #3

Closed Menghuan1918 closed 2 months ago

Menghuan1918 commented 2 months ago

Refactored Doc2X support using concurrency to speed up processing.