infiniflow / ragflow

RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.
https://ragflow.io
Apache License 2.0
11.16k stars 1.08k forks source link

[Question]: how to remove pdf watermark #1362

Open diaojunxian opened 4 days ago

diaojunxian commented 4 days ago

Describe your problem

deepdoc reads pdf very well, but I want to know how to effectively remove pdf watermarks?

KevinHuSh commented 4 days ago

Negative by now.

diaojunxian commented 4 days ago

Is there any simple way to support it?

Can I filter by rgb value?