-
For example:
```python
doc_parser = cmd2.Cmd2ArgumentParser(description='Add documents')
doc_parser.add_argument('path', help='Path of the document file or document folder')
@cmd2.…
-
### Describe your problem
hi
I previously hag ragflow working with law documents and I was very impressed.
I updated the ragflow and now have problems with redis.
i did change the web app port t…
-
Hey. Hey,
I want to search all files by the following mask: "515209.pdf" (exact number) or "515209 *.pdf" (space after number) or "515209-*.pdf" (dash after number). If I search for just a number o…
-
```
What steps will reproduce the problem?
Monitor memory usage while rendering the only page of the attached PDF.
/usr/bin/time -f "%M KB" out/Debug/pdfium_test "~/Downloads/Ruimte B8 magazijn
ETO…
-
Right now, it only seems to perform OCR. i.e., convert image to raw text. Is there any table-specific extraction performed? Basically, I'm researching about good algorithms to extract tabular data fro…
-
- [x] better patch management
- probably just hard-fork all deps within this repo (maybe git subtree?)
- "squash" mode looks alright https://www.atlassian.com/git/tutorials/git-subtree
- fi…
-
Documents related recommended hospital service costs by various state/city goverments. Collect and scrape the data.
https://github.com/datameet/covid19/tree/master/downloads/hospital-service-costs-…
-
If you haven't lost interest in this project, would it be possible to add fb2 support? This project is exactly what I was looking for, but most of my books are in fb2 format.
-
In the current implementation of CoherenceBot, we are not allowing HTML reports into Policy Commons. While there are many publishers who publish only in this format, there's simply too much possibili…
-