-
### Why is it worth to add this package?
PyMuPDF is a high performance Python library for data extraction, analysis, conversion & manipulation of PDF (and other) documents.
### Home page URL
…
-
### Do you need to file an issue?
- [X] I have searched the existing issues and this feature is not already filed.
- [ ] My model is hosted on OpenAI or Azure. If not, please look at the "model pr…
-
I was trying to exclude a folder from message extraction through an ignore rule in the mapping file, and ended up having to read the source code to find out how to do it. Some of the questions that ar…
-
---
# Implement unit tests for newspaper front page application
## Description:
Create a comprehensive test suite for the main functionality of the newspaper front page application, focusing on ke…
-
### context
I'm looking to get the original token positions of keyterms when performing keyterm extraction with e.g. TextRank, but this can apply to the other extractors. Example:
```python
>>> d…
-
pypdf_table_extraction/camelot does not recognize the table on pages after page 1 with the lattice flavor.
With the stream method, I get a messed-up output like this one
```
0 1 …
-
2024-10-30 22:00:48,100 - Deleted File Path: E:\Python_Code\Neo4j-llm-graph-builder\backend\merged_files\test9.txt and Deleted File Name : test9.txt
2024-10-30 22:00:48,101 - file test9.txt deleted s…
-
Document the following features. Some of this documentation may need to be in the use cases in langchain extraction.
- [ ] Retrieval Mode
- [ ] Brute Force Extraction
- [ ] Deduplication -- how i…
-
-
Right now, if an API isn't accessible due to missing or incorrect credentials, litellm encounters the resulting 401 error. It looks like this:
```
Error code: 401 - {'error': {'message': 'Authentica…