-
[x] I have checked the [documentation](https://docs.ragas.io/) and related resources and couldn't resolve my bug.
**Describe the bug**
Hi! I'm currently working with `ragas` to test different RAG …
-
path1 = r"C:\Users\Downloads\PDF Extraction Project\Compensation_document.pdf"
tables = camelot.read_pdf(path1, flavor='stream', pages='all')
print("Total tables extracted:", tables.n)
writer =…
-
These repo uses the [cookie cutter](https://cookiecutter-hypermodern-python.readthedocs.io/en/2022.6.3.post1/guide.htmll#the-tests-workflow) which offers a lot of [tools](https://cookiecutter-hypermod…
-
Here is an example of PDF that has some incorrectly extracted data (in stream mode):
[V_1.pdf](https://github.com/camelot-dev/camelot/files/12279247/V_1.pdf)
![V_1](https://github.com/camelot-dev/…
igvk updated
7 months ago
-
[S05MoldedCaseCircuitBreakers.pdf](https://github.com/camelot-dev/excalibur/files/2651058/S05MoldedCaseCircuitBreakers.pdf)
Hi,
The file will not download.
The upload & Table extraction seem …
-
-
The engine used for extraction of tables from PDF files is a well-known Python library called camelot. However, this library requires that the processed PDF file contains text ("computer" text, not ju…
-
Note the last ruling line at the right of the table:
![screen shot 2015-05-30 at 6 17 16 pm](https://cloud.githubusercontent.com/assets/27584/7899609/14c8bee2-06f8-11e5-9654-114244a10e61.png)
Since …
-
In the below file headers of table are not detected as part of extraction.
[junghee.pdf](https://github.com/tabulapdf/tabula-java/files/2127047/junghee.pdf)
[junghee.xlsx](https://github.com/tabulap…
-
## Description
I've been using the /generate command to create Jupyter notebooks from text prompts, but it seems to be generating filenames that contain colons (:). This is causing issues, espe…