issues
search
pymupdf
/
RAG
RAG (Retrieval-Augmented Generation) Chatbot Examples Using PyMuPDF
https://pymupdf.readthedocs.io/en/latest/pymupdf4llm
GNU Affero General Public License v3.0
302
stars
57
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
Redirects documentation to main site.
#103
jamie-lemon
closed
1 month ago
0
RAG has any conversion limitation?
#100
Gurushesh-Metapercept
closed
1 month ago
2
The package sometimes stuck for too long
#99
IronK77
closed
1 month ago
3
Add Option to Control Code Block Formatting in Markdown Output.
#98
HiroshigeAoki
closed
1 month ago
2
Markup Links are associated to the whole text line instead of the original span
#97
DiazBejaranoD
opened
2 months ago
0
fixed quad type issue in is_significant
#95
bhashithe-air
closed
2 months ago
1
How to parse CID font?
#94
ITHealer
closed
1 month ago
0
Fixed quad abbreviation
#93
rca-umb
closed
2 months ago
1
Garbled code on Chinese reports
#92
IronK77
closed
1 month ago
5
Text Extraktion from docx and pptx files
#91
simonschoe
closed
1 month ago
4
'Quad' object has no attribute 'tl'
#90
IronK77
closed
1 month ago
10
Remove fitz and use pymupdf only
#89
dantetemplar
closed
1 month ago
3
Bug in `is_significant` function
#88
dantetemplar
closed
1 month ago
4
Use Poetry
#87
dantetemplar
closed
1 month ago
0
Make it only one module named `pymupdf4llm`
#86
dantetemplar
closed
1 month ago
0
Fix imports in llama test
#85
dantetemplar
closed
1 month ago
1
Use Poetry to resolve dependencies, lock them, build and publish package
#84
dantetemplar
closed
1 month ago
3
Rename repository to pymupdf4llm
#83
dantetemplar
closed
1 month ago
0
Make it only one pymupdf4llm module instead of two (pymupdf4llm, pdf4llm)
#82
dantetemplar
closed
1 month ago
5
Issues with bullet points in PDFs
#81
Jaimish00
closed
1 week ago
5
ValueError: Expected collection name that...
#80
natea
opened
2 months ago
0
Can pymupdf4llm work with multiprocessing?
#79
IronK77
closed
1 month ago
2
multi column pdf file text extraction
#78
sanketpatel91
closed
1 week ago
6
Changes for v0.0.10
#77
JorjMcKie
closed
2 months ago
0
suggestion on useful api parameters
#76
kingennio
closed
2 months ago
8
Poor Markdown Generation for Particular PDF
#75
marty-sullivan
closed
2 months ago
8
minimum area for images & vector graphics
#74
hewliyang
closed
2 months ago
2
bug in to_markdown internal function
#73
kingennio
closed
2 months ago
2
Changes for v0.0.9
#72
JorjMcKie
closed
2 months ago
0
Unexpected results in pymupdf4llm but pymupdf works
#71
saturosfz
closed
2 months ago
2
Table formatting/ Table format extraction issue
#69
mk-docenty
closed
2 months ago
2
Issue with text extraction near footer of page
#68
Shreyanshcodes
closed
2 months ago
10
The Markdown syntax for images is always included in the Markdown output.
#67
tamdao
closed
2 months ago
3
Fix wrong code for markdown generation.
#66
plommon
closed
3 months ago
1
Bug in pymupdf4llm
#65
plommon
closed
3 months ago
1
fix the typo for itm
#64
mikeshi80
closed
3 months ago
3
Changes for v0.0.7
#63
JorjMcKie
closed
3 months ago
0
Table Formatting Preservation
#62
Hackersmate-Aditya
closed
2 months ago
2
Source Code Not Recognized in PDF Files in Version 0.0.6
#61
yewool0818
closed
3 months ago
3
A custom sorting method is required
#60
yoke233
closed
3 months ago
4
Changes for version 0.0.6
#58
JorjMcKie
closed
3 months ago
0
How to store images as blobs, instead writing into a directory?
#56
kevinmt24
closed
3 months ago
1
Bug in helpers/multi_column.py - IndexError: list index out of range
#55
shenyimings
closed
3 months ago
1
Mistakes in orchestrating sentences
#54
madlogos
closed
3 months ago
6
When the image is at the very end of the page, the image cannot be displayed
#53
rexyan
closed
3 months ago
2
Chunking of text files
#52
zymbuzz
closed
3 months ago
2
pymupdg4llm.
#50
hherb
closed
3 months ago
2
Fix bugs
#49
dantetemplar
closed
3 months ago
1
Add opportunity to filter out some images
#48
dantetemplar
closed
3 months ago
1
Workaround about diagram recognizing
#47
dantetemplar
closed
3 months ago
4
Previous
Next