issues
search
QuivrHQ
/
MegaParse
File Parser optimised for LLM Ingestion with no loss 🧠 Parse PDFs, Docx, PPTx in a format that is ideal for LLMs.
https://pypi.org/project/megaparse/
Apache License 2.0
511
stars
37
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
add: llm megaparser
#42
chloedia
closed
3 months ago
0
chore(main): release 0.0.14
#41
StanGirard
closed
3 months ago
1
fix: remove nest asycio
#40
chloedia
closed
3 months ago
0
chore(main): release 0.0.13
#39
StanGirard
closed
3 months ago
1
fix: use aload_data
#38
chloedia
closed
3 months ago
0
Add image extraction to PDF. Polish code
#37
dSupertramp
closed
2 weeks ago
5
🎇Improvement suggestions : Image content recognition and PDF page number information
#36
g4ti0r
opened
3 months ago
0
chore(main): release 0.0.12
#35
StanGirard
closed
3 months ago
1
fix: fake fix README.md
#34
chloedia
closed
3 months ago
0
fix:delete markdownify dependency
#33
chloedia
closed
3 months ago
0
Can I use ollama instead of openai key?
#32
heweapon
closed
3 months ago
2
add: convert_tab
#31
chloedia
closed
3 months ago
0
chore(main): release 0.0.11
#30
StanGirard
closed
3 months ago
1
add: xlsx convertor
#29
chloedia
closed
3 months ago
0
add: xlsx convertor
#28
chloedia
closed
3 months ago
0
add: XLSXConvertor
#27
chloedia
closed
3 months ago
0
ImportError: cannot import name 'open_filename' from 'pdfminer.utils'
#26
iris-qq
opened
3 months ago
1
Fix DOCX reader. Add input tests
#25
dSupertramp
closed
3 months ago
2
Fix OpenAI key error. Add docstrings. Polish code
#24
dSupertramp
closed
4 months ago
1
chore: Add Dockerfile and Makefile for project setup
#23
StanGirard
closed
4 months ago
0
chore(main): release 0.0.10
#22
StanGirard
closed
4 months ago
1
Change from LiteralString to Literal (typing)
#21
dSupertramp
closed
4 months ago
1
chore(main): release 0.0.9
#20
StanGirard
closed
4 months ago
1
chore: Update README.md to include optional use of LlamaParse for improved results
#19
StanGirard
closed
4 months ago
0
Contributing + import not working
#18
dSupertramp
closed
4 months ago
2
chore(main): release 0.0.8
#17
StanGirard
closed
4 months ago
1
chore(main): release 0.0.7
#16
StanGirard
closed
4 months ago
1
feat: Update benchmark results in README.md
#15
StanGirard
closed
4 months ago
0
Docker deployment support
#14
TopGun666
opened
4 months ago
5
add: gpt cleaner for header and footer
#13
chloedia
closed
4 months ago
0
chore(main): release 0.0.6
#12
StanGirard
closed
4 months ago
1
chore(main): release 0.0.5
#11
StanGirard
closed
4 months ago
1
feat: Add instructions for installing poppler and tesseract
#10
StanGirard
closed
4 months ago
0
Add support for Unstructured Parser, improve Table and Image Parsing, and add TOC and Hyperlinks for Docx
#9
StanGirard
closed
4 months ago
0
chore(main): release 0.0.4
#8
StanGirard
closed
4 months ago
1
add: baseline evaluation
#7
StanGirard
closed
4 months ago
0
chore(main): release 0.0.3
#6
StanGirard
closed
4 months ago
1
chore(main): release 0.0.2
#5
StanGirard
closed
4 months ago
1
chore(main): release 0.0.2
#4
StanGirard
closed
4 months ago
0
chore(main): release 0.0.2
#3
StanGirard
closed
4 months ago
0
feat: Megaparse example and working
#2
StanGirard
closed
4 months ago
0
chore(main): release 0.0.2
#1
StanGirard
closed
4 months ago
1
Previous