issues
search
Filimoa
/
open-parse
Improved file parsing for LLM’s
https://filimoa.github.io/open-parse/
MIT License
2.18k
stars
83
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
add langchain document support
#56
priamai
opened
1 week ago
2
ValueError: Coordinate 'right' is less than 'left'"
#55
anthopit
closed
2 weeks ago
0
add homebrew installation path + fix linux prefix setting
#52
Gregory-Pereira
closed
1 month ago
1
Whitespace Issues
#51
Filimoa
closed
1 month ago
0
No whitespace in text?
#50
Filimoa
closed
1 month ago
2
Fixes LTAnno objects being skipped which contains the needed whitespace for some PDFs
#48
cipherCOM
closed
1 month ago
5
Fix CI & merge conflicts
#47
amonras
closed
2 months ago
1
Is the purpose of this project to interpret and comprehensively analyze the content of PDF documents?
#42
Bruce337f
closed
2 months ago
1
Llama Index Integration
#41
Filimoa
closed
2 months ago
0
open parse seems missing some blocks within pdf file
#40
DinoLiww
opened
2 months ago
3
[Memory Leak Fix] Create Fitz Pdf From Bytestream
#39
Filimoa
closed
2 months ago
0
Named temp directory never clears temp files
#38
bradfox2
closed
2 months ago
3
Does the original image information in the PDF need to be parsed?
#37
ic-xu
opened
2 months ago
1
ImportError: cannot import name 'DocumentParser' from partially initialized module 'openparse'
#36
spar025
closed
1 month ago
2
PyMuPdf Hierarchal Headings
#35
mingzhang798
closed
1 month ago
2
Config Object failing with AttributeError: module 'torch' has no attribute 'cuda'
#34
aman-paco
closed
1 month ago
2
Fix layout inversion bug
#33
ic-xu
opened
3 months ago
2
#29 [minor tweak to mashihua's branch]
#32
Filimoa
closed
3 months ago
0
TypeError: sequence item 13: expected str instance, NoneType found
#31
mingzhang798
closed
3 months ago
1
Ollama integration
#30
Kydlaw
closed
3 months ago
2
Fix bug with parse.py
#29
mashihua
closed
3 months ago
1
NoneType error occured in pymupdf.output_to_markdown function
#28
mashihua
closed
3 months ago
1
fix: Fix sequence item 2: expected str instance, NoneType found exception when table output is set to markdown.
#27
ic-xu
closed
3 months ago
0
Improving Table Performance
#26
brianjking
opened
3 months ago
8
update the cookbooks link
#24
brianjking
closed
3 months ago
0
More Embedding Models [Draft]
#23
Filimoa
opened
3 months ago
0
Request to Add License Information to PyPI
#22
fixxtion1
closed
2 months ago
1
support embeddings via ollama
#21
miku
closed
3 months ago
3
Update pymupdf.md
#20
ada-lovecraft
closed
3 months ago
1
Global PyTorch Config
#19
Filimoa
closed
3 months ago
0
fix cuda device error for tableformer/unitable
#18
jinmang2
closed
3 months ago
1
Input type (torch.FloatTensor) and weight type (torch.cuda.FloatTensor) should be the same
#17
fjw1049
closed
3 months ago
5
integration to llamaindex and langchain
#15
saitej123
closed
2 months ago
3
UniTable Cookbook notebook has errors
#13
zacharysmithdatatonic
closed
3 months ago
3
Unable to run library following provided steps
#12
dvalletj
closed
3 months ago
2
ValueError: Coordinate 'right' is less than 'left'
#11
atgreen
opened
3 months ago
1
support for Litellm module and Azure , aws OCR modules
#10
saitej123
closed
3 months ago
2
Missing parts of documents
#9
zby
opened
3 months ago
3
Different Embedding Models
#8
gvlx
opened
3 months ago
2
nodes output
#7
atgreen
closed
3 months ago
6
Unitable
#6
Filimoa
closed
3 months ago
0
ScannedPDF
#5
atulpant
closed
3 months ago
2
TypeError: Rect.__init__() got an unexpected keyword argument 'x0'
#4
Noexpert
closed
3 months ago
1
NameError: name 'display' is not defined
#3
Noexpert
closed
3 months ago
1
🚀 Roadmap
#1
Filimoa
opened
4 months ago
9