issues
search
jrmuizel
/
pdf-extract
A rust library for extracting content from pdfs
396
stars
78
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
Pack of files which cause crashes
#96
qarmin
opened
1 week ago
0
release a new version of pdf-extract
#95
prabirshrestha
closed
1 month ago
1
Panics with message "no widths"
#94
xrl1
opened
1 month ago
4
extract pdf by pages (based on https://github.com/jrmuizel/pdf-extract/pull/73)
#93
linusbierhoff
closed
13 hours ago
4
Using Encoding-RS instead
#92
pvichivanives
closed
2 months ago
1
Removed panics, todos, asserts, and unwraps in the lib code, and formatted code for improved readability.
#91
darxkies
opened
3 months ago
0
RUSTSEC-2021-0153 Switch to Encoding RS?
#90
pvichivanives
closed
2 months ago
1
panic: index out of bounds: the len is 1 but the index is 1
#89
Sinderella
opened
5 months ago
2
Unicode map unsafe get leads to panic
#88
DimitriTimoz
closed
5 months ago
2
Fix crashing debug output in PdfSimpleFont
#87
Bennett-Petzold
opened
5 months ago
1
add extract txt with page example
#86
BenLocal
opened
5 months ago
1
Fonts with custom encoding
#85
maxpowel
opened
6 months ago
4
Text result split by spacing
#84
frankvgompel
closed
6 months ago
2
Fix panic by setting default_width to Some(1.0)
#83
prscoelho
closed
7 months ago
6
Add decryption functions and attempt decrypt if pdf is encrypted
#82
prscoelho
closed
5 months ago
0
Upgraded lopdf version
#81
maxpowel
closed
7 months ago
0
Added support for missing colour spaces
#80
josemirm
closed
8 months ago
0
Text result is split by spacing
#79
Implocell
closed
8 months ago
8
FR: Make the HTML output buffer string available
#78
annie444
closed
8 months ago
2
fix: Use `get` instead of `[]` to avoid panic when key is missing
#77
dilawar
closed
1 month ago
1
panic while parsing PDF
#76
dilawar
closed
9 months ago
2
Multiple panics on Arxiv.org PDFs
#75
jlandahl
opened
10 months ago
2
unexpected smask type 168 0 R
#74
nbittich
closed
10 months ago
3
Added extract_text_by_page()
#73
JustBobinAround
opened
10 months ago
0
thread 'main' panicked at 'missing char 33 in map
#72
danindiana
closed
7 months ago
2
Spec violation TrueType without Encoding entry
#71
sftse
opened
11 months ago
0
/ToUnicode spec violation
#70
sftse
opened
1 year ago
2
add example document where characters of extracted text are poorly sp…
#69
sftse
closed
1 year ago
2
new line
#68
CaptainKludge
opened
1 year ago
0
panic : missing colorspace [67, 83, 112]
#67
nbittich
closed
1 year ago
2
panic on unwrap on a None value
#66
blankenshipz
opened
1 year ago
5
Panic: Unexpected smask type <</Type /Mask/S /Luminosity/G 6 0 R>>
#65
clarkmcc
closed
1 year ago
0
Bumping up version of linked-hash-map dependency. As there is version conflict between pdf-extract & insta package that I am using.
#64
vamseekm
closed
1 year ago
1
Added output_page fn
#63
JuniFruit
opened
1 year ago
0
Empty text output
#62
Palmik
opened
1 year ago
5
Sanity Check - Unicode Mismatch
#61
piotroxp
opened
1 year ago
6
Unsafe get and Missing char
#60
0xMimir
opened
1 year ago
3
added decryption logic for encrypted document
#59
russellwmy
opened
1 year ago
0
pdftotext -layout equivalent
#58
Sinderella
opened
1 year ago
1
thread 'main' panicked at 'assertion failed: name == \"Identity-H\"
#57
wingjson
opened
1 year ago
8
missing char 48 in map
#56
gravit22
opened
1 year ago
5
extract_text_from_mem not found in `pdf_extract`
#55
felixbecker
closed
1 year ago
3
Tests from pdf.link files
#54
joepio
closed
1 year ago
1
Making codebase more flexible and modularizing code for having a better software
#53
emadbaqeri
opened
1 year ago
2
remove some println! to be more CLI / TUI friendly
#52
qkzk
opened
1 year ago
2
Missing LICENSE File
#51
Endle
opened
1 year ago
1
Empty output file running extract example on a test pdf file
#50
bogct0mculhl
opened
1 year ago
2
Multiple improvements across multiple forks
#49
Hessesian
opened
1 year ago
3
Less panics, add error handling, add tests, re-export lopdf, linting, readme
#48
joepio
opened
1 year ago
9
Error handling - replace `.unwrap` and `panic` with `?`
#47
joepio
opened
1 year ago
7
Next