issues
search
deanmalmgren
/
textract
extract text from any document. no muss. no fuss.
http://textract.readthedocs.io
MIT License
3.92k
stars
609
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
Scheduled biweekly dependency update for week 47
#491
pyup-bot
closed
12 months ago
2
extract-msg~=0.28.7
#490
mtasic85
opened
1 year ago
0
textract 1.6.5 has a non-standard dependency specifier extract-msg<=0.29.*
#489
SteveMwika
opened
1 year ago
2
Scheduled biweekly dependency update for week 45
#488
pyup-bot
closed
1 year ago
1
Non-Standard Dependency Specifier with pip 24.0
#487
mauricefreese
opened
1 year ago
1
Requesting compatibility for red hat linux
#486
Tylersuard
opened
1 year ago
0
Fix Parser to ignore encoding errors
#485
DmitryMalishev
opened
1 year ago
0
Scheduled biweekly dependency update for week 42
#484
pyup-bot
closed
1 year ago
1
error message whilest pip installing
#483
LouisK3itel
opened
1 year ago
1
Scheduled biweekly dependency update for week 40
#482
pyup-bot
closed
1 year ago
1
Scheduled biweekly dependency update for week 38
#481
pyup-bot
closed
1 year ago
1
Add Markdown support
#480
fakerybakery
closed
3 months ago
0
Scheduled biweekly dependency update for week 36
#479
pyup-bot
closed
1 year ago
1
Scheduled biweekly dependency update for week 34
#478
pyup-bot
closed
1 year ago
1
Scheduled biweekly dependency update for week 32
#477
pyup-bot
closed
1 year ago
1
textract 1.6.5 has a non-standard dependency specifier extract-msg<=0.29.*
#476
chapmanjacobd
opened
1 year ago
2
Support for .one (OneNote) files
#475
jw25116
opened
1 year ago
0
Scheduled biweekly dependency update for week 29
#474
pyup-bot
closed
1 year ago
1
Char fix
#473
rosewang01
closed
1 year ago
0
Scheduled biweekly dependency update for week 27
#472
pyup-bot
closed
1 year ago
1
checked with six version 1.15
#471
Svyat33
closed
1 year ago
0
Is textract still maintained?
#470
KamarajuKusumanchi
opened
1 year ago
5
adding encoding options for pdftotext
#469
Enzodtz
opened
1 year ago
0
Replace Antiword with a Python alternative
#468
SMillerDev
opened
1 year ago
2
progress bar for long documents
#467
chanansh
opened
1 year ago
0
Scheduled biweekly dependency update for week 23
#466
pyup-bot
closed
1 year ago
1
Use latest six
#465
I-Good-Vegetable
opened
1 year ago
3
textract3-1.6.4.post1 and textract-1.6.5 compilation error: error in beautifulsoup4 setup command: use_2to3 is invalid.
#464
ashish-2022
opened
1 year ago
1
Scheduled biweekly dependency update for week 18
#463
pyup-bot
closed
1 year ago
1
Scheduled biweekly dependency update for week 16
#462
pyup-bot
closed
1 year ago
1
Error in textract setup command w/ extract-msg<=0.29.* due to Wheel 0.40.0
#461
seankfh
opened
1 year ago
2
mp3 text extraction Exception - 5MB~ file
#460
RiccardoRomagnoli
opened
1 year ago
0
OS (WINDOWS) SUPPORT
#459
knana1662
opened
1 year ago
2
Scheduled biweekly dependency update for week 08
#458
pyup-bot
closed
1 year ago
1
Scheduled biweekly dependency update for week 06
#457
pyup-bot
closed
1 year ago
1
Enable encoding detection for the txt parser
#456
LoicGrobol
opened
1 year ago
0
Textract.process returns empty bytes object for EPUBs from DBNL collection
#455
bitsgalore
opened
1 year ago
0
Use of `antiword`
#454
p-linnane
closed
1 year ago
0
Scheduled biweekly dependency update for week 03
#453
pyup-bot
closed
1 year ago
1
Scheduled biweekly dependency update for week 01
#452
pyup-bot
closed
1 year ago
1
FR: Make SpeechRecognition etc. large AI libs just "extra" dependencies.
#451
kxrob
opened
1 year ago
1
Scheduled biweekly dependency update for week 51
#450
pyup-bot
closed
1 year ago
1
Issues with textract.process while run within and executable created by pyinstaller
#449
vq75
opened
1 year ago
1
Scheduled biweekly dependency update for week 49
#448
pyup-bot
closed
1 year ago
1
Scheduled biweekly dependency update for week 47
#447
pyup-bot
closed
1 year ago
1
Scheduled biweekly dependency update for week 45
#446
pyup-bot
closed
2 years ago
1
Text can't be extracted from scanned PDF, jpg and png.
#445
Takip31
opened
2 years ago
0
textract.exceptions.ShellError: The command antiword is not installed on your system. Please make sure the appropriate dependencies are installed before using textract
#444
faridelya
opened
2 years ago
0
Scheduled biweekly dependency update for week 42
#443
pyup-bot
closed
2 years ago
1
Scheduled biweekly dependency update for week 40
#442
pyup-bot
closed
2 years ago
1
Previous
Next