issues
search
deanmalmgren
/
textract
extract text from any document. no muss. no fuss.
http://textract.readthedocs.io
MIT License
3.84k
stars
585
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
Replace Antiword with a Python alternative
#468
SMillerDev
opened
1 year ago
2
progress bar for long documents
#467
chanansh
opened
1 year ago
0
Scheduled biweekly dependency update for week 23
#466
pyup-bot
closed
1 year ago
1
Use latest six
#465
I-Good-Vegetable
opened
1 year ago
3
textract3-1.6.4.post1 and textract-1.6.5 compilation error: error in beautifulsoup4 setup command: use_2to3 is invalid.
#464
ashish-2022
opened
1 year ago
1
Scheduled biweekly dependency update for week 18
#463
pyup-bot
closed
1 year ago
1
Scheduled biweekly dependency update for week 16
#462
pyup-bot
closed
1 year ago
1
Error in textract setup command w/ extract-msg<=0.29.* due to Wheel 0.40.0
#461
seankfh
opened
1 year ago
2
mp3 text extraction Exception - 5MB~ file
#460
RiccardoRomagnoli
opened
1 year ago
0
OS (WINDOWS) SUPPORT
#459
knana1662
opened
1 year ago
2
Scheduled biweekly dependency update for week 08
#458
pyup-bot
closed
1 year ago
1
Scheduled biweekly dependency update for week 06
#457
pyup-bot
closed
1 year ago
1
Enable encoding detection for the txt parser
#456
LoicGrobol
opened
1 year ago
0
Textract.process returns empty bytes object for EPUBs from DBNL collection
#455
bitsgalore
opened
1 year ago
0
Use of `antiword`
#454
p-linnane
closed
1 year ago
0
Scheduled biweekly dependency update for week 03
#453
pyup-bot
closed
1 year ago
1
Scheduled biweekly dependency update for week 01
#452
pyup-bot
closed
1 year ago
1
FR: Make SpeechRecognition etc. large AI libs just "extra" dependencies.
#451
kxrob
opened
1 year ago
1
Scheduled biweekly dependency update for week 51
#450
pyup-bot
closed
1 year ago
1
Issues with textract.process while run within and executable created by pyinstaller
#449
vq75
opened
1 year ago
0
Scheduled biweekly dependency update for week 49
#448
pyup-bot
closed
1 year ago
1
Scheduled biweekly dependency update for week 47
#447
pyup-bot
closed
1 year ago
1
Scheduled biweekly dependency update for week 45
#446
pyup-bot
closed
1 year ago
1
Text can't be extracted from scanned PDF, jpg and png.
#445
Takip31
opened
1 year ago
0
textract.exceptions.ShellError: The command antiword is not installed on your system. Please make sure the appropriate dependencies are installed before using textract
#444
faridelya
opened
1 year ago
0
Scheduled biweekly dependency update for week 42
#443
pyup-bot
closed
1 year ago
1
Scheduled biweekly dependency update for week 40
#442
pyup-bot
closed
1 year ago
1
Support of Open Office Extesions
#441
dezoito
opened
1 year ago
0
unsafe for multiprocessing?
#440
chapmanjacobd
opened
1 year ago
0
Scheduled biweekly dependency update for week 38
#439
pyup-bot
closed
1 year ago
1
Paddle ocr give multi language ?
#438
vinothkanagaraj
opened
1 year ago
0
MacOS installation is outdated
#437
roablep
opened
1 year ago
1
Update python
#436
raj5287
opened
1 year ago
1
Unable to Install on Airflow
#435
raj5287
closed
1 year ago
1
Scheduled biweekly dependency update for week 36
#434
pyup-bot
closed
1 year ago
1
Drop python2 support
#433
tehabstract
opened
1 year ago
2
Scheduled biweekly dependency update for week 33
#432
pyup-bot
closed
1 year ago
1
Scheduled biweekly dependency update for week 31
#431
pyup-bot
closed
1 year ago
1
docs: Fix a few typos
#430
timgates42
opened
1 year ago
0
Scheduled biweekly dependency update for week 29
#429
pyup-bot
closed
1 year ago
1
Scheduled biweekly dependency update for week 27
#428
pyup-bot
closed
2 years ago
1
Beautifulsoup version
#427
supermanIT
opened
2 years ago
0
text parsers doesn't support encodings other then 'utf-8'
#426
davidorlov12
opened
2 years ago
0
Scheduled biweekly dependency update for week 25
#425
pyup-bot
closed
2 years ago
2
Pdfminer on Windows searches for pdf2text.py.exe
#424
PeterTillema
opened
2 years ago
1
Textract extracts different text accroding to the OS it operates on
#423
bennnym
opened
2 years ago
0
Fix issue deanmalmgren#342
#422
TheElementalOfDestruction
opened
2 years ago
0
Scheduled biweekly dependency update for week 23
#421
pyup-bot
closed
2 years ago
1
Scheduled biweekly dependency update for week 20
#420
pyup-bot
closed
2 years ago
1
Please add .jpx images support for textract
#419
deepaksharmaofficial
opened
2 years ago
0
Previous
Next