issues
search
jwilk-archive
/
ocrodjvu
OCR for DjVu
GNU General Public License v2.0
45
stars
19
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
Should print all missing language packs specified by -l/--language
#47
jwilk
opened
2 years ago
0
Retiring
#46
jwilk
opened
2 years ago
0
gocr: "trying pxH-fix by Hxp …"
#45
jwilk
opened
2 years ago
0
html5lib/_ihatexml.py:265: DataLossWarning: Coercing non-XML name
#44
jwilk
opened
2 years ago
0
Multiprocessing support
#43
FriedrichFroebel
closed
2 years ago
1
Unclear origin of OCR engine messages when using -j
#42
jwilk
opened
2 years ago
0
Port to python3
#41
bastien-roucaries
opened
3 years ago
1
-X extra_args='-psm 1' option
#40
derrikF
closed
4 years ago
3
Support Python 3
#39
madalu
opened
4 years ago
22
Keep maintaining the package in Debian and get the program back to the distribution
#38
jsbien
opened
5 years ago
5
DjVu to PAGE-XML converter
#37
jwilk
opened
5 years ago
0
OCR engine executable path should be configurable
#36
xelxebar
opened
5 years ago
3
Windows support
#35
jwilk
opened
5 years ago
0
adopt hOCR utilities from marasca
#34
jwilk
opened
5 years ago
0
adopt IETF language tags (BCP 47)
#33
jwilk
opened
5 years ago
2
Multiple jobs do not work with Tesseract 4
#31
ashipunov
opened
5 years ago
18
allow passing arbitrary options to Tesseract
#30
jsbien
closed
5 years ago
1
Tesseract 4: error: invalid language identifier: Latin
#29
vltavskachobotnice
closed
5 years ago
2
TSV support (tsv2djvused)
#28
jsbien
opened
6 years ago
1
Conversion of hocr to djvused as a separate utility
#27
jsbien
closed
5 years ago
3
Allow passing custom configfile parameters to Tesseract engine
#26
jsbien
closed
5 years ago
1
parallel mode for djvu2hocr
#25
jwilk
opened
6 years ago
0
quneiform support
#24
jwilk
closed
6 years ago
2
Non-ASCII filenames cause UnicodeEncodeError
#23
derrikF
closed
6 years ago
3
djvu2hocr: extract XMP metadata
#22
jwilk
opened
6 years ago
0
error msg "No image suitable for OCR" is too vague
#21
ghost
opened
7 years ago
1
[debian] ocrodjvu: error: OCR engine (ocropus) was not found
#20
ghost
closed
5 years ago
4
Allow editing hOCR (or TSV) files
#19
jwilk
opened
8 years ago
1
ValueError: need more than 0 values to unpack
#18
jwilk
closed
8 years ago
3
Support for UZN files?
#17
jwilk
opened
9 years ago
0
ocrodjvu hangs with DjVuLibre 3.5.26
#16
jwilk
closed
9 years ago
3
Sometimes ampersand is not escaped in the hOCR output
#15
jwilk
closed
8 years ago
4
ocrodjvu for tesseract 3.04.00
#14
jwilk
closed
6 years ago
6
djvused script without escaping Unicode characters
#13
jwilk
closed
9 years ago
5
ocrodjvu creates an incorrect djvused script?
#12
jwilk
closed
10 years ago
5
Version 0.7.18 does not start
#11
jwilk
closed
10 years ago
6
please add 'tesseract: ' prefix to Tesseract's stderr
#10
jwilk
closed
9 years ago
9
tesseract engine (v 3.03) not found
#9
jwilk
closed
10 years ago
6
Tesseract: 3.02: Malformed hOCR document: character zones intermixed with non-character zones
#8
jwilk
opened
10 years ago
2
Crash with empty page
#7
jwilk
closed
8 years ago
7
Fix & document exit codes
#6
jwilk
closed
9 years ago
10
freeze if a page cannot be decoded
#5
jwilk
closed
11 years ago
3
crashes on non-UTF-8 file identifiers
#4
jwilk
closed
11 years ago
3
Support multi-languages with Tesseract
#3
jwilk
closed
11 years ago
2
Support ocropus 0.6
#2
jwilk
opened
11 years ago
1
process multiple html files with hocr2djvused
#1
jwilk
closed
12 years ago
3