issues
search
internetarchive
/
archive-hocr-tools
Efficient hOCR tooling
Other
38
stars
9
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
EPUB: option to continue past singular Kakadu errors
#17
scottbarnes
closed
1 week ago
0
EPUB: convert images to 8 bit prior to saving as JPEG
#16
scottbarnes
closed
1 week ago
0
DAISY: remove Internet Archive item dependency
#15
scottbarnes
closed
2 weeks ago
0
DAISY: only add Arabic and Roman numerals to navigation
#14
scottbarnes
closed
2 weeks ago
0
hocr-to-daisy: use internetarchive-deriver-module for scandata
#13
scottbarnes
closed
2 weeks ago
1
Fix: handle `None` pageTypes
#12
scottbarnes
closed
1 month ago
1
Feature: Add ISO 639 part2b support for normalize_language
#11
scottbarnes
opened
1 month ago
0
Fix: use `iso-639` rather than `iso639`
#10
scottbarnes
closed
1 month ago
0
Feature/add hocr to daisy script
#9
scottbarnes
closed
1 month ago
1
Non-integer confidences cause error parsing
#8
whikloj
opened
7 months ago
1
mention hocr_to_pdf in README, along with other undocumented tools
#7
jrochkind
closed
1 year ago
3
Remove some unused import statements
#6
stweil
closed
2 years ago
1
hocr-combine-stream output contains multiple </body></html> tags, producing invalid xml
#5
jwachuta
closed
1 year ago
5
Add detection of image/graphics features
#4
johtso
closed
2 years ago
1
Fix typos discovered by codespell
#3
cclauss
closed
2 years ago
1
Undefined name: Initialize match_number = 0
#2
cclauss
closed
2 years ago
1
Make tools more usable with pipes
#1
MerlijnWajer
opened
2 years ago
0