issues
search
internetarchive
/
archive-hocr-tools
Efficient hOCR tooling
Other
40
stars
9
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
Epub: address `epubcheck` validation errors
#19
scottbarnes
opened
2 months ago
0
DAISY fix: if no item title, set the book title to the identifier
#18
scottbarnes
opened
2 months ago
0
EPUB: option to continue past singular Kakadu errors
#17
scottbarnes
closed
2 months ago
0
EPUB: convert images to 8 bit prior to saving as JPEG
#16
scottbarnes
closed
2 months ago
0
DAISY: remove Internet Archive item dependency
#15
scottbarnes
closed
2 months ago
0
DAISY: only add Arabic and Roman numerals to navigation
#14
scottbarnes
closed
2 months ago
0
hocr-to-daisy: use internetarchive-deriver-module for scandata
#13
scottbarnes
closed
2 months ago
1
Fix: handle `None` pageTypes
#12
scottbarnes
closed
3 months ago
1
Feature: Add ISO 639 part2b support for normalize_language
#11
scottbarnes
opened
3 months ago
0
Fix: use `iso-639` rather than `iso639`
#10
scottbarnes
closed
3 months ago
0
Feature/add hocr to daisy script
#9
scottbarnes
closed
3 months ago
1
Non-integer confidences cause error parsing
#8
whikloj
opened
9 months ago
1
mention hocr_to_pdf in README, along with other undocumented tools
#7
jrochkind
closed
1 year ago
3
Remove some unused import statements
#6
stweil
closed
2 years ago
1
hocr-combine-stream output contains multiple </body></html> tags, producing invalid xml
#5
jwachuta
closed
1 year ago
5
Add detection of image/graphics features
#4
johtso
closed
2 years ago
1
Fix typos discovered by codespell
#3
cclauss
closed
3 years ago
1
Undefined name: Initialize match_number = 0
#2
cclauss
closed
3 years ago
1
Make tools more usable with pipes
#1
MerlijnWajer
opened
3 years ago
0