issues
search
UB-Mannheim
/
ocr-fileformat
Validate and transform various OCR file formats (hOCR, ALTO, PAGE, FineReader)
https://digi.bib.uni-mannheim.de/ocr-fileformat/
MIT License
176
stars
23
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
docker: unlimit POST upload size, #136
#137
kba
closed
2 years ago
1
Web interface in Docker container/ Error when uploading document: "Must be either POST with the field 'file'...."
#136
cboulanger
opened
3 years ago
2
Support conversion to MiniOCR
#135
kba
opened
3 years ago
1
Page to alto python
#134
kba
closed
3 years ago
2
Proxy support
#133
mikegerber
closed
3 years ago
7
page schemas: use github not primaresearch.org
#132
kba
closed
3 years ago
0
⬆️ Update JPageConverter to 1.5.05
#131
mikegerber
closed
3 years ago
0
update hocr2alto to include filak/hOCR-to-ALTO#23
#130
kba
closed
3 years ago
3
alto to text: too many spaces
#129
jbarth-ubhd
closed
2 years ago
7
update JPAGEConverter to 1.5.04
#128
kba
closed
3 years ago
4
Update saxon9he.jar to version 9.9.1.7
#127
stweil
closed
4 years ago
5
ocr-transform: set -e before running saxon/transform scripts
#126
kba
closed
4 years ago
1
Google Cloud Vision to PAGE-XML
#125
kba
opened
4 years ago
8
New Saxon version 10.2 is out
#124
zuphilip
closed
2 years ago
8
"ocr-transform page alto ... ...": loosing text
#123
jbarth-ubhd
closed
1 year ago
13
Support conversion from and to Textract JSON
#122
scottschreckengaust
opened
4 years ago
4
GCV to HOCR or PAGE conversion not working
#121
OmriPi
opened
4 years ago
9
Release version 0.3.0 and 1.0.0
#120
zuphilip
opened
4 years ago
11
Add update mechanism
#119
zuphilip
opened
4 years ago
3
Pretty print option for CLI
#118
zuphilip
opened
4 years ago
1
Update Dockerfile as MAINTAINER is deprecated now
#117
zuphilip
closed
4 years ago
3
:arrow_up: Upgrade to new version of hOCR-to-ALTO
#116
zuphilip
closed
4 years ago
6
Simplify validations
#115
zuphilip
opened
4 years ago
2
Extend automated tests in CI
#114
zuphilip
opened
4 years ago
0
Add hocr__page transformation
#113
zuphilip
closed
4 years ago
1
:ambulance: Prevent multiple download events
#112
zuphilip
closed
4 years ago
0
Add script debugging when -d -d is used
#111
zuphilip
closed
4 years ago
0
Update Saxon version to 9.9.1.6
#110
zuphilip
closed
4 years ago
1
GCV2hocr not working: no file
#109
zuphilip
closed
4 years ago
2
Multiple downloads
#108
zuphilip
closed
4 years ago
1
Compatibility of XSLT 1.0 with new Saxon HE
#107
zuphilip
closed
4 years ago
0
Fix conversion from ALTO to PAGE and vice versa
#106
stweil
closed
4 years ago
11
[WIP] Fix page__alto and alto__page
#105
zuphilip
closed
4 years ago
4
Add PAGE XML examples
#104
zuphilip
closed
4 years ago
4
Update README.md
#103
zuphilip
closed
4 years ago
1
Fix ignored files
#102
zuphilip
closed
4 years ago
2
Implement command line option --version
#101
stweil
closed
4 years ago
5
Update documentation to reflect latest code
#100
stweil
closed
4 years ago
3
page2tsv
#99
kba
opened
4 years ago
0
[WIP] TEI to HOCR
#98
zuphilip
closed
4 years ago
2
Integrate PRIMA Labs PageConverter
#97
kba
closed
4 years ago
10
Converting hOCR to Alto
#96
asor12
closed
4 years ago
21
No transformation from alto3.0 (from Tesseract 4.1.0) to hocr
#95
jtlz2
closed
4 years ago
6
multi-choice of files in the web interface
#94
yanirmr
opened
5 years ago
3
loop of files downloading
#93
yanirmr
closed
4 years ago
1
Add abbyy2hocr transformation by @OCR-D
#92
zuphilip
closed
4 years ago
7
Add XSLT for transformation from PAGEXML to text
#91
zuphilip
closed
4 years ago
0
Add PAGE 2019-07-15 schema
#90
zuphilip
closed
4 years ago
0
alto2hocr: Content in BottomMargin is not considered (PrintSpace node is missing in this example)
#89
jtlz2
opened
5 years ago
15
installation problem under macOS 10.13.6
#88
jtlz2
closed
5 years ago
9
Previous
Next