-
Check your https://github.com/TurkuNLP/ocr-postcorrection-lm/blob/main/alignment-and-gridsearch/alignment_test.ipynb file
There is something visible in there which probably shouldn't be
R4ZZ3 updated
3 months ago
-
This seems to work:
```
OUT_DIR=.
apply-sliding-window \
…
-
In OCR-D, long ago we moved away from absolute filenames and `file://` refs in FLocat.
When calling `de.lmu.cis.ocrd.cli.PostCorrectionCommand` with an absolute path to the METS, it runs through, …
-
Tried with Eclipse 2022-03 & Spring Tools 4.15.2.RELEASE :
```
...
Run from new projects's context menu Maven->Update project
New Java code from RAML is generated in /target/generated-sources/ra…
-
Hello,
The git lfs storage is full and I cannot access the code repo. This is exciting work. Looking forward to access.
`[llm-transcript-postcorrection]$ git lfs pull
batch response: This rep…
-
-
Just quick question: any chance that the rows and cells in the Abbyy file would be kept by the converter?
_Originally posted by @pirolen in https://github.com/LanguageMachines/foliautils/issues/62#…
-
In the OCR-D workflow, there are several steps that likely require input or output to be able to represent __word segmentation ambiguity__ and confidence values of word boundaries (whitespace characte…
-
for (lightly) skewed images like
https://iiif.bdrc.io/bdr:I1KG10195::I1KG101950044.jpg/full/max/0/default.jpg
the current line detection of the HOCR import gives an output of
```
ལེཞེས་
བཟོ…
eroux updated
10 months ago
-
The training process is interrupted by a segmentation fault during the very first epoch as part of the pretraining process. The error encountered is as follows:
`[dynet] random seed: 1678755796`
`…