-
Hi, is it possible for me to fine-tune the model with a custom dataset after training it on syth dataset?
-
Can we use or modify this code for typed text segmentation.
-
First stage : I am dealing with Gallica OCRs and importing raw text from urls **(I dont want to work with txt files**)
library(htm2txt) # a usefull package to import raw text from an html…
-
In https://github.com/mjenckel/LAYoutERkennung/blob/master/ocrd_anybaseocr/ocrd-tool.json the parameter `parallel` with the description *numbers of CPUs to us* defaults to 0. Is this intended? What do…
wrznr updated
4 years ago
-
Provide possibility to configure uneven split of hugepages between NUMA nodes.
At the moment, one can do either of the following 2:
a) Evenly split all hugepages between both NUMA nodes:
~~~
[…
ghost updated
3 years ago
-
In [get_text()](https://github.com/OCR-D/core/blob/master/ocrd_validators/ocrd_validators/page_validator.py#L260), the `TextEquiv` with `index=1` is used if it exists. The way I read the documentation…
-
## The Basics
* Service team responsible for the client library: Form Recognizer Dev Team
* Link to documentation describing the service: https://docs.microsoft.com/en-us/azure/cognitive-servi…
-
Ubuntu setup guide is missing segment about installing tesseract and possibly to use `sudo apt install python3-opencv` instead of the pip ocr requirement
- at least I needed to install these manua…
-
Looks like the example with just question marks is good now:
```
>>> segmenter.segment("??")
['??']
```
but the example with double question marks as a token at the end of a sentence still loses …
-
heya im using known good hardware, with pullups on the SD CS line, no level shifters, PCB traces not jumpers - but getting 'flakiness' on init/mount
```
[E][sd_diskio.cpp:99] sdSelectCard(): timeo…