-
Words with misspelled "е\ё" are not corrected with spellchecker (USSR typographical simplification allows this), however, such words are not detected with part-of-speech analysis.
ежик - -
ёжик …
-
Allowing non-XML to flow through the pipeline will be handy, but encoding/decoding may still be necessary at the boundaries. We could have p:encode/p:decode steps that convert between base64. They cou…
-
I'd love a method which gets the 'level' of a word.
Also on my macbook I can't even install this
CPAN: Storable loaded ok (v2.41)
Reading '/Users/ydo/.cpan/Metadata'
Database was generat…
-
The piper-phonemizer setup is a bit confusing at the moment as it's both a included with some significant code and a library imported at runtime. The two phonemizers text and espeak are both tightly …
-
### Is there an existing issue for this?
- [X] I have searched the existing issues
### Feature Description
A Part-of-Speech tagger using a Hidden Markov Model (HMM) assigns grammatical categories t…
-
This is an open issue where you can comment and add resources that might come in handy for NaNoGenMo.
NOTE: at some point I will turn this into a more organized document, probably on the [wiki for th…
-
Hi,
According to the [ud annotation guidelines](https://universaldependencies.org/u/dep/flat.html) `flat` seems to be the most apt relation for handling locales like "Rio de Janeiro", "Sao Paulo", …
-
```
I'm not sure the extent to which this is possible, I've only looked into this
kind of thing very briefly,
but here goes.
Sometimes you'll have a blob of text with a few names in it, it'd be use…
-
```
I'm not sure the extent to which this is possible, I've only looked into this
kind of thing very briefly,
but here goes.
Sometimes you'll have a blob of text with a few names in it, it'd be use…
-
```
I'm not sure the extent to which this is possible, I've only looked into this
kind of thing very briefly,
but here goes.
Sometimes you'll have a blob of text with a few names in it, it'd be use…