-
first as on https://ichi.moe
- https://readevalprint.tumblr.com/post/97467849358/who-needs-graph-theory-anyway
- https://github.com/tshatrov/ichiran
or https://jisho.org
-
According to the JapaneseNumberFilter javadocs, it uses the attribute values of the last token used to compose the normalized number, which can be wrong. While this is documented it leads to a number …
-
Following the request #224 to know the preprocessing steps applied to the Wikipedia, I would like to go further and suggest the creation of a Tokenizer class that would wrap the references to those l…
-
Getting following error while using `kuroshiro` but it is only in some cases. 90% of the time, it is not throwing any error. I do not have the input to test for this case.
Stacktrace
```
TypeE…
-
#3 のコメントに書いた通り、まともな点字翻訳には助詞にあたるカナの変換(「は」→「わ」のようなもの)が不可欠になる。変換自体は自明だが、助詞を見つけ出す作業には日本語自然文に対する形態素解析が必要。
これを安直にMeCabでやろうとも考えたが、任意のCライブラリをインストールできないHerokuでは実行できなくなってしまう(基本的にGem経由でなければいけなさそう)。
Herokuに依存する…
tadd updated
10 years ago
-
If it is the combination of one kanji+ one kana, then it is likely to be a type I verb. Such 走る、歩く、切る,
(exceptions are 着る、寝る),while verbs with one+ kanji && one+ kana sould be type II, such as 起きる(on…
-
といってもどうする?
実際に何する?
-
**Describe the bug**
I'm using the OpenSearch Dashboards Dev Tools Console to make requests to an OpenSearch cluster. Some of the requests work as expected, but for a number of requests, the consol…
-
Unidic's lex data doesn't have enough information for the viterbi algorithm to distinguish words with the same readings and same word types in context. So お父さん is always interpreted as お・ちち・さん, instea…
-
First off, this is a genius project! Great use of Elastic ELK.
* 1) I should be able to send you something to set the default kibana index once I get back to my main computer this weekend.
* 2) H…