-
### Elasticsearch Version
8.14.1
### Installed Plugins
analysis-icu, analysis-kuromoji, analysis-nori,analysis-smartcn,analysis-stempel,analysis-ukrainian,ltr,mapper-size,repository-hdfs
### Java …
-
-
Tokenizing a sentence "寿司が美味しい。" produces the following tokens:
,,,
Tokenizing the same sentence written only in hiragana character exhibits identical behavior which is great.
,,,
However, for som…
-
MeCab has a `-N` flag with which a user can specify the top-N results to get back. On http://www.atilika.org/ the Viterbi algorithm's output graph shows all possible morphemes, along with the cost of …
-
Is it possible to add a feature so that one can find
"carry" with "carried"
"album" with "albums"
「止める」 with 「止められた」(Japanese)
especially when there would be 0 result otherwise?
-
### 概要
lucene を9系にバージョンアップする。
未使用ライブラリは削除する。
### 実施事項(任意)
- org.apache.lucene:lucene-queryparser
- org.apache.lucene:lucene-analyzers-kuromoji -> org.apache.lucene:lucene-analysis-kuromoji
…
-
## 論文情報
- 論文タイトル:Sudachi: a Japanese Tokenizer for Business
- 著者:Kazuma Takaoka, Sorami Hisamoto, Noriko Kawahara, Miho Sakamoto, Yoshitaka Uchida, Yuji Matsumoto
- 論文リンク:https://www.aclweb.org/ant…
-
name | about | title | labels | assignees
-- | -- | -- | -- | --
💭 Proposal | Better support for German words decompounder | [PROPOSAL] | proposal |
## What kind of business use case are you tr…
-
First off, this is a genius project! Great use of Elastic ELK.
* 1) I should be able to send you something to set the default kibana index once I get back to my main computer this weekend.
* 2) H…
-
This issue lists Renovate updates and detected dependencies. Read the [Dependency Dashboard](https://docs.renovatebot.com/key-concepts/dashboard/) docs to learn more.
## Rate-Limited
These updates a…