Closed mikemccand closed 5 years ago
Commit 2ac5fb668a83f6484276de8059875bebf6a7ce16 in lucene-solr's branch refs/heads/master from Namgyu Kim https://gitbox.apache.org/repos/asf?p=lucene-solr.git;h=2ac5fb6
LUCENE-8904: enhance Nori DictionaryBuilder tool
[Legacy Jira: ASF subversion and git services on Jul 11 2019]
Commit 2ac5fb668a83f6484276de8059875bebf6a7ce16 in lucene-solr's branch refs/heads/jira/SOLR-13565 from Namgyu Kim https://gitbox.apache.org/repos/asf?p=lucene-solr.git;h=2ac5fb6
LUCENE-8904: enhance Nori DictionaryBuilder tool
[Legacy Jira: ASF subversion and git services on Jul 12 2019]
Commit 70854dc1efbdc1d7efdb8ac0421c69d36ea6e31f in lucene-solr's branch refs/heads/branch_8x from Namgyu Kim https://gitbox.apache.org/repos/asf?p=lucene-solr.git;h=70854dc
LUCENE-8904: enhance Nori DictionaryBuilder tool
[Legacy Jira: ASF subversion and git services on Aug 08 2019]
Closing after the 9.0.0 release
[Legacy Jira: Adrien Grand (@jpountz) on Dec 08 2021]
It is the Nori version of @msokolov's LUCENE-8863. This patch has two changes. 1) Improve exception handling 2) Enable external dictionary for testing
Overall, it is the same as LUCENE-8863.
But there are some differences between Nori and Kuromoji. These can be slightly different on the code. 1) CSV field size Nori : 12 Kuromoji : 13 2) left context ID == right context ID Nori : can be different Kuromoji : always same 3) Dictionary Type Nori : just one type Kuromoji : IPADIC, UNIDIC
After this job, I'll apply LUCENE-8866 and LUCENE-8871 to Nori.
Legacy Jira details
LUCENE-8904 by Namgyu Kim (@danmuzi) on Jul 07 2019, resolved Aug 08 2019