issues
search
atilika
/
kuromoji
Kuromoji is a self-contained and very easy to use Japanese morphological analyzer designed for search
Apache License 2.0
945
stars
130
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
Kuromoji_tokenizer: sort clause does not seem to work for some specific character combinations
#141
ajaypvymo
opened
4 months ago
0
Kanji penalty and other penalty
#140
elialm7
opened
7 months ago
0
Handling of userDictionary comments
#139
tottokug
opened
1 year ago
0
日本人 is not divided into two sections even in extended mode
#138
hohno-panopto
opened
2 years ago
0
ソーシャルメディア is not tokenized into two words
#137
hohno-panopto
opened
2 years ago
0
Improve Travis CI build Performance
#136
chenzhang22
closed
1 year ago
0
Question: Is there any way to update neologd dictionary?
#135
tstsukahara
opened
3 years ago
0
How to enable discardPunctuation in Kuromoji Java
#134
yanghanxy
opened
3 years ago
0
how to increase heap size other than MAVEN_OPS
#133
kazukousen
opened
3 years ago
0
Configuring with Maven
#132
Zurdge
closed
4 years ago
3
Next release?
#131
mpriala-code
opened
4 years ago
2
Optimization opportunity in the fst usage.
#130
fulmicoton
opened
4 years ago
2
Kuromoji POS Train
#129
abhinandansrivastava
opened
5 years ago
0
Fix bug with overflow bits in patricia trie
#127
emmanuellegedin
opened
5 years ago
0
Normalized surface in user dictionary.
#126
mrikitoku
opened
6 years ago
5
tokenize 一人(ひとり,hitori)will be seperate as 一(いち,ichi) 人(ひと,hito)
#125
andy840119
opened
6 years ago
1
Builder method to configure custom ResourceResolver
#124
logogin
opened
6 years ago
3
Nexus Repository is Offline?
#123
ryantenney
closed
6 years ago
1
Small typo in the README
#122
Pisush
opened
6 years ago
0
Obtain furigana?
#121
0x6C38
closed
6 years ago
1
How to use Kuromoji in Gradle?
#120
weituotian
closed
6 years ago
4
Why does tokenized Kanji features never contains Hiragana ?
#119
theGlenn
closed
6 years ago
1
Unidic design flaw
#118
wareya
opened
6 years ago
4
Internals documentation and academic papers?
#117
DarrenCook
closed
6 years ago
1
Possible Issue with tokenization when English+Japanese are adjacent in text
#116
bbguitar77
opened
7 years ago
0
Longer string in Katakana has low priority
#115
oharato
opened
7 years ago
0
Debug graph for multi-tokenization
#114
emmanuellegedin
opened
7 years ago
0
The tokenizing performance of mixed language
#113
kwkwvenusgod
opened
7 years ago
0
Fixed overflow error
#112
emmanuellegedin
closed
7 years ago
1
http://www.atilika.org showcases the outdated maven artifact repository information
#111
titsuki
opened
7 years ago
0
java.lang.RuntimeException: Could not load dictionaries. Caused by: java.io.IOException: Classpath resource not found: fst.bin
#110
proninalex
opened
7 years ago
0
Made code nicer
#109
emmanuellegedin
closed
7 years ago
1
Segmentation wrong with token contains square brackets?
#108
reckart
opened
7 years ago
3
Is there any example for Lemmatization?
#107
RangerWolf
opened
8 years ago
0
Feature n best search
#106
emmanuellegedin
closed
7 years ago
1
Tokenizing text in Hiragana character set
#105
mhko
opened
8 years ago
5
Compound word with nakaguro in it
#104
mhko
closed
6 years ago
2
Unidic Tokenization on Romaji Words
#103
tobias-khs
opened
8 years ago
0
Fixed bug with partially overlapping entries in user dictionary. Adde…
#102
gautela
closed
8 years ago
1
Prevent an array copy if it can be provided by the buffer itself
#101
cmoen
closed
8 years ago
0
Update to NIO read methods to support Android implementation
#100
gerryhocks
closed
8 years ago
1
Question: how to obtain multiple parsings?
#99
fasiha
opened
8 years ago
5
Android runtime exception when creating new Tokenizer using kuromoji-ipadic
#98
hk0i
closed
8 years ago
14
Changed user dictionary word cost to prefer longest match
#97
gautela
closed
8 years ago
0
Kuromoji on Android
#96
jendib
closed
8 years ago
9
Disable simple benchmark tests as they often deplete heap-size
#95
cmoen
closed
8 years ago
0
Upgraded NEologd version to 20151224 (Merry Christmas!)
#94
cmoen
closed
8 years ago
0
Fixed a few typos. Added full-features user dictonary test for kuromo…
#93
cmoen
closed
8 years ago
0
Fixed a typo that snuck into a test-case
#92
cmoen
closed
8 years ago
0
Added user dictionary support for entries with full features, including weights, etc.
#91
cmoen
closed
8 years ago
0
Next