issues
search
WorksApplications
/
SudachiPy
Python version of Sudachi, a Japanese tokenizer.
Apache License 2.0
392
stars
50
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
Cython based optimization
#123
polm
closed
4 years ago
7
Improve error messages related to dictionary setup
#122
sorami
closed
4 years ago
0
Fix a bug causing … is converted to "", "", "…"
#121
sorami
closed
4 years ago
0
Tokenizing Ellipsis creates empty tokens
#120
polm
opened
4 years ago
6
Major content update
#119
sorami
closed
4 years ago
0
How do I run the tests?
#118
polm
closed
4 years ago
3
Speed up execution by re-using unk info
#117
polm
closed
4 years ago
1
Even if registered in the user dictionary, 「MI6」 is divided.
#116
maharada2
closed
4 years ago
5
Release v0.4.3
#115
hiroshi-matsuda-rit
closed
4 years ago
3
upgrade dartsclone from 0.7.0 to 0.9.0
#114
hiroshi-matsuda-rit
closed
4 years ago
1
Can I pickle WordInfoList?
#113
knok
closed
4 years ago
10
How can we detect unknown words?
#112
fullflu
closed
4 years ago
7
Resolve #99
#111
izziiyt
closed
4 years ago
6
Update SudachiDict_core and dartsclone version.
#110
kanjirz50
closed
4 years ago
3
Fix the disconnected EOS problem
#109
sorami
closed
4 years ago
1
feat: unuse symlink & use resoources/sudachi.json
#108
izziiyt
closed
2 years ago
9
SudachiPy doesn't work with Windows with "OSError: symbolic link privilege not held"
#107
chezou
closed
3 years ago
5
Fix the oov flag method call
#106
sorami
closed
5 years ago
0
OOV flag is not properly set
#105
sorami
closed
5 years ago
0
Fix a reading form error
#104
sorami
closed
5 years ago
0
No reading form for certain words
#103
sorami
closed
5 years ago
0
doc: update requirements.txt
#102
izziiyt
closed
5 years ago
0
Slack invitation link is expired.
#101
natsuume
closed
5 years ago
2
Is it possible to import and use sudachipy's full dictionary directly?
#100
BLKSerene
closed
3 years ago
2
"BufferError" happens when calling create method of dictionary object
#99
araiman
closed
4 years ago
2
update README.md
#98
izziiyt
closed
5 years ago
0
cythonize dartscloen
#97
izziiyt
closed
5 years ago
0
AttributeError: EOS is not connected to BOS
#96
sig-miyamoto
closed
4 years ago
5
UnicodeDecodeError
#95
KHiyowa
opened
5 years ago
0
optimize JoinNumericPlugin
#94
izziiyt
opened
5 years ago
0
optimize JoinKatakana plugin
#93
izziiyt
opened
5 years ago
0
tokenize -d option && Resolve #82
#92
izziiyt
closed
5 years ago
0
doc: modify README
#91
izziiyt
closed
5 years ago
0
Resolve #86
#90
izziiyt
closed
5 years ago
0
Resolve #65
#89
izziiyt
closed
5 years ago
0
What's the tagset used by SudachiPy?
#88
BLKSerene
closed
5 years ago
3
Failed to install the dictionary on Windows
#87
BLKSerene
closed
5 years ago
2
link command has inconsistent effect to build and tokenize command
#86
izziiyt
closed
5 years ago
0
Resolve #83
#85
izziiyt
closed
5 years ago
0
refactor: make _compile function clear
#84
izziiyt
closed
5 years ago
0
parsing user-dictionary name
#83
KHiyowa
closed
5 years ago
2
handling alphabet
#82
KHiyowa
closed
5 years ago
2
improve character category search
#81
izziiyt
closed
5 years ago
0
build user dictionary failed with a long csv
#80
AlloVince
closed
5 years ago
5
fix: fixed logger TypeError issue
#79
AlloVince
closed
5 years ago
1
Improve Speed
#78
izziiyt
closed
5 years ago
0
Improve Speed
#77
izziiyt
closed
5 years ago
0
fix: versioning when pip install
#76
izziiyt
closed
5 years ago
0
auto versioning & follow PEP396
#75
izziiyt
closed
5 years ago
0
faster parsing
#74
izziiyt
closed
3 years ago
8
Previous
Next