issues
search
Yoctol
/
purewords
Create pure sentences
3
stars
2
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
correct token
#46
SoluMilken
closed
6 years ago
0
added status badges
#45
SoluMilken
closed
6 years ago
0
Update setup
#44
SoluMilken
closed
6 years ago
0
added install-require
#43
SoluMilken
closed
6 years ago
0
default None
#42
SoluMilken
opened
6 years ago
0
(pattern)|(pattern) -> pattern|pattern
#41
SoluMilken
opened
7 years ago
0
patch
#40
SoluMilken
opened
7 years ago
1
What is token_num_filter ?
#39
SoluMilken
closed
7 years ago
2
Invertible filter
#38
SoluMilken
opened
7 years ago
0
Filters should be invertible
#37
SoluMilken
opened
7 years ago
1
remove empty line in clean document
#36
plliao
closed
7 years ago
0
Jieba object lock problem
#35
plliao
opened
7 years ago
0
fix multi thread bug
#34
plliao
closed
7 years ago
0
Fix tokenize sentence bug
#33
plliao
closed
7 years ago
0
有要統一lcut就是噴出list, cut就是個generator嗎????
#32
SoluMilken
opened
7 years ago
0
use find package in setup.py
#31
plliao
closed
7 years ago
0
import BaseFilterCollection in init.py
#30
plliao
closed
7 years ago
0
Default filter collection
#29
plliao
closed
7 years ago
0
escape . in abbreviation filter
#28
plliao
closed
7 years ago
0
Migrate filter object
#27
plliao
closed
7 years ago
0
Add base filter testcases
#26
plliao
closed
7 years ago
0
add show process
#25
SoluMilken
closed
7 years ago
1
migrate filter collection
#24
plliao
closed
7 years ago
0
Base filter object
#23
plliao
closed
7 years ago
0
add base filter
#22
plliao
closed
7 years ago
0
base_filter_collection
#21
SoluMilken
closed
7 years ago
2
refact split sentences
#20
plliao
closed
7 years ago
0
replace to specific tokens
#19
plliao
closed
7 years ago
0
add jieba testcases
#18
plliao
closed
7 years ago
0
structure testcases
#17
plliao
closed
7 years ago
0
Replace time, url, phone_number, number to specific token
#16
plliao
closed
7 years ago
1
let jieba tokenizer setting be independent
#15
plliao
closed
7 years ago
0
Jieba tokenizer
#14
plliao
closed
7 years ago
0
Add customed dictionary in purewords
#13
plliao
closed
7 years ago
1
add space on splitting token to produce correct word counting
#12
plliao
closed
7 years ago
0
Split document function does too many things
#11
plliao
closed
7 years ago
0
split with semi-colon and use word length in splitting
#10
plliao
closed
7 years ago
0
Add strict test cases
#9
plliao
opened
7 years ago
2
pass command line arguments into purewords
#8
plliao
closed
7 years ago
0
Update README.md
#7
plliao
closed
7 years ago
3
split sentence 和 cut sentence 名字好模糊阿阿阿阿阿
#6
SoluMilken
closed
7 years ago
0
Enrich ReadMe
#5
SoluMilken
closed
7 years ago
0
Several url can't be removed
#4
plliao
opened
7 years ago
0
add remove blank
#3
plliao
closed
7 years ago
0
add circleci.yml
#2
plliao
closed
7 years ago
0
Sentence preprocessing
#1
plliao
closed
7 years ago
1