issues
search
Yoctol
/
strpipe
text preprocessing pipeline
Other
5
stars
0
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
correct-badge
#76
SoluMilken
closed
5 years ago
1
added code conv
#75
SoluMilken
closed
5 years ago
1
Fixed cython version to 0.29.1
#74
SoluMilken
closed
5 years ago
0
int_normalizer
#73
SoluMilken
closed
5 years ago
1
fix_bug: Fixtures are not meant to be called directly
#72
SoluMilken
closed
5 years ago
2
Add a number processing op
#71
stegben
opened
5 years ago
0
[WIP] remove tokenizer-hub from requirements
#70
stegben
opened
5 years ago
0
Remove dependency: tokenizer_hub
#69
stegben
opened
5 years ago
0
version_050
#68
SoluMilken
closed
5 years ago
0
changed flake8 to flake-config-yoctol
#67
SoluMilken
closed
5 years ago
0
added en word tokenizer
#66
SoluMilken
closed
5 years ago
0
pass json dump kwargs when call save_json of pipe
#65
stegben
closed
5 years ago
0
Setup extensions
#64
stegben
closed
5 years ago
0
assert input list is immutable
#63
SoluMilken
closed
5 years ago
0
Fix mutable pad
#62
stegben
closed
5 years ago
0
fix upload bug without pxd file
#61
stegben
closed
5 years ago
0
Add pxd
#60
SoluMilken
closed
5 years ago
1
add skip-cleanup when CI
#59
stegben
closed
5 years ago
0
add checkpoint example in README
#58
stegben
closed
5 years ago
1
Refactor step_info serialization
#57
stegben
opened
5 years ago
0
Collect Intermediate data
#56
SoluMilken
closed
5 years ago
8
Separated ops
#55
SoluMilken
closed
5 years ago
0
Need fullwidth -> halfwidth normalizer ??
#54
SoluMilken
opened
5 years ago
1
rename TokenToIndexWithUnk to TokenToIndex
#53
stegben
closed
5 years ago
3
[BUG]: index2token change after serialization
#52
SoluMilken
closed
5 years ago
3
Update README.md
#51
stegben
closed
5 years ago
2
Not consistent
#50
SoluMilken
closed
5 years ago
1
reversible add sos eos and return sentlen after add sos eos
#49
SoluMilken
closed
5 years ago
2
sentlen ?
#48
SoluMilken
closed
5 years ago
1
added get state
#47
SoluMilken
closed
5 years ago
1
added char tokenizer ops
#46
SoluMilken
closed
5 years ago
0
annotated normalizer
#45
SoluMilken
opened
5 years ago
1
string join and split
#44
SoluMilken
closed
5 years ago
0
Need update pypi
#43
SoluMilken
closed
5 years ago
2
Modified readme
#42
SoluMilken
closed
5 years ago
0
Pipe helper
#41
SoluMilken
closed
5 years ago
3
fix method name of Pipe
#40
stegben
closed
5 years ago
0
add_step(s or no s)_by_op_name ????
#39
SoluMilken
closed
5 years ago
0
Pad
#38
absolutelyNoWarranty
closed
5 years ago
1
removed all .so files in strpipe
#37
SoluMilken
closed
5 years ago
0
[WIP]Pad
#36
absolutelyNoWarranty
closed
5 years ago
3
added normalizer op
#35
SoluMilken
closed
5 years ago
0
Fix docs building with cython
#34
absolutelyNoWarranty
closed
5 years ago
1
added zh char tokenizer op
#33
SoluMilken
closed
5 years ago
0
Fix docs makefile - include apidocs and readme link
#32
absolutelyNoWarranty
closed
5 years ago
2
Regression test
#31
stegben
closed
5 years ago
0
Property-based testing
#30
absolutelyNoWarranty
opened
5 years ago
0
token <-> index operation
#29
SoluMilken
closed
5 years ago
2
README add extend-ops guide
#28
stegben
closed
5 years ago
0
Candidate ops
#27
SoluMilken
opened
5 years ago
10
Next