issues
search
huu4ontocord
/
rio
Text pre-processing for NLP datasets
Apache License 2.0
11
stars
6
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
updated rulebase
#32
ianyu93
closed
2 years ago
0
Create CODE_OF_CONDUCT.md
#31
huu4ontocord
closed
2 years ago
0
Load multiple supported kenlm models and check fakename
#30
vumichien
closed
2 years ago
1
Add ontology.json.gz
#29
huu4ontocord
closed
2 years ago
0
Update ontology storage and access
#28
huu4ontocord
closed
2 years ago
0
Make all indentation to be 4 spaces
#27
edugp
closed
2 years ago
1
Generalize FakeNameGenerator to any pattern of name configurations
#26
edugp
closed
2 years ago
1
add catalan names
#25
mapama247
closed
2 years ago
2
Update fake_names.py
#24
shamikbose
closed
2 years ago
0
Add libpostal address detection
#23
huu4ontocord
opened
2 years ago
5
tokenizers
#22
PierreColombo
closed
2 years ago
0
Update README.md
#21
PierreColombo
closed
2 years ago
0
Update fake_names.py
#20
PierreColombo
closed
2 years ago
1
Update fake_names.py
#19
PierreColombo
closed
2 years ago
1
updated regex rulebase
#18
ianyu93
closed
2 years ago
0
updated regex rulebase
#17
ianyu93
closed
2 years ago
0
Update lexicon to fix diseases
#16
huu4ontocord
closed
2 years ago
0
Update language:country mappings
#15
j-chim
closed
2 years ago
0
More fake names
#14
huu4ontocord
opened
2 years ago
2
Tie fake data together
#13
huu4ontocord
opened
2 years ago
0
Refactor code from process.py
#12
huu4ontocord
opened
2 years ago
0
Add documentation
#11
huu4ontocord
opened
2 years ago
0
truncate batch width to 512
#10
huu4ontocord
closed
2 years ago
1
updated Chinese regex
#9
ianyu93
closed
2 years ago
1
Adding TurkuNLP sample data
#8
huu4ontocord
closed
2 years ago
0
Merge dev into main
#7
justinphan3110
closed
2 years ago
0
Some Cleaning up the code
#6
justinphan3110
closed
2 years ago
0
Add parallel processing of datasets
#5
huu4ontocord
opened
2 years ago
0
Do winner takes all for NER tags
#4
huu4ontocord
opened
2 years ago
1
Do span splitting and merging for final ner tags
#3
huu4ontocord
closed
2 years ago
0
Fix Zh NER tagging
#2
huu4ontocord
opened
2 years ago
0
CLI to test
#1
huu4ontocord
closed
2 years ago
1