issues
search
mead-ml
/
mead-baseline
Deep-Learning Model Exploration and Development for NLP
Apache License 2.0
243
stars
73
forks
source link
Chore/improve spm int
#905
Closed
dpressel
closed
2 years ago
dpressel
commented
2 years ago
Adds SPM "normal" vectorizer (no additional outside token handling)
Adds dict1d and label vectorizer for GPT2/RoBERTa that allows tagging tasks
Change the sentencepiece dep
Update a few defaults in the pretraining, include SPM everywhere