google-research / uda

Unsupervised Data Augmentation (UDA)
https://arxiv.org/abs/1904.12848
Apache License 2.0
2.17k stars 312 forks source link

Is UDA(for text classification) purely based on back-translation? #57

Open abrhaleitela opened 4 years ago

abrhaleitela commented 4 years ago

I wanted to augment labeled data for sentiment analysis of low resource language where there are no reasonable MT models to do back-translation. Is there a way to do that without using back-translation?

Thanks

michaelpulsewidth commented 4 years ago

You can also try a simpler data augmentation SwitchOut, which replace words with words randomly sampled from your vocabulary.