owos / afri_augs

Data Augmentation for Generative models
1 stars 5 forks source link

Create a script to perform back translations #6

Open Iambusayor opened 10 months ago

Iambusayor commented 10 months ago

Back translation involves the use of monolingual data to generate more training data for MT task. A backward intermediate model is trained on the available corpora and then used to generate synthetic data, which is mixed together in the training data Measuring the Impact of Data Augmentation Methods for Extremely Low-Resource NMT explains further.

You're expected to write a script/class to generate additional training data for a given language using the back-translation technique.

ThisisPromise commented 10 months ago

Assign me this issue

ikeadeoyin commented 10 months ago

Can I work on this with @ThisisPromise?

Iambusayor commented 10 months ago

Can I work on this with @ThisisPromise?

@ThisisPromise do you want any help with this?