How to use MASS for Style Transfer?

microsoft / MASS

MASS: Masked Sequence to Sequence Pre-training for Language Generation

https://arxiv.org/pdf/1905.02450.pdf

Other

1.11k stars 206 forks source link

Closed him-mah10 closed 4 years ago

him-mah10 commented 4 years ago

Hello, I want to finetune for style transfer task.

I have following doubts:

If I want to pre-train, what should be the value of arguments --lgs and --mass_steps? As per my understanding both of them should have value en.
When I- finetune what should be the value of argument --bt_steps? Now for en to fr conversion, the value was en-fr-en, but for style transfer how to finetune?
Can I use MASS-base-uncased as already pre-trained model and use it for fine tuning?

Thank-you!

StillKeepTry commented 4 years ago

Maybe you can consider A-style corpus as en and B-style corpus as fr. and training.
If you already have parallel data, maybe use --mt_steps 'A-B'. You can also use --bt_steps 'A-B-A' and 'B-A-B' for on-the-fly back-translation.
Of course, you can use Mass-base-uncased for directly fine-tuning.

him-mah10 commented 4 years ago

@StillKeepTry could you provide me a way to fine tune mass-base-uncased for style transfer tasks? TIA