Closed junwang-wish closed 2 years ago
I think that BLOOM might be a good candidate for that. After UL2 training you might want to try instruction tuning like BLOOMZ, FLAN or T0. But a good workaround could be (i) include instruction tuning samples (xp3mt, p3 etc) in the "prefix LM" objective function, (ii) include other objective function like span_corruption
and continue UL2 training.
Thanks!
Is mT0 suitable / recommended for continued training on mixture of denoising (span corruption, extreme span corruption, prefix LM) tasks similar to UL2? Like below
My domain text is quite different from internet text so I assume span corruption task would help mT0 learn special syntax / semantics of my domain.