[Project] Preparatory-Training (Placeholder name)

tl;dr: Add a second phase of pre-training to explicitly train models to be good at prompt-tuning and few-shot learning.

Short version:

Choose a pretrained model e.g. GPT-J, T5, GLM
Build a large set of NLU and NLG tasks. Piggy-back off https://github.com/EleutherAI/lm_evaluation_harness/ where possible.
Do a second phase of pretraining (important: has to be pretraining scale, not fine-tuning scale) on massive task dataset, incorporating prompt tuning during training
Evaluate!

EleutherAI / project-menu