minimaxir / gpt-2-simple

Python package to easily retrain OpenAI's GPT-2 text-generating model on new texts
Other
3.4k stars 675 forks source link

New transformer architecture #289

Open aletote opened 2 years ago

aletote commented 2 years ago

I see that, like GANs in their time, a lot of new transformer architectures came out. Is there a better one that we could use now than gpt-2 to generate for example, in my case, ABC music?

NeonWizard commented 2 years ago

This project's name is "gpt-2-simple". It's a simplification wrapper around the GPT2 model. Changing to a different architecture or model is out of scope for this project.