vilmarzti / long_context_transformers

This is the repository for my Master Thesis where I analyse transformer architectures for long contexts
GNU General Public License v3.0
2 stars 0 forks source link

Vanilla Transformer (baseline) #3

Closed vilmarzti closed 2 years ago

vilmarzti commented 2 years ago

Maybe GPT-2?

vilmarzti commented 2 years ago

Implemented as GPT, might change to GPT2 later