issues
search
vilmarzti
/
long_context_transformers
This is the repository for my Master Thesis where I analyse transformer architectures for long contexts
GNU General Public License v3.0
2
stars
0
forks
source link
Vanilla Transformer (baseline)
#3
Closed
vilmarzti
closed
2 years ago
vilmarzti
commented
2 years ago
Maybe GPT-2?
vilmarzti
commented
2 years ago
Implemented as GPT, might change to GPT2 later
Maybe GPT-2?