issues
search
vilmarzti
/
long_context_transformers
This is the repository for my Master Thesis where I analyse transformer architectures for long contexts
GNU General Public License v3.0
2
stars
0
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
Batch perplexity
#13
vilmarzti
closed
2 years ago
0
Padding
#12
vilmarzti
closed
2 years ago
0
Routing Transformer weights available if useful
#11
GenTxt
opened
2 years ago
1
Training Routine
#10
vilmarzti
closed
2 years ago
0
Create Licence
#9
vilmarzti
closed
2 years ago
0
Implement Unviversal Transformer
#8
vilmarzti
opened
2 years ago
0
Implement Routing Transformer
#7
vilmarzti
opened
2 years ago
1
Transformer training settings
#6
vilmarzti
opened
2 years ago
0
Implement autoregression LM in LongFormer
#5
vilmarzti
opened
2 years ago
0
Scaling Transfomers
#4
vilmarzti
opened
2 years ago
0
Vanilla Transformer (baseline)
#3
vilmarzti
closed
2 years ago
1
LinFormer
#2
vilmarzti
opened
2 years ago
0
Compressive Transforer
#1
vilmarzti
opened
2 years ago
2