issues
search
rkfg
/
gpt-2
Code for the paper "Language Models are Unsupervised Multitask Learners"
MIT License
20
stars
7
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
Since the merge from nshepperd, the state of the Adam optimizer is no longer saved
#12
allo-
opened
4 years ago
7
GPT's (and GPT-2's) architecture
#11
ZheMann
opened
5 years ago
0
Error after 100th training epoch
#10
zendevil
opened
5 years ago
4
Protobuf::FatalException
#9
zendevil
closed
5 years ago
2
models/345M/checkpoint/run1; Not a directory
#8
zendevil
closed
5 years ago
20
Loss calculation and updating weights
#7
ZheMann
closed
5 years ago
7
Duration of encoding a dataset ~2.4GB
#6
ZheMann
closed
5 years ago
12
Use of pre- and suffix to distinguish between documents
#5
ZheMann
closed
5 years ago
3
Creating dictionairy files
#4
ZheMann
closed
5 years ago
7
Mismatch between shape of tensors (due to vocabulairy size)
#3
ZheMann
closed
5 years ago
9
AttributeError while running train.py
#2
ZheMann
closed
5 years ago
1
Training on Telugu-english corpus
#1
ghost
opened
5 years ago
3