issues
search
karpathy
/
minGPT
A minimal PyTorch re-implementation of the OpenAI GPT (Generative Pretrained Transformer) training
MIT License
20.3k
stars
2.53k
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
Minor spelling in demo.ipynb
#144
MicahMaphet
opened
1 week ago
1
Reallly small correction
#143
manuelsh
opened
1 month ago
0
Data preprocessing
#142
umertens
closed
2 months ago
2
get it to run on aws ec2
#141
cchen9
closed
3 months ago
0
<|endoftext|> token isn't encoded correctly
#140
ttumiel
opened
5 months ago
2
Why not causal-masking MLP layer?
#139
MikeTkachuk
closed
5 months ago
0
DEMO: minGPT on tinygrad
#138
ziliangpeng
opened
7 months ago
0
rename minGPT to tinyGPT
#137
ziliangpeng
closed
7 months ago
0
We collect more than 60 open source software for CAX and want to train miniGP to be the AI assistant.
#136
fengsim
opened
7 months ago
0
What is the purpose of `c_proj` here?
#135
brynhayder
opened
7 months ago
1
GPT-2 implementation problem
#134
sanhai77
opened
8 months ago
0
where did the self.bias get defined in the casual attention class
#133
nebyu08
opened
8 months ago
1
Strange model behavior when taking the softmax in the wrong dimension
#132
Cloud299
opened
9 months ago
0
how to build a model and interact with it like chatgpt?
#131
IamExperimenting
opened
9 months ago
0
Support for Multi-GPU Parallel Training in chargpt.py
#130
JinXiaofeng1234
opened
9 months ago
0
concatenate two BPE tokenizer
#129
mackmake
opened
9 months ago
0
error line 200, in from_pretrained assert len(keys) == len(sd)
#128
Sandy4321
opened
10 months ago
7
which pytorch version should be used pls for windows OS only CPU use only for inference ?
#127
Sandy4321
opened
10 months ago
0
what is the minimum hardware requirement to train
#126
jorjiang
opened
11 months ago
0
What's the max output tokens this model supports?
#125
aletote
opened
1 year ago
1
Cs674
#124
Westen-M
opened
1 year ago
1
Should -1 marker (as special token) be counted in vocab_size?
#123
mw66
opened
1 year ago
1
bug fix - remove attn.bias keys from GPT state dict in 'from_pretrine…
#122
amnonbleich
opened
1 year ago
1
Check off a todo in utils: add 'freeze()' to freeze config.
#121
JosephCatrambone
opened
1 year ago
0
AssertionError when run generate.ipynb with default parameter
#120
jacquesqiao
opened
1 year ago
4
How can I run a trained model and can't run Test_ Hugging face_ Import. py
#119
linlong1314
opened
1 year ago
1
Output of CausalSelfAttention
#118
whchan05
opened
1 year ago
1
Added generator repl for using adder model.
#117
prasad83
opened
1 year ago
0
Rename transformer layers
#116
rjarun8
opened
1 year ago
0
Question: does it support other utf-8 natual language?
#115
yingshaoxo
opened
1 year ago
1
生成圖片
#114
rubucat
opened
1 year ago
0
About layer norm dimention parameter:
#113
vcvycy
opened
1 year ago
1
Simplifying weigh decay checking doesn't work
#112
rabinadk1
closed
1 year ago
3
Crashed Encoder possible data corruption
#111
DayneSorvisto
closed
1 year ago
0
Update bpe.py
#110
DayneSorvisto
closed
11 months ago
0
Adding a requirements.txt file
#109
kukuquack
opened
1 year ago
0
Update README.md
#108
hoangkimthuc
closed
1 year ago
0
Information leak in training procedure?
#107
ljch2018
closed
1 year ago
0
Zero-grad more aggressively to save memory
#106
cchan
opened
1 year ago
1
how does this compare to aitextgen?
#105
breadbrowser
closed
1 year ago
0
Stop words?
#104
BoyuanJackChen
opened
1 year ago
0
More consistent variable naming
#103
vwxyzjn
opened
1 year ago
0
Slightly Better README.md
#102
MohamedAliRashad
opened
1 year ago
0
CI for unittests
#101
ubgk
opened
1 year ago
1
fix: add missing dependency in `setup.py`
#100
ben-schulz
opened
1 year ago
0
tests do not run in project as built
#99
ben-schulz
opened
1 year ago
1
Facilitating setup with popular tools
#98
Utopiah
opened
1 year ago
0
Fix typo in bpe.py
#97
eltociear
opened
1 year ago
0
Update README.md
#96
chinhaihour
closed
1 year ago
0
Caching for generation
#95
murbard
opened
1 year ago
1
Next