karpathy minGPT issues - Githubissues

karpathy / minGPT

A minimal PyTorch re-implementation of the OpenAI GPT (Generative Pretrained Transformer) training

MIT License

19.94k stars 2.47k forks source link

issues

Newest

Newest Most commented Recently updated Oldest Least commented Least recently updated

Data preprocessing

#142 umertens closed 1 week ago
2
get it to run on aws ec2

#141 cchen9 closed 1 month ago
0
<|endoftext|> token isn't encoded correctly

#140 ttumiel opened 3 months ago
2
Why not causal-masking MLP layer?

#139 MikeTkachuk closed 3 months ago
0
DEMO: minGPT on tinygrad

#138 ziliangpeng opened 5 months ago
0
rename minGPT to tinyGPT

#137 ziliangpeng closed 5 months ago
0
We collect more than 60 open source software for CAX and want to train miniGP to be the AI assistant.

#136 fengsim opened 5 months ago
0
What is the purpose of `c_proj` here?

#135 brynhayder opened 5 months ago
1
GPT-2 implementation problem

#134 sanhai77 opened 6 months ago
0
where did the self.bias get defined in the casual attention class

#133 nebyu08 opened 6 months ago
1
Strange model behavior when taking the softmax in the wrong dimension

#132 Cloud299 opened 7 months ago
0
how to build a model and interact with it like chatgpt?

#131 IamExperimenting opened 7 months ago
0
Support for Multi-GPU Parallel Training in chargpt.py

#130 JinXiaofeng1234 opened 7 months ago
0
concatenate two BPE tokenizer

#129 mackmake opened 8 months ago
0
error line 200, in from_pretrained assert len(keys) == len(sd)

#128 Sandy4321 opened 8 months ago
7
which pytorch version should be used pls for windows OS only CPU use only for inference ?

#127 Sandy4321 opened 8 months ago
0
what is the minimum hardware requirement to train

#126 jorjiang opened 9 months ago
0
What's the max output tokens this model supports?

#125 aletote opened 10 months ago
1
Cs674

#124 Westen-M opened 10 months ago
1
Should -1 marker (as special token) be counted in vocab_size?

#123 mw66 opened 1 year ago
1
bug fix - remove attn.bias keys from GPT state dict in 'from_pretrine…

#122 amnonbleich opened 1 year ago
1
Check off a todo in utils: add 'freeze()' to freeze config.

#121 JosephCatrambone opened 1 year ago
0
AssertionError when run generate.ipynb with default parameter

#120 jacquesqiao opened 1 year ago
4
How can I run a trained model and can't run Test_ Hugging face_ Import. py

#119 linlong1314 opened 1 year ago
1
Output of CausalSelfAttention

#118 whchan05 opened 1 year ago
1
Added generator repl for using adder model.

#117 prasad83 opened 1 year ago
0
Rename transformer layers

#116 rjarun8 opened 1 year ago
0
Question: does it support other utf-8 natual language?

#115 yingshaoxo opened 1 year ago
1
生成圖片

#114 rubucat opened 1 year ago
0
About layer norm dimention parameter:

#113 vcvycy opened 1 year ago
1
Simplifying weigh decay checking doesn't work

#112 rabinadk1 closed 1 year ago
3
Crashed Encoder possible data corruption

#111 DayneSorvisto closed 1 year ago
0
Update bpe.py

#110 DayneSorvisto closed 9 months ago
0
Adding a requirements.txt file

#109 kukuquack opened 1 year ago
0
Update README.md

#108 hoangkimthuc closed 1 year ago
0
Information leak in training procedure？

#107 ljch2018 closed 1 year ago
0
Zero-grad more aggressively to save memory

#106 cchan opened 1 year ago
1
how does this compare to aitextgen?

#105 breadbrowser closed 1 year ago
0
Stop words?

#104 BoyuanJackChen opened 1 year ago
0
More consistent variable naming

#103 vwxyzjn opened 1 year ago
0
Slightly Better README.md

#102 MohamedAliRashad opened 1 year ago
0
CI for unittests

#101 ubgk opened 1 year ago
1
fix: add missing dependency in `setup.py`

#100 ben-schulz opened 1 year ago
0
tests do not run in project as built

#99 ben-schulz opened 1 year ago
1
Facilitating setup with popular tools

#98 Utopiah opened 1 year ago
0
Fix typo in bpe.py

#97 eltociear opened 1 year ago
0
Update README.md

#96 chinhaihour closed 1 year ago
0
Caching for generation

#95 murbard opened 1 year ago
1
Renaming transformer.h into transformer.l

#94 marxav opened 1 year ago
0
Wrong definition of Query, Key, Value matrices? They shouldn't have bias=True

#93 LeoPerelli opened 1 year ago
3