gpt2 Search Results - Githubissues

1000+ results
for gpt2

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

langchain-ai/langchain #28056

TokenTextSplitter not loading up HF tokenizer from `.from_hu…

### Checked other resources - [X] I added a very descriptive title to this issue. - [X] I searched the LangChain documentation with the integrated search. - [X] I used the GitHub search to find a sim…

bhavnicksm updated 1 week ago
4
pistocop/pistoBot #10

Google Colab - Fails to Install Requirements for gpt2-scratc…

Dear Pistocop, Hope you're well! Thanks for putting together this cool example of how to build a what's app bot to chat like myself. I'm running into an error when running the Train a GTP2 model se…

evar18 updated 2 weeks ago
5
Shwai-He/MEO #4

Cannot setup MoE and MEO training for gpt2

Hi authors, I have a question about training the MoE and MEO models on GPT-2. I am using the `tasks/language-modeling/run_clm.py` script from the repository (as well as `sh run_clm.sh`). However, …

yujiaw98 updated 2 weeks ago
1
huggingface/accelerate #3257

[BUG] Accelerator.__init__() got an unexpected keyword argum…

### System Info ```Shell accelerate version: main python version: 3.11 torch version: 2.4 numpy version: 1.26.4 ``` ### Information - [X] The official example scripts - [ ] My own modified scri…

as12138 updated 3 days ago
3
princeton-nlp/MABEL #8

StereoSet benchmark for GPT2

I noticed that GPT2Tokenizer is used when evaluating GPT2, which doesn't have a mask_token. Will this impact the evaluation result? I think I should add a new one manually but I'm unsure which one I…

Lj1ang updated 3 months ago
2
microsoft/LMOps #280

Exception: Current loss scale already at minimum - cannot de…

Thank you for sharing your codes. When running gpt2/kd/kd_medium.sh on 2*3090, the program encountered this error. What should I do, such as adjusting the learning rate?

Z-eloto updated 4 days ago
3
lm-sys/FastChat #3309

Gpt2

12368w updated 5 months ago
3
databricks/megablocks #157

amp_C undefined symbol after installing Megablocks

I am trying to setup and use megablocks to train MoE models, but I see the following error: ``` Traceback (most recent call last): File "/n/holyscratch01/dam_lab/brachit/moes/megablocks/third_p…

RachitBansal updated 1 month ago
3
hikettei/Caten #102

Milestones

- [ ] September: Finish implementing GPT2 Inference w/ float32, clang. - [x] Solidify aIR System - [x] Make aIR type-safe - [x] Optimize AIR (FastGraph). Simplify GPT2 < 1.0s - [x] E…

hikettei updated 1 day ago
1
YifeiZhou02/ArCHer #14

Model Training Unstable（webshop，gpt2）

I encountered an issue while trying to reproduce the results by loading the gpt2_bc_webshop_history.pt model and running the run.py script. The training was initiated with the following parameters: …

RobertXWL updated 2 months ago
1

上一页 1...1 2 3 4 5 6 7...100 下一页

1000+ results for gpt2

1000+ results
for gpt2