-
### Checked other resources
- [X] I added a very descriptive title to this issue.
- [X] I searched the LangChain documentation with the integrated search.
- [X] I used the GitHub search to find a sim…
-
Dear Pistocop,
Hope you're well! Thanks for putting together this cool example of how to build a what's app bot to chat like myself. I'm running into an error when running the Train a GTP2 model se…
-
Hi authors,
I have a question about training the MoE and MEO models on GPT-2. I am using the `tasks/language-modeling/run_clm.py` script from the repository (as well as `sh run_clm.sh`). However, …
-
### System Info
```Shell
accelerate version: main
python version: 3.11
torch version: 2.4
numpy version: 1.26.4
```
### Information
- [X] The official example scripts
- [ ] My own modified scri…
-
I noticed that GPT2Tokenizer is used when evaluating GPT2, which doesn't have a mask_token. Will this impact the evaluation result?
I think I should add a new one manually but I'm unsure which one I…
-
Thank you for sharing your codes.
When running gpt2/kd/kd_medium.sh on 2*3090, the program encountered this error. What should I do, such as adjusting the learning rate?
-
-
I am trying to setup and use megablocks to train MoE models, but I see the following error:
```
Traceback (most recent call last):
File "/n/holyscratch01/dam_lab/brachit/moes/megablocks/third_p…
-
- [ ] September: Finish implementing GPT2 Inference w/ float32, clang.
- [x] Solidify aIR System
- [x] Make aIR type-safe
- [x] Optimize AIR (FastGraph). Simplify GPT2 < 1.0s
- [x] E…
-
I encountered an issue while trying to reproduce the results by loading the gpt2_bc_webshop_history.pt model and running the run.py script. The training was initiated with the following parameters:
…