-
I could not download roberta-tokenized gpt2 from Google Drive through Internet, and found that there were no related files on Google Drive and the roberta-tokenized gpt2 model could not be found on th…
-
## TODO
- [x] Optimize JIT, fix memory planner #193
- [x] Complete test-suite/test-dynamic-shape.lisp
- [x] More tests on the JIT kernel accuracy (compared to PyTorch, like Multi Head Attention an…
-
I follow the script in [https://towardsdatascience.com/time-llm-reprogram-an-llm-for-time-series-forecasting-e2558087b8ac](url) to Predicting with Time-LLM using GPT2.
My code is exactly the same as…
-
### System information
- **Have I written custom code (as opposed to using a stock example script
provided in TensorFlow)**: No
- **OS Platform and Distribution (e.g., Linux Ubuntu 16.04)…
-
Hi
I ran into some issues with your notebook?
1) `args = get_parser()` was not defined anywhere. I assumed that it was just the `run.py` args then inside a function?
2) Where is this `gpt2/g…
-
Hello, I have a strange phenomenon. This makes me very puzzled.
I use the following code to load the GPT2-xl model locally, but it can run and load normally in a Jupyter file. When I use another sc…
-
GPT2 is a very standard transformer implementation (e.g. used in [Andrej Karpathy](https://www.youtube.com/@AndrejKarpathy)'s rather famous introduction to transformers: https://www.youtube.com/watch?…
-
我用 BEST_chekpoint_ViT-B_32.pth.tar 作为模型权重运行 eval2.py 或 caption_beams.py 都会报错
(gpt2) ➜ PromptCC-main git:(master) ✗ python caption_beams.py
11-03 12 : 14 : 01
model_name: ./checkpoints/cap_mo…
-
I want to load offline gpt2, however it can't load directly, so did I do something wrong?
code:
from autotiktokenizer import AutoTikTokenizer
tokenizer = AutoTikTokenizer.from_pretrained("home/xx…
-
I'm trying to follow https://github.com/karpathy/llm.c/discussions/481 but I'm getting this error:
```
evaluating HellaSwag: 30/79
evaluating HellaSwag: 40/79
evaluating HellaSwag: 50/79
evalua…