-
### Proposal
Change some code that could load model locally.
### Motivation
Today I want to load gpt2 model that download from huggingface website locally like Llama, but it keeps try to connc…
-
To validate the high-level IR execution framework, code for models other than gpt2, such as gpt2-medium, gpt2-large, and gpt2-xl, is also needed.
-
To train Rrhf using my own Gpt2 model, do I need to first generate a response based on the query using my own model, and then have ChatGPT score it? This way, isn't wombat_train.json useless anymore?
-
Hi, I attempted to use speculative decoding but encountered some errors. May I ask for your assistance?
I used the parameters from the first example.
python ./examples/speculative_inference.py \…
-
Hi, your research is very interesting; right now I'm trying to replicate it for leukemia issues, but I have a problem when I try to run the pretrained_model.py file, it throws me the following error
…
-
In the GGML [repo](https://github.com/ggerganov/ggml) there are guides for converting those models into GGML format, including int4 support. I have successfully done so myself and ran those models us…
-
Is there something I'm missing with llm.c/train_gpt2.py running so slow? This is much slower than PyTorch even.
```
$ python3 train_gpt2.py --sequence_length 1024
ram used: 0.55 GB, lm_head.wei…
-
Hello once again, Max.
1. I am trying to fine-tune this model: **sberbank-ai/rugpt3small_based_on_gpt2** with aitextgen, and when I try to test it I get a compelete nonsence (random symbols like thos…
-
I get an error when I change from gpt to gpt2 in the interactive.py file.
However, when I change from gpt to gpt2 in the train.py file, everything is fine.
The error I get:
File "/transfer-lear…
-
if i run `python train.py --config conf/tutorial-gpt2-micro.yaml` I get:
FileNotFoundError: [Errno 2] No such file or directory: '/scr/dlwh/runs/gpt2-small-d=dlwh/wikitext_103_detokenized-n=-1-g=-1…
dlwh updated
4 months ago