-
It seems like mergekit didn't support the merge method of GPT-Neo, could anyone help me or just realize the function? I appreciate it !
-
Could you provide an example with gpt neo and english wav2vec models ?
Replacing one model throws an shape error here
-
![image](https://github.com/user-attachments/assets/f413c84a-cc4d-44e7-80fb-1d9c160a7c06)
1.How to get the scores through GPT-Neo-2.7B?
2.In which procedure,the prompt get positive or negative,after…
-
### System Info
- `transformers` version: 4.45.1
- Platform: Linux-5.4.0-193-generic-x86_64-with-glibc2.17
- Python version: 3.8.13
- Huggingface_hub version: 0.25.1
- Safetensors version: 0.4.…
-
Huggingface is adding PyTorch-based GPT-Neo support via https://github.com/huggingface/transformers/pull/10848
That's just the superlarge models (1.3B and 2.7B). If performance/support is good (sin…
-
### 🐛 Describe the bug
I'm trying to follow the instructions to efficiently load Hugging Face models from [`torchtitan`'s docs for FSDP1 -> FSDP2: Meta-Device Initialization](https://github.com/pyt…
-
-
Thanks so much for sharing your code.
I tested the local demo using 40 human-written stories (average length around 500 words) and got a few false alarms. 17 stories were flagged as having over a …
-
https://github.com/uber-research/PPLM/blob/e236b8989322128360182d29a79944627957ad47/run_pplm.py#L610
I'm trying to implement gpt-neo with PPLM. however gpt-neo meeds upgarde transformers liberary to …
-
### Search before asking
- [X] I had searched in the [issues](https://github.com/eosphoros-ai/DB-GPT/issues?q=is%3Aissue) and found no similar issues.
### Operating system information
Linux
### P…