-
**Is your feature request related to a problem? Please describe.**
I see quite some GPT-2 implementations, but I am missing a GPT-Neo/GPT-J implementation. This is a variant of GPT-2 which is quite s…
-
```
causal_mask = self.bias[:, :, key_length - query_length : key_length, :key_length]
```
but in Structured Prompting the key_length exceeds the max_positions.
How to address this issue. Tha…
-
## Description
---
**Message from the maintainers**:
Love this enhancement proposal? Give it a 👍. We prioritise the proposals with the most 👍.
-
Hi, I have a question about the tokenizer mismatch.
When the reference model is fixed to be "gpt-j-6B", several scoring models do not share the same tokenizer, such as "gpt-neox-20b" and "llama". …
-
Goose AI offers API services for GPT Neo/GPT J and its much cheaper than OPEN AI. I don't have much experience on LLMs and I think the AI will be more based and sassy as I've seen from YouTube videos…
-
### Describe the issue
I implemented a program with GPT NEO in python (attached the program) and the equivalent version in C++. To acquire the exported GPT NEO model I made some slight modification…
-
像是缺失了文件
Unrecognized model in D:\LIUGEGE\ComfyUI\models\Joy_caption_alpha\text_model. Should have a `model_type` key in its config.json, or contain one of the following strings in its name: albert, a…
-
### Feature request
Flash Attention 2 is a library that provides attention operation kernels for faster and more memory efficient inference and training: https://github.com/Dao-AILab/flash-attentio…
-
### Feature request
I encountered a KeyError while loading the phi3-v vision model into Optimum Huggingface. The error message states:
```
KeyError: 'phi3-v model type is not supported yet in Nor…
-
### Model description
Do we support Model2Vec embedding models?
E.g: https://huggingface.co/minishlab/potion-base-8M
https://minishlab.github.io/tokenlearn_blogpost/
### Open source status
- […