-
I'm getting the following error:
```
Special tokens have been added in the vocabulary, make sure the associated word embeddings are fine-tuned or trained.
Loading checkpoint shards: 100%|████████…
-
### *Project idea 3: Research about deploying LLM with Jina*
| info | details |
| ---------------- | ------------------------------…
-
I used mtq.INT8_default_CFG as recommended for CNN networks (mtq.quantize(model, config, forward_loop). My initial model ran at 80FPS after quantization it dropped to 40FPS? I checked the model struct…
-
Just a few questions about using LLMLingua.
1. How do I adjust the code so that I am using Llama instead of GPT?
2. The reason I am using Llama instead of GPT is because I don't want my data to be s…
-
I am trying to use OpenAI GPT-2 model for prompt compression. However, getting error "OpenAIGPTLMHeadModel.forward() got an unexpected keyword argument 'past_key_values'". Has anyone faced/facing simi…
-
The code is concise and very helpful. Please provide a demo for the client.
-
Hi,
I wanted to run some of the examples, but it seems like the dropbox links inside
[gptchem/data.py](https://github.com/kjappelbaum/gptchem/blob/main/src/gptchem/data.py)
only lead to dropb…
-
Hello,
1. The current implementation for matrix multiplication uses BRGEMM algorithm. Is there any implementation of "Low Rank Approximation approach" for matrix multiplication in oneDNN? Is there a…
-
## Blog: Techniques for training large neural networks
Link: https://openai.com/research/techniques-for-training-large-neural-networks
This blog explains the parallism technologies for training la…
-
### Is there an existing issue for the same bug?
- [X] I have checked the troubleshooting document at https://docs.all-hands.dev/modules/usage/troubleshooting
- [X] I have checked the existing iss…