-
### Feature request
Hello!
I would love to see StreamingLLM/ Windowed Attention with Attention Sinks implemented, as proposed in https://arxiv.org/abs/2309.17453.
The primary author (@Guangxuan-…
-
This is going to collect missing spaces a fter a period as discussed in https://github.com/petergtz/alexa-wikipedia/issues/37.
-
### Issue you'd like to raise.
I am just testing a very basic code as follows using LangChain
```
from langchain import HuggingFaceHub
from langchain import PromptTemplate, LLMChain
import …
-
I invested dozens of hours in trying to get the best translation results into german.
From all models for translation available Alma 13 Lora is the best.
However, it's beaten by a margin from an o…
-
The examples load and serve without issue `meta-llama/Llama-2-7b-chat-hf` and `amazon/LightGPT` models.
However, anytime I try other models such as
- `tiiuae/falcon-7b`
- mistralai/Mistral-7B-v0…
-
Not exactly a bug, but I'm about to try to get EasyDel to work on some AMD GPU servers I've got, and might need some help. Would it be possible to pay for support to get EasyDel working on these serve…
-
On Windows WSL2, with Cuda Toolkit Installed and Cuda-Container-Toolkit installed, I'm facing this issue running the official Docker image :
```
ollama-ollama-1 | 2023/11/29 00:36:04 llama.go:2…
-
### System Info
A6000 GPU on runpod.
Copy-and-paste the text below in your GitHub issue and FILL OUT the two last points.
- `transformers` version: 4.35.0.dev0
- Platform: Linux-5.4.0-153-gene…
-
When I run the following command I am getting an error:
az vm extension set \
--resource-group PALANI_DEV2_RG \
--vm-name testvm2 \
--name customScript \
--publisher Microsoft.Azure.Ext…
-
we are building a big flutter application and so we have created multiple modules and using them in our flutter app. Each module contains many screens.
Here is the folder structure for the app :
…