-
Dear collaborators
I use an Azure OpenAI Studio deployment with the model and deployment names set to the same values:
- `gpt-4-32k`
- `text-embedding-ada-002`
It's running on the Azure OpenAI s…
-
无法加载模型,后台报错:
1.当前版本为PYTHON 3.11.9,之前试过PYTHON3.10.X也报错
2. xinference . 0.12.2
3. pip install "xinference[all]" 安装后本地环境运行
4. Full stack of the error.
Traceback (most recent call last):
File …
-
# Prerequisites
When I install via pip install llama-cpp-python, there will be an error. It will occur on versions 0.2.81 and 0.2.80, The version 0.2.79 can be successfully installed.
python 3.11…
-
research autogenerating or auto suggesting different kinds of mnemonics, such as acrostics, story points, peg lists, etc using GPT-3
-
Thank you for such a great work. Recently, I delved into the paper and the code provided for the content-aware layout generation task, and it appears that Layoutprompter handles the underlay element i…
-
Hello, I am currently working with the new LLaMA 3.1 models by Meta and they require the newer versions of transformers, optimum, and accelerate. I ran into compatibility issues with XTTS regarding th…
-
### Describe the bug
since last week's update, I can't load any gguf model. i can still do so if i roll back.
### Is there an existing issue for this?
- [X] I have searched the existing iss…
-
ScienceQA was evaluated in your experiments. As I understand, ScienceQA is a benchmark associated with multi-modal tasks, whereas your model operates purely within the realm of text. Could you please …
-
So this is a strange one. I am stumped.
In way, this is sort of like #416, but I confirmed that if Batch==1, then the problem does not occur. (See below)
My inference loop looks like this
```
…
-
### System Info
tensorrt_llm 0.12.0.dev2024073000
CUDA 12.4
H100-PCIe
### Who can help?
@Tracin @byshiue
### Information
- [ ] The official example scripts
- [X] My own modified scr…