-
When converting [nemolita-21b](https://huggingface.co/win10/nemolita-21b), which is a merged model, the `convert.py` runs into this error:
```shell
Traceback (most recent call last):
File "/hom…
-
We would like to produce a reference architecture and best practices for sustainable AI with cloud native environments. This will factor in the AI/ML lifecycle.
- The document will should aim to…
-
### Checklist
- [X] 1. I have searched related issues but cannot get the expected help.
- [X] 2. The bug has not been fixed in the latest version.
- [X] 3. Please note that if the bug-related iss…
-
With latest update azure openAI key not working. no changes on my configuration. API key works.
macOS 15
Version: 0.40.1
VSCode Version: 1.91.1
Commit: 58b91712431381a1b75817cd3437cee35dddcb30
…
-
### Description of the feature request:
**Feature requests:**
1 >>>
I am trying to develop an application using Gemini but it is not able to do very simple and easy tasks which can be done by …
-
### 1. Who do you think this talk is for?
Developers and AI enthusiasts
### 2. What do you think you'll learn from this talk?
- prompt engineering
- large language models
- automating content cre…
-
### Your current environment
```text
The output of `python collect_env.py`
```
### How would you like to use vllm
Can someone help me explain what the max_num_seqs and max_model_len parameters …
-
How to export Whisper small.En quantization model?
For example, the int8 quantified version .
-
### What happened?
https://github.com/nomic-ai/gpt4all/issues/2204
Since I upgraded to gpt4all 2.6.2 (which updated llama.cpp) my speed dropped from 3-4 t/s to 1 t/s. I am getting 1/3 the speed acro…
-
### Your current environment
流式输出前面几个字符为啥要设置成空字符?不能直接输出模型的生成吗
### How would you like to use vllm
流式输出前面几个字符为啥要设置成空字符?不能直接输出模型的生成吗