-
Hi, this is a great project. Can you provide some sample data for local development testing? I want to test it out. thank you very much!
-
Hi,
I've followed the this blog post https://huggingface.co/blog/fine-tune-whisper to finetune Whisper with my own dataset.
Everything seems to be working as expected. However, I've noticed a stra…
-
### System Info / 系統信息
python 3.11.8
### Running Xinference with Docker? / 是否使用 Docker 运行 Xinfernece?
- [ ] docker / docker
- [X] pip install / 通过 pip install 安装
- [ ] installation from source / 从源…
-
## Describe the bug
I have an intermittent error when streaming the Pile, where I get a JSON parse error which causes my program to crash.
This is intermittent - when I rerun the program with the …
-
The current interface of **setNextSentence** and **incrementWord** methods in **SegmentingTokenizerBase** do not define the checked exceptions, which makes it troublesome to be inherited.
For example…
-
In the 0.5 release, `summarize.py` is used for summarization benchmark. However, in the latest 0.6.1 release, the `summarize.py` does not exist. I can only find the `summarize_long.py`.
Following …
-
# Overview
I have long wanted proper streaming support in the `encoding/json` library. I’ve been doing some homework to understand the current state of things, and I think I’ve come to grips with m…
-
-
```
curl -X POST localhost:8000/v2/models/ensemble/generate -d '{"text_input": "What is machine learning?", "max_tokens": 20, "bad_words": "", "stop_words": ""}'
{"error":"in ensemble 'ensemble', [r…
-
### Your current environment
Dockerfile: vllm/vllm-openai:v0.6.3
Parameters:
`--enable-auto-tool-choice --tool-call-parser hermes`
### Model Input Dumps
_No response_
### 🐛 Describe th…