-
### System Info / 系統信息
python=3.11.10
vllm=0.6.3.post1
transformers=0.6.3.post1
vllm-cpp-python=0.3.1
Ubuntu="18.04.6 LTS (Bionic Beaver)"
### Running Xinference with Docker? / 是否使用 Docker 运行 Xi…
-
_As I am not familiar with python and coding, my issue report can be weird and I apologize for that, I try my best. Thank you for your help and patience!_
ISSUE: I am unable to launch the project a…
-
服务器部署ms-swift出现如下问题:[ERROR:swift] * Running on local URL: http://127.0.0.1:7860 To create a public link, set `share=True` in `launch()`.
但是cat web_ui.py只有如下内容:
# Copyright (c) Alibaba, Inc. and it…
-
### System Info / 系統信息
(chatglm_env) (base) root@di-20240511161733-mz478:/tiamat-vePFS/share_data/boyang/llms/GLM-4/finetune_demo# pip list
Package Version
-----------------------…
-
For example: https://github.com/intel-analytics/ipex-llm/blob/main/python/llm/example/GPU/PyTorch-Models/Model/qwen1.5/generate.py
The current inference output is generated all at once.
However, t…
-
Help! I got this error and can't fix it. I follow the instruction but got this。
Traceback (most recent call last):
File "D:\LLMs\Thinker_DecisionMakingAssistant\Thinker_DecisionMakingAssistant-m…
-
### Initial Checks
- [ ] I have searched GitHub for a duplicate issue and I'm sure this is something new
- [ ] I have read and followed [the docs & demos](https://github.com/modelscope/modelscope-age…
-
### What features would you like to see added?
Haystack is a great opportunity to use RAG. Haystack has an API and LibreChat is a great UI with all the good implementation of user and document manage…
-
### Describe the bug
The links all now have duplicated url segment, for example:
https://www.gradio.app/guides/guides/fastapi-app-with-the-gradio-client/
### Have you searched existing issues? …
-
问题:通过webui.py运行,推理模式选择预训练音色,点击生成音频报错,服务端显示:RuntimeError: "addmm_impl_cpu_" not implemented for 'Half',具体报错信息如下:
2024-09-27 16:32:01,942 INFO get sft inference request
tn 我是通义实验室语音团队全新推出的生成式语音大模型,提…