-
I am trying to install flash-attention for windows 11, but failed with message:
```
> pip install flash-attn --no-build-isolation
Looking in indexes: https://pypi.tuna.tsinghua.edu.cn/simple
Colle…
G4mot updated
4 months ago
-
### 是否已有关于该错误的issue或讨论? | Is there an existing issue / discussion for this?
- [X] 我已经搜索过已有的issues和讨论 | I have searched the existing issues / discussions
### 该问题是否在FAQ中有解答? | Is there an existing…
-
Environment: Windows10_zh-cn, Anaconda Prompt
The installation of the pre-environment is completed, but an error occurs when executing the following command.
conda activate scepter
set PYTHONPATH…
-
-
2023-12-29 15:19:56,915 - modelscope - WARNING - ('PIPELINES', 'my-anytext-task', 'my-custom-pipeline') not found in ast index file
2023-12-29 15:19:56,915 - modelscope - INFO - initiate model from C…
-
Excellent work! When will the dataset be released?
nmll updated
2 months ago
-
sh scripts/run_assistant_server.sh --served-model-name Qwen2-7B-Instruct --model path/to/weights
这个比VLLM推理速度慢吗
-
### System Info / 系統信息
Miniconda3_3.10 ubuntu22.04 CUDA 12.1
### Running Xinference with Docker? / 是否使用 Docker 运行 Xinfernece?
- [ ] docker / docker
- [ ] pip install / 通过 pip install 安装
- [X] …
-
-
### Motivation
Hi team,
ZhipuAI just released their multi-modal model `CogVLM2-Video-LLama3-Chat`. Can we support its serving with TorchEngine? It seems that they use a new causal model architectu…