qwen-api Search Results

1000+ results
for qwen-api

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

vllm-project/vllm #5675

ValueError: The input size is not aligned with the quantized…

@youkaichao ### Your current environment My environment: Name: vllm Version: 0.4.2+cu117 ### 🐛 Describe the bug I quantified the model(Qwen2_72B) using AWQ myself, when i wanna to set api s…

QuanhuiGuan updated 2 months ago
7
InternLM/lmdeploy #1639

LMDeploy-0.4.1运行qwen1.5 110B，推理长时间无结果

### Checklist - [X] 1. I have searched related issues but cannot get the expected help. - [X] 2. The bug has not been fixed in the latest version. ### Describe the bug 当启动110b的服务时，以下命令长时间没有结果 curl…

summerrain321 updated 3 months ago
3
chatchat-space/Langchain-Chatchat #4372

启动ollama+chatchat

==================================================== 步骤：安装ollama 进行 ollama serve 和ollama run qwen:0.5b 安装chatchat 更改配置 chatchat-config model --set_model_platforms '[{ "platform_name": "ollama…

Dhaizei updated 2 months ago
5
microsoft/graphrag #732

[Issue]: AttributeError: 'str' object has no attribute 'embe…

### Is there an existing issue for this? - [X] I have searched the existing issues - [X] I have checked [#657](https://github.com/microsoft/graphrag/issues/657) to validate if my issue is covered …

wenmengzhou updated 1 month ago
2
triton-inference-server/tensorrtllm_backend #562

Unable to initialize shared memory key 'triton_python_backe…

### System Info A100 80G accelerate 0.31.0 aiohttp 3.9.5 aiosignal 1.3.1 annotated-types 0.7.0 async-timeout …

zhangyu68 updated 1 month ago
1
LLM-Red-Team/qwen-free-api #12

qwen和kimi分开端口部署能同时使用吗？

目前部署了kimi可以但是qwen换8001，访问失败了

Bald0Wang updated 5 months ago
2
hiyouga/LLaMA-Factory #3185

PPO训练过程中loss为负数

### Reminder - [X] I have read the README and searched the existing issues. ### Reproduction export USE_MODELSCOPE_HUB=1 # nohup sh ppo_qwen.sh > ppo_qwen.log 2>&1 & CUDA_VISIBLE_DEVICES=0 py…

BIT-Xu updated 5 months ago
2
oneapi-src/oneDNN #1712

Add support for INT4/UINT4

# Summary Add support for INT4 and/or UINT4 Refs: https://intellabs.github.io/distiller/quantization.html https://developer.nvidia.com/blog/int4-for-ai-inference/ https://arxiv.org/abs/2301.12017…

WilliamTambellini updated 3 months ago
7
vllm-project/vllm #3303

OAI Chat completions response contains chatml tokens for phi…

#### Context I am doing some performance comparison between [llama.cpp](https://github.com/ggerganov/llama.cpp/blob/master/examples/server/README.md) and vLLM in https://github.com/ggerganov/llama.…

phymbert updated 4 months ago
3
jgravelle/AutoGroq #35

LMStudio unexpected keyword argument 'api_key'

`st.experimental_rerun` will be removed after 2024-04-01. Debug: Handling user request for session state: {'discussion': '', 'rephrased_request': '', 'api_key': '', 'agents': [], 'whiteboard': '', '…

jsarsoun updated 3 months ago
14

上一页 1...94 95 96 97 98 99 100...100 下一页

1000+ results for qwen-api

1000+ results
for qwen-api