-
### Reminder
- [X] I have read the README and searched the existing issues.
### System Info
- `llamafactory` version: 0.9.1.dev0
- Platform: Linux-5.19.0-45-generic-x86_64-with-glibc2.35
- Python…
-
**问题描述, 日志截图**
xinference部署glm-4-9b,通过oneapi接入fastgpt,使用glm4的对话功能正常,**使用glm4的工具调用时,报错400**
关联issue:https://github.com/labring/FastGPT/issues/1823
### 版本信息:
xinference:0.12.2
fastgpt:4.8.4-fix…
-
(chatglm) n:\github\GLM-4>python openai_api_lby.py
2024-06-12 15:24:16,061 - Start initialize model...
Special tokens have been added in the vocabulary, make sure the associated word embeddings are …
-
**问题描述 / Problem Description**
目前用的是0.3.1版本
llm: qwen-max
platform: one-api
qwen的agent prompt用的是structured-chat-agent,用qwen这个prompt更奇怪,工具甚至都没调用就直接给出答案了。测试可以试试天气查询的这个工具,工具都没有调用能直接返回查询结果,最后换了struc…
-
Please help to confirm if the GLM-4-9B-Chat is supported , thanks so much.
Docker images:intelanalytics/ipex-llm-serving-vllm-xpu-experiment
Tag:2.1.0b2
Image ID:0e20af44ad46
step:
…
-
Hi there, is it possible to add the new GLM-4V-9B model ? Thanks you
https://github.com/THUDM/GLM-4
huggingface : https://huggingface.co/THUDM/glm-4v-9b
modelscope:https://modelscope.cn/models/…
-
### What happened?
When ctx kv quantization is enabled, if the task's context length exceeds the threshold and triggers a context shift, the application will throw an error and crash.
![image](h…
neavo updated
3 weeks ago
-
**H2O version, Operating System and Environment**
I am running H2O on Databricks with the following cluster settings:
- Single User Cluster
- 13.3 LTS (Apache Spark 3.4.1, Scala 2.12)
and the …
-
### System Info / 系統信息
absl-py 2.0.0
accelerate 0.33.0
addict 2.4.0
aiofiles 23.2.1
aiohttp …
-
**问题描述 / Problem Description**
用简洁明了的语言描述这个问题 / Describe the problem in a clear and concise manner.
**复现问题的步骤 / Steps to Reproduce**
1. 执行 '...' / Run '...'
2. 点击 '...' / Click '...'
3. 滚动到 '..…