-
here is my log
[15:09:15]: Uploading binary to App Store Connect
[15:09:16]: Going to upload updated app to App Store Connect
[15:09:16]: [32mThis might take a few minutes. Please don't interrup…
-
### Describe the bug
我们在压测xinference时候发现,V100 2卡,调用/v1/chat/completions接口,stream参数是True,模型用qwen-14b-chat,用jmeter10并发进行压测,压测1分钟xinference就挂了,如果stream是False,是可以的.
### 报错日志
```
2024-07-08 11:34:3…
-
**Check if issue already exists**
could not find similar issues.
Running the node on laptop (ubuntu 20.04, noetic) installed with "sudo apt install ros--depthai-ros" works but when running node on…
-
### Describe the issue as clearly as possible:
When running in a Colab notebook, I get a `UserWarning` when using the greedy sampler. Not sure if this can be fixed in `outlines`, or requires some up…
-
I tried fine-tuning the llama-2-7b model using LoRa on an RTX3090 with 24GB, where the memory usage was only about 17GB. However, when I used the same configuration on an A100 with 80GB, the memory us…
-
Hi, thank you for making this.
I receive this error on first launch of Comfyui after installing and following directions. I am on windows 11, python 3.11.9, cuda 12.4:
```
Traceback (most recent …
-
![image](https://github.com/yuanzhoulvpi2017/zero_nlp/assets/91042213/4e681244-224e-4473-8be7-61bc9f995f81)
在将模型保存下来后然后重新读取的时候,读取预处理器会报错。下边是错误内容,transformers版本为4.40.1, torch版本为2.0.1+cuda11.8, 设备为移动端R…
-
### System Info
text-generation-launcher --env
2024-07-26T03:39:42.960734Z INFO text_generation_launcher: Runtime environment:
Target: x86_64-unknown-linux-gnu
Cargo version: 1.79.0
Commit sha…
-
what should I do?
D:\LLM\unsloth>python test-lora.py
Traceback (most recent call last):
File "D:\LLM\unsloth\test-lora.py", line 2, in
from unsloth import FastLanguageModel
File "D:\LL…
-
2024-05-20 13:07:04.828611: I tensorflow/core/util/port.cc:110] oneDNN custom operations are on. You may see slightly different numerical results due to floating-point round-off errors from different …