-
### Describe the bug
I use lora to fintune the model of 'stabilityai/stable-diffusion-xl-base-1.0',get fellow error:
OSError: stabilityai/stable-diffusion-xl-base-1.0 does not appear to have a file …
-
## 🐛 Bug
The following code snippet from multihead attention module is using tensor.equal method to compare query, key and value to determine if the attention module is being used as self-attention…
-
### System Info / 系統信息
Python: Python 3.10.14
os:
```
DISTRIB_ID=Kylin
DISTRIB_RELEASE=V10
DISTRIB_CODENAME=kylin
DISTRIB_DESCRIPTION="Kylin V10 SP1"
DISTRIB_KYLIN_RELEASE=V10
DISTRIB_VER…
-
## 🚀 Feature
Remove the redundant `num_nodes` Trainer argument. Knowing the number of nodes is not required, and the world size is provided by the cluster environment anyway.
### Motivation
…
-
训练集:
{"content": "不错的上网本,外形很漂亮,操作系统应该是个很大的 卖点,电池还可以。整体上讲,作为一个上网本的定位,还是不错的。\t", "result_list": [{"text": "正向", "start": -7, "end": -5}], "prompt": "情感倾向[正向,负向]"}
{"content": " 推荐所有喜欢的红迷们一定要收藏这本书,要知道当…
-
## 🚀 Feature
Introduce a property/method `experiment_dir` on the abstract logger interface. All loggers must implement this and return a path to where their files are being saved.
### Motivatio…
-
Hi, OS: windows 10, Arc A750
Driver: 5081
请问chatglm3和Baichuan2-7B随着对话次数增加,内存不停增大。用这个KV cache demo也不能解决:
demo link: https://github.com/intel-analytics/BigDL/blob/main/python/llm/portable-zip/chat.py…
-
### 🐛 Describe the bug
I have been trying to JIT trace Whisper, using this code:
```
from transformers import WhisperProcessor, WhisperForConditionalGeneration
from datasets import load_datase…
-
### 🐛 Describe the bug
Test `TestOperatorsCUDA.test_vjp_nn_functional_scaled_dot_product_attention_cuda_float32` fails with `query: last dimension must be contiguous` on **H100**
Also the schema t…
-
when I run the scripts I get this funny error:
```python
/mnt/Samsung980Pro/WORK/AI/arch/magic-animate/magicanimate/pipelines/pipeline_animation.py:43: FutureWarning: Importing `DiffusionPipeline`…