-
### Description of the bug:
Hello,
I'm encountering an issue when trying to export a model to tflite with quantization. It appears that the tensor shapes are being altered incorrectly somewher…
-
### Your current environment
```text
Can't run since running on dockerized cluster. Using latest pip install for both vLLM and transformers + CUDA 12.1
```
### 🐛 Describe the bug
Running …
-
加载 lora 后的 model 信息:
```bash
PeftModelForCausalLM(
(base_model): LoraModel(
(model): ChatGLMForConditionalGeneration(
(transformer): ChatGLMModel(
(embedding): Embeddin…
-
您好!我很希望可以用自己的数据集训练权重,请问在与预训练部分您提供的模型是否是由P2-weighting训练出来的,或是您对其做了何种修改,是否可以提供最终的预训练的代码,如果可以收到您的回复我将感激不尽!
awcea updated
21 hours ago
-
# Notes
The errors is happening in `ExportedProgram.run_decompositions()` call: message is `Cannot view a tensor with shape torch.Size([1, 512, 32, 128]) and strides (2097152, 128, 65536, 1) as a t…
-
I ran this command
"python src/launcher.py --phase=test --visualize --data-path=evaluation_data/svt/test.txt --data-base-dir=evaluation_data/svt --log-path=log.txt --load-model --model-dir=model --o…
-
### 🐛 Describe the bug
hi @hliuca ,
ROCm Nightly has been greatly improved performance ever since the F.Linear fix but unfortunately pytorch compile does not work on ROCm even though it works on…
-
Hi, I'm interested in your work and try to reproduce it, but there are some details need to be confirmed.
The first one is the implementation of MVAE. The paper said,
> We copy the network archi…
-
Following [the tutorial](https://github.com/lllyasviel/ControlNet/blob/main/docs/train.md#step-3---what-sd-model-do-you-want-to-control) I can successfully download SD, add ControlNet, and train it.
…
-
**Describe the bug**
When the sequence of calculation parameters (FP16/BF16) in the buffer is different from the forward execution sequence of the model: As a result, when the `--overlap-param-gather…