-
### 🚀 The feature, motivation and pitch
FlashAttention supports sliding window local attention.
Since Pytorch's FlashAttention has been upgraded to FlashAttention-2, I hope PyTorch will open the sl…
-
I can use orpo without issues with transformers 4.45.2 and orpo_trainer.train(), the old way of starting the training loop.
When I use the new unsloth_train(orpo_trainer) for gradient_accumulation …
-
_Downstream PyTorch issue:_
https://github.com/pytorch/pytorch/issues/133780
I'm trying to do attention on a batch-of-zero, because my program uses a static graph and I rely on zero-batching (in…
-
Below is the error I am getting while using generate api with below params. First time it is able to generate with **prefix_pos** but next call I am getting below error.
"use_beam_search": false,…
-
Hello.
Unfortunately, since I use Linux, I can't use this great mod. Therefore I ask myself: Is it possible to adapt the mod so that it would also run with a local TTS server such as Piper? This wo…
-
I want to fine-tune a model using unsloth. Every thing works fine on colab but on my system I got the following:
{
"name": "NotImplementedError",
"message": "No operator found for `memory_efficie…
-
Hi I`m trying to customize the default shell topbar in Android and iOS
I would do like to set some rounded corners on it
i`m trying to follow this https://vladislavantonyuk.github.io/articles/C…
-
运行语音识别时报错:
>>> from paddlespeech.cli.asr.infer import ASRExecutor
>>> asr = ASRExecutor()
>>> result = asr(audio_file="zh.wav")
>>> print(result)
我认为跑步最重要的就是给我带来了身体健康
[2023-07-07 08:27:47,58…
-
### Branch/Tag/Commit
v5.3
### Docker Image Version
nvcr.io/nvidia/pytorch:22.12-py3
### GPU name
T4
### CUDA Driver
NVIDIA-SMI 470.57.02 Driver Version: 470.57.02 CUDA Version: 11.8
###…
-
### Feature request
Currently, if fp16 is used with grounding dino via https://huggingface.co/docs/transformers/main/en/model_doc/grounding-dino, there is an error of the following:
```
...
Fi…