-
This might be more of a general question, but is it possible to use [FlashAttention](https://github.com/Dao-AILab/flash-attention/tree/v1.0.9) with QLoRA in order to further decrease memory requiremen…
-
-
The model fails to do a forward pass in the train step. The error reported is just "Segmentation fault" :-
```
dataset cityscapes_train
batch_size 1
data_dir ./dataset/ci…
-
### System Info
- `transformers` version: 4.43.3
- Platform: Linux-5.4.0-26-generic-x86_64-with-glibc2.27
- Python version: 3.10.4
- Huggingface_hub version: 0.24.3
- Safetensors version: 0.4.3
…
-
# 🐛** C++ Inferencing using Torchscript Exported Torchvision model Erorr
I'm trying to use this approach to make my model (Mobilenetv3 small) using Torchvison models, In train and validation phase …
-
getting this error
-
Hello !
Regarding MCTformer+, in the previous code training, "la_crf_dir" and "ha_crf_dir" were not generated. Where do they obtain them from? Can you provide relevant code?
-
Hi all,
I've been trying to run a modified version of the mnist estimator example ([link to gist](https://gist.github.com/smurching/bb703e024507dde065ea11289b36da29)) using the `tf.estimator.train_…
-
### Is there an existing issue for this?
- [X] I have searched the existing issues and checked the recent builds/commits
### What happened?
When running the UI with python launch.py --xformers, i g…
-
# Prerequisites
Please answer the following questions for yourself before submitting an issue.
- [X] I am using the latest TensorFlow Model Garden release and TensorFlow 2.
- [X] I am reporting…