-
Python version: 3.9.6
Operating System: Mac
## Description
I'm fine-tuning a model pre-trained with Vggface2 with the data I collected, but the accuracy doesn't improve.
If you have any optio…
-
Not really a coding issue.
You assert in your paper that the code base is highly modular but the algorithms are very strongly tied to your implementation of a replay buffer. All of the static method…
-
## Description
Add support for [Hashicorp Nomad](https://www.nomadproject.io/) as a scheduler.
## Motivation/Background
Nomad has a good scheduler, and pytorch has good distributed training. Ho…
-
I use this setting below to train flux lora:
```
accelerate launch --gpu_ids 0,1 --main_process_port 29502 --mixed_precision bf16 --num_cpu_threads_per_process=2 \
flux_train_network.py --pr…
-
![image](https://github.com/user-attachments/assets/50c4684d-b052-493a-9995-584b07c52b79)
Hi,
I've been attempting to reproduce an experiment involving the finetuning of the Llama-2-7b-hf model,…
-
Thanks a lot for this fantastic library - it is really awesome!
I'd love to hear your opinion on the following use case. I have access to a SLURM cluster where I am not allowed (or I am at least d…
-
I'm trying to prune [DiT-XL-2-256](https://huggingface.co/facebook/DiT-XL-2-256) with taylor pruning.
While pruning the model, the following error occurs during `pruner.step()`.
```
import torch
…
-
## ❓ Questions and Help
Hi All,
I Have this code
```
import optuna
from torch.optim.lr_scheduler import ReduceLROnPlateau
# Assuming dataset is already defined
train_size = int(0.8 * len(da…
-
I’m working on a multi task classification with DistilBert with 4 labels, based on your repo, and I was wondering if maybe you could help me, since I'm having a hard time trying tor each the hugging f…
-
### System Info
- Platform: Linux-5.15.0-86-generic-x86_64-with-glibc2.35
- Python version: 3.10.14
- PyTorch version: 2.4.1
- CUDA device: NVIDIA A100-SXM4-80GB
- Transformers version: 4.45.0.…