on-device-training Search Results

1000+ results
for on-device-training

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

fadel/pytorch_ema #15

question about the use of ExponentialMovingAverage

here is my code: from torch_ema import ExponentialMovingAverage model = ... optimizer = ... scheduler = ... ema_model = ExponentialMovingAv…

sevennotmouse updated 1 week ago
2
UKPLab/sentence-transformers #2833

Adding deepspeed config

@tomaarsen hello tom. I hope you will good. I am trying to add deepspeed in sentence transformer training argument via deepspeed= "deepspeed_config.json" and also try with accelerate config but it'…

imrankh46 updated 1 month ago
1
CompVis/latent-diffusion #105

Question about the training device on text2image

Thanks for the excellent work! I wonder about the training device on text2image. The paper says it is trained on a single A100, but it seems the settings in table 15 should take more than 640GB of mem…

zhaowt61 updated 2 years ago
1
KindXiaoming/pykan #416

Train_loss and test_loss become NaN after using model.prune(…

Hello, I'm working on a multi-classification model and the result looks good. But when I train the model after using model.prune(), train_loss and test_loss become NaN easily even if I set lr and ste…

DonSteven updated 1 day ago
6
EleutherAI/sae #27

Training process stuck with --distribute_modules flag

Hi, while executing: `torchrun --nproc_per_node gpu -m sae meta-llama/Meta-Llama-3-8B --distribute_modules --batch_size 1 --layers 24 25 --grad_acc_steps 8 --ctx_len 2048 --k 192 --load_in_8bit --mic…

Hambaobao updated 3 days ago
2
ultralytics/ultralytics #14586

GPU-trained model can't detect, but CPU-trained one can

### Search before asking - [X] I have searched the YOLOv8 [issues](https://github.com/ultralytics/ultralytics/issues) and found no similar bug report. ### YOLOv8 Component Train ### Bug …

edward27inGithub updated 3 weeks ago
6
kuldeep7688/multi_task_text_classification #1

Help with multi task classification with DistilBert with 4 l…

I’m working on a multi task classification with DistilBert with 4 labels, based on your repo, and I was wondering if maybe you could help me, since I'm having a hard time trying tor each the hugging f…

joaopedrosdmm updated 1 week ago
1
ultralytics/ultralytics #14490

Deployment of training nodes in a Kuberentes Cluster

### Search before asking - [X] I have searched the YOLOv8 [issues](https://github.com/ultralytics/ultralytics/issues) and [discussions](https://github.com/ultralytics/ultralytics/discussions) and fou…

tilobormann updated 1 month ago
5
stanfordnlp/pyreft #113

[P1] Getting key error in parameter while training REFT usin…

code: import torch import transformers from transformers import AutoTokenizer, AutoModelForCausalLM, TrainingArguments import pyreft from huggingface_hub import login login(token="***") model_n…

AkashGhosh updated 2 weeks ago
8
kijai/ComfyUI-FluxTrainer #20

torch.OutOfMemoryError: Allocation on device

This extension is much more efficient and simple to use than kohya. I like it a lot! However I am having a frequent issue where it will fill up the memory right before training. After the following…

funwithforks updated 3 days ago
2

上一页 1...2 3 4 5 6 7 8...100 下一页

1000+ results for on-device-training

1000+ results
for on-device-training