on-device-training Search Results

1000+ results
for on-device-training

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

FlagOpen/FlagEmbedding #957

How to implement LongLLM in NPU device

When I create a LongLLM training environment on an NPU device, the installation of the flash-attention dependency is not possible. Are LongLLM training scripts allowed to be utilized on NPU?

yangjq713 updated 1 month ago
1
jdb78/pytorch-forecasting #1588

[BUG] Issue with using optimise_hyperparameter with PyTorch …

Hi community, I have been stuck on this issue for some time now and would greatly appreciate any help! I am trying to run the optimise_hyperparameter function over 2 A100GPU using PyTorch DDP strat…

aman1b updated 6 days ago
2
ParticleMedia/RAGTruth #9

[BUG] ValueError(f"Could not find the transformer layer clas…

Objective: To train and evaluate a model on RAGTruth dataset Settings: OS: Ubuntu WSL Python: 3.12.4 NVIDIA Driver Version: 536.23 CUDA Version: 12.2 Replication steps: 1. Git clone 2. Run…

Vanessa-Taing updated 20 hours ago
2
unslothai/unsloth #963

TypeError in `orpo_trainer.train()`: 'str' object is not cal…

When running the `ORPO Unsloth Example.ipynb` notebook, I encountered an error during the execution of `orpo_trainer.train()`. The error occurs consistently across different GPU types and persists eve…

kdunee updated 1 day ago
7
instructlab/instructlab #1344

`ilab train` issue on Windows

**Describe the bug** ilab train on windows exits with `PermissionError: [WinError 32] The process cannot access the file because it is being used by another process: './training_results/final\\mode…

sumanair updated 2 weeks ago
2
kohya-ss/sd-scripts #1551

Running Flux Lora training on 2 GPUs

First of all, many thanks for doing this! This is the only repo I'm aware of which allows doing Flux Lora training on a 16GB GPU. I appreciate this is new and the lack of information is unavoidable. …

innokean updated 1 day ago
11
yangjianxin1/Firefly #270

ValueError: You can't train a model that has been loaded in …

在用 torchrun --nproc_per_node=4 train.py --train_args_file train_args/sft/qlora/qwen2-7b-sft-qlora.json 训练qwen2+qlora+unsloth时（use_unsloth=true）出现错误： ValueError: You can't train a model that has bee…

WeixuanXiong updated 2 months ago
1
wavefrontshaping/complexPyTorch #30

ComplexDropout2d Device Error

Hi, thank you for the nice library. There seems to be a small mistake in the complexPyTorch.complexLayers.ComplexDropout2d layer, which gives a device mismatch error (torch version 2.0.1+cu118): …

lucacoma updated 1 day ago
2
huggingface/trl #1937

Is there a way to finetune a finetuned model?

I use peft and SFTrainer to train a lora model. And I want to fintune this lora model on new datasets. How can I manage this? When using the code below, the grad norm is always zero. ```python pef…

sherlcok314159 updated 2 weeks ago
1
openstreetmap/id-tagging-schema #1301

Make leisure=fitness_station preset same as leisure=playgrou…

Hello! I don't quite get a reason of difference between playground and fitness_station iD presets. Both places are basically the same, only user age changes. Current iD preset: * [`leisure=playgro…

radioxoma updated 3 weeks ago
4

上一页 1...1 2 3 4 5 6 7...100 下一页

1000+ results for on-device-training

1000+ results
for on-device-training