models-tuning Search Results

1000+ results
for models-tuning

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

Dao-AILab/flash-attention #1126

Applying FA3 in qwen2 model fine-tuning is slower than FA2

Hello，I applied FA3 in the fine-tuning of the qwen2 model, using an H800 machine. The test was slower than FA2 under the same conditions. I used FlashAttnFunc.forward in hopper/flash_attn_interface…

982118809 updated 2 weeks ago
8
huggingface/transformers #31688

Any config for DeBERTa series as decoders for TSDAE?

### Feature request Seems that there is no config for DeBERTa v1-2-3 as decoder (while there are configs for BERT/RoBERTa et similia models)... This is needed in order to perform TSDAE unsupervised…

bobox2997 updated 2 months ago
3
lenML/ChatTTS-Forge #141

[assistance] Confirmation on Data Format and Structure for F…

### 确认清单 - [X] 我已经阅读过 README.md 和 dependencies.md 文件 - [X] 我已经确认之前没有 issue 或 discussion 涉及此 BUG - [X] 我已经确认问题发生在最新代码或稳定版本中 - [X] 我已经确认问题与 API 无关 - [X] 我已经确认问题与 WebUI 无关 - [X] 我已经确认问题与 Finetune 无关 ##…

IrisSally updated 2 weeks ago
2
tanganke/weight-ensembling_MoE #2

Question about WEMoE vs PEFT

Thanks for your awesome work in model merging! I'm excited about the improvements you achieved compare to other merging methods. However, I saw the individually fine-tuned models still out-perform WEM…

Zhou-Hangyu updated 2 months ago
1
kkoutini/PaSST #30

difference of fine-tuning the pretrained models

I'm sorry to bother you. I want to ask the difference between the two ways to get pre-training models. I don't know if I understand correctly **The first is in the ''Getting a pre-trained model for f…

nianniandoushabao updated 1 year ago
2
unslothai/unsloth #735

Feature request: Combining train_on_inputs: false + sample p…

In axolotl, there's a config parameter you can set: `train_on_inputs: false` It changes the way the loss is calculated when training a lora -> i.e. it ignores the loss on input tokens and only tra…

williambarberjr updated 2 months ago
1
huggingface/peft #2052

LoRA support for image classification and segmentation

I had a question regarding LoRA support for image classification and segmentation. I understand that LoRA support is available for both as specified in the following tutorials: https://github.com/hug…

namrahrehman updated 1 week ago
4
cubiq/ComfyUI_InstantID #179

Applied Providers Slow

Hi! When I queue an image for the first time it takes significantly longer than subsequent requests. It seems like the issue is related to applied providers. It shows antelopev2 and buffalo_l in th…

maxIrvine updated 2 months ago
4
tanganke/peta #1

Minimal code for reproduce the results

Hi, Thank you for your great work. If you don't mind, could you provide us with minimal code or instructions to reproduce the results from the paper? Or, the minimal script to run the code woul…

enkeejunior1 updated 1 month ago
3
NougatCA/FineTuner #16

Some errors that occurred while fine-tuning models such as c…

老师您好！ When I fine-tune codebert, graphcodebert and unixcoder on the downstream tasks, they all have the same error, which is as follows:`==================== LOADING ==================== Loaded conf…

lsrhaha1 updated 4 months ago
5

上一页 1...8 9 10 11 12 13 14...100 下一页

1000+ results for models-tuning

1000+ results
for models-tuning