parameter-efficient-fine-tuning Search Results

1000+ results
for parameter-efficient-fine-tuning

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

facebookresearch/dino #144

Finetuning to down stream tasks

Hi, Thank you for the code release and the continuous support. In the main paper (Sec 4.2.3) you mention FT experiments following the FT protocol in "Training data-efficient image transformers & di…

RoyHirsch updated 2 years ago
7
nextflow-io/nextflow #4286

GPU usage metrics

## New feature It would be extremely useful if GPU usage metrics were recorded for GPU tasks. ## Usage scenario Using GPU resources efficiently on HPC is often a challenge. For example, basec…

shenker updated 11 months ago
6
kvkarandashev/bmapqml #8

Behaviour of gradient based hyperparamteropt

When using the default settings of `stochastic_gradient_descend_hyperparam_optimization(X_train, y_train, init_param_guess=np.array([1.0, 100.0]), max_stagnating_iterations=8, …

janweinreich updated 2 years ago
4
ninehills/blog #92

大语言模型（LLM）微调技术笔记

> 注：本文大段摘抄自 [^2] **图1：大模型进化树**[^1] ## 0x00 大模型微调在预训练后，大模型可以获得解决各种任务的通用能力。然而，越来越多的研究表明，大语言模型的能力可以根据特定目标进一步调整。这就是微调技术，目前主要有两种微调大模型的方法[^2]： 1. 指令微调，目标是增强（或解锁）大语言模型的能力。 2. 对齐微调，目标是将大…

ninehills updated 1 month ago
22
pytorch/data #1334

[RFC] Polylithic: Enabling multi-threaded DataLoading throug…

This RFC was re-created due to a problem with the original. Summary of comments from previous issue below. ### 🚀 The feature **TL;DR \-** We want to lean into **modular Multi-Threading/Multi-Pr…

andrewkho updated 3 weeks ago
9
linkedin/Liger-Kernel #48

Unable to use FLCE with FSDP+PEFT+embeddings layers

### 🐛 Describe the bug when trying to train both LoRA layers on the base model and also set modules_to_save on the lora config which makes the embeddings layers trainable (my assumption is it also ap…

winglian updated 1 month ago
5
AkihikoWatanabe/paper_notes #1307

ReFT: Representation Finetuning for Language Models, Zhengxu…

# URL - https://arxiv.org/abs/2404.03592 # Affiliations - Zhengxuan Wu, N/A - Aryaman Arora, N/A - Zheng Wang, N/A - Atticus Geiger, N/A - Dan Jurafsky, N/A - Christopher D. Manning, N/A …

AkihikoWatanabe updated 6 months ago
1
RyderSwanson/LLMEval #18

LLM Coursera Background Research Week 2

**As a** LLM researcher **I need** to develop standardized testing protocols for fine-tuning LLMs **So that** I can evaluate the effectiveness and efficiency of different fine-tuning strategies ###…

Elliott-Hendrickson updated 1 month ago
1
AILab-CVC/YOLO-World #196

About fine-tuning the resulting model is unable to inference…

### First I was using the fine-tuning documentation to fine-tune it myself, Refactoring "yolo_world_v2_s_vlpan_bn_2e-4_80e_8gpus_mask _refine_finetune_coco.pth" from the coco dataset, but using …

chenjiafu-George updated 6 months ago
29
A-suozhang/GetArxivDaily #35

New submissions for Mon, 17 Apr 23

## Keyword: efficient ### End-to-end codesign of Hessian-aware quantized neural networks for FPGAs and ASICs - **Authors:** Javier Campos, Zhen Dong, Javier Duarte, Amir Gholami, Michael W. Mahoney,…

A-suozhang updated 1 year ago
1

上一页 1...6 7 8 9 10 11 12...100 下一页

1000+ results for parameter-efficient-fine-tuning

1000+ results
for parameter-efficient-fine-tuning