memory-efficient-tuning Search Results

1000+ results
for memory-efficient-tuning

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

iridescentttt/SynQT #1

Parameter-Efficient and Memory-Efficient Tuning for Vision T…

您好，这篇论文最近有开源计划吗

AsuradaYuci updated 1 month ago
1
pytorch/torchtune #1487

How to disable Checkpointing for Full tuning or PEFT runs?

I am trying to run single GPU to multinode distributed fine tuning for Llama3-70B and Llama3 8B Models. Below is my training configuration: SFT (Llama3 8B & 70B) Epochs: 3 Gradient Accumulatio…

premmotgi updated 3 days ago
3
PKU-Alignment/safe-rlhf #20

[Feature Request] LoRA support for memory efficient fine-tun…

### Required prerequisites - [X] I have read the documentation . - [X] I have searched the [Issue Tracker](https://github.com/PKU-Alignment/safe-rlhf/issues) and [Discussions](https://github.com/PKU-…

70557dzqc updated 7 months ago
2
openlm-research/open_llama #88

xformers error when fine-tuning open_llama_3B with memory_ef…

Hi, I feel confused about this bug when using memory_efficient_attention. It seems that the embed per head you choose can't match with xformers? ``` NotImplementedError: No operator found for `memo…

EliverQ updated 1 year ago
5
pytorch/pytorch #129330

Fuyou Training Framework Integration for PyTorch

### 🚀 The feature, motivation and pitch Fuyou Training Framework Integration for PyTorch Description: Integrate the Fuyou training framework into PyTorch to enable efficient fine-tuning of larg…

quasinnovate updated 2 months ago
3
Cruiz102/Advesarial_Attacks_Tests #1

Add optimizations to the training loop.

To enable efficient training on GPUs and scale our repository for models with millions to billions of parameters—essential for working with large visual language models—we must implement optimization …

Cruiz102 updated 6 months ago
1
shm007g/LLaMA-Cult-and-More #4

parallel training and param efficient

## Typology of Efficient Training - Data & Model Parallel - Data Parallel - Tensor Parallel - Pipeline Paralle - Zero Redundancy Optimizer(ZeRO) (DeepSpeed, often work with CPU offloadi…

shm007g updated 1 year ago
3
lusterchris/Performance-Tuning #3

Parameters Recommendations

[PARAMETERS.txt](https://github.com/user-attachments/files/16852584/PARAMETERS.txt)

lusterchris updated 3 days ago
2
microsoft/DeepSpeedExamples #890

Deepspeed support finetune extra model with lora ?

Deepspeed support finetune extra model with lora ?

wanghongqu updated 4 months ago
1
BaoZhuhan/BaoZhuhan #5

个人科研准备

BaoZhuhan updated 1 week ago
6

上一页 1...1 2 3 4 5 6 7...100 下一页

1000+ results for memory-efficient-tuning

1000+ results
for memory-efficient-tuning