memory-efficient-tuning Search Results

1000+ results
for memory-efficient-tuning

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

Cruiz102/Advesarial_Attacks_Tests #1

Add optimizations to the training loop.

To enable efficient training on GPUs and scale our repository for models with millions to billions of parameters—essential for working with large visual language models—we must implement optimization …

Cruiz102 updated 7 months ago
1
apache/shenyu #5704

[Question] build bootstrap native

### Question 是否有计划考虑将Bootstrap编译成native? native在应对突发流量需要扩容时启动快, 占用资源更小. 1. 很多依赖的框架并不支持 2. shenyu本身也有上传编译jar等native不能支持的功能 Is there a plan to consider compiling Bootstrap into native? Nativ…

eye-gu updated 4 hours ago
1
hpcaitech/ColossalAI #4964

[FEATURE]: Better optmizer for llm training

### Describe the feature I want to continue pre-training llama 2 70b using my own data. My data is about 1b tokens. I have read [Fine-tuning Llama 2 70B using PyTorch FSDP ](https://huggingface.co/bl…

fancyerii updated 11 months ago
1
ucbepic/docetl #1

Adaptive `compare_batch_size` for Resolve Operator

## Current Implementation Our `ResolveOperation` class uses a Union-Find (aka Disjoint Set Union) algorithm for grouping similar items efficiently. Here's how it works: We've got two main data str…

shreyashankar updated 5 days ago
2
Lightning-AI/pytorch-lightning #17827

MeZO implementation with fabric

### Description & Motivation MeZO proposes a memory-efficient zeroth-order optimizer (MeZO), adapting the classical zeroth-order SGD method to operate in-place, thereby fine-tuning language models (L…

RubenFricke updated 9 months ago
1
pytorch/pytorch #129330

Fuyou Training Framework Integration for PyTorch

### 🚀 The feature, motivation and pitch Fuyou Training Framework Integration for PyTorch Description: Integrate the Fuyou training framework into PyTorch to enable efficient fine-tuning of larg…

quasinnovate updated 3 months ago
3
haotian-liu/LLaVA #1018

[Fine-tuning] what to do with adapter_model.safetensors afte…

### Question I finally managed to fine-tune LLaVA on a custom dataset (LLaVA-1.5-7b on Google Colab using a single A100 GPU) The output I got was mostly an adapter_model.safetensors file (610MB) -- …

rapsar updated 2 months ago
3
lusterchris/Performance-Tuning #3

Parameters Recommendations

[PARAMETERS.txt](https://github.com/user-attachments/files/16852584/PARAMETERS.txt)

lusterchris updated 3 weeks ago
3
ml-explore/mlx-examples #714

[Feature Request] Support for QDoRA: Efficient quantized fin…

> Today we’re releasing the next step: QDoRA. This is just as memory efficient and scalable as FSDP/QLoRA, and critically is also as accurate for continued pre-training as full weight training. We thi…

s-smits updated 3 months ago
2
BaoZhuhan/BaoZhuhan #5

个人科研准备

BaoZhuhan updated 1 month ago
6

上一页 1...1 2 3 4 5 6 7...100 下一页

1000+ results for memory-efficient-tuning

1000+ results
for memory-efficient-tuning