memory-efficient-tuning Search Results

1000+ results
for memory-efficient-tuning

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

pytorch/torchtune #1023

Benchmark performance against other implementation such as `…

Torchtune is a great project that explaining such a complex fine-tuning process in such an elegant way. I would think having a simple benchmark againt other popular LLM fine-tuning approach is valu…

liyucheng09 updated 4 months ago
2
hiyouga/LLaMA-Factory #5456

max pixels argument

### Reminder - [X] I have read the README and searched the existing issues. ### System Info [2024-09-17 10:58:53,418] [INFO] [real_accelerator.py:203:get_accelerator] Setting ds_accelerator to cuda…

sharonsalabiglossai updated 2 weeks ago
1
cosmos/cosmos-db #65

tune default rocksdb options

## Current Default - `target_file_size_multiplier = 1` - `block_size = 4096` - `OptimizeLevelStyleCompaction(512M)` implies - `target_file_size_base = 64M` - snappy/lz4 compression types …

yihuang updated 1 year ago
2
baichuan-inc/Baichuan-7B #23

实现了baichuan-7B模型的LoRA微调

支持Alpaca等指令数据集的SFT和RLHF流程：https://github.com/hiyouga/LLaMA-Efficient-Tuning LoRA微调可在单块3090 GPU上运行，同时支持QLoRA方法。（最低12G显存）微调模型的 LoRA 权重：https://huggingface.co/hiyouga/baichuan-7b-sft 运行以下指令即可实现…

hiyouga updated 3 months ago
101
pytorch/pytorch #124260

[FSDP] Unreleased reserved memory after FSDP wrap

### 🐛 Describe the bug When I was fine-tuning lLama2-70b on Intel GPU (64G/card, 8 cards), I met out of memory issue after FSDP wrap. Here is the printed log before and after FSDP wrap: >===memo…

shiyan1121 updated 4 months ago
4
vllm-project/vllm #4068

[Feature]: Allow LoRA adapters to be specified as in-memory …

### 🚀 The feature, motivation and pitch PPO and a number of other LLM fine-tuning techniques require autoregressive generation as part of the training process. When using vLLM to speed up the autor…

jacobthebanana updated 5 months ago
4
AxonFramework/AxonFramework #1241

Revise gap logic of the `GapAwareTrackingToken`

We had a bug in our code that caused us to publish thousands of events in a single unit of work. It ended up triggering some Axon behavior that brought our application to its knees. If there is a v…

sgrimm updated 2 months ago
8
VisualComputingInstitute/SiamR-CNN #19

Tips and ideas to improve current accuracy

Hello Thanks for the great job you guys did, I was wondering do you have any tips or ideas to improve the current accuracy?, where do you think it lacks the most, I would like to try and improve the …

ahmadi3d updated 3 years ago
1
nspcc-dev/neofs-node #2316

Reuse object/data memory for replication workers

## Is your feature request related to a problem? Please describe. I'm always frustrated when I'm looking at the replication worker code. It takes and object from the storage, decompresses it, unmarsh…

roman-khimov updated 2 months ago
1
pytorch/torchtune #791

Vision/Multimodal

With all the growing activity and focus on multimodal models is this library restricted to tune text only LLM? Do we plan to have Vision or more in general multimodal models tuning support?

bhack updated 2 months ago
11

上一页 1...2 3 4 5 6 7 8...100 下一页

1000+ results for memory-efficient-tuning

1000+ results
for memory-efficient-tuning