TsinghuaC3I / Intuitive-Fine-Tuning

Intuitive Fine-Tuning: Towards Simplifying Alignment into a Single Process
https://arxiv.org/abs/2405.11870
16 stars 0 forks source link

IFT: Intuitve Fine-Tuning

Overview

This repository contains the code for the paper "Intuitive Fine-Tuning: Towards Simplifying Alignment into a Single Process".

The code is based on the eric-mitchell/direct-preference-optimization repository.

Setup

pip install -r requirements.txt

Running IFT

bash commands/run_mistral_ift.sh

Hyperparameters

Citing IFT

If you find IFT useful in your research, please consider citing the following paper:

@article{
    hua2024intuitive,
    title={Intuitive Fine-Tuning: Towards Simplifying Alignment into a Single Process},
    author={Hua, Ermo and Qi, Biqing and Zhang, Kaiyan and Yu, Yue and Ding, Ning and Lv, Xingtai and Tian, Kai and Zhou, Bowen},
    journal={arXiv preprint arXiv:2405.11870},
    year={2024}
}