Dose the ref_model really need?

opendatalab / HA-DPO

Beyond Hallucinations: Enhancing LVLMs through Hallucination-Aware Direct Preference Optimization

https://opendatalab.github.io/HA-DPO

Apache License 2.0

66 stars 6 forks source link

Closed liuxingbin closed 4 months ago

liuxingbin commented 4 months ago

Hi, Thanks for the great work, I am curious about the red model, Can I set the ref_model to None, as I see here: https://github.com/opendatalab/HA-DPO/blame/main/ha_dpo/trainer/base_dpo_trainer.py#L298. When you set reference_free=False, you do not need a ref-model to save computation resources.