Closed alvarobartt closed 5 months ago
This PR adds the ORPO fine-tuning technique among the existing README.md files where the available techniques are mentioned, as well as adding https://huggingface.co/HuggingFaceH4/zephyr-orpo-141b-A35b-v0.1 within the "News" section.
README.md
Additionally, this PR also fixes a small typo in the ModelArguments from adatpers to adapters.
ModelArguments
adatpers
adapters
Thanks @nisten for realising that the docs were missing https://github.com/huggingface/alignment-handbook/pull/143#issuecomment-2051497473
The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.
Description
This PR adds the ORPO fine-tuning technique among the existing
README.md
files where the available techniques are mentioned, as well as adding https://huggingface.co/HuggingFaceH4/zephyr-orpo-141b-A35b-v0.1 within the "News" section.Additionally, this PR also fixes a small typo in the
ModelArguments
fromadatpers
toadapters
.Thanks @nisten for realising that the docs were missing https://github.com/huggingface/alignment-handbook/pull/143#issuecomment-2051497473