allenai / fm-cheatsheet

Website for hosting the Open Foundation Models Cheat Sheet.
https://fmcheatsheet.org
256 stars 18 forks source link

Intro Text for Finetuning Page #16

Closed danmcduff closed 3 months ago

danmcduff commented 4 months ago

Replace:

Finetuning data is used to hone specific capabilities, orient the model to a certain task format, improve its responses to general instructions, mitigate harmful or unhelpful response patterns, or generally align its responses to human preferences. Developers use a variety of data annotations and loss objectives for finetuning, including traditional supervised finetuning, DPO, or reinforcement learning with human feedback. Explore various data catalogs, their attached documentation, and specialized finetuning data sources.

with:

Finetuning data is used to hone a model's specific capabilities, orient it to a certain task, improve its responses to instructions, mitigate harmful or unhelpful behaviors, and/or align it to human preferences. Given the thousands of specialized data sources for finetuning, we recommend using data catalogs that provide well documented datasets.

neural-loop commented 4 months ago

https://onm-demo.aimodels.org/foundation-model-resources/finetuning-data-catalogs/