argilla-io / distilabel

⚗️ distilabel is a framework for synthetic data and AI feedback for AI engineers that require high-quality outputs, full data ownership, and overall efficiency.
https://distilabel.argilla.io
Apache License 2.0
1.12k stars 70 forks source link

`FormatTextGenerationSFT` with function calling #746

Open plaguss opened 1 week ago

plaguss commented 1 week ago

Description

⚠️ Work in progress

This PR improves the FormatTextGenerationSFT task to allow preparing fine tuning datasets with function calling.

codspeed-hq[bot] commented 1 week ago

CodSpeed Performance Report

Merging #746 will not alter performance

Comparing format-functioncall (7a7a629) with develop (63ee8c5)

Summary

✅ 1 untouched benchmarks