argilla-io / distilabel

⚗️ distilabel is a framework for synthetic data and AI feedback for AI engineers that require high-quality outputs, full data ownership, and overall efficiency.
https://distilabel.argilla.io
Apache License 2.0
1.12k stars 70 forks source link

Add `MixtureOfAgentsLLM` #735

Closed gabrielmbmb closed 2 weeks ago

gabrielmbmb commented 2 weeks ago

Description

This PR adds the MixtureOfAgents llm, an special LLM that allows combining an aggregator_llm and a list of proporsers_llms as described in this paper: https://arxiv.org/abs/2406.04692. Although LLMs in distilabel are not in charge of tweaking the input, this special LLM does as it requires to inject the Mixture-of-Agents system prompt in the system prompt of the task allowing passing the references of the proposers_llms.

⚠️ This system prompt injection should work fine with every task, although it hasn't been tested with all the tasks, only with the TextGenerationtask.

In addition and motivated by the addition of the MixtureOfAgentsLLM, this PR adds/updates the following:

codspeed-hq[bot] commented 2 weeks ago

CodSpeed Performance Report

Merging #735 will not alter performance

Comparing mixture-of-agents (d74e91c) with develop (9d6a152)

Summary

✅ 1 untouched benchmarks