argilla-io / distilabel

⚗️ distilabel is a framework for synthetic data and AI feedback for AI engineers that require high-quality outputs, full data ownership, and overall efficiency.
https://distilabel.argilla.io
Apache License 2.0
1.12k stars 70 forks source link

[FEATURE] create a pipeline playground UI #368

Open davidberenstein1957 opened 4 months ago

davidberenstein1957 commented 4 months ago

Is your feature request related to a problem? Please describe. I was speaking to Oras Al-Kubaisi, he proposed it might be nice to have a pipeline UI playground. I think it would be interesting to build something like this on external API framework to test what the pipeline looks like.

Describe the solution you'd like perhaps something in streamlit or gradio could be an option but I'm not sure how extensive they are w.r.t. being able to define components.

Describe alternatives you've considered Re-unning python code and looking at pandas/datasets reprs

Additional context N.A.

gabrielmbmb commented 2 months ago

Hi @davidberenstein1957! This is an idea I had myself too :) I was thinking to build a small UI using Svelte that allows you to drag and drop steps from a catalog and connect them together, like no-code tools. It will generate the YAML, so you can run the pipeline from the CLI. I was thinking on hosting this on a hugging face space of course, and the pipeline could be executed from there too if desired :)