argilla-io / distilabel

Distilabel is a framework for synthetic data and AI feedback for engineers who need fast, reliable and scalable pipelines based on verified research papers.
https://distilabel.argilla.io
Apache License 2.0
1.61k stars 127 forks source link

[FEATURE] Add `huggingface_hub.utils.telemetry` #810

Open davidberenstein1957 opened 3 months ago

davidberenstein1957 commented 3 months ago

Is your feature request related to a problem? Please describe. We want to include some basic telemetry to understand the usage of distilabel and shoot that to the hub.

Describe the solution you'd like Some metrics on the # initialized steps. the # initialized LLMs, # intiialized pipelines, and their throughput.

Describe alternatives you've considered N.A.

Additional context https://github.com/argilla-io/argilla/pull/5218

davidberenstein1957 commented 3 months ago

@gabrielmbmb @plaguss, in case you have some input.

davidberenstein1957 commented 3 months ago

@plaguss @gabrielmbmb, in case you have some input, feel free to add it.