-
- [ ] Generate data separately for different classes.
- [ ] Shuffle the classes
-
E.g., a simple function that takes an input directory and writes out one static MEDS dataset that is version compatible that users can see in this repo that they can use to test downstream tools.
@…
-
Wrote the according to the following example at https://distilabel.argilla.io/latest/sections/how_to_guides/advanced/serving_an_llm_for_reuse/#serving-llms-using-vllm:
```
from distilabel.llms im…
-
### Environment Details
Please indicate the following details about the environment in which you found the bug:
* SDV version: 1.16.1
* Python version: 3.8.19
* Operating System: macOS Sonoma …
-
### Core Components
- **Schema Parser and Validator**: Takes user-provided schemas (JSON or SQL DDL) as input and outputs a structured representation of the schema, such as an abstract syntax tree (A…
-
Hey,
I was wondering if you think it would be possible to create a synthetic dataset for function calling tasks?
I would like to use that dataset for a finetuning experiment.
Thanks for any guida…
-
-
Create an Evaluator class, which takes two medrecords and evaluate the differences and similarities between them.
Tasks:
- [ ] #192
- [ ] descriptive statistics
- [ ] inferential statistics
- …
-
Using [fake](https://crates.io/crates/fake), and scanning for the [faker keywords](https://github.com/cksac/fake-rs#fakers-with-locale) in a `schema`-generated jsonschema description, create more real…
-
Thanks for your wonderful job, I'm confused about where I can find the python code for generating the synthetic data, could you please help me to find it out?
ZedFm updated
2 years ago