sdv-dev / SDV

Synthetic data generation for tabular data
https://docs.sdv.dev/sdv
Other
2.21k stars 289 forks source link

Create unified Metadata class #2104

Open amontanez24 opened 5 days ago

amontanez24 commented 5 days ago

Problem Description

As a user, it can be confusing to figure out which metadata to provide for different scenarios and how to format it. Since majority of the metadata is the same for all cases, it would be nice to only have one object and format to learn.

In an effort to streamline SDV workflows, we want to unify the metadata. This means we will just have one Metadata class that all synthesizers will take, whether single table, multi table or sequential.

Expected behavior

from sdv.metadata import Metadata

single_table_metadata = Metadata.load_from_json('single_table_metadata.json')
multi_table_metadata = Metadata.load_from_json('multi_table_metadata.json')

Additional context