sdv-dev / SDMetrics

Metrics to evaluate quality and efficacy of synthetic datasets.
https://docs.sdv.dev/sdmetrics
MIT License
201 stars 44 forks source link

Wrong chart title when generating a box plot for just the real data using `get_column_pair_plot()` #615

Closed srinify closed 1 month ago

srinify commented 1 month ago

Environment Details

Please indicate the following details about the environment in which you found the bug:

Error Description

When plotting just the real data using get_column_pair_plot(), the chart title describes "Real vs Synthetic" instead of just "Real". This only occurs when trying to generate a box plot in this function (not a heatmap or scatter plot).

Screenshot 2024-07-31 at 3 25 16 PM

Steps to reproduce

from sdv.datasets.demo import download_demo
from sdmetrics.visualization import get_column_pair_plot

data, metadata = download_demo(
    modality='single_table',
    dataset_name='census'
)

fig = get_column_pair_plot(
    real_data=data,
    synthetic_data=None,
    column_names=['veterans benefits', 'weeks worked in year'],
    plot_type='box'
)

fig.show()