Closed aplotnikov2020 closed 1 year ago
Hi, thanks for reporting this!
Currently, the HighDimSynthesizer
does not consider index values and so we don't support index preservation. If the index values are required, we would recommend using the reset_index()
method to transfer these index values to columns. We may decide to support this feature in a future release if there is enough interest in it.
If you're interested in time-series applications then have a look at our (new!) regular and event-based time-series synthesizers https://docs.synthesized.io/sdk/latest/user_guide/time_series_synthesis/. These are currently a beta release and only available on the paid version.
Thanks again for sharing this! For the meantime, we will close this issue as 'Not planned'.
Currently HighDimSynthesizer does not preserve Pandas DataFrame index. Let's consider the following example:
Original data:
Synthesized data:
Desired behavior I'd expect synthesized to produce index values along with
y
values.Workarounds If synthesized data has the same number of rows, we can just concatenate the original and produced DataFrames:
Possible caveats There might be an issue with the lack of unique values for the index