hitsz-ids / synthetic-data-generator

SDG is a specialized framework designed to generate high-quality structured tabular data.
Apache License 2.0
3.24k stars 541 forks source link

Information Data Preprocessing #152

Open kajpetersen opened 5 months ago

kajpetersen commented 5 months ago

❓Search before asking

I have searched for issues similar to this one.

❓Description

Is it possible to get/find the code that is used for the preprocessing of the dataset? After evaluating the code, I saw that there is almost no correlation anymore and I was wondering how this was done.

MooooCat commented 5 months ago

Good morning! thank you for this question @kajpetersen

I am currently working on this part.

The specific branch isfeature-intro-data-processor, the link.

It is expected to be merged into the main branch next week.

The data processor will include two parts: Data Pre-processing and Data Post-processing, and support the plug-in system.