Open SharonHill opened 6 years ago
Synthetic data quality assessment methodology
Our initial testing strategy will be based on the method proposed in [1]. Therefore, to assess the quality of the generated synthetic data we will work as follows:
An output of the proposed testing methodology will be a correlation matrix similar to the one described in [1] (Fig. 1).
Fig. 1: The correlation matrix constructed in [1] as part of their synthetic data quality assessment methodology.
[1] Brett K. Beaulieu-Jones, Zhiwei Steven Wu, Chris Williams, Ran Lee, Sanjeev P Bhavnani, James Brian Byrd, Casey S. Greene. Privacy-preserving generative deep neural networks support clinical data sharing, bioRxiv 159756; doi: https://doi.org/10.1101/159756
should be reviewed/input by Methodology