Open lydiahjchung opened 3 years ago
The preprocessing code requires having the original dataset at the date that we downloaded it. Since the TCGA is still in progress work, and the volume is big, I would not be able to provide the code for doing this part. However, there are only few steps that we have done to reach the dataset which is available:
Hopefully this answer helps you to perform the preprocessing. Good luck
@MMostavi Hi, I am attempting to understand the preprocessing statement in your paper.
To test the robustness of our models, we added Gaussian noises with zero mean and standard deviations of 0–500% (k) of ith gene's average expression level (μi), or N(0, kμ) to each gene. We set noisy gene expression level to 0 if noise added expression level is less than 0.
Can you walk me through this step? I assume it is after filtering out mean < 0.5 and std < 0.8 in the previous step?
Thank you!
@MMostavi Hi, I am attempting to understand the preprocessing statement in your paper.
To test the robustness of our models, we added Gaussian noises with zero mean and standard deviations of 0–500% (k) of ith gene's average expression level (μi), or N(0, kμ) to each gene. We set noisy gene expression level to 0 if noise added expression level is less than 0.
Can you walk me through this step? I assume it is after filtering out mean < 0.5 and std < 0.8 in the previous step?
Thank you!
Hello, if you understand it can you help me . Thank you!
@MMostavi , would it be possible to disclose the preprocessing code making the input data files?