Hi,
I downloaded the distributed dataset in the .rds format and found there're >90k genes. Although the final integrated dataset has 2k genes, but I'm wondering how much efforts in cleaning up the dataset and make gene names consistent?
Also there're genes names as date format, apparently from excel... Here's a screenshot.... Is this the correct dataset you intended to release?
Thanks,
Hurley
Hi, I downloaded the distributed dataset in the .rds format and found there're >90k genes. Although the final integrated dataset has 2k genes, but I'm wondering how much efforts in cleaning up the dataset and make gene names consistent?
Also there're genes names as date format, apparently from excel... Here's a screenshot.... Is this the correct dataset you intended to release? Thanks, Hurley