Why are there >90k genes, and with gene names e.g. 8-Mar, 6-Sep....

Hi, I downloaded the distributed dataset in the .rds format and found there're >90k genes. Although the final integrated dataset has 2k genes, but I'm wondering how much efforts in cleaning up the dataset and make gene names consistent?

Also there're genes names as date format, apparently from excel... Here's a screenshot.... Is this the correct dataset you intended to release? Thanks, Hurley

gustaveroussy / FG-Lab

Why are there >90k genes, and with gene names e.g. 8-Mar, 6-Sep.... #2