Open jimmybgammyknee opened 6 years ago
But the gct file format is pretty strict, it requires only 2 lines of description (the version line, and the row&col count line), because the following script used in the gtex pipeline is line-dependent(quite lazy). And the gene name in the outcome is depend on the the gene name used in feature count
Ah so its only for genome coordinates? I thought i saw an example which had functional information in that field...
Right, you mean the second column, I thought you mean the metadata like in the vcf files. I'll check biomart to see if I can make a dictionary obj to add that.
Maybe this? https://github.com/daler/biomartpy
Can some meaningful annotation be added to the description field? Perhaps being able to download the gene information (HUGO gene name or function) from biomart using the gene names in the first field?