Golob-Minot / geneshot

MIT License
28 stars 5 forks source link

/ref/taxonomy in HDF has tax_ids and parent IDs as objects. Should be integers #63

Closed jgolob closed 3 years ago

jgolob commented 3 years ago

The taxonomy cached in the HDF has the tax_ids as objects rather than integers. Same with parent IDs. Ideally both would be integers, to facilitate merges later.

sminot commented 3 years ago

Ah yes, the age-old debate of tax IDs as strings vs. integers! In this case I come down on the side of 'always strings' for three reasons:

Looking forward to hearing your thoughts, @jgolob

jgolob commented 3 years ago

The case here for strings makes sense. I may adjust this instead to be then, /ref/taxonomy and /annot/gene/tax both use strings or integers for the tax_id column. Currently, /ref/taxonomy uses strings and /annot/gene/tax uses integers. Perhaps swap the latter to be strings?

sminot commented 3 years ago

Great idea! I'll add that now

sminot commented 3 years ago

Closed by a9751f5