Closed nick-youngblut closed 4 years ago
names.dmp: "scientific name" not "scientific_name"
$ more names.dmp
1 | all | | synonym |
1 | root | | scientific name |
2 | Bacteria | Bacteria <bacteria> | scientific name |
2 | Monera | Monera <bacteria> | in-part |
That did it. Thanks!
I created a script to convert the Genome Taxonomy Database (GTDB) taxonomy to nodes.dmp + names.dmp files. The output looks like:
names.dmp
nodes.dmp
taxonkit list
works as expected, buttaxonkit lineage
does not provide any lineage info. For example:Any idea why I'm not getting the full lineage info? I tried to look at the taxonkit code to see if it was filtering based on the embl code or something else, but I don't see what's the problem (it doesn't help that I don't know
go
).