zyxue / ncbitax2lin

🐞 Convert NCBI taxonomy dump into lineages
MIT License
140 stars 29 forks source link

How use your script to trace protein ID based on its tax ID? #25

Open Xueliang24 opened 1 year ago

Xueliang24 commented 1 year ago

Hi, I am a PhD student and use the ncbitax2lin recently. I want to trace the proten ID with NCBI taid, I have download prot.accession2taxid and get the second column and third column (they are gene ID and tax ID), I also downloaded the RefSeq-release215.catalog, because I saw a method the two files can get the lineage information. I also tried use prot.accession2taxid and name.dmp or lineages.csv to get the information according to the FAQ. Maybe it was failed due to the format of these files?

zyxue commented 1 year ago

ncbitax2lin currently can only generate lineage info. Not sure about protein IDs.

Xueliang24 commented 1 year ago

ncbitax2lin currently can only generate lineage info. Not sure about protein IDs.

Or could you tell me how to realize the FAQ?Just like ncbitax2lin ncr_accession2taxid names.dmp.I would like to change the format of these files