luizirber / 2020-cami

Preparing sourmash for CAMI 2 evaluations
3 stars 1 forks source link

Better name -> taxid assignment #3

Open luizirber opened 4 years ago

luizirber commented 4 years ago

The RefSeq DB was created from signatures calculated with --name-from-first, and the name is used to figure out what is the taxid for that signature. This leads to missing some name assignments because sometimes the first record name is not in the accession2taxid file provided by CAMI.

Possible solutions: