Open smehringer opened 1 year ago
Hello,
We built an index over RefSeq genomes. The downloaded filenames are named like this:
/path/GCF_000019125.1_ASM1912v1_genomic.fna.gz /path/GCF_000019165.1_ASM1916v1_genomic.fna.gz ...
When searching the index, the result looks as follows:
*query1 XXX GCF_000019125 XXX GCF_000019165 XXX ...
Luckily for us, the names are still unique and we should be able to compare the output with some effort to reconstruct the full reference name.
This format is lossy if the names weren't unique before the first dot and might even lead to severe false negatives if not noticed by the user.
Best, Svenja
thanks for pointing this out @smehringer . I don't understand why i didnt get notified of your comment. Will follow this up, but leandro has left the project so there will be a delay
Hello,
We built an index over RefSeq genomes. The downloaded filenames are named like this:
When searching the index, the result looks as follows:
Luckily for us, the names are still unique and we should be able to compare the output with some effort to reconstruct the full reference name.
This format is lossy if the names weren't unique before the first dot and might even lead to severe false negatives if not noticed by the user.
Best, Svenja