Of the 2,820,860 sequences in the v16.0 FASTA file, 2,811,816 have headers with what appears to be a date like 25-AUG-2016 instead of the virus name or description. Only around 9044 sequences have what appear to be regular names.
For example:
>AB504233.1 25-AUG-2016
has the name Sapovirus Tamagawa River/Site2_a/Nov2003/JP gene for capsid protein, partial cds (https://www.ncbi.nlm.nih.gov/nuccore/AB504233.1). I'm not sure where the 25-AUG-2016 comes from.
Of the 2,820,860 sequences in the v16.0 FASTA file, 2,811,816 have headers with what appears to be a date like
25-AUG-2016
instead of the virus name or description. Only around 9044 sequences have what appear to be regular names.For example:
has the name
Sapovirus Tamagawa River/Site2_a/Nov2003/JP gene for capsid protein, partial cds
(https://www.ncbi.nlm.nih.gov/nuccore/AB504233.1). I'm not sure where the25-AUG-2016
comes from.