Hmm does this mean we could be failing to ingest some other quite important data? (Not a criticism - just for understanding, I guess I previously thought we were capturing 100% of data)
(and it's a genuine question - maybe this would be the only thing that we are not capturing)
in #2832
Ingest currently only knows about things that NCBI virus emits. It is known that there's (sometimes) more data that is available in individual genbank files and not (yet) parsed by NCBI Virus.
We could manually request those individual files in ingest and parse out extra metadata - this is not urgent but nice to have, depending on how much extra metadata we could be pulling in like this.
Agree that it would be amazing to have. In particular sequencing technologies and full author names would be cool! But I also agree that it's not urgent.
@theosanderson raised a good point:
in #2832
Ingest currently only knows about things that NCBI virus emits. It is known that there's (sometimes) more data that is available in individual genbank files and not (yet) parsed by NCBI Virus.
We could manually request those individual files in ingest and parse out extra metadata - this is not urgent but nice to have, depending on how much extra metadata we could be pulling in like this.