GenSpectrum / cov-spectrum-website

A web platform to detect and analyze variants of SARS-CoV-2
https://cov-spectrum.org
GNU General Public License v3.0
59 stars 13 forks source link

ENH: Submission date filter, submission delay filter and clock-filter metric #610

Open corneliusroemer opened 1 year ago

corneliusroemer commented 1 year ago

Date errors are a big problem when investigating when the first sequence was found. When there are millions of Omicrons, the earliest are inevitably data entry error. See this, there are 700 Omicrons before there could have possibly been one.

image

It would be great if we could filter these erroneous date entries out somehow.

A few possibilities exist:

Submission date and delay filtering is something you should be able to do fairly easily with LAPIS as is, would be great to have that, make covSpectrum even more useful ❤️

chaoran-chen commented 1 year ago

Thanks for the suggestions, @corneliusroemer! For the submission date filter, we already have #581 but the other aspects are new!

corneliusroemer commented 1 year ago

In particular the submission delay could be super useful as it doesn't need to be readjusted with different periods. Could be something like 6 months by default, that would keep most sequences but get rid of most date errors :)