TYMichaelsen / covid19

Bioinformatic pipelines used to assemble SARS-CoV-2 genomes from DK
GNU General Public License v3.0
3 stars 3 forks source link

Create data cleansing script #20

Closed tomersagi closed 4 years ago

tomersagi commented 4 years ago
tomersagi commented 4 years ago

List: https://github.com/KasperSkytte/covid19/wiki/data_cleansing

biocyberman commented 4 years ago

The team at nextrain already did something for this purpose. May be we can reuse and extend?

tomersagi commented 4 years ago

@biocyberman thanks, but these things are very specific to your schema and data sources. I have already finished most of the work. Just have some common sense tests left to code in.

biocyberman commented 4 years ago

@tomersagi Thought it's also gisaid data, but great that you got some progress already.