jeromekelleher / sc2ts

Infer a succinct tree sequence from SARS-COV-2 variation data
MIT License
4 stars 3 forks source link

Add date_summary data frame #97

Open jeromekelleher opened 1 year ago

jeromekelleher commented 1 year ago

Returns a dataframe with a summary of every day:

  1. Number of samples
  2. Number of recombinants

We should also add summaries of other types of nodes, but I'm not sure if we've been systematic enough about the "data_added" metadata on all nodes.

Any other information we can usefully add about a given day of data should be included.

szhan commented 1 year ago

Maybe also these to help us troubleshoot?

  1. Array of submission delay of each sample in the daily batch.
  2. Array of number of mutations explaining each sample (not just recombinant samples).