Open ArtPoon opened 9 months ago
First steps would be to investigate how feasible this would be to build and maintain
Omit rows from *.mapped.csv
inputs where the mutation was observed in one read only, i.e., frequency * coverage = 1
. This will exclude roughly one-fifth of rows from the database
The script populating the database died on Paphlagon
Please refer to collate_mapped.py
and aggregate_mapped.py
regarding target outputs
The resulting database would be enormous, though. It might be more efficient to just use
grep
across outputs.