katerinakazantseva / strainy

Graph-based assembly phasing
Other
65 stars 5 forks source link

Skip / downsample unitigs with very high coverage #52

Closed mikolmogorov closed 6 months ago

mikolmogorov commented 1 year ago

Some graph unitigs may turn out to be highly repetitive, and the phasing part never finishes on them. For example, in the dataset nano_uni_5, using unpolished metaFlye graph, coverage of edge_208 is 900+.

One way to process this is to downsample reads above 100x coverage. Or maybe above 2 x mean_dataset_coverage?

mikolmogorov commented 6 months ago

Added in rc_2024