seattleflu / augur-build

the (in development) augur build for understanding influenza dynamics in Seattle
https://seattleflu.org
0 stars 2 forks source link

Filter cluster output #3

Closed trvrb closed 5 years ago

trvrb commented 5 years ago

@miparedes @cassiawag ---

Could you add two command line options to extract_cluster_fastas.py? These are:

I'd suggest doing these as two separate feature branches / PRs.

cassiawag commented 5 years ago

--min-size is done!

trvrb commented 5 years ago

The use of --min-size is causing an issue with Snakemake if cluster0.fasta is deleted. I think I'd modify extract_cluster_fastas.py to renumber cluster FASTAs to run contiguously from cluster1.fasta, cluster2.fasta, etc...

cassiawag commented 5 years ago

Do you mean to renumber them after the small clusters are dropped due to --min-size?

trvrb commented 5 years ago

@cassiawag: Changes in 1ca1ea21bf765e7fb7560175209c5ec817a749fc have obviated the need for this renumbering. We're no longer relying on the existence of cluster0.fasta.

@miparedes: I made some changes to extract_cluster_fastas.py in the above. You may want to move your changes to be on top of the current master branch.

trvrb commented 5 years ago

This is resolved via #4 and #12.