brian-cleary / LatentStrainAnalysis

Partitioning and analysis methods for large, complex sequence datasets
MIT License
37 stars 20 forks source link

Can we have a discussion about selecting the clustering threshold? #12

Open nmb85 opened 8 years ago

nmb85 commented 8 years ago

How is everyone deciding on the clustering threshold for their data? Right now I am running LSA/kmer_cluster_index.py with the threshold (-t) set to cycle from 0.7 to 0.99 in steps of 0.01 and appending the number of clusters to numClusters.txt to find out what that curve looks like for my data. I'm not sure where to go from there. Any suggestions?