--rcluster-max default - Githubissues

brettc / partitionfinder

PartitionFinder discovers optimal partitioning schemes for DNA sequences.

Other

60 stars 42 forks source link

The current default behavior seems to be correct. I let PF2 run without specifying rcluster-max in the command line and it was suggesting schemes with 9K and 7K subsets in steps 1 and 2, respectively, and it is still going. Running PF2 with rcluster-max set at 1000 or 50 in the command line ends up with a final scheme with 12K subsets, hundreds less, but still too many. This program needs to be sped up though, using MPI parallelization, for use with large datasets because my PF2 run is now on its fourth day and will probably run for at least a few more; whereas running unpartitioned ML analysis with bootstrapping on the same dataset took only a little more than 2 days with ExaML.

brettc / partitionfinder

--rcluster-max default #130