uci-cbcl / genomix

Parallel genome assembly using Hyracks
3 stars 2 forks source link

Anbangx/find start seed min length #107

Closed anbangx closed 10 years ago

anbangx commented 10 years ago

@jakebiesinger @JavierJia @Elmira88 @Nan-Zhang

This PR is for findMinScaffoldingMinLength, which separates the setCutoffCoverage and findMinScaffoldingMinLength from the previous PR I sent. hpc is broken, I need more time to test that Gaussian model in the real data. So this PR can be used to get startSeed first.

jakebiesinger commented 10 years ago

One major problem here: we need a conditional the length distribution! We want the top 1% conditional on their coverage being > some min threshold.

anbangx commented 10 years ago

@jakebiesinger Refactor based on your feedback.