lutteropp / hakmer-ng-redesign

0 stars 0 forks source link

Plot sequence data usage for each minimum seed size if we wouldn't care about overlaps/ reusage of sequence data #56

Closed lutteropp closed 5 years ago

lutteropp commented 5 years ago

And use it as a way for guessing a good initial value for minK

lutteropp commented 5 years ago

Tried this, didn't work well as an estimate. Instead, what worked better was reducing the tail at the end of the seed size distribution, such that the elbow criterion gets less confused.