Preseq lc_extrap: Too many defects in the approximation

Preseq `lc_extrap`Approximation Error

Preseq lc_extrap fails when with extremely low read depth, and produces the following error message:

ERROR: too many defects in the approximation, consider running in defect mode.

From a maintainer of preseq:

    Preseq uses the observed duplicate count histogram to extrapolate
    the expected number of new distinct reads will be gained with additional 
    sequencing. This will not work if the duplicate count histogram is not 
    sufficiently full, which may be happening in the 2.5M read samples.

Runnning preseq in defect mode (specified by the -D) solves the problem. Alexei has tested the changes (using the -D flag) by running preseq lc_extrap with good and bad data. With good data resulting NRF values are unchanged, and with bad data preseq does not error out.

CCBR / Pipeliner

Preseq lc_extrap: Too many defects in the approximation #405

Preseq `lc_extrap`Approximation Error

CCBR / Pipeliner

Preseq lc_extrap: Too many defects in the approximation #405

Preseq lc_extrapApproximation Error

Preseq `lc_extrap`Approximation Error