ksahlin / IsoCon

Derives consensus sequences from a set of long noisy reads by clustering and error correction.
GNU General Public License v3.0
14 stars 1 forks source link

Final candidates header explanation #9

Open ksahlin opened 2 years ago

ksahlin commented 2 years ago

The header is on the following format:

> acc + "_” + support + "_" + p_value + "_" + N_t + "_" + delta_size

Here,

Example

If there are two clusters c1 and c2 for 10 and 5 reads respectively, and their total distance is 3 (e.g., one SNP and one indel of length 2). Lets say you are testing whether the SNP and deletion that c2 have with respect to c1 is statistically supported, and the hypothesis test comes back significant (i.e., it is statistically supported), then you will have in c2’s header: