Closed rania-o closed 2 years ago
Hi @rania-o,
Sorry for the delayed reply!
The default --readcount_min
is 1 for xpore dataprep
.
If the kmer and model_kmer columns of a site in the eventalign.txt
from nanopolish
do not match, xpore
drops those sites, which may explain the drop from 20k to 998.
One potential reason why you get KeyError: 'GAA'
from xpore may be that your eventalign.txt
is truncated, and you can check it by tail eventalign.txt
to see whether all columns of the last line is present.
Thanks!
Best wishes, Yuk Kei
Hello @yuukiiwa
Thank you for your answer. I've checked my eventalign file and indeed the columns kmer and model kmer are absent for some lines. Here is an example :
contig position reference_kmer read_index strand event_index event_level_mean event_stdv event_length model_kmer model_mean model_stdv standardized_level start_idx end_idx dystro-oligo 0 GCCAA 0 t 4 78.27 1.380 0.01328 GCCAA 73.26 2.11 1.75 28133 28173 dystro-oligo 1 CCAA 0 t 5 92.55 2.484 0.00465 CCAA 87.19 3.02 1.31 28119 28133 dystro-oligo 1 CCAA 0 t 6 95.85 1.139 0.00465 CCAA 87.19 3.02 2.12 28105 28119 dystro-oligo 2 CAA 0 t 7 98.24 2.083 0.00564 CAA 105.72 2.68 -2.06 28088 28105 dystro-oligo 2 CAA 0 t 8 96.94 3.260 0.00598 CAA 105.72 2.68 -2.42 28070 28088 dystro-oligo 3 AA 0 t 9 122.06 1.616 0.00299 AA 108.90 2.68 3.63 28061 28070 dystro-oligo 4 A 0 t 10 119.61 3.024 0.01295 A 108.90 2.68 2.95 28022 28061 dystro-oligo 4 A 0 t 11 101.06 3.389 0.00996 A 108.90 2.68 -2.16 27992 28022 dystro-oligo 5 0 t 12 98.53 1.871 0.00730 108.90 2.68 -2.86 27970 27992
How can I fix this please ? Rania
Hi @yuukiiwa
It turns out my eventalign file was truncated as you said. I fixed it by converting U to T in the reference sequence, and it worked.
Thank you for your help.
Hi,
I've already run the dataprep command and I got results. This is what my read count file contains :
I have more than 20 000 reads mapped to my reference, Is there a coverage filter or something that makes the number of reads drop to 998 ?
Also when I run the diff_mod with the correct dataprep results, I get this error message, and my terminal is frozen at this step and I have to control+C to exit Xpore and get back on it. :
Thanks, Rania