Closed JinxinMonash closed 4 years ago
Thanks for opening this issue! A couple questions:
1) Did things run successfully on the test dataset?
2) Can you list the final directory structure of the working directory after the error?
3) Can you count the number of lines in the output files and list them for me with the wc -l
command?
Based on what I am seeing here, it looks like no termini were detected, which means MGEfinder probably can't help you. This may be because the samples are too similar to the reference.
Sorry about that, hopefully we'll figure out what's wrong, but it looks like there just may not be any insertions. I should account for this in the workflow.
Thank you very much for the quick response!
I do run mgefinder find xxxx.bam for the same sample successful. I have also attached the output file.
From this output file, there are several insertions. Please correct me if I was wrong.
Best, Jason
Great, thanks for doing that.
Looks like it found some potential insertion termini, but it wasn't able to recover the full sequences from the assembly files or the reference itself, meaning they may be false positives.
Could you run
ls ~/Jinxin/dataSet2/workdir/01.mgefinder/*/*
and
wc -l ~/Jinxin/dataSet2/workdir/01.mgefinder/*/*
And tell me the output?
Thanks.
Thank you very much. Sure, pls kindly find them as attached.
Great, sorry one more thing, try running:
wc -l ~/Jinxin/dataSet2/workdir/01.mgefinder/*/*/*
Thanks.
No worries.
Thank you very much.
Yeah, as I suspected, MGEfinder was not able to find any insertions using these two files. It identifies candidate insertions in the *find.*.AB5075UW.tsv
files, but then it is not able to determine the full identity of the inserted element, if it exists at all. The insertion flanks identified in the find
files could just as easily be inversions, for example, which MGEfinder is not designed to identify in the subsequent steps.
If you have more isolates, I would recommend including those in the working directory as well, because this should increase sensitivity. But from what I can see, it can't detect any large insertions with these isolates.
Sorry about that! Good luck.
Thank you very much, that helps a lot. Yea, it might be due to the low mutation frequency (~0.1-0.3). I will include more and have another test.
Best, Jinxin
Hi all,
I would like to use MGEfinder to detect insertion sequence using the whole workflow. I prepared the dics (00.genome, 00.bam, 00.assembly)and files (sample.refer.bam etc) following the tutorial. However, when I run mgefinder workflow /workdir I get the error below. It would be highly appreciated if anyone could help with it. Thank you very much in advance.
Best, Jason