Closed Runningchuan closed 5 years ago
Dear @Runningchuan,
Please note that find_circ2 is still unpublished and cannnot be considered ready-to-use as a black-box tool. Most of the documentations still refers to find_circ1. This will hopefully change soon but is still the current state. Note that the main difference between find_circ1 and find_circ2 is that the latter is not meant to be run on bowtie2-mapped anchors but (unsorted!) bwa-mem-mapped reads.
Hope this helps to get you started.
Best, Marcel
Hi @mschilli87 Thank you for your reply help me to solve this problem. I try to use find_circ1 to processing my data again,but at the step of find_circ.py have an error:
This is the code and result:
1. bowtie2 \ -p 8 \ --very-sensitive \ --phred64 \ --mm \ --score-min=C,-15,0 \ -x /media/lyc/data/refence/mouse/mm10_bowtie2_index/mm10 \ -q -1 /media/lyc/work/circRNA/CIRI2/con1_1.P.trim.gz \ -2 /media/lyc/work/circRNA/CIRI2/con1_2.P.trim.gz \ 2>con1_bowtie2_mapp.result.txt\ | samtools view -hbuS \ | samtools sort \ | samtools view -hf 4\ | samtools view -Sb \
con1_unmap_sort.bam
This is the bowtie2 log:
55092837 reads; of these:
55092837 (100.00%) were paired; of these:
14453309 (26.23%) aligned concordantly 0 times
33928030 (61.58%) aligned concordantly exactly 1 time
6711498 (12.18%) aligned concordantly >1 times
----
14453309 pairs aligned concordantly 0 times; of these:
1397170 (9.67%) aligned discordantly 1 time
----
13056139 pairs aligned 0 times concordantly or discordantly; of these:
26112278 mates make up the pairs; of these:
18712081 (71.66%) aligned 0 times
6859099 (26.27%) aligned exactly 1 time
541098 (2.07%) aligned >1 times
83.02% overall alignment rate
2. unmapped2anchors.py script
python unmapped2anchors.py \
con1_unmap_sort.bam \
| gzip \
>con1_anchors.fastq.gz
3. Then bowtie2 the con1_anchors.fastq.gz
bowtie2 \
-p 8 \
--score-min=C,-15,0 \
--reorder \
--mm \
-q \
-U con1_anchors.fastq.gz \
-x /media/lyc/data/refence/mouse/mm10_bowtie2_index/mm10 \
2>second_bowtie.log \
>con1_anchonor_map.bam
This is the second_bowtie2.log:
37424162 reads; of these:
37424162 (100.00%) were unpaired; of these:
6360193 (16.99%) aligned 0 times
16625264 (44.42%) aligned exactly 1 time
14438705 (38.58%) aligned >1 times
83.01% overall alignment rate
4. At last , I use the find_circ.py , but it got an error :
cat con1_anchonor_map.bam \
| ./find_circ.py \
-G /media/lyc/data/refence/mouse/mm10.fa \
--name=con1
The error is below:
ERROR:root:Unhandled exception raised while processing alignment pair 458549, on input starting at SAM line 917098, FASTQ line 3668392
Traceback (most recent call last):
File "./find_circ.py", line 651, in
Then I check the SAM file line 917098 with upper 8 line ,and FASTQ file line 3668392 upper 20lines ,but I found no difference. I don't kown the reason.
This is the SAM file
-- | -- | -- | -- | -- | -- | -- | -- | -- | -- | -- | -- | -- | -- | -- | -- | -- | -- | -- | -- | --
This is the FASTQ file
-- | --
Could you help me help me to solve this problem?
Thanks!
@Runningchuan: I'm sorry but find_circ1 is not used anymore since many years and I really cannot offer any support for it anymore. We are aware that teh state of find_circ2 is anything but ideal but given the EOL of Python 2.x there is no point in updating the documentation before we finde the time to port it to Python 3.x.
Until then, I highly recommend using find_circ2 as-is in combination with bwa-mem. I could give you further assistence if you get stuck anywhere down that road.
I have finsh my circRNA work with CIRI_v2.0.6, and get nearly 20000 circRNAs from each sample. Then i start my work with find_circ2,but i got no results at the last step. I don't the reason.
This is the code and results i have got.
1.
This is the bowtie2 log:
2.
3.
This is the bowtie2 log:
And this is the run.log in the find_circ_run folder:
the other files such as circ or lin splice_sites.bed and spliced_reads.fastq.gz have no sequences, only have the header.
i don't which step was wrong.
Thanks!
edit (@mschilli87): style