Closed sidizhao closed 2 years ago
The error correction step by lordec is very slow, it may need some more time to finish. Lordec only use the first short-read file to name the generated graph file (.h5), but it should contain all the short-read sequence information.
The 'pacbioFile' message was from the lordec program . This error correction program was initially designed for pacbio long reads, but it also shows good performance on nanopore reads. There isn't any option to specify the read type for lordec, so basically you are doing the right thing.
Yan
Thank you so much. As of now, it took more than 21 hours but the program is moving forward. It’s looking for TRFs now. I think it’s functioning well.
Hi there,
I've been able to run isoCirc with
isocirc -t 1 $PATH/FAQ07459_pass_a4f2108a.fastq $PATH/all-chrs.fa $PATH/hg38_ref_all.gtf $PATH/annotation.bed $PATH/isocirc_output
and get intended results. However, when I tried to add the--short
parameter to the run command, it seems to me that there is either a really long run time, or I'm not running it correctly.I have 2 sets of pair-end short-read sequencing data, and here's the command I used:
isocirc -t 1 --short-read $PATH/TruSeq_R1.fastq,$PATH/TruSeq_R2.fastq,$PATH/New_England_R1.fastq,$PATH/New_England_R2.fastq $PATH/FAQ07459_pass_a4f2108a.fastq $PATH/all-chrs.fa $PATH/hg38_ref_all.gtf $PATH/annotation.bed $PATH/isocirc_outputP_short_read
When I look into the log of the run, it shows this (path edited and omitted similar lines for simplicity):
It seems to me that only one of the short-read files were used, and there hasn't been any more lines printed to the output file. It still says the job is running, but I don't see it proceeding to the next step (finding TRFs).
I also have a question. With this line
illumina: $PATH/TruSeq_R1.fastq,$PATH/TruSeq_R2.fastq,$PATH/New_England_R1.fastq,$PATH/New_England_R2.fastq $PATH/HCT116_Illumina_TruSeq_R1.fastq_multi_k21_s3.h5 pacbioFile: $PATH/FAQ07459_pass_a4f2108a.fastq kmer_len: 21 solid_kmer_thr: 3
it looks like it's taking my long read data as PacBio generated. Mine is actually nanopore. Is there anywhere I can specify that?Really appreciate your help! Please advise me on what I should do next.