Closed adhamzul closed 2 years ago
Hej Adham,
The file "cdna_classifier_report.tsv " is not linked in the process preprocess_reads, when use_pychopper is set to false. You can add a ln to it after line 102 in main.nf to fix this.
Best! Philipp
Hello Philipp,
Thank you for the reply.
I am sorry but I don't really get what you mean...
I checked main.nf and line 102 is ln -s
realpath input_reads.fqfull_length_reads.fq
.
93 script:
94 """
95 fastcat -s ${sample_id} -r ${sample_id}.stats -x ${directory} > input_reads.fq
96
97 if [[ ${params.use_pychopper} == true ]];
98 then
99 cdna_classifier.py -t ${params.threads} ${params.pychopper_opts} input_reads.fq full_length_reads.fq
100 generate_pychopper_stats.py --data cdna_classifier_report.tsv --output .
101 else
102 ln -s `realpath input_reads.fq` full_length_reads.fq
103 fi
104 """
Where do I need to insert ln
exactly?
Regards, Adham
Hej,
A rather hackish way to solve this is to clone the repo and run it locally. Then you can edit the main.nf file:
realpath input_reads.fq
full_length_reads.fq" command:
ln -s ${mytsv} cdna_classifier_report.tsvThis will make the tsv available to the pipeline. Hope this helps. Philipp
Hello!
I did direct RNA sequencing last week using MinION and I basecalled the data using Guppy's RNA config. I have the sample's full genome (fasta) and its annotation (GFF3) as well as 1.4 million basecalled reads in a single fastq.gz.
I first ran the workflow with the following command as a test.
nextflow run wf-isoforms --fastq input-rna/RNA.fastq.gz --ref_genome input-genome/genome.fasta --ref_annotation input-gff/annotations.gff --out_dir outdir/ -profile conda
The workflow completed successfully. But, when I check the transcriptomes assembled, there were only eight sequences. I figured that pychopper scans for primer used when generating cDNA and that is also why almost all of my reads were categorized as 'Unusable' in cdna_classifier_report.tsv.
When I ran the following command, I get an error saying that the workflow expected a
cdna_classifier_report.tsv
.Command:
nextflow run wf-isoforms --use_pychopper false --fastq input-rna/RNA.fastq.gz --ref_genome input-genome/genome.fasta --ref_annotation input-gff/annotations.gff --out_dir outdir_2/ -profile conda
Output:I don't see any options to skip the preprocess_reads step and I am stuck.
Any ideas to what I am doing wrong? Any insights would be really great!
Adham