Open ZainabKamel opened 4 years ago
We then added to it trimmomatic SE -phred33 ~/Desktop/PROJECT/SAMPLE_DATA/hs_T47D_shCTRL_RNAseq_rep1.fastq.gz results.fq.gz ILLUMINACLIP:TRUSeq-SE:2:30:10 LEADING:3 TRAILING:3 SLIDINGWINDOW:4:15 MINLEN:36 HEADCROP:13 CROP:61
The last command allowed us to crop the bases with problems from the start and end
After looking at the QC report, we get the best parameters to modify our data that are mentioned above in the command after trying the parameters in Galaxy which are:
Maximum mismatch count which will still allow a full match to be performed=2 How accurate the match between the two 'adapter ligated' reads must be for PE palindrome read alignment=30 How accurate the match between any adapter etc. sequence must be against a read=10 Minimum length of adapter that needs to be detected (PE specific/palindrome mode)=8
Number of bases to average across=4 Average quality required=15
Minimum quality required to keep a base=3
Minimum quality required to keep a base=3
Minimum length of reads to be kept=36
Number of bases to remove from the start of the read=13
Number of bases to keep from the start of the read=61
We figured out it didn't work with us because we are not working on paired end reads