Open russellxie opened 1 year ago
Also paste my yaml file here
sequence_files:
file1:
name: /gne/data/lab-shares/xie-lab/Sequencing_Data/2023/fastq/20230830_smartseq_test2/20230829_run/Undetermined_S0_R1_001.fastq.gz
base_definition:
- cDNA(23-75)
- UMI(12-19)
find_pattern: ATTGCGCAATG
file2:
name: /gne/data/lab-shares/xie-lab/Sequencing_Data/2023/fastq/20230830_smartseq_test2/20230829_run/Undetermined_S0_I1_001.fastq.gz
base_definition:
- BC(1-8)
file3:
name: /gne/data/lab-shares/xie-lab/Sequencing_Data/2023/fastq/20230830_smartseq_test2/20230829_run/Undetermined_S0_I2_001.fastq.gz
base_definition:
- BC(1-8)
reference:
STAR_index: /gne/data/lab-shares/xie-lab/RefGenome/star_2.7.9/
GTF_file: /gne/data/lab-shares/xie-lab/RefGenome/raw_files/hg38/Homo_sapiens.GRCh38.93.gtf
additional_STAR_params: '--clip3pAdapterSeq CTGTCTCTTATACACATCT --clip3pAdapterMMp 0.1'
out_dir: /gne/data/lab-shares/xie-lab/Sequencing_Data/2023/mapping/20230830_smartseq3_run2/mapping_test
num_threads: 20
mem_limit: 50
filter_cutoffs:
BC_filter:
num_bases: 3
phred: 20
UMI_filter:
num_bases: 2
phred: 20
barcodes:
barcode_num: ~
barcode_file: /gne/data/lab-shares/xie-lab/Sequencing_Data/2023/mapping/20230830_smartseq3_run2/expected_barcodes.txt
automatic: no
BarcodeBinning: 1
nReadsperCell: 100
demultiplex: no
counting_opts:
introns: yes
downsampling: '10000'
strand: 0
Ham_Dist: 1
write_ham: no
velocyto: no
primaryHit: yes
twoPass: yes
make_stats: yes
which_Stage: Counting
samtools_exec: samtools
pigz_exec: pigz
STAR_exec: STAR
Rscript_exec: Rscript
read_layout: SE
Hello,
I got the following error in the Counting stage.
When I trace back this to the original code, it seems to be happening when it's trying to combine the barcode chunks (as shown below).
I am using the known barcode list so a few barcodes got very few reads. Could that be the problem?