Open minhtien-trinh opened 3 months ago
Hi, the reason for this error may be that your fastq files were not read correctly or were incomplete. For example, in the following situations: 1) They are soft links 2) The content stored in fastq are incorrect, as shown below:
3)The fastq files are incomplete, for example, zcat 1. fq. gz | tail
will prompt 'unexpected end'
Could you find the *. bcPara in 00.mapping and check if 'in1' and 'in2' correctly record the path to fastq files, and checking the integrity like MD5 value?
Thank you for the fast response. Here is the output for the .bcPara file:
in=/home/minh/Chip_C.barcodeToPos.h5
in1=/home/minh/ChipC_R1.fastq.gz
in2=/home/minh/ChipC_R2.fastq.gz
barcodeReadsCount=/home/minh/st_test/results/00.mapping/ChipC_R1.barcodeReadsCount.txt
barcodeStart=0
barcodeLen=25
umiStart=25
umiLen=10
mismatch=1
bcNum=599522906
polyAnum=15
mismatchInPolyA=2
The path to the fastq files seems to be correct from what I can tell.
The files look fine to me but I'm uncertain if the content is stored correctly. Do you know if the tail for the fastq files is correct?
I have the exact same problem. I've checked my R1.fastq.gz & R2.fastq.gz following what minhtien-trinh
mentioned above and confirmed my R1&R2 are integrate and are not soft links. What happened in the alignment? I've run SAW7.1 for multiple times but never met this error before.
This is the first time I'm using SAW but my guess is that my RNA-seq reads are too long (151 bp). In this paper where they used SAW as well the reads are 35/100 bp long.
This is the first time I'm using SAW but my guess is that my RNA-seq reads are too long (151 bp). In this paper where they used SAW as well the reads are 35/100 bp long.
@minhtien-trinh Yes, the maximum read length supported by SAW is 127. I think the method your sample used to construct the library was different from ours? If so, we are sorry that we currently do not enable the parameter for such a long read length. We recommend that you use ST_BarcodeMap(https://github.com/STOmics/ST_BarcodeMap) to decode your h5 file, obtain the CID sequence and its spatial coordinates, and use other tools for mapping and annotation.
I have the exact same problem. I've checked my R1.fastq.gz & R2.fastq.gz following what
minhtien-trinh
mentioned above and confirmed my R1&R2 are integrate and are not soft links. What happened in the alignment? I've run SAW7.1 for multiple times but never met this error before.
@AritaZ-hang Hi, could you attach the content of your .bcPara file and run the following code to check the output to confirm whether there are reads longer than 127bp?
gunzip -c /path_to_your/R2.fq.gz | awk '{if(length($0)>127){print}}'
I have the exact same problem. I've checked my R1.fastq.gz & R2.fastq.gz following what
minhtien-trinh
mentioned above and confirmed my R1&R2 are integrate and are not soft links. What happened in the alignment? I've run SAW7.1 for multiple times but never met this error before.@AritaZ-hang Hi, could you attach the content of your .bcPara file and run the following code to check the output to confirm whether there are reads longer than 127bp?
gunzip -c /path_to_your/R2.fq.gz | awk '{if(length($0)>127){print}}'
Hi Cloute, I've checked my R2. and found almost all the reads are of 151bp length, longer than the maximum requirements of 127bp. I will use other tools to map this spatial data. Anyway, thanks for your prompt reply.
The contents of .bcPara
file are listed below. it seems all good.
in=/home/arita/spatial/data//barcodeToPos.h5
in1=/home/arita/spatial/fastqs//H_R1_final.fastq.gz
in2=/home/arita/spatial/fastqs//H_R2_final.fastq.gz
barcodeReadsCount=/home/arita/spatial/test/results/00.mapping/H_R1_final.barcodeReadsCount.txt
barcodeStart=0
barcodeLen=25
umiStart=25
umiLen=10
mismatch=1
bcNum=359440836
polyAnum=15
mismatchInPolyA=2
Hello,
I'm encountering an error when attempting to build a mapping using the following command:
Error Details:
Full Error Traceback
``` Fr 9. Aug 11:04:50 CEST 2024 singularity check: pass, and singularity path is /usr/local/bin/singularity Fr 9. Aug 11:04:50 CEST 2024 singularity image file check: file exist and SIF path is /home/minh/SAW_7.1.sif Fr 9. Aug 11:04:50 CEST 2024 => splitMask, compute CID count and predict the memory of mapping start...... WARNING: While bind mounting '/home/minh:/home/minh': destination is already in the mount point list Command being timed: "singularity exec /home/minh/SAW_7.1.sif CIDCount -i /home/minh/Chip_C.barcodeToPos.h5 -s Fomes_fomentarius -g 1G" User time (seconds): 23.52 System time (seconds): 1.63 Percent of CPU this job got: 120% Elapsed (wall clock) time (h:mm:ss or m:ss): 0:20.91 Average shared text size (kbytes): 0 Average unshared data size (kbytes): 0 Average stack size (kbytes): 0 Average total size (kbytes): 0 Maximum resident set size (kbytes): 694544 Average resident set size (kbytes): 0 Major (requiring I/O) page faults: 79 Minor (reclaiming a frame) page faults: 120643 Voluntary context switches: 272085 Involuntary context switches: 231 Swaps: 0 File system inputs: 10068 File system outputs: 8 Socket messages sent: 0 Socket messages received: 0 Signals delivered: 0 Page size (bytes): 4096 Exit status: 0 Your sequencing reads are in Q40 format. Fr 9. Aug 11:05:11 CEST 2024 => CID mapping, adapter filtering and RNA alignment start...... ~~~ mapping - ChipC_R1.fastq.gz ~~~ WARNING: While bind mounting '/home/minh:/home/minh': destination is already in the mount point list WARNING: While bind mounting '/home/minh:/home/minh': destination is already in the mount point list WARNING: While bind mounting '/home/minh:/home/minh': destination is already in the mount point list --- Error: The length of reads exceeds the size of buf. --- Error: The length of reads exceeds the size of buf. --- Error: The length of reads exceeds the size of buf. --- Error: The length of reads exceeds the size of buf. bcSTAR: ReadsParse.cpp:911: ThreadBufWrapper::GetBatchReads(ReadsBuf*, char (*)[128], meta*, mmp*, stage*)::I have attempted to increase various buffer and RAM limits, as shown in the command, but the issue persists. The error appears to stem from a potential hardcoded limit in bcSTAR, specifically in the ReadsParse.cpp file. When I attempt to access bcSTAR inside Singularity at /opt/saw_st_software/pipeline/mapping/bcSTAR, I encounter binary output, suggesting that the source code is not included within the container. Additionally, I looked through the bcSTAR source code repository, but I could not locate the ReadsParse.cpp file.
Could you provide guidance on how to resolve this buffer size issue? Specifically:
Thank you for your assistance.