Closed ghost closed 1 year ago
Hi,
Thank you for your interest in the software. Could you provide the first 1,000,000 lines of your BAM file?
Thanks.
Thank you for the quick response. Due to the upload limit of file size, here is the compressed file of the first 900,000 lines of the BAM file.
Hi,
I noticed that the alignments you provided are all paired-end reads with unmapped mates. This is not typically what we see from STAR's paired end reads. This would explain the error, as TEcount is trying to estimate fragment length assuming that it's a paired-end alignment, but the other end is unmapped (and thus fragment length is undeterminable).
Could you either double-check your STAR command line, or provide that here?
If you run the following, do you get any output?
samtools view -F 4 -F 8 [BAM] > [output file]
Thanks
Appreciate your kind help. There is no output from the code you asked, and here are the STAR command line I runned and the log from it - there was no problem when running it and TEcounts in other datasets. Actually, there was one strange point I noticed from this dataset - the size of read1 files are approximately twice the size of read2 files.
/data/tools/STAR-2.5.2b/bin/Linux_x86_64/STAR \ --outSAMtype BAM Unsorted \ --runThreadN 4 \ --genomeDir /data_lu/users/shinminkyu1_ct7/STAR_hg38 \ --genomeLoad LoadAndRemove \ --outFilterMultimapNmax 100 \ --winAnchorMultimapNmax 100 \ --readFilesIn /data_lu/users/shinminkyu1_ct7/GSE165252_BAM//SRR13521096_T/SRR13521096_T_R1.fastq /data_lu/users/shinminkyu1_ct7/GSE165252_BAM//SRR13521096_T/SRR13521096_T_R2.fastq 2>&1| [Wed May 10 21:46:29 KST 2023] May 10 21:46:29 ..... started STAR run [Wed May 10 21:46:29 KST 2023] May 10 21:46:29 ..... loading genome [Wed May 10 21:46:48 KST 2023] May 10 21:46:48 ..... started mapping [Wed May 10 21:50:21 KST 2023] May 10 21:50:21 ..... finished successfully [Wed May 10 21:50:22 KST 2023] No other jobs are attached to the shared memory segment, removing it. [Wed May 10 21:50:24 KST 2023] Elapsed: 00:03:55
Hi,
That is very strange, as it would mean that there were more reads in one direction than the other. I just quickly downloaded SRR13521096, and noticed that while Read 1 is 58nt, Read 2 is only 7nt in length (and all Ns). I wonder if they uploaded the wrong file. You can see the read length distribution here.
In fact, I noticed that this is the case for all the files in this project, SRX9932092.
These might be best treated as single-end libraries.
Sorry that I couldn't be of more help.
Thanks.
Okay, I understand the problem. I should have first checked the fastq files.. Sorry to bother you and thank you so much for your great help and suggestion.
Really appreciate your efforts in this area. Wish you a good day.
Hello, thank you very much for this amazing tool. I get this error when running TEcounts in a dataset, which I haven't seen in other datasets.
But the input BAM file, which was aligned using STAR as I did in other datasets, seems to have an enough number of mapped reads.
Could you please help me find out a solution for this problem?
Here is the header of BAM file: