Closed wolfgangrumpf closed 4 years ago
I check the files and here's the result:
fam43_isoseq_out.fastq.split.tama.renamed.renamed.fasta looks fine; it has a series of fasta sequences along the lines of:
>PB.1.35
AAAAATCAGATGGGGACTTTATTGTGATGGTGGCAGGTCCACCAGCA etc. etc. etc.
These files are empty: fam43_isoseq_out.fastq.split.tama.renamed.renamed_corrected.fasta fam43_isoseq_out.fastq.split.tama.renamed.renamed_corrected.gtf fam43_isoseq_out.fastq.split.tama.renamed.renamed_corrected.sam
fam43_isoseq_out.fastq.split.tama.renamed.renamed_corrected.gtf.tmp only has the header line.
If the .sam file is empty doesn't that suggest that mapping failed? I didn't see anything in the console to suggest that.....
Did you manage to solve the problem I have the same issue?
Same for me.
@wolfgangrumpf , @mysecretbackyard , @kgiperez ,
The issue comes from GMAP and I probably have some bug in there. The reason "*corrected.fasta" is empty is an indication of SQANTI2 failing to run GMAP to get an aligned SAM and convert it to a genome-corrected fasta.
I hate to do this - but can you guys switch to minimap2 for the time being?
I'll put on my ToDo to get GMAP fixed again.
Alternatively, supply directly the GMAP output in GFF3/GTF format, by running with --gtf input.gtf
instead of input.fasta
. When SQANTI2 sees --gtf
it will not run the aligner step and directly go to ORF prediction.
--Liz
I'm actually using minimap2. Tried with or without --aligner_choice=minimap2 but I still get the same error.
If you supply GTF would it works? Is this human? Would you be willing to share the input fasta confidentially for testing ?
On Mon, Nov 18, 2019 at 6:28 PM Kevin Perez notifications@github.com wrote:
I'm actually using minimap2. Tried with or without --aligner_choice=minimap2 but I still get the same error.
— You are receiving this because you commented. Reply to this email directly, view it on GitHub https://github.com/Magdoll/SQANTI2/issues/42?email_source=notifications&email_token=AAEQE3ZERCF52FT3Q7PDSZLQUNFMXA5CNFSM4JBBEGXKYY3PNVWWK3TUL52HS4DFVREXG43VMVBW63LNMVXHJKTDN5WW2ZLOORPWSZGOEEMUB2Q#issuecomment-555303146, or unsubscribe https://github.com/notifications/unsubscribe-auth/AAEQE37U2SS6G7X6ILXSCXTQUNFMXANCNFSM4JBBEGXA .
-- Sent from Gmail Mobile. Excuse any possible typos.
Hi,
I just tested GMAP and I'm unable to produce your errors.
The command I used is below. If any of you could share confidentially your input fasta or gtf for me to reproduce your errors I'd appreciate it. I'd just need your email (which you can email me at etseng@pacb.com)
python ~/GitHub/SQANTI2/sqanti_qc2.py \
-o TEST -d test_out \
-t 30 --aligner_choice gmap \
-x ~/share/gmap_db_new/hg38 \
test.fasta \
gencode.v31.chr_patch_hapl_scaff.annotation.gtf \
hg38.fa
Hi,
@Magdoll unfortunately, my input is confidential patient data so I can not share it. I used --aligner_choice=minimap2 and input fa file from cupcake.collapsed.rep.fa\ Homo_sapiens.GRCh38.97.gtf\ Homo_sapiens.GRCh38.dna.primary_assembly.fa I got the same error.
Hi @mysecretbackyard ,
Can you please run the test data and see if it works? I'd like to discern whether the issue is your input or the software.
There is a test.gtf
and test_mini.fasta
:
python sqanti_qc2.py --gtf test.gtf \
gencode.v29.annotation.gtf hg38.fa
python sqanti_qc2.py test_mini.fasta \
gencode.v29.annotation.gtf hg38.fa
Hi @Magdoll ,
Thank you for your response. I tried both test_mini.fasta and my input without --aligner_choice and it works!
Hi @mysecretbackyard ,
Good to hear.
Closing this issue unless others continue to experience this problem. -Liz
Posting the original issue here first....
I've mapped some isoseq data and then collapsed it with TAMA and am now trying to do the final processing in SQANTI2 - I can't get past this error: