Closed yaada100 closed 2 years ago
Hi Yakup,
I'll explain the results in the context of one of the test files:
perl make-SIFT-db-all.pl --config test_files/candidatus_carsonella_ruddii_pv_config.txt
Run the above command, and you'll generate a bunch of files. One example file: SIFT_prediction/BAF35125.aligned.fasta
gene-annotation-src/Candidatus_carsonella_ruddii_pv.ASM1036v1.34.gtf.gz
Best, Pauline
Hey Pauline,
first of all thank you for your fast reply. To point 2. and 3. what i meant with query is the original query. So in context of candidatus I meant Candidatus_carsonella_ruddii_pv.ASM1036v1.dna.chromosome.Chromosome.fa as "Query". So can those results be closed back to this sequence.
Best, Yakup
If we define the genomic sequence as "Query", then go to candidatus_carsonella_ruddii_pv/ASM1036v1.34/Chromosome.gz
In this file: Column 1 is the position in the query sequence Column 2: the reference DNA allele
SIFT information is in other columns.
Dear Pauline and Robert,
I hope it is alright to ask these questions on here.
I have a question regarding the output. The code results in a folder called SIFT_predictions, with alignment fasta files. If I’ve understood it correctly those sequences are with Sw aligned, so pairwise and locally with the query sequence. The identity of the sequences are listed, but what about the query sequence? There is only the ID in the file name (i.e. : ENST000063518).
Thanks in advance, Yakup