pauline-ng / SIFT4G_Create_Genomic_DB

Create genomic databases with SIFT predictions. Input is an organism's genomic DNA (.fa) file and the gene annotation file (.gtf). Output will be a database that can be used with SIFT4G_Annotator.jar to annotate VCF files.
GNU General Public License v3.0
21 stars 7 forks source link

Problem with Creating Local Database #91

Closed VNF1981 closed 3 months ago

VNF1981 commented 4 months ago

Hello,

I have successfully created local databases and annotated the whole genome sequences of more than 50 species via SIFT4G. However, I encountered the following issues with four species.

1) I am getting the "Alignment score and position are not consensus" error for one species.

Searching database for candidate sequences

Aligning queries with candidate sequences Alignment score and position are not consensus.7.50/100.00% *

///////////////////////////////////////////////////////////////////////////////////////

2) For three species, the process of creating databases breaks without any error during the alignment:

Searching database for candidate sequences

Aligning queries with candidate sequences

And here is the content of log file:

converting gene format to use-able input done converting gene format making single records file done making single records template making noncoding records file done making noncoding records make the fasta sequences done making the fasta sequences start siftsharp, getting the alignments

/////////////////////////////////////////////////////////////////////////////////////////////////

Any advice on these would be highly appreciated in advance!

pauline-ng commented 4 months ago

Congrats on making almost 50 databases!

If you used the gtf file to pull out the underlying DNA sequence for these transcripts, are they valid bases (A/C/G/T) or are there lots of N's?

Because SIFT is translating the DNA into amino acid sequence, and if there are N's, it won't be able to translate it into a protein (and SIFT won't work)

VNF1981 commented 3 months ago

Hi Pauline,

The issue has been resolved! I'm not sure what caused it, but I adjusted the method for creating annotations for species that lack them, and now everything is working.

Thanks once again for your quick response!

pauline-ng commented 3 months ago

Great!