Create genomic databases with SIFT predictions. Input is an organism's genomic DNA (.fa) file and the gene annotation file (.gtf). Output will be a database that can be used with SIFT4G_Annotator.jar to annotate VCF files.
GNU General Public License v3.0
25
stars
7
forks
source link
Use of uninitialized value $fasta_subseq in concatenation (.) or string at generate-fasta-subst-files-BIOPERL.pl line 446, <IN_TX> line 37345. #59
A big log file with repetitive warnings like "Use of uninitialized value $fasta_subseq in concatenation (.) or string at generate-fasta-subst-files-BIOPERL.pl line 446, line 37345."
When I run the sample files, SIFT4G databases created correctly.
I look in the fasta directory for my file , I see protein sequences.
This is my file tree:
|-- chr-src genomic.fa.gz
|-- gene-annotation-src genomic.gtf.gz
|-- dbSNP xx.vcf.gz
My config-file:
GENETIC_CODE_TABLE=1
GENETIC_CODE_TABLENAME=Standard
MITO_GENETIC_CODE_TABLE=5
MITO_GENETIC_CODE_TABLENAME=Invertebrate Mitochondrial
@pauline-ng Hi Pauline,
A big log file with repetitive warnings like "Use of uninitialized value $fasta_subseq in concatenation (.) or string at generate-fasta-subst-files-BIOPERL.pl line 446, line 37345."
When I run the sample files, SIFT4G databases created correctly.
I look in the fasta directory for my file , I see protein sequences.
This is my file tree: |-- chr-src genomic.fa.gz |-- gene-annotation-src genomic.gtf.gz |-- dbSNP xx.vcf.gz
My config-file: GENETIC_CODE_TABLE=1 GENETIC_CODE_TABLENAME=Standard MITO_GENETIC_CODE_TABLE=5 MITO_GENETIC_CODE_TABLENAME=Invertebrate Mitochondrial
PARENT_DIR=/public/home/lizhimin/biosoft/sift4g/scripts_to_build_SIFT_db/hefeishi_test/hefeishi ORG=hefeishi ORG_VERSION=GRCh38.83
DBSNP_VCF_FILE=hefeishi.LDfilter-SRR-remove.vcf.gz
Running SIFT 4G
SIFT4G_PATH=~/anaconda3/bin/sift4g PROTEIN_DB=/public/home/lizhimin/biosoft/sift4g/scripts_to_build_SIFT_db/hefeishi_test/protein.faa
Any advice?
Thanks, LiZhimin