Closed shigej38 closed 4 years ago
Hi @shigej38, the output you see only means that there are no counts for the variant. If you are expecting a result could you please send me the input variants and also the command for how you call the Mastermind plugin together with the Mastermind filename and assembly version? Thank you, Anja
Hi @at7
The size of my input file is quite large. There are variants belonging to about 20,000 genes in the absence of any bi-mastermind matching.
I am using command for calling is:
--plugin Mastermind, / mnt / 7C033C763BE8E7D8 / ensembl-data / mastermind / mastermind_cited_variants_reference-2019.06.14-grch37.vcf.gz
my Mastermind file version is 2019.06.14-grch37
and I constantly update my Mastermind.pm plugin file.
Thank you for sending the additional information. I tried running Mastermind for an example variant (rs699, location 1:230845794-230845794) and get results back from the plugin using the same Mastermind file as you do. Could you please use the same example variant as I did and see if you get any results? I also recommend that you check out the latest Mastermind code. We did push some bug fixes in the last weeks.
In my file, although the appropriate variant in the header section of the description line belonging to the mastermine should not be empty or (?)?
I've updated all of the ensembl-vep and mastermind files, but this update didn't solve my problem. Can you send a sample vcf file if you don't know how to use the instance variable you mentioned? my e-mail: ertan.yilmaz@detagen.com.tr
I used the following variant for tests:
1 230845794 rs699 A G . .
vermiş olduğunuz variantı şu şekilde kaydettim.
#CHROM POS ID REF ALT QUAL FILTER INFO 1 230845794 rs699 A G . .
bu girdiye göre şu şekilde çıktı oluşturdu.
`## ENSEMBL VARIANT EFFECT PREDICTOR v97.3
`
As you can see the output information about the mastermind in the info field "?" is shown. Previously, these lines contained information about what each column meant.
`## MMCNT1 : ?
I think the problem is that your Mastermind.pm is not up-to-date. How did you install the plugins? Can you please replace your Mastermind.pm file with the latest version from github: https://github.com/Ensembl/VEP_plugins/blob/release/97/Mastermind.pm Thank you.
I've updated ensembl vep and Mastermind.pm plugin.
The command I used to update ensembl vep is:
git pull
git chechkout release/97
perl INSTALL.pl
The command I used to update Mastermind:
git clone https://github.com/Ensembl/VEP_plugins/blob/release/97/Mastermind.pm
cd VEP_plugins
cp Mastermind.pm /home/$USER/.vep/Plugins
I updated the Mastermind database file by re-registering.
my problem still continues.
git clone https://github.com/Ensembl/VEP_plugins/blob/release/97/Mastermind.pm doesn't work for me. Can you please try: git clone https://github.com/Ensembl/VEP_plugins.git cp VEP_plugins/Mastermind.pm /home/$USER/.vep/Plugins/ Thank you
As you have already specified, I have used the "git clone" command with git clone to perform a single file operation. therefore ongoing commands do not work. because I copied the last file, it's not like that or the command I used is:
git clone https://github.com/ensembl/vep_plugins.git
the updates you mentioned did not solve my problem.
Could you please check that the Mastermind.pm file which you have stored under /home/$USER/.vep/Plugins contains the following 2 lines: https://github.com/Ensembl/VEP_plugins/blob/release/97/Mastermind.pm#L134-L135? Thank you
my mastermind.pm file content is line 131 to 138
`sub get_header_info {
return{ 'Mastermind_counts' => 'Mastermind number of citations in the medical literature. Output includes three unique counts: MMCNT1|MMCNT2|MMCNT3. MMCNT1 - Count of Mastermind articles with cDNA matches for this specific variant; MMCNT2 - Count of Mastermind articles with variants either explicitly matching at the cDNA level or given only at protein level; MMCNT3 - Count of Mastermind articles including other DNA-level variants resulting in the same amino acid change.', 'Mastermind_MMID3' => 'Mastermind MMID3 variant identifier(s), as gene:key, for MMCNT3.', };
}`
That looks good! What are you using in your --fields list? Maybe you need to update to Mastermind_counts and Mastermind_MMID3?
my fields command is:
"Uploaded_variation,cDNA_position,REF_ALLELE,Allele,Protein_position,Amino_acids,Codons,SYMBOL,EXON,INTRON,ZYG,Consequence,CLIN_SIG,Existing_variation,rs_dbSNP150,G2P_complete,G2P_flag,G2P_gene_req,MMCNT1,MMCNT2,MMCNT3,MMID3,clinvar_rs,clinvar_trait,DOMAINS,Interpro_domain,hg18_chr,hg18_pos(1-based),hg19_chr,hg19_pos(1-based),ExAC_AC,ExAC_AF,gnomAD_exomes_AC,gnomAD_exomes_AF,gnomAD_genomes_AC,gnomAD_genomes_AF,Ensembl_geneid,Ensembl_proteinid,Ensembl_transcriptid,HGVSc,HGVSp,Location,Gene,Feature,Feature_type,CDS_position,IND,ALLELE_NUM,IMPACT,DISTANCE,STRAND,FLAGS,VARIANT_CLASS,SYMBOL_SOURCE,HGNC_ID,BIOTYPE,CANONICAL,MANE,CCDS,ENSP,SWISSPROT,TREMBL,UNIPARC,REFSEQ_MATCH,SOURCE,SIFT,PolyPhen,HGVS_OFFSET,AF,AFR_AF,AMR_AF,EAS_AF,EUR_AF,SAS_AF,AA_AF,EA_AF,gnomAD_AF,gnomAD_AFR_AF,gnomAD_AMR_AF,gnomAD_ASJ_AF,gnomAD_EAS_AF,gnomAD_FIN_AF,gnomAD_NFE_AF,gnomAD_OTH_AF,gnomAD_SAS_AF,SOMATIC,PHENO,OverlapBP,OverlapPC,1000Gp3_AC,1000Gp3_AF,1000Gp3_AFR_AC,1000Gp3_AFR_AF,1000Gp3_AMR_AC,1000Gp3_AMR_AF,1000Gp3_EAS_AC,1000Gp3_EAS_AF,1000Gp3_EUR_AC,1000Gp3_EUR_AF,1000Gp3_SAS_AC,1000Gp3_SAS_AF,ALSPAC_AC,ALSPAC_AF,AltaiNeandertal,Ancestral_allele,CADD_phred,CADD_raw,CADD_raw_rankscore,DANN_rankscore,DANN_score,Denisova,ESP6500_AA_AC,ESP6500_AA_AF,ESP6500_EA_AC,ESP6500_EA_AF,Eigen-PC-phred,Eigen-PC-raw,Eigen-PC-raw_rankscore,Eigen-phred,Eigen-raw,Eigen_coding_or_noncoding,ExAC_AFR_AC,ExAC_AFR_AF,ExAC_AMR_AC,ExAC_AMR_AF,ExAC_Adj_AC,ExAC_Adj_AF,ExAC_EAS_AC,ExAC_EAS_AF,ExAC_FIN_AC,ExAC_FIN_AF,ExAC_NFE_AC,ExAC_NFE_AF,ExAC_SAS_AC,ExAC_SAS_AF,ExAC_nonTCGA_AC,ExAC_nonTCGA_AF,ExAC_nonTCGA_AFR_AC,ExAC_nonTCGA_AFR_AF,ExAC_nonTCGA_AMR_AC,ExAC_nonTCGA_AMR_AF,ExAC_nonTCGA_Adj_AC,ExAC_nonTCGA_Adj_AF,ExAC_nonTCGA_EAS_AC,ExAC_nonTCGA_EAS_AF,ExAC_nonTCGA_FIN_AC,ExAC_nonTCGA_FIN_AF,ExAC_nonTCGA_NFE_AC,ExAC_nonTCGA_NFE_AF,ExAC_nonTCGA_SAS_AC,ExAC_nonTCGA_SAS_AF,ExAC_nonpsych_AC,ExAC_nonpsych_AF,ExAC_nonpsych_AFR_AC,ExAC_nonpsych_AFR_AF,ExAC_nonpsych_AMR_AC,ExAC_nonpsych_AMR_AF,ExAC_nonpsych_Adj_AC,ExAC_nonpsych_Adj_AF,ExAC_nonpsych_EAS_AC,ExAC_nonpsych_EAS_AF,ExAC_nonpsych_FIN_AC,ExAC_nonpsych_FIN_AF,ExAC_nonpsych_NFE_AC,ExAC_nonpsych_NFE_AF,ExAC_nonpsych_SAS_AC,ExAC_nonpsych_SAS_AF,FATHMM_converted_rankscore,FATHMM_pred,FATHMM_score,GERP++_NR,GERP++_RS,GERP++_RS_rankscore,GM12878_confidence_value,GM12878_fitCons_score,GM12878_fitCons_score_rankscore,GTEx_V6p_gene,GTEx_V6p_tissue,GenoCanyon_score,GenoCanyon_score_rankscore,H1-hESC_confidence_value,H1-hESC_fitCons_score,H1-hESC_fitCons_score_rankscore,clinvar_clnsig,clinvar_golden_stars,HUVEC_confidence_value,HUVEC_fitCons_score,HUVEC_fitCons_score_rankscore,LRT_Omega,LRT_converted_rankscore,LRT_pred,LRT_score,M-CAP_pred,M-CAP_rankscore,M-CAP_score,MetaLR_pred,MetaLR_rankscore,MetaLR_score,MetaSVM_pred,MetaSVM_rankscore,MetaSVM_score,MutPred_AAchange,MutPred_Top5features,MutPred_protID,MutPred_rankscore,MutPred_score,MutationAssessor_UniprotID,MutationAssessor_pred,MutationAssessor_score,MutationAssessor_score_rankscore,MutationAssessor_variant,MutationTaster_AAE,MutationTaster_converted_rankscore,MutationTaster_model,MutationTaster_pred,MutationTaster_score,PROVEAN_converted_rankscore,PROVEAN_pred,PROVEAN_score,Polyphen2_HDIV_pred,Polyphen2_HDIV_rankscore,Polyphen2_HDIV_score,Polyphen2_HVAR_pred,Polyphen2_HVAR_rankscore,Polyphen2_HVAR_score,REVEL_rankscore,REVEL_score,Reliability_index,SIFT_converted_rankscore,SIFT_pred,SIFT_score,SiPhy_29way_logOdds,SiPhy_29way_logOdds_rankscore,SiPhy_29way_pi,TWINSUK_AC,TWINSUK_AF,Transcript_id_VEST3,Transcript_var_VEST3,Uniprot_aapos_Polyphen2,Uniprot_acc_Polyphen2,Uniprot_id_Polyphen2,VEST3_rankscore,VEST3_score,aaalt,aapos,aaref,alt,cds_strand,chr,codon_degeneracy,codonpos,fathmm-MKL_coding_group,fathmm-MKL_coding_pred,fathmm-MKL_coding_rankscore,fathmm-MKL_coding_score,genename,gnomAD_exomes_AFR_AC,gnomAD_exomes_AFR_AF,gnomAD_exomes_AFR_AN,gnomAD_exomes_AMR_AC,gnomAD_exomes_AMR_AF,gnomAD_exomes_AMR_AN,gnomAD_exomes_AN,gnomAD_exomes_ASJ_AC,gnomAD_exomes_ASJ_AF,gnomAD_exomes_ASJ_AN,gnomAD_exomes_EAS_AC,gnomAD_exomes_EAS_AF,gnomAD_exomes_EAS_AN,gnomAD_exomes_FIN_AC,gnomAD_exomes_FIN_AF,gnomAD_exomes_FIN_AN,gnomAD_exomes_NFE_AC,gnomAD_exomes_NFE_AF,gnomAD_exomes_NFE_AN,gnomAD_exomes_OTH_AC,gnomAD_exomes_OTH_AF,gnomAD_exomes_OTH_AN,gnomAD_exomes_SAS_AC,gnomAD_exomes_SAS_AF,gnomAD_exomes_SAS_AN,gnomAD_genomes_AFR_AC,gnomAD_genomes_AFR_AF,gnomAD_genomes_AFR_AN,gnomAD_genomes_AMR_AC,gnomAD_genomes_AMR_AF,gnomAD_genomes_AMR_AN,gnomAD_genomes_AN,gnomAD_genomes_ASJ_AC,gnomAD_genomes_ASJ_AF,gnomAD_genomes_ASJ_AN,gnomAD_genomes_EAS_AC,gnomAD_genomes_EAS_AF,gnomAD_genomes_EAS_AN,gnomAD_genomes_FIN_AC,gnomAD_genomes_FIN_AF,gnomAD_genomes_FIN_AN,gnomAD_genomes_NFE_AC,gnomAD_genomes_NFE_AF,gnomAD_genomes_NFE_AN,gnomAD_genomes_OTH_AC,gnomAD_genomes_OTH_AF,gnomAD_genomes_OTH_AN,integrated_confidence_value,integrated_fitCons_score,integrated_fitCons_score_rankscore,phastCons100way_vertebrate,phastCons100way_vertebrate_rankscore,phastCons20way_mammalian,phastCons20way_mammalian_rankscore,phyloP100way_vertebrate,phyloP100way_vertebrate_rankscore,phyloP20way_mammalian,phyloP20way_mammalian_rankscore,pos(1-based),ref,refcodon,Condel"
That must be it. You still use MMCNT1,MMCNT2,MMCNT3,MMID3 in your fields list. Can you replace those please with _Mastermind_counts,MastermindMMID3? Thank you!
@at7 Thank You Sir yes, this method solved my problem. but asking for a little customize. can I write the names of the columns I want with this method?
Hello: When I run the Mastermind extension, it adds the comments to the output file in this way and does not add any mastermind information, but fills its columns with "-". What is this error? How can I solve.
INFO SPACE `
MMCNT1 : ?
MMCNT2 : ?
MMCNT3 : ?
MMID3 : ?
`
COLUMN SPACE EXAMPLE `
`