Open MikeHala opened 2 weeks ago
Hi @MikeHala,
We are working on supporting gnomAD version 4.1 for the next Ensembl release which will be early Autumn. You can check the updates here.
should I keep this line: "use_seq_region_synonyms": 1? (and why there is no equivalent for exomes)
use_seq_region_synonyms
is necessary when the VCF file has different names from the regions in the Ensembl db. The gnomAD files should work without this option.
do these matter "display_group_priority": 1.6 (for genomes) and "display_group_priority": 1.4?
display_group_priority
retrieves the priority of the group of populations enabling the ordering of frequency data on the population genetics webpage. This should not impact your work.
what will be appropriate values for the populations IDs - currently 9900028 for genomes and 9900010 for exomes?
Yes, for genomes it starts on 9900028 and for exomes on 9900010.
Let me know if you have more questions.
Best wishes, Diana
Describe the issue
Hi,
We are currently using the G2P plugin based on the the global MAF in the locally downloaded gnomAD variants (
gnomADg_r3.1.1_GRCh38
andgnomADe_r2.1.1_GRCh38
).The call to G2P is done as
To facilitate using the global MAF annotation from the locally downloaded gnomAD VCFs (and only them) we have modified the
ensembl-vep-100.4-0/Bio/EnsEMBL/Variation/DBSQL/vcf_config.json
file (see https://www.ebi.ac.uk/gene2phenotype/g2p_vep_plugin, theaf_from_vcf
parameter), and the json file looks like thisThis is all working.
With the release of gnomAD r4.1.0, we would like to update our scripts to using it and replace the old gnomAD VCFs.
My questions are related to (after downloading the gnomAD r.4.1.0 data and adjusting accordingly
id
,description
andfilename_template
) how should I adapt thevcf_config.json
file?"use_seq_region_synonyms": 1
? (and why there is no equivalent for exomes)"display_group_priority": 1.6
(for genomes) and"display_group_priority": 1.4
?9900028
for genomes and9900010
for exomes?Thank you in advance, Mike