bigbio / py-pgatk

Python tools for proteogenomics analysis toolkit
Apache License 2.0
10 stars 11 forks source link

VCF to proteinDB #10

Closed husensofteng closed 5 years ago

husensofteng commented 5 years ago

Convert a VEP annotated VCF into protein sequences.

Input:

Process:

Output:

husensofteng commented 5 years ago

The main difference with COSMIC and cBioportal is:

husensofteng commented 5 years ago

Done Example command: python pypgatk_cli.py vcf-to-proteindb --config_file config/ensembl_config.yaml --vep_annotated_vcf testdata/test.vcf --input_fasta testdata/test.fa --gene_annotations_gtf testdata/test.gtf --output_proteindb testdata/proteindb_from_ENSEMBL_VCF.fa