reorganize columns of parsed output files - Githubissues

UW-GAC / wgsaparsr

Code for parsing TOPMED variant annotation files produced by the WGSA annotation tool.

Other

5 stars 3 forks source link

reorganize columns of parsed output files #75

Closed jaind closed 5 years ago

jaind commented 6 years ago

The following columns, in that order, should always be the first set of columns to be reported in all parsed output files

CHROM
POS
REF
ALT
FILTER
chr_hg19
pos_hg19
alt_hg19
ref_hg19
ref_hg19_equals_ref_hg38
rs_dbSNP150 ( or equivalent of rsID in future WGSA release)

For indels, the two columns below should follow next to the top eleven columns

focal_snv_number
indel_focal_length

For dbNSFP annotations the three columns below should follow next to the top eleven columns

aaref
aaalt
Ensembl_geneid