If a vcf record being annotated has more than 1 entry in the alt field, then nanno will split that record into 2 records.
eg.
chr1 1000 A C,T .....
will become
chr1 1000 A C .....
and
chr1 1000 A T .....
However, nanno was using the whole of the alt field in snpEff annotation files, which meant that there would not be a match from snpEff (just for records that had more than 1 entry in the alt field).
The fix involves splitting the 'alt' field when it contains more than one value, allowing the VCF record to be annotated to match against each of these values.
Other changes:
reverted a change made be the previous PR #341 that removed a build.last step
renamed ChePositionRefAlt.getName() to getRef()
added GATK GT field to nano output
added original_alt field to nanno output (useful when splitting 1/2 variants)
keep original GATK_AD values rather than manipulating them (original_alt field allows for this)
update Executor so that spaces in the classpath (hello IDEA) don't cause the Process to fall over
Type of change
Please delete options that are not relevant.
[X] Bug fix (non-breaking change which fixes an issue)
How Has This Been Tested?
New unit tests, along with testing on cluster
Are WDL Updates Required?
No
Checklist:
[X] My code follows the style guidelines of this project
[X] I have performed a self-review of my own code
[X] I have commented my code, particularly in hard-to-understand areas
[X] I have made corresponding changes to the documentation
[X] My changes generate no new warnings
[X] I have added tests that prove my fix is effective or that my feature works
[X] New and existing unit tests pass locally with my changes
Description
If a vcf record being annotated has more than 1 entry in the alt field, then nanno will split that record into 2 records. eg.
chr1 1000 A C,T .....
will becomechr1 1000 A C .....
andchr1 1000 A T .....
However, nanno was using the whole of the alt field in snpEff annotation files, which meant that there would not be a match from snpEff (just for records that had more than 1 entry in the alt field).
The fix involves splitting the 'alt' field when it contains more than one value, allowing the VCF record to be annotated to match against each of these values.
Other changes:
Type of change
Please delete options that are not relevant.
How Has This Been Tested?
New unit tests, along with testing on cluster
Are WDL Updates Required?
No
Checklist: