Closed ffranr closed 6 years ago
This command finds new variation with respect to an inferred personal reference.
Inputs These are the inputs to this command:
Procedure The following is an outline of how this command works:
The inferred personal reference and reads are passed to cortex which generates a VCF. This VCF will be referred to as vcf_cortex. vcf_cortex is then transformed/projected such that the reference is the same as that used by vcf_build (O-ref).
The infer
command can generate either a VCF vcf_infer
or FASTA reference_infer
output. The discover
command should take vcf_infer
as an input and not reference_infer
. This will make it easier to "transpose" the cortex output VCF vcf_cortex
on to the reference found in the gram directory.
Will update top comment.
Example command currently looks like this:
gramtools discover --gram-directory ./gram --inferred-vcf ./infer.vcf --reads ./reads --output-vcf ./discover.vcf
Cortex (or other caller) will need the reference_infer also.
This command finds new variation with respect to an inferred personal reference.
Inputs
These are the inputs to this command:
vcf_build
and a referencereference_build
)vcf_infer
withreference_build
as a reference (VCF format output of theinfer
command)Procedure
The following is an outline of how this command works:
vcf_infer
calledreference_infer
. This is accomplished by selecting the alternative allele for each entry withinvcf_infer
.reference_infer
and reads are passed to cortex which generates a VCF. This VCF will be referred to asvcf_cortex
.vcf_cortex
such that the reference isreference_build
instead ofreference_infer
. This VCF file will be referred to asvcf_discover
. Usingvcf_infer
instead ofreference_infer
is necessary at this step.Output
Discover produces a single VCF file as an output:
vcf_discover
. This VCF file usesreference_build
as a reference.Notes
Each entry within
vcf_cortex
can contain at most one alternative. Furthermore, these entries do not overlap. This should also hold true forvcf_discover
.