Closed swomics closed 1 year ago
These are all excellent questions.
vg
. The idea will be to keep the full graph and then, when mapping with giraffe
, dynamically clip down to the subgraph that most fits the reads. This way it can ignore common variants that aren't in the sample while keeping rare variants that are. As for targetted clipping, there is a bit of an interface in vg clip
where you can pass in a BED file, but you'd need to do it manually outside of cactus-graphmap-join
. Thank you for the detailed reply!
Hi,
Working with the cactus/vg programs has been really insightful and has produced some very interesting results so far. I'm currently genotypying some short read samples based on known SVs present in some long-read genomes. The region I am interested has lots of structural variation, which I think lends itself to this pangenome framework well. As my datasets have increased in size and overlapping complexity at the region of interest, I'm coming up against the limits of the data/programs. I would like to optimize the genotyping of this complex region as much as possible, and I have three questions: