vgteam / vg

tools for working with genome variation graphs
https://biostars.org/tag/vg/
Other
1.09k stars 193 forks source link

ancient DNA #1709

Open ekg opened 6 years ago

ekg commented 6 years ago

I'd like to validate the utility of vg in the alignment of ancient DNA. The shortest path to this would be through simulation of aDNA-like reads. We already have the capacity to take recalibrated BAMs as input to get per-base error estimates. However, we don't have the ability to vary the read length according to the distribution of lengths that we see in this input. It seems like the simplest thing to do in order to be able to simulate something reasonably similar to aDNA to make the argument that our alignment works on it would be to add the read length distribution sampler into the simulation.

jeizenga commented 6 years ago

Nice. I've been wanting to implement the empirical read-length distribution for a while but just never got around to it.