Question about the coordinates in bubble output

lh3 / minigraph

Sequence-to-graph mapper and graph generator

MIT License

417 stars 38 forks source link

Dear Dr. Li, When running minigraph --call, how are the positions of the "alleles" (bubble vertices) determined? I am assuming that the reference sequences are mapped to the graph, and that's how the coordinates are calculated. Is that correct? Below, see an example where I queried the graph with the same sequence that was used as the reference sample during the graph construction phase (minigraph -xasm --call foo.gfa Bd21C1.fa)

Bd21C1  481920  481939  >s1     >s3     >s2:19:+:Bd21C1:481915:481941

Notice how the allele coordinate is different from the bubble position (presumably on the same sequence). Is that a mapping artifact?

Also, what happens when a sequence maps to the bubble in two or more places?

Thank you

lh3 / minigraph

Question about the coordinates in bubble output #50