vg validate -A -a remapped.gam /private/groups/patenlab/anovak/projects/hprc/lr-giraffe/graphs/hprc-v1.1-mc-chm13.d9.gbz
Invalid Alignment:
Length of node 4465278 (1) exceeded by Mapping with offset 0 and from-length 4:
{"edit": [{"from_length": 1, "to_length": 1}, {"sequence": "AAGG", "to_length": 4}, {"from_length": 3}], "position": {"node_id": "4465278"}, "rank": "63"}
alignment: invalid
This is because a "from_length": 3 deletion edit that might make sense in a mapping to node 4465277 is smushed into the mapping for the previous node, node 4465278. It also happens to be immediately after an insertion edit; generally we don't want to abut those.
We have to figure out how this base-level alignment is being generated, and at least get the edit into the right mapping, if not prohibit the adjacent indels entirely.
If you map read S1_73477 from the 1m simulated R10 reads with vg
ef2a1384b
, like this:Then the alignment doesn't validate:
This is because a
"from_length": 3
deletion edit that might make sense in a mapping to node 4465277 is smushed into the mapping for the previous node, node 4465278. It also happens to be immediately after an insertion edit; generally we don't want to abut those.We have to figure out how this base-level alignment is being generated, and at least get the edit into the right mapping, if not prohibit the adjacent indels entirely.
Without this, surject can't process the read.