Closed rbuels closed 7 months ago
MAF format http://www.bx.psu.edu/~dcking/man/maf.xhtml
data formats to consider:
for clustalW output we would need to process it to make one of the sequences the reference, and have the other alignment variants be deletions and insertions with respect to it.
UCSC uses MAF via the BigMaf format, which looks like it actually stores the MAF info in a BigBed with an AutoSql.
True :) have parsed this format in https://github.com/cmdcolin/mafviewer/
One important part of this discussion is representing gaps in the reference sequence track and making annotations aware of that (since an alignment can introduce gaps into the reference sequence to represent insertions relative to the query)
Note that things like samtools tview fully does this for any viewing of a genome alignments file, which helps elucidate indels
Here is samtools tview showing gapped reference on our volvox-sorted.bam
Also looks like this with a reference sequence available
I keep poking this thread occaisionally but I do think it's sort of an interesting issue
It brings into light modified reference coordinates
This SV visualizer is part of SV plaudit and does a good job of comparing reference to alt
https://github.com/svviz/svviz
If we could make a custom view type or LGV mod that supports such a thing it could be quite powerful
seems like there can be only hacks that try to bolt this onto lgv. i think future true alignments to a graph genome type thing are the better option, and allowing jbrowse to flip between refs from graph for example
like here's a clustalw output: