Closed cmdcolin closed 5 months ago
uses a redispatching approach similar to gff3tabix to handle cases where you retrieve a child and need to re-fetch to get all children within it's bounds. it's a heuristic that could be broken but hopefully holds up for general usage
on the UCSC Gencode bigGenePred file, i think it has much nicer default behavior
this branch
main branch
note that a similar thing could be done to bedTabix potentially, but the UCSC bedTabix that i exported from the sql tables don't have any geneName type attribute to aggregate by afaik
This adds the ability to aggregate multiple bigbed entries into a single 'gene' feature based on a attribute such as geneName
BigBed /BED in general is only capable of storing one transcript per line and doesnt explicitly acknowledge child->parent relationships with a gene level feature
result on volvox
before
this PR
motivation: better UCSC2jbrowse mega-instance functionality