standage / genhub

Explore eukaryotic genome composition and organization with iLoci
BSD 3-Clause "New" or "Revised" License
6 stars 3 forks source link

NT_ and NW_ accessions in mammalian genomes correspond to patches, variants #52

Closed standage closed 8 years ago

standage commented 8 years ago

The mouse and human genomes are very well finished, and the chromosome sequences are assigned NC accessions. NW and NT_ do not correspond to unplaced genomic scaffolds as they do in many other species, they correspond to patches or variants not (yet?) integrated into a major build release. This information is redundant and should be filtered out in preprocessing. Filtering annotations is simple, but if we don't want redundant sequences to be included in calculations this will require implementing a new filtering mechanism.