biocommons / biocommons.seqrepo

non-redundant, compressed, journalled, file-based storage for biological sequences
Apache License 2.0
39 stars 35 forks source link

Fix alias parsing and release fixed database #76

Closed reece closed 4 years ago

reece commented 4 years ago

Something changed last year in a way that caused the entire defline to be loaded as an alias. seqrepo releases since last June or so are affected. This affects NCBI and Ensembl. ~See comments in cli.py::load()~

This issue is a master list for work to comprehensively fix parsing. It will need to span several package and data releases.

0.5.5