superphy / spfy

Spfy: an integrated graph database for real-time prediction of Escherichia coli phenotypes and downstream comparative analyses
https://lfz.corefacility.ca/superphy/grouch/
Apache License 2.0
4 stars 2 forks source link

Hashing problem #206

Open jamez-eh opened 7 years ago

jamez-eh commented 7 years ago

Currently Spfy names genomes based on the a sorted hash of the entire contents of a file. It would be more durable to name genomes based on a hash of the sequence alone in case anyone decides to rename the headers of a file before uploading a genome, which otherwise would be identical to a genome currently stored within the database.