Hashing problem - Githubissues

superphy / spfy

Spfy: an integrated graph database for real-time prediction of Escherichia coli phenotypes and downstream comparative analyses

https://lfz.corefacility.ca/superphy/grouch/

Apache License 2.0

4 stars 2 forks source link

Hashing problem #206

Open jamez-eh opened 7 years ago

jamez-eh commented 7 years ago

Currently Spfy names genomes based on the a sorted hash of the entire contents of a file. It would be more durable to name genomes based on a hash of the sequence alone in case anyone decides to rename the headers of a file before uploading a genome, which otherwise would be identical to a genome currently stored within the database.