luispedro / waldo

Waldo Project
MIT License
2 stars 3 forks source link

Data Policy #2

Closed magsol closed 14 years ago

magsol commented 14 years ago

The protein locator must have and adhere to a data policy in terms of accessing database files on the filesystem. Whether all databases are included in a single folder, or in a separate repository, the code and data need to remain separate but accessible to one another in centralized locations.

magsol commented 14 years ago

I'm unsure exactly what is meant by "having a lone script that updates the files." Does this mean there is a script/cronjob that periodically re-download the entire database files? The Uniprot file, for example, is 4.5 GB.

magsol commented 14 years ago

New "settings.py" file will store the path to all the flat data source files, from which the rest of the application will draw. The folder structure still needs to be tweaked, but this is the data source policy.