cschlaffner / PROTzilla2

12 stars 0 forks source link

Unify UniProt Protein Graph Data across Runs #464

Closed antonneubauer closed 2 months ago

antonneubauer commented 3 months ago

Issue description

For every protein graph that is generated, a file needs to be downloaded from the Uniprot database. For a given protein id, this is always the same file. As these files are saved per run, the same file might be present multiple times in the user data, unnecessarily taking up space. Additionally, if the user is offline and the file was not previously downloaded in the current run, the protein graph cannot be created as the file cannot be downloaded even though, technically, all the necessary data might be present.

The goal is to save the protein files in central location (potentially /user_data/external_data/protein_graph_files/, still up for consideration) so that the above mentioned challenges are resolved.

up for debate

Acceptance Criteria