populationgenomics / seqr

web-based analysis tool for rare disease genomics
GNU Affero General Public License v3.0
3 stars 1 forks source link

Seqr Cost Recovery #111

Open illusional opened 2 years ago

illusional commented 2 years ago

As part of recovering costs for external datasets, we need to store and determine some reporting mechanism for the different costs we accrue for processing and storing data (re @lgruen):

As part of doing this, we need to ensure that costs are counted in the correct currency, and for the correct region.

illusional commented 2 years ago

@vladsaveliev, I've assigned us in the short term, but mostly just to keep us thinking about it. I'm having a deeper look into hail batch billing costs at the moment.

illusional commented 2 years ago

Noting here that the hail batch costs are derived from the resource table, which by default stores cost for the us-east1 region (eg: populationgenomics/hail:batch/sql/insert_nonpreemptible_resources.py#L13-L14).

There are a couple of things we'll need to consider:

violetbrina commented 2 years ago

Planning document

violetbrina commented 1 year ago

Write both into a python script that dumps out a csv for loading elsewhere.