datamade / wopr-data

Deprecated: Scripts for creating and updating datasets for plenario
MIT License
2 stars 2 forks source link

Parameters for our data warehouse #12

Open svetlozarn opened 10 years ago

svetlozarn commented 10 years ago

Finally got response to our request for a postgres DB: Here are my estimates for the parameters of our data warehouse. Please, feel free to comment and adjust them:

Hi Svetlozar,

We’ll need you to answer a few questions in order to properly size the new database virtual machine. I’m sure we’ll have more questions but to get started do you happen to know the following?

Size of the database? estimated number of remote connections? estimated peak number of transactions per second?

My response, for now:

I expect that the size of the database would initially be between 10-50GB depending on indexing and materialized views. We expect the size to grow to a few hundred GB by the summer.

During the development period, the simultaneous remote connections should be less than 50. In the future, the number may increase quite a bit but we will discuss it with the RCC team first.

We will use the database for data warehousing and analytical purposes, so we don't expect many transactions but rather longer lasting queries and updates.