neondatabase / neon

Neon: Serverless Postgres. We separated storage and compute to offer autoscaling, code-like database branching, and scale to zero.
https://neon.tech
Apache License 2.0
15.15k stars 442 forks source link

Epic: set up infra for periodic perf tests with bigger databases #2016

Closed kelvich closed 3 weeks ago

kelvich commented 2 years ago

Right now we have some infrastructure to run periodic tests, but it needs more love. For some amount of time safekeepers were not capable of automatically deleting WAL, so we avoided big databases as they were causing disk overflow on staging. Now safekeepers can clean their WAL so let's start doing more realistic tests.

As result we want to have the following:

Approximate steps:

bayandin commented 2 years ago

https://github.com/neondatabase/neon/issues/2221 is required

stepashka commented 2 years ago

during the meeting today @hlinnaka mentioned that we're ready to increase the scale up more towards the 300GB, e.g. 50GB first?

stepashka commented 2 years ago

is this still blocked by neondatabase/neon#2221 ?

bayandin commented 2 years ago

during the meeting today @hlinnaka mentioned that we're ready to increase the scale up more towards the 300GB, e.g. 50GB first?

I've experemented with pgbench -i, setting up a 50 GB (scale 3424) database took ~ 1h (with enabled prefetch):

------------------------------ Benchmark results -------------------------------
test_pgbench_remote_init[3600-3424].scale: 3424 
test_pgbench_remote_init[3600-3424].init.start_timestamp: 1665488570 
test_pgbench_remote_init[3600-3424].init.end_timestamp: 1665492349 
test_pgbench_remote_init[3600-3424].init.duration: 3,778.830 s
test_pgbench_remote_init[3600-3424].init.drop_tables: 0.030 s
test_pgbench_remote_init[3600-3424].init.create_tables: 0.130 s
test_pgbench_remote_init[3600-3424].init.client_side_generate: 1,419.680 s
test_pgbench_remote_init[3600-3424].init.vacuum: 1,607.980 s
test_pgbench_remote_init[3600-3424].init.primary_keys: 750.590 s

https://github.com/neondatabase/neon/actions/runs/3226525705/jobs/5280150287

A couple of clarifying questions:

@hlinnaka what do you think?

is this still blocked by https://github.com/neondatabase/neon/issues/2221 ?

Now with creating projects via API it doesn't really blocked by it. I believe with increasing db size we could be blocked by https://github.com/neondatabase/cloud/issues/1872

abhishektvz commented 1 year ago

Hey folks, we were trying to deploy neon tech for 10 TB+ data would we need 10 TB storage on the page server ?

stepashka commented 1 year ago

this is is Alexander's backlog, still in progress

ololobus commented 4 months ago

@bayandin should we just close it and open another issue if something else is needed?

mickael-carl commented 3 months ago

Removing from the infra board as I don't think there is anything for us to do at this time. If I'm mistaken I'd just ask for a concrete action item description so we can follow up on it 🙂

bayandin commented 3 weeks ago

We're tracking multi-TB databases in a separate issue — https://github.com/neondatabase/cloud/issues/14727 This can be closed