datastax / cstar_perf

Apache Cassandra performance testing platform
Apache License 2.0
72 stars 34 forks source link

Refactor storage of blobs into more manageable chunks #124

Open EnigmaCurry opened 8 years ago

EnigmaCurry commented 8 years ago

Test artifacts are stored on the frontend in a single table with each artifact a single row. This has worked fine for small tests, but this will hemorrhage with larger data sets as it's being stored in a single blob.

Current schema:

CREATE TABLE test_artifacts (test_id timeuuid, artifact_type text, description text, artifact blob, PRIMARY KEY (test_id, artifact_type));

Current update path:

UPDATE test_artifacts SET description = ?, artifact = ? WHERE test_id = ? AND artifact_type = ?;

I propose that we chunk out the blobs into 10MB (configurable?) chunks. This will fix two foreseeable issues:

Necessary changes:

EnigmaCurry commented 8 years ago

Some stats as of today: