cube-js / cube

📊 Cube — The Semantic Layer for Building Data Applications
https://cube.dev
Other
17.85k stars 1.77k forks source link

difference between CUBEJS_DB_EXPORT_BUCKET, CUBESTORE_GCS_BUCKET and Local storage #6436

Open jxperf opened 1 year ago

jxperf commented 1 year ago

Hi I'm trying to understand a couple of things

My understanding is that with each cube-router/worker there are 2 levels of storage:

The documentation points to the fact that remote storage should use a remote blob storage e.g. gcs to guarantee consistency.

My questions are

ovr commented 1 year ago

Hello @jxperf,

  1. Cube uses the export bucket to transfer data from DWH to Cube Store. Technically Cube asks DWH to export data (UNLOAD), and then it asks Cube Store to import data directly from cloud storage. Cube Store uses remote storage as persistent storage.
  2. Yes.
  3. It's temporary storage for each pod.

Thanks

jxperf commented 1 year ago

Hi @ovr thanks for this! For the Local storage do you recommend setting persistent volume in the case of a pod restart or does it not really matter? if so should it be one volume per worker or should it be a shared persistent volume i.e. one which is accessed by all workers?

Thanks!!

paveltiunov commented 1 year ago

@jxperf Please see https://cube.dev/docs/caching/running-in-production#storage