stackabletech / stackablectl

Commandline tool to interact with a Stackable Data Platform
Other
8 stars 1 forks source link

[Merged by Bors] - Add demo data-warehouse-iceberg-trino-spark #144

Closed sbernauer closed 1 year ago

sbernauer commented 1 year ago

Description

Needs a larger k8s cluster! I use IONOS k8s with 9x 4 cores (8 threads), 20GB ram and 30GB hdd disk Maybe we can also offer a smaller variant later on.

Otherwise business as usual. From feature-branch run stackablectl --additional-stacks-file stacks/stacks-v1.yaml --additional-releases-file releases.yaml --additional-demos-file demos/demos-v1.yaml demo install data-warehouse-iceberg-trino-spark

I'm not happy with some parts but i think an iterative approach is best:

To get to the Spark UI kubectl port-forward $(kubectl get pod -o name | grep 'spark-ingest-into-warehouse-.*-driver') 4040

Review Checklist

Once the review is done, comment bors r+ (or bors merge) to merge. Further information

sbernauer commented 1 year ago

It kept running over the weekend. We are now at

|files|total_size     |avg_size   |total_records |avg_records|smallest_file|
|-----|---------------|-----------|--------------|-----------|-------------|
|298  |150,318,197,593|504,423,481|36,430,087,583|122,248,616|7,101,325    |
sbernauer commented 1 year ago

@maltesander could you please do another round reviewing the docs?

sbernauer commented 1 year ago

Many many thanks!

sbernauer commented 1 year ago

bors r+

bors[bot] commented 1 year ago

Pull request successfully merged into main.

Build succeeded: