turbot / steampipe

Zero-ETL, infinite possibilities. Live query APIs, code & more with SQL. No DB required.
https://steampipe.io
GNU Affero General Public License v3.0
6.79k stars 263 forks source link

Performance issue with many GCP projects #4135

Closed nirmesika closed 6 months ago

nirmesika commented 6 months ago

Describe the bug have 7 gcp orgs with combined volume of 12K (12000) projects. when trying to load the service or refresh the connections it takes over an hour for steampipe to load the schema for those projects.

Steampipe version (steampipe -v) Example: v0.21.7

To reproduce have a gcp.spc file with 12K projects.

Expected behavior loading time of a few minutes.

Additional context running on an AWS EC2 instance with 16GB ram and 4 CPU's, Any ideas on how to resolve

e-gineer commented 6 months ago

Glad you got this resolved @nirmesika ... that's an impressive number of projects!

Steampipe will create schemas for connection (project) the first time you run. Future runs will not need to do that.

Some tips to help performance here:

Please hit us up on Slack (or GitHub) if you have any questions or need any help as you scale Steampipe. Keen to learn from your experience.