-
### Description
In order to populate the delivery dashboard with metrics calculated based on data pulled from GitHub, we need a strategy to run the analytics pipeline created in the `analytics/` sub-…
-
After delivering several editions of the original "Software engineering principles" course #13 we got generally good feedback 🎉 and an ample majority of beginners felt that the course was worth their…
-
### Describe the feature
Currently, we can make use of [CallAwsService](https://docs.aws.amazon.com/cdk/api/v2/docs/aws-cdk-lib.aws_stepfunctions_tasks.CallAwsService.html), however it would be nice …
-
Start with my questions:
1) I am trying to work with a minimal set of events - is this the right approach?
2) Any suggestions as to how to modify the data I am trying to shred and ingest in orde…
-
**Relevant system information:**
- OS: Debian 9
- PostgreSQL version (output of `postgres --version`): postgres (PostgreSQL) 12.4 (Debian 12.4-1.pgdg90+1)
- TimescaleDB version (output of `\dx` …
-
Hi, I try to write a row from Azure Databricks (11.3 LTS (includes Apache Spark 3.3.0, Scala 2.12) ) to Azure SQL DB using com.microsoft.azure:spark-mssql-connector_2.12:1.3.0-BETA library but unfortu…
-
Goal is to build a single big (but modular) pipeline that can combine `off-chain` and `on-chain` data related to Arbitrum ecosystem in a manner that serves multiple analytical needs at the same time.…
-
**What is the goal / desired outcome?**
Output many (100k+) files from a single pipeline. Example use case: shuffling a DB dump to sort purchases by user, when there are more than 100k users in the…
-
### Elasticsearch Version
8.9.0, tested also on 8.5 and 8.6
### Installed Plugins
_No response_
### Java Version
_bundled_
### OS Version
N/A
### Problem Description
When us…
-
The `pipeline_ml_factory` method in kedro-mlflow is a useful method to store artifacts (transformers, models) automatically (using kedro-mlflow hook). However, this method calls the method [extract_pi…