subugoe / leine

Data Pipelines for @subugoe/wag
https://subugoe.github.io/leine
MIT License
1 stars 0 forks source link

consider non-SQL MPP #11

Open maxheld83 opened 3 years ago

maxheld83 commented 3 years ago

aside from SQL, we sometimes have more involved analyses, which we'd typically run in R. For example, we might have some complicated regex for license info or some such thing already coded in R. (This is not a great example, because it could perhaps be done in just SQL and custom functions in BigQuery, but still).

For these expensive, non-SQL analyses we need an MPP solution, ideally tightly integrated with our data warehouse.

We might have several MPP needs: