-
This should be done as a pre-processing step, part of overall ETL pipeline, such that each individual analysis does not need to do normalization
Currently done for water packages here:
https://nbv…
-
有两个pipeline(单向,mysql5.7.16-log)跑着跑着突然就挂起,报了这个错,没找到其他相关日志。。
```
pid:15 nid:1 exception:setl:com.alibaba.otter.node.etl.select.exceptions.SelectException: java.lang.NullPointerException
at com.alib…
luyee updated
2 years ago
-
Hello
I'm trying to replicate your example in my own project.
But I have an issue with python udf: always run into this error `ModuleNotFoundError: No module named 'pipelines'`
I simply changed …
-
# Description
The first version of good tables is about what good tables does best - data validation. Additionally, all the infrastructure will be in place for a useful data processing service. We …
-
Investigate whether it is possible to use SystemDS with CDAP.io or cloud data fusion instances.
https://cdap.atlassian.net/wiki/spaces/DOCS/overview
https://cloud.google.com/blog/products/data-ana…
-
### Description
In order to populate the delivery dashboard with metrics calculated based on data pulled from GitHub, we need a strategy to run the analytics pipeline created in the `analytics/` sub-…
-
## Tell us about the problem you're trying to solve
AWS Redshift is used as a datawarehouse and for powering reverse etl pipelines. It acts as a source for all of our pipelines. Since we have to send…
-
It's possible to write rudimentary tests in DAX Studio using a combinations of VAR for ExpectedValues, CalculatedValues
And then check values match using an IF equality check.
It would be good i…
-
## 🚀 Feature
Recently, we implemented `bbs_database download` for multiple different sources. It might be a good idea to extend our integration test to actually use this download (rather than using…
-
BigQuery has pretty nice support for adding descriptions to tables and columns. We aren't yet making heavy use of it, but we _are_ starting to add some descriptions into JSON schemas in mozilla-pipeli…