-
## Context
Once we are done completing the creation of the ETL pipeline to download, filter, and parse papers from the various sources (see #562), we need to run this pipeline for the first time to e…
-
currently failures are logged by timestamp and a failure might be hidden in between lots and lots of output
to make it easier to see what the problem was it would be nice to collect all errors and p…
-
Extend the Flask API to read from a .jsonl file of scraped Officers data and insert it into the Postgres DB.
**Is your feature request related to a problem? Please describe.**
As part of our data …
-
See [this execution](https://demo.etl.linkedpipes.com/#/pipelines/edit/canvas?pipeline=https:%2F%2Fdemo.etl.linkedpipes.com%2Fresources%2Fpipelines%2F1597218835622&execution=https:%2F%2Fdemo.etl.linke…
-
See [this execution](https://demo.etl.linkedpipes.com/#/pipelines/edit/canvas?pipeline=https://demo.etl.linkedpipes.com/resources/pipelines/1563878656717&execution=https://demo.etl.linkedpipes.com/res…
-
Line 188 in etl_pipeline (https://github.com/coursera/dataduct/blob/develop/dataduct/etl/etl_pipeline.py) passes variable "load_min" as minute component of specified schedule time from YAML file. Howe…
-
### Describe the feature
Incorporate the `ChunkedFileReader` into the `FileClient`, as opposed to `ChunkedFileReader::next_chunk` returning a new `FileClient`. Instead, a new chunk is read upon cal…
-
## Description:
It would be extremely helpful to have the ability to save and restore sessions in Data Wrangler. This feature would allow users to:
- Save the current state of data transformatio…
-
I keep getting `java.util.NoSuchElementException: No value present` error when trying to to run my etl pipeline
the lineage information is not sent to my openmetadata instance, find code snippet fo…
-
The data-processing cluster in mlab-sandbox & mlab-staging is in us-east, while the archive-measurement-lab bucket is in us-central1. These clusters should be redeployed to us-central, and their outp…