-
Tasks:
- [ ] Land nodes ingest step
- [ ] Land edges ingest step (already scaffolded, but needs the nodes to be completed)
- [ ] Food groups ingest (file `UniqueFG1_FG2.csv` seems to include the …
-
-
Includes a Workflow Engine, an implementation of Apache Airflow, to orchestrate manifest/metadata ingestion via Storage Service
-
# Overview for the next few tasks
- Ingest all 2023 data sources relevant to the banking dashboard project
- Run the 2023 data through the 2022 data processing pipeline
- Inspect the process / re…
-
### Is it easy to find the information you need?
No
### Are the instructions clear?
No
### How could we improve the Timescale documentation site?
The page currently provides no insight how to get…
-
**Deliverable this task is associated with**
_See Deliverables tab here: _
Identifiers
doi 10.46936/fics.proj.2021.60033/60000394
JGI ITS proposal ID 508059
GOLD study ID Gs0160700
EMSL proj…
aclum updated
2 weeks ago
-
Currently we only ingest the WholeCellMask data if available.
We should also ingest the NucleusMask data as regions of interest (ROIs).
-
Sometimes candidate names that should have the suffix "Jr." leave out the ".", and when that is the case AND there is no middle name, our script reads "Jr" as the last name.
e.g. "Francis Bigaouette …
-
Blitz on the following before beta launch:
-
[We currently ingest dbt sources from CaDeT](https://github.com/ministryofjustice/data-catalogue/blob/2e59cfb6c08c2b0e8de4cafd30c5bb97b782520a/ingestion/cadet.yaml#L27)
However, these are excluded …