-
# Data-oblivious Programming
A data-oblivious program is one that decouples data input from program execution. Such programs exhibit control-flow and memory access patterns that are independent of …
-
Load data into a local postgres instance manually from the parquet files. Create the DBT transformation steps.
-
## Terminology
- **data sources**: a tabular dataset with arbitrary schema (column
names & types) in a local file; could be CSV, Excel, Parquet,
JSON(LD), or a database
- **transformation**:…
-
# Bug
## Describe the Bug
ReportStream's ORM to FHIR transformation is now losing data from a variety of fields, including:
- OBX-3
- OBX-6
- OBX-5-3
- OBR-4
- OBR-7
- ORC-21
- SPM-4
- S…
-
### Description
Currently there are two ways we can handle transformations performed by ray data:
1. Materialize ahead of time, then all downstream actions (multiple training epochs) will use the ca…
-
We want to enable users to add/change/remove columns on the way in. Now that we support arrow as our data format we might be able to leverage https://arrow.apache.org/datafusion/ to make those kind of…
-
### Bug Description
In the `run()` method (line 542 in llama_index.core.ingestion.**pipeline.py**) the parameter show_progress is passed to the `run_transformation()` method, but this method doesn'…
-
### Version
5.0.2
### Steps to reproduce
i cannot find a API to get the result after dataset has transformed.
### What is expected?
A API or another solution to get the transformed data.
#…
-
We need to refactor the `Model` class to store pointers to `Node` and (potentially) `Constraint` objects -- currently, we are holding copies to such objects which makes transformations difficult to ke…
-
Originally submitted by @kindly on ckan/ckan#1394
## Purpose
To give CKAN basic data transformation abilities. These transformations will happen to data within the CKAN datastore.
This will initi…