google / fhir-data-pipes

A collection of tools for extracting FHIR resources and analytics services on top of that data.
https://google.github.io/fhir-data-pipes/
Apache License 2.0
151 stars 84 forks source link

Update documentation to clarify feature roadmap (e.g. what is ready, what is not yet ready). #557

Open jjtswan opened 1 year ago

jjtswan commented 1 year ago

For example, I think we want to indicate that we want to enable support of single-machine deployment using FHIR search, instead of the native HAPI integration - but that it is not yet ready.

williamito commented 1 year ago

My understanding of current features:

Pipeline

Transform data from a FHIR-based data source to Parquet files or a different FHIR store.

Controller

Schedule incremental runs for a Pipeline to Parquet files. Provides a GUI to run Pipeline, see its status, and see its settings.

Single Machine Deployment

A single Docker Compose configuration to run Controller and query the Parquet files via a Spark thrift server.

Test docker images

Docker images to try out Pipelines with different sources and sinks.

Synthea HIV

Generate synthetic HIV patient data using Synthea. Upload data to HAPI, GCP, or OpenMRS. Use pre-generated test data in your own development.

DWH Query Library [deprioritized?]

Simplifies querying of FHIR-based data warehouses by providing a unified query API across Spark and BigQuery.

bashir2 commented 1 year ago

This is not a requirement for Beta launch so moving to "post-beta" milestone.