google / fhir-data-pipes

A collection of tools for extracting FHIR resources and analytics services on top of that data.
https://google.github.io/fhir-data-pipes/
Apache License 2.0
142 stars 82 forks source link

Added queries_large.ipynb with related changes #922

Closed bashir2 closed 6 months ago

bashir2 commented 6 months ago

Description of what I changed

This adds another sample Jupyter notebook to compare Spark+Parquet and PostgresSQL+views queries. Note this is on top of #919 so only the second commit belongs to this PR.

E2E test

TESTED:

Ran the pipeline on the large dataset with SQLonFHIR-v2 view-generation enabled. Then built the custom Jupyter docker image and ran queries_large.ipynb.

Checklist: I completed these to help reviewers :)