subugoe / leine

Data Pipelines for @subugoe/wag
https://subugoe.github.io/leine
MIT License
1 stars 0 forks source link

consider ELT with BigQuery as pseudo-schemaless db #1

Open maxheld83 opened 3 years ago

maxheld83 commented 3 years ago

These approaches might work for us. Have to do more research, just to remember the links:

maxheld83 commented 3 years ago

this is also what (some) big nerds apparently do: https://github.com/mozilla/gcp-ingestion/issues/423

maxheld83 commented 3 years ago

this is a good summary of the case for ELT https://www.stitchdata.com/resources/what-is-elt/

maxheld83 commented 3 years ago

prep been implemented here:

and schema here: https://github.com/The-Academic-Observatory/observatory-platform/blob/6f4a67b6119492d4ebc55e89be1c4a73c0be833c/observatory-dags/observatory/dags/database/schema/crossref_metadata_2020-09-01.json