hasadna / datacity-k8s

1 stars 0 forks source link

odata as a data lake - containing all raw data #95

Open OriHoch opened 3 years ago

OriHoch commented 3 years ago

a data lake exposes all the raw, unprocessed, schemaless data (as opposed to data warehouse which exposes processed data with well defined schema)

odata should be this data lake, exposing the metadata via CKAN api and the data via the datastore

This data should also be available on Redash to allow joining data from multiple data sources

OriHoch commented 3 years ago

implementation suggestion:

OriHoch commented 3 years ago

could overcome some of the CKAN datastore bugs / ingestion problems