sul-dlss-deprecated / dataOps

data operations ("dataOps") repo for issue queues & any version-controlled documentation
1 stars 1 forks source link

Traject+ : Compare to DPLA Ingest 3 as alternate ETL approaches #24

Open cmharlow opened 7 years ago

cmharlow commented 7 years ago

Part of trying to identify and build momentum behind a particular ETL framework within DLSS that also allows for use within other data engineering requests (can run over Spark; can support non-developer, configuration based transforms; has a data model validation step; can take streaming data input)

Traject+: https://github.com/sul-dlss/dlme/tree/master/lib/traject

DPLA Ingestion 3: https://github.com/dpla/ingestion3