kreuzwerker / kreuzlaker

11 stars 2 forks source link

Move scoofy data generation into this repo #24

Open jankatins opened 1 year ago

jankatins commented 1 year ago

Currently the scoofy data generation happens outside of this project (basically a lambda + a s3 bucket) and needs a cross account setup to copy data into the data lake. It makes sense to simply add the s3 data directly into the raw data lake bucket, similar to what currently the copy job does. This would also mirror what we expect to happen in the cdc-from-RDBMS case (#7).

Tasks:

DoD:

(see also https://github.com/kreuzwerker/xw-data-toolkit/issues/21 for adjusting the current setup in multiple ways)