Open aadrian opened 9 years ago
We'll be taking a final pass over the examples for the next release. I'm happy to do another SQL example. What would be a good realistic use case, in your opinion?
What would be a good realistic use case, in your opinion?
As input, at least one DB schema with several tables (with relationships between them), than some non-trivial transformations, and as output another DB schema, with several tables (and relationships).
Basically that's what ETLs most of the time are used for.
I mentioned H2 since it's Java based, no setup required and it's also one of the fastest DBs (of course, not with the in-memory mode since that's not really for "production") http://www.h2database.com/
I'll have to prod you once more for a suggestion. I'm juggling a lot of things right now so every bit helps.
What would be a helpful non-trivial transformation? In my ETL experience, most of the data I was moving was copied through from point A to B. H2 sounds good though.
What would be a helpful non-trivial transformation?
Example of transformations quite often required:
In my ETL experience, most of the data I was moving was copied through from point A to B
Well, except that:
Those are quite usual scenarios that need to be done all the time in most applications I've encountered, do not involve terabyes of data, at most a few hundred gigabyte, so it's also expected to be run on a single machine.
Okay, perhaps we can try some of these. I can't address this in the short term (e.g. next 2-3 weeks), but maybe someone else will be able to pick it up before I can. If you want to give it a shot, send a PR in and we can help fix any problems. Thanks for the suggestion!
Please add a better and more realistic example using SQL (e.g. with a H2 Database) (for input and output too).
Thank you.