Open soumilshah1995 opened 3 months ago
i am assuming the answer would be to set up MultiTable Delta Streamer for each topic. I want to make sure this is the correct approach, or if there are any better recommendations
@soumilshah1995 Using MultiTable Delta Streamer would be best available option for now.
wouldn't streamer aims to push in different target paths what if you want to build a one table and each hive.public.sales_part_0 hive.public.sales_part_1 hive.public.sales_part_2 hive.public.sales_part_3
becomes partitions
kind of like notion architecture
https://www.notion.so/blog/building-and-scaling-notions-data-lake
How do we achieve something like this ?
Any pointers | Suggestions
I got this question answered and I will be trying this out
I dont think approached mentioned on community sync will solve the issue I will attend next sync and ask my further question
Hi everyone,
I’m working with a PostgreSQL table that uses a hash-based partitioning strategy. Here is the setup:
I have set up a Debezium connector with the following configuration:
This configuration creates the following topics:
hive.public.sales hive.public.sales_part_0 hive.public.sales_part_1 hive.public.sales_part_2 hive.public.sales_part_3 I am looking for recommendations on how to set up DeltaStreamer jobs for each partition. What is the best approach to handle this setup effectively?
Thank you!