practo / tipoca-stream

Near real time cloud native data pipeline in AWS (CDC+Sink). Hosts code for RedshiftSink. RDS to RedshiftSink Pipeline with masking and reloading support.
https://towardsdatascience.com/open-sourcing-tipoca-stream-f261cdcc3a13
Apache License 2.0
47 stars 5 forks source link

Bulk insert or UNLOAD + COPY? #60

Open alok87 opened 4 years ago

alok87 commented 4 years ago

Keep the onea which are faster for small and bulk insert use cases.. https://stackoverflow.com/questions/63942859/unload-with-data-containing-quotes-and-delimiters?noredirect=1#comment113084781_63942859 WhatsApp Image 2020-09-18 at 11 32 15 AM

alok87 commented 4 years ago

Need to benchmark(prometheus metrics) and have a flag for this.