delta-io / kafka-delta-ingest

A highly efficient daemon for streaming data from Kafka into Delta Lake
Apache License 2.0
337 stars 72 forks source link

Run loop refactor and starting offsets feature #87

Closed xianwill closed 2 years ago

xianwill commented 2 years ago

This PR refactors the run loop structure and adds support for specifying explicit starting offsets.

Apologies, there is a lot to look at here, but it needed to be done. The lions share of changes are in lib.rs.

The main changes include:

In the near future, I think we may look at breaking some files out of lib.rs for another cleanup pass.