splitgraph / seafowl

Analytical database for data-driven Web applications 🪶
https://seafowl.io
Apache License 2.0
386 stars 9 forks source link

Data replication interface #527

Closed gruuya closed 3 weeks ago

gruuya commented 1 month ago

What

This PR introduces a flight-compatible protocol for syncing record batches from a remote source.

How

It facilitates the arrow flight do_put call to upload record batches representing data changes in a remote system. It also passes a command containing the metadata about the action.

These are in turn stored in a cache, which for a given lag and size based criteria will flush the batches from memory to object storage.

TODOs (here or in follow-on PRs)