Nike-Inc / koheesio

Python framework for building efficient data pipelines. It promotes modularity and collaboration, enabling the creation of complex pipelines from simple, reusable components.
https://engineering.nike.com/koheesio/
Apache License 2.0
599 stars 19 forks source link

[FEATURE] Add streaming option to FileLoader #50

Closed BrendBraeckmans closed 4 months ago

BrendBraeckmans commented 4 months ago

Is your feature request related to a problem? Please describe.

Having a streaming option for the FileLoader would be beneficial during UT's or for people that are not on Databricks and can't make use of the Databricks proprietary Autoloader.

Describe the solution you'd like

A streaming option like the DeltaTableReader has as seen here

Describe alternatives you've considered

AutoLoader is an alternative but isn't present outside of Databricks ...