Nike-Inc / koheesio

Python framework for building efficient data pipelines. It promotes modularity and collaboration, enabling the creation of complex pipelines from simple, reusable components.
https://engineering.nike.com/koheesio/
Apache License 2.0
599 stars 19 forks source link

Add streaming option to FileLoader. #51

Closed BrendBraeckmans closed 4 months ago

BrendBraeckmans commented 4 months ago

Description

Add streaming option to FileLoader. Default behaviour is batch.

Related Issue

Issue 50

Motivation and Context

Having a streaming option for the FileLoader would be beneficial during UT's or for people that are not on Databricks and can't make use of the Databricks proprietary Autoloader.

How Has This Been Tested?

Types of changes

Checklist: