Gemma-Analytics / ewah

ELT With Airflow Helper - Classes and functions to make apache airflow life easier
MIT License
12 stars 2 forks source link

Load Sharepoint in Batches #238

Closed soltanianalytics closed 11 months ago

soltanianalytics commented 11 months ago

This code is untested right now.

The idea is to yield the sharepoint data in batches. Currently, it loads the entire workbook into memory and then on top, reads the workbook and keeps another copy of it in memory as a list of dicts. This way, at least the second part consumes far less memory.