A data pipeline orchestration library for rapid iterative development with automatic cache invalidation allowing users to focus writing their tasks in pandas, polars, sqlalchemy, ibis, and alike.
In general, we can expand testing for deferred table store ops. We test with real project code. But this should also be tested with unit tests in this repo:
def _trigger_deferred_table_store_ops(self, stage_id: int):
if (
stage_id in self._deferred_table_store_ops
and len(self._deferred_table_store_ops[stage_id]) > 0
):
if len(self._deferred_table_store_op_threads.get(stage_id, {})) == 0:
self._deferred_table_store_op_threads[stage_id] = {}
started_ops_end = 0
else:
started_ops_end = max(
self._deferred_table_store_op_threads[stage_id].keys()
) + 1
vs.
def _trigger_deferred_table_store_ops(self, stage_id: int):
if (
stage_id in self._deferred_table_store_ops
and len(self._deferred_table_store_ops[stage_id]) > 0
):
if len(self._deferred_table_store_op_threads.get(stage_id, {})) == 0:
self._deferred_table_store_op_threads[stage_id] = {}
started_ops_end = 0
else:
started_ops_end = max(
self._deferred_table_store_op_threads[stage_id].keys()
)
In general, we can expand testing for deferred table store ops. We test with real project code. But this should also be tested with unit tests in this repo:
vs.