Open astrojuanlu opened 3 months ago
I'm wondering, does this "100 tasks per job" limitation depend on the Databricks config/cluster? Or is it universal?
My hunch is 1 task is 1 container - the same reason you shouldn't need 28 containers to run spaceflights
I'm wondering, does this "100 tasks per job" limitation depend on the Databricks config/cluster? Or is it universal?
My hunch is 1 task is 1 container - the same reason you shouldn't need 28 containers to run spaceflights
Think you're right. As far as I can tell, it's universal. But I haven't really found any mention of it outside of this issue from 2021: https://community.databricks.com/t5/data-engineering/how-many-jobs-can-i-create-in-my-databricks-workspace/td-p/18111
https://linen-slack.kedro.org/t/22732083/announcing-kedro-databricks-https-github-com-jenspederm-kedr#0dffcc72-1ac6-4c4f-9bb0-47a654643cd0
A possible solution would be to group tasks, like
kedro-airflow
does since https://github.com/kedro-org/kedro-plugins/pull/241 cc @ankatiyar, @sbrugmanThere are other possible solutions maybe.
I'm wondering, does this "100 tasks per job" limitation depend on the Databricks config/cluster? Or is it universal?