spotify / luigi

Luigi is a Python module that helps you build complex pipelines of batch jobs. It handles dependency resolution, workflow management, visualization etc. It also comes with Hadoop support built in.
Apache License 2.0
17.89k stars 2.4k forks source link

A task fails if the subsequent task is already completed #2997

Open KansaiUser opened 4 years ago

KansaiUser commented 4 years ago

I am just starting to use and understand Luigi, I would appreciate some help here.

I have a pipeline built with Luigi in which some taks require other tasks and each task creates a file. Something like:

TaskA-------->TaskB---------> TaskC-------->TaskD
(fileA)             (fileB)                 (fileC)            (fileC)

yes, fileB gets recreated but TaskC fails, with the error message saying that fileC already exist.

Is there a way for the subsequent files to be recreated again? Or fileC be overwritten?

stale[bot] commented 2 years ago

This issue has been automatically marked as stale because it has not had recent activity. It will be closed if no further activity occurs. If closed, you may revisit when your time allows and reopen! Thank you for your contributions.