SwissDataScienceCenter / renku-python

A Python library for the Renku collaborative data science platform.
https://renku-python.readthedocs.io/
Apache License 2.0
37 stars 29 forks source link

Cannot output to a non-empty directory under renku run #1178

Closed ciyer closed 3 years ago

ciyer commented 4 years ago

Describe the bug I have the following folder setup:

data/my-dataset/file1.txt
data/my-dataset/file2.txt

And I have a script script.py that takes data/my-dataset as an input and generates file1.txt, leaving file2.txt untouched.

When I execute renku run script.py data/my-dataset, I get an error

Error: The output directory "data/my-dataset" is not empty. 

Delete existing files before running the command:
  (use "git rm <file>..." to remove them first)

        data/my-dataset/file2.txt

Once you have removed files that should be used as outputs,
you can safely rerun the previous command.

Specifying the output explicitly as using --output or .renku/tmp/outputs.txt does not fix the problem. The repo is in a clean state, and the script does not touch data/my-dataset/file2.txt.

Link to project https://renkulab.io/projects/covid-19/covid-19-public-data

To Reproduce See description above.

Expected behavior

Screenshots and/or execution output If applicable, add screenshots and/or copy-pasted execution output to help explain your problem.

Run environment (please complete the following information):

Panaetius commented 3 years ago

This shouldn't happen anymore if it is an explicit output