Closed MichaelFlec closed 3 years ago
This has to do with how Git
handles empty folders. The initialized folder structure (that I have locally, but isn't on remote yet) is:
├── data
│ ├── external <- Data from third party sources.
│ ├── interim <- Intermediate data that has been transformed.
│ ├── processed <- The final, canonical data sets for modeling.
│ └── raw <- The original, immutable data dump.
For now, these folders are empty since we do not have any datasets yet. Thus, Git
ignores them. As soon as those contain files, they will be tracked. This is why the error vanishes as soon as there is a folder structure present.
There are workarounds to this, such as using .gitkeep files, though I don't think that's necessary here. As you described, the error will resolve itself as soon as the dirs are populated with data.
When running: make data within my local rep produced the stated error below. Manually creating the foulder fixed the problem.
Try 'make_dataset.py --help' for help.
Error: Invalid value for 'INPUT_FILEPATH': Path 'data/raw' does not exist. make: *** [data] Error 2