pzimbrod / ML-for-PhaseField

MIT License
3 stars 1 forks source link

missing empty foulder: data/raw #2

Closed MichaelFlec closed 3 years ago

MichaelFlec commented 3 years ago

When running: make data within my local rep produced the stated error below. Manually creating the foulder fixed the problem.

Try 'make_dataset.py --help' for help.

Error: Invalid value for 'INPUT_FILEPATH': Path 'data/raw' does not exist. make: *** [data] Error 2

pzimbrod commented 3 years ago

This has to do with how Git handles empty folders. The initialized folder structure (that I have locally, but isn't on remote yet) is:

├── data
    │   ├── external       <- Data from third party sources.
    │   ├── interim        <- Intermediate data that has been transformed.
    │   ├── processed      <- The final, canonical data sets for modeling.
    │   └── raw            <- The original, immutable data dump.

For now, these folders are empty since we do not have any datasets yet. Thus, Git ignores them. As soon as those contain files, they will be tracked. This is why the error vanishes as soon as there is a folder structure present.

There are workarounds to this, such as using .gitkeep files, though I don't think that's necessary here. As you described, the error will resolve itself as soon as the dirs are populated with data.