Closed jnfrye closed 7 years ago
As of commit 9ca86c6, all code can access the data output folder via the global variables in the configuration file.
As of commit b168017, the data coordinator module is implemented in all scraping scripts, and I am almost done putting it in the cleansing scripts.
As of commit 1b8d30f, all scripts use the data coordinator to locate paths.
As of commit e86d557, the changes have been merged into master. There is some cleanup that can be done, but for now it is working so I'm going to move on to other issues.
I want the data output to be less hard-coded and more flexible.
I am thinking of having a
coordinate_data.py
file that can be imported by scripts and used to get the appropriate folders for input/output. Also, in thesetup.py
file, I want to make it so it creates a configuration file or something that specifies where to send data, with the default behavior being a folder located relative to the setup script:../data/
etc.Coordination of data output is taking place in branch data_output_configuration.