raspstephan / sugar-flower-fish-or-gravel

Repository for the Zooniverse cloud classification project.
https://www.zooniverse.org/projects/raspstephan/sugar-flower-fish-or-gravel
5 stars 4 forks source link

Clean up notebooks #5

Open raspstephan opened 5 years ago

raspstephan commented 5 years ago

Hi Hauke,

I think we should clean up the notebook structure a little. Like merge the two folders we have right now.

I briefly went through all the notebooks to see what is done in them:

In analysis:

In notebooks_SR:

I would suggest to put the current notebooks into one folder called notebooks (for now, we can later on subdivide it). All not current notebooks (like the early ML ones), I will put in a subfolder called old_notebooks.

Then we should merge some notebooks. All IoU stuff should go into one notebook. All basic analysis (i.e. everything that does not require images or ERA data) should go into one notebook. All data wrangling should go into the organize... notebook.

Utility functions that are used more than once should go into pyclouds/helpers.py. I hope that our work here will be used by others at some point (e.g. at a hackathon with the data), so I think it's great to write nice code (as you are already doing, I need to force myself to write proper docstrings).

Does all that sound good?

observingClouds commented 5 years ago

Yes, that sounds all good! I try to keep to that structure

raspstephan commented 5 years ago

I started the restructuring. Will continue tomorrow. I went through all your analysis and tried to write handy functions to ease the data wrangling. I also tried to do as much as I could with pure Pandas.

Tomorrow, I will start with some analysis on user agreement. Trying to come up with some easy-to-understand agreement metrics.