Closed WilsonGregory closed 2 years ago
I now realize that all the _empty.ipynb are copies of the associated .ipynb files, from this commit: 70a61f40eee256b183f381bb66cf1fd163ae1476
It makes sense to have empty notebooks notebooks because it is "cleaner" from the perspective of other people using the code and wanting to run it for themselves. It also doesn't necessarily make sense to have git version control on the outputs of notebooks which are often going to be different because they rely on non-deterministic processes.
At the same time it makes to have the non-empty notebooks so you can see the results without having to run it yourself.
Having two copies of the same notebook is problematic because it means if I make code changes to one, I would have to make the exact same code changes to the other to keep them in sync.
I am leaning towards only having the empty versions in the github.
Great idea!
Hi @beelze-b, thank you for taking a look at the code! What do you think about the tradeoffs of having empty notebooks vs notebooks with output? I am thinking that just having the empty notebooks might be best. So I am thinking:
What do you think?
I agree just having the empty notebooks would be best. I verified that all the data is on the GCloud, so clearing the SummaryTable notebook (or any other notebook) will not cause in any loss of pass results.
I didn't know there were hooks to clear notebooks' outputs. Please go ahead and use them if you wish and it is no effort to you.
Changes
How did I test