Isabel-Gan / quantifying-notebook-features

Python scripts to detect and quantify features in Jupyter notebooks
2 stars 0 forks source link

gathering commits for notebook file #16

Closed Isabel-Gan closed 4 years ago

Isabel-Gan commented 4 years ago

not detecting certain commits, see nb 116896 and nb 295335

also, didn't have time to run on all testing notebooks (api response too slow), so run another full test after this fix

Isabel-Gan commented 4 years ago

script is hanging on nb 502567

Isabel-Gan commented 4 years ago

for 116896 and 295335, seems that the commit is too large to be loaded in by the API request; for example for 116896, see the actual commit page: https://github.com/mutaihillary/Mytodo_app/commit/4dec8e39632565fb774297e66e551c955c929da5#diff-01f0c1b97eba2b55bc67f1586aa51c40, but at the API link, https://api.github.com/repos/mutaihillary/Mytodo_app/commits/4dec8e39632565fb774297e66e551c955c929da5, I wrote code to print out the filenames and count, and it only loaded in 300 files

Isabel-Gan commented 4 years ago

got help from Shurui: https://stackoverflow.com/questions/15831313/is-it-possible-to-get-commits-history-for-one-file-in-github-api, fixed by https://github.com/Isabel-Gan/quantifying-notebook-features/commit/541359b14348a16feb93647133a37833aa810eb8

Isabel-Gan commented 4 years ago

not catching the case where the notebook file has been renamed, see nb 1102235, https://github.com/amoudgl/gsoc-2016/blob/master/notebooks/c%2B%2B/Autoencoder_Multiclass.ipynb, should have one commit; if you go to the commit api link (https://api.github.com/repos/amoudgl/gsoc-2016/commits/8499dc3d1ddae5b8ff99b8eec74e9bd8a99f256b), the notebook was only renamed, and when you try to get the commits pertaining to the filename (https://api.github.com/repos/amoudgl/gsoc-2016/commits?path=notebooks/c++/notebooks/c++/Autoencoder_Multiclass.ipynb), nothing shows up