Closed ilkarman closed 4 years ago
Hi @ilkarman, our code saves the surface values by the rank 0 process in default after collecting values calculated by multiple processes, as you can see in https://github.com/tomgoldstein/loss-landscape/blob/master/plot_surface.py#L88 and https://github.com/tomgoldstein/loss-landscape/blob/master/plot_surface.py#L136.
It seems that you are not using mpi and your process rank value is 2, so it might be the reason why the surface values are not saved into the h5 file. It could be an easy fix if you change the default rank values to 2 or figure out why it is not zero.
Thank you! One of the params for crunch() was being overwritten instead of rank.
I run the plot_surface code like so:
And it seem to calculate the loss fine:
However, the plot functions do not work because 'train_loss' is not found:
And if I print the keys(), it's just:
<KeysViewHDF5 ['dir_file', 'xcoordinates', 'ycoordinates']>
Not sure what I'm doing wrong?