Closed kks32 closed 4 months ago
I have reviewed the changes in train.py
and looks great with tensorboard monitoring. I think npeochs
and nsteps
are both fine, but nsteps
would have more control to stop in the middle of training, particularly when there are a lot of training examples are in train.npz
.
Thanks @yjchoi1 . I'm working on cleaning up and adding new features. We can discuss epochs vs steps as an RFC. We are planning a rework for V2. So please add RFCs to improve the GNS code.
Describe the PR Add logging with tensorboard and progress bar with each epoch
Related Issues/PRs
74
Additional context Also added formatter to auto format our python files. This shows I touched a lot of files, but it is just formatting. Please only check
train.py
for review.npeochs
instead ofnsteps
?Outputs look like this:
Loss histories are available in logs via tensorboard.
Also includes metadata and hyperparameters