Closed sfvnielsen closed 7 years ago
The problem is highly imbalanced (class-wise) so we need to use metrics that take this into account (ROC, F1,...)
I'm working on a general framework for scores. F1 will be included in initial commit.
Last commits include a lot of measures. Requires xarray to print it all though (pandas multidimensional brother).
How should we quantitatively evaluate the performance of the network we train?