NicolaBernini / PapersAnalysis

Analysis, summaries, cheatsheets about relevant papers
21 stars 4 forks source link

Paper Read - Negative eigenvalues of the Hessian in deep neural networks #30

Open NicolaBernini opened 4 years ago

NicolaBernini commented 4 years ago

Overview

Reading Negative eigenvalues of the Hessian in deep neural networks

Abstract

The loss function of deep networks is known to be non-convex but the precise nature of this nonconvexity is still an active area of research. In this work, we study the loss landscape of deep networks through the eigendecompositions of their Hessian matrix. In particular, we examine how important the negative eigenvalues are and the benefits one can observe in handling them appropriately.