Unveiling the Structure of Wide Flat Minima in Neural Networks
Unveiling the Structure of Wide Flat Minima in Neural Networks
The success of deep learning has revealed the application potential of neural networks across the sciences and opened up fundamental theoretical problems. In particular, the fact that learning algorithms based on simple variants of gradient methods are able to find near-optimal minima of highly nonconvex loss functions is an unexpected …