Ask a Question

Prefer a chat interface with context about you and your work?

Unveiling the Structure of Wide Flat Minima in Neural Networks

Unveiling the Structure of Wide Flat Minima in Neural Networks

The success of deep learning has revealed the application potential of neural networks across the sciences and opened up fundamental theoretical problems. In particular, the fact that learning algorithms based on simple variants of gradient methods are able to find near-optimal minima of highly nonconvex loss functions is an unexpected …