Ask a Question

Prefer a chat interface with context about you and your work?

Label noise (stochastic) gradient descent implicitly solves the Lasso for quadratic parametrisation

Label noise (stochastic) gradient descent implicitly solves the Lasso for quadratic parametrisation

Understanding the implicit bias of training algorithms is of crucial importance in order to explain the success of overparametrised neural networks. In this paper, we study the role of the label noise in the training dynamics of a quadratically parametrised model through its continuous time version. We explicitly characterise the …