Overparameterization

High-dimensional Asymptotics of Langevin Dynamics in Spiked Matrix Models

Tengyuan Liang, Subhabrata Sen, Pragya Sur Information and Inference: A Journal of the IMA

We study Langevin dynamics for recovering the planted signal in the spiked matrix model. We provide a path-wise characterization of the overlap between the output of the Langevin algorithm and the planted signal. This overlap is characterized in terms of a self-consistent system of integro-differential equations, usually referred to as the Crisanti-Horner-Sommers-Cugliandolo-Kurchan (CHSCK) equations in the spin glass literature.

Interpolating Classifiers Make Few Mistakes

Tengyuan Liang, Benjamin Recht Journal of Machine Learning Research

This paper provides elementary analyses of the regret and generalization of minimum-norm interpolating classifiers.

Overparameterization Statistical Learning The Interpolation Regime

Mehler’s Formula, Branching Process, and Compositional Kernels of Deep Neural Networks

Tengyuan Liang, Hai Tran-Bach Journal of the American Statistical Association

We utilize a connection between compositional kernels and branching processes via Mehler’s formula to study deep neural networks. This new probabilistic insight provides us a novel perspective on the mathematical role of activation functions in compositional neural networks. We study the unscaled and rescaled limits of the compositional kernels and explore the different phases of the limiting behavior, as the compositional depth increases.

Overparameterization Statistical Learning The Interpolation Regime

A Precise High-Dimensional Asymptotic Theory for Boosting and Minimum-L1-Norm Interpolated Classifiers

Tengyuan Liang, Pragya Sur The Annals of Statistics

This paper establishes a precise high-dimensional asymptotic theory for boosting on separable data, taking statistical and computational perspectives.

Overparameterization Statistical Learning The Interpolation Regime

On the Multiple Descent of Minimum-Norm Interpolants and Restricted Lower Isometry of Kernels

Tengyuan Liang, Alexander Rakhlin, Xiyu Zhai Conference on Learning Theory

We study the risk of minimum-norm interpolants of data in Reproducing Kernel Hilbert Spaces. Our upper bounds on the risk are of a multiple-descent shape. Empirical evidence supports our finding that minimum-norm interpolants in RKHS can exhibit this unusual non-monotonicity in sample size.

Overparameterization Statistical Learning The Interpolation Regime

Training Neural Networks as Learning Data-adaptive Kernels: Provable Representation and Approximation Benefits

Xialiang Dou, Tengyuan Liang Journal of the American Statistical Association

What are the provable benefits of the adaptive representation by neural networks compared to the pre-specified fixed basis representation in the classical nonparametric literature? We answer the above questions via a dynamic reproducing kernel Hilbert space (RKHS) approach indexed by the training process of neural networks.

Overparameterization Statistical Learning The Interpolation Regime

Just Interpolate: Kernel Ridgeless Regression Can Generalize

Tengyuan Liang, Alexander Rakhlin The Annals of Statistics

In the absence of explicit regularization, interpolating kernel machine has the potential to fit the training data perfectly, at the same time, still generalizes well on test data. We isolate a phenomenon of implicit regularization for minimum-norm interpolated solutions.

Overparameterization Statistical Learning The Interpolation Regime