Papers

"Quasi-Random" Samples of Research, by Year Written

2023 –

Distributional Shrinkage II: Higher-Order Scores Encode Brenier Map

This paper revisits the 70-year-old shrinkage problem and establishes a precise connection among three subjects: higher-order Fisher-type information, optimal transport, and the combinatorics of integer partitions.

Distributional Shrinkage I: Universal Denoiser Beyond Tweedie's Formula

Tengyuan Liang arXiv preprint

Empirical Bayes tends to produce overly aggressive shrinkage as a denoiser. We introduce new denoisers that optimally shrink the distribution toward the true signal distribution with order-of-magnitude improvements. Unlike empirical Bayes denoiser, our denoisers are universal and agnostic to the signal and noise distributions. One immediate application of our distributional shrinkage theory is to enhance generative modeling: we can replace the stochastic backward diffusion process with optimal deterministic denoisers to achieve higher-order accuracy.

Distributional Shrinkage Generative Models Optimal Transport The Distributional Regime

No-Regret Generative Modeling via Parabolic Monge-Ampère PDE

Nabarun Deb, Tengyuan Liang The Annals of Statistics

We introduce a novel generative modeling framework called parabolic Monge-Ampère PDE sampler. We establish theoretical guarantees for generative modeling through the lens of no-regret analysis, demonstrating that the iterates converge to the optimal Brenier map under a variety of step-size schedules. We derive a new Evolution Variational Inequality connecting geometry, transportation cost, and regret.

Generative Models Optimal Transport Stochastic Dynamics The Distributional Regime

Gaussianized Design Optimization for Covariate Balance in Randomized Experiments

Wenxuan Guo, Tengyuan Liang, Panos Toulis Journal of the Royal Statistical Society: Series B

This paper presents Gaussianized Design Optimization, a novel framework for optimally balancing covariates in experimental design.

Experimental Design Causal Inference Uncertainty Quantification The Causal Shift

Denoising Diffusions with Optimal Transport: Localization, Curvature, and Multi-Scale Complexity

Tengyuan Liang, Kulunu Dharmakeerthi, Takuya Koriyama Transactions on Machine Learning Research

Adding noise is easy; what about denoising? Diffusion is easy; what about reverting a diffusion? We provide a fine-grained analysis of the diffuse-then-denoise process. We discover a notion of multi-scale curvature complexity that collectively determines the success or failure mode of probabilistic diffusion models.

Generative Models Optimal Transport Stochastic Dynamics The Distributional Regime

A Convexified Matching Approach to Imputation and Individualized Inference

YoonHaeng Hur, Tengyuan Liang arXiv preprint

We introduce a new convexified matching method for missing value imputation and individualized inference inspired by computational optimal transport.

Statistical Inference Optimal Transport The Causal Shift

Learning When the Concept Shifts: Confounding, Invariance, and Dimension Reduction

Kulunu Dharmakeerthi, YoonHaeng Hur, Tengyuan Liang Journal of the American Statistical Association

Confounding can obfuscate the definition of the best prediction model (concept shift) and shift covariates to domains yet unseen (covariate shift). Therefore, a model maximizing prediction accuracy in the source environment could suffer a significant accuracy drop in the target environment. We propose a new domain adaptation method for observational data in the presence of confounding, and characterize the the stability and predictability tradeoff leveraging a structural causal model.

Statistical Learning The Causal Shift

Randomization Inference When N Equals One

Tengyuan Liang, Benjamin Recht Biometrika

A statistical theory for N-of-1 experiments, where a unit serves as its own control and treatment in rapid interleaving time windows.

Causal Inference Experimental Design Uncertainty Quantification The Causal Shift

Detecting Weak Distribution Shifts via Displacement Interpolation

YoonHaeng Hur, Tengyuan Liang Journal of Business & Economic Statistics

Detecting weak, systematic distribution shifts and quantitatively modeling individual, heterogeneous responses to policies or incentives have found increasing empirical applications in social and economic sciences. We propose a model for weak distribution shifts via displacement interpolation, drawing from the optimal transport theory.

Statistical Inference Optimal Transport The Causal Shift

2019 – 2022

Blessings and Curses of Covariate Shifts: Adversarial Learning Dynamics, Directional Convergence, and Equilibria

Tengyuan Liang Journal of Machine Learning Research

Blessings and curses of covariate shifts, directional convergece, and the connection to experimental design.

Statistical Learning Experimental Design The Distributional Regime The Causal Shift

High-dimensional Asymptotics of Langevin Dynamics in Spiked Matrix Models

Tengyuan Liang, Subhabrata Sen, Pragya Sur Information and Inference: A Journal of the IMA

We study Langevin dynamics for recovering the planted signal in the spiked matrix model. We provide a path-wise characterization of the overlap between the output of the Langevin algorithm and the planted signal. This overlap is characterized in terms of a self-consistent system of integro-differential equations, usually referred to as the Crisanti-Horner-Sommers-Cugliandolo-Kurchan (CHSCK) equations in the spin glass literature.

Stochastic Dynamics Overparameterization The Distributional Regime

Online Learning to Transport via the Minimal Selection Principle

Wenxuan Guo, YoonHaeng Hur, Tengyuan Liang, Christopher Ryan Conference on Learning Theory

Motivated by robust dynamic resource allocation in operations research, we study the Online Learning to Transport (OLT) problem where the decision variable is a probability measure, an infinite-dimensional object. We draw connections between online learning, optimal transport, and partial differential equations through an insight called the minimal selection principle, originally studied in the Wasserstein gradient flow setting by Ambrosio et al. (2005).

Stochastic Dynamics Optimal Transport The Distributional Regime

Reversible Gromov-Monge Sampler for Simulation-Based Inference

YoonHaeng Hur, Wenxuan Guo, Tengyuan Liang SIAM Journal on Mathematics of Data Science

Motivated by the seminal work on distance and isomorphism between metric measure spaces, we propose a new notion called the Reversible Gromov-Monge (RGM) distance and study how RGM can be used to design new transform samplers to perform simulation-based inference.

Generative Models Optimal Transport The Distributional Regime

Universal Prediction Band via Semi-Definite Programming

Tengyuan Liang Journal of the Royal Statistical Society: Series B

This paper proposes a computationally efficient method to construct nonparametric, heteroscedastic prediction bands for uncertainty quantification.

Uncertainty Quantification Statistical Learning The Causal Shift The Interpolation Regime

Interpolating Classifiers Make Few Mistakes

Tengyuan Liang, Benjamin Recht Journal of Machine Learning Research

This paper provides elementary analyses of the regret and generalization of minimum-norm interpolating classifiers.

Overparameterization Statistical Learning The Interpolation Regime

Mehler’s Formula, Branching Process, and Compositional Kernels of Deep Neural Networks

Tengyuan Liang, Hai Tran-Bach Journal of the American Statistical Association

We utilize a connection between compositional kernels and branching processes via Mehler’s formula to study deep neural networks. This new probabilistic insight provides us a novel perspective on the mathematical role of activation functions in compositional neural networks. We study the unscaled and rescaled limits of the compositional kernels and explore the different phases of the limiting behavior, as the compositional depth increases.

Overparameterization Statistical Learning The Interpolation Regime

A Precise High-Dimensional Asymptotic Theory for Boosting and Minimum-L1-Norm Interpolated Classifiers

Tengyuan Liang, Pragya Sur The Annals of Statistics

This paper establishes a precise high-dimensional asymptotic theory for boosting on separable data, taking statistical and computational perspectives.

Overparameterization Statistical Learning The Interpolation Regime

On the Multiple Descent of Minimum-Norm Interpolants and Restricted Lower Isometry of Kernels

Tengyuan Liang, Alexander Rakhlin, Xiyu Zhai Conference on Learning Theory

We study the risk of minimum-norm interpolants of data in Reproducing Kernel Hilbert Spaces. Our upper bounds on the risk are of a multiple-descent shape. Empirical evidence supports our finding that minimum-norm interpolants in RKHS can exhibit this unusual non-monotonicity in sample size.

Overparameterization Statistical Learning The Interpolation Regime

Training Neural Networks as Learning Data-adaptive Kernels: Provable Representation and Approximation Benefits

Xialiang Dou, Tengyuan Liang Journal of the American Statistical Association

What are the provable benefits of the adaptive representation by neural networks compared to the pre-specified fixed basis representation in the classical nonparametric literature? We answer the above questions via a dynamic reproducing kernel Hilbert space (RKHS) approach indexed by the training process of neural networks.

Overparameterization Statistical Learning The Interpolation Regime

2015 – 2018

Deep Neural Networks for Estimation and Inference

Max H. Farrell, Tengyuan Liang, Sanjog Misra Econometrica

Can deep neural networks with standard archtectures estimate treatment effects and perform downstream uncertainty quantification tasks?

Causal Inference Uncertainty Quantification The Causal Shift

Just Interpolate: Kernel Ridgeless Regression Can Generalize

Tengyuan Liang, Alexander Rakhlin The Annals of Statistics

In the absence of explicit regularization, interpolating kernel machine has the potential to fit the training data perfectly, at the same time, still generalizes well on test data. We isolate a phenomenon of implicit regularization for minimum-norm interpolated solutions.

Overparameterization Statistical Learning The Interpolation Regime

Interaction Matters: A Note on Non-asymptotic Local Convergence of Generative Adversarial Networks

Tengyuan Liang, James Stokes International Conference on Artificial Intelligence and Statistics

Motivated by the pursuit of a systematic computational and algorithmic understanding of Generative Adversarial Networks (GANs), we present a simple yet unified non-asymptotic local convergence theory for smooth two-player games, which subsumes several discrete-time gradient-based saddle point dynamics. The analysis reveals the surprising nature of the off-diagonal interaction term as both a blessing and a curse.

Stochastic Dynamics Generative Models The Distributional Regime

Local Optimality and Generalization Guarantees for the Langevin Algorithm via Empirical Metastability

Belinda Tzen, Tengyuan Liang, Maxim Raginsky Conference on Learning Theory

We study the detailed path-wise behavior of the discrete-time Langevin algorithm for non-convex Empirical Risk Minimization (ERM) through the lens of metastability, adopting some techniques from Berglund and Gentz (2003).

Stochastic Dynamics The Distributional Regime

How Well Generative Adversarial Networks Learn Distributions

Tengyuan Liang Journal of Machine Learning Research

This paper studies the rates of convergence for learning distributions implicitly with the adversarial framework and Generative Adversarial Networks (GANs), which subsume Wasserstein, Sobolev, MMD GAN, and Generalized/Simulated Method of Moments (GMM/SMM) as special cases. We study a wide range of parametric and nonparametric target distributions under a host of objective evaluation metrics. We investigate how to obtain valid statistical guarantees for GANs through the lens of regularization.

Generative Models Statistical Learning The Distributional Regime

Statistical Inference for the Population Landscape via Moment Adjusted Stochastic Gradients

Tengyuan Liang, Weijie J. Su Journal of the Royal Statistical Society: Series B

Modern statistical inference tasks often require iterative optimization methods to compute the solution. Convergence analysis from an optimization viewpoint only informs us how well the solution is approximated numerically but overlooks the sampling nature of the data. We introduce the moment-adjusted stochastic gradient descents, a new stochastic optimization method for statistical inference.

Statistical Inference Stochastic Dynamics

Learning with Square Loss: Localization through Offset Rademacher Complexity

Tengyuan Liang, Alexander Rakhlin, Karthik Sridharan Conference on Learning Theory

We introduce Offset Rademacher Complexity as a tool for analyzing the localization properties and fast rates of learning algorithms under square loss.

Statistical Learning