Optimal Transport

Distributional Shrinkage II: Higher-Order Scores Encode Brenier Map

This paper revisits the 70-year-old shrinkage problem and establishes a precise connection among three subjects: higher-order Fisher-type information, optimal transport, and the combinatorics of integer partitions.

Distributional Shrinkage I: Universal Denoiser Beyond Tweedie's Formula

Tengyuan Liang arXiv preprint

Empirical Bayes tends to produce overly aggressive shrinkage as a denoiser. We introduce new denoisers that optimally shrink the distribution toward the true signal distribution with order-of-magnitude improvements. Unlike empirical Bayes denoiser, our denoisers are universal and agnostic to the signal and noise distributions. One immediate application of our distributional shrinkage theory is to enhance generative modeling: we can replace the stochastic backward diffusion process with optimal deterministic denoisers to achieve higher-order accuracy.

Distributional Shrinkage Generative Models Optimal Transport The Distributional Regime

No-Regret Generative Modeling via Parabolic Monge-Ampère PDE

Nabarun Deb, Tengyuan Liang The Annals of Statistics

We introduce a novel generative modeling framework called parabolic Monge-Ampère PDE sampler. We establish theoretical guarantees for generative modeling through the lens of no-regret analysis, demonstrating that the iterates converge to the optimal Brenier map under a variety of step-size schedules. We derive a new Evolution Variational Inequality connecting geometry, transportation cost, and regret.

Generative Models Optimal Transport Stochastic Dynamics The Distributional Regime

Denoising Diffusions with Optimal Transport: Localization, Curvature, and Multi-Scale Complexity

Tengyuan Liang, Kulunu Dharmakeerthi, Takuya Koriyama Transactions on Machine Learning Research

Adding noise is easy; what about denoising? Diffusion is easy; what about reverting a diffusion? We provide a fine-grained analysis of the diffuse-then-denoise process. We discover a notion of multi-scale curvature complexity that collectively determines the success or failure mode of probabilistic diffusion models.

Generative Models Optimal Transport Stochastic Dynamics The Distributional Regime

A Convexified Matching Approach to Imputation and Individualized Inference

YoonHaeng Hur, Tengyuan Liang arXiv preprint

We introduce a new convexified matching method for missing value imputation and individualized inference inspired by computational optimal transport.

Statistical Inference Optimal Transport The Causal Shift

Detecting Weak Distribution Shifts via Displacement Interpolation

YoonHaeng Hur, Tengyuan Liang Journal of Business & Economic Statistics

Detecting weak, systematic distribution shifts and quantitatively modeling individual, heterogeneous responses to policies or incentives have found increasing empirical applications in social and economic sciences. We propose a model for weak distribution shifts via displacement interpolation, drawing from the optimal transport theory.

Statistical Inference Optimal Transport The Causal Shift

Online Learning to Transport via the Minimal Selection Principle

Wenxuan Guo, YoonHaeng Hur, Tengyuan Liang, Christopher Ryan Conference on Learning Theory

Motivated by robust dynamic resource allocation in operations research, we study the Online Learning to Transport (OLT) problem where the decision variable is a probability measure, an infinite-dimensional object. We draw connections between online learning, optimal transport, and partial differential equations through an insight called the minimal selection principle, originally studied in the Wasserstein gradient flow setting by Ambrosio et al. (2005).

Stochastic Dynamics Optimal Transport The Distributional Regime

Reversible Gromov-Monge Sampler for Simulation-Based Inference

YoonHaeng Hur, Wenxuan Guo, Tengyuan Liang SIAM Journal on Mathematics of Data Science

Motivated by the seminal work on distance and isomorphism between metric measure spaces, we propose a new notion called the Reversible Gromov-Monge (RGM) distance and study how RGM can be used to design new transform samplers to perform simulation-based inference.

Generative Models Optimal Transport The Distributional Regime