Numerical Analysis (cs.NA)

Interpolating Classifiers Make Few Mistakes

Mehler’s Formula, Branching Process, and Compositional Kernels of Deep Neural Networks

Escaping the Local Minima via Simulated Annealing: Optimization of Approximately Convex Functions