Numerical Analysis (cs.NA)

Mehler’s Formula, Branching Process, and Compositional Kernels of Deep Neural Networks

Interpolating Classifiers Make Few Mistakes

Escaping the Local Minima via Simulated Annealing: Optimization of Approximately Convex Functions