Anne Auger - Slow Convergence of Stochastic Optimization Algorithms Without Derivatives Is Avoidable
Many approaches to optimization without derivatives rooted in probability theory are variants of stochastic approximation such as the well-known Kiefer-Wolfowitz method, a finite-difference stochastic approximation (FDSA) algorithm that estimates gradients using finite differences. Such methods are known to converge slowly: in many cases the best possible convergence rate is governed by the Central Limit Theorem leading to a mean square error that vanishes at rate inversely proportional to the number of iterations. In this talk, I will show that those slow convergence rates are not a forgone conclusion for stochastic algorithms without derivatives. I will present a class of adaptive stochastic algorithms originating from the class of Evolution Strategy algorithms, where we can prove asymptotic geometric convergence of the mean square error on classes of functions that include non-convex and non-quasi convex functions. This corresponds to linear convergence in optimization. I will highlight the main differences compared to FDSA algorithms and explain how the analysis of the stability of underlying Markov chain allow enables linear convergence guarantees. I will discuss the connection to the analysis of the Covariance Matrix Adaptation Evolution Strategy (CMA-ES), widely regarded as one of the most effective stochastic algorithms for solving complex derivative-free optimization problems. Anne Auger (Télécom Paris) === Find this and many more scientific videos on https://www.carmin.tv/ - a French video platform for mathematics and their interactions with other sciences offering extra functionalities tailored to meet the needs of the research community. ===

2026 EMS Lecture Series on Mathematics Education. Lecture 6: Terence Tao

Yann LeCun: World Models: Enabling the next AI revolution

Stochastic Approximation and Reinforcement Learning: Hidden Theory and New Super-Fast Algorithms

Edward Lockhart - Why AI Needs Formal Mathematics

Vanilla Bayesian Optimization Performs Great in High Dimensions

The Strange Math That Predicts (Almost) Anything

But what is quantum computing? (Grover's Algorithm)

The French Do Not Care About Work

Turing Award Winner: Disagreeing with Google, Postgres, Future Problems | Mike Stonebraker

Conan O’Brien Mocks Trump At Harvard Commencement | Crowd Erupts During Viral Speech

Weird Things Happen When Energy Goes Negative

But how do AI images and videos actually work? | Guest video by Welch Labs

What do tech pioneers think about the AI revolution? - The Engineers, BBC World Service

We're 99.9% sure this pattern is true, but no one can prove it

She’s 12. She Sings Aretha Franklin… Until Simon TELLS Her to Do It Acapella! 😳

JANITOR vs THE BIGGEST GUYS IN THE GYM. They Didn’t Expect THAT

NestJS Full Course for Beginners in 2026 | Build a Production-Ready API

Algebra 2 Introduction, Basic Review, Factoring, Slope, Absolute Value, Linear, Quadratic Equations

But what is the Fourier Transform? A visual introduction.

