Publications of Eduardo D. Sontag jointly with M.K. Wafi

Publications of Eduardo D. Sontag jointly with M.K. Wafi

Conference articles

M.K. Wafi, A.C.B de Oliveira, and E.D. Sontag. On the (almost) global exponential convergence of overparameterized policy optimization for the LQR problem. In 2026 American Control Conference (ACC), 2026. Note: To appear. Also 2025 arXiv:2510.02140. [PDF] Keyword(s): machine learning, artificial intelligence, gradient dominance, gradient flows, gradient dynamics, gradient descent, gradient systems, gradient descent, numerical methods, dynamics of algorithms, LQR, reinforcement learning. Abstract:

In this work we study the convergence of gradient methods for nonconvex optimization problems -- specifically the effect of the problem formulation to the convergence behavior of the solution of a gradient flow. We show through a simple example that, surprisingly, the gradient flow solution can be exponentially or asymptotically convergent, depending on how the problem is formulated. We then deepen the analysis and show that a policy optimization strategy for the continuous-time linear quadratic regulator (LQR) (which is known to present only asymptotic convergence globally) presents almost global exponential convergence if the problem is overparameterized through a linear feed-forward neural network (LFFNN). We prove this qualitative improvement always happens for a simplified version of the LQR problem and derive explicit convergence rates for the gradient flow. Finally, we show that both the qualitative improvement and the quantitative rate gains persist in the general LQR through numerical simulations.

BACK TO INDEX

Disclaimer:

This material is presented to ensure timely dissemination of scholarly and technical work. Copyright and all rights therein are retained by authors or by other copyright holders.

Last modified: Thu Feb 12 10:13:41 2026
Author: sontag.

This document was translated from BibT_EX by bibtex2html