Publications about 'data-driven control'

Publications about 'data-driven control'

Articles in journal or book chapters

E.D. Sontag. Remarks on input to state stability of perturbed gradient flows, motivated by model-free feedback control learning. Systems and Control Letters, 161:105138, 2022. Note: Important: there is an error in the paper. For the LQR application, the paper only shows iISS, not ISS. See the paper Small-disturbance input-to-state stability of perturbed gradient flows: Applications to LQR problem for details.[PDF] Keyword(s): iss, input to state stability, data-driven control, gradient systems, steepest descent, model-free control, gradient dynamics, gradient descent, gradient systems, gradient descent, numerical methods, dynamics of algorithms. Abstract:

Recent work on data-driven control and reinforcement learning has renewed interest in a relatively old field in control theory: model-free optimal control approaches which work directly with a cost function and do not rely upon perfect knowledge of a system model. Instead, an "oracle" returns an estimate of the cost associated to, for example, a proposed linear feedback law to solve a linear-quadratic regulator problem. This estimate, and an estimate of the gradient of the cost, might be obtained by performing experiments on the physical system being controlled. This motivates in turn the analysis of steepest descent algorithms and their associated gradient differential equations. This paper studies the effect of errors in the estimation of the gradient, framed in the language of input to state stability, where the input represents a perturbation from the true gradient. Since one needs to study systems evolving on proper open subsets of Euclidean space, a self-contained review of input to state stability definitions and theorems for systems that evolve on such sets is included. The results are then applied to the study of noisy gradient systems, as well as the associated steepest descent algorithms.

Conference articles

M. Sznaier, F. Allgower, A. C. B. de Oliveira, N. Ozay, and E. D. Sontag. Tutorial: Data driven and learning enabled control. In Proc. 64th IEEE Conference on Decision and Control (CDC), 2025. Note: To appear.Keyword(s): data-drive control, reinforcement learning. Abstract:

Data-driven control (DDC), that is the design of controllers directly from observed data, has attracted substantial attention in recent years due to its advantages over model-based control. DDC avoids a computationally expensive, potentially conservative model identification step and bypasses practically difficult questions such as model order/class selection. This tutorial paper seeks to offer a sampling of the different approaches that have been recently used to synthesize data driven controllers and filters, covering both analytic approaches and learning enabled ones, indicating the relative strengths of each. A second objective is to provide a key to the rapidly expanding literature in the subject, to help researchers newly interested in this field to quickly come up to speed.

A.C.B de Oliveira, M. Siami, and E.D. Sontag. Remarks on the gradient training of linear neural network based feedback for the LQR Problem. In Proc. 2024 63rd IEEE Conference on Decision and Control (CDC), pages 7846-7852, 2024. [PDF] Keyword(s): machine learning, artificial intelligence, neural networks, overparametrization, gradient descent, gradient dynamics, gradient descent, gradient systems, gradient descent, numerical methods, dynamics of algorithms, input to state stability, gradient systems, feedback control, LQR. Abstract:

Motivated by the current interest in using Artificial intelligence (AI) tools in control design, this paper takes the first steps towards bridging results from gradient methods for solving the LQR control problem, and neural networks. More specifically, it looks into the case where one wants to find a Linear Feed-Forward Neural Network (LFFNN) that minimizes the Linear Quadratic Regulator (LQR) cost. This work develops gradient formulas that can be used to implement the training of LFFNNs to solve the LQR problem, and derives an important conservation law of the system. This conservation law is then leveraged to prove global convergence of solutions and invariance of the set of stabilizing networks under the training dynamics. These theoretical results are then followed by and extensive analysis of the simplest version of the problem (the ``scalar case'') and by numerical evidence of faster convergence of the training of general LFFNNs when compared to traditional direct gradient methods. These results not only serve as indication of the theoretical value of studying such a problem, but also of the practical value of LFFNNs as design tools for data-driven control applications.

BACK TO INDEX

Disclaimer:

This material is presented to ensure timely dissemination of scholarly and technical work. Copyright and all rights therein are retained by authors or by other copyright holders.

Last modified: Sun Jan 4 22:55:38 2026
Author: sontag.

This document was translated from BibT_EX by bibtex2html