Progressive Optimal Path Sampling for Closed-Loop Optimal Control Design with Deep Neural Networks

Xuanxi Zhang; Jihao Long; Wei Hu; Weinan E; Jiequn Han

doi:10.4208/jml.250213

Authors

Xuanxi Zhang New York University
Jihao Long Institute for Advanced Algorithms Research
Wei Hu Institute for Advanced Algorithms Research
Weinan E AI for Science Institute , Peking University
Jiequn Han Flatiron Institute

DOI:

https://doi.org/10.4208/jml.250213

Keywords:

Closed-loop optimal control, Distribution mismatch, Adaptive sampling, Supervised learning

Abstract

Closed-loop optimal control design for high-dimensional nonlinear systems has been a long-standing challenge. Traditional methods, such as solving the associated Hamilton-Jacobi-Bellman equation, suffer from the curse of dimensionality. Recent literature proposed a new promising approach based on supervised learning, by leveraging powerful open-loop optimal control solvers to generate training data and neural networks as efficient high-dimensional function approximators to fit the closed-loop optimal control. This approach successfully handles certain high-dimensional optimal control problems but still performs poorly on more challenging problems. One of the crucial reasons for the failure is the so-called distribution mismatch phenomenon brought by the controlled dynamics. In this paper, we investigate this phenomenon and propose the progressive optimal path sampling method to mitigate this problem. We theoretically prove that this enhanced sampling strategy outperforms both the vanilla approach and the widely used dataset aggregation method on the classical linear-quadratic regulator by a factor proportional to the total time duration. We further numerically demonstrate that the proposed sampling strategy significantly improves the performance on tested control problems, including the optimal landing problem of a quadrotor and the optimal reaching problem of a 7-DoF manipulator.

Author Biographies

Xuanxi Zhang

Courant Institute of Mathematical Sciences, New York University, New York, NY 10012, United States.
Jihao Long

Institute for Advanced Algorithms Research, Shanghai 201308, P.R. China
Wei Hu

Institute for Advanced Algorithms Research, Shanghai 201308, P.R. China
Weinan E

AI for Science Institute, Beijing 100080, P.R. China

School of Mathematical Science, Peking University, Beijing 100871, P.R. China

Center for Machine Learning Research, Peking University, Beijing 100871, P.R. China
Jiequn Han

Center for Computational Mathematics, Flatiron Institute, New York, NY 10010, United States

Progressive Optimal Path Sampling for Closed-Loop Optimal Control Design with Deep Neural Networks

Authors

DOI:

Keywords:

Abstract

Author Biographies

Downloads

Published

Abstract View

Pdf View

Issue

Section

Submit Article