An Actor Critic Machine Learning Method for Free Terminal Time Optimal Control

BURTON Evan, NAKAMURA-ZIMMERER Tenavi, GONG Qi, KANG Wei

系统科学与复杂性(英文) ›› 2025

PDF(2217 KB)
PDF(2217 KB)
系统科学与复杂性(英文) ›› 2025

An Actor Critic Machine Learning Method for Free Terminal Time Optimal Control

    BURTON Evan1, NAKAMURA-ZIMMERER Tenavi2, GONG Qi1, KANG Wei3
作者信息 +

An Actor Critic Machine Learning Method for Free Terminal Time Optimal Control

    BURTON Evan1, NAKAMURA-ZIMMERER Tenavi2, GONG Qi1, KANG Wei3
Author information +
文章历史 +

摘要

Optimal feedback control of nonlinear system with free terminal time present many challenges including nonsmooth in the value function and control laws, and existence of multiple local or even global optimal trajectories. To mitigate these issues, the authors introduce an actor-critic method along with some enhancements. The authors demonstrate the algorithm's effectiveness on a prototypical example featuring each of the main pathological issues present in problems of this type as well as a higher dimensional example to show that the solution method presented can scale.

Abstract

Optimal feedback control of nonlinear system with free terminal time present many challenges including nonsmooth in the value function and control laws, and existence of multiple local or even global optimal trajectories. To mitigate these issues, the authors introduce an actor-critic method along with some enhancements. The authors demonstrate the algorithm's effectiveness on a prototypical example featuring each of the main pathological issues present in problems of this type as well as a higher dimensional example to show that the solution method presented can scale.

关键词

Hamilton-Jacobi-Bellman equations / machine learning / optimal feedback control

Key words

Hamilton-Jacobi-Bellman equations / machine learning / optimal feedback control

引用本文

导出引用
BURTON Evan , NAKAMURA-ZIMMERER Tenavi , GONG Qi , KANG Wei. An Actor Critic Machine Learning Method for Free Terminal Time Optimal Control. 系统科学与复杂性(英文), 2025
BURTON Evan , NAKAMURA-ZIMMERER Tenavi , GONG Qi , KANG Wei. An Actor Critic Machine Learning Method for Free Terminal Time Optimal Control. Journal of Systems Science and Complexity, 2025

基金

This work was supported in part by the Air Force Office of Scientific Research (AFOSR), USA under Grant No. FA9550-21-1-0113, the National Science Foundation (NSF), USA under Grant Nos. 2134235 and 2202668.
PDF(2217 KB)

26

Accesses

0

Citation

Detail

段落导航
相关文章

/