New Results in Cooperative Adaptive Optimal Output Regulation

DONG Yuchen, GAO Weinan, JIANG Zhong-Ping

系统科学与复杂性(英文) ›› 2024, Vol. 37 ›› Issue (1) : 253-272.

PDF(659 KB)
PDF(659 KB)
系统科学与复杂性(英文) ›› 2024, Vol. 37 ›› Issue (1) : 253-272. DOI: 10.1007/s11424-024-3429-0

New Results in Cooperative Adaptive Optimal Output Regulation

    DONG Yuchen1, GAO Weinan1, JIANG Zhong-Ping2
作者信息 +

New Results in Cooperative Adaptive Optimal Output Regulation

    DONG Yuchen1, GAO Weinan1, JIANG Zhong-Ping2
Author information +
文章历史 +

摘要

This paper investigates the cooperative adaptive optimal output regulation problem of continuous-time linear multi-agent systems. As the multi-agent system dynamics are uncertain, solving regulator equations and the corresponding algebraic Riccati equations is challenging, especially for high-order systems. In this paper, a novel method is proposed to approximate the solution of regulator equations, i.e., gradient descent method. It is worth noting that this method obtains gradients through online data rather than model information. A data-driven distributed adaptive suboptimal controller is developed by adaptive dynamic programming, so that each follower can achieve asymptotic tracking and disturbance rejection. Finally, the effectiveness of the proposed control method is validated by simulations.

Abstract

This paper investigates the cooperative adaptive optimal output regulation problem of continuous-time linear multi-agent systems. As the multi-agent system dynamics are uncertain, solving regulator equations and the corresponding algebraic Riccati equations is challenging, especially for high-order systems. In this paper, a novel method is proposed to approximate the solution of regulator equations, i.e., gradient descent method. It is worth noting that this method obtains gradients through online data rather than model information. A data-driven distributed adaptive suboptimal controller is developed by adaptive dynamic programming, so that each follower can achieve asymptotic tracking and disturbance rejection. Finally, the effectiveness of the proposed control method is validated by simulations.

关键词

Adaptive dynamic programming / cooperative output regulation / gradient descent method / multi-agent systems

Key words

Adaptive dynamic programming / cooperative output regulation / gradient descent method / multi-agent systems

引用本文

导出引用
DONG Yuchen , GAO Weinan , JIANG Zhong-Ping. New Results in Cooperative Adaptive Optimal Output Regulation. 系统科学与复杂性(英文), 2024, 37(1): 253-272 https://doi.org/10.1007/s11424-024-3429-0
DONG Yuchen , GAO Weinan , JIANG Zhong-Ping. New Results in Cooperative Adaptive Optimal Output Regulation. Journal of Systems Science and Complexity, 2024, 37(1): 253-272 https://doi.org/10.1007/s11424-024-3429-0

参考文献

[1] Khan G D, Chen Z Y, and Zhu L J, A new approach for event-triggered stabilization and output regulation of nonlinear systems, IEEE Transactions on Automatic Control, 2020, 65(8): 3592– 3599.
[2] Liang D and Huang J, Robust output regulation of linear systems by event-triggered dynamic output feedback control, IEEE Transactions on Automatic Control, 2021, 66(5): 2415–2422.
[3] Gao W N, Deng C, Jiang Y, et al., Resilient reinforcement learning and robust output regulation under denial-of-service attacks, Automatica, 2022, 142: 110366.
[4] Gao W N, Mynuddin M, Wunsch D C, et al., Reinforcement learning-based cooperative optimal output regulation via distributed adaptive internal model, IEEE Transactions on Neural Networks and Learning Systems, 2022, 33(10): 5229–5240.
[5] Zhang Z, Chen S M, and Zheng Y S, Cooperative output regulation for linear multiagent systems via distributed fixed-time event-triggered control, IEEE Transactions on Neural Networks and Learning Systems, 2022, DOI: 10.1109/TNNLS.2022.3174416.
[6] Zhang D, Deng C, and Feng G, Resilient cooperative output regulation for nonlinear multiagent systems under DoS attacks, IEEE Transactions on Automatic Control, 2023, 68(4): 2521–2528.
[7] Xiang J, Wei W, and Li Y, Synchronized output regulation of networked linear systems, IEEE Transactions on Automatic Control, 2009, 54(6): 1336–1341.
[8] Qu Y H, Wang H Y, and Lin S, Smart grid system cooperative output control method based on distributed compensation algorithm, Proceedings of the 5th International Conference on Clean Energy and Electrical Systems, 2023, 1058: 35–47.
[9] Wang X, He Y, Gao D W, et al., Cooperative output regulation of large-scale wind turbines for power reserve control, IEEE Transactions on Energy Conversion, 2023, 38(2): 1166–1177.
[10] Deng C and Yang G H, Distributed adaptive fault-tolerant control approach to cooperative output regulation for linear multi-agent systems, Automatica, 2019, 103: 62–68.
[11] Huang C and Ye X D, Cooperative output regulation of heterogeneous multi-agent systems: An H criterion, IEEE Transactions on Automatic Control, 2014, 59(1): 267–273.
[12] Li Z K, Chen M Z Q, and Ding Z T, Distributed adaptive controllers for cooperative output regulation of heterogeneous agents over directed graphs, Automatica, 2016, 68: 179–183.
[13] Jiang Z P, Bian T, and Gao W N, Learning-based control: A tutorial and some recent results, Foundations and Trends in Systems and Control, 2020, 8(3): 176–284.
[14] Zhao F Y, Gao W N, Jiang Z P, et al., Event-triggered adaptive optimal control with output feedback: An adaptive dynamic programming approach, IEEE Transactions on Neural Networks and Learning Systems, 2021, 32(11): 5208–5221.
[15] Qasem O, Gao W N, and Vamvoudakis K G, Adaptive optimal control of continuous-time nonlinear affine systems via hybrid iteration, Automatica, 2023, 157: 111261.
[16] Gao W N and Jiang Z P, Adaptive dynamic programming and adaptive optimal output regulation of linear systems, IEEE Transactions on Automatic Control, 2016, 61(12): 4164–4169.
[17] Wu Y Z, Liang Q P, and Hu J P, Optimal output regulation for general linear systems via adaptive dynamic programming, IEEE Transactions on Cybernetics, 2022, 52(11): 11916–11926.
[18] Wang B J, Xu L, Yi X L, et al., Semiglobal suboptimal output regulation for heterogeneous multiagent systems with input saturation via adaptive dynamic programming, IEEE Transactions on Neural Networks and Learning Systems, 2022, DOI: 10.1109/TNNLS.2022.3191673.
[19] Gao W N and Jiang Z P, Adaptive optimal output regulation of time-delay systems via measurement feedback, IEEE Transactions on Neural Networks and Learning Systems, 2019, 30(3): 938–945.
[20] Li H Y and Wei Q L, Data-driven optimal output cluster synchronization control of heterogeneous multi-agent systems, IEEE Transactions on Automation Science and Engineering, 2023, DOI: 10.1109/TASE.2023.3289950.
[21] Gao W N, Jiang Z P, Lewis F L, et al., Cooperative optimal output regulation of multi-agent systems using adaptive dynamic programming, 2017 American Control Conference (ACC), Seattle, 2017, 2674–2679.
[22] Zhang W G and Yan J, Adaptive constrained output feedback optimal consensus tracking for uncertain nonlinear multi-agent systems and its application, International Journal of Control, 2022, DOI: 10.1080/00207179.2022.2160826.
[23] Yang Y, Xu C, Yue D, et al., Output feedback tracking control of a class of continuous-time nonlinear systems via adaptive dynamic programming approach, Information Sciences, 2018, 469: 1–13.
[24] Huang J, Nonlinear output regulation: Theory and applications, Philadelphia: Society for Industrial and Applied Mathematics, 2004, 16(8): 413–415.
[25] He S P, Fang H Y, Zhang M G, et al., Adaptive optimal control for a class of nonlinear systems: The online policy iteration approach, IEEE Transactions on Neural Networks and Learning Systems, 2020, 31(2): 549–558.
[26] Su Y F and Huang J, Cooperative output regulation of linear multi-agent systems, IEEE Transactions on Automatic Control, 2012, 57(4): 1062–1066.
[27] Kleinman D, On an iterative technique for Riccati equation computations, IEEE Transactions on Automatic Control, 1968, 13(1): 114–115.
[28] Boyd S and Vandenberghe L, Convex Optimization, Cambridge University Press, Cambridge, 2004.
[29] Shwartz S S and Ben D S, Understanding Machine Learning: From Theory to Algorithms, Cambridge University Press, Cambridge, 2014.
[30] Bottou L, Curtis F E, and Nocedal J, Optimization methods for large-scale machine learning, SIAM Review, 2016, 60(2): 223–311.

基金

The work was supported in part by the National Natural Science Foundation of China under Grant No. 62373090 and the U.S. National Science Foundation under Grant No. CNS-2227153.
PDF(659 KB)

91

Accesses

0

Citation

Detail

段落导航
相关文章

/