中国科学院数学与系统科学研究院期刊网
An Online Q-Learning Method for Linear-Quadratic Nonzero-Sum Stochastic Differential Games with Completely Unknown Dynamics
ZHANG Bao-Qiang, WANG Bing-Chang, CAO Ying
An Online Q-Learning Method for Linear-Quadratic Nonzero-Sum Stochastic Differential Games with Completely Unknown Dynamics
ZHANG Bao-Qiang, WANG Bing-Chang, CAO Ying
系统科学与复杂性(英文) . 2024, (5): 1907 -1922 .  DOI: 10.1007/s11424-024-3343-5