中国科学院数学与系统科学研究院期刊网
On-Policy and Off-Policy Value Iteration Algorithms for Stochastic Zero-Sum Dynamic Games
GUO Liangyuan, WANG Bing-Chang, ZHANG Ji-Feng
Journal of Systems Science & Complexity . 2025, (1): 421 -435 .  DOI: 10.1007/s11424-025-4572-y