JMLR

Learning Global Nash Equilibrium in Team Competitive Games with Generalized Fictitious Cross-Play

Authors
Zelai Xu Chao Yu Yancheng Liang Yi Wu Yu Wang
Paper Information
  • Journal:
    Journal of Machine Learning Research
  • Added to Tracker:
    Jul 15, 2025
Abstract

Self-play (SP) is a popular multi-agent reinforcement learning framework for competitive games. Despite the empirical success, the theoretical properties of SP are limited to two-player settings. For team competitive games where two teams of cooperative agents compete with each other, we show a counter-example where SP cannot converge to a global Nash equilibrium (NE) with high probability. Policy-Space Response Oracles (PSRO) is an alternative framework that finds NEs by iteratively learning the best response (BR) to previous policies. PSRO can be directly extended to team competitive games with unchanged convergence properties by learning team BRs, but its repeated training from scratch makes it hard to scale to complex games. In this work, we propose Generalized Fictitious Cross-Play (GFXP), a novel algorithm that inherits benefits from both frameworks. GFXP simultaneously trains an SP-based main policy and a counter population. The main policy is trained by fictitious self-play and cross-play against the counter population, while the counter policies are trained as the BRs to the main policy's checkpoints. We evaluate GFXP in matrix games and gridworld domains where GFXP achieves the lowest exploitabilities. We further conduct experiments in a challenging football game where GFXP defeats SOTA models with over 94% win rate.

Author Details
Zelai Xu
Author
Chao Yu
Author
Yancheng Liang
Author
Yi Wu
Author
Yu Wang
Author
Citation Information
APA Format
Zelai Xu , Chao Yu , Yancheng Liang , Yi Wu & Yu Wang . Learning Global Nash Equilibrium in Team Competitive Games with Generalized Fictitious Cross-Play. Journal of Machine Learning Research .
BibTeX Format
@article{JMLR:v26:24-1503,
  author  = {Zelai Xu and Chao Yu and Yancheng Liang and Yi Wu and Yu Wang},
  title   = {Learning Global Nash Equilibrium in Team Competitive Games with Generalized Fictitious Cross-Play},
  journal = {Journal of Machine Learning Research},
  year    = {2025},
  volume  = {26},
  number  = {44},
  pages   = {1--30},
  url     = {http://jmlr.org/papers/v26/24-1503.html}
}
Related Papers