← 返回大厅
arXiv (CS.AI) 2026-06-15 12:00 DOI: arXiv:2606.14693

Learning Coordinated Preference for Multi-Objective Multi-Agent Reinforcement Learning

摘要 / Abstract

arXiv:2606.14693v1 Announce Type: cross Abstract: Cooperative multi-objective multi-agent reinforcement learning (MOMARL) models team decision making under multiple, potentially conflicting objectives. In this setting, conflicts arise not only across objectives but also across agents with different observations, roles, and contributions. We propose Preference Coordinated Multi-agent Policy Optimization (PCMA), which learns coordinated agent-specific preferences to enable complementary trade-offs among agents. Theoretically, we formulate cooperative MOMARL as a team-optimal game and show that, under suitable conditions, preference diversity can induce team improvement through a first-order improvement decomposition. Experiments on multiple cooperative MOMA environments and a practical traffic-control scenario show that PCMA improves both performance and trade-off coordination.

同行评议区

登录学者账户后即可在此处发表评述或点赞。

立即登录

暂无评议记录。