MA-Trace: An On-Policy Actor-Critic Algorithm for Multi-Agent Reinforcement Learning - 42Papers