Abstract
Ensuring trust, fairness, and long-term efficiency in multi-agent systems poses significant challenges, particularly under partially competitive and decentralized settings where strategic manipulation and collusion can arise. This paper proposes a blockchain-enhanced framework that integrates smart contracts with multi-agent reinforcement learning (MARL) to design incentive-compatible mechanisms for strategic agent coordination. The framework utilizes the decentralized and tamper-resistant nature of blockchain to record agent behaviors on-chain, enforce transparency, and implement automated penalty and reward mechanisms through smart contracts. We embed these mechanisms into a Multi-Agent Soft Actor-Critic (MASAC) algorithm, aligning local decision-making with global system objectives. Experimental validation in two representative domains-automated market bidding and intelligent traffic control-demonstrates that the proposed approach significantly improves social welfare, reduces collusion success rates, enhances fairness, and increases behavioral robustness under noise. Ablation studies further reveal the complementary contributions of each system component. This work lays the foundation for scalable, transparent, and incentive-aligned coordination in decentralized intelligent agent systems.