Robust Multi‐Agent Reinforcement Learning Against Adversarial Attacks for Cooperative Self‐Driving Vehicles
Wang, C. ORCID: 0009-0007-8060-6326, Wang, Z. & Aouf, N.
ORCID: 0000-0001-9291-4077 (2025).
Robust Multi‐Agent Reinforcement Learning Against Adversarial Attacks for Cooperative Self‐Driving Vehicles.
IET Radar, Sonar & Navigation, 19(1),
article number e70033.
doi: 10.1049/rsn2.70033
Abstract
Multi‐agent deep reinforcement learning (MARL) for self‐driving vehicles aims to address the complex challenge of coordinating multiple autonomous agents in shared road environments. MARL creates a more stable system and improves vehicle performance in typical traffic scenarios compared to single‐agent DRL systems. However, despite its sophisticated cooperative training, MARL remains vulnerable to unforeseen adversarial attacks. Perturbed observation states can lead one or more vehicles to make critical errors in decision‐making, triggering chain reactions that often result in severe collisions and accidents. To ensure the safety and reliability of multi‐agent autonomous driving systems, this paper proposes a robust constrained cooperative multi‐agent reinforcement learning (R‐CCMARL) algorithm for self‐driving vehicles, enabling robust driving policy to handle strong and unpredictable adversarial attacks. Unlike most existing works, our R‐CCMARL framework employs a universal policy for each agent, achieving a more practical, nontask‐oriented driving agent for real‐world applications. In this way, it enables us to integrate shared observations with Mean‐Field theory to model interactions within the MARL system. A risk formulation and a risk estimation network are developed to minimise the defined long‐term risks. To further enhance robustness, this risk estimator is then used to construct a constrained optimisation objective function with a regulariser to maximise long‐term rewards in worst‐case scenarios. Experiments conducted in the CARLA simulator in intersection scenarios demonstrate that our method remains robust against adversarial state perturbations while maintaining high performance, both with and without attacks.
Publication Type: | Article |
---|---|
Additional Information: | © 2025 The Author(s). IET Radar, Sonar & Navigation published by John Wiley & Sons Ltd on behalf of The Institution of Engineering and Technology. This is an open access article under the terms of the Creative Commons Attribution License, which permits use, distribution and reproduction in any medium, provided the original work is properly cited. |
Publisher Keywords: | decision making, multi-robot systems, neural nets |
Subjects: | T Technology > TA Engineering (General). Civil engineering (General) |
Departments: | School of Science & Technology School of Science & Technology > Engineering |
SWORD Depositor: |
Available under License Creative Commons Attribution.
Download (2MB) | Preview
Export
Downloads
Downloads per month over past year