City Research Online

Robust Multi‐Agent Reinforcement Learning Against Adversarial Attacks for Cooperative Self‐Driving Vehicles

Wang, C. ORCID: 0009-0007-8060-6326, Wang, Z. & Aouf, N. ORCID: 0000-0001-9291-4077 (2025). Robust Multi‐Agent Reinforcement Learning Against Adversarial Attacks for Cooperative Self‐Driving Vehicles. IET Radar, Sonar & Navigation, 19(1), article number e70033. doi: 10.1049/rsn2.70033

Abstract

Multi‐agent deep reinforcement learning (MARL) for self‐driving vehicles aims to address the complex challenge of coordinating multiple autonomous agents in shared road environments. MARL creates a more stable system and improves vehicle performance in typical traffic scenarios compared to single‐agent DRL systems. However, despite its sophisticated cooperative training, MARL remains vulnerable to unforeseen adversarial attacks. Perturbed observation states can lead one or more vehicles to make critical errors in decision‐making, triggering chain reactions that often result in severe collisions and accidents. To ensure the safety and reliability of multi‐agent autonomous driving systems, this paper proposes a robust constrained cooperative multi‐agent reinforcement learning (R‐CCMARL) algorithm for self‐driving vehicles, enabling robust driving policy to handle strong and unpredictable adversarial attacks. Unlike most existing works, our R‐CCMARL framework employs a universal policy for each agent, achieving a more practical, nontask‐oriented driving agent for real‐world applications. In this way, it enables us to integrate shared observations with Mean‐Field theory to model interactions within the MARL system. A risk formulation and a risk estimation network are developed to minimise the defined long‐term risks. To further enhance robustness, this risk estimator is then used to construct a constrained optimisation objective function with a regulariser to maximise long‐term rewards in worst‐case scenarios. Experiments conducted in the CARLA simulator in intersection scenarios demonstrate that our method remains robust against adversarial state perturbations while maintaining high performance, both with and without attacks.

Publication Type: Article
Additional Information: © 2025 The Author(s). IET Radar, Sonar & Navigation published by John Wiley & Sons Ltd on behalf of The Institution of Engineering and Technology. This is an open access article under the terms of the Creative Commons Attribution License, which permits use, distribution and reproduction in any medium, provided the original work is properly cited.
Publisher Keywords: decision making, multi-robot systems, neural nets
Subjects: T Technology > TA Engineering (General). Civil engineering (General)
Departments: School of Science & Technology
School of Science & Technology > Engineering
SWORD Depositor:
[thumbnail of Robust Multi.pdf]
Preview
Text - Published Version
Available under License Creative Commons Attribution.

Download (2MB) | Preview

Export

Add to AnyAdd to TwitterAdd to FacebookAdd to LinkedinAdd to PinterestAdd to Email

Downloads

Downloads per month over past year

View more statistics

Actions (login required)

Admin Login Admin Login