Robust Multi‐Agent Reinforcement Learning Against Adversarial Attacks for Cooperative Self‐Driving Vehicles

Wang, C.; Wang, Z.; Aouf, N.

Robust Multi‐Agent Reinforcement Learning Against Adversarial Attacks for Cooperative Self‐Driving Vehicles

Wang, C. ORCID: 0009-0007-8060-6326, Wang, Z. & Aouf, N. ORCID: 0000-0001-9291-4077 (2025). Robust Multi‐Agent Reinforcement Learning Against Adversarial Attacks for Cooperative Self‐Driving Vehicles. IET Radar, Sonar & Navigation, 19(1), article number e70033. doi: 10.1049/rsn2.70033

Abstract

Multi‐agent deep reinforcement learning (MARL) for self‐driving vehicles aims to address the complex challenge of coordinating multiple autonomous agents in shared road environments. MARL creates a more stable system and improves vehicle performance in typical traffic scenarios compared to single‐agent DRL systems. However, despite its sophisticated cooperative training, MARL remains vulnerable to unforeseen adversarial attacks. Perturbed observation states can lead one or more vehicles to make critical errors in decision‐making, triggering chain reactions that often result in severe collisions and accidents. To ensure the safety and reliability of multi‐agent autonomous driving systems, this paper proposes a robust constrained cooperative multi‐agent reinforcement learning (R‐CCMARL) algorithm for self‐driving vehicles, enabling robust driving policy to handle strong and unpredictable adversarial attacks. Unlike most existing works, our R‐CCMARL framework employs a universal policy for each agent, achieving a more practical, nontask‐oriented driving agent for real‐world applications. In this way, it enables us to integrate shared observations with Mean‐Field theory to model interactions within the MARL system. A risk formulation and a risk estimation network are developed to minimise the defined long‐term risks. To further enhance robustness, this risk estimator is then used to construct a constrained optimisation objective function with a regulariser to maximise long‐term rewards in worst‐case scenarios. Experiments conducted in the CARLA simulator in intersection scenarios demonstrate that our method remains robust against adversarial state perturbations while maintaining high performance, both with and without attacks.

Publication Type:	Article
Additional Information:	© 2025 The Author(s). IET Radar, Sonar & Navigation published by John Wiley & Sons Ltd on behalf of The Institution of Engineering and Technology. This is an open access article under the terms of the Creative Commons Attribution License, which permits use, distribution and reproduction in any medium, provided the original work is properly cited.
Publisher Keywords:	decision making, multi-robot systems, neural nets
Subjects:	T Technology > TA Engineering (General). Civil engineering (General)
Departments:	School of Science & Technology School of Science & Technology > Department of Engineering
SWORD Depositor:	Symplectic Administrator

Preview

Text - Published Version
Available under License Creative Commons Attribution.
Download (2MB) | Preview

Official URL: https://doi.org/10.1049/rsn2.70033

Export

Downloads

Downloads per month over past year

View more statistics

Metadata

Altmetric

View Altmetric information about this item.

CORE (COnnecting REpositories)

Actions (login required)

Admin Login

Creators:	Wang, C. ORCID: 0009-0007-8060-6326 Wang, Z. Aouf, N. ORCID: 0000-0001-9291-4077
Status:	Published
Refereed:	Yes
Journal or Publication Title:	IET Radar, Sonar & Navigation
Publisher:	Institution of Engineering and Technology (IET)
ISSN:	1751-8784
e-ISSN:	1751-8792
URI:	https://openaccess.city.ac.uk/id/eprint/35259
Date available in CRO:	29 May 2025 10:04
Date deposited:	29 May 2025
Dates:	Date Event 1 January 2025 Published 19 May 2025 Published Online 7 May 2025 Accepted