City Research Online

Machine Learning for Performance Prediction of Data Distribution Service (DDS)

Peeroo, K. ORCID: 0000-0001-8601-4750, Popov, P. T. ORCID: 0000-0002-3434-5272, Stankovic, V. ORCID: 0000-0002-8740-6526 & Weyde, T. ORCID: 0000-0001-8028-9905 (2024). Machine Learning for Performance Prediction of Data Distribution Service (DDS). Paper presented at the European Dependable Computing Conference, 8-11 Apr 2024, Leuven, Belgium.

Abstract

Networking middleware following the Data Distribution Service (DDS) specification is used in real-time missioncritical systems such as autonomous vehicles, energy management systems, and air traffic control. DDS follows the publishsubscribe communication pattern and offers a set of Quality of Service (QoS) parameters, allowing the users to align the data communication to the needs of the application. Configuring DDS to achieve the required performance is a difficult task, given the large space of QoS parameter values. Experimental evaluation of performance levels with a real DDS system for different QoS configurations can be complex and require substantial time and resources.

We propose the use of Machine Learning (ML) models to predict the performance metric distribution of DDS under different configurations. This is done by using performance measurements of some configurations to train an ML model. The trained model can then be used to predict the performance distribution of DDS under other system configurations. Since the prediction is computationally inexpensive, we can predict the performance of many different configurations to find a suitable one for given requirements. To the best of our knowledge, this is the first time this approach has been applied to DDS performance evaluation. We used random forests (RF) as an ML method and linear regression (LR) as a baseline. We selected thirteen performance metrics, and for each, we trained an RF model and tuned its hyperparameters. We tested the final models on system configurations unseen during training, both for parameter values within the training range (interpolation) and outside the training range (extrapolation).

The RF models show better predictive accuracy than the LR baseline. This paper focuses on the models for throughput and latency - the two well-established performance metrics. The models demonstrate coefficients of determination greater than 0.9 and 0.8, respectively, for different unseen system configurations in interpolation, but work less well in extrapolation cases. We conclude that the proposed ML models offer a way of predicting the performance distribution of a range of configurations when interpolation is used. Since model prediction is computationally much cheaper than relying on experimentation, it is a useful tool to guide DDS system parametrisation and design.

Publication Type: Conference or Workshop Item (Paper)
Additional Information: © 2024 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works.
Publisher Keywords: Data Distribution Service, Machine Learning, Linear Regression, Random Forests, Performance Prediction
Subjects: Q Science > QA Mathematics > QA75 Electronic computers. Computer science
Departments: School of Science & Technology
School of Science & Technology > Computer Science
School of Science & Technology > Computer Science > Software Reliability
SWORD Depositor:
[thumbnail of Machine_Learning_for_Performance_Prediction_of_DDS___Conference_Paper_Short_Version-1.pdf]
Preview
Text - Accepted Version
Download (283kB) | Preview

Export

Add to AnyAdd to TwitterAdd to FacebookAdd to LinkedinAdd to PinterestAdd to Email

Downloads

Downloads per month over past year

View more statistics

Actions (login required)

Admin Login Admin Login