City Research Online

Multi-modal Open World User Identification

Irfan, B., Garcia Ortiz, M. ORCID: 0000-0003-4729-7457, Lyubova, N. & Belpaeme, T. (2022). Multi-modal Open World User Identification. ACM Transactions on Human-Robot Interaction, 11(1), pp. 1-50. doi: 10.1145/3477963

Abstract

User identification is an essential step in creating a personalised long-term interaction with robots. This requires learning the users continuously and incrementally, possibly starting from a state without any known user. In this article, we describe a multi-modal incremental Bayesian network with online learning, which is the first method that can be applied in such scenarios. Face recognition is used as the primary biometric, and it is combined with ancillary information, such as gender, age, height, and time of interaction to improve the recognition. The Multi-modal Long-term User Recognition Dataset is generated to simulate various human-robot interaction (HRI) scenarios and evaluate our approach in comparison to face recognition, soft biometrics, and a state-of-the-art open world recognition method (Extreme Value Machine). The results show that the proposed methods significantly outperform the baselines, with an increase in the identification rate up to 47.9% in open-set and closed-set scenarios, and a significant decrease in long-term recognition performance loss. The proposed models generalise well to new users, provide stability, improve over time, and decrease the bias of face recognition. The models were applied in HRI studies for user recognition, personalised rehabilitation, and customer-oriented service, which showed that they are suitable for long-term HRI in the real world.

Publication Type: Article
Additional Information: This is the authors’ accepted manuscript. The final version of this work is published in October 2021 by ACM in Transactions on Human-Robot Interaction, available at DOI: 10.1145/3477963. This work is made available online in accordance with the publisher’s policies. Please refer to any applicable terms of use of the publisher.
Publisher Keywords: Open world recognition, Bayesian network, soft biometrics, incremental learning, online learning, multi-modal dataset, long-term user recognition, Human-Robot Interaction
Subjects: Q Science > QA Mathematics > QA75 Electronic computers. Computer science
Departments: School of Science & Technology > Computer Science
SWORD Depositor:
[thumbnail of irfan2021multimodal-open.pdf]
Preview
Text - Accepted Version
Download (10MB) | Preview

Export

Add to AnyAdd to TwitterAdd to FacebookAdd to LinkedinAdd to PinterestAdd to Email

Downloads

Downloads per month over past year

View more statistics

Actions (login required)

Admin Login Admin Login