Multi-modal Open World User Identification
Irfan, B., Garcia Ortiz, M. ORCID: 0000-0003-4729-7457, Lyubova, N. & Belpaeme, T. (2022). Multi-modal Open World User Identification. ACM Transactions on Human-Robot Interaction, 11(1), pp. 1-50. doi: 10.1145/3477963
Abstract
User identification is an essential step in creating a personalised long-term interaction with robots. This requires learning the users continuously and incrementally, possibly starting from a state without any known user. In this article, we describe a multi-modal incremental Bayesian network with online learning, which is the first method that can be applied in such scenarios. Face recognition is used as the primary biometric, and it is combined with ancillary information, such as gender, age, height, and time of interaction to improve the recognition. The Multi-modal Long-term User Recognition Dataset is generated to simulate various human-robot interaction (HRI) scenarios and evaluate our approach in comparison to face recognition, soft biometrics, and a state-of-the-art open world recognition method (Extreme Value Machine). The results show that the proposed methods significantly outperform the baselines, with an increase in the identification rate up to 47.9% in open-set and closed-set scenarios, and a significant decrease in long-term recognition performance loss. The proposed models generalise well to new users, provide stability, improve over time, and decrease the bias of face recognition. The models were applied in HRI studies for user recognition, personalised rehabilitation, and customer-oriented service, which showed that they are suitable for long-term HRI in the real world.
Publication Type: | Article |
---|---|
Additional Information: | This is the authors’ accepted manuscript. The final version of this work is published in October 2021 by ACM in Transactions on Human-Robot Interaction, available at DOI: 10.1145/3477963. This work is made available online in accordance with the publisher’s policies. Please refer to any applicable terms of use of the publisher. |
Publisher Keywords: | Open world recognition, Bayesian network, soft biometrics, incremental learning, online learning, multi-modal dataset, long-term user recognition, Human-Robot Interaction |
Subjects: | Q Science > QA Mathematics > QA75 Electronic computers. Computer science |
Departments: | School of Science & Technology > Computer Science |
SWORD Depositor: |
Download (10MB) | Preview
Export
Downloads
Downloads per month over past year