City Research Online

Selectively augmented attention network for few-shot image classification

Li, X., Wang, X., Zhu, R. ORCID: 0000-0002-9944-0369 , Ma, Z. ORCID: 0000-0003-2950-2488, Cao, J. ORCID: 0000-0003-0481-5170 & Xue, J-H. (2024). Selectively augmented attention network for few-shot image classification. IEEE Transactions on Circuits and Systems for Video Technology, doi: 10.1109/tcsvt.2024.3480279

Abstract

Few-shot image classification is a challenging task that aims to learn from a limited number of labelled training images a classification model that can be generalised to unseen classes. Two strategies are usually taken to improve the classification performances of few-shot image classifiers: either applying data augmentation to enlarge the sample size of the training set and reduce overfitting, or involving attention mechanisms to highlight discriminative spatial regions or channels. However, naively applying them to few-shot classifiers directly and separately may lead to undesirable results; for example, some augmented images may focus majorly on the background rather than the object, which brings additional noises to the training process. In this paper, we propose a unified framework, the selectively augmented attention (SAA) network, that carefully integrates the best of the two approaches in an end-to-end fashion via a selective best match module to select the most representative images from the augmented training set. The selected images tend to concentrate on the objects with less irrelevant background, which can assist the subsequent calculation of attentions by alleviating the interference from background. Moreover, we design a joint attention module to jointly learn both the spatial and channel-wise attentions. Experimental results on four benchmark datasets showcase the superior classification performance of the proposed SAA network compared with the state-of-the-arts.

Publication Type: Article
Additional Information: © 2024 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works.
Publisher Keywords: Few-shot image classification, Data augmentation, Attention mechanism, Metric-based methods
Subjects: Q Science > QA Mathematics > QA75 Electronic computers. Computer science
T Technology > TK Electrical engineering. Electronics Nuclear engineering
Z Bibliography. Library Science. Information Resources > Z665 Library Science. Information Science
Departments: Bayes Business School
Bayes Business School > Actuarial Science & Insurance
SWORD Depositor:
[thumbnail of SAA.pdf]
Preview
Text - Accepted Version
Download (9MB) | Preview

Export

Add to AnyAdd to TwitterAdd to FacebookAdd to LinkedinAdd to PinterestAdd to Email

Downloads

Downloads per month over past year

View more statistics

Actions (login required)

Admin Login Admin Login