Selectively augmented attention network for few-shot image classification
Li, X., Wang, X., Zhu, R. ORCID: 0000-0002-9944-0369 , Ma, Z. ORCID: 0000-0003-2950-2488, Cao, J. ORCID: 0000-0003-0481-5170 & Xue, J-H. (2024). Selectively augmented attention network for few-shot image classification. IEEE Transactions on Circuits and Systems for Video Technology, doi: 10.1109/tcsvt.2024.3480279
Abstract
Few-shot image classification is a challenging task that aims to learn from a limited number of labelled training images a classification model that can be generalised to unseen classes. Two strategies are usually taken to improve the classification performances of few-shot image classifiers: either applying data augmentation to enlarge the sample size of the training set and reduce overfitting, or involving attention mechanisms to highlight discriminative spatial regions or channels. However, naively applying them to few-shot classifiers directly and separately may lead to undesirable results; for example, some augmented images may focus majorly on the background rather than the object, which brings additional noises to the training process. In this paper, we propose a unified framework, the selectively augmented attention (SAA) network, that carefully integrates the best of the two approaches in an end-to-end fashion via a selective best match module to select the most representative images from the augmented training set. The selected images tend to concentrate on the objects with less irrelevant background, which can assist the subsequent calculation of attentions by alleviating the interference from background. Moreover, we design a joint attention module to jointly learn both the spatial and channel-wise attentions. Experimental results on four benchmark datasets showcase the superior classification performance of the proposed SAA network compared with the state-of-the-arts.
Publication Type: | Article |
---|---|
Additional Information: | © 2024 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works. |
Publisher Keywords: | Few-shot image classification, Data augmentation, Attention mechanism, Metric-based methods |
Subjects: | Q Science > QA Mathematics > QA75 Electronic computers. Computer science T Technology > TK Electrical engineering. Electronics Nuclear engineering Z Bibliography. Library Science. Information Resources > Z665 Library Science. Information Science |
Departments: | Bayes Business School Bayes Business School > Actuarial Science & Insurance |
SWORD Depositor: |
Download (9MB) | Preview
Export
Downloads
Downloads per month over past year