Multi-Branch Network for Few-shot Learning
Ren, K., Guo, Z., Zhang, Z. , Zhu, R. ORCID: 0000-0002-9944-0369 & Li, X. (2022). Multi-Branch Network for Few-shot Learning. In: Proceedings of 2022 Asia-Pacific signal and Information Processing Association Annual Summit and Conference (APSIPA ASC). 2022 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC), 7-10 Nov 2022, Chiang Mai, Thailand. doi: 10.23919/APSIPAASC55919.2022.9980160
Abstract
Few-shot learning aims provide precise predictions for unseen data through learning from only one or few labelled samples of each class. However, it often suffers from the overfitting problem because of insufficient training data. In this paper, we propose a novel metric-based few-shot learning method, multi-branch network (MBN), with a new data augmentation module to improve the generalization ability of the model. Specifically, we generate different types of noise contaminated data through multiple branches in the network to simulate the real-world scenarios when noisy images are obtained. Following this novel data augmentation module, the feature embedding and similarities between the support and query samples are learned simultaneously through the embedding and metric modules, respectively. Moreover, to consider more details in the feature maps, we propose to utilize the average-pooling layer in the metric module rather than the commonly adopted max-pooling layer. The network is trained from end to end by the Kullback- Leibler (KL) divergence, to minimize the difference between the distributions of the ground truths and predictions. Extensive experiments on Standford-Dogs, Standford-Cars, CUB-200-2011 and mini-ImageNet in the 1-shot and 5-shot tasks demonstrate the superior classification performance of MBN.
Publication Type: | Conference or Workshop Item (Paper) |
---|---|
Additional Information: | © 2023 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works. |
Subjects: | H Social Sciences > HA Statistics Q Science > QA Mathematics Q Science > QA Mathematics > QA75 Electronic computers. Computer science |
Departments: | Bayes Business School > Actuarial Science & Insurance |
Download (302kB) | Preview
Export
Downloads
Downloads per month over past year