City Research Online

Various Active Learning Strategies Analysis in Image Labeling: Maximizing Performance with Minimum Labeled Data

Tyagi, A., Aditya, H., Shelke, N. A. , Khandelwal, R., Singh, J., Jadeja, Y. ORCID: 0000-0003-4790-3592 & Turukmane, A. V. (2024). Various Active Learning Strategies Analysis in Image Labeling: Maximizing Performance with Minimum Labeled Data. In: Recent Trends in Image Processing and Pattern Recognition. 6th International Conference, RTIP2R 2023, 7-8 Dec 2023, Derby, UK. doi: 10.1007/978-3-031-53082-1_15

Abstract

The use of active learning in supervised machine learning is proposed in this study to reduce the expenses associated with labeling data. Active learning is a technique that includes iteratively selecting the most informative unlabeled data points and asking a human expert to label them. Active learning can achieve high accuracy while utilizing fewer labeled examples than typical supervised learning algorithms by selecting the most informative data points. This study conducts and provides an in-depth examination and analysis of numerous active learning algorithms and their applications to various machine learning labeling problems, especially focusing on image classification. The experiments are carried out using Fashion MNIST as a benchmark dataset. This study compares the performance of five popular active learning methods BALD, DBAL, coreset, least confidence and ensemble varR for the given problem. The best performing algorithm was BALD with a mean classification accuracy of 91.31%, when 50% of the data is considered labeled, closely followed by all other techniques, making each suitable for specific use cases. The trials conducted by the study illustrates how active learning may lower the time and cost of data labeling while also maintaining high accuracy.

Publication Type: Conference or Workshop Item (Paper)
Additional Information: This paper has been accepted and published by Springer. The final version is available at: https://doi.org/10.1007/978-3-031-53082-1_15. © 2024 The Author(s), under exclusive license to Springer Nature Switzerland AG
Publisher Keywords: Active Learning, CNN, Data Labeling, Diversity Sampling, Ens-varR, Image Classification, Uncertainty Sampling.
Subjects: Q Science > QA Mathematics > QA75 Electronic computers. Computer science
Departments: School of Science & Technology
School of Science & Technology > Engineering
SWORD Depositor:
[thumbnail of VariousActiveLearningStrategiesAnalysisinImageLabelingMaximizingPerformancewithMinimumLabeledData.pdf] Text - Accepted Version
This document is not freely accessible until 31 January 2025 due to copyright restrictions.

To request a copy, please use the button below.

Request a copy

Export

Add to AnyAdd to TwitterAdd to FacebookAdd to LinkedinAdd to PinterestAdd to Email

Downloads

Downloads per month over past year

View more statistics

Actions (login required)

Admin Login Admin Login