Various Active Learning Strategies Analysis in Image Labeling: Maximizing Performance with Minimum Labeled Data
Tyagi, A., Aditya, H., Shelke, N. A. , Khandelwal, R., Singh, J., Jadeja, Y. ORCID: 0000-0003-4790-3592 & Turukmane, A. V. (2024). Various Active Learning Strategies Analysis in Image Labeling: Maximizing Performance with Minimum Labeled Data. In: Recent Trends in Image Processing and Pattern Recognition. 6th International Conference, RTIP2R 2023, 7-8 Dec 2023, Derby, UK. doi: 10.1007/978-3-031-53082-1_15
Abstract
The use of active learning in supervised machine learning is proposed in this study to reduce the expenses associated with labeling data. Active learning is a technique that includes iteratively selecting the most informative unlabeled data points and asking a human expert to label them. Active learning can achieve high accuracy while utilizing fewer labeled examples than typical supervised learning algorithms by selecting the most informative data points. This study conducts and provides an in-depth examination and analysis of numerous active learning algorithms and their applications to various machine learning labeling problems, especially focusing on image classification. The experiments are carried out using Fashion MNIST as a benchmark dataset. This study compares the performance of five popular active learning methods BALD, DBAL, coreset, least confidence and ensemble varR for the given problem. The best performing algorithm was BALD with a mean classification accuracy of 91.31%, when 50% of the data is considered labeled, closely followed by all other techniques, making each suitable for specific use cases. The trials conducted by the study illustrates how active learning may lower the time and cost of data labeling while also maintaining high accuracy.
Publication Type: | Conference or Workshop Item (Paper) |
---|---|
Additional Information: | This paper has been accepted and published by Springer. The final version is available at: https://doi.org/10.1007/978-3-031-53082-1_15. © 2024 The Author(s), under exclusive license to Springer Nature Switzerland AG |
Publisher Keywords: | Active Learning, CNN, Data Labeling, Diversity Sampling, Ens-varR, Image Classification, Uncertainty Sampling. |
Subjects: | Q Science > QA Mathematics > QA75 Electronic computers. Computer science |
Departments: | School of Science & Technology School of Science & Technology > Engineering |
SWORD Depositor: |
This document is not freely accessible until 31 January 2025 due to copyright restrictions.
To request a copy, please use the button below.
Request a copyExport
Downloads
Downloads per month over past year