On Hypothesis Testing for Comparing Image Quality Assessment Metrics [Tips & Tricks]
Zhu, R. ORCID: 0000-0002-9944-0369, Zhou, F., Yang, W. & Xue, J-H. (2018). On Hypothesis Testing for Comparing Image Quality Assessment Metrics [Tips & Tricks]. IEEE Signal Processing Magazine, 35(4), pp. 133-136. doi: 10.1109/msp.2018.2829209
Abstract
In developing novel image quality assessment (IQA) metrics, researchers should compare their proposed metrics with state-of-the-art metrics. A commonly adopted approach is by comparing two residuals between the nonlinearly mapped scores of two IQA metrics and the difference mean opinion score, which are assumed from Gaussian distributions with zero means. An F-test is then used to test the equality of variances of the two sets of residuals. If the variances are significantly different, then we conclude that the residuals are from different Gaussian distributions and that the two IQA metrics are significantly different. The F-test assumes that the two sets of residuals are independent. However, given that the IQA metrics are calculated on the same database, the two sets of residuals are paired and may be correlated. We note this improper usage of the F-test by practitioners, which can result in misleading comparison results of two IQA metrics. To solve this practical problem, we introduce the Pitman test to investigate the equality of variances for two sets of correlated residuals. Experiments on the Laboratory for Image and Video Engineering (LIVE) database show that the two tests can provide different conclusions.
Publication Type: | Article |
---|---|
Additional Information: | © 2018 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works. |
Subjects: | H Social Sciences > HA Statistics |
Departments: | Bayes Business School > Actuarial Science & Insurance |
SWORD Depositor: |
Download (429kB) | Preview
Export
Downloads
Downloads per month over past year