City Research Online

Interpreting Categorical Data Classifiers using Explanation-based Locality

Rasouli, P., Yu, I. C. & Jimenez-Ruiz, E. ORCID: 0000-0002-9083-4599 (2023). Interpreting Categorical Data Classifiers using Explanation-based Locality. In: IEEE International Conference on Data Mining Workshops, ICDMW. International Conference on Data Mining Workshops, ICDMW, 28 Nov - 01 Dec 2022, Orlando, FL, USA. doi: 10.1109/ICDMW58026.2022.00030

Abstract

Local surrogate explanation methods are a popular class of post-hoc interpretability approaches that explain the rationale of machine learning models in the locality of every particular instance. Fidelity, which refers to the accuracy of explanation methods in imitating the actual behavior of a model, is highly affected by their strategy for identifying the locality of instances. To find the locality of an instance, we need to calculate the distance between the instance and perturbed data points concerning categorical and numerical features. While the distance of numerical features can be measured precisely, the existing works usually adopt a coarse-grained or imprecise approach for comparing categorical features. This is especially problematic in the categorical data setting, where defining a representative locality demands fine-grained semantic similarity information between categories. In this paper, we propose a locality generation approach for categorical data classifiers that makes no assumption about domain knowledge and infers categorical similarities by relying on the model's explanations. Further, we devise a multi-centered sampling approach based on the derived similarity information that, compared to the conventional instance-centered technique, captures the local behavior of the model more effectively. Moreover, we develop a knowledge-based locality generation approach based on knowledge graphs to benchmark our explanation-based method against a scenario where the similarity information is provided by a domain expert. The experiments conducted on various data sets demonstrate the efficacy of our approach in generating faithful explanations.

Publication Type: Conference or Workshop Item (Paper)
Additional Information: © 2023 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works.
Publisher Keywords: Conferences, Semantics, Knowledge based systems, Machine learning, Benchmark testing, Data models, Behavioral sciences
Subjects: Q Science > QA Mathematics
Departments: School of Science & Technology > Computer Science
[thumbnail of Interpreting_Categorical_Data_Classifiers_using_Explanation_based_Locality__ICDM_Workshop_.pdf]
Preview
Text - Accepted Version
Download (530kB) | Preview

Export

Add to AnyAdd to TwitterAdd to FacebookAdd to LinkedinAdd to PinterestAdd to Email

Downloads

Downloads per month over past year

View more statistics

Actions (login required)

Admin Login Admin Login