Rule Extraction from Neural Networks and Other Classifiers Applied to XSS Detection
Mereani, F. & Howe, J. M. ORCID: 0000-0001-8013-6941 (2021). Rule Extraction from Neural Networks and Other Classifiers Applied to XSS Detection. In: Morelo, J. J., Garibaldi, J., Linares-Barranco, A. , Warwick, K. & Madani, K. (Eds.), Computational Intelligence. 11th International Joint Conference, IJCCI 2019, 17-19 Sep 2019, Vienna, Austria. doi: 10.1007/978-3-030-70594-7_15
Abstract
Explainable artificial intelligence (XAI) is concerned with creating artificial intelligence that is intelligible and interpretable by humans. Many AI techniques build classifiers, some of which result in intelligible models, some of which don’t. Rule extraction from classifiers treated as black boxes is an impor- tant topic in XAI, that aims to find rule sets that describe classifiers and that are understandable to humans. Neural networks provide one type of classifier where it is difficult to explain why the inputs map to the decision; support vector ma- chines provide a second example of this kind. A third type of classifier, k-nearest neighbour (k-NN), gives more interpretable classifiers, but suffers from perfor- mance problems as the model is little more than a representation of the training data. This work investigates a technique to extract rules from classifiers where the underlying problem’s feature space is Boolean, without looking at the in- ner structure of the classifier. For such a classifier with a small feature space, a Boolean function describing it can be directly calculated, whilst for a classifier with a larger feature space, a sampling method is investigated to produce rule- based approximations to the behaviour of the underlying classifier, with varying granularity, leading to XAI. The behaviour of the technique with neural network, support vector machine, and k-NN classifiers is experimentally assessed on a dataset of cross-site scripting (XSS) attacks, and proves to give very high accuracy and precision, often comparable to the classifier being approximated.
Publication Type: | Conference or Workshop Item (Paper) |
---|---|
Additional Information: | This is a post-peer-review, pre-copyedit version of a chapter published in Computation Intelligence. The final authenticated version is available online at: https://doi.org/10.1007/978-3-030-70594-7_15 |
Subjects: | Q Science > QA Mathematics > QA75 Electronic computers. Computer science R Medicine > RC Internal medicine > RC0321 Neuroscience. Biological psychiatry. Neuropsychiatry |
Departments: | School of Science & Technology > Computer Science > Software Reliability |
Download (226kB) | Preview
Export
Downloads
Downloads per month over past year