Extracting deterministic finite automata from RNNs via hyperplane partitioning and learning

Wickramasinghe, S. Y.; Howe, J.; Daviaud, L.

Extracting deterministic finite automata from RNNs via hyperplane partitioning and learning

Wickramasinghe, S. Y., Howe, J. ORCID: 0000-0001-8013-6941 & Daviaud, L. (2025). Extracting deterministic finite automata from RNNs via hyperplane partitioning and learning. Paper presented at the International Conference on Explainable AI for Neural and Symbolic Methods, 22-24 Oct 2025, Marbella, Spain.

Abstract

Recurrent Neural Networks (RNNs) have achieved remarkable success in handling sequential data. However, they lack interpretability. Extracting Deterministic Finite Automata (DFAs) from black-box models can provide insight into their decision-making processes. This research focuses on extracting DFAs from RNNs trained on regular languages using an exact learning framework. The proposed approach employs the L algorithm to learn a DFA, and it demonstrates how a hyperplane-based method can be used to partition the RNN state space when evaluating equivalence queries.

Publication Type:	Conference or Workshop Item (Paper)
Additional Information:	This version of the article has been accepted for publication, after peer review (when applicable) and is subject to Springer Nature’s AM terms of use, but is not the Version of Record and does not reflect post-acceptance improvements, or any corrections. The Version of Record will be available online at: https://link.springer.com/series/7899
Publisher Keywords:	Recurrent Neural Networks, Finite State Automata, Explainable AI
Subjects:	Q Science > QA Mathematics > QA75 Electronic computers. Computer science
Departments:	School of Science & Technology School of Science & Technology > Department of Computer Science
SWORD Depositor:	Symplectic Administrator

Text - Accepted Version
This document is not freely accessible due to copyright restrictions.

To request a copy, please use the button below.

Request a copy

Export

Downloads

Downloads per month over past year

View more statistics

Metadata

Funder Information

CORE (COnnecting REpositories)

Actions (login required)

Admin Login

Creators:	Wickramasinghe, S. Y. Howe, J. ORCID: 0000-0001-8013-6941 Daviaud, L.
Event Title:	International Conference on Explainable AI for Neural and Symbolic Methods
Event Type:	Conference
Event Location:	Marbella, Spain
Event Dates:	22-24 Oct 2025
Status:	In Press
Refereed:	Yes
Journal or Publication Title:	Communications in Computer and Information Science
Publisher:	Springer International Publishing
ISSN:	1865-0929
e-ISSN:	1865-0937
URI:	https://openaccess.city.ac.uk/id/eprint/35990
Date available in CRO:	15 Oct 2025 12:36
Date deposited:	9 October 2025
Dates:	Date Event 1 August 2025 Accepted