City Research Online - International Classification of Diseases Prediction from MIMIIC-III Clinical Text Using Pre-Trained ClinicalBERT and NLP Deep Learning Models Achieving State of the Art

International Classification of Diseases Prediction from MIMIIC-III Clinical Text Using Pre-Trained ClinicalBERT and NLP Deep Learning Models Achieving State of the Art

Aden, I., Child, C. H. T. ORCID: 0000-0001-5425-2308 & Reyes-Aldasoro, C. C. ORCID: 0000-0002-9466-2018 (2024). International Classification of Diseases Prediction from MIMIIC-III Clinical Text Using Pre-Trained ClinicalBERT and NLP Deep Learning Models Achieving State of the Art. Big Data and Cognitive Computing, 8(5), article number 47. doi: 10.3390/bdcc8050047

Abstract

The International Classification of Diseases (ICD) serves as a widely employed framework for assigning diagnosis codes to electronic health records of patients. These codes facilitate the encapsulation of diagnoses and procedures conducted during a patient’s hospitalisation. This study aims to devise a predictive model for ICD codes based on the MIMIC-III clinical text dataset. Leveraging natural language processing techniques and deep learning architectures, we constructed a pipeline to distill pertinent information from the MIMIC-III dataset: the Medical Information Mart for Intensive Care III (MIMIC-III), a sizable, de-identified, and publicly accessible repository of medical records. Our method entails predicting diagnosis codes from unstructured data, such as discharge summaries and notes encompassing symptoms. We used state-of-the-art deep learning algorithms, such as recurrent neural networks (RNNs), long short-term memory (LSTM) networks, bidirectional LSTM (BiLSTM) and BERT models after tokenizing the clinical test with Bio-ClinicalBERT, a pre-trained model from Hugging Face. To evaluate the efficacy of our approach, we conducted experiments utilizing the discharge dataset within MIMIC-III. Employing the BERT model, our methodology exhibited commendable accuracy in predicting the top 10 and top 50 diagnosis codes within the MIMIC-III dataset, achieving average accuracies of 88% and 80%, respectively. In comparison to recent studies by Biseda and Kerang, as well as Gangavarapu, which reported F1 scores of 0.72 in predicting the top 10 ICD-10 codes, our model demonstrated better performance, with an F1 score of 0.87. Similarly, in predicting the top 50 ICD-10 codes, previous research achieved an F1 score of 0.75, whereas our method attained an F1 score of 0.81. These results underscore the better performance of deep learning models over conventional machine learning approaches in this domain, thus validating our findings. The ability to predict diagnoses early from clinical notes holds promise in assisting doctors or physicians in determining effective treatments, thereby reshaping the conventional paradigm of diagnosis-then-treatment care. Our code is available online.

Publication Type:	Article
Additional Information:	© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Publisher Keywords:	ICD prediction; NLP; deep learning models (RNN, LSTM, BERT)
Subjects:	Q Science > QA Mathematics > QA75 Electronic computers. Computer science R Medicine > RC Internal medicine
Departments:	School of Science & Technology School of Science & Technology > Computer Science School of Science & Technology > Computer Science > giCentre
SWORD Depositor:	Symplectic Administrator

Preview

Text - Published Version
Available under License Creative Commons: Attribution International Public License 4.0.
Download (1MB) | Preview

Official URL: http://dx.doi.org/10.3390/bdcc8050047

Export

Downloads

Downloads per month over past year

View more statistics

Metadata

Altmetric

CORE (COnnecting REpositories)

Actions (login required)

Admin Login

Creators:	Aden, I. Child, C. H. T. ORCID: 0000-0001-5425-2308 Reyes-Aldasoro, C. C. ORCID: 0000-0002-9466-2018
Status:	Published
Refereed:	Yes
Journal or Publication Title:	Big Data and Cognitive Computing
Publisher:	MDPI AG
e-ISSN:	2504-2289
URI:	https://openaccess.city.ac.uk/id/eprint/33037
Date available in CRO:	04 Jun 2024 12:48
Date deposited:	3 June 2024
Dates:	Date Event 10 May 2024 Published 10 May 2024 Published Online 30 April 2024 Accepted