City Research Online

Lung cancer prediction using machine learning on data from a symptom e-questionnaire for never smokers, formers smokers and current smokers

Nemlander, E., Rosenblad, A., Abedi, E. , Ekman, S., Hasselström, J., Eriksson, L. E. ORCID: 0000-0001-5121-5325 & Carlsson, A. C. (2022). Lung cancer prediction using machine learning on data from a symptom e-questionnaire for never smokers, formers smokers and current smokers. PLoS One, 17(10), e0276703. doi: 10.1371/journal.pone.0276703


PURPOSE: The aim of the present study was to investigate the predictive ability for lung cancer of symptoms reported in an adaptive e-questionnaire, separately for never smokers, former smokers, and current smokers.

PATIENTS AND METHODS: Consecutive patients referred for suspected lung cancer were recruited between September 2014 and November 2015 from the lung clinic at the Karolinska University Hospital, Stockholm, Sweden. A total of 504 patients were later diagnosed with lung cancer (n = 310) or no cancer (n = 194). All participants answered an adaptive e-questionnaire with a maximum of 342 items, covering background variables and symptoms/sensations suspected to be associated with lung cancer. Stochastic gradient boosting, stratified on smoking status, was used to train and test a model for predicting the presence of lung cancer.

RESULTS: Among never smokers, 17 predictors contributed to predicting lung cancer with 82% of the patients being correctly classified, compared with 26 predictors with an accuracy of 77% among current smokers and 36 predictors with an accuracy of 63% among former smokers. Age, sex, and education level were the most important predictors in all models.

CONCLUSION: Methods or tools to assess the likelihood of lung cancer based on smoking status and to prioritize investigative and treatment measures among all patients seeking care with diffuse symptoms are much needed. Our study presents risk assessment models for patients with different smoking status that may be developed into clinical risk assessment tools that can help clinicians in assessing a patient's risk of having lung cancer.

Publication Type: Article
Additional Information: © 2022 Nemlander et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Subjects: R Medicine > RA Public aspects of medicine > RA0421 Public health. Hygiene. Preventive Medicine
R Medicine > RC Internal medicine > RC0254 Neoplasms. Tumors. Oncology (including Cancer)
Departments: School of Health & Psychological Sciences > Nursing
[thumbnail of journal.pone.0276703.pdf]
Text - Published Version
Available under License Creative Commons Attribution.

Download (823kB) | Preview


Add to AnyAdd to TwitterAdd to FacebookAdd to LinkedinAdd to PinterestAdd to Email


Downloads per month over past year

View more statistics

Actions (login required)

Admin Login Admin Login