Towards survival prediction of cancer patients using medical images

View article
PeerJ Computer Science

Main article text

 

Introduction

  • We evaluate the performance of 156 image features from nine classes, six feature selection methods, and 10 machine learning models to select the best performing model and features for MRI and CT scanned datasets of BraTS and NSCLC, respectively.

  • A comprehensive analysis using the AUC score for each model shows that logistic regression is the best performing classifier and more ‘stable’ model with 0.769 and 0.751 AUC score for BraTS and NSCLC, respectively.

  • Our analysis on features highlights that shape-based and gray level contrast features are best performing features for MRI scanned data. While grey level symmetry features perform best for CT scanned images.

  • Experiments and analysis show that logistic regression and linear regression models are more suitable models for survival prediction purposes as compared to decision tree, multilayer perceptron, artificial neural network, random forest, and support vector machine models.

Literature review

Dataset

Methodology

Feature extraction and selection

Machine learning models and evaluation

Experimental setup

Results

BraTS dataset

NSCLC dataset

Stability analysis

Conclusion

Supplemental Information

Code and data files of the experiment

DOI: 10.7717/peerj-cs.1090/supp-1

ReadMe file for code

DOI: 10.7717/peerj-cs.1090/supp-2

Value of parameters used to train the model on Windows 10 using the python 3.9 installed using Anaconda 64 bit-version

DOI: 10.7717/peerj-cs.1090/supp-3

Additional Information and Declarations

Competing Interests

The authors declare there are no competing interests.

Author Contributions

Nazeef Ul Haq performed the experiments, analyzed the data, performed the computation work, prepared figures and/or tables, and approved the final draft.

Bilal Tahir analyzed the data, prepared figures and/or tables, authored or reviewed drafts of the article, and approved the final draft.

Samar Firdous conceived and designed the experiments, authored or reviewed drafts of the article, and approved the final draft.

Muhammad Amir Mehmood conceived and designed the experiments, authored or reviewed drafts of the article, and approved the final draft.

Data Availability

The following information was supplied regarding data availability:

The code and data are available in the Supplemental Files.

Funding

This research work was funded by Higher Education Commission (HEC) Pakistan and Ministry of Planning Development and Reforms under National Center in Big Data and Cloud Computing. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

3 Citations 1,279 Views 116 Downloads