Accurate stratification of patients with neuroblastoma by application of the reject option paradigm to outcome prediction
- Published
- Accepted
- Subject Areas
- Bioinformatics, Molecular Biology, Oncology, Pediatrics
- Keywords
- Reject option, Prognosis, Neuroblastoma, Hypoxia., Stratification, Pediatric
- Copyright
- © 2016 Cangelosi et al.
- Licence
- This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, reproduction and adaptation in any medium and for any purpose provided that it is properly attributed. For attribution, the original author(s), title, publication source (PeerJ Preprints) and either DOI or URL of the article must be cited.
- Cite this article
- 2016. Accurate stratification of patients with neuroblastoma by application of the reject option paradigm to outcome prediction. PeerJ Preprints 4:e2263v1 https://doi.org/10.7287/peerj.preprints.2263v1
Abstract
Motivation
Neuroblastoma (NB) is the most common solid pediatric tumor, deriving from ganglionic lineage precursors of the sympathetic nervous system. Half of the patients with high-risk NB die despite treatment and more accurate outcome prediction models are needed to direct effective therapies. To limit the therapy's side effects, a highly accurate classification of a limited number of patients is preferable to a broader classification with greater error. We aimed at improving the classifier's accuracy and establishing the criteria for confident selection of classification rules by applying the reject option technique (RO).
Methods
We evaluated outcome prediction by BFTree, ID3, J48, REPTree, SimpleCART decision-tree algorithms utilizing a 182 patients’ dataset. Fifty percent of the patients were utilized for model selection, and the remaining 50% for independent validation. The risk factors were: NB-hypo, age at diagnosis, stage and MYCN amplification. Accuracy of the classification was measured by Matthew’s Correlation Coefficient and assessed by 2 fold cross validation analysis repeated 1000 times. Trade-off between confidence and accuracy was estimated by the RO technique utilizing the accuracy/rejection plot. Kaplan-Meier estimate and log-rank test assessed overall survival.
Results
Every decision tree classified the patients' outcome with a confidence >0.6. Only ID3, utilizing all risk factors, stratified stage 4 patients and was chosen for further analysis. Application of RO raised the ID3 accuracy from 69% to 92%. This result was obtained because of accuracy/rejection plot identified a threshold confidence of 0.66 at which 71% of the patients, classified by highly reliable rules, were accepted. The trade-off was the exclusion from classification of 29% of patients falling in low represented and ambiguous rules. Kaplan-Meier curves showed a significant stratification of stage 4 patients following RO application. In conclusion, we demonstrated that application of the RO improves the classification performance of ID3 decision tree. Stage 4 patients can be stratified in significant groups characterized by the high confidence rules needed for making clinical decisions.
Author Comment
This is an abstract which has been accepted for the BITS2016 Meeting.