Clustering of Largely Right-Censored Oropharyngeal Head and Neck Cancer Patients for Discriminative Groupings to Improve Outcome Prediction

Joel Tosado, Luka Zdilar, Hesham Elhalawani, Baher Elgohari, David M. Vock, G. Elisabeta Marai, Clifton Fuller, Abdallah S.R. Mohamed, Guadalupe Canahuate

Research output: Contribution to journalArticlepeer-review

21 Scopus citations

Abstract

Clustering is the task of identifying groups of similar subjects according to certain criteria. The AJCC staging system can be thought as a clustering mechanism that groups patients based on their disease stage. This grouping drives prognosis and influences treatment. The goal of this work is to evaluate the efficacy of machine learning algorithms to cluster the patients into discriminative groups to improve prognosis for overall survival (OS) and relapse free survival (RFS) outcomes. We apply clustering over a retrospectively collected data from 644 head and neck cancer patients including both clinical and radiomic features. In order to incorporate outcome information into the clustering process and deal with the large proportion of censored samples, the feature space was scaled using the regression coefficients fitted using a proxy dependent variable, martingale residuals, instead of follow-up time. Two clusters were identified and evaluated using cross validation. The Kaplan Meier (KM) curves between the two clusters differ significantly for OS and RFS (p-value < 0.0001). Moreover, there was a relative predictive improvement when using the cluster label in addition to the clinical features compared to using only clinical features where AUC increased by 5.7% and 13.0% for OS and RFS, respectively.

Original languageEnglish (US)
Article number3811
JournalScientific reports
Volume10
Issue number1
DOIs
StatePublished - Dec 1 2020

Bibliographical note

Publisher Copyright:
© 2020, The Author(s).

Fingerprint

Dive into the research topics of 'Clustering of Largely Right-Censored Oropharyngeal Head and Neck Cancer Patients for Discriminative Groupings to Improve Outcome Prediction'. Together they form a unique fingerprint.

Cite this