Prediction of Pathologic Complete Response for Rectal Cancer Based on Pretreatment Factors Using Machine Learning

Kevin A. Chen, Paolo Goffredo, Logan R. Butler, Chinmaya U. Joisa, Jose G. Guillem, Shawn M. Gomez, Muneera R. Kapadia

Research output: Contribution to journalArticlepeer-review

Abstract

BACKGROUND: Pathologic complete response after neoadjuvant therapy is an important prognostic indicator for locally advanced rectal cancer and may give insights into which patients might be treated nonoperatively in the future. Existing models for predicting pathologic complete response in the pretreatment setting are limited by small data sets and low accuracy. OBJECTIVE: We sought to use machine learning to develop a more generalizable predictive model for pathologic complete response for locally advanced rectal cancer. DESIGN: Patients with locally advanced rectal cancer who underwent neoadjuvant therapy followed by surgical resection were identified in the National Cancer Database from years 2010 to 2019 and were split into training, validation, and test sets. Machine learning techniques included random forest, gradient boosting, and artificial neural network. A logistic regression model was also created. Model performance was assessed using an area under the receiver operating characteristic curve. SETTINGS: This study used a national, multicenter data set. PATIENTS: Patients with locally advanced rectal cancer who underwent neoadjuvant therapy and proctectomy. MAIN OUTCOME MEASURES: Pathologic complete response defined as T0/xN0/x. RESULTS: The data set included 53,684 patients. Pathologic complete response was experienced by 22.9% of patients. Gradient boosting showed the best performance with an area under the receiver operating characteristic curve of 0.777 (95% CI, 0.773-0.781), compared with 0.684 (95% CI, 0.68-0.688) for logistic regression. The strongest predictors of pathologic complete response were no lymphovascular invasion, no perineural invasion, lower CEA, smaller size of tumor, and microsatellite stability. A concise model including the top 5 variables showed preserved performance. LIMITATIONS: The models were not externally validated. CONCLUSIONS: Machine learning techniques can be used to accurately predict pathologic complete response for locally advanced rectal cancer in the pretreatment setting. After fine-tuning a data set including patients treated nonoperatively, these models could help clinicians identify the appropriate candidates for a watch-and-wait strategy. See Video Abstract.

Original languageEnglish (US)
Pages (from-to)387-397
Number of pages11
JournalDiseases of the colon and rectum
Volume67
Issue number3
DOIs
StatePublished - Mar 1 2024

Bibliographical note

Publisher Copyright:
© 2024 Lippincott Williams and Wilkins. All rights reserved.

Keywords

  • Artificial intelligence
  • Machine learning
  • Pathological complete response
  • Rectal cancer

PubMed: MeSH publication types

  • Video-Audio Media
  • Multicenter Study
  • Journal Article

Fingerprint

Dive into the research topics of 'Prediction of Pathologic Complete Response for Rectal Cancer Based on Pretreatment Factors Using Machine Learning'. Together they form a unique fingerprint.

Cite this