Automatic Segmentation of Kidneys and Kidney Tumors: The KiTS19 International Challenge

Niranjan J Sathianathen; Nicholas Heller; Resha Tejpaul; Bethany Stai; Arveen Kalapara; Jack Rickman; Joshua Dean; Makinna Oestreich; Paul Blake; Heather Kaluzniak; Shaneabbas Raza; Joel Rosenberg; Keenan Moore; Edward Walczak; Zachary Rengel; Zach Edgerton; Ranveer Vasdev; Matthew Peterson; Sean McSweeney; Sarah Peterson; Nikolaos Papanikolopoulos; Christopher J Weight

doi:10.3389/fdgth.2021.797607

Automatic Segmentation of Kidneys and Kidney Tumors: The KiTS19 International Challenge

Niranjan J Sathianathen, Nicholas Heller, Resha Tejpaul, Bethany Stai, Arveen Kalapara, Jack Rickman, Joshua Dean, Makinna Oestreich, Paul Blake, Heather Kaluzniak, Shaneabbas Raza, Joel Rosenberg, Keenan Moore, Edward Walczak, Zachary Rengel, Zach Edgerton, Ranveer Vasdev, Matthew Peterson, Sean McSweeney, Sarah PetersonNikolaos Papanikolopoulos, Christopher J Weight

Research output: Contribution to journal › Article › peer-review

4 Scopus citations

Abstract

Purpose: Clinicians rely on imaging features to calculate complexity of renal masses based on validated scoring systems. These scoring methods are labor-intensive and are subjected to interobserver variability. Artificial intelligence has been increasingly utilized by the medical community to solve such issues. However, developing reliable algorithms is usually time-consuming and costly. We created an international community-driven competition (KiTS19) to develop and identify the best system for automatic segmentation of kidneys and kidney tumors in contrast CT and report the results. Methods: A training and test set of CT scans that was manually annotated by trained individuals were generated from consecutive patients undergoing renal surgery for whom demographic, clinical and outcome data were available. The KiTS19 Challenge was a machine learning competition hosted on grand-challenge.org in conjunction with an international conference. Teams were given 3 months to develop their algorithm using a full-annotated training set of images and an unannotated test set was released for 2 weeks from which average Sørensen-Dice coefficient between kidney and tumor regions were calculated across all 90 test cases. Results: There were 100 valid submissions that were based on deep neural networks but there were differences in pre-processing strategies, architectural details, and training procedures. The winning team scored a 0.974 kidney Dice and a 0.851 tumor Dice resulting in 0.912 composite score. Automatic segmentation of the kidney by the participating teams performed comparably to expert manual segmentation but was less reliable when segmenting the tumor. Conclusion: Rapid advancement in automated semantic segmentation of kidney lesions is possible with relatively high accuracy when the data is released publicly, and participation is incentivized. We hope that our findings will encourage further research that would enable the potential of adopting AI into the medical field.

Original language	English (US)
Article number	797607
Journal	Frontiers in Digital Health
Volume	3
DOIs	https://doi.org/10.3389/fdgth.2021.797607
State	Published - Jan 4 2022
Externally published	Yes

Bibliographical note

Publisher Copyright:
Copyright © 2022 Sathianathen, Heller, Tejpaul, Stai, Kalapara, Rickman, Dean, Oestreich, Blake, Kaluzniak, Raza, Rosenberg, Moore, Walczak, Rengel, Edgerton, Vasdev, Peterson, McSweeney, Peterson, Papanikolopoulos and Weight.

Keywords

ct scans
kidney tumors
medical images
renal mass
semantic segmentation

PubMed: MeSH publication types

Journal Article

UN SDGs

This output contributes to the following UN Sustainable Development Goals (SDGs)

Access

10.3389/fdgth.2021.797607

OpenUrl availability

Full text

Cite this

Sathianathen, N. J., Heller, N., Tejpaul, R., Stai, B., Kalapara, A., Rickman, J., Dean, J., Oestreich, M., Blake, P., Kaluzniak, H., Raza, S., Rosenberg, J., Moore, K., Walczak, E., Rengel, Z., Edgerton, Z., Vasdev, R., Peterson, M., McSweeney, S., ... Weight, C. J. (2022). Automatic Segmentation of Kidneys and Kidney Tumors: The KiTS19 International Challenge. Frontiers in Digital Health, 3, Article 797607. https://doi.org/10.3389/fdgth.2021.797607

Sathianathen, NJ, Heller, N, Tejpaul, R, Stai, B, Kalapara, A, Rickman, J, Dean, J, Oestreich, M, Blake, P, Kaluzniak, H, Raza, S, Rosenberg, J, Moore, K, Walczak, E, Rengel, Z, Edgerton, Z, Vasdev, R, Peterson, M, McSweeney, S, Peterson, S, Papanikolopoulos, N & Weight, CJ 2022, 'Automatic Segmentation of Kidneys and Kidney Tumors: The KiTS19 International Challenge', Frontiers in Digital Health, vol. 3, 797607. https://doi.org/10.3389/fdgth.2021.797607

@article{d46ecf7c6bfb4038a3370349801f1ae2,

title = "Automatic Segmentation of Kidneys and Kidney Tumors: The KiTS19 International Challenge",

abstract = "Purpose: Clinicians rely on imaging features to calculate complexity of renal masses based on validated scoring systems. These scoring methods are labor-intensive and are subjected to interobserver variability. Artificial intelligence has been increasingly utilized by the medical community to solve such issues. However, developing reliable algorithms is usually time-consuming and costly. We created an international community-driven competition (KiTS19) to develop and identify the best system for automatic segmentation of kidneys and kidney tumors in contrast CT and report the results. Methods: A training and test set of CT scans that was manually annotated by trained individuals were generated from consecutive patients undergoing renal surgery for whom demographic, clinical and outcome data were available. The KiTS19 Challenge was a machine learning competition hosted on grand-challenge.org in conjunction with an international conference. Teams were given 3 months to develop their algorithm using a full-annotated training set of images and an unannotated test set was released for 2 weeks from which average S{\o}rensen-Dice coefficient between kidney and tumor regions were calculated across all 90 test cases. Results: There were 100 valid submissions that were based on deep neural networks but there were differences in pre-processing strategies, architectural details, and training procedures. The winning team scored a 0.974 kidney Dice and a 0.851 tumor Dice resulting in 0.912 composite score. Automatic segmentation of the kidney by the participating teams performed comparably to expert manual segmentation but was less reliable when segmenting the tumor. Conclusion: Rapid advancement in automated semantic segmentation of kidney lesions is possible with relatively high accuracy when the data is released publicly, and participation is incentivized. We hope that our findings will encourage further research that would enable the potential of adopting AI into the medical field.",

keywords = "ct scans, kidney tumors, medical images, renal mass, semantic segmentation",

author = "Sathianathen, {Niranjan J} and Nicholas Heller and Resha Tejpaul and Bethany Stai and Arveen Kalapara and Jack Rickman and Joshua Dean and Makinna Oestreich and Paul Blake and Heather Kaluzniak and Shaneabbas Raza and Joel Rosenberg and Keenan Moore and Edward Walczak and Zachary Rengel and Zach Edgerton and Ranveer Vasdev and Matthew Peterson and Sean McSweeney and Sarah Peterson and Nikolaos Papanikolopoulos and Weight, {Christopher J}",

note = "Publisher Copyright: Copyright {\textcopyright} 2022 Sathianathen, Heller, Tejpaul, Stai, Kalapara, Rickman, Dean, Oestreich, Blake, Kaluzniak, Raza, Rosenberg, Moore, Walczak, Rengel, Edgerton, Vasdev, Peterson, McSweeney, Peterson, Papanikolopoulos and Weight.",

year = "2022",

month = jan,

day = "4",

doi = "10.3389/fdgth.2021.797607",

language = "English (US)",

volume = "3",

journal = "Frontiers in Digital Health",

issn = "2673-253X",

publisher = "Frontiers Media S. A.",

}

TY - JOUR

T1 - Automatic Segmentation of Kidneys and Kidney Tumors

T2 - The KiTS19 International Challenge

AU - Sathianathen, Niranjan J

AU - Heller, Nicholas

AU - Tejpaul, Resha

AU - Stai, Bethany

AU - Kalapara, Arveen

AU - Rickman, Jack

AU - Dean, Joshua

AU - Oestreich, Makinna

AU - Blake, Paul

AU - Kaluzniak, Heather

AU - Raza, Shaneabbas

AU - Rosenberg, Joel

AU - Moore, Keenan

AU - Walczak, Edward

AU - Rengel, Zachary

AU - Edgerton, Zach

AU - Vasdev, Ranveer

AU - Peterson, Matthew

AU - McSweeney, Sean

AU - Peterson, Sarah

AU - Papanikolopoulos, Nikolaos

AU - Weight, Christopher J

N1 - Publisher Copyright: Copyright © 2022 Sathianathen, Heller, Tejpaul, Stai, Kalapara, Rickman, Dean, Oestreich, Blake, Kaluzniak, Raza, Rosenberg, Moore, Walczak, Rengel, Edgerton, Vasdev, Peterson, McSweeney, Peterson, Papanikolopoulos and Weight.

PY - 2022/1/4

Y1 - 2022/1/4

N2 - Purpose: Clinicians rely on imaging features to calculate complexity of renal masses based on validated scoring systems. These scoring methods are labor-intensive and are subjected to interobserver variability. Artificial intelligence has been increasingly utilized by the medical community to solve such issues. However, developing reliable algorithms is usually time-consuming and costly. We created an international community-driven competition (KiTS19) to develop and identify the best system for automatic segmentation of kidneys and kidney tumors in contrast CT and report the results. Methods: A training and test set of CT scans that was manually annotated by trained individuals were generated from consecutive patients undergoing renal surgery for whom demographic, clinical and outcome data were available. The KiTS19 Challenge was a machine learning competition hosted on grand-challenge.org in conjunction with an international conference. Teams were given 3 months to develop their algorithm using a full-annotated training set of images and an unannotated test set was released for 2 weeks from which average Sørensen-Dice coefficient between kidney and tumor regions were calculated across all 90 test cases. Results: There were 100 valid submissions that were based on deep neural networks but there were differences in pre-processing strategies, architectural details, and training procedures. The winning team scored a 0.974 kidney Dice and a 0.851 tumor Dice resulting in 0.912 composite score. Automatic segmentation of the kidney by the participating teams performed comparably to expert manual segmentation but was less reliable when segmenting the tumor. Conclusion: Rapid advancement in automated semantic segmentation of kidney lesions is possible with relatively high accuracy when the data is released publicly, and participation is incentivized. We hope that our findings will encourage further research that would enable the potential of adopting AI into the medical field.

AB - Purpose: Clinicians rely on imaging features to calculate complexity of renal masses based on validated scoring systems. These scoring methods are labor-intensive and are subjected to interobserver variability. Artificial intelligence has been increasingly utilized by the medical community to solve such issues. However, developing reliable algorithms is usually time-consuming and costly. We created an international community-driven competition (KiTS19) to develop and identify the best system for automatic segmentation of kidneys and kidney tumors in contrast CT and report the results. Methods: A training and test set of CT scans that was manually annotated by trained individuals were generated from consecutive patients undergoing renal surgery for whom demographic, clinical and outcome data were available. The KiTS19 Challenge was a machine learning competition hosted on grand-challenge.org in conjunction with an international conference. Teams were given 3 months to develop their algorithm using a full-annotated training set of images and an unannotated test set was released for 2 weeks from which average Sørensen-Dice coefficient between kidney and tumor regions were calculated across all 90 test cases. Results: There were 100 valid submissions that were based on deep neural networks but there were differences in pre-processing strategies, architectural details, and training procedures. The winning team scored a 0.974 kidney Dice and a 0.851 tumor Dice resulting in 0.912 composite score. Automatic segmentation of the kidney by the participating teams performed comparably to expert manual segmentation but was less reliable when segmenting the tumor. Conclusion: Rapid advancement in automated semantic segmentation of kidney lesions is possible with relatively high accuracy when the data is released publicly, and participation is incentivized. We hope that our findings will encourage further research that would enable the potential of adopting AI into the medical field.

KW - ct scans

KW - kidney tumors

KW - medical images

KW - renal mass

KW - semantic segmentation

UR - http://www.scopus.com/inward/record.url?scp=85131254408&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85131254408&partnerID=8YFLogxK

U2 - 10.3389/fdgth.2021.797607

DO - 10.3389/fdgth.2021.797607

M3 - Article

C2 - 35059687

AN - SCOPUS:85131254408

SN - 2673-253X

VL - 3

JO - Frontiers in Digital Health

JF - Frontiers in Digital Health

M1 - 797607

ER -

Automatic Segmentation of Kidneys and Kidney Tumors: The KiTS19 International Challenge

Abstract

Bibliographical note

Keywords

PubMed: MeSH publication types

UN SDGs

Access

OpenUrl availability

Other files and links

Fingerprint

Cite this