Big data cohort extraction for personalized statin treatment and machine learning

Terrence J. Adam; Chih Lin Chi

doi:10.1007/978-1-4939-9089-4_14

Big data cohort extraction for personalized statin treatment and machine learning

Terrence J. Adam, Chih Lin Chi

Research output: Chapter in Book/Report/Conference proceeding › Chapter

3 Scopus citations

Abstract

The creation of big clinical data cohorts for machine learning and data analysis require a number of steps from the beginning to successful completion. Similar to data set preprocessing in other fields, there is an initial need to complete data quality evaluation; however, with large heterogeneous clinical data sets, it is important to standardize the data in order to facilitate dimensionality reduction. This is particularly important for clinical data sets including medications as a core data component due to the complexity of coded medication data. Data integration at the individual subject level is essential with medication-related machine learning applications since it can be difficult to accurately identify drug exposures, therapeutic effects, and adverse drug events without having high-quality data integration of insurance, medication, and medical data. Successful data integration and standardization efforts can substantially improve the ability to identify and replicate personalized treatment pathways to optimize drug therapy.

Original language	English (US)
Title of host publication	Methods in Molecular Biology
Publisher	Humana Press Inc.
Pages	255-272
Number of pages	18
DOIs	https://doi.org/10.1007/978-1-4939-9089-4_14
State	Published - 2019

Publication series

Name	Methods in Molecular Biology
Volume	1939
ISSN (Print)	1064-3745

Bibliographical note

Publisher Copyright:
© Springer Science+Business Media, LLC, part of Springer Nature 2019.

Keywords

Clinical comorbidity evaluation
Clinical data integration
Medication safety
Personalized medication therapy

Access

10.1007/978-1-4939-9089-4_14

OpenUrl availability

Full text

Cite this

@inbook{37307fcbd94a45ba9d3103b56279e837,

title = "Big data cohort extraction for personalized statin treatment and machine learning",

abstract = "The creation of big clinical data cohorts for machine learning and data analysis require a number of steps from the beginning to successful completion. Similar to data set preprocessing in other fields, there is an initial need to complete data quality evaluation; however, with large heterogeneous clinical data sets, it is important to standardize the data in order to facilitate dimensionality reduction. This is particularly important for clinical data sets including medications as a core data component due to the complexity of coded medication data. Data integration at the individual subject level is essential with medication-related machine learning applications since it can be difficult to accurately identify drug exposures, therapeutic effects, and adverse drug events without having high-quality data integration of insurance, medication, and medical data. Successful data integration and standardization efforts can substantially improve the ability to identify and replicate personalized treatment pathways to optimize drug therapy.",

keywords = "Clinical comorbidity evaluation, Clinical data integration, Medication safety, Personalized medication therapy",

author = "Adam, {Terrence J.} and Chi, {Chih Lin}",

note = "Publisher Copyright: {\textcopyright} Springer Science+Business Media, LLC, part of Springer Nature 2019.",

year = "2019",

doi = "10.1007/978-1-4939-9089-4_14",

language = "English (US)",

series = "Methods in Molecular Biology",

publisher = "Humana Press Inc.",

pages = "255--272",

booktitle = "Methods in Molecular Biology",

}

TY - CHAP

T1 - Big data cohort extraction for personalized statin treatment and machine learning

AU - Adam, Terrence J.

AU - Chi, Chih Lin

N1 - Publisher Copyright: © Springer Science+Business Media, LLC, part of Springer Nature 2019.

PY - 2019

Y1 - 2019

N2 - The creation of big clinical data cohorts for machine learning and data analysis require a number of steps from the beginning to successful completion. Similar to data set preprocessing in other fields, there is an initial need to complete data quality evaluation; however, with large heterogeneous clinical data sets, it is important to standardize the data in order to facilitate dimensionality reduction. This is particularly important for clinical data sets including medications as a core data component due to the complexity of coded medication data. Data integration at the individual subject level is essential with medication-related machine learning applications since it can be difficult to accurately identify drug exposures, therapeutic effects, and adverse drug events without having high-quality data integration of insurance, medication, and medical data. Successful data integration and standardization efforts can substantially improve the ability to identify and replicate personalized treatment pathways to optimize drug therapy.

AB - The creation of big clinical data cohorts for machine learning and data analysis require a number of steps from the beginning to successful completion. Similar to data set preprocessing in other fields, there is an initial need to complete data quality evaluation; however, with large heterogeneous clinical data sets, it is important to standardize the data in order to facilitate dimensionality reduction. This is particularly important for clinical data sets including medications as a core data component due to the complexity of coded medication data. Data integration at the individual subject level is essential with medication-related machine learning applications since it can be difficult to accurately identify drug exposures, therapeutic effects, and adverse drug events without having high-quality data integration of insurance, medication, and medical data. Successful data integration and standardization efforts can substantially improve the ability to identify and replicate personalized treatment pathways to optimize drug therapy.

KW - Clinical comorbidity evaluation

KW - Clinical data integration

KW - Medication safety

KW - Personalized medication therapy

UR - http://www.scopus.com/inward/record.url?scp=85062603086&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85062603086&partnerID=8YFLogxK

U2 - 10.1007/978-1-4939-9089-4_14

DO - 10.1007/978-1-4939-9089-4_14

M3 - Chapter

C2 - 30848466

AN - SCOPUS:85062603086

T3 - Methods in Molecular Biology

SP - 255

EP - 272

BT - Methods in Molecular Biology

PB - Humana Press Inc.

ER -

Big data cohort extraction for personalized statin treatment and machine learning

Abstract

Publication series

Bibliographical note

Keywords

Access

OpenUrl availability

Other files and links

Fingerprint

Cite this