TY - CHAP
T1 - Cancer phenotype development
T2 - A literature review
AU - Wang, Pei
AU - Garza, Maryam
AU - Zozus, Meredith
PY - 2019/1/1
Y1 - 2019/1/1
N2 - EHR-based, computable phenotypes can be leveraged by healthcare organizations and researchers to improve the cohort identification process. The ability to identify patient cohorts using aspects of care and outcomes based on clinical characteristics or diagnostic conditions and/or risk factors presents opportunities to researchers targeting specific populations for drug development and disease interventions. The objective of this review was to summarize the literature describing the development and use of phenotypes for cohort identification of cancer patients. A survey of the literature indexed in PubMed was performed to identify studies using EHR-based phenotypes for use in cancer studies. Specific search criteria were formulated by leveraging a phenotype identification guideline developed by the Phenotypes, Data Standards, and Data Quality Core of the NIH Health Care Systems Research Collaboratory. The final set of articles was examined further to identify 1) the cancer of interest and 2) the different approaches used for phenotype development, validation and implementation. The articles reviewed were specific to breast cancer, colorectal cancer, ovarian cancer, and lung cancer. The approaches taken for phenotype development and validation varied slightly among the relevant publications. Four studies relied on chart review, three utilized machine learning techniques, one took an ontological approach, and one utilized natural language processing (NLP).
AB - EHR-based, computable phenotypes can be leveraged by healthcare organizations and researchers to improve the cohort identification process. The ability to identify patient cohorts using aspects of care and outcomes based on clinical characteristics or diagnostic conditions and/or risk factors presents opportunities to researchers targeting specific populations for drug development and disease interventions. The objective of this review was to summarize the literature describing the development and use of phenotypes for cohort identification of cancer patients. A survey of the literature indexed in PubMed was performed to identify studies using EHR-based phenotypes for use in cancer studies. Specific search criteria were formulated by leveraging a phenotype identification guideline developed by the Phenotypes, Data Standards, and Data Quality Core of the NIH Health Care Systems Research Collaboratory. The final set of articles was examined further to identify 1) the cancer of interest and 2) the different approaches used for phenotype development, validation and implementation. The articles reviewed were specific to breast cancer, colorectal cancer, ovarian cancer, and lung cancer. The approaches taken for phenotype development and validation varied slightly among the relevant publications. Four studies relied on chart review, three utilized machine learning techniques, one took an ontological approach, and one utilized natural language processing (NLP).
KW - cancer phenotypes
KW - computable phenotypes
KW - electronic health records
KW - phenotype development
KW - secondary data use
UR - http://www.scopus.com/inward/record.url?scp=85061290806&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85061290806&partnerID=8YFLogxK
U2 - 10.3233/978-1-61499-951-5-468
DO - 10.3233/978-1-61499-951-5-468
M3 - Chapter
C2 - 30741241
AN - SCOPUS:85061290806
T3 - Studies in Health Technology and Informatics
SP - 468
EP - 472
BT - Improving Usability, Safety and Patient Outcomes with Health Information Technology
A2 - Kuo, Alex Mu-Hsing
A2 - Kushniruk, Andre
A2 - Lau, Francis
A2 - Borycki, Elizabeth M.
A2 - Bliss, Gerry
A2 - Monkman, Helen
A2 - Roudsari, Abdul Vahabpour
A2 - Bartle-Clar, John A.
A2 - Courtney, Karen L.
PB - IOS Press
ER -