TY - JOUR
T1 - Assessing longitudinal housing status using Electronic Health Record data
T2 - a comparison of natural language processing, structured data, and patient-reported history
AU - Chapman, Alec B.
AU - Cordasco, Kristina
AU - Chassman, Stephanie
AU - Panadero, Talia
AU - Agans, Dylan
AU - Jackson, Nicholas
AU - Clair, Kimberly
AU - Nelson, Richard
AU - Montgomery, Ann Elizabeth
AU - Tsai, Jack
AU - Finley, Erin
AU - Gabrielian, Sonya
N1 - Publisher Copyright:
Copyright © 2023 Chapman, Cordasco, Chassman, Panadero, Agans, Jackson, Clair, Nelson, Montgomery, Tsai, Finley and Gabrielian.
PY - 2023
Y1 - 2023
N2 - Introduction: Measuring long-term housing outcomes is important for evaluating the impacts of services for individuals with homeless experience. However, assessing long-term housing status using traditional methods is challenging. The Veterans Affairs (VA) Electronic Health Record (EHR) provides detailed data for a large population of patients with homeless experiences and contains several indicators of housing instability, including structured data elements (e.g., diagnosis codes) and free-text clinical narratives. However, the validity of each of these data elements for measuring housing stability over time is not well-studied. Methods: We compared VA EHR indicators of housing instability, including information extracted from clinical notes using natural language processing (NLP), with patient-reported housing outcomes in a cohort of homeless-experienced Veterans. Results: NLP achieved higher sensitivity and specificity than standard diagnosis codes for detecting episodes of unstable housing. Other structured data elements in the VA EHR showed promising performance, particularly when combined with NLP. Discussion: Evaluation efforts and research studies assessing longitudinal housing outcomes should incorporate multiple data sources of documentation to achieve optimal performance.
AB - Introduction: Measuring long-term housing outcomes is important for evaluating the impacts of services for individuals with homeless experience. However, assessing long-term housing status using traditional methods is challenging. The Veterans Affairs (VA) Electronic Health Record (EHR) provides detailed data for a large population of patients with homeless experiences and contains several indicators of housing instability, including structured data elements (e.g., diagnosis codes) and free-text clinical narratives. However, the validity of each of these data elements for measuring housing stability over time is not well-studied. Methods: We compared VA EHR indicators of housing instability, including information extracted from clinical notes using natural language processing (NLP), with patient-reported housing outcomes in a cohort of homeless-experienced Veterans. Results: NLP achieved higher sensitivity and specificity than standard diagnosis codes for detecting episodes of unstable housing. Other structured data elements in the VA EHR showed promising performance, particularly when combined with NLP. Discussion: Evaluation efforts and research studies assessing longitudinal housing outcomes should incorporate multiple data sources of documentation to achieve optimal performance.
KW - electronic health records
KW - homelessness
KW - natural language processing
KW - social determinants of health
KW - veterans affairs
UR - http://www.scopus.com/inward/record.url?scp=85161150554&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85161150554&partnerID=8YFLogxK
U2 - 10.3389/frai.2023.1187501
DO - 10.3389/frai.2023.1187501
M3 - Article
C2 - 37293237
AN - SCOPUS:85161150554
SN - 2624-8212
VL - 6
JO - Frontiers in Artificial Intelligence
JF - Frontiers in Artificial Intelligence
M1 - 1187501
ER -