TY - JOUR
T1 - Privacy-protecting multivariable-adjusted distributed regression analysis for multi-center pediatric study
AU - on behalf of the PCORnet Antibiotics and Childhood Growth Study Group
AU - Toh, Sengwee
AU - Rifas-Shiman, Sheryl L.
AU - Lin, Pi I.D.
AU - Bailey, L. Charles
AU - Forrest, Christopher B.
AU - Horgan, Casie E.
AU - Lunsford, Douglas
AU - Moyneur, Erick
AU - Sturtevant, Jessica L.
AU - Young, Jessica G.
AU - Block, Jason P.
AU - Appelhans, Brad
AU - Arterburn, David
AU - Boone-Heinenon, Janne
AU - Brickman, Andrew L.
AU - Bunnell, H. Timothy
AU - Cole, F. Sessions
AU - Daley, Matthew F.
AU - Dempsey, Amanda
AU - Finkelstein, Jonathan
AU - Fitzpatrick, Stephanie L.
AU - Heerman, William
AU - Horberg, Michael
AU - Isasi, Carmen R.
AU - Jay, Melanie
AU - Kharbanda, Elyse
AU - Khare, Ritu
AU - Lemas, Dominick
AU - Lin, Simon M.
AU - Messito, Mary Jo
AU - O’Neill, Allison
AU - Peay, Holly Landrum
AU - Prochaska, Micah
AU - Ranade, Daksha
AU - Rao, Goutham
AU - Rayas, Maria
AU - Reynolds, Juliane S.
AU - Rosenman, Marc
AU - Taylor, Bradley
AU - Willis, Zachary
N1 - Funding Information:
Brad Appelhans6, David Arterburn7, Janne Boone-Heinenon8, Andrew L. Brickman9, H. Timothy Bunnell10, F. Sessions Cole, III11, Matthew F. Daley12, Amanda Dempsey13, Jonathan Finkelstein14, Stephanie L. Fitzpatrick15, William Heerman16, Michael Horberg17, Carmen R. Isasi18, Melanie Jay19, Elyse Kharbanda20, Ritu Khare21, Dominick Lemas22, Simon M. Lin23, Mary Jo Messito24, Allison O’Neill25, Holly Landrum Peay26, Micah Prochaska27, Daksha Ranade28, Goutham Rao29, Maria Rayas30, Juliane S. Reynolds31, Marc Rosenman32, Bradley Taylor33, Zachary Willis34 6Rush University Medical Center, Chicago, IL, USA; 7Washington Permanente Medical Group, Internal Medicine, Kaiser Permanente Washington Health Research Institute, Seattle, WA, USA; 8Oregon Health & Science University, Portland, OR, USA; 9Strategic Clinical Initiatives, Health Choice Network, Doral, FL, USA;10Nemours Children’s Health System, Wilmington, DE, USA; 11Edward Mallinckrodt Department of Pediatrics, Washington University School of Medicine/St. Louis Children’s Hospital, St. Louis, MO, USA; 12Institute for Health Research, Kaiser Permanente Colorado, Denver, CO, USA; 13Department of Pediatrics, University of Colorado School of Medicine, Denver, CO, USA; 14Department of Pediatrics, Harvard Medical School, Boston, MA, USA; 15Kaiser Permanente Center for Health Research, Portland, OR, USA; 16Vanderbilt University Medical Center, Nashville, TN, USA; 17Kaiser Permanente Mid-Atlantic Permanente Research Institute, Rockville, MD, USA; 18Department of Epidemiology, Albert Einstein College of Medicine, Bronx, NY, USA; 19Department of Population Health, New York University School of Medicine, New York, NY, USA; 20HealthPartners Institute, Bloomington, MN, USA; 21Center for Applied Clinical Research, Children’s Hospital of Philadelphia, Philadelphia, PA, USA;22Department of Health Outcomes and Biomedical Informatics, College of Medicine, University of Florida, Gainesville, FL, USA; 23The Research Institute, Nationwide Children’s Hospital, Columbus, OH, USA; 24Department of Pediatrics, New York University School of Medicine, New York, NY, USA; 25OCHIN Inc, Portland, OR, USA; 26RTI International, Research Triangle Park, Triangle Park, NC, USA; 27Department of Medicine, University of Chicago, Chicago, IL, USA; 28Research Informatics, PEDSnet, Seattle Children’s, Seattle, WA, USA; 29Case Western Reserve University and University Hospitals of Cleveland, Cleveland, OH, USA; 30University of Texas Health Science Center at San Antonio, San Antonio, TX, USA; 31Department of Population Medicine, Harvard Medical School and Harvard Pilgrim Health Care Institute, Boston, MA, USA; 32Ann & Robert H. Lurie Children’s Hospital of Chicago and Northwestern University Feinberg School of Medicine, Chicago, IL, USA; 33Medical College of Wisconsin, Milwaukee, WI, USA and 34University of North Carolina School of Medicine, Chapel Hill, NC, USA
Publisher Copyright:
© 2019, International Pediatric Research Foundation, Inc.
PY - 2020/5/1
Y1 - 2020/5/1
N2 - Background: Privacy-protecting analytic approaches without centralized pooling of individual-level data, such as distributed regression, are particularly important for vulnerable populations, such as children, but these methods have not yet been tested in multi-center pediatric studies. Methods: Using the electronic health data from 34 healthcare institutions in the National Patient-Centered Clinical Research Network (PCORnet), we fit 12 multivariable-adjusted linear regression models to assess the associations of antibiotic use <24 months of age with body mass index z-score at 48 to <72 months of age. We ran these models using pooled individual-level data and conventional multivariable-adjusted regression (reference method), as well as using the more privacy-protecting pooled summary-level intermediate statistics and distributed regression technique. We compared the results from these two methods. Results: Pooled individual-level and distributed linear regression analyses produced virtually identical parameter estimates and standard errors. Across all 12 models, the maximum difference in any of the parameter estimates or standard errors was 4.4833 × 10−10. Conclusions: We demonstrated empirically the feasibility and validity of distributed linear regression analysis using only summary-level information within a large multi-center study of children. This approach could enable expanded opportunities for multi-center pediatric research, especially when sharing of granular individual-level data is challenging.
AB - Background: Privacy-protecting analytic approaches without centralized pooling of individual-level data, such as distributed regression, are particularly important for vulnerable populations, such as children, but these methods have not yet been tested in multi-center pediatric studies. Methods: Using the electronic health data from 34 healthcare institutions in the National Patient-Centered Clinical Research Network (PCORnet), we fit 12 multivariable-adjusted linear regression models to assess the associations of antibiotic use <24 months of age with body mass index z-score at 48 to <72 months of age. We ran these models using pooled individual-level data and conventional multivariable-adjusted regression (reference method), as well as using the more privacy-protecting pooled summary-level intermediate statistics and distributed regression technique. We compared the results from these two methods. Results: Pooled individual-level and distributed linear regression analyses produced virtually identical parameter estimates and standard errors. Across all 12 models, the maximum difference in any of the parameter estimates or standard errors was 4.4833 × 10−10. Conclusions: We demonstrated empirically the feasibility and validity of distributed linear regression analysis using only summary-level information within a large multi-center study of children. This approach could enable expanded opportunities for multi-center pediatric research, especially when sharing of granular individual-level data is challenging.
UR - http://www.scopus.com/inward/record.url?scp=85074628913&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85074628913&partnerID=8YFLogxK
U2 - 10.1038/s41390-019-0596-0
DO - 10.1038/s41390-019-0596-0
M3 - Article
C2 - 31578038
AN - SCOPUS:85074628913
SN - 0031-3998
VL - 87
SP - 1086
EP - 1092
JO - Pediatric Research
JF - Pediatric Research
IS - 6
ER -