TY - JOUR
T1 - BinomiRare
T2 - A robust test for association of a rare genetic variant with a binary outcome for mixed models and any case-control proportion
AU - NHLBI Trans-Omics for Precision Medicine (TOPMed) Consortium
AU - Sofer, Tamar
AU - Lee, Jiwon
AU - Kurniansyah, Nuzulul
AU - Jain, Deepti
AU - Laurie, Cecelia A.
AU - Gogarten, Stephanie M.
AU - Conomos, Matthew P.
AU - Heavner, Ben
AU - Hu, Yao
AU - Kooperberg, Charles
AU - Haessler, Jeffrey
AU - Vasan, Ramachandran S.
AU - Cupples, L. Adrienne
AU - Coombes, Brandon J.
AU - Seyerle, Amanda
AU - Gharib, Sina A.
AU - Chen, Han
AU - O'Connell, Jeffrey R.
AU - Zhang, Man
AU - Gottlieb, Daniel J.
AU - Psaty, Bruce M.
AU - Longstreth, W. T.
AU - Rotter, Jerome I.
AU - Taylor, Kent D.
AU - Rich, Stephen S.
AU - Guo, Xiuqing
AU - Boerwinkle, Eric
AU - Morrison, Alanna C.
AU - Pankow, James S.
AU - Johnson, Andrew D.
AU - Pankratz, Nathan
AU - Reiner, Alex P.
AU - Redline, Susan
AU - Smith, Nicholas L.
AU - Rice, Kenneth M.
AU - Schifano, Elizabeth D.
N1 - Publisher Copyright:
© 2021 The Author(s)
PY - 2021/7/8
Y1 - 2021/7/8
N2 - Whole-genome sequencing (WGS) and whole-exome sequencing studies have become increasingly available and are being used to identify rare genetic variants associated with health and disease outcomes. Investigators routinely use mixed models to account for genetic relatedness or other clustering variables (e.g., family or household) when testing genetic associations. However, no existing tests of the association of a rare variant with a binary outcome in the presence of correlated data control the type 1 error where there are (1) few individuals harboring the rare allele, (2) a small proportion of cases relative to controls, and (3) covariates to adjust for. Here, we address all three issues in developing a framework for testing rare variant association with a binary trait in individuals harboring at least one risk allele. In this framework, we estimate outcome probabilities under the null hypothesis and then use them, within the individuals with at least one risk allele, to test variant associations. We extend the BinomiRare test, which was previously proposed for independent observations, and develop the Conway-Maxwell-Poisson (CMP) test and study their properties in simulations. We show that the BinomiRare test always controls the type 1 error, while the CMP test sometimes does not. We then use the BinomiRare test to test the association of rare genetic variants in target genes with small-vessel disease (SVD) stroke, short sleep, and venous thromboembolism (VTE), in whole-genome sequence data from the Trans-Omics for Precision Medicine (TOPMed) program.
AB - Whole-genome sequencing (WGS) and whole-exome sequencing studies have become increasingly available and are being used to identify rare genetic variants associated with health and disease outcomes. Investigators routinely use mixed models to account for genetic relatedness or other clustering variables (e.g., family or household) when testing genetic associations. However, no existing tests of the association of a rare variant with a binary outcome in the presence of correlated data control the type 1 error where there are (1) few individuals harboring the rare allele, (2) a small proportion of cases relative to controls, and (3) covariates to adjust for. Here, we address all three issues in developing a framework for testing rare variant association with a binary trait in individuals harboring at least one risk allele. In this framework, we estimate outcome probabilities under the null hypothesis and then use them, within the individuals with at least one risk allele, to test variant associations. We extend the BinomiRare test, which was previously proposed for independent observations, and develop the Conway-Maxwell-Poisson (CMP) test and study their properties in simulations. We show that the BinomiRare test always controls the type 1 error, while the CMP test sometimes does not. We then use the BinomiRare test to test the association of rare genetic variants in target genes with small-vessel disease (SVD) stroke, short sleep, and venous thromboembolism (VTE), in whole-genome sequence data from the Trans-Omics for Precision Medicine (TOPMed) program.
KW - diverse populations
KW - genome sequencing
KW - mixed models
KW - rare variant assocaition testing
KW - targeted gene analysis
UR - http://www.scopus.com/inward/record.url?scp=85123070300&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85123070300&partnerID=8YFLogxK
U2 - 10.1016/j.xhgg.2021.100040
DO - 10.1016/j.xhgg.2021.100040
M3 - Article
C2 - 34337551
AN - SCOPUS:85123070300
SN - 2666-2477
VL - 2
JO - Human Genetics and Genomics Advances
JF - Human Genetics and Genomics Advances
IS - 3
M1 - 100040
ER -