A Kernel of Truth. Statistical Advances in Polygenic Variance Component Models for Complex Human Pedigrees.

John Blangero, Vincent P. Diego, Thomas D. Dyer, Marcio Almeida, Juan Peralta, Jack W. Kent, Jeff T. Williams, Laura Almasy, Harald H H GÖring

Research output: Contribution to journalArticle

43 Citations (Scopus)

Abstract

Statistical genetic analysis of quantitative traits in large pedigrees is a formidable computational task due to the necessity of taking the nonindependence among relatives into account. With the growing awareness that rare sequence variants may be important in human quantitative variation, heritability and association study designs involving large pedigrees will increase in frequency due to the greater chance of observing multiple copies of rare variants among related individuals. Therefore, it is important to have statistical genetic test procedures that utilize all available information for extracting evidence regarding genetic association. Optimal testing for marker/phenotype association involves the exact calculation of the likelihood ratio statistic which requires the repeated inversion of potentially large matrices. In a whole genome sequence association context, such computation may be prohibitive. Toward this end, we have developed a rapid and efficient eigensimplification of the likelihood that makes analysis of family data commensurate with the analysis of a comparable sample of unrelated individuals. Our theoretical results which are based on a spectral representation of the likelihood yield simple exact expressions for the expected likelihood ratio test statistic (ELRT) for pedigrees of arbitrary size and complexity. For heritability, the ELRT is-∑ln1+h2λgi-1,where h2 and λgi are, respectively, the heritability and eigenvalues of the pedigree-derived genetic relationship kernel (GRK). For association analysis of sequence variants, the ELRT is given byELRThq2>0:unrelateds-ELRTht2>0:pedigrees-ELRThr2>0:pedigrees,where ht 2, hq 2, and hr 2 are the total, quantitative trait nucleotide, and residual heritabilities, respectively. Using these results, fast and accurate analytical power analyses are possible, eliminating the need for computer simulation. Additional benefits of eigensimplification include a simple method for calculation of the exact distribution of the ELRT under the null hypothesis which turns out to differ from that expected under the usual asymptotic theory. Further, when combined with the use of empirical GRKs-estimated over a large number of genetic markers-our theory reveals potential problems associated with nonpositive semidefinite kernels. These procedures are being added to our general statistical genetic computer package, SOLAR.

Original languageEnglish (US)
Pages (from-to)1-31
Number of pages31
JournalAdvances in Genetics
Volume81
DOIs
StatePublished - 2013
Externally publishedYes

Fingerprint

Pedigree
Genetic Markers
Computer Simulation
Sequence Analysis
Nucleotides
Genome
Phenotype

Keywords

  • Association analysis
  • Eigensimplification
  • Eigenvalues
  • Heritability
  • Human pedigrees
  • Power analysis
  • Variance component models

ASJC Scopus subject areas

  • Genetics

Cite this

Blangero, J., Diego, V. P., Dyer, T. D., Almeida, M., Peralta, J., Kent, J. W., ... GÖring, H. H. H. (2013). A Kernel of Truth. Statistical Advances in Polygenic Variance Component Models for Complex Human Pedigrees. Advances in Genetics, 81, 1-31. https://doi.org/10.1016/B978-0-12-407677-8.00001-4

A Kernel of Truth. Statistical Advances in Polygenic Variance Component Models for Complex Human Pedigrees. / Blangero, John; Diego, Vincent P.; Dyer, Thomas D.; Almeida, Marcio; Peralta, Juan; Kent, Jack W.; Williams, Jeff T.; Almasy, Laura; GÖring, Harald H H.

In: Advances in Genetics, Vol. 81, 2013, p. 1-31.

Research output: Contribution to journalArticle

Blangero, J, Diego, VP, Dyer, TD, Almeida, M, Peralta, J, Kent, JW, Williams, JT, Almasy, L & GÖring, HHH 2013, 'A Kernel of Truth. Statistical Advances in Polygenic Variance Component Models for Complex Human Pedigrees.', Advances in Genetics, vol. 81, pp. 1-31. https://doi.org/10.1016/B978-0-12-407677-8.00001-4
Blangero, John ; Diego, Vincent P. ; Dyer, Thomas D. ; Almeida, Marcio ; Peralta, Juan ; Kent, Jack W. ; Williams, Jeff T. ; Almasy, Laura ; GÖring, Harald H H. / A Kernel of Truth. Statistical Advances in Polygenic Variance Component Models for Complex Human Pedigrees. In: Advances in Genetics. 2013 ; Vol. 81. pp. 1-31.
@article{faecd44d8c094d668fd0a68789db2c57,
title = "A Kernel of Truth. Statistical Advances in Polygenic Variance Component Models for Complex Human Pedigrees.",
abstract = "Statistical genetic analysis of quantitative traits in large pedigrees is a formidable computational task due to the necessity of taking the nonindependence among relatives into account. With the growing awareness that rare sequence variants may be important in human quantitative variation, heritability and association study designs involving large pedigrees will increase in frequency due to the greater chance of observing multiple copies of rare variants among related individuals. Therefore, it is important to have statistical genetic test procedures that utilize all available information for extracting evidence regarding genetic association. Optimal testing for marker/phenotype association involves the exact calculation of the likelihood ratio statistic which requires the repeated inversion of potentially large matrices. In a whole genome sequence association context, such computation may be prohibitive. Toward this end, we have developed a rapid and efficient eigensimplification of the likelihood that makes analysis of family data commensurate with the analysis of a comparable sample of unrelated individuals. Our theoretical results which are based on a spectral representation of the likelihood yield simple exact expressions for the expected likelihood ratio test statistic (ELRT) for pedigrees of arbitrary size and complexity. For heritability, the ELRT is-∑ln1+h2λgi-1,where h2 and λgi are, respectively, the heritability and eigenvalues of the pedigree-derived genetic relationship kernel (GRK). For association analysis of sequence variants, the ELRT is given byELRThq2>0:unrelateds-ELRTht2>0:pedigrees-ELRThr2>0:pedigrees,where ht 2, hq 2, and hr 2 are the total, quantitative trait nucleotide, and residual heritabilities, respectively. Using these results, fast and accurate analytical power analyses are possible, eliminating the need for computer simulation. Additional benefits of eigensimplification include a simple method for calculation of the exact distribution of the ELRT under the null hypothesis which turns out to differ from that expected under the usual asymptotic theory. Further, when combined with the use of empirical GRKs-estimated over a large number of genetic markers-our theory reveals potential problems associated with nonpositive semidefinite kernels. These procedures are being added to our general statistical genetic computer package, SOLAR.",
keywords = "Association analysis, Eigensimplification, Eigenvalues, Heritability, Human pedigrees, Power analysis, Variance component models",
author = "John Blangero and Diego, {Vincent P.} and Dyer, {Thomas D.} and Marcio Almeida and Juan Peralta and Kent, {Jack W.} and Williams, {Jeff T.} and Laura Almasy and G{\"O}ring, {Harald H H}",
year = "2013",
doi = "10.1016/B978-0-12-407677-8.00001-4",
language = "English (US)",
volume = "81",
pages = "1--31",
journal = "Advances in Genetics",
issn = "0065-2660",
publisher = "Academic Press Inc.",

}

TY - JOUR

T1 - A Kernel of Truth. Statistical Advances in Polygenic Variance Component Models for Complex Human Pedigrees.

AU - Blangero, John

AU - Diego, Vincent P.

AU - Dyer, Thomas D.

AU - Almeida, Marcio

AU - Peralta, Juan

AU - Kent, Jack W.

AU - Williams, Jeff T.

AU - Almasy, Laura

AU - GÖring, Harald H H

PY - 2013

Y1 - 2013

N2 - Statistical genetic analysis of quantitative traits in large pedigrees is a formidable computational task due to the necessity of taking the nonindependence among relatives into account. With the growing awareness that rare sequence variants may be important in human quantitative variation, heritability and association study designs involving large pedigrees will increase in frequency due to the greater chance of observing multiple copies of rare variants among related individuals. Therefore, it is important to have statistical genetic test procedures that utilize all available information for extracting evidence regarding genetic association. Optimal testing for marker/phenotype association involves the exact calculation of the likelihood ratio statistic which requires the repeated inversion of potentially large matrices. In a whole genome sequence association context, such computation may be prohibitive. Toward this end, we have developed a rapid and efficient eigensimplification of the likelihood that makes analysis of family data commensurate with the analysis of a comparable sample of unrelated individuals. Our theoretical results which are based on a spectral representation of the likelihood yield simple exact expressions for the expected likelihood ratio test statistic (ELRT) for pedigrees of arbitrary size and complexity. For heritability, the ELRT is-∑ln1+h2λgi-1,where h2 and λgi are, respectively, the heritability and eigenvalues of the pedigree-derived genetic relationship kernel (GRK). For association analysis of sequence variants, the ELRT is given byELRThq2>0:unrelateds-ELRTht2>0:pedigrees-ELRThr2>0:pedigrees,where ht 2, hq 2, and hr 2 are the total, quantitative trait nucleotide, and residual heritabilities, respectively. Using these results, fast and accurate analytical power analyses are possible, eliminating the need for computer simulation. Additional benefits of eigensimplification include a simple method for calculation of the exact distribution of the ELRT under the null hypothesis which turns out to differ from that expected under the usual asymptotic theory. Further, when combined with the use of empirical GRKs-estimated over a large number of genetic markers-our theory reveals potential problems associated with nonpositive semidefinite kernels. These procedures are being added to our general statistical genetic computer package, SOLAR.

AB - Statistical genetic analysis of quantitative traits in large pedigrees is a formidable computational task due to the necessity of taking the nonindependence among relatives into account. With the growing awareness that rare sequence variants may be important in human quantitative variation, heritability and association study designs involving large pedigrees will increase in frequency due to the greater chance of observing multiple copies of rare variants among related individuals. Therefore, it is important to have statistical genetic test procedures that utilize all available information for extracting evidence regarding genetic association. Optimal testing for marker/phenotype association involves the exact calculation of the likelihood ratio statistic which requires the repeated inversion of potentially large matrices. In a whole genome sequence association context, such computation may be prohibitive. Toward this end, we have developed a rapid and efficient eigensimplification of the likelihood that makes analysis of family data commensurate with the analysis of a comparable sample of unrelated individuals. Our theoretical results which are based on a spectral representation of the likelihood yield simple exact expressions for the expected likelihood ratio test statistic (ELRT) for pedigrees of arbitrary size and complexity. For heritability, the ELRT is-∑ln1+h2λgi-1,where h2 and λgi are, respectively, the heritability and eigenvalues of the pedigree-derived genetic relationship kernel (GRK). For association analysis of sequence variants, the ELRT is given byELRThq2>0:unrelateds-ELRTht2>0:pedigrees-ELRThr2>0:pedigrees,where ht 2, hq 2, and hr 2 are the total, quantitative trait nucleotide, and residual heritabilities, respectively. Using these results, fast and accurate analytical power analyses are possible, eliminating the need for computer simulation. Additional benefits of eigensimplification include a simple method for calculation of the exact distribution of the ELRT under the null hypothesis which turns out to differ from that expected under the usual asymptotic theory. Further, when combined with the use of empirical GRKs-estimated over a large number of genetic markers-our theory reveals potential problems associated with nonpositive semidefinite kernels. These procedures are being added to our general statistical genetic computer package, SOLAR.

KW - Association analysis

KW - Eigensimplification

KW - Eigenvalues

KW - Heritability

KW - Human pedigrees

KW - Power analysis

KW - Variance component models

UR - http://www.scopus.com/inward/record.url?scp=84873926637&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84873926637&partnerID=8YFLogxK

U2 - 10.1016/B978-0-12-407677-8.00001-4

DO - 10.1016/B978-0-12-407677-8.00001-4

M3 - Article

VL - 81

SP - 1

EP - 31

JO - Advances in Genetics

JF - Advances in Genetics

SN - 0065-2660

ER -