TY - JOUR
T1 - Gene Set Enrichment Analyses
T2 - Lessons learned from the heart failure phenotype
AU - Tragante, Vinicius
AU - Gho, Johannes M.I.H.
AU - Felix, Janine F.
AU - Vasan, Ramachandran S.
AU - Smith, Nicholas L.
AU - Voight, Benjamin F.
AU - Palmer, Colin
AU - Van Der Harst, Pim
AU - Moore, Jason H.
AU - Asselbergs, Folkert W.
N1 - Publisher Copyright:
© 2017 The Author(s).
PY - 2017
Y1 - 2017
N2 - Background: Genetic studies for complex diseases have predominantly discovered main effects at individual loci, but have not focused on genomic and environmental contexts important for a phenotype. Gene Set Enrichment Analysis (GSEA) aims to address this by identifying sets of genes or biological pathways contributing to a phenotype, through gene-gene interactions or other mechanisms, which are not the focus of conventional association methods. Results: Approaches that utilize GSEA can now take input from array chips, either gene-centric or genome-wide, but are highly sensitive to study design, SNP selection and pruning strategies, SNP-to-gene mapping, and pathway definitions. Here, we present lessons learned from our experience with GSEA of heart failure, a particularly challenging phenotype due to its underlying heterogeneous etiology. Conclusions: This case study shows that proper data handling is essential to avoid false-positive results. Well-defined pipelines for quality control are needed to avoid reporting spurious results using GSEA.
AB - Background: Genetic studies for complex diseases have predominantly discovered main effects at individual loci, but have not focused on genomic and environmental contexts important for a phenotype. Gene Set Enrichment Analysis (GSEA) aims to address this by identifying sets of genes or biological pathways contributing to a phenotype, through gene-gene interactions or other mechanisms, which are not the focus of conventional association methods. Results: Approaches that utilize GSEA can now take input from array chips, either gene-centric or genome-wide, but are highly sensitive to study design, SNP selection and pruning strategies, SNP-to-gene mapping, and pathway definitions. Here, we present lessons learned from our experience with GSEA of heart failure, a particularly challenging phenotype due to its underlying heterogeneous etiology. Conclusions: This case study shows that proper data handling is essential to avoid false-positive results. Well-defined pipelines for quality control are needed to avoid reporting spurious results using GSEA.
KW - Coronary artery disease
KW - Gene set enrichment analyses
KW - Heart failure
UR - http://www.scopus.com/inward/record.url?scp=85020280698&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85020280698&partnerID=8YFLogxK
U2 - 10.1186/s13040-017-0137-5
DO - 10.1186/s13040-017-0137-5
M3 - Article
C2 - 28559929
AN - SCOPUS:85020280698
SN - 1756-0381
VL - 10
JO - BioData Mining
JF - BioData Mining
IS - 1
M1 - 18
ER -