TY - JOUR
T1 - A gene-set approach to analyze copy number alterations in breast cancer
AU - Hsu, Yu Ching
AU - Chiu, Yu Chiao
AU - Chen, Yidong
AU - Hsiao, Tzu Hung
AU - Chuang, Eric Y.
N1 - Funding Information:
The study was supported partly by the National Health Research Institutes of Taiwan (NHRI-EX104-10419BI) and the Ministry of Science and Technology of Taiwan (103-2917-I-002-166). The authors thank the Center of Genomic Medicine, National Taiwan University, for providing financial support and computing facilities. The authors also thank Melissa Stauffer, PhD, of Scientific Editing Solutions, for editing the manuscript.
Publisher Copyright:
© 2011 - 2016 Translational Cancer Research. All rights reserved.
PY - 2015/6/1
Y1 - 2015/6/1
N2 - Copy number alterations (CNAs) have been widely reported as an oncogenic or tumor suppressive feature in cancers. Since CNAs simultaneously affect a large number of genes, previous single gene-based methods are limited in revealing the landscape of CNAs. A systematic method to explore the influence of CNAs on cancer progression is needed. In the present study, a total of 1,045 genome-wide array comparative genomic hybridization (aCGH) data sets and 529 gene expression profiles of breast tumors were collected from The Cancer Genome Atlas (TCGA). We devised an algorithm (called Gene Set analysis for Copy number Alteration, or GSCA) to identify functional gene sets exhibiting significant enrichment in CNAs based on Fisher's exact test. Gene expression profiles of the enriched gene sets were analyzed to evaluate the influence of CNAs on gene expression changes. We also integrated survival analysis to pinpoint prognostic CNA-affected gene sets. Thirty-five and ten gene sets were identified with significant enrichment in copy number gains and losses, respectively. Forty-four out of the 45 (98%) gene sets showed concordant significant gene expression changes with the CNAs. In addition, survival analysis discovered 31 gene sets in which copy number enrichment was associated with patient survival, including several important transcriptional factor target gene sets, such as MYC. The results indicate that CNAs play essential roles in breast tumor progression and lead to differential clinical outcomes. In conclusion, here we devised a novel method for analyzing and interpreting CNA data at the level of functional gene sets. We demonstrated its capability of identifying CNA-affected, as well as CNA-driven, biological functions and pathways in breast cancer. The analysis workflow can be widely applied to other cancers and provides biological insights into complex mechanisms governing tumor progression.
AB - Copy number alterations (CNAs) have been widely reported as an oncogenic or tumor suppressive feature in cancers. Since CNAs simultaneously affect a large number of genes, previous single gene-based methods are limited in revealing the landscape of CNAs. A systematic method to explore the influence of CNAs on cancer progression is needed. In the present study, a total of 1,045 genome-wide array comparative genomic hybridization (aCGH) data sets and 529 gene expression profiles of breast tumors were collected from The Cancer Genome Atlas (TCGA). We devised an algorithm (called Gene Set analysis for Copy number Alteration, or GSCA) to identify functional gene sets exhibiting significant enrichment in CNAs based on Fisher's exact test. Gene expression profiles of the enriched gene sets were analyzed to evaluate the influence of CNAs on gene expression changes. We also integrated survival analysis to pinpoint prognostic CNA-affected gene sets. Thirty-five and ten gene sets were identified with significant enrichment in copy number gains and losses, respectively. Forty-four out of the 45 (98%) gene sets showed concordant significant gene expression changes with the CNAs. In addition, survival analysis discovered 31 gene sets in which copy number enrichment was associated with patient survival, including several important transcriptional factor target gene sets, such as MYC. The results indicate that CNAs play essential roles in breast tumor progression and lead to differential clinical outcomes. In conclusion, here we devised a novel method for analyzing and interpreting CNA data at the level of functional gene sets. We demonstrated its capability of identifying CNA-affected, as well as CNA-driven, biological functions and pathways in breast cancer. The analysis workflow can be widely applied to other cancers and provides biological insights into complex mechanisms governing tumor progression.
KW - Breast cancer
KW - Copy number alteration (CNA)
KW - Copy number variation
KW - Gene set analysis
UR - http://www.scopus.com/inward/record.url?scp=84962723167&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84962723167&partnerID=8YFLogxK
U2 - 10.3978/j.issn.2218-676X.2015.05.03
DO - 10.3978/j.issn.2218-676X.2015.05.03
M3 - Article
AN - SCOPUS:84962723167
SN - 2218-676X
VL - 4
SP - 291
EP - 302
JO - Translational Cancer Research
JF - Translational Cancer Research
IS - 3
ER -