CrossLink: A novel method for cross-condition classification of cancer subtypes

Chifeng Ma, Konduru S. Sastry, Mario Flore, Salah Gehani, Issam Al-Bozom, Yusheng Feng, Erchin Serpedin, Lotfi Chouchane, Yidong Chen, Yufei Huang

Research output: Contribution to journalArticlepeer-review

1 Scopus citations

Abstract

Background: We considered the prediction of cancer classes (e.g. subtypes) using patient gene expression profiles that contain both systematic and condition-specific biases when compared with the training reference dataset. The conventional normalization-based approaches cannot guarantee that the gene signatures in the reference and prediction datasets always have the same distribution for all different conditions as the class-specific gene signatures change with the condition. Therefore, the trained classifier would work well under one condition but not under another. Methods: To address the problem of current normalization approaches, we propose a novel algorithm called CrossLink (CL). CL recognizes that there is no universal, condition-independent normalization mapping of signatures. In contrast, it exploits the fact that the signature is unique to its associated class under any condition and thus employs an unsupervised clustering algorithm to discover this unique signature. Results: We assessed the performance of CL for cross-condition predictions of PAM50 subtypes of breast cancer by using a simulated dataset modeled after TCGA BRCA tumor samples with a cross-validation scheme, and datasets with known and unknown PAM50 classification. CL achieved prediction accuracy >73 %, highest among other methods we evaluated. We also applied the algorithm to a set of breast cancer tumors derived from Arabic population to assign a PAM50 classification to each tumor based on their gene expression profiles. Conclusions: A novel algorithm CrossLink for cross-condition prediction of cancer classes was proposed. In all test datasets, CL showed robust and consistent improvement in prediction performance over other state-of-the-art normalization and classification algorithms.

Original languageEnglish (US)
Article number549
JournalBMC genomics
Volume17
DOIs
StatePublished - Aug 22 2016

ASJC Scopus subject areas

  • Biotechnology
  • Genetics

Fingerprint Dive into the research topics of 'CrossLink: A novel method for cross-condition classification of cancer subtypes'. Together they form a unique fingerprint.

Cite this