Interactive semisupervised learning for microarray analysis

Yijuan Lu, Qi Tian, Feng Liu, Maribel Sanchez, Yufeng Wang

Research output: Contribution to journalArticlepeer-review

11 Scopus citations


Microarray technology has generated vast amounts of gene expression data with distinct patterns. Based on the premise that genes of correlated functions tend to exhibit similar expression patterns, various machine learning methods have been applied to capture these specific patterns in microarray data. However, the discrepancy between the rich expression profiles and the limited knowledge of gene functions has been a major hurdle to the understanding of cellular networks. To bridge this gap so as to properly comprehend and interpret expression data, we introduce Relevance Feedback to microarray analysis and propose an interactive learning framework to incorporate the expert knowledge into the decision module. In order to find a good learning method and solve two intrinsic problems in microarray data, high dimensionality and small sample size, we also propose a semisupervised learning algorithm: Kernel Discriminant-EM (KDEM). This algorithm efficiently utilizes a large set of unlabeled data to compensate for the insufficiency of a small set of labeled data and it extends the linear algorithm in Discrimlnant-EM (DEM) to a kernel algorithm to handle nonlinearly separable data in a lower dimensional space. The Relevance Feedback technique and KDEM together construct an efficient and effective interactive semisupervised learning framework for microarray analysis. Extensive experiments on the yeast cell cycle regulation data set and Plasmodium falciparum red blood cell cycle data set show the promise of this approach.

Original languageEnglish (US)
Pages (from-to)190-202
Number of pages13
JournalIEEE/ACM Transactions on Computational Biology and Bioinformatics
Issue number2
StatePublished - Apr 2007


  • Kernel DEM
  • Microarray analysis
  • Relevance feedback
  • Semisupervised learning

ASJC Scopus subject areas

  • Biotechnology
  • Genetics
  • Applied Mathematics


Dive into the research topics of 'Interactive semisupervised learning for microarray analysis'. Together they form a unique fingerprint.

Cite this