Comparing multiple ChIP-sequencing experiments.

Hatice Gulcin Ozer, Yi Wen Huang, Jiejun Wu, Jeffrey D. Parvin, Tim Hui Ming Huang, Kun Huang

Research output: Contribution to journalArticlepeer-review

1 Scopus citations


New high-throughput sequencing technologies can generate millions of short sequences in a single experiment. As the size of the data increases, comparison of multiple experiments on different cell lines under different experimental conditions becomes a big challenge. In this paper, we investigate ways to compare multiple ChIP-sequencing experiments. We specifically studied epigenetic regulation of breast cancer and the effect of estrogen using 50 ChIP-sequencing data from Illumina Genome Analyzer II. First, we evaluate the correlation among different experiments focusing on the total number of reads in transcribed and promoter regions of the genome. Then, we adopt the method that is used to identify the most stable genes in RT-PCR experiments to understand background signal across all of the experiments and to identify the most variable transcribed and promoter regions of the genome. We observed that the most variable genes for transcribed regions and promoter regions are very distinct. Gene ontology and function enrichment analysis on these most variable genes demonstrate the biological relevance of the results. In this study, we present a method that can effectively select differential regions of the genome based on protein-binding profiles over multiple experiments using real data points without any normalization among the samples.

Original languageEnglish (US)
Pages (from-to)269-282
Number of pages14
JournalJournal of bioinformatics and computational biology
Issue number2
StatePublished - Apr 2011
Externally publishedYes

ASJC Scopus subject areas

  • Molecular Biology
  • Biochemistry
  • Computer Science Applications


Dive into the research topics of 'Comparing multiple ChIP-sequencing experiments.'. Together they form a unique fingerprint.

Cite this