Chromatin signature analysis and prediction of genome-wide novel promoters using finite mixture model

Cenny Taslim, Shili Lin, Kun Huang, Tim Huang

Producción científica: Conference contribution

Resumen

Regulation of gene expression has been shown to involve not only binding of transcription factor in target gene promoters but also characterization of histone around which DNA is wrapped around. Some histone modification, for example di-methylated histone H3 at lysine 4 (H3K4me2), has been shown to be associated with gene activation. However, no clear pattern has been shown to predict human promoters. This paper proposed a novel quantitative approach to characterize chromatin signature and patterns of promoters, which are then used to predict novel (alternative) promoters. In this paper, chromatin immunoprecipitation methods followed by massive parallel sequencing (ChIP-seq) data against RNA Polymerase II (Pol II) and H3K4me2 are used to identify common patterns of promoter regions. These patterns were then used to search for similar patterns over the entire genome to find novel promoters. Common patterns of promoter regions are modeled using a mixture model involving double-exponential and uniform distributions. Regions with high correlations with the common patterns are identified as putative novel promoters. We used this proposed algorithm and RNA-seq data to identify novel promoters in the MCF7 cell line. We found 4,392 high-confidence regions that display the identified promoter patterns (referred to as putative novel promoters). Of these, 875 regions (20%) overlap with RNA transcripts. Around 70% of these putative novel promoters have overlapped with RNA transcripts, EST and/or non-coding RNA suggesting that these putative novel promoters might be promoters which are currently undiscovered.

Idioma originalEnglish (US)
Título de la publicación alojadaProceedings 2011 IEEE International Workshop on Genomic Signal Processing and Statistics, GENSIPS'11
Páginas13-16
Número de páginas4
EstadoPublished - 2011
Publicado de forma externa
Evento2011 IEEE International Workshop on Genomic Signal Processing and Statistics, GENSIPS'11 - San Antonio, TX, United States
Duración: dic 4 2011dic 6 2011

Serie de la publicación

NombreProceedings - IEEE International Workshop on Genomic Signal Processing and Statistics
ISSN (versión impresa)2150-3001
ISSN (versión digital)2150-301X

Other

Other2011 IEEE International Workshop on Genomic Signal Processing and Statistics, GENSIPS'11
País/TerritorioUnited States
CiudadSan Antonio, TX
Período12/4/1112/6/11

ASJC Scopus subject areas

  • Biochemistry, Genetics and Molecular Biology (miscellaneous)
  • Computational Theory and Mathematics
  • Signal Processing
  • Biomedical Engineering

Huella

Profundice en los temas de investigación de 'Chromatin signature analysis and prediction of genome-wide novel promoters using finite mixture model'. En conjunto forman una huella única.

Citar esto