Email updates

Keep up to date with the latest news and content from BMC Bioinformatics and BioMed Central.

This article is part of the supplement: UT-ORNL-KBRIN Bioinformatics Summit 2008

Open Access Poster presentation

De novo identification of in vivo binding sites from ChIP-chip data

Victor Jin1*, Alina Rabinovich2, Henny O'Geen2, Sushma Iyengar2 and Peggy Farnham2

Author Affiliations

1 Bioinformatics Program and Department of Biology, University of Memphis, Memphis, TN 38152, USA

2 The Genome Center, University of California, Davis, Davis, CA 95617, USA

For all author emails, please log on.

BMC Bioinformatics 2008, 9(Suppl 7):P24  doi:10.1186/1471-2105-9-S7-P24

The electronic version of this article is the complete one and can be found online at:

Published:8 July 2008

© 2008 Jin et al; licensee BioMed Central Ltd.

Poster presentation

Advances in high-throughput technologies such as ChIP-chip and the completion of human genomic sequences allow analysis of the mechanisms of gene regulation on a systems level. In this study, we have developed a computational genomics approach (ChIPModules) and a motif discovery approach (ChIPMotifs) to mine the ChIP-chip data. The ChIPModules approach begins with experimentally determined binding sites and integrates positional weight matrices, a comparative genomics approach, and statistical learning methods to identify transcriptional regulatory modules. Using E2F1 ChIP-chip data performed on ENCODE regions in both HeLa and MCF7 cells, we have identified five regulatory modules for E2F1. One of modules was validated by using ChIP-chip with arrays containing ~14,000 human promoters. The ChIPMotifs approach incorporates a bootstrap re-sampling method to statistically infer the optimal cutoff threshold for a position weight matrix (PWM) of a motif identified from ChIP-chip data by ab initio motif discovery programs. Using OCT4 ChIP-chip data, we developed an in vivo OCT4 PWM. We then used this PWM and our ChIPModules to identify transcription factors co-localizing with OCT4 in a testicular germ cell tumor (Ntera2 cells).


This work was supported in part by Public Health Service grant CA45250, HG003129, and DK067889 to P.J.F. and a bioinformatics start-up funding to V.X.J at the University of Memphis. As part of our analyses, we used ChIP-chip data collected as part of the ENCODE Project Consortium.