Log on / register
Feedback | Support | My details
Open AccessHighly AccessMethodology article

Applying dynamic Bayesian networks to perturbed gene expression data

Norbert Dojer1 email, Anna Gambin1 email, Andrzej Mizera2 email, Bartek Wilczyński3 email and Jerzy Tiuryn1 email

1Institute of Informatics, Warsaw University, Banacha 2, 02-097 Warszawa, Poland

2Institute of Fundamental Technological Research, Polish Academy of Sciences, Świętokrzyska 21, 00-049 Warszawa, Poland

3Institute of Mathematics, Polish Academy of Sciences, Śniadeckich 8, 00-956 Warszawa, Poland

author email corresponding author email

BMC Bioinformatics 2006, 7:249doi:10.1186/1471-2105-7-249

Published: 8 May 2006

Abstract

Background

A central goal of molecular biology is to understand the regulatory mechanisms of gene transcription and protein synthesis. Because of their solid basis in statistics, allowing to deal with the stochastic aspects of gene expressions and noisy measurements in a natural way, Bayesian networks appear attractive in the field of inferring gene interactions structure from microarray experiments data. However, the basic formalism has some disadvantages, e.g. it is sometimes hard to distinguish between the origin and the target of an interaction. Two kinds of microarray experiments yield data particularly rich in information regarding the direction of interactions: time series and perturbation experiments. In order to correctly handle them, the basic formalism must be modified. For example, dynamic Bayesian networks (DBN) apply to time series microarray data. To our knowledge the DBN technique has not been applied in the context of perturbation experiments.

Results

We extend the framework of dynamic Bayesian networks in order to incorporate perturbations. Moreover, an exact algorithm for inferring an optimal network is proposed and a discretization method specialized for time series data from perturbation experiments is introduced. We apply our procedure to realistic simulations data. The results are compared with those obtained by standard DBN learning techniques. Moreover, the advantages of using exact learning algorithm instead of heuristic methods are analyzed.

Conclusion

We show that the quality of inferred networks dramatically improves when using data from perturbation experiments. We also conclude that the exact algorithm should be used when it is possible, i.e. when considered set of genes is small enough.


© 1999-2009 BioMed Central Ltd unless otherwise stated. Part of Springer Science+Business Media.