Bayesian estimation of genomic copy number with single nucleotide polymorphism genotyping arrays
-
* Corresponding author: Rudy Guerra rguerra@rice.edu
1 Department of Statistics, Rice University, 6100 Main, Houston, TX 77005-1827, USA
2 Department of Statistics, University of Connecticut, Storrs, CT 06269-4120, USA
3 Department of Pediatrics, Division of Hematology-Oncology, Baylor College of Medicine, Texas Children's Hospital, 6621 Fannin St., MC 3-3320 Houston, TX 77030, USA
4 Structural and Computational Biology and Molecular Biophysics Program, Baylor College of Medicine, One Baylor Plaza, Houston, TX 77030, USA
BMC Research Notes 2010, 3:350 doi:10.1186/1756-0500-3-350
Published: 30 December 2010Abstract
Background
The identification of copy number aberration in the human genome is an important area in cancer research. We develop a model for determining genomic copy numbers using high-density single nucleotide polymorphism genotyping microarrays. The method is based on a Bayesian spatial normal mixture model with an unknown number of components corresponding to true copy numbers. A reversible jump Markov chain Monte Carlo algorithm is used to implement the model and perform posterior inference.
Results
The performance of the algorithm is examined on both simulated and real cancer data, and it is compared with the popular CNAG algorithm for copy number detection.
Conclusions
We demonstrate that our Bayesian mixture model performs at least as well as the hidden Markov model based CNAG algorithm and in certain cases does better. One of the added advantages of our method is the flexibility of modeling normal cell contamination in tumor samples.