Array Comparative Genomic Hybridizations: Assessing the ability to recapture evolutionary relationships using an in silico approach
1 Laboratoire de Recherche en Sciences Végétales, UMR CNRS-Université Paul Sabatier 5546, Chemin de Borde Rouge - Auzeville 31326, Castanet Tolosan, France
2 Department of Plant Biology, Carnegie Institution for Science, Stanford, California 94305, USA
3 USDA ARS, Plant Pathology Department, UC Davis, Davis, CA, 95616, USA
4 Department of Plant and Microbial Biology, University of California Berkeley, Berkeley CA 94720, USA
BMC Genomics 2011, 12:456 doi:10.1186/1471-2164-12-456Published: 21 September 2011
Comparative Genomic Hybridization (CGH) with DNA microarrays has many biological applications including surveys of copy number changes in tumorogenesis, species detection and identification, and functional genomics studies among related organisms. Array CGH has also been used to infer phylogenetic relatedness among species or strains. Although the use of the entire genome can be seen as a considerable advantage for use in phylogenetic analysis, few such studies have questioned the reliability of array CGH to correctly determine evolutionary relationships. A potential flaw in this application lies in the fact that all comparisons are made to a single reference species. This situation differs from traditional DNA sequence, distance-based phylogenetic analyses where all possible pairwise comparisons are made for the isolates in question. By simulating array data based on the Neurospora crassa genome, we address this potential flaw and other questions regarding array CGH phylogeny.
Our simulation data indicates that having a single reference can, in some cases, be a serious limitation when using this technique. Additionally, the tree building process with a single reference is sensitive to many factors including tree topology, choice of tree reconstruction method, and the distance metric used.
Without prior knowledge of the topology and placement of the reference taxon in the topology, the outcome is likely to be wrong and the error undetected. Given these limitations, using CGH to reveal phylogeny based on sequence divergence does not offer a robust alternative to traditional phylogenetic analysis.