<?xml version='1.0'?>
<!DOCTYPE art SYSTEM 'http://www.biomedcentral.com/xml/article.dtd'>
<art>
   <ui>gb-2008-9-3-r49</ui>
   <ji>GBJ</ji>
   <fm>
      <dochead>Research</dochead>
      <bibl>
         <title>
            <p>Analysis of 142 genes resolves the rapid diversification of the rice genus</p>
         </title>
         <aug>
            <au id="A1" ce="yes">
               <snm>Zou</snm>
               <fnm>Xin-Hui</fnm>
               <insr iid="I1"/>
               <email>zouxh@ibcas.ac.cn</email>
            </au>
            <au id="A2" ce="yes">
               <snm>Zhang</snm>
               <fnm>Fu-Min</fnm>
               <insr iid="I1"/>
               <email>zhangfm@ibcas.ac.cn</email>
            </au>
            <au id="A3" ce="yes">
               <snm>Zhang</snm>
               <fnm>Jian-Guo</fnm>
               <insr iid="I2"/>
               <email>zhangjg@genomics.org.cn</email>
            </au>
            <au id="A4">
               <snm>Zang</snm>
               <fnm>Li-Li</fnm>
               <insr iid="I1"/>
               <email>zangli@ibcas.ac.cn</email>
            </au>
            <au id="A5">
               <snm>Tang</snm>
               <fnm>Liang</fnm>
               <insr iid="I1"/>
               <email>ecologytang@yahoo.com.cn</email>
            </au>
            <au id="A6">
               <snm>Wang</snm>
               <fnm>Jun</fnm>
               <insr iid="I2"/>
               <email>wangj@genomics.org.cn</email>
            </au>
            <au id="A7">
               <snm>Sang</snm>
               <fnm>Tao</fnm>
               <insr iid="I3"/>
               <email>sang@msu.edu</email>
            </au>
            <au id="A8" ca="yes">
               <snm>Ge</snm>
               <fnm>Song</fnm>
               <insr iid="I1"/>
               <insr iid="I4"/>
               <email>gesong@ibcas.ac.cn</email>
            </au>
         </aug>
         <insg>
            <ins id="I1">
               <p>State Key Laboratory of Systematic and Evolutionary Botany, Institute of Botany, Chinese Academy of Sciences, Beijing, 100093, China</p>
            </ins>
            <ins id="I2">
               <p>Beijing Genomics Institute, Beijing, 101300, China</p>
            </ins>
            <ins id="I3">
               <p>Department of Plant Biology, Michigan State University, East Lansing, MI 48824, USA</p>
            </ins>
            <ins id="I4">
               <p>The Graduate School, Chinese Academy of Sciences, Beijing, 100039, China</p>
            </ins>
         </insg>
         <source>Genome Biology</source>
         <issn>1465-6906</issn>
         <pubdate>2008</pubdate>
         <volume>9</volume>
         <issue>3</issue>
         <fpage>R49</fpage>
         <url>http://genomebiology.com/2008/9/3/R49</url>
         <xrefbib>
            <pubidlist>
               <pubid idtype="pmpid">18315873</pubid>
               <pubid idtype="doi">10.1186/gb-2008-9-3-r49</pubid>
            </pubidlist>
         </xrefbib>
      </bibl>
      <history>
         <rec>
            <date>
               <day>21</day>
               <month>12</month>
               <year>2007</year>
            </date>
         </rec>
         <revrec>
            <date>
               <day>18</day>
               <month>2</month>
               <year>2008</year>
            </date>
         </revrec>
         <acc>
            <date>
               <day>3</day>
               <month>3</month>
               <year>2008</year>
            </date>
         </acc>
         <pub>
            <date>
               <day>03</day>
               <month>03</month>
               <year>2008</year>
            </date>
         </pub>
      </history>
      <cpyrt>
         <year>2008</year>
         <collab>Zou et al.; licensee BioMed Central Ltd.</collab>
         <note>This is an open access article distributed under the terms of the Creative Commons Attribution License (<url>http://creativecommons.org/licenses/by/2.0</url>), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</note>
      </cpyrt>
      <shorttitle>
         <p>Diversification of rice</p>
      </shorttitle>
      <shortabs>
         <p>The relationships among all diploid genome types of the rice genus were clarified using 142 single-copy genes</p>
      </shortabs>
      <abs>
         <sec>
            <st>
               <p>Abstract</p>
            </st>
            <sec>
               <st>
                  <p>Background</p>
               </st>
               <p>The completion of rice genome sequencing has made rice and its wild relatives an attractive system for biological studies. Despite great efforts, phylogenetic relationships among genome types and species in the rice genus have not been fully resolved. To take full advantage of rice genome resources for biological research and rice breeding, we will benefit from the availability of a robust phylogeny of the rice genus.</p>
            </sec>
            <sec>
               <st>
                  <p>Results</p>
               </st>
               <p>Through screening rice genome sequences, we sampled and sequenced 142 single-copy genes to clarify the relationships among all diploid genome types of the rice genus. The analysis identified two short internal branches around which most previous phylogenetic inconsistency emerged. These represent two episodes of rapid speciation that occurred approximately 5 and 10 million years ago (Mya) and gave rise to almost the entire diversity of the genus. The known chromosomal distribution of the sampled genes allowed the documentation of whole-genome sorting of ancestral alleles during the rapid speciation, which was responsible primarily for extensive incongruence between gene phylogenies and persisting phylogenetic ambiguity in the genus. Random sample analysis showed that 120 genes with an average length of 874 bp were needed to resolve both short branches with 95% confidence.</p>
            </sec>
            <sec>
               <st>
                  <p>Conclusion</p>
               </st>
               <p>Our phylogenomic analysis successfully resolved the phylogeny of rice genome types, which lays a solid foundation for comparative and functional genomic studies of rice and its relatives. This study also highlights that organismal genomes might be mosaics of conflicting genealogies because of rapid speciation and demonstrates the power of phylogenomics in the reconstruction of rapid diversification.</p>
            </sec>
         </sec>
      </abs>
   </fm>
   <meta>
      <classifications>
         <classification type="BMC" subtype="man_spc_id" id="30010002">Bioinformatics</classification>
         <classification type="BMC" subtype="man_spc_id" id="30010008">Evolution</classification>
         <classification type="BMC" subtype="man_spc_id" id="30010019">Plant biology</classification>
      </classifications>
   </meta>
   <bdy>
      <sec>
         <st>
            <p>Background</p>
         </st>
         <p>Rice is one of the most important crops in the world, providing the staple food for more than one-half of the world's population <abbrgrp><abbr bid="B1">1</abbr><abbr bid="B2">2</abbr></abbrgrp>. The completion of rice genome sequencing has made rice and its wild relatives an increasingly attractive system for biological studies at the genomic level <abbrgrp><abbr bid="B3">3</abbr><abbr bid="B4">4</abbr><abbr bid="B5">5</abbr></abbrgrp>. Considerable insights have been recently gained into comparative genomics between rice and other cereal crops of the grass family <abbrgrp><abbr bid="B6">6</abbr></abbrgrp> and between the species of the rice genus, <it>Oryza </it><abbrgrp><abbr bid="B7">7</abbr><abbr bid="B8">8</abbr></abbrgrp>. To take full advantage of rice genome resources for basic biological research and rice breeding, we will benefit from the availability of a robust phylogeny of the rice genus.</p>
         <p>The genus <it>Oryza </it>consists of 2 cultivated and approximately 22 wild species distributed in a diverse range of habitats in tropics and subtropics of the world <abbrgrp><abbr bid="B9">9</abbr></abbrgrp>. By assessing the degree of meiotic pairing in interspecific hybrids, traditional genome analyses grouped the majority of <it>Oryza </it>species into five diploid and two allotetraploid genome types: A-, B-, C-, E-, F-, BC-, and CD-genomes <abbrgrp><abbr bid="B10">10</abbr></abbrgrp>. Because of the difficulties in obtaining hybrids with presumably more distantly related species, three additional genomes, G-, HJ-, and HK-genomes, were later recognized based on total genomic DNA hybridization <abbrgrp><abbr bid="B11">11</abbr></abbrgrp> and molecular phylogenetics <abbrgrp><abbr bid="B12">12</abbr></abbrgrp>. In <it>Oryza</it>, one-third of extant species are allotetraploids that originated through hybridization between diploid genomes, and, in particular, four (B-, E-, F-, and G-genomes) out of the six diploid genomes each have a single species <abbrgrp><abbr bid="B1">1</abbr><abbr bid="B9">9</abbr><abbr bid="B13">13</abbr></abbrgrp>. Consequently, elucidating the phylogenetic relationships of the diploid rice genomes is critically important for understanding the evolutionary history of the entire genus.</p>
         <p>Despite extensive studies on evolutionary relationships among rice genomes and species <abbrgrp><abbr bid="B10">10</abbr><abbr bid="B12">12</abbr><abbr bid="B14">14</abbr><abbr bid="B15">15</abbr><abbr bid="B16">16</abbr></abbrgrp>, the phylogenetic relationships among genomes remained elusive until a study that sampled all recognized <it>Oryza </it>species and utilized sequences of two nuclear and one chloroplast genes <abbrgrp><abbr bid="B12">12</abbr></abbrgrp>. This study supported the monophyly of each of the previously recognized genome types and reconstructed the origins of tetraploid species. Nevertheless, two areas of the phylogeny were left unresolved due to incongruence between gene trees. These included the relationship among A-, B-, and C-genomes and that among the F-genome, G-genome, and the rest of the genus <abbrgrp><abbr bid="B12">12</abbr></abbrgrp>. The incongruence was highlighted in the rice phylogenetic literature, where all three possible relationships among A-, B-, and C-genomes were suggested <abbrgrp><abbr bid="B10">10</abbr><abbr bid="B12">12</abbr><abbr bid="B16">16</abbr><abbr bid="B17">17</abbr><abbr bid="B18">18</abbr></abbrgrp>. More remarkable is the position of the F-genome, which varied from being the most basal lineage of the entire genus <abbrgrp><abbr bid="B16">16</abbr><abbr bid="B19">19</abbr></abbrgrp> to being nested within the recently diverged A-genome <abbrgrp><abbr bid="B15">15</abbr><abbr bid="B20">20</abbr></abbrgrp>.</p>
         <p>The recent decade has witnessed the successful utilization of large quantities of DNA sequences in solving long-standing phylogenetic problems <abbrgrp><abbr bid="B21">21</abbr><abbr bid="B22">22</abbr><abbr bid="B23">23</abbr><abbr bid="B24">24</abbr><abbr bid="B25">25</abbr><abbr bid="B26">26</abbr><abbr bid="B27">27</abbr><abbr bid="B28">28</abbr><abbr bid="B29">29</abbr><abbr bid="B30">30</abbr></abbrgrp>. As a growing number of genomes are decoded, phylogenetic reconstruction using genome-wide markers, or phylogenomics <abbrgrp><abbr bid="B31">31</abbr><abbr bid="B32">32</abbr></abbrgrp>, will provide unprecedented opportunities to elucidate the previously controversial evolutionary relationships at all taxonomic levels <abbrgrp><abbr bid="B31">31</abbr><abbr bid="B33">33</abbr></abbrgrp>. In this study, we screened the genome sequences of two rice cultivars and sampled 142 single-copy genes as markers for reconstructing the phylogeny of all diploid rice genomes. This phylogenomic analysis, for the first time, fully resolved the relationships of the rice genome types. It further revealed that two episodes of rapid diversification in the rice genus were responsible for the phylogenetic incongruence that persisted in the previous studies. We suggest that rapid diversification might be widespread in organismal evolution and caution that under rapid speciation, large data sets or phylogenomic approach are required to resolve phylogenetic relationships with a high degree of confidence.</p>
      </sec>
      <sec>
         <st>
            <p>Results</p>
         </st>
         <sec>
            <st>
               <p>Phylogeny inferred from concatenated sequences of 142 genes</p>
            </st>
            <p>After an extensive screen of rice genome sequences, we identified and sequenced 142 single-copy genes that were most likely free of the paralogy problem for reconstructing the phylogeny of all diploid genome types of <it>Oryza </it>(Table <tblr tid="T1">1</tblr>; see Materials and methods for details of gene screening). These genes are distributed throughout the 12 rice chromosomes and represent a genome-wide sampling of phylogenetic markers (Additional data files 1 and 2). After removing regions with ambiguous alignment, we concatenated the 142 genes into a data matrix of 124,079 bp, with exons accounting for 43% of the total sequence. The concatenated alignment contained 26,838 (21.6%) variable sites, of which 6,753 (5.4%) were phylogenetically informative (Additional data file 2).</p>
            <tbl id="T1">
               <title>
                  <p>Table 1</p>
               </title>
               <caption>
                  <p>Information on the materials used in this study</p>
               </caption>
               <tblbdy cols="6">
                  <r>
                     <c ca="left">
                        <p>Species</p>
                     </c>
                     <c ca="center">
                        <p>Genome</p>
                     </c>
                     <c ca="center">
                        <p>Accession number*</p>
                     </c>
                     <c ca="left">
                        <p>Origin</p>
                     </c>
                     <c ca="center">
                        <p>No. of genes sequenced</p>
                     </c>
                     <c ca="center">
                        <p>No. of sites aligned</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="6">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Oryza sativa</it>
                           <sup>&#8224;</sup>
                        </p>
                     </c>
                     <c ca="center">
                        <p>A</p>
                     </c>
                     <c ca="center">
                        <p>93-11</p>
                     </c>
                     <c ca="left">
                        <p>China</p>
                     </c>
                     <c ca="center">
                        <p>62</p>
                     </c>
                     <c ca="center">
                        <p>52,092</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>O. rufipogon</it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>A</p>
                     </c>
                     <c ca="center">
                        <p>105480</p>
                     </c>
                     <c ca="left">
                        <p>India</p>
                     </c>
                     <c ca="center">
                        <p>142</p>
                     </c>
                     <c ca="center">
                        <p>124,079</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>O. barthii</it>
                           <sup>&#8224;</sup>
                        </p>
                     </c>
                     <c ca="center">
                        <p>A</p>
                     </c>
                     <c ca="center">
                        <p>104132</p>
                     </c>
                     <c ca="left">
                        <p>Cameroon</p>
                     </c>
                     <c ca="center">
                        <p>62</p>
                     </c>
                     <c ca="center">
                        <p>52,092</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>O. punctata</it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>B</p>
                     </c>
                     <c ca="center">
                        <p>103903</p>
                     </c>
                     <c ca="left">
                        <p>Tanzania</p>
                     </c>
                     <c ca="center">
                        <p>141</p>
                     </c>
                     <c ca="center">
                        <p>124,079</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>O. officinalis</it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>C</p>
                     </c>
                     <c ca="center">
                        <p>104972</p>
                     </c>
                     <c ca="left">
                        <p>China</p>
                     </c>
                     <c ca="center">
                        <p>142</p>
                     </c>
                     <c ca="center">
                        <p>124,079</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>O. rhizomatis</it>
                           <sup>&#8224;</sup>
                        </p>
                     </c>
                     <c ca="center">
                        <p>C</p>
                     </c>
                     <c ca="center">
                        <p>103410</p>
                     </c>
                     <c ca="left">
                        <p>Sri Lanka</p>
                     </c>
                     <c ca="center">
                        <p>62</p>
                     </c>
                     <c ca="center">
                        <p>52,092</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>O. eichingeri</it>
                           <sup>&#8224;</sup>
                        </p>
                     </c>
                     <c ca="center">
                        <p>C</p>
                     </c>
                     <c ca="center">
                        <p>105415</p>
                     </c>
                     <c ca="left">
                        <p>Sri Lanka</p>
                     </c>
                     <c ca="center">
                        <p>62</p>
                     </c>
                     <c ca="center">
                        <p>52,092</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>O. australiensis</it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>E</p>
                     </c>
                     <c ca="center">
                        <p>105263, 101410</p>
                     </c>
                     <c ca="left">
                        <p>Australia</p>
                     </c>
                     <c ca="center">
                        <p>135</p>
                     </c>
                     <c ca="center">
                        <p>124,079</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>O. brachyantha</it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>F</p>
                     </c>
                     <c ca="center">
                        <p>105151</p>
                     </c>
                     <c ca="left">
                        <p>Sierra Leone</p>
                     </c>
                     <c ca="center">
                        <p>124</p>
                     </c>
                     <c ca="center">
                        <p>124,079</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>O. granulata</it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>G</p>
                     </c>
                     <c ca="center">
                        <p>M8-15, 106469</p>
                     </c>
                     <c ca="left">
                        <p>China, Vietnam</p>
                     </c>
                     <c ca="center">
                        <p>124</p>
                     </c>
                     <c ca="center">
                        <p>124,079</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Leersia tisserantti</it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>-</p>
                     </c>
                     <c ca="center">
                        <p>105610</p>
                     </c>
                     <c ca="left">
                        <p>Cameroon</p>
                     </c>
                     <c ca="center">
                        <p>122</p>
                     </c>
                     <c ca="center">
                        <p>124,079</p>
                     </c>
                  </r>
               </tblbdy>
               <tblfn>
                  <p>*All accession numbers were obtained from the International Rice Research Institute at Los Banos, Philippines, except for M8-15, which was collected by the authors. <sup>&#8224;</sup>Sixty-two genes were sequenced for these species and used only for testing the effect of dense sampling. Sequences of <it>O. sativa </it>(<it>93-11</it>) were retrieved from the BGI-RIS database.</p>
               </tblfn>
            </tbl>
            <p>Phylogenetic analyses of the concatenated sequences using maximum likelihood (ML), maximum parsimony (MP) and Bayesian inference (BI) all yielded a single fully resolved tree with high bootstrap support or Bayesian posterior probability (PP) for all internal branches (Figure <figr fid="F1">1</figr>). We labeled these branches as I, II, III, and IV. The relationships between A-, B-, and C-genomes are finally resolved, with the sister relationship between A- and B-genomes supported by 99-100% bootstrap support or PP. The F-genome, which jumped all over the previously reported phylogenies, is firmly placed between the basal G-genome and the rest of the genome types.</p>
            <fig id="F1">
               <title>
                  <p>Figure 1</p>
               </title>
               <caption>
                  <p>ML tree inferred from the concatenated sequences of 142 genes using the GTR+&#915; model</p>
               </caption>
               <text>
                  <p>ML tree inferred from the concatenated sequences of 142 genes using the GTR+&#915; model. The same topology was obtained from MP and BI. The letters A, B, C, E, F, and G represent all recognized diploid genome types of <it>Oryza</it>, and L represents the outgroup. The names of the species that represent the genome types and outgroup are in parentheses. Numbers above branches indicate bootstrap support of ML and MP, and posterior probability of BI, respectively. Four internal branches of <it>Oryza </it>genome types are indicated with I, II, III, and IV. Branch length is proportional to the number of substitutions measured by the scale bar.</p>
               </text>
               <graphic file="gb-2008-9-3-r49-1"/>
            </fig>
            <p>Because the increase in sequence length or the number of sampled genes does not guarantee the elimination of systematic errors <abbrgrp><abbr bid="B28">28</abbr><abbr bid="B34">34</abbr><abbr bid="B35">35</abbr></abbrgrp>, it is necessary to investigate the potential impact of systematic bias on our phylogenetic reconstruction. First, we tested homogeneity of base composition across species for total, intron, exon, and three codon sites of the concatenated data set. The results indicated that four nucleotide bases occurred in almost equal proportions and the GC content varied little among species for all data partitions (&#967;<sup>2 </sup>tests, <it>P </it>= 0.346-1.0; Additional data file 3). The potential compositional bias was also examined with analysis using log-determinant (LogDet) distance <abbrgrp><abbr bid="B36">36</abbr></abbrgrp>. This yielded the same topology as ML, MP, and BI (Table <tblr tid="T2">2</tblr>). These tests suggest that the concatenated data set did not contain compositional signals that could have biased the phylogenetic reconstruction.</p>
            <tbl id="T2">
               <title>
                  <p>Table 2</p>
               </title>
               <caption>
                  <p>Bootstrap support from 1,000 replicates for the four internal branches of phylogenetic trees based on the concatenated sequences using different methods</p>
               </caption>
               <tblbdy cols="5">
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c cspan="4" ca="center">
                        <p>Bootstrap support (%)</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c cspan="4">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Method</p>
                     </c>
                     <c ca="center">
                        <p>I</p>
                     </c>
                     <c ca="center">
                        <p>II</p>
                     </c>
                     <c ca="center">
                        <p>III</p>
                     </c>
                     <c ca="center">
                        <p>IV</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="5">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>RY-coding strategy (ML)</p>
                     </c>
                     <c ca="center">
                        <p>93</p>
                     </c>
                     <c ca="center">
                        <p>100</p>
                     </c>
                     <c ca="center">
                        <p>100</p>
                     </c>
                     <c ca="center">
                        <p>73</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>LogDet distance (NJ)</p>
                     </c>
                     <c ca="center">
                        <p>93</p>
                     </c>
                     <c ca="center">
                        <p>100</p>
                     </c>
                     <c ca="center">
                        <p>100</p>
                     </c>
                     <c ca="center">
                        <p>90</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Gene bootstrap (ML)</p>
                     </c>
                     <c ca="center">
                        <p>72</p>
                     </c>
                     <c ca="center">
                        <p>100</p>
                     </c>
                     <c ca="center">
                        <p>100</p>
                     </c>
                     <c ca="center">
                        <p>88</p>
                     </c>
                  </r>
               </tblbdy>
               <tblfn>
                  <p>NJ, neighbor-joining.</p>
               </tblfn>
            </tbl>
            <p>Second, we analyzed rate constancy among lineages using Tajima's relative rate test <abbrgrp><abbr bid="B37">37</abbr></abbrgrp>. When the concatenated sequences were considered, results showed that the null hypothesis of rate constancy was rejected in almost all pairs of contrasts (<it>P </it>&lt; 0.01). It is noteworthy that the F-genome evolved at a faster rate and the G-genome evolved at a slower rate than other genomes (Additional data file 4). To explore the potential impact of rate heterogeneity on tree reconstruction, we adopted the RY-coding strategy that discards fast-evolving transitions and consequently makes phylogenetic reconstructions less susceptible to uneven occurrence of multiple hits among lineages <abbrgrp><abbr bid="B34">34</abbr><abbr bid="B38">38</abbr></abbrgrp>. The tree obtained from the re-coded data set was topologically identical to that shown in Figure <figr fid="F1">1</figr> (Table <tblr tid="T2">2</tblr>). To further test the potential long-branch attraction effect of the fast-evolving F-genome, we identified genes that evolved more rapidly in the F-genome than in the A-, B-, and C-genomes. We calculated the ratio of the mean distance between the F-genome and each of A-, B-, and C-genomes to the mean distances among A-, B-, and C-genomes for each gene. We then progressively excluded fast-evolving genes of the F-genome in a decreasing order of the ratios. The topology based on the remaining genes did not change until more than 50 genes were excluded (Additional data file 5). These suggest that rate heterogeneity was not severe enough to cause significant systematic bias.</p>
            <p>Third, to examine the potential systematic errors caused by model misspecification <abbrgrp><abbr bid="B30">30</abbr><abbr bid="B32">32</abbr><abbr bid="B35">35</abbr></abbrgrp>, we applied a series of homogeneous and mixed models in BI and evaluated the relative merits of competing models by Bayes factors. Although Bayes factor comparisons showed that all mixed models outperformed the homogeneous models significantly (Additional data files 6 and 7), analyses with all 14 alternative models, including ones incorporating the covarion model, which accounts for heterotachous signal, yielded the same topology as shown in Figure <figr fid="F1">1</figr>, with all internal branches supported by 100% PP. Taken together, the above analyses indicate that the phylogeny inferred from the concatenated gene sequences was not biased by systematic errors.</p>
            <p>We next tested whether the resulting phylogeny could have been influenced by a subset of the genes. A gene bootstrap analysis was performed with 1,000 replicates <abbrgrp><abbr bid="B27">27</abbr></abbrgrp>. For each replicate, we randomly drew 142 genes with replacement from the entire pool. The sampled genes were concatenated and analyzed using ML. The results strongly supported the topology shown in Figure <figr fid="F1">1</figr> (Table <tblr tid="T2">2</tblr>), indicating that the phylogenetic reconstruction was not dominated by a subset of the 142 genes.</p>
            <p>Finally, we investigated whether within-genome sampling would influence the phylogenetic reconstruction. Because the A- and C-genomes have more than one species, while each of the remaining genomes has only one species <abbrgrp><abbr bid="B1">1</abbr><abbr bid="B9">9</abbr><abbr bid="B13">13</abbr></abbrgrp>, we sequenced 62 genes for an additional A-genome species, <it>O. barthii</it>, and two additional C-genome species, <it>O. rhizomatis </it>and <it>O. eichingeri </it>(Table <tblr tid="T1">1</tblr>). Sequences of cultivated rice, <it>O. sativa</it>, belonging to the A-genome were also retrieved from the BGI-RIS Database <abbrgrp><abbr bid="B39">39</abbr></abbrgrp> and added to the data set. Phylogenetic analyses of the 11 species generated the same inter-genome relationship as the one shown in Figure <figr fid="F1">1</figr> (Figure <figr fid="F2">2</figr>). This indicates that one species sampled from each genome was sufficiently representative for the reconstruction of the genome relationships.</p>
            <fig id="F2">
               <title>
                  <p>Figure 2</p>
               </title>
               <caption>
                  <p>ML tree inferred based on concatenation of 62 genes from 11 species using the GTR+&#915; model</p>
               </caption>
               <text>
                  <p>ML tree inferred based on concatenation of 62 genes from 11 species using the GTR+&#915; model. Numbers above branches indicate bootstrap support of ML and MP, and posterior probability of BI analyses, respectively. Capital letters (A to G) beside the tree specify the genome type of the species. For the species in bold, 142 genes were sequenced and used in the analyses as shown in Figure 1.</p>
               </text>
               <graphic file="gb-2008-9-3-r49-2"/>
            </fig>
         </sec>
         <sec>
            <st>
               <p>Phylogenetic incongruence and network analyses</p>
            </st>
            <p>When phylogenetic analyses were done for each of the 142 genes separately, more than 40 different optimal trees were generated, indicative of extensive incongruence among gene phylogenies. To gain insight into the extent of incongruence, we constructed consensus networks from ML trees of the 106-gene data set without missing data. Figure <figr fid="F3">3a</figr> shows the network at a threshold of 0.15, which presents branches appearing at a frequency of 15% or higher of all gene trees. Two boxes are evident in the network, indicating that topological incongruence is concentrated on branch I involving the relationships between A-, B-, and C-genomes and branch IV involving the relationships between the F-genome, G-genome, and the rest of the genome types (R). We also explored the features of consensus networks by increasing the threshold from 0.05 and found that the boxes were collapsed when the threshold reached 0.3 and it ended up with the topology identical to that shown in Figure <figr fid="F1">1</figr> (Additional data file 8). These results further support the phylogenetic relationships revealed by the concatenated data set and highlight the incongruence involving branches I and IV.</p>
            <fig id="F3">
               <title>
                  <p>Figure 3</p>
               </title>
               <caption>
                  <p>Genome-wide incongruence</p>
               </caption>
               <text>
                  <p>Genome-wide incongruence. A, B, C, E, F, and G represent <it>Oryza </it>genome types and L represents the outgroup, <it>Leersia</it>. <b>(a) </b>Consensus network constructed from ML trees at a threshold of 0.15. The two boxes indicate the relatively high levels of incongruence among gene trees associated with internal branches I and IV. Branch length is proportional to the frequency of occurrence of a particular split of all gene trees. R represents the rest of the genome types, including A-, B-, C-, and E-genomes. Color schemes: for the box associated with branch I, blue, orange, and purple illustrate splits supporting alternative topologies, (AB)C, (BC)A, and (AC)B, respectively; for the box associated with branch IV, blue, orange, and purple illustrate splits supporting alternative topologies, (RF)G, (FG)R, and (RG)F, respectively. <b>(b) </b>Pie graphs indicate the proportions of gene trees that support alternative splits in the corresponding boxes at the left. Histograms at the right illustrate the distribution of ML bootstrap support for the corresponding split (in the corresponding colors). <b>(c) </b>Illustration of the relative physical locations of the 142 sampled genes on the 12 rice chromosomes based on rice genome sequences. The colors indicate genes supporting a split or topology coded in the same color in the corresponding boxes on the consensus network. Genes coded in gray are those that had no input in the topology illustrated in the pie graphs and those not included for the construction of the consensus network because of missing data.</p>
               </text>
               <graphic file="gb-2008-9-3-r49-3"/>
            </fig>
            <p>In the first box, the length of parallel edges supporting split AB|CEFGL is longer than those supporting splits AC|BEFGL and BC|AEFGL (Figure <figr fid="F3">3a</figr>), suggesting that a higher proportion of consensus signal groups A and B together. This is in agreement with the result that a larger number of gene trees (53%) support the sister relationship of A and B than those supporting the alternative sister relationship between B and C (26%) or between A and C (21%) (Figure <figr fid="F3">3b</figr>). For the second box, the length of parallel edges supporting split ABCEF|GL is longer than those supporting the two alternative splits. This is also consistent with the result that 45% of gene trees support the sister relationship between R and F while 30% and 25% of gene trees support the sister relationships between R and G or between F and G, respectively (Figure <figr fid="F3">3b</figr>).</p>
            <p>To further explore the incongruence among gene trees, we performed the incongruence length difference (ILD) test based on two partitioning strategies and found that there was no significant incongruence between any pair of the process partitions (intron and three codon positions; Additional data file 9). In contrast, significant heterogeneity was found among gene partitions, including tests among all gene partitions as a whole (<it>P </it>&lt; 0.01) and between pairwise comparisons and between each gene and the remaining genes combined (Additional data file 10). These results were consistent with the distributions of bootstrap support for alternative topologies at the two boxes (Figure <figr fid="F3">3b</figr>). For each box, there is a substantial proportion of high bootstrap support for alternative topologies, suggesting that the competing topologies are well supported on the respective gene trees. Remarkably, genes supporting any given topology are distributed randomly among the 12 chromosomes (&#967;<sup>2 </sup>test, <it>P </it>= 0.233-0.823), indicative of a genome-wide incongruence (Figure <figr fid="F3">3c</figr>).</p>
            <p>To address the question of whether the incongruence among genes is attributed to different evolutionary histories of genes or merely systematic errors <abbrgrp><abbr bid="B40">40</abbr><abbr bid="B41">41</abbr></abbrgrp>, we conducted tests for systematic bias for each of the 142 genes. The Chi-square test revealed that there was no heterogeneity of base composition for any gene. However, rate heterogeneity was detected for some genes by the relative rate test. We then conducted phylogenetic analyses for each gene using different strategies, including ML, MP, RY-coding, and LogDet distance. The comparison of bootstrap 75% majority-rule consensus trees showed that only 4 out of 142 genes yielded incompatible topologies between different methods of analyses (Additional data file 11). This indicates that there are few systematic errors involved in individual genes and the incongruence among gene partitions is governed mainly by different evolutionary histories of genes.</p>
         </sec>
         <sec>
            <st>
               <p>Short branches and their resolution</p>
            </st>
            <p>Different evolutionary histories of genes can be attributed to three major factors, including paralogy, hybridization, and lineage sorting <abbrgrp><abbr bid="B40">40</abbr></abbrgrp>. We have largely ruled out the potential effect of paralogy by carefully screening gene markers (see Materials and methods). The pattern of incongruence also does not support hybrid speciation because hybridization would have led to two major incongruent topologies rather than the presence of a leading topology with two alternative topologies occurring at nearly equal frequencies for both clades I and IV. The random distribution on chromosomes of the genes that support a given topology (Figure <figr fid="F3">3c</figr>) does not support the hybridization hypothesis either because related or linked loci should share gene trees if the species have a history of introgression or hybridization <abbrgrp><abbr bid="B42">42</abbr></abbrgrp>. Therefore, we are left with the hypothesis of lineage sorting as the primary explanation for the incongruence.</p>
            <p>Population genetic theory suggests that lineage sorting is more likely to occur at an internal branch of a species tree that is short (few in generations) and wide (large in effective population sizes) <abbrgrp><abbr bid="B43">43</abbr><abbr bid="B44">44</abbr></abbrgrp>. Based on estimation by the ML method, branches I and IV were the shortest internal braches on the concatenation tree and obtained relatively low support values in analyses with different methodologies (Figure <figr fid="F1">1</figr> and Table <tblr tid="T2">2</tblr>). For branch I, there is a sufficient amount of published data that allow us to estimate the probability to obtain the species tree from a given gene. That is, <it>P </it>= 1 - 2/3exp(-<it>t</it>) under the coalescent model, where <it>t </it>is the time between two speciation events in the unit of generations/2<it>Ne </it>and <it>Ne </it>is the effective population size <abbrgrp><abbr bid="B43">43</abbr></abbrgrp>.</p>
            <p>Using the previously reported nucleotide diversity at silent sites (&#952;<sub>sil </sub>= 0.0038-0.0095) for the A- and C-genome species <abbrgrp><abbr bid="B45">45</abbr><abbr bid="B46">46</abbr></abbrgrp> and a substitution rate for grasses (5.9 &#215; 10<sup>-9 </sup>substitutions per synonymous site per year) <abbrgrp><abbr bid="B47">47</abbr><abbr bid="B48">48</abbr></abbrgrp>, we estimated that the effective population sizes of these <it>Oryza </it>species ranged from 1.6 &#215; 10<sup>5 </sup>to 4.0 &#215; 10<sup>5</sup>. A speciation model test on three C-genome species suggested that their ancestral population sizes were approximately ten-fold larger than those of each species <abbrgrp><abbr bid="B45">45</abbr></abbrgrp>. Thus, the ancestral population size of A-, B-, and C-genomes (<it>Ne</it>) should be at least 1.6 &#215; 10<sup>6</sup>. Because the A-genome species began to diverge approximately 2 Mya <abbrgrp><abbr bid="B49">49</abbr></abbrgrp> and divergence between B- and C-genomes occurred approximately 3.8 Mya <abbrgrp><abbr bid="B45">45</abbr></abbrgrp>, the time between two speciation events should be less than 1.8 million years. Given the generation time of 1-2 years in wild rice species <abbrgrp><abbr bid="B50">50</abbr></abbrgrp>, the number of generations between the two speciation events is at most 1.8 &#215; 10<sup>6</sup>. The estimated upper limit of generations together with the lower limit of <it>Ne </it>led to the calculation of the upper limit of <it>P </it>as 0.62. This implies that there is less than a 62% chance for any given gene tree to be the same as the species tree or less than 62% of gene trees from the sampled genes will be congruent with the species tree. Our finding that 53% of gene trees support the sister relationship of A- and B-genomes agrees with the theoretical expectation (Figure <figr fid="F3">3b</figr>), which further supports the lineage sorting hypothesis.</p>
            <p>For branch IV, the divergence happened at greater depth in the tree and thus homoplasy resulting from mutational saturation might be a factor to cause incongruent gene phylogenies <abbrgrp><abbr bid="B33">33</abbr><abbr bid="B51">51</abbr></abbrgrp>. However, analyses of saturation plots did not reveal any mutational saturation for the concatenated data set (Additional data file 12), suggesting that lineage sorting is still the most plausible explanation for the incongruence.</p>
            <p>To assess how much of the data set might be needed to resolve such short branches, we explored the relationship between the number of genes or nucleotide sites and the proportion of gene trees that support the topology or clades shown in Figure <figr fid="F1">1</figr>. The results demonstrated that the probability of getting identical topology or clades as in Figure <figr fid="F1">1</figr> steadily increased with the number of genes or sites sampled, regardless of methods used, although ML generally performed better than MP (Figure <figr fid="F4">4</figr>). Using 95% of identical gene trees or clades in 500 replicates as a criterion, about 120 genes were needed for both ML and MP methods to resolve branch I and more than 80 (ML) and 120 (MP) genes were needed to resolve branch IV. Additionally, 120 (ML) or more genes (MP) were needed to resolve both branches simultaneously (Figure <figr fid="F4">4</figr> and Additional data file 13).</p>
            <fig id="F4">
               <title>
                  <p>Figure 4</p>
               </title>
               <caption>
                  <p>The proportions of topologies (or clades) that are identical to those shown in Figure 1 based on resampling of 142 gene sequences at various scales</p>
               </caption>
               <text>
                  <p>The proportions of topologies (or clades) that are identical to those shown in Figure 1 based on resampling of 142 gene sequences at various scales. Results of ML and MP analyses are indicated by blue and red, respectively. Genome types are represented with the same capital letters as in Figure 3.</p>
               </text>
               <graphic file="gb-2008-9-3-r49-4"/>
            </fig>
            <p>When nucleotide sites rather than genes were the unit of resampling, about 40 kb of nucleotides were sufficient to resolve branch I with both methods. This is equivalent to 46 sampled genes in length given the average length of 874 bp per gene. It took approximately 40 kb and 80 kb (approximately 92 genes) for ML and MP, respectively, to resolve branch IV. A total of 50 kb (approximately 57 genes) for ML and 80 kb for MP were sufficient to resolve both branches simultaneously (Figure <figr fid="F4">4</figr> and Additional data file 13). These results indicate that random sampling of unlinked nucleotides has a higher power of phylogenetic resolution than sampling contiguous nucleotides such as those within a gene.</p>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Discussion</p>
         </st>
         <p>This study fully resolved the phylogeny of the rice genomes. Through extensive tests and analyses, we demonstrate that the phylogenetic reconstruction based on the sequences of 142 genes was not biased by systematic or sampling errors and was insensitive to phylogenetic methods or model specification. We identified across the genome a remarkable level of incongruence of gene phylogenies at the two shortest internal branches (Figures <figr fid="F1">1</figr> and <figr fid="F3">3</figr>). Our analyses clearly indicated that lineage sorting was a primary cause for the difficulty of resolving two branches of the rice phylogeny that underwent rapid diversification. Even more remarkably, lineage sorting occurred for genes distributed randomly across all 12 rice chromosomes (Figure <figr fid="F3">3c</figr>). This study thus documents a case of genome-wide lineage sorting that gave rise to species with the mosaic of ancestral genomes <abbrgrp><abbr bid="B26">26</abbr><abbr bid="B52">52</abbr></abbrgrp>. One implication of our findings is that special caution must be taken in interpreting phylogenetic relationships of rapidly diverged lineages even though the relationships are strongly supported on a single gene phylogeny. Our results also imply that although it may not be feasible to have a large number of genes to resolve a short branch for groups with limited genomic resources, utilization of a few genes should provide a clue to the extent to which lineage sorting may lead to erroneous phylogenies <abbrgrp><abbr bid="B12">12</abbr></abbrgrp>.</p>
         <p>The biological implications for the presence of two short branches (I and IV) that reflect two episodes of rapid diversification of the rice genus are profound. Based on a molecular clock estimate, the first event occurred approximately 10 Mya <abbrgrp><abbr bid="B53">53</abbr></abbrgrp> and led to a rapid diversification of the G-genome, F-genome and a lineage that subsequently diversified into the rest of the rice genomes. Additionally, the H-, J-, and K-genomes that are now only present in extant tetraploid species, including <it>O. longiglumis </it>and <it>O. ridleyi </it>with the HJ-genome and <it>O. schlechteri </it>and <it>O. coarctata </it>with the HK-genome, also diverged around this time <abbrgrp><abbr bid="B12">12</abbr><abbr bid="B53">53</abbr></abbrgrp>. The second event led to the diversification of A-, B-, and C-genomes approximately 5 Mya <abbrgrp><abbr bid="B45">45</abbr><abbr bid="B53">53</abbr></abbrgrp>. Therefore, the two episodes of rapid diversification gave rise to almost the entire diversity of the genus. Because the <it>Oryza </it>species are distributed in distinct habitats across four continents <abbrgrp><abbr bid="B1">1</abbr><abbr bid="B50">50</abbr></abbrgrp>, it would be interesting to further investigate whether the rapid diversification was coupled with adaptive radiation under certain geological and ecological conditions <abbrgrp><abbr bid="B54">54</abbr><abbr bid="B55">55</abbr></abbrgrp>.</p>
         <p>Rapid speciation, particularly ancient radiation, featured by the short internal branches in phylogenetic trees, poses an extraordinary challenge to systematic and evolutionary biologists <abbrgrp><abbr bid="B33">33</abbr><abbr bid="B51">51</abbr><abbr bid="B55">55</abbr><abbr bid="B56">56</abbr></abbrgrp>. It has been observed at a variety of time depths ranging from as early as the Cambrian explosion of animal phyla over 550 Mya <abbrgrp><abbr bid="B25">25</abbr></abbrgrp> to as recent as the divergence between human, chimpanzee, and gorilla a few Mya <abbrgrp><abbr bid="B52">52</abbr><abbr bid="B57">57</abbr><abbr bid="B58">58</abbr></abbrgrp>. In many cases, phylogenetic relationships seemed to be an irresolvable polytomy <abbrgrp><abbr bid="B23">23</abbr><abbr bid="B56">56</abbr><abbr bid="B59">59</abbr></abbrgrp> because of the rapid radiations. Such closely spaced series of speciation events was accordingly considered to be "bushes in the Tree of Life" <abbrgrp><abbr bid="B33">33</abbr></abbrgrp>. To date, rapid evolutionary radiations have been proposed to be the most plausible explanation for the poorly resolved phylogenies or polytomies in many organisms such as aphids, black flies, bees, birds, turtles, mammals, and higher plants <abbrgrp><abbr bid="B29">29</abbr><abbr bid="B30">30</abbr><abbr bid="B33">33</abbr><abbr bid="B51">51</abbr><abbr bid="B60">60</abbr></abbrgrp>. However, a growing body of evidence showed that many assumed polytomies were 'soft' and could be resolved into sequential bifurcations with additional data and proper methods of phylogenetic analysis <abbrgrp><abbr bid="B52">52</abbr><abbr bid="B59">59</abbr><abbr bid="B61">61</abbr><abbr bid="B62">62</abbr></abbrgrp>. In a study of phylogenetic relationships among tetrapod, coelacanth, and lungfish, Takezaki <it>et al</it>. <abbrgrp><abbr bid="B59">59</abbr></abbrgrp> obtained an irresolvable trichotomy although sequences of 44 nuclear genes were analyzed. Using computer simulation, they concluded that more than 200 loci would have to be analyzed to resolve the relationships among the three lineages if the fish-to-tetrapod transition interval was 10-20 million years long. The once unresolved relationship among human, chimpanzee, and gorilla is a typical example of soft polytomies. Recent analyses with an increased amount of molecular data resolved human and chimpanzee into a sister group <abbrgrp><abbr bid="B52">52</abbr><abbr bid="B57">57</abbr><abbr bid="B58">58</abbr></abbrgrp>. Our results exemplify that rapid speciation within an angiosperm genus can be reliably resolved as long as a sufficient amount of unlinked DNA sequences is available.</p>
         <p>However, we should also realize that the increase in the amount of data alone may not provide a universal solution to all short branches on the Tree of Life. It is theoretically possible that certain branches are not resolvable even with whole genome sequences if time intervals between speciation were extremely short and the speciation events were sufficiently ancient <abbrgrp><abbr bid="B31">31</abbr><abbr bid="B33">33</abbr><abbr bid="B51">51</abbr></abbrgrp>. These branches are considered to be 'hard' polytomies <abbrgrp><abbr bid="B33">33</abbr><abbr bid="B61">61</abbr></abbrgrp>. Nevertheless, both soft and hard polytomies provide historical information on evolutionary processes and a phylogenetic analysis with genome-wide information can be most helpful for understanding the evolutionary histories behind these seemingly problematic, but perhaps intriguing, branches of the Tree of Life.</p>
         <p>For soft polytomies, an obviously interesting question is how many DNA sequences would be needed to resolve rapid speciation considering that DNA sequences have been, and will remain, major sources of biological data <abbrgrp><abbr bid="B31">31</abbr><abbr bid="B32">32</abbr></abbrgrp>. The mosaic genome or different evolutionary histories of genes under rapid speciation, in conjunction with other factors associated with species divergence (for example, selection and high homoplasy of ancient speciation <abbrgrp><abbr bid="B33">33</abbr><abbr bid="B51">51</abbr></abbrgrp>), brings about difficulties in resolving speciation events when using a small number of regions/genes or limited characters <abbrgrp><abbr bid="B22">22</abbr><abbr bid="B59">59</abbr></abbrgrp>. This study shows that as many as 120 genes with an average length of 874 bp or 50 kb of randomly sampled nucleotides from 142 genes are needed to resolve clades I and IV simultaneously with over 95% confidence (Figure <figr fid="F4">4</figr>). Clearly, blocks of contiguous nucleotide sites were less powerful in phylogenetic resolution than samples consisting of sites drawn randomly from the genome because nucleotides within genes do not evolve independently <abbrgrp><abbr bid="B22">22</abbr><abbr bid="B63">63</abbr></abbrgrp>. This implies that for the same amount of sequence data, a larger number of unlinked shorter DNA fragments are preferred over a smaller number of larger fragments for resolving short branches.</p>
      </sec>
      <sec>
         <st>
            <p>Conclusion</p>
         </st>
         <p>As the speed of genome sequencing continues to accelerate, phylogenomics is becoming a growing field of evolutionary biology. The potential of phylogenomics to address fundamental evolutionary questions has yet to be realized with the accumulation of phylogenomic studies for diverse groups of organisms <abbrgrp><abbr bid="B31">31</abbr><abbr bid="B32">32</abbr><abbr bid="B33">33</abbr></abbrgrp>. The successful resolution of the rice phylogeny demonstrates the power of phylogenomics in the reconstruction of rapid evolutionary diversification. This study also highlights that organismal genomes might be mosaics of conflicting genealogies because of rapid speciation and exemplifies that phylogenetic relationships of organisms that undergo explosive or rapid diversification can be reliably resolved with increasing amounts of data and improved analytical methodology. A fully resolved rice phylogeny lays a solid foundation for comparative and functional genomic studies of rice and its related species and genera. Combined with the availability of rice genome sequences <abbrgrp><abbr bid="B2">2</abbr><abbr bid="B64">64</abbr></abbrgrp> and the BAC libraries of <it>Oryza </it>species representing all rice genome types <abbrgrp><abbr bid="B7">7</abbr></abbrgrp>, this phylogenetic framework will play an important role in the studies of genome evolution, speciation and adaptation, and crop domestication.</p>
      </sec>
      <sec>
         <st>
            <p>Materials and methods</p>
         </st>
         <sec>
            <st>
               <p>Sampling single-copy genes</p>
            </st>
            <p>We used the BGI-RIS Database <abbrgrp><abbr bid="B39">39</abbr></abbrgrp> for gene screening. Similar to the strategy used by Yu <it>et al. </it><abbrgrp><abbr bid="B64">64</abbr></abbrgrp>, we extracted the protein sequences with nr-KOME cDNA <abbrgrp><abbr bid="B65">65</abbr></abbrgrp> evidence and then conducted extensive searches against the genomic sequences of indica rice (<it>93-11</it>) in all six reading frames using TBLASTN at E-values of 10<sup>-7</sup>. To ensure that single-copy genes were used in our analysis, we applied a stringent similarity criterion of 50% in our searches; that is, only protein-coding genes that have no counterpart over 50% similar to themselves in the rice genome were selected for further analyses. Excluding those sequences without syntenic counterparts in the japonica (<it>Nipponbare</it>) genome <abbrgrp><abbr bid="B2">2</abbr></abbrgrp>, we got a total of 943 genes as candidates for phylogenetic markers. Using coding sequences of these candidates, we performed BLAST searches against the GenBank database to obtain the gene counterparts from barley, maize, sorghum, wheat, or other species of Poaceae as targets for primer design. On this basis, we designed 162 pairs of primers for amplifying orthologous segments from <it>Oryza </it>species and the outgroup <it>Leersia tisserantti</it>. Finally, 118 genes were kept according to the following criteria: they were sampled randomly from all the 12 rice chromosomes; the amplifying length ranged from 0.5-2.0 kb with an intron length of 30-70% so that adequate information is available at different taxonomic levels; and clear and strong amplified fragments were obtained from the <it>Oryza </it>species and the outgroup. Moreover, we sequenced 24 additional genes that were single copies demonstrated by previous studies (Additional data file 2). All the 142 genes used in this study were mapped onto the chromosomes of indica rice (<it>93-11</it>) (Additional data file 1).</p>
         </sec>
         <sec>
            <st>
               <p>Species sampling, amplification, and sequencing</p>
            </st>
            <p>We sampled six <it>Oryza </it>species, representing all six diploid genomes in the genus, and one <it>Leersia </it>species (<it>L. tisserantti</it>) as outgroup because <it>Leersia </it>is most closely related to <it>Oryza </it><abbrgrp><abbr bid="B12">12</abbr><abbr bid="B53">53</abbr></abbrgrp>. Information on the materials used in this study is listed in Table <tblr tid="T1">1</tblr>. Primers for PCR of all 142 genes are listed in Additional data file 14. Missing or partial sequences of some genes were present in some species because of the amplifying difficulty (Table <tblr tid="T1">1</tblr>). However, missing data in our case did not impact the tree constructions no matter what methods were used because our data set contained sufficient information, consistent with previous computer simulation and empirical investigation <abbrgrp><abbr bid="B21">21</abbr><abbr bid="B25">25</abbr><abbr bid="B66">66</abbr></abbrgrp>.</p>
            <p>PCR amplifications and purification of the products were performed by standard methods. Purified products were sequenced either directly or after cloning into pGEM T-easy vectors (Promega, Madison, WI, USA) if the direct sequencing failed. Sequencing was carried out on an ABI 3730 automated sequencer (Applied Biosystems, Foster City, CA, USA). All sequences obtained in this study have been deposited in the GenBank database (accession numbers <ext-link ext-link-type="gen" ext-link-id="EF577518">EF577518</ext-link> to <ext-link ext-link-type="gen" ext-link-id="EF578433">EF578433</ext-link>, and <ext-link ext-link-type="gen" ext-link-id="EU503348">EU503348</ext-link> to <ext-link ext-link-type="gen" ext-link-id="EU503533">EU503533</ext-link>; Additional data file 14).</p>
         </sec>
         <sec>
            <st>
               <p>Phylogenetic reconstructions</p>
            </st>
            <p>Individual genes were aligned using T-Coffee <abbrgrp><abbr bid="B67">67</abbr></abbrgrp> and then manually adjusted. Phylogenetic trees were reconstructed by ML, MP and BI methods. ML and MP were implemented with PAUP 4.0b10 <abbrgrp><abbr bid="B68">68</abbr></abbrgrp> and the branch-and-bound algorithm was used for tree searching. A non-parametric bootstrap strategy <abbrgrp><abbr bid="B69">69</abbr></abbrgrp> was used for assessing tree reliability, with 1,000 replicates for MP analysis and with 100 and 500 replicates for ML analysis of the concatenated sequence and single genes, respectively.</p>
            <p>BI was attempted with MrBayes 3.1.2 <abbrgrp><abbr bid="B70">70</abbr></abbrgrp>. Given the sensitivity of the Bayesian method to model misspecification, we explored a series of homogeneous models by combining model components in different ways, including substitution rates among nucleotides (Nst = 1, 2, 6), rate variations across sites (Rates = Equal, Gamma, Propinv, Invgamma), and rate variations across the tree (Covarion = Yes, No) (Additional data file 6). Furthermore, we explored mixed models that accommodate heterogeneity across data partitions by specifying partition-specific substitution models <abbrgrp><abbr bid="B70">70</abbr></abbrgrp>. We applied mixed models to our partitioned data by two schemes (see 'Analysis of systematic bias and congruence tests' below). Mixed models were implemented with separate models for each data partition selected by the program Modeltest 3.7 <abbrgrp><abbr bid="B71">71</abbr></abbrgrp> and model parameters separately estimated, and a rate multiplier (ratepr = variable) was also employed to allow the overall rate to be different across partitions. In all the BI analyses, three independent Markov Chain Monte Carlo runs were executed, each starting with randomly choosing topologies for the four simultaneous chains, one cold and three incrementally heated. The four chains were run for at least 1,000,000 generations until stationarity in Markov chains was achieved, sampling trees every 100 generations with the first 10% of trees sampled discarded as burn-in, and then the posterior probabilities were calculated from the remaining samples.</p>
            <p>We used Bayes factors <abbrgrp><abbr bid="B72">72</abbr></abbrgrp> to evaluate the relative merits of two competing models, with the intention of detecting the effect of model components on our data. This method does not require alternative models to be hierarchically nested, and so it makes possible the comparison of any pair of distinctly different models. A Bayes factor in favor of one model (model 1) over another model (model 0) was calculated as the ratio of their marginal likelihoods and the natural logarithm of marginal likelihood can be approximated by the harmonic mean of the likelihoods of Markov Chain Monte Carlo samples with MrBayes <abbrgrp><abbr bid="B73">73</abbr></abbrgrp>. We calculated twice the natural logarithm of the Bayes factors for the competing model pairs, and interpreted the results according to the rule suggested by Kass and Ratery <abbrgrp><abbr bid="B72">72</abbr></abbrgrp>, which states that a result of 2 to 6 is 'positive' evidence in favor of model 1, a result of 6 to 10 is 'strong' evidence, and a result of >10 is 'very strong' evidence; conversely, a result of &lt;0 provides evidence in favor of model 0.</p>
         </sec>
         <sec>
            <st>
               <p>Phylogenetic network analysis</p>
            </st>
            <p>To combine evidence from different loci without losing the information on independent gene histories, which might be drowned out by suppressing them into a bifurcating tree, several phylogenetic network approaches have been proposed and proven to be useful alternatives when using multi-gene data sets <abbrgrp><abbr bid="B74">74</abbr><abbr bid="B75">75</abbr><abbr bid="B76">76</abbr></abbrgrp>. Consensus network, which is applied to multiple trees with the same set of taxa, is one commonly used network approach and can display simultaneously the conflicting evolutionary hypotheses based on multiple loci in a network fashion <abbrgrp><abbr bid="B74">74</abbr><abbr bid="B76">76</abbr></abbrgrp>. Such conflict or uncertainty might arise from stochastic errors, systematic bias, or biological processes <abbrgrp><abbr bid="B75">75</abbr></abbrgrp>. Therefore, phylogenetic networks provide a more inclusive approach than analysis of the concatenated data set because weak or conflicting signals are hidden when genes are concatenated before phylogenetic analysis <abbrgrp><abbr bid="B76">76</abbr></abbrgrp>.</p>
            <p>In the consensus network, areas where all trees have compatible splits (that is, a split is a bipartition of the taxa) will be tree-like (that is, a single branch); in contrast, areas with incompatible splits will be represented by bands of parallel edges, thus forming a potentially hyper-dimensional graph. The degree of denseness of boxes in networks reflects the intensity of contradictory evidence for grouping certain taxa, and the length of an edge is determined by the weight assigned to it <abbrgrp><abbr bid="B74">74</abbr><abbr bid="B75">75</abbr></abbrgrp>. The phylogenetic networks can range from one extreme, a structure of high-dimensional hypercubes in the absence of any common phylogenetic patterns among gene trees, to the other extreme, a unique bifurcating tree in the absence of stochasticity associated with bifurcating evolutionary process <abbrgrp><abbr bid="B75">75</abbr></abbrgrp>. By employing the threshold value, we can reduce the visual complexity of resulting graphs by using only the splits that occur in more than a given proportion of all trees.</p>
            <p>In the present study, we constructed consensus networks from optimal ML trees for a 106-gene data set in which sequences of all six diploid genomes and the outgroup were available and included in our consensus network all splits that occurred above a threshold value ranging from 0.05-0.3. In our case, branch lengths were not considered when using optimal ML trees as source trees because we were only interested in the conflict between topologies of gene trees. Thus, edge lengths in the final network are proportional to the number of trees in which a particular split appears. Consensus network was performed by the method described by Holland <abbrgrp><abbr bid="B76">76</abbr></abbrgrp>, in which Python scripts (kindly offered by BR Holland) was first implemented to create Nexus files and then the resulting network was visualized by Spectronet <abbrgrp><abbr bid="B77">77</abbr></abbrgrp>.</p>
         </sec>
         <sec>
            <st>
               <p>Analysis of systematic bias and congruence tests</p>
            </st>
            <p>Systematic errors such as compositional signal, rate signal and heterotachous signal might be reinforced as more and more data are considered <abbrgrp><abbr bid="B35">35</abbr></abbrgrp>. We first tested the compositional bias resulting from the heterogeneity of nucleotide compositions among lineages by Chi-square test. The LogDet distance <abbrgrp><abbr bid="B36">36</abbr></abbrgrp> was also used to account for compositional bias with the neighbor-joining method. Then Tajima's relative rate test <abbrgrp><abbr bid="B37">37</abbr></abbrgrp> was employed with each pair of <it>Oryza </it>species, using <it>L. tisserantti </it>as outgroup, to test rate constancy. Sequence data were also analyzed under the RY-coding strategy (A and G = R, C and T = Y), which maintains only transversions and thus efficiently reduces saturations by excluding more frequently occurring transitions <abbrgrp><abbr bid="B31">31</abbr><abbr bid="B38">38</abbr></abbrgrp>. In addition, the effect of heterotachous signal was explored by implementing a covarion model in BI.</p>
            <p>Substitutional saturation of the data set was evaluated by plotting observed pairwise distance (uncorrected P-distance) for transitions and transversions against the ML pairwise distances for each pair of taxa. Saturation plots were constructed for total, exon, intron and third codon positions, respectively. Second order polynomial regression lines were fitted to all saturation plots and if the slope of this regression line was zero or negative, the data were considered saturated <abbrgrp><abbr bid="B78">78</abbr></abbrgrp>.</p>
            <p>The ILD test <abbrgrp><abbr bid="B79">79</abbr></abbrgrp>, a character-based test for homogeneity, was used to explore the difference in phylogenetic signal between data partitions. We partitioned the data set by two schemes: four process partitions including intron and each codon positions <abbrgrp><abbr bid="B80">80</abbr></abbrgrp>; and 142 gene partitions along gene boundaries, which may reveal variation in allelic histories that the concatenated data might obscure <abbrgrp><abbr bid="B26">26</abbr><abbr bid="B76">76</abbr></abbrgrp>. Then, we performed three kinds of ILD tests for each type of partition: a test among all partitions simultaneously; a test between all possible pairwise partitions; and a test between single partitions and the rest of the data set combined.</p>
         </sec>
         <sec>
            <st>
               <p>Amount of sequence and phylogenetic resolution</p>
            </st>
            <p>To explore the relationship between the number of genes or nucleotides in a sample and the probability to infer the species tree in our case, we drew random samples of different sizes from the original 142-gene data set without replacement and concatenated each sample for phylogenetic analyses. When sampling genes, we generated samples consisting of 20, 40, 60, ..., 120 genes each for 500 replicates. Similarly, samples with randomly sampled sites in a total length of 10, 20, 30, ...100 kb were generated each for 500 replicates. ML and MP methods were used to determine whether or not the sampling results were affected by reconstruction methods. The branch-and-bound search was used in both methods, with the General Time Reversible (GTR)+&#915; model for ML. The proportion of trees (or clades) identical to that in Figure <figr fid="F1">1</figr> was calculated as the probability that a correct phylogenetic hypothesis will be obtained at a specific data size <abbrgrp><abbr bid="B63">63</abbr></abbrgrp>.</p>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Abbreviations</p>
         </st>
         <p>BI, Bayesian inference; GTR, General Time Reversible; ILD, incongruence length difference; kb, kilo base pairs; LogDet, log-determinant; ML, maximum likelihood; MP, maximum parsimony; Mya, million years ago; PP, Bayesian posterior probability.</p>
      </sec>
      <sec>
         <st>
            <p>Additional data files</p>
         </st>
         <p>The following additional data are available with the online version of this paper. Additional data file <supplr sid="S1">1</supplr> is a figure showing the relative location on rice chromosomes of the 142 genes sampled in this study. Additional file <supplr sid="S2">2</supplr> is a table listing the detailed information on each of 142 loci. Additional file <supplr sid="S3">3</supplr> is a table listing the GC content variation among lineages and the result of Chi-square test for the concatenated data set. Additional file <supplr sid="S4">4</supplr> is a table summarizing the Tajima's relative rate test for concatenated sequences using <it>Leersia </it>as outgroup, with estimates of the ratio of substitution rate between lineages. Additional file <supplr sid="S5">5</supplr> includes figures showing the results of testing the effect of rate bias caused by the fast-evolving genes of the F-genome. Additional file <supplr sid="S6">6</supplr> is a table summarizing 14 alternative models used in BI analyses. Additional file <supplr sid="S7">7</supplr> is a table indicating the effect of model components on model fit judged by Bayes factor comparisons of competing models. Additional file <supplr sid="S8">8</supplr> is a figure showing the consensus networks of a collection of 106 optimal ML trees from the 106 genes with the complete set of seven species, applying thresholds of 0.05, 0.1, 0.15, 0.2, 0.25 and 0.3, respectively. Additional file <supplr sid="S9">9</supplr> is a table presenting the results of the ILD test for pairwise comparisons of process partitions. Additional file <supplr sid="S10">10</supplr> is a table summarizing the number of genes that failed the ILD test with the target gene at <it>P </it>&lt; 0.01 for total, intron, exon and the third codon sites, respectively, and the <it>P </it>value of the ILD test between the target gene and all the rest of genes. Additional file <supplr sid="S11">11</supplr> is a table presenting the topologies of bootstrap 75% majority-rule consensus trees by different methods of analyses for each gene. Additional file <supplr sid="S12">12</supplr> is a figure showing saturation analyses in the concatenated datasets of total, intron, exon, and third codon positions, respectively. Additional file <supplr sid="S13">13</supplr> is a table summarizing the proportions of topology (or clades) identical to those shown in Figure <figr fid="F1">1</figr> inferred from randomly sampled genes or sites in 500 replicates. Additional file <supplr sid="S14">14</supplr> is a table listing the primers for PCR amplification and the GenBank accession numbers of the sequences of 142 loci sampled in this present study.</p>
         <suppl id="S1">
            <title>
               <p>Additional data file 1</p>
            </title>
            <caption>
               <p>The relative location on rice chromosomes of the 142 genes sampled in this study</p>
            </caption>
            <text>
               <p>The relative location on rice chromosomes of the 142 genes sampled in this study.</p>
            </text>
            <file name="gb-2008-9-3-r49-S1.pdf">
               <p>Click here for file</p>
            </file>
         </suppl>
         <suppl id="S2">
            <title>
               <p>Additional data file 2</p>
            </title>
            <caption>
               <p>Detailed information on each of 142 loci</p>
            </caption>
            <text>
               <p>Detailed information on each of 142 loci.</p>
            </text>
            <file name="gb-2008-9-3-r49-S2.pdf">
               <p>Click here for file</p>
            </file>
         </suppl>
         <suppl id="S3">
            <title>
               <p>Additional data file 3</p>
            </title>
            <caption>
               <p>GC content variation among lineages and the result of Chi-square test for the concatenated data set</p>
            </caption>
            <text>
               <p>GC content variation among lineages and the result of Chi-square test for the concatenated data set.</p>
            </text>
            <file name="gb-2008-9-3-r49-S3.pdf">
               <p>Click here for file</p>
            </file>
         </suppl>
         <suppl id="S4">
            <title>
               <p>Additional data file 4</p>
            </title>
            <caption>
               <p>Summary of the Tajima's relative rate test for concatenated sequences using <it>Leersia </it>as outgroup, with estimates of the ratio of substitution rate between lineages</p>
            </caption>
            <text>
               <p>Summary of the Tajima's relative rate test for concatenated sequences using <it>Leersia </it>as outgroup, with estimates of the ratio of substitution rate between lineages.</p>
            </text>
            <file name="gb-2008-9-3-r49-S4.pdf">
               <p>Click here for file</p>
            </file>
         </suppl>
         <suppl id="S5">
            <title>
               <p>Additional data file 5</p>
            </title>
            <caption>
               <p>Results of testing the effect of rate bias caused by the fast-evolving genes of the F-genome</p>
            </caption>
            <text>
               <p>Results of testing the effect of rate bias caused by the fast-evolving genes of the F-genome.</p>
            </text>
            <file name="gb-2008-9-3-r49-S5.pdf">
               <p>Click here for file</p>
            </file>
         </suppl>
         <suppl id="S6">
            <title>
               <p>Additional data file 6</p>
            </title>
            <caption>
               <p>The 14 alternative models used in BI analyses</p>
            </caption>
            <text>
               <p>The 14 alternative models used in BI analyses.</p>
            </text>
            <file name="gb-2008-9-3-r49-S6.pdf">
               <p>Click here for file</p>
            </file>
         </suppl>
         <suppl id="S7">
            <title>
               <p>Additional data file 7</p>
            </title>
            <caption>
               <p>The effect of model components on model fit judged by Bayes factor comparisons of competing models</p>
            </caption>
            <text>
               <p>The effect of model components on model fit judged by Bayes factor comparisons of competing models</p>
            </text>
            <file name="gb-2008-9-3-r49-S7.pdf">
               <p>Click here for file</p>
            </file>
         </suppl>
         <suppl id="S8">
            <title>
               <p>Additional data file 8</p>
            </title>
            <caption>
               <p>Consensus networks of a collection of 106 optimal ML trees from the 106 genes with the complete set of seven species, applying thresholds of 0.05, 0.1, 0.15, 0.2, 0.25 and 0.3, respectively</p>
            </caption>
            <text>
               <p>Consensus networks of a collection of 106 optimal ML trees from the 106 genes with the complete set of seven species, applying thresholds of 0.05, 0.1, 0.15, 0.2, 0.25 and 0.3, respectively.</p>
            </text>
            <file name="gb-2008-9-3-r49-S8.pdf">
               <p>Click here for file</p>
            </file>
         </suppl>
         <suppl id="S9">
            <title>
               <p>Additional data file 9</p>
            </title>
            <caption>
               <p>Results of the ILD test for pairwise comparisons of process partitions</p>
            </caption>
            <text>
               <p>Results of the ILD test for pairwise comparisons of process partitions.</p>
            </text>
            <file name="gb-2008-9-3-r49-S9.pdf">
               <p>Click here for file</p>
            </file>
         </suppl>
         <suppl id="S10">
            <title>
               <p>Additional data file 10</p>
            </title>
            <caption>
               <p>The number of genes that failed the ILD test with the target gene at <it>P </it>&lt; 0.01 for total, intron, exon and the third codon sites, respectively, and the <it>P </it>value of the ILD test between the target gene and all the rest of genes</p>
            </caption>
            <text>
               <p>The number of genes that failed the ILD test with the target gene at <it>P </it>&lt; 0.01 for total, intron, exon and the third codon sites, respectively, and the <it>P </it>value of the ILD test between the target gene and all the rest of genes.</p>
            </text>
            <file name="gb-2008-9-3-r49-S10.pdf">
               <p>Click here for file</p>
            </file>
         </suppl>
         <suppl id="S11">
            <title>
               <p>Additional data file 11</p>
            </title>
            <caption>
               <p>Topologies of bootstrap 75% majority-rule consensus trees by different methods of analyses for each gene</p>
            </caption>
            <text>
               <p>Topologies of bootstrap 75% majority-rule consensus trees by different methods of analyses for each gene.</p>
            </text>
            <file name="gb-2008-9-3-r49-S11.pdf">
               <p>Click here for file</p>
            </file>
         </suppl>
         <suppl id="S12">
            <title>
               <p>Additional data file 12</p>
            </title>
            <caption>
               <p>Saturation analyses in the concatenated datasets of total, intron, exon, and third codon positions, respectively</p>
            </caption>
            <text>
               <p>Saturation analyses in the concatenated datasets of total, intron, exon, and third codon positions, respectively.</p>
            </text>
            <file name="gb-2008-9-3-r49-S12.pdf">
               <p>Click here for file</p>
            </file>
         </suppl>
         <suppl id="S13">
            <title>
               <p>Additional data file 13</p>
            </title>
            <caption>
               <p>Proportions of topology (or clades) identical to those shown in Figure <figr fid="F1">1</figr> inferred from randomly sampled genes or sites in 500 replicates</p>
            </caption>
            <text>
               <p>Proportions of topology (or clades) identical to those shown in Figure <figr fid="F1">1</figr> inferred from randomly sampled genes or sites in 500 replicates.</p>
            </text>
            <file name="gb-2008-9-3-r49-S13.pdf">
               <p>Click here for file</p>
            </file>
         </suppl>
         <suppl id="S14">
            <title>
               <p>Additional data file 14</p>
            </title>
            <caption>
               <p>Primers for PCR amplification and the GenBank accession numbers of the sequences of 142 loci sampled</p>
            </caption>
            <text>
               <p>Primers for PCR amplification and the GenBank accession numbers of the sequences of 142 loci sampled.</p>
            </text>
            <file name="gb-2008-9-3-r49-S14.pdf">
               <p>Click here for file</p>
            </file>
         </suppl>
      </sec>
      <sec>
         <st>
            <p>Authors' contributions</p>
         </st>
         <p>SG, XHZ, and TS designed the study; XHZ, FMZ, LLZ, and SG performed the research; FMZ, JGZ, and JW contributed the gene screening; XHZ, FMZ, LT, SG, and TS analyzed the data; SG, TS, and XHZ interpreted the data and wrote the paper.</p>
      </sec>
   </bdy>
   <bm>
      <ack>
         <sec>
            <st>
               <p>Acknowledgements</p>
            </st>
            <p>We thank L-B Zhang, Q Zhu, Y-H Yang, H-Z Sun, H-Z Kong and other members of the Ge laboratory (Institute of Botany, CAS) and J Zhang (Beijing Genomics Institute) for technical assistance. We also thank Z Yang (University College London, UK), BS Gaut (University of California at Irvine, USA), and L-B Zhang (Missouri Botanical Garden, USA) for discussions and suggestions. We are grateful to BR Holland (Massey University, New Zealand) for providing Python scripts for creating consensus networks and J Savard (University of Koln, Germany) for providing Perl scripts for gene bootstrapping. We acknowledge the International Rice Research Institute (Los Banos, Philippines) for providing seed samples. This work was supported by the National Basic Research Program of China (2007CB815704), the National Natural Science Foundation of China (30121003, 30430030, 30025005), and grants from the Chinese Academy of Sciences to GS, and the National Science Foundation of USA to TS.</p>
         </sec>
      </ack>
      <refgrp>
         <bibl id="B1">
            <title>
               <p>Phylogeny of the genus <it>Oryza </it>as revealed by molecular approaches.</p>
            </title>
            <aug>
               <au>
                  <snm>Ge</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Sang</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Lu</snm>
                  <fnm>BR</fnm>
               </au>
               <au>
                  <snm>Hong</snm>
                  <fnm>DY</fnm>
               </au>
            </aug>
            <source>Rice Genetics IV Proceedings of the Fourth International Rice Genetics Symposium: 25 October, 2000; Los Ba&#241;os, Laguna, Philippines</source>
            <publisher>Los Banos, Philippines: International Rice Research Institute</publisher>
            <editor>Khush GS, Brar DS, Hardy B</editor>
            <pubdate>2001</pubdate>
            <fpage>89</fpage>
            <lpage>105</lpage>
         </bibl>
         <bibl id="B2">
            <title>
               <p>The map-based sequence of the rice genome.</p>
            </title>
            <aug>
               <au>
                  <cnm>International Rice Genome Sequencing Project</cnm>
               </au>
            </aug>
            <source>Nature</source>
            <pubdate>2005</pubdate>
            <volume>436</volume>
            <fpage>793</fpage>
            <lpage>800</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/nature03895</pubid>
                  <pubid idtype="pmpid" link="fulltext">16100779</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B3">
            <title>
               <p>The grasses: a case study in macroevolution.</p>
            </title>
            <aug>
               <au>
                  <snm>Kellogg</snm>
                  <fnm>EA</fnm>
               </au>
            </aug>
            <source>Annu Rev Ecol Syst</source>
            <pubdate>2000</pubdate>
            <volume>31</volume>
            <fpage>217</fpage>
            <lpage>238</lpage>
            <xrefbib>
               <pubid idtype="doi">10.1146/annurev.ecolsys.31.1.217</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B4">
            <title>
               <p>Rice as a model for comparative genomics of plants.</p>
            </title>
            <aug>
               <au>
                  <snm>Shimamoto</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Kyozuka</snm>
                  <fnm>J</fnm>
               </au>
            </aug>
            <source>Annu Rev Plant Biol</source>
            <pubdate>2002</pubdate>
            <volume>53</volume>
            <fpage>399</fpage>
            <lpage>419</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1146/annurev.arplant.53.092401.134447</pubid>
                  <pubid idtype="pmpid" link="fulltext">12221982</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B5">
            <title>
               <p>The <it>Oryza </it>Map Alignment Project: the golden path to unlocking the genetic potential of wild rice species.</p>
            </title>
            <aug>
               <au>
                  <snm>Wing</snm>
                  <fnm>RA</fnm>
               </au>
               <au>
                  <snm>Ammiraju</snm>
                  <fnm>JS</fnm>
               </au>
               <au>
                  <snm>Luo</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Kim</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Yu</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Kudrna</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Goicoechea</snm>
                  <fnm>JL</fnm>
               </au>
               <au>
                  <snm>Wang</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Nelson</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Rao</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Brar</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Mackill</snm>
                  <fnm>DJ</fnm>
               </au>
               <au>
                  <snm>Han</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Soderlund</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Stein</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>SanMiguel</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Jackson</snm>
                  <fnm>S</fnm>
               </au>
            </aug>
            <source>Plant Mol Biol</source>
            <pubdate>2005</pubdate>
            <volume>59</volume>
            <fpage>53</fpage>
            <lpage>62</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1007/s11103-004-6237-x</pubid>
                  <pubid idtype="pmpid" link="fulltext">16217601</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B6">
            <title>
               <p>Patterns in grass genome evolution.</p>
            </title>
            <aug>
               <au>
                  <snm>Bennetzen</snm>
                  <fnm>JL</fnm>
               </au>
            </aug>
            <source>Curr Opin Plant Biol</source>
            <pubdate>2007</pubdate>
            <volume>10</volume>
            <fpage>176</fpage>
            <lpage>181</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/j.pbi.2007.01.010</pubid>
                  <pubid idtype="pmpid" link="fulltext">17291821</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B7">
            <title>
               <p>The <it>Oryza </it>bacterial artificial chromosome library resource: construction and analysis of 12 deep-coverage large-insert BAC libraries that represent the 10 genome types of the genus <it>Oryza</it>.</p>
            </title>
            <aug>
               <au>
                  <snm>Ammiraju</snm>
                  <fnm>JS</fnm>
               </au>
               <au>
                  <snm>Luo</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Goicoechea</snm>
                  <fnm>JL</fnm>
               </au>
               <au>
                  <snm>Wang</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Kudrna</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Mueller</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Talag</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Kim</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Sisneros</snm>
                  <fnm>NB</fnm>
               </au>
               <au>
                  <snm>Blackmon</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Fang</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Tomkins</snm>
                  <fnm>JB</fnm>
               </au>
               <au>
                  <snm>Brar</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>MacKill</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>McCouch</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Kurata</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Lambert</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Galbraith</snm>
                  <fnm>DW</fnm>
               </au>
               <au>
                  <snm>Arumuganathan</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Rao</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Walling</snm>
                  <fnm>JG</fnm>
               </au>
               <au>
                  <snm>Gill</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Yu</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>SanMiguel</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Soderlund</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Jackson</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Wing</snm>
                  <fnm>RA</fnm>
               </au>
            </aug>
            <source>Genome Res</source>
            <pubdate>2006</pubdate>
            <volume>16</volume>
            <fpage>140</fpage>
            <lpage>147</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1356138</pubid>
                  <pubid idtype="pmpid" link="fulltext">16344555</pubid>
                  <pubid idtype="doi">10.1101/gr.3766306</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B8">
            <title>
               <p>Doubling genome size without polyploidization: dynamics of retrotransposition-driven genomic expansions in <it>Oryza australiensis</it>, a wild relative of rice.</p>
            </title>
            <aug>
               <au>
                  <snm>Piegu</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Guyot</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Picault</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Roulin</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Saniyal</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Kim</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Collura</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Brar</snm>
                  <fnm>DS</fnm>
               </au>
               <au>
                  <snm>Jackson</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Wing</snm>
                  <fnm>RA</fnm>
               </au>
               <au>
                  <snm>Panaud</snm>
                  <fnm>O</fnm>
               </au>
            </aug>
            <source>Genome Res</source>
            <pubdate>2006</pubdate>
            <volume>16</volume>
            <fpage>1262</fpage>
            <lpage>1269</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1581435</pubid>
                  <pubid idtype="pmpid" link="fulltext">16963705</pubid>
                  <pubid idtype="doi">10.1101/gr.5290206</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B9">
            <title>
               <p>The genus <it>Oryza </it>L.: current status of taxonomy.</p>
            </title>
            <aug>
               <au>
                  <snm>Vaughan</snm>
                  <fnm>DA</fnm>
               </au>
            </aug>
            <source>IRRI Res Pap Ser</source>
            <pubdate>1989</pubdate>
            <volume>138</volume>
            <fpage>1</fpage>
            <lpage>21</lpage>
         </bibl>
         <bibl id="B10">
            <title>
               <p>Origin and cytogenetics of rice.</p>
            </title>
            <aug>
               <au>
                  <snm>Nayar</snm>
                  <fnm>NM</fnm>
               </au>
            </aug>
            <source>Adv Genet</source>
            <pubdate>1973</pubdate>
            <volume>17</volume>
            <fpage>153</fpage>
            <lpage>292</lpage>
         </bibl>
         <bibl id="B11">
            <title>
               <p>Two new genomes in the <it>Oryza </it>complex identified on the basis of molecular divergence analysis using total genomic DNA hybridization.</p>
            </title>
            <aug>
               <au>
                  <snm>Aggarwal</snm>
                  <fnm>RK</fnm>
               </au>
               <au>
                  <snm>Brar</snm>
                  <fnm>DS</fnm>
               </au>
               <au>
                  <snm>Khush</snm>
                  <fnm>GS</fnm>
               </au>
            </aug>
            <source>Mol Gen Genet</source>
            <pubdate>1997</pubdate>
            <volume>254</volume>
            <fpage>1</fpage>
            <lpage>12</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1007/s004380050384</pubid>
                  <pubid idtype="pmpid">9108284</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B12">
            <title>
               <p>Phylogeny of rice genomes with emphasis on origins of allotetraploid species.</p>
            </title>
            <aug>
               <au>
                  <snm>Ge</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Sang</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Lu</snm>
                  <fnm>BR</fnm>
               </au>
               <au>
                  <snm>Hong</snm>
                  <fnm>DY</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci USA</source>
            <pubdate>1999</pubdate>
            <volume>96</volume>
            <fpage>14400</fpage>
            <lpage>14405</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">24448</pubid>
                  <pubid idtype="pmpid" link="fulltext">10588717</pubid>
                  <pubid idtype="doi">10.1073/pnas.96.25.14400</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B13">
            <title>
               <p>A biosystematic study of the <it>Oryza meyeriana </it>complex (Poaceae).</p>
            </title>
            <aug>
               <au>
                  <snm>Gong</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Borromeo</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Lu</snm>
                  <fnm>BR</fnm>
               </au>
            </aug>
            <source>Plant Syst Evol</source>
            <pubdate>2000</pubdate>
            <volume>224</volume>
            <fpage>139</fpage>
            <lpage>151</lpage>
            <xrefbib>
               <pubid idtype="doi">10.1007/BF00986339</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B14">
            <title>
               <p>A new insight into the genome differentiation in <it>Oryza </it>L. through isozymic studies.</p>
            </title>
            <aug>
               <au>
                  <snm>Second</snm>
                  <fnm>G</fnm>
               </au>
            </aug>
            <source>Advances in Chromosome and Cell Genetics</source>
            <publisher>New Delhi: Oxford and IBH</publisher>
            <editor>Sharma AK, Sharma A</editor>
            <pubdate>1985</pubdate>
            <fpage>45</fpage>
            <lpage>78</lpage>
         </bibl>
         <bibl id="B15">
            <title>
               <p>Polymorphism and phylogenetic relationships among species in the genus <it>Oryza </it>as determined by analysis of nuclear RFLPs.</p>
            </title>
            <aug>
               <au>
                  <snm>Wang</snm>
                  <fnm>ZY</fnm>
               </au>
               <au>
                  <snm>Second</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Tanksley</snm>
                  <fnm>SD</fnm>
               </au>
            </aug>
            <source>Theor Appl Genet</source>
            <pubdate>1992</pubdate>
            <volume>83</volume>
            <fpage>565</fpage>
            <lpage>581</lpage>
            <xrefbib>
               <pubid idtype="doi">10.1007/BF00226900</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B16">
            <title>
               <p>Sequence variation in the gene encoding the 10-kDa prolamin in <it>Oryza </it>(Poaceae). I. Phylogenetic implications.</p>
            </title>
            <aug>
               <au>
                  <snm>Mullins</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Hilu</snm>
                  <fnm>W</fnm>
               </au>
            </aug>
            <source>Theor Appl Genet</source>
            <pubdate>2002</pubdate>
            <volume>105</volume>
            <fpage>841</fpage>
            <lpage>846</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1007/s00122-002-1056-4</pubid>
                  <pubid idtype="pmpid" link="fulltext">12582908</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B17">
            <title>
               <p>Evolutionary trends in genus <it>Oryza</it>.</p>
            </title>
            <aug>
               <au>
                  <snm>Sharma</snm>
                  <fnm>SD</fnm>
               </au>
            </aug>
            <source>Rice Genetics</source>
            <publisher>Los Banos, Philippines: International Rice Research Institute</publisher>
            <pubdate>1986</pubdate>
            <fpage>59</fpage>
            <lpage>67</lpage>
         </bibl>
         <bibl id="B18">
            <title>
               <p>Genetic diversity and phylogenetic relationship as revealed by inter simple sequence repeat (ISSR) polymorphism in the genus <it>Oryza</it>.</p>
            </title>
            <aug>
               <au>
                  <snm>Joshi</snm>
                  <fnm>SP</fnm>
               </au>
               <au>
                  <snm>Gupta</snm>
                  <fnm>VS</fnm>
               </au>
               <au>
                  <snm>Aggarwal</snm>
                  <fnm>RK</fnm>
               </au>
               <au>
                  <snm>Ranjekar</snm>
                  <fnm>PK</fnm>
               </au>
               <au>
                  <snm>Brar</snm>
                  <fnm>DS</fnm>
               </au>
            </aug>
            <source>Theor Appl Genet</source>
            <pubdate>2000</pubdate>
            <volume>100</volume>
            <fpage>1311</fpage>
            <lpage>1320</lpage>
            <xrefbib>
               <pubid idtype="doi">10.1007/s001220051440</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B19">
            <title>
               <p>Phylogenetic analysis of the tribe Oryzeae: total chloroplast DNA restriction fragment analysis (a preliminary report).</p>
            </title>
            <aug>
               <au>
                  <snm>Zhang</snm>
                  <fnm>SH</fnm>
               </au>
               <au>
                  <snm>Second</snm>
                  <fnm>G</fnm>
               </au>
            </aug>
            <source>Rice Genet Newsl</source>
            <pubdate>1989</pubdate>
            <volume>6</volume>
            <fpage>76</fpage>
            <lpage>80</lpage>
         </bibl>
         <bibl id="B20">
            <title>
               <p>Independent amplification of two classes of <it>Tourists </it>in some <it>Oryza </it>species.</p>
            </title>
            <aug>
               <au>
                  <snm>Zhang</snm>
                  <fnm>Q</fnm>
               </au>
               <au>
                  <snm>Kochert</snm>
                  <fnm>G</fnm>
               </au>
            </aug>
            <source>Genetica</source>
            <pubdate>1997</pubdate>
            <volume>101</volume>
            <fpage>145</fpage>
            <lpage>152</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1023/A:1018328423736</pubid>
                  <pubid idtype="pmpid" link="fulltext">9692224</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B21">
            <title>
               <p>The analysis of 100 genes supports the grouping of three highly divergent amoebae: <it>Dictyostelium</it>, <it>Entamoeba</it>, and <it>Mastigamoeba</it>.</p>
            </title>
            <aug>
               <au>
                  <snm>Bapteste</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Brinkmann</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Lee</snm>
                  <fnm>JA</fnm>
               </au>
               <au>
                  <snm>Moore</snm>
                  <fnm>DV</fnm>
               </au>
               <au>
                  <snm>Sensen</snm>
                  <fnm>CW</fnm>
               </au>
               <au>
                  <snm>Gordon</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Durufle</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Gaasterland</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Lopez</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Muller</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Philippe</snm>
                  <fnm>H</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci USA</source>
            <pubdate>2002</pubdate>
            <volume>99</volume>
            <fpage>1414</fpage>
            <lpage>1419</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">122205</pubid>
                  <pubid idtype="pmpid" link="fulltext">11830664</pubid>
                  <pubid idtype="doi">10.1073/pnas.032662799</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B22">
            <title>
               <p>Genome-scale approaches to resolving incongruence in molecular phylogenies.</p>
            </title>
            <aug>
               <au>
                  <snm>Rokas</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Williams</snm>
                  <fnm>BL</fnm>
               </au>
               <au>
                  <snm>King</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Carroll</snm>
                  <fnm>SB</fnm>
               </au>
            </aug>
            <source>Nature</source>
            <pubdate>2003</pubdate>
            <volume>425</volume>
            <fpage>798</fpage>
            <lpage>804</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/nature02053</pubid>
                  <pubid idtype="pmpid" link="fulltext">14574403</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B23">
            <title>
               <p>Genome-scale evidence of the nematode-arthropod clade.</p>
            </title>
            <aug>
               <au>
                  <snm>Dopazo</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Dopazo</snm>
                  <fnm>J</fnm>
               </au>
            </aug>
            <source>Genome Biol</source>
            <pubdate>2005</pubdate>
            <volume>6</volume>
            <fpage>R41</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1175953</pubid>
                  <pubid idtype="pmpid" link="fulltext">15892869</pubid>
                  <pubid idtype="doi">10.1186/gb-2005-6-5-r41</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B24">
            <title>
               <p>Placing paleopolyploidy in relation to taxon divergence: a phylogenetic analysis in legumes using 39 gene families.</p>
            </title>
            <aug>
               <au>
                  <snm>Pfeil</snm>
                  <fnm>BE</fnm>
               </au>
               <au>
                  <snm>Schlueter</snm>
                  <fnm>JA</fnm>
               </au>
               <au>
                  <snm>Shoemaker</snm>
                  <fnm>RC</fnm>
               </au>
               <au>
                  <snm>Doyle</snm>
                  <fnm>JJ</fnm>
               </au>
            </aug>
            <source>Syst Biol</source>
            <pubdate>2005</pubdate>
            <volume>54</volume>
            <fpage>441</fpage>
            <lpage>454</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1080/10635150590945359</pubid>
                  <pubid idtype="pmpid" link="fulltext">16012110</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B25">
            <title>
               <p>Animal evolution and the molecular signature of radiations compressed in time.</p>
            </title>
            <aug>
               <au>
                  <snm>Rokas</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Kruger</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Carroll</snm>
                  <fnm>SB</fnm>
               </au>
            </aug>
            <source>Science</source>
            <pubdate>2005</pubdate>
            <volume>310</volume>
            <fpage>1933</fpage>
            <lpage>1938</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1126/science.1116759</pubid>
                  <pubid idtype="pmpid" link="fulltext">16373569</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B26">
            <title>
               <p>Widespread discordance of gene trees with species tree in <it>Drosophila</it>: evidence for incomplete lineage sorting.</p>
            </title>
            <aug>
               <au>
                  <snm>Pollard</snm>
                  <fnm>DA</fnm>
               </au>
               <au>
                  <snm>Iyer</snm>
                  <fnm>VN</fnm>
               </au>
               <au>
                  <snm>Moses</snm>
                  <fnm>AM</fnm>
               </au>
               <au>
                  <snm>Eisen</snm>
                  <fnm>MB</fnm>
               </au>
            </aug>
            <source>PLoS Genet</source>
            <pubdate>2006</pubdate>
            <volume>2</volume>
            <fpage>e173</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1626107</pubid>
                  <pubid idtype="pmpid" link="fulltext">17132051</pubid>
                  <pubid idtype="doi">10.1371/journal.pgen.0020173</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B27">
            <title>
               <p>Phylogenomic analysis reveals bees and wasps (Hymenoptera) at the base of the radiation of Holometabolous insects.</p>
            </title>
            <aug>
               <au>
                  <snm>Savard</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Tautz</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Richards</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Weinstock</snm>
                  <fnm>GM</fnm>
               </au>
               <au>
                  <snm>Gibbs</snm>
                  <fnm>RA</fnm>
               </au>
               <au>
                  <snm>Werren</snm>
                  <fnm>JH</fnm>
               </au>
               <au>
                  <snm>Tettelin</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Lercher</snm>
                  <fnm>MJ</fnm>
               </au>
            </aug>
            <source>Genome Res</source>
            <pubdate>2006</pubdate>
            <volume>16</volume>
            <fpage>1334</fpage>
            <lpage>1338</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1626634</pubid>
                  <pubid idtype="pmpid" link="fulltext">17065606</pubid>
                  <pubid idtype="doi">10.1101/gr.5204306</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B28">
            <title>
               <p>Detecting and overcoming systematic errors in genome-scale phylogenies.</p>
            </title>
            <aug>
               <au>
                  <snm>Rodriguez-Ezpeleta</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Brinkmann</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Roure</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Lartillot</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Lang</snm>
                  <fnm>BF</fnm>
               </au>
               <au>
                  <snm>Philippe</snm>
                  <fnm>H</fnm>
               </au>
            </aug>
            <source>Syst Biol</source>
            <pubdate>2007</pubdate>
            <volume>56</volume>
            <fpage>389</fpage>
            <lpage>399</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1080/10635150701397643</pubid>
                  <pubid idtype="pmpid" link="fulltext">17520503</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B29">
            <title>
               <p>Accurate gene-tree reconstruction by learning gene- and species-specific substitution rates across multiple complete genomes.</p>
            </title>
            <aug>
               <au>
                  <snm>Rasmussen</snm>
                  <fnm>MD</fnm>
               </au>
               <au>
                  <snm>Kellis</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>Genome Res</source>
            <pubdate>2007</pubdate>
            <volume>17</volume>
            <fpage>1932</fpage>
            <lpage>1942</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">2099600</pubid>
                  <pubid idtype="pmpid" link="fulltext">17989260</pubid>
                  <pubid idtype="doi">10.1101/gr.7105007</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B30">
            <title>
               <p>Rooting the eutherian tree: the power and pitfalls of phylogenomics.</p>
            </title>
            <aug>
               <au>
                  <snm>Nishihara</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Okada</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Hasegawa</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>Genome Biol</source>
            <pubdate>2007</pubdate>
            <volume>8</volume>
            <fpage>R199</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1186/gb-2007-8-9-r199</pubid>
                  <pubid idtype="pmpid" link="fulltext">17883877</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B31">
            <title>
               <p>Phylogenomics and the reconstruction of the tree of life.</p>
            </title>
            <aug>
               <au>
                  <snm>Delsuc</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Brinkmann</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Philippe</snm>
                  <fnm>H</fnm>
               </au>
            </aug>
            <source>Nat Rev Genet</source>
            <pubdate>2005</pubdate>
            <volume>6</volume>
            <fpage>361</fpage>
            <lpage>375</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/nrg1603</pubid>
                  <pubid idtype="pmpid" link="fulltext">15861208</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B32">
            <title>
               <p>Phylogenomics.</p>
            </title>
            <aug>
               <au>
                  <snm>Philippe</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Delsuc</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Brinkmann</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Lartillot</snm>
                  <fnm>N</fnm>
               </au>
            </aug>
            <source>Annu Rev Ecol Evol Syst</source>
            <pubdate>2005</pubdate>
            <volume>36</volume>
            <fpage>541</fpage>
            <lpage>562</lpage>
            <xrefbib>
               <pubid idtype="doi">10.1146/annurev.ecolsys.35.112202.130205</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B33">
            <title>
               <p>Bushes in the tree of life.</p>
            </title>
            <aug>
               <au>
                  <snm>Rokas</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Carroll</snm>
                  <fnm>SB</fnm>
               </au>
            </aug>
            <source>PLoS Biol</source>
            <pubdate>2006</pubdate>
            <volume>4</volume>
            <fpage>e352</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1637082</pubid>
                  <pubid idtype="pmpid" link="fulltext">17105342</pubid>
                  <pubid idtype="doi">10.1371/journal.pbio.0040352</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B34">
            <title>
               <p>Genome-scale phylogeny and the detection of systematic biases.</p>
            </title>
            <aug>
               <au>
                  <snm>Phillips</snm>
                  <fnm>MJ</fnm>
               </au>
               <au>
                  <snm>Delsuc</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Penny</snm>
                  <fnm>D</fnm>
               </au>
            </aug>
            <source>Mol Biol Evol</source>
            <pubdate>2004</pubdate>
            <volume>21</volume>
            <fpage>1455</fpage>
            <lpage>1458</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/molbev/msh137</pubid>
                  <pubid idtype="pmpid" link="fulltext">15084674</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B35">
            <title>
               <p>Phylogenomics: the beginning of incongruence?</p>
            </title>
            <aug>
               <au>
                  <snm>Jeffroy</snm>
                  <fnm>O</fnm>
               </au>
               <au>
                  <snm>Brinkmann</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Delsuc</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Philippe</snm>
                  <fnm>H</fnm>
               </au>
            </aug>
            <source>Trends Genet</source>
            <pubdate>2006</pubdate>
            <volume>22</volume>
            <fpage>225</fpage>
            <lpage>231</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/j.tig.2006.02.003</pubid>
                  <pubid idtype="pmpid" link="fulltext">16490279</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B36">
            <title>
               <p>Recovering evolutionary trees under a more realistic model of sequence evolution.</p>
            </title>
            <aug>
               <au>
                  <snm>Lockhart</snm>
                  <fnm>PJ</fnm>
               </au>
               <au>
                  <snm>Steel</snm>
                  <fnm>MA</fnm>
               </au>
               <au>
                  <snm>Hendy</snm>
                  <fnm>MD</fnm>
               </au>
               <au>
                  <snm>Penny</snm>
                  <fnm>D</fnm>
               </au>
            </aug>
            <source>Mol Biol Evol</source>
            <pubdate>1994</pubdate>
            <volume>11</volume>
            <fpage>605</fpage>
            <lpage>612</lpage>
         </bibl>
         <bibl id="B37">
            <title>
               <p>Simple methods for testing the molecular evolutionary clock hypothesis.</p>
            </title>
            <aug>
               <au>
                  <snm>Tajima</snm>
                  <fnm>F</fnm>
               </au>
            </aug>
            <source>Genetics</source>
            <pubdate>1993</pubdate>
            <volume>135</volume>
            <fpage>599</fpage>
            <lpage>607</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1205659</pubid>
                  <pubid idtype="pmpid" link="fulltext">8244016</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B38">
            <title>
               <p>Archaeal phylogeny: reexamination of the phylogenetic position of <it>Archaeoglobus fulgidus </it>in light of certain composition-induced artifacts.</p>
            </title>
            <aug>
               <au>
                  <snm>Woese</snm>
                  <fnm>CR</fnm>
               </au>
               <au>
                  <snm>Achenbach</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Rouviere</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Mandelco</snm>
                  <fnm>L</fnm>
               </au>
            </aug>
            <source>Syst Appl Microbiol</source>
            <pubdate>1991</pubdate>
            <volume>14</volume>
            <fpage>364</fpage>
            <lpage>371</lpage>
            <xrefbib>
               <pubid idtype="pmpid">11540072</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B39">
            <title>
               <p>BGI-RIS: an integrated information resource and comparative analysis workbench for rice genomics.</p>
            </title>
            <aug>
               <au>
                  <snm>Zhao</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Wang</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>He</snm>
                  <fnm>X</fnm>
               </au>
               <au>
                  <snm>Huang</snm>
                  <fnm>X</fnm>
               </au>
               <au>
                  <snm>Jiao</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Dai</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Wei</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Fu</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Chen</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Ren</snm>
                  <fnm>X</fnm>
               </au>
               <au>
                  <snm>Zhang</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Ni</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Zhang</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Li</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Wang</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Wong</snm>
                  <fnm>GK</fnm>
               </au>
               <au>
                  <snm>Zhao</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Yu</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Yang</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Wang</snm>
                  <fnm>J</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2004</pubdate>
            <volume>32</volume>
            <fpage>D377</fpage>
            <lpage>382</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">308819</pubid>
                  <pubid idtype="pmpid" link="fulltext">14681438</pubid>
                  <pubid id