<?xml version='1.0'?>
<!DOCTYPE art SYSTEM 'http://www.biomedcentral.com/xml/article.dtd'>
<art>
   <ui>1471-2148-6-13</ui>
   <ji>1471-2148</ji>
   <fm>
      <dochead>Research article</dochead>
      <bibl>
         <title>
            <p>Adaptive evolution of chloroplast genome structure inferred using a parametric bootstrap approach</p>
         </title>
         <aug>
            <au id="A1">
               <snm>Cui</snm>
               <fnm>Liying</fnm>
               <insr iid="I1"/>
               <email>liying@psu.edu</email>
            </au>
            <au id="A2">
               <snm>Leebens-Mack</snm>
               <fnm>Jim</fnm>
               <insr iid="I1"/>
               <email>jhl10@psu.edu</email>
            </au>
            <au id="A3">
               <snm>Wang</snm>
               <fnm>Li-San</fnm>
               <insr iid="I2"/>
               <email>lswang@med.upenn.edu</email>
            </au>
            <au id="A4">
               <snm>Tang</snm>
               <fnm>Jijun</fnm>
               <insr iid="I3"/>
               <email>jtang@cse.sc.edu</email>
            </au>
            <au id="A5">
               <snm>Rymarquis</snm>
               <fnm>Linda</fnm>
               <insr iid="I4"/>
               <email>lar24@cornell.edu</email>
            </au>
            <au id="A6">
               <snm>Stern</snm>
               <mi>B</mi>
               <fnm>David</fnm>
               <insr iid="I4"/>
               <email>ds28@cornell.edu</email>
            </au>
            <au id="A7" ca="yes">
               <snm>dePamphilis</snm>
               <mi>W</mi>
               <fnm>Claude</fnm>
               <insr iid="I1"/>
               <email>cwd3@psu.edu</email>
            </au>
         </aug>
         <insg>
            <ins id="I1">
               <p>Department of Biology, Institute of Molecular Evolutionary Genetics, and Huck Institutes of Life Sciences, The Pennsylvania State University, University Park, PA 16802, USA</p>
            </ins>
            <ins id="I2">
               <p>Department of Biology, University of Pennsylvania, Philadelphia, PA 19104, USA</p>
            </ins>
            <ins id="I3">
               <p>Department of Computer Science and Engineering, University of South Carolina, Columbia, SC 29208, USA</p>
            </ins>
            <ins id="I4">
               <p>Boyce Thompson Institute, Cornell University, Ithaca, NY 14853, USA</p>
            </ins>
         </insg>
         <source>BMC Evolutionary Biology</source>
         <issn>1471-2148</issn>
         <pubdate>2006</pubdate>
         <volume>6</volume>
         <issue>1</issue>
         <fpage>13</fpage>
         <url>http://www.biomedcentral.com/1471-2148/6/13</url>
         <xrefbib>
            <pubidlist>
               <pubid idtype="pmpid">16469102</pubid>
               <pubid idtype="doi">10.1186/1471-2148-6-13</pubid>
            </pubidlist>
         </xrefbib>
      </bibl>
      <history>
         <rec>
            <date>
               <day>15</day>
               <month>6</month>
               <year>2005</year>
            </date>
         </rec>
         <acc>
            <date>
               <day>09</day>
               <month>2</month>
               <year>2006</year>
            </date>
         </acc>
         <pub>
            <date>
               <day>09</day>
               <month>2</month>
               <year>2006</year>
            </date>
         </pub>
      </history>
      <cpyrt>
         <year>2006</year>
         <collab>Cui et al; licensee BioMed Central Ltd.</collab>
         <note>This is an Open Access article distributed under the terms of the Creative Commons Attribution License (<url>http://creativecommons.org/licenses/by/2.0</url>), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</note>
      </cpyrt>
      <abs>
         <sec>
            <st>
               <p>Abstract</p>
            </st>
            <sec>
               <st>
                  <p>Background</p>
               </st>
               <p>Genome rearrangements influence gene order and configuration of gene clusters in all genomes. Most land plant chloroplast DNAs (cpDNAs) share a highly conserved gene content and with notable exceptions, a largely co-linear gene order. Conserved gene orders may reflect a slow intrinsic rate of neutral chromosomal rearrangements, or selective constraint. It is unknown to what extent observed changes in gene order are random or adaptive. We investigate the influence of natural selection on gene order in association with increased rate of chromosomal rearrangement. We use a novel parametric bootstrap approach to test if directional selection is responsible for the clustering of functionally related genes observed in the highly rearranged chloroplast genome of the unicellular green alga <it>Chlamydomonas reinhardtii</it>, relative to ancestral chloroplast genomes.</p>
            </sec>
            <sec>
               <st>
                  <p>Results</p>
               </st>
               <p>Ancestral gene orders were inferred and then subjected to simulated rearrangement events under the random breakage model with varying ratios of inversions and transpositions. We found that adjacent chloroplast genes in <it>C. reinhardtii </it>were located on the same strand much more frequently than in simulated genomes that were generated under a random rearrangement processes (increased sidedness; p &lt; 0.0001). In addition, functionally related genes were found to be more clustered than those evolved under random rearrangements (p &lt; 0.0001). We report evidence of co-transcription of neighboring genes, which may be responsible for the observed gene clusters in <it>C. reinhardtii </it>cpDNA.</p>
            </sec>
            <sec>
               <st>
                  <p>Conclusion</p>
               </st>
               <p>Simulations and experimental evidence suggest that both selective maintenance and directional selection for gene clusters are determinants of chloroplast gene order.</p>
            </sec>
         </sec>
      </abs>
   </fm>
   <meta>
      <classifications>
         <classification type="bmc" subtype="user_supplied_xml" id="endnote"/>
      </classifications>
   </meta>
   <bdy>
      <sec>
         <st>
            <p>Background</p>
         </st>
         <p>The influence of genotype on phenotype is not limited to the coding of peptides and functional RNAs by nucleotide sequences. An organism's phenotype is also affected by the chromosomal arrangement of genes and the interaction of gene products. Comparative genomics has revealed a variety of gene clusters and chromosomal segments that have remained intact over hundreds of millions of years <abbrgrp><abbr bid="B1">1</abbr></abbrgrp>. Selection for clustering of co-transcribed genes has been hypothesized to influence gene order within bacterial and organelle genomes, where gene clusters typically encode multiple components of a functional pathway <abbrgrp><abbr bid="B2">2</abbr></abbrgrp>. For example, the ribosomal proteins are encoded by similar operons in archaebacteria, eubacteria and plastids <abbrgrp><abbr bid="B3">3</abbr></abbrgrp>. In eukaryotic genomes, co-expression of neighboring genes is significantly associated with the functional roles of the genes (such as housekeeping genes or genes in the same metabolic pathway) <abbrgrp><abbr bid="B4">4</abbr><abbr bid="B5">5</abbr></abbrgrp>. One way that those genes become clustered is through tandem duplication, which usually results in functionally related genes being adjacent. On the other hand, unrelated genes may also be brought together through chromosome rearrangements (recombination, inversion and transposition).</p>
         <p>Unless selection is acting to maintain or promote gene clusters, gene orders in genomes subjected to rearrangements should become randomized with respect to function or co-expression profiles. Significant clustering has been inferred using permutation tests that compare observed physical distances between pairs or blocks of co-expressed or functionally related genes to a null distribution constructed from randomized gene orders <abbrgrp><abbr bid="B4">4</abbr><abbr bid="B5">5</abbr></abbrgrp>. However, this approach is limited since the evolutionary history of the genome was not considered. When comparing gene orders among related species, it is possible to estimate the ancestral genome and to simulate a null distribution for changes in gene order using a model. This evolutionary approach can be used to test directly the influence of selection on genome structure, that is, whether present-day genome structure has been influenced by directional selection for clustering of functionally related genes.</p>
         <p>Small genomes, especially those of organelles and bacteria, are well suited to global comparisons of gene order. Like eukaryotic genomes, they are subject to structural changes such as inversion, transposition or translocation, as well as gene loss and (more rarely) gene gain. Chloroplast DNAs in most land plants share a highly conserved gene content and similar gene orders <abbrgrp><abbr bid="B6">6</abbr></abbrgrp>. Most cpDNAs include two identical regions in opposite orientations called the inverted repeat (IR), flanked by large single copy (LSC) and small single copy (SSC) regions. The IRs generally contain the bacterial-like rRNA gene clusters, and the genes involved in photosynthesis (photosystem I/II, cytochrome b<sub>6</sub>/f, and ATP synthase) are arranged similarly in chloroplast and cyanobacterial genomes <abbrgrp><abbr bid="B2">2</abbr><abbr bid="B3">3</abbr><abbr bid="B7">7</abbr></abbrgrp>. Despite these well-characterized patterns, it is unknown to what extent the conserved gene order reflects a slow intrinsic rate of neutral chromosomal rearrangements, rather than selection against alternative gene orders. A model of neutral rearrangement of gene order is required to test formally whether gene orders evolve under selection which prefers some gene arrangements over others.</p>
         <p>Nadeau and Taylor first proposed a model for the neutral evolution of gene order in comparisons of mouse and human chromosomes <abbrgrp><abbr bid="B8">8</abbr></abbrgrp>. This "random breakage model" provides a null hypothesis for the evolution of gene order. It assumes a random distribution of break points and allows all possible gene orders without restrictions. The random breakage model has been used to infer organismal phylogenies from gene order data <abbrgrp><abbr bid="B9">9</abbr></abbrgrp>. The gene order difference can be measured using the inversion distance, which is the minimal number of inversions necessary to transform one gene order to another. Currently, the most accurate heuristic approach is implemented in the GRAPPA software <abbrgrp><abbr bid="B10">10</abbr></abbrgrp>, which is generally suitable for small taxon sets because the algorithm scores inversion medians for all nodes iteratively across all possible phylogenies. Algorithms for genomes with arbitrary rearrangements, a few deletions and duplications have been developed <abbrgrp><abbr bid="B11">11</abbr></abbrgrp>, and the capacity of GRAPPA can be scaled up with the discovering method (DCM) to potentially very large data sets <abbrgrp><abbr bid="B12">12</abbr></abbrgrp>.</p>
         <p>The random breakage model does not account for recombination hotspots, which have been reported from human-mouse genome comparisons <abbrgrp><abbr bid="B13">13</abbr></abbrgrp>. However, at this time it may be difficult to model these hotspots, because the precise locations of reused breakpoints are unknown due to insufficient resolution of gene orders and potential errors in homology assessment given the scale of eukaryotic chromosomes <abbrgrp><abbr bid="B14">14</abbr></abbrgrp>. Thus, the fragile breakage model <abbrgrp><abbr bid="B13">13</abbr></abbrgrp>, as an alternative to the random breakage model, has not been well established.</p>
         <p>Whereas gene order is generally conserved among land plant cpDNAs, very little synteny is observed between this group and cpDNAs of the chlorophytic green algae <it>C. reinhardtii </it><abbrgrp><abbr bid="B15">15</abbr><abbr bid="B16">16</abbr></abbrgrp> and <it>Chlorella vulgaris </it><abbrgrp><abbr bid="B17">17</abbr></abbrgrp>. The apparently increased rearrangement rate is associated with invasion by a large number of short dispersed repeats (SDRs), for which the evolutionary distribution is still poorly defined. The large number of rearrangements provides an excellent opportunity to test whether natural selection has preferred some changes in gene order. Here we present novel statistics and parametric tests that lead us to reject the models of random rearrangement in favor of directional selection for clustering of functionally related genes in <it>C. reinhardtii</it>. We also present experimental evidence that adaptive evolution of chloroplast genome structure could be driven by the advantage of concerted regulation conferred by polycistronic transcription.</p>
      </sec>
      <sec>
         <st>
            <p>Results</p>
         </st>
         <sec>
            <st>
               <p>Functional clusters are not randomly distributed</p>
            </st>
            <p>We compared gene orders of representative cpDNAs from land plants, including tobacco (<it>Nicotiana tabacum</it>, [GenBank:<ext-link ext-link-type="gen" ext-link-id="NC_001879">NC_001879</ext-link>]) <abbrgrp><abbr bid="B18">18</abbr></abbrgrp> and liverwort (<it>Marchantia polymorpha</it>, [GenBank:<ext-link ext-link-type="gen" ext-link-id="NC_001319">NC_001319</ext-link>]) <abbrgrp><abbr bid="B19">19</abbr></abbrgrp>, a charophytic green alga (<it>Chaetosphaeridium globosum </it>[GenBank:<ext-link ext-link-type="gen" ext-link-id="NC_004115">NC_004115</ext-link>]) <abbrgrp><abbr bid="B20">20</abbr></abbrgrp>, chlorophytic green algae (<it>Nephroselmis olivacea </it>[GenBank:<ext-link ext-link-type="gen" ext-link-id="NC_000927">NC_000927</ext-link>] <abbrgrp><abbr bid="B21">21</abbr></abbrgrp>, <it>C. vulgaris </it>[GenBank:<ext-link ext-link-type="gen" ext-link-id="NC_001865">NC_001865</ext-link>] <abbrgrp><abbr bid="B17">17</abbr></abbrgrp>, <it>C. reinhardtii </it>[GenBank:<ext-link ext-link-type="gen" ext-link-id="BK000554">BK000554</ext-link>] <abbrgrp><abbr bid="B16">16</abbr></abbrgrp>), a green flagellate alga with uncertain affinities (<it>Mesostigma viride </it>[GenBank:<ext-link ext-link-type="gen" ext-link-id="NC_002186">NC_002186</ext-link>]) <abbrgrp><abbr bid="B22">22</abbr></abbrgrp>, and the plastid of <it>Cyanophora paradoxa </it>[GenBank:<ext-link ext-link-type="gen" ext-link-id="NC_001675">NC_001675</ext-link>] <abbrgrp><abbr bid="B23">23</abbr></abbrgrp> (Figure <figr fid="F1">1</figr>) (<supplr sid="S1">Additional file 1</supplr>, <supplr sid="S3">Additional file 3</supplr>).</p>
            <fig id="F1">
               <title>
                  <p>Figure 1</p>
               </title>
               <caption>
                  <p>Extensive rearrangement in <it>Chlamydomonas reinhardtii </it>and <it>Chlorella vulgaris </it>cpDNAs</p>
               </caption>
               <text>
                  <p><b>Extensive rearrangement in <it>Chlamydomonas reinhardtii </it>and <it>Chlorella vulgaris </it>cpDNAs</b>. Representative cpDNAs from land plants and green algae are arranged to reflect their phylogenetic relationships. The scale bar indicates 20 kb. Each genome is linearized and drawn as a grey bar. Genes are drawn as colored rectangles and with those encoded on the positive strand above the genome bar. Colored lines connect the homologs included in this study and the functional category is shown by specific colors.</p>
               </text>
               <graphic file="1471-2148-6-13-1"/>
            </fig>
            <suppl id="S1">
               <title>
                  <p>Additional File 1</p>
               </title>
               <text>
                  <p><b>Gene coding and functional categories</b>. Text file, lists the names of 85 genes included in the study and corresponding functional categories.</p>
               </text>
               <file name="1471-2148-6-13-S1.txt">
                  <p>Click here for file</p>
               </file>
            </suppl>
            <suppl id="S3">
               <title>
                  <p>Additional File 3</p>
               </title>
               <text>
                  <p><b>The gene order data set</b>. Text file, contains gene orders of seven chloroplast genomes, computed C<sub>s </sub>and C<sub>f </sub>indices, and the inferred rearrangement phylogeny.</p>
               </text>
               <file name="1471-2148-6-13-S3.txt">
                  <p>Click here for file</p>
               </file>
            </suppl>
            <p>To measure the genome structure in terms of clustering by chromosome locations and gene functions, we defined "sided blocks" as contiguous genes coded on the same strand of the plastid chromosome, and "functional clusters" as blocks of functionally related genes (see Methods). The randomness in the observed distribution of shared genes in chloroplast genomes with respect to gene function was assessed using a Kolmogorov-Smirnov test. The null hypothesis was rejected in all seven cpDNAs investigated for genes in functional categories such as ATP synthases and electron transport (p &lt;&lt; 0.05, Table <tblr tid="T1">1</tblr>). While this test suggests some degree of functional clustering in all chloroplast genomes, it does not take into account the phylogenetic relationship of these organisms, so it is unclear whether functional clustering in chloroplast genomes is a legacy of genome organization in a cyanobacteria-like ancestor, or the product of selection on gene order in the face of genome rearrangements.</p>
            <tbl id="T1">
               <title>
                  <p>Table 1</p>
               </title>
               <caption>
                  <p>The Kolmogorov-Smirnov test of gene clustering by the functional category in cpDNAs &#167;</p>
               </caption>
               <tblbdy cols="5">
                  <r>
                     <c ca="left">
                        <p>cpDNA</p>
                     </c>
                     <c cspan="4" ca="center">
                        <p><it>D</it><sub><it>n </it></sub>(p-value)</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>Translation and transcription</p>
                     </c>
                     <c ca="left">
                        <p>Photosystem I and II</p>
                     </c>
                     <c ca="left">
                        <p>Electron Transport</p>
                     </c>
                     <c ca="left">
                        <p>ATP synthase</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="5">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Chlorella</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>0.214(.6418)</p>
                     </c>
                     <c ca="left">
                        <p>0.488(.0066)</p>
                     </c>
                     <c ca="left">
                        <p>0.750(.0000)</p>
                     </c>
                     <c ca="left">
                        <p>0.833(.0000)</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Chlamydomonas</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>0.198(.6866)</p>
                     </c>
                     <c ca="left">
                        <p>0.473(.0060)</p>
                     </c>
                     <c ca="left">
                        <p>0.780(.0000)</p>
                     </c>
                     <c ca="left">
                        <p>0.769(.0000)</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Nephroselmis</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>0.209(.6207)</p>
                     </c>
                     <c ca="left">
                        <p>0.484(.0046)</p>
                     </c>
                     <c ca="left">
                        <p>0.703(.0000)</p>
                     </c>
                     <c ca="left">
                        <p>0.846(.0000)</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Mesostigma</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>0.275(.2786)</p>
                     </c>
                     <c ca="left">
                        <p>0.549(.0008)</p>
                     </c>
                     <c ca="left">
                        <p>0.769(.0000)</p>
                     </c>
                     <c ca="left">
                        <p>0.846(.0000)</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Chaetosphaeridium</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>0.242(.4388)</p>
                     </c>
                     <c ca="left">
                        <p>0.484(.0046)</p>
                     </c>
                     <c ca="left">
                        <p>0.714(.0000)</p>
                     </c>
                     <c ca="left">
                        <p>0.846(.0000)</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Marchantia</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>0.341(.0986)</p>
                     </c>
                     <c ca="left">
                        <p>0.473(.0060)</p>
                     </c>
                     <c ca="left">
                        <p>0.714(.0000)</p>
                     </c>
                     <c ca="left">
                        <p>0.846(.0000)</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Nicotiana</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>0.264(.3283)</p>
                     </c>
                     <c ca="left">
                        <p>0.473(.0060)</p>
                     </c>
                     <c ca="left">
                        <p>0.769(.0000)</p>
                     </c>
                     <c ca="left">
                        <p>0.846(.0000)</p>
                     </c>
                  </r>
               </tblbdy>
               <tblfn>
                  <p>&#167;The test statistic <it>D</it><sub><it>n </it></sub>measures whether the distribution of functionally related genes is random in gene clusters. Total 85 shared genes between seven cpDNAs were included.</p>
               </tblfn>
            </tbl>
         </sec>
         <sec>
            <st>
               <p>Extensive rearrangements from the ancestral chloroplast genome to <it>C. reinhardtii</it></p>
            </st>
            <p>In order to investigate evolutionary changes of gene order, we constructed a phylogeny of seven representative cpDNAs and rooted with the sequence of <it>C. paradoxa </it><abbrgrp><abbr bid="B23">23</abbr></abbrgrp>. Maximum parsimony, neighbor joining and maximum likelihood analyses of an alignment of 50 concatenated protein sequences including a total of 19,836 aligned sites (<supplr sid="S2">Additional file 2</supplr>), all yielded identical fully resolved topologies with high bootstrap support (Figure <figr fid="F2">2A</figr>). <it>Mesostigma </it>was placed as a basal charophyte lineage in one previous analysis <abbrgrp><abbr bid="B24">24</abbr></abbrgrp>. The unrooted phylogeny of seven cpDNAs (Figure <figr fid="F2">2B</figr>) is congruent with the alternative placement of <it>Mesostigma </it>either as a basal charophyte <abbrgrp><abbr bid="B24">24</abbr></abbrgrp> or basal to both charophyte and chlorophyte lineages <abbrgrp><abbr bid="B22">22</abbr></abbrgrp>. This tree was used as the reference phylogeny for gene order inference.</p>
            <suppl id="S2">
               <title>
                  <p>Additional File 2</p>
               </title>
               <text>
                  <p><b>Protein alignment matrix</b>. Text file, with a NEXUS format data matrix of concatenated proteins from seven chloroplast genomes and the outgroup, <it>Cyanophora paradoxa</it>.</p>
               </text>
               <file name="1471-2148-6-13-S2.txt">
                  <p>Click here for file</p>
               </file>
            </suppl>
            <fig id="F2">
               <title>
                  <p>Figure 2</p>
               </title>
               <caption>
                  <p>The phylogeny of cpDNAs</p>
               </caption>
               <text>
                  <p><b>The phylogeny of cpDNAs</b>. (A) The cpDNA phylogeny based on analysis of 50 concatenated proteins. The phylogeny includes major green plant and algal lineages and the outgroup <it>Cyanophora</it>. The bootstrap support values from maximum parsimony/neighbor joining/maximum likelihood analyses are labeled near each node. (B) Estimated inversion distances considering 85 common genes on the cpDNA phylogeny. There is an increase of rearrangements on branches leading to <it>C. reinhardtii</it>. and <it>C. vulgaris</it>, from a common ancestor indicated by an arrow.</p>
               </text>
               <graphic file="1471-2148-6-13-2"/>
            </fig>
            <p>We scored the orders of 85 genes shared in the seven genomes (Gene orders are in <supplr sid="S3">additional file 3</supplr>). Then we used modified versions of GRAPPA <abbrgrp><abbr bid="B11">11</abbr><abbr bid="B25">25</abbr></abbrgrp> to compute the inversion distance between ancestral nodes and each terminal node (Figure <figr fid="F2">2B</figr>; see Methods). The branches leading to two chlorophytic green algae, <it>C. reinhardtii </it>and <it>C. vulgaris</it>, are much longer than the branches leading to the other taxa, and that many more steps were inferred on the <it>C. reinhardtii </it>lineage relative to the <it>C. vulgaris </it>lineage. Gene duplications or deletions were mapped before scoring the ancestral genomes with inversions, and were not counted as rearrangements. IRs were present in all inferred ancestral nodes, and one copy was lost in <it>C. vulgaris</it>. Ancestral gene orders were scored on all the phylogenies using a two-step approach (see Methods). Due to the computational time limit (the full search for ancestral gene orders may require months), we stopped scoring all possible ancestral gene orders with the data set after 25 days and took the best scored ancestral gene orders at that time (<supplr sid="S4">Additional file 4</supplr>).</p>
            <suppl id="S4">
               <title>
                  <p>Additional File 4</p>
               </title>
               <text>
                  <p><b>Comparison of gene clusters</b>. Text file, shows gene clusters shared between the inferred ancestral genome of <it>C. reinhardtii </it>and <it>C. vulgaris </it>to the cpDNA of <it>C. vulgaris </it>and <it>N. olivacea</it>.</p>
               </text>
               <file name="1471-2148-6-13-S4.txt">
                  <p>Click here for file</p>
               </file>
            </suppl>
            <p>The cpDNAs of two land plants, <it>N. tabacum </it>and <it>M. polymorpha</it>, were separated by an estimated 7 inversions based on the data set. One large inversion (~30 kb) in the LSC region has long been recognized to separate the two genomes <abbrgrp><abbr bid="B26">26</abbr></abbrgrp>. Additional rearrangements are directly observable through comparison of gene order files for the two species (see <supplr sid="S5">additional file 5</supplr> for the sequences of gene order rearrangements). Using GRAPPA, all rearrangements were inferred as inversions, but the total number of inversion events estimated by GRAPPA may be greater than the true (but unknown) mixture of inversions and transpositions because one transposition could result in the same change in gene order as two or three inversions.</p>
            <suppl id="S5">
               <title>
                  <p>Additional File 5</p>
               </title>
               <text>
                  <p><b>Inversions separating <it>N. tabacum </it>and <it>M. polymorpha </it>cpDNA</b>. Text file, shows the possible scenarios to transform the chloroplast gene order of <it>N. tabacum </it>to <it>M. polymorpha </it>cpDNA through inversions.</p>
               </text>
               <file name="1471-2148-6-13-S5.txt">
                  <p>Click here for file</p>
               </file>
            </suppl>
         </sec>
         <sec>
            <st>
               <p>Increased order in the genome structure after rearrangements</p>
            </st>
            <p>Two genomic structural characteristics were measured: the propensity of adjacent genes to be clustered on the same strand (using the sidedness index C<sub>s</sub>) and the clustering of functionally related genes (using the functional cluster index, C<sub>f</sub>) (see Methods). Both indices were calculated for the inferred ancestral gene orders and extant daughter lineages. Among land plants and charophytes, the inferred sidedness among ancestral genomes was similar to extant lineages, however, among the chlorophytes an opposite trend was observed, especially in the <it>C. reinhardtii </it>lineage (<supplr sid="S3">Additional file 3</supplr>). The large number of rearrangements in the <it>C. reinhardtii </it>cpDNA lineage resulted in dramatically increased sidedness relative to the inferred most recent common ancestor of <it>C. reinhardtii </it>and <it>C. vulgaris </it>(C<sub>s </sub>ancestor = 0.6966, C<sub>s </sub>observed = 0.8710; Figure <figr fid="F3">3A</figr>). A small increase of C<sub>s </sub>was found in the <it>N. olivacea </it>lineage and there was almost no change in the lineage leading to <it>C. vulgaris</it>. A large increase was also observed in the functional clustering index, C<sub>f</sub>, for <it>C. reinhardti </it>(C<sub>f </sub>ancestor = 0.01674, C<sub>f </sub>observed = 0.03397; Figure <figr fid="F3">3B</figr>), whereas the trend was less profound in other lineages (<supplr sid="S3">Additional file 3</supplr>). Thus, even if the ancestral genome already had a "sided" structure, sidedness increased with genome rearrangements as <it>C. reinhardtii </it>cpDNA evolved. The inferred increase in sidedness and functional clustering in the face of the large number of rearrangements on the lineage leading to <it>C. reinhardtii </it>might be adaptive, if such increases were not expected under random rearrangements.</p>
            <fig id="F3">
               <title>
                  <p>Figure 3</p>
               </title>
               <caption>
                  <p>Comparison of sidedness and functional cluster indices in <it>C. reinhardtii </it>cpDNA to those of simulated genomes</p>
               </caption>
               <text>
                  <p><b>Comparison of sidedness and functional cluster indices in <it>C. reinhardtii </it>cpDNA to those of simulated genomes</b>. (A)The sidedness index C<sub>s </sub>observed in <it>C. reinhardtii </it>(indicated by an arrow) is significantly larger than C<sub>s </sub>of gene orders simulated under the random breakage model (inversion only) and the estimated ancestral genome indicated in Figure 2B. (B) The functional cluster index C<sub>f </sub>for <it>C. reinhardtii </it>(indicated by a solid horizontal line) is greater than that for the inferred ancestral genome (dashed line), in contrast to the decrease predicted by three sets of simulations under the random breakage model. Models 1, 2 and 3 specified the inversion/transposition ratios to be 1:0, 10:1 and 1:1, respectively, in simulations with 10,000 replicates. The box section of the box plot indicates the first quartile, median and the third quartile of the distribution.</p>
               </text>
               <graphic file="1471-2148-6-13-3"/>
            </fig>
            <p>To test the null hypothesis that the changes in C<sub>s </sub>and C<sub>f </sub>were consequence of random genome rearrangements rather than a consequence of directional selection (H<sub>0</sub>: random rearrangement; H<sub>A</sub>: constraints in rearrangements), we simulated random rearrangements starting with the inferred ancestral genome along the branch leading to <it>C. reinhardtii</it>. Although inversions are the most abundant type of rearrangement in cpDNAs <abbrgrp><abbr bid="B27">27</abbr></abbrgrp>, we also considered the contribution of transpositions under three inversion to transposition ratios, while the total number of rearrangements was fixed according to the branch length inferred using GRAPPA (Figure <figr fid="F2">2B</figr>). Three simulations with 10,000 replicates were conducted with inversion to transposition ratios of 1:0, 10:1 and 1:1 under the random breakage model. The mean C<sub>s </sub>values for the three sets were 0.5929, 0.6084 and 0.5948, respectively, and the 95% confidence intervals were (0.5056,0.6742), (0.5281, 0.6854) and (0.5169, 0.6742), respectively. All datasets simulated under the random breakage model showed a significant decrease of sidedness from the ancestral level (p &lt; 0.0001). In contrast, the C<sub>s </sub>value calculated for <it>C. reinhardtii </it>increased significantly to 0.8710 (Figure <figr fid="F3">3A</figr>), greatly exceeding the sidedness that would be expected in a genome that had undergone this much evolutionary change relative to its ancestor. Simulations using inferred ancestral genomes for land plant lineages (e.g. <it>N. tabacum</it>) also strongly reject the null hypothesis of random rearrangements (results not shown).</p>
            <p>Given the large number of rearrangements observed in the <it>C. reinhardtii </it>lineage, C<sub>f </sub>was also predicted to decrease significantly under the random breakage model, but C<sub>f </sub>did not decrease as observed in <it>C. reinhardtii </it>(Figure <figr fid="F3">3B</figr>). The simulations with three models described above (all inversions, a small fraction of transpositions, and equal inversions and transpositions) all yielded a large decrease in clustering as expected (the observed C<sub>f </sub>in <it>C. reinhardtii </it>is 0.03397, and the 95% confidence intervals for C<sub>f </sub>in simulated genomes were 0.00744&#8211;0.01401, 0.00812&#8211;0.014299 and 0.0750-0.01418, respectively). When transposition was included in simulations, decreases of C<sub>f </sub>were on a similar scale to the inversion-only simulations. Taken together, these results indicate that the remarkable increase in sidedness and functional clustering observed in <it>C. reinhardtii </it>cpDNA has not been the outcome of solely chance events. Instead, the strong deviation from the range of outcomes expected under various random breakage models implies that the genome structure is the outcome of a directional selective process.</p>
            <p>The increased level of organization in <it>C. reinhardtii </it>cpDNA was associated with both maintenance of ancestral clusters and growth of new clusters. There were six conserved blocks containing 19 of the 85 genes shared between the <it>C. reinhardtii </it>and the <it>C. vulgaris </it>cpDNAs. These blocks include concentrations of genes from a single functional category, such as ribosomal proteins (<it>rpl</it>23-<it>rpl</it>12-<it>rps</it>19, <it>rpl</it>16-<it>rpl</it>14-<it>rps</it>8), Photosystem II (<it>psb</it>L-<it>psb</it>F, <it>psb</it>B-<it>psb</it>T-<it>psb</it>N-<it>psb</it>H), translation apparatus (<it>rrn</it>16-<it>trn</it>I-GAU &#8211; <it>trn</it>A-UGC &#8211; <it>rrn</it>23-<it>rrn</it>5), and ATP synthase subunits (<it>atp</it>F-<it>atp</it>H). Moreover, a number of small clusters of functionally related genes inferred in the ancestral genome were brought together in <it>C. reinhardtii </it>("rearranged clusters" in Figure <figr fid="F4">4B</figr>). These include transcription/translation genes (<it>trn</it>H-M-F; <it>rpl</it>/<it>rps</it>; <it>rps</it>3-<it>rpo</it>C2), electron transport genes (<it>pet</it>A-<it>pet</it>D), and photosynthetic genes (<it>psb</it>D-<it>psa</it>A exon 2-<it>psb</it>J) (Figure <figr fid="F4">4B</figr>). The new clusters contributed to the increase of C<sub>f </sub>in the <it>C. reinhardtii </it>chloroplast genome.</p>
            <fig id="F4">
               <title>
                  <p>Figure 4</p>
               </title>
               <caption>
                  <p>Selected functional clusters from <it>C. reinhardtii </it>cpDNA</p>
               </caption>
               <text>
                  <p><b>Selected functional clusters from <it>C. reinhardtii </it>cpDNA</b>. (A) Evidence for co-transcription of the genes <it>rpl</it>36-<it>rpl</it>23-<it>rpl</it>2-<it>rps</it>19. The gel was loaded with total RNA from wild-type cells, and shows new evidence for co-transcription (see text). The top left lane is an over-exposure of the <it>rpl</it>36 gel. Transcripts 1 and 2 (3.5 and 3.3 kb) are tricistronic <it>rpl</it>36-<it>rpl</it>23-<it>rpl</it>2, transcript 3A (2.5 kb) is <it>rpl</it>36-<it>rpl</it>23, and transcript 3B (2.5 kb) is probably <it>rpl</it>2-<it>rps</it>19. Single gene transcripts are labeled "mono". (B) Rearranged functional clusters, which were absent from the inferred common ancestor of <it>C. reinhardtii </it>and <it>C. vulgaris</it>, were identified in <it>C. reinhardtii </it>(genes connected by bold black lines). Cyan lines connect conserved clusters retained from the ancestor cpDNA. The genes are displayed in the coding direction, and from top to bottom relative to their order in the genome. The exception is <it>psb</it>N, which is on the opposite strand relative to other genes shown (<it>psb</it>T-B-N-H). A scale bar of 1 kb is shown below and at the left of each gene cluster.</p>
               </text>
               <graphic file="1471-2148-6-13-4"/>
            </fig>
         </sec>
         <sec>
            <st>
               <p>Coordinated expression of genes in functional clusters</p>
            </st>
            <p>Co-transcription of several clusters shown in Figure <figr fid="F4">4B</figr> has been previously documented, including <it>psb</it>D-<it>psa</it>A exon 2-<it>psb</it>J-<it>atp</it>I <abbrgrp><abbr bid="B28">28</abbr></abbrgrp>, <it>psb</it>F-<it>psb</it>L <abbrgrp><abbr bid="B29">29</abbr></abbrgrp>, <it>pet</it>A-<it>pet</it>D <abbrgrp><abbr bid="B30">30</abbr></abbrgrp>, and <it>psb</it>M-<it>psb</it>Z <abbrgrp><abbr bid="B31">31</abbr></abbrgrp>. Co-transcription of <it>rpl </it>and <it>rps </it>genes has been found in land plant chloroplasts <abbrgrp><abbr bid="B32">32</abbr></abbrgrp>. We documented co-transcription for an additional novel functional cluster, shown in Figure <figr fid="F4">4A</figr>. Using RNA gel blots, tricistronic transcripts of <it>rpl</it>36-<it>rpl</it>23-<it>rpl</it>2 and possibly dicistronic <it>rpl</it>2-<it>rps</it>19 species could be detected. Taken together, it appears that the clusters of functionally related genes observed in <it>C. reinhardtii </it>cpDNA may be frequently co-transcribed.</p>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Discussion</p>
         </st>
         <p>By reconstructing the possible ancestral gene order in chloroplast genomes and simulating rearrangements, we have been able to formally test and reject the null hypothesis that <it>C. reinhardtii </it>cpDNA has evolved through random rearrangements. Instead, we found that its observed gene order deviates strongly from the degree of sidedness and clustering expected under a random breakage model. <it>Euglena gracilis </it>cpDNA also has a high degree of sidedness <abbrgrp><abbr bid="B33">33</abbr></abbrgrp>, however, the asymmetry of its coding strand is concentrated in one half of the genome and associated with GC content, which could be influenced by asymmetrical replication of the chromosome <abbrgrp><abbr bid="B33">33</abbr></abbrgrp>. In <it>C. reinhardtii</it>, the sidedness is not associated with GC content and we hypothesize that it is driven by co-transcription of genes in a functional cluster. Whereas some clusters of co-transcribed genes (e.g. <it>rp</it>l23-<it>rpl</it>2-<it>rps</it>19, <it>rpl</it>16-<it>rpl</it>14-<it>rps</it>8) were maintained in both <it>C. reinhardtii </it>and <it>C. vulgaris</it>, novel clusters clearly formed in the <it>C. reinhardtii </it>lineage (Figure <figr fid="F4">4B</figr>).</p>
         <p>Co-transcription of neighboring genes in the <it>C. reinhardtii </it>chloroplast is a widely documented phenomenon. We demonstrated that in addition to the ribosomal protein clusters, global analyses support the elevated level of clustering of other functionally related genes. The aggregate of genes in clusters include most essential genes involved in translation and transcription, and some photosynthetic genes. Coordinated transcription may play a crucial role in the regulation of plastid gene expression in response to light or circadian rhythms <abbrgrp><abbr bid="B34">34</abbr><abbr bid="B35">35</abbr></abbrgrp>. It is also possible that some clusters contain <it>cis</it>-elements, similar to the artificial polydeoxyadenosine sequences <abbrgrp><abbr bid="B36">36</abbr></abbrgrp>, which enhance transcription efficiency. Moreover, most of the putative co-transcription units are not conserved across chlorophytes. Therefore, the majority of functional clusters observed in <it>C. reinhardtii </it>represent new gene arrangements.</p>
         <p>In the chloroplast gene order phylogeny (Figure <figr fid="F2">2B</figr>), the <it>C. reinhardtii </it>lineage resides on a long branch compared to the <it>C. vulgaris </it>lineage, and both genomes are more rearranged than that of <it>N. olivacea</it>, relative to the common ancestral genome of the three chlorophyte lineages. The elevated rate of chloroplast genome rearrangement in <it>C. reinhardtii </it>is associated with invasion of SDRs, which heavily populate the non-coding regions, increasing the total length of the intergenic regions compared to <it>C. vulgaris </it>cpDNA by one-third <abbrgrp><abbr bid="B16">16</abbr></abbrgrp>. Although simple sequence repeats are common to microbial genomes <abbrgrp><abbr bid="B37">37</abbr></abbrgrp>, such elements are rare in most sequenced chloroplast genomes. Within the <it>Chlamydomonas </it>genus (Chlorophyceae), <it>C. reinhardtii </it>and <it>C. gelatinosa </it>cpDNAs exhibit a prevalence of repetitive DNA and a high degree of gene order variation compared to the <it>C. moewusii</it>/<it>C. pitschmannii </it>lineage <abbrgrp><abbr bid="B15">15</abbr><abbr bid="B38">38</abbr><abbr bid="B39">39</abbr></abbrgrp>. The sister lineage to <it>C. reinhardtii </it>in our study, <it>C. vulgaris </it>(Trebouxiophyceae), contains numerous cpDNA repeat sequences. Besides chlorophyte algae, members of angiosperm families, including Campanulaceae <abbrgrp><abbr bid="B40">40</abbr></abbrgrp>, Fabaceae <abbrgrp><abbr bid="B41">41</abbr><abbr bid="B42">42</abbr></abbrgrp> and Geraniaceae <abbrgrp><abbr bid="B43">43</abbr></abbrgrp>, also contain repeat elements in rearranged cpDNAs, albeit of a much lower copy number <abbrgrp><abbr bid="B40">40</abbr><abbr bid="B41">41</abbr><abbr bid="B42">42</abbr><abbr bid="B43">43</abbr></abbrgrp>. These repeat elements may act as molecular "grease" that facilitates non-homologous recombination and creates a pool of diverse genome structures subject to selective retention. Future investigations will test whether the increased rates of rearrangement in plastid genomes with dispersed repeats typically lead to increased sidedness and functional clustering as we infer for <it>C. reinhardtii</it>.</p>
         <p>Gene order changes reflect relatively rare evolutionary events and are expected to result in much less homoplasy than substitution events in nucleotide or protein sequences over a deep time scale <abbrgrp><abbr bid="B44">44</abbr></abbrgrp>. Phylogeny reconstruction using GRAPPA is highly accurate even for divergent genomes <abbrgrp><abbr bid="B45">45</abbr></abbrgrp>, and thus the ancestral gene orders inferred in our study contained sufficient phylogenetic information. The only other software for genome rearrangement phylogeny, BADGER <abbrgrp><abbr bid="B46">46</abbr></abbrgrp>, performed poorly on this data set (results not shown). GRAPPA usually inferred unique ancestral gene orders on many data sets we tested. Furthermore, analyses on simulated data have shown that the inferred gene orders scored almost as well as true ancestral gene orders <abbrgrp><abbr bid="B47">47</abbr></abbrgrp>. In our simulation tests of three genomes with 85 genes each, and branch lengths of 50, 20 and 20 (roughly corresponding to the branches leading to <it>C. reinhardtii</it>, <it>C. vulgaris </it>and <it>N. olivacea</it>; see Methods), the average score for ancestral gene orders computed by GRAPPA was only about 7% less than the true scores. In practice, we observed that the less optimal gene orders generally required more rearrangements. Therefore, it is quite likely that any error in our estimation of ancestral gene order has resulted in a downward bias in the inferred number of rearrangements on the branch leading to <it>C. reinhardtii</it>. Increasing the number of rearrangements on this branch would only lead to a more certain rejection of the neutrality of rearrangements.</p>
         <p>The accuracy of ancestral genome reconstruction also depends on the degree of divergence among extant taxa and taxon sampling. For example, accurate reconstruction of ancestral genomes at the mammalian CFTR locus was achieved at the DNA level <abbrgrp><abbr bid="B48">48</abbr></abbrgrp>. The high-quality reconstruction was attributed to a dense sampling of syntenic genome sequences from eutherian mammals, and the lack of gene order rearrangement at the locus. Because the <it>C. reinhardtii </it>cpDNA is one of the most rearranged chloroplast genomes sequenced to date, we included all available chlorophyte chloroplast genomes for evolutionary distance estimation and ancestral gene order reconstruction. The accuracy of our ancestral gene order estimation may improve with inclusion of additional chlorophyte plastid gene orders as they become available, but we do not foresee a substantial reduction in the inferred number of rearrangements separating <it>C. reinhardtii </it>and <it>C. vulgaris </it>from their common ancestor.</p>
         <p>Inversions are thought to be much more common than transpositions in chloroplast genome evolution <abbrgrp><abbr bid="B27">27</abbr></abbrgrp>, and our estimation of ancestral genome order was made with the assumption that all rearrangements were inversions. However, we did consider the contribution of inversions and transpositions under different scenarios in the simulation from the ancestral genome. It should be noted that there is not a unique phylogeny distance measure using transposition only, because computationally one transposition is equivalent to two or three inversions <abbrgrp><abbr bid="B49">49</abbr></abbrgrp>. For this reason, we designed our simulations to allow for various ratios of inversion and transposition events. The results of our simulation study did not vary significantly under these scenarios.</p>
         <p>The GRAPPA-IR algorithm was developed to account for the inverted repeat (IR) region found in most plastid genomes The IR region seems to evolve at a slower rate in both nucleotide sequence and gene order than the single copy regions <abbrgrp><abbr bid="B50">50</abbr></abbrgrp>, and frequent intra-molecular recombination homogenizes the two copies <abbrgrp><abbr bid="B6">6</abbr><abbr bid="B51">51</abbr></abbrgrp>. The most conserved gene set in the IR region is the rRNA operon. In IR-containing green plastids, the order of rRNA genes is conserved, but the IR boundaries can vary greatly even within one genus <abbrgrp><abbr bid="B52">52</abbr></abbrgrp>. The IR may restrict rearrangements that cross the boundary of single copy regions, and thus concentrate gene order changes within single copy regions. However, this hypothetical constraint of the IR on genome rearrangements seems to have been lost in the <it>C. reinhardtii</it>/<it>C. vulgaris </it>lineage. Notably, both lineages have undergone extensive rearrangements since their divergence from a common ancestor, and they contain only a few conserved clusters encoding rRNA or ribosomal proteins. In either genome, genes that typically reside together in the LSC region have often been scrambled and scattered. When comparing the ancestral genome to the <it>C. vulgaris </it>gene order, there was no distinction of LSC and SSC regions although many large clusters were still shared (<supplr sid="S4">additional file 4</supplr>). If there were constraints on the breakpoint locations, as experimentally identified in bacterial inversion mutants <abbrgrp><abbr bid="B53">53</abbr></abbrgrp>, it would limit the possible paths of evolution, and these constraints on the ancestral gene orders would increase the number of rearrangements relative to the estimations derived from GRAPPA. Therefore, as discussed above, our approach of detecting strong deviation from expectation is conservative in that the number of rearrangements may be underestimated.</p>
         <p>Recent studies of plant, animal and fungal genomes have shown that genes involved in the same pathways or genes sharing similar expression patterns are often spatially clustered <abbrgrp><abbr bid="B1">1</abbr><abbr bid="B5">5</abbr><abbr bid="B54">54</abbr></abbrgrp>. In eukaryotes, the operon structure has only been demonstrated in the nematode <it>Caenorhabditis </it><abbrgrp><abbr bid="B55">55</abbr></abbrgrp>. Comparative analyses of yeast genomes indicate that rearrangements brought together duplicate genes forming the <it>DAL </it>cluster involved in allantoin metabolism <abbrgrp><abbr bid="B56">56</abbr></abbrgrp>. In this study, we demonstrated that positive selection for increased clustering has influenced gene order in the chloroplast. Gene clusters, as opposed to separated genes, permit polycistronic transcription and thus fewer transcriptional regulation units. Co-transcription may be facilitated by close spacing of genes in cpDNA because transcription termination is inefficient <abbrgrp><abbr bid="B57">57</abbr></abbrgrp>. Although post-transcriptional RNA processing often creates multiple single-gene transcripts, co-transcription foments an initial stoichiometric accumulation of RNA corresponding to each gene in a cluster. Thus, large clusters can be advantageous in coordinating gene expression on this level.</p>
         <p>Experimental approaches are necessary to understand whether these gene clusters function as operons. Because chloroplast primary transcripts are heavily processed &#8211; as just one example, the <it>psb</it>B cluster in maize accumulates as at least 15 distinct mRNA species with varying translational capacities <abbrgrp><abbr bid="B58">58</abbr></abbrgrp> &#8211; direct analysis of the functional advantages of clustering in chloroplasts is challenging. Indeed, <it>Chlamydomonas </it>may be a special case, since unlike land plants it has a single rather than multiple RNA polymerases <abbrgrp><abbr bid="B35">35</abbr></abbrgrp>. This situation does not allow differential expression by promoter selectivity, and may therefore serve as a selective force that favors physical grouping of genes rather than evolution of promoter sequences of dispersed genes.</p>
      </sec>
      <sec>
         <st>
            <p>Conclusion</p>
         </st>
         <p>In conclusion, we infer that gene order in the <it>C. reinhardtii </it>plastid evolved in a non-random fashion, and hypothesize that genome structure has been influenced by directional selection acting on variation generated by an increased rate of rearrangement. Our results provide strong evidence that genetic responses to natural selection occur at the level of genome organization. By estimating the ancestral gene order and simulating rearrangements under a null model, we provide a formal demonstration that the chloroplast genome of <it>C. reinhardtii </it>has been shaped by natural selection. Although the model of natural selection on gene order remains to be developed, application of our methods to sequences of additional chlorophyte plastid genomes would help to improve the accuracy of the ancestral genome reconstruction and inferred branch lengths. The complex process of gene duplication and loss in bacterial and eukaryotic nuclear genomes presents challenges to reconstruction of ancestral gene orders. Still, the development of new comparative tools <abbrgrp><abbr bid="B59">59</abbr></abbrgrp> gives us hope that the type of analysis presented in this paper will soon be applicable to eukaryotic genomes.</p>
      </sec>
      <sec>
         <st>
            <p>Methods</p>
         </st>
         <sec>
            <st>
               <p>Functional clustering of chloroplast genes</p>
            </st>
            <p>We defined a "functional cluster" as contiguous genes encoded on one strand from one of the following categories: transcription/translation, photosystem I and II, electron transport (cytochrome b6/f complex), and ATP synthase (See <supplr sid="S1">additional file 1</supplr>).</p>
         </sec>
         <sec>
            <st>
               <p>Kolmogorov-Smirnov test of random clusters</p>
            </st>
            <p>A random cluster consists of genes from any functional category. The n = 85 genes shared in the seven chloroplast genomes shown in Figure <figr fid="F1">1</figr> were divided into 11 equal sized blocks of <it>r</it><sub><it>j </it></sub>= 7 genes and one block of 8 genes so that the block sizes and number of blocks are equal. If <it>m</it><sub><it>ij </it></sub>genes were from the functional category <it>i </it>(total <it>T</it><sub><it>i </it></sub>genes) in the <it>j</it>th block, the observed cumulative frequency was <it>u</it><sub><it>i </it></sub>= &#8721;<sub><it>i</it></sub><it>m</it><sub><it>ij</it></sub>/<it>r</it><sub><it>j</it></sub>. The Kolmogorov-Smirnov test measures the deviation of the observed <it>u</it><sub><it>i </it></sub>from the expected from the random breakage model <abbrgrp><abbr bid="B13">13</abbr></abbrgrp>. The test statistic <it>D</it><sub><it>n </it></sub>was calculated for each functional category separately.</p>
            <p>
               <m:math name="1471-2148-6-13-i1" xmlns:m="http://www.w3.org/1998/Math/MathML">
                  <m:semantics>
                     <m:mrow>
                        <m:msub>
                           <m:mi>D</m:mi>
                           <m:mi>n</m:mi>
                        </m:msub>
                        <m:mo>=</m:mo>
                        <m:mi>max</m:mi>
                        <m:mo>&#8289;</m:mo>
                        <m:mo stretchy="false">[</m:mo>
                        <m:mi>max</m:mi>
                        <m:mo>&#8289;</m:mo>
                        <m:mo stretchy="false">(</m:mo>
                        <m:mfrac>
                           <m:mrow>
                              <m:msub>
                                 <m:mi>T</m:mi>
                                 <m:mi>i</m:mi>
                              </m:msub>
                           </m:mrow>
                           <m:mi>n</m:mi>
                        </m:mfrac>
                        <m:mo>&#8722;</m:mo>
                        <m:msub>
                           <m:mi>u</m:mi>
                           <m:mi>i</m:mi>
                        </m:msub>
                        <m:mo stretchy="false">)</m:mo>
                        <m:mo>,</m:mo>
                        <m:mi>max</m:mi>
                        <m:mo>&#8289;</m:mo>
                        <m:mo stretchy="false">(</m:mo>
                        <m:msub>
                           <m:mi>u</m:mi>
                           <m:mi>i</m:mi>
                        </m:msub>
                        <m:mo>&#8722;</m:mo>
                        <m:mfrac>
                           <m:mrow>
                              <m:msub>
                                 <m:mi>T</m:mi>
                                 <m:mi>i</m:mi>
                              </m:msub>
                              <m:mo>&#8722;</m:mo>
                              <m:mn>1</m:mn>
                           </m:mrow>
                           <m:mi>n</m:mi>
                        </m:mfrac>
                        <m:mo stretchy="false">)</m:mo>
                        <m:mo stretchy="false">]</m:mo>
                     </m:mrow>
                     <m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaacqWGebardaWgaaWcbaGaemOBa4gabeaakiabg2da9iGbc2gaTjabcggaHjabcIha4jabcUfaBjGbc2gaTjabcggaHjabcIha4jabcIcaOmaalaaabaGaemivaq1aaSbaaSqaaiabdMgaPbqabaaakeaacqWGUbGBaaGaeyOeI0IaemyDau3aaSbaaSqaaiabdMgaPbqabaGccqGGPaqkcqGGSaalcyGGTbqBcqGGHbqycqGG4baEcqGGOaakcqWG1bqDdaWgaaWcbaGaemyAaKgabeaakiabgkHiTmaalaaabaGaemivaq1aaSbaaSqaaiabdMgaPbqabaGccqGHsislcqaIXaqmaeaacqWGUbGBaaGaeiykaKIaeiyxa0faaa@55C1@</m:annotation>
                  </m:semantics>
               </m:math>
            </p>
         </sec>
         <sec>
            <st>
               <p>Phylogeny of chloroplast genomes</p>
            </st>
            <p>Alignments of 50 proteins shared in the 8 chloroplast genomes shown in Figure <figr fid="F2">2A</figr> were concatenated into one data matrix (<supplr sid="S2">Additional file 2</supplr>). 1,000 bootstrap replicates were conducted on the data set using PAUP* 4.0b10 with maximum parsimony and using MEGA with neighbor-joining methods and the Poisson-corrected distance. Maximum likelihood analysis with 100 bootstrap replicates was performed using PHYLIP3.6 with JTT distance and gamma = 0.5. GRAPPA was not used to construct the reference phylogeny.</p>
         </sec>
         <sec>
            <st>
               <p>Inferring ancestral gene orders</p>
            </st>
            <p>The ancestral gene order was inferred from the gene orders of extant genomes on the best-scored tree following two steps. First, the gene contents for the LSC, SSC and IR regions of ancestral genomes of IR-containing cpDNAs were inferred based on parsimony. Changes in gene copy number due to IR expansion or contraction were considered the last step of gene order changes, and thus the gene contents of ancestral genomes were determined. The ancestral gene orders on the phylogeny for five genomes (excluding <it>C. vulgaris </it>and <it>C. reinhardtii</it>) were computed using GRAPPA-IR <abbrgrp><abbr bid="B25">25</abbr></abbrgrp>, which is a modified version of GRAPPA that scores rearrangements independently within LSC, SSC or IR. Second, the chlorophyte algal gene orders (the extant chloroplast gene orders of <it>N. olivacea</it>, <it>C. reinhardtii </it>and <it>C. vulgaris </it>and the inferred ancestral genome of <it>N. olivacea </it>from step one) and the gene order of <it>M. viride </it>were used for the inference of the common ancestral gene order of <it>C. vulgaris </it>and <it>C. reinhardtii</it>. The data set contains duplicated <it>trn</it>V-UAC and <it>trn</it>G-GCC in <it>C. vulgaris</it>, <it>trn</it>E-UUC and <it>psb</it>A in <it>C. reinhardtii </it>and three trans-splicing <it>psa</it>A exons in C. <it>reinhardtii</it>. The IR regions contained rRNA genes in the same order and orientation in each genome except that one copy was lost in the lineage leading to <it>C. vulgaris</it>. To score the genomes with gene duplications and deletions, multiple data sets were created each containing genomes with equalized gene contents by the following assignment rules: one copy of each duplicate genes outside the typical IR was chosen; the IR region lost in <it>C. vulgaris </it>was inserted to all possible locations in that genome. Preferably, we should test all these datasets (3,936 total) with inversion medians; however, such computation on one dataset alone will take more than a month. To overcome this limitation, these datasets were computed using breakpoint medians, and the assignment yielded the shortest tree was chosen for a full evaluation by GRAPPA. Because the gene contents of LSC and SSC in <it>C. reinhardtii </it>were different from other chloroplast genomes in the study, we allows free rearrangements such that genes in LSC or SSC could commute across the IR.</p>
         </sec>
         <sec>
            <st>
               <p>Ancestral gene order simulation</p>
            </st>
            <p>A set of simulation experiments were conducted to evaluate the accuracy of ancestral genome reconstruction with long branches. Three genomes with 85 genes each were generated from a defined ancestral gene order, and the branch lengths (inversion distances) were 50, 20 and 20, respectively. The true gene order score was 90 (equals the tree length). The scores were computed for inferred ancestral gene orders by GRAPPA using inversion medians and the random breakage model and then compared to the true score. The experiment was repeated on 30 data sets.</p>
         </sec>
         <sec>
            <st>
               <p>Random genome rearrangement simulation</p>
            </st>
            <p>Gene orders were simulated under the assumption that the rearrangements involve random breakpoints placed between genes. Initial gene orders were set based on the inferred ancestral gene orders estimated. Random rearrangement operations on the initial genomes were performed for the number of replicates according to the number of rearrangements inferred by GRAPPA. The parameters input to the model were the ratios of inversion and transposition (1:0, 10:1, 1:1) to test the sensitivity of the findings to the specific rearrangement model. The simulated genomes had identical gene content but scrambled gene orders relative to those observed in extant genomes, with the exception that inverted repeats were maintained. Test statistics (below) were calculated for each simulated replicate of 10,000 total and the frequency distributions were used to test the null hypothesis of random rearrangement.</p>
         </sec>
         <sec>
            <st>
               <p>Sidedness index (C<sub>s</sub>)</p>
            </st>
            <p>We designed the sidedness index (C<sub>s</sub>) to measure the degree to which neighboring genes are clustered on the same strand (side) of the chromosome. A "sided block" includes only adjacent genes on one strand, and the number of sided blocks in a genome is designated as <it>n</it><sub><it>SB</it></sub>, while the total number of genes is <it>n</it>. C<sub>s </sub>is defined as</p>
            <p>C<sub>s </sub>= (<it>n</it>-<it>n</it><sub><it>SB</it></sub>)/(<it>n</it>-1).</p>
            <p>When C<sub>s </sub>reaches the maximum of 1, all genes are located on one side. If every gene resides on the strand opposite its neighbors, C<sub>s </sub>approaches a minimum of zero.</p>
         </sec>
         <sec>
            <st>
               <p>Functional cluster index (C<sub>f</sub>)</p>
            </st>
            <p>We divided a genome of total <it>n </it>genes to <it>J </it>sided blocks (<it>r</it><sub>1</sub>, <it>r</it><sub>2</sub>,...<it>r</it><sub><it>J</it></sub>). In a block, we assigned genes to functional categories. Let the numbers of genes in the <it>i</it>th functional category and the <it>j</it>th block be <it>m</it><sub><it>ij</it></sub>, the functional cluster index C<sub>f </sub>is</p>
            <p>
               <m:math name="1471-2148-6-13-i2" xmlns:m="http://www.w3.org/1998/Math/MathML">
                  <m:semantics>
                     <m:mrow>
                        <m:msub>
                           <m:mtext>C</m:mtext>
                           <m:mtext>f</m:mtext>
                        </m:msub>
                        <m:mo>=</m:mo>
                        <m:mfrac>
                           <m:mn>1</m:mn>
                           <m:mi>J</m:mi>
                        </m:mfrac>
                        <m:mstyle displaystyle="true">
                           <m:msubsup>
                              <m:mo>&#8721;</m:mo>
                              <m:mrow>
                                 <m:mi>j</m:mi>
                                 <m:mo>=</m:mo>
                                 <m:mn>1</m:mn>
                              </m:mrow>
                              <m:mi>J</m:mi>
                           </m:msubsup>
                           <m:mrow>
                              <m:mfrac>
                                 <m:mrow>
                                    <m:msub>
                                       <m:mi>r</m:mi>
                                       <m:mi>j</m:mi>
                                    </m:msub>
                                 </m:mrow>
                                 <m:mi>n</m:mi>
                              </m:mfrac>
                              <m:mstyle displaystyle="true">
                                 <m:msubsup>
                                    <m:mo>&#8721;</m:mo>
                                    <m:mrow>
                                       <m:mi>i</m:mi>
                                       <m:mo>=</m:mo>
                                       <m:mn>1</m:mn>
                                    </m:mrow>
                                    <m:mn>4</m:mn>
                                 </m:msubsup>
                                 <m:mrow>
                                    <m:mrow>
                                       <m:mrow>
                                          <m:mrow>
                                             <m:mo>(</m:mo>
                                             <m:mrow>
                                                <m:mtable>
                                                   <m:mtr>
                                                      <m:mtd>
                                                         <m:mrow>
                                                            <m:msub>
                                                               <m:mi>m</m:mi>
                                                               <m:mrow>
                                                                  <m:mi>i</m:mi>
                                                                  <m:mi>j</m:mi>
                                                               </m:mrow>
                                                            </m:msub>
                                                         </m:mrow>
                                                      </m:mtd>
                                                   </m:mtr>
                                                   <m:mtr>
                                                      <m:mtd>
                                                         <m:mn>2</m:mn>
                                                      </m:mtd>
                                                   </m:mtr>
                                                </m:mtable>
                                             </m:mrow>
                                             <m:mo>)</m:mo>
                                          </m:mrow>
                                       </m:mrow>
                                       <m:mo>/</m:mo>
                                       <m:mrow>
                                          <m:mrow>
                                             <m:mo>(</m:mo>
                                             <m:mrow>
                                                <m:mtable>
                                                   <m:mtr>
                                                      <m:mtd>
                                                         <m:mrow>
                                                            <m:msub>
                                                               <m:mi>r</m:mi>
                                                               <m:mi>j</m:mi>
                                                            </m:msub>
                                                         </m:mrow>
                                                      </m:mtd>
                                                   </m:mtr>
                                                   <m:mtr>
                                                      <m:mtd>
                                                         <m:mn>2</m:mn>
                                                      </m:mtd>
                                                   </m:mtr>
                                                </m:mtable>
                                             </m:mrow>
                                             <m:mo>)</m:mo>
                                          </m:mrow>
                                       </m:mrow>
                                    </m:mrow>
                                    <m:mo>.</m:mo>
                                 </m:mrow>
                              </m:mstyle>
                           </m:mrow>
                        </m:mstyle>
                     </m:mrow>
                     <m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaacqqGdbWqdaWgaaWcbaGaeeOzaygabeaakiabg2da9maalaaabaGaeGymaedabaGaemOsaOeaamaaqadabaWaaSaaaeaacqWGYbGCdaWgaaWcbaGaemOAaOgabeaaaOqaaiabd6gaUbaadaaeWaqaamaalyaabaWaaeWaaeaafaqabeGabaaabaGaemyBa02aaSbaaSqaaiabdMgaPjabdQgaQbqabaaakeaacqaIYaGmaaaacaGLOaGaayzkaaaabaWaaeWaaeaafaqabeGabaaabaGaemOCai3aaSbaaSqaaiabdQgaQbqabaaakeaacqaIYaGmaaaacaGLOaGaayzkaaaaaiabc6caUaWcbaGaemyAaKMaeyypa0JaeGymaedabaGaeGinaqdaniabggHiLdaaleaacqWGQbGAcqGH9aqpcqaIXaqmaeaacqWGkbGsa0GaeyyeIuoaaaa@50F0@</m:annotation>
                  </m:semantics>
               </m:math>
            </p>
            <p>A larger value of C<sub>f </sub>indicates that functionally related genes are more clustered into blocks.</p>
         </sec>
         <sec>
            <st>
               <p>RNA analysis</p>
            </st>
            <p>Wild-type CC-124 cells were grown in Tris-Acetate-Phosphate medium <abbrgrp><abbr bid="B60">60</abbr></abbrgrp> under continuous light to mid-log phase. RNA was isolated from 10 mL of cells as previously described <abbrgrp><abbr bid="B61">61</abbr></abbrgrp>. For filter hybridization, 5 &#956;g of total RNA was fractionated in 1.2% agarose and 6% formaldehyde gels, transferred to nylon membranes, and probed with gene-specific PCR products labeled by random priming according to Church and Gilbert <abbrgrp><abbr bid="B62">62</abbr></abbrgrp>.</p>
         </sec>
      </sec>
      <sec>
         <st>
            <p>List of Abbreviations</p>
         </st>
         <p>cpDNA, chloroplast DNA; IR, inverted repeat; SDR, short dispersed repeat</p>
      </sec>
      <sec>
         <st>
            <p>Authors' contributions</p>
         </st>
         <p>LC conducted the analysis and drafted the manuscript. JLM and CWD conceived the study, helped with the analyses and contributed to the text. LSW contributed the code for the genome simulator. JT carried out the ancestral genome reconstruction. LR conducted the RNA analysis. DBS provided further experimental data review and revision of the draft. All authors read and approved the final manuscript.</p>
      </sec>
   </bdy>
   <bm>
      <ack>
         <sec>
            <st>
               <p>Acknowledgements</p>
            </st>
            <p>We thank A. Jarosz, H. Ma, J. Marden, W. Martin, W. Miller, and D. Schemske for valuable suggestions and comments. This work was supported by NSF awards DBI 0115684 and DEB 0120709 to CWD. JT is supported by the University of South Carolina and part of the work was done while he was visiting the National Evolutionary Synthesis Center. <it>Chlamydomonas </it>genomics work at BTI was supported by NSF awards MCB 9975765 and MCB 0091020 to DBS.</p>
         </sec>
      </ack>
      <refgrp>
         <bibl id="B1">
            <title>
               <p>The evolutionary dynamics of eukaryotic gene order</p>
            </title>
            <aug>
               <au>
                  <snm>Hurst</snm>
                  <fnm>LD</fnm>
               </au>
               <au>
                  <snm>Pal</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Lercher</snm>
                  <fnm>MJ</fnm>
               </au>
            </aug>
            <source>Nat Rev Genet</source>
            <pubdate>2004</pubdate>
            <volume>5</volume>
            <fpage>299</fpage>
            <lpage>310</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/nrg1319</pubid>
                  <pubid idtype="pmpid" link="fulltext">15131653</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B2">
            <title>
               <p>Primary structure of cotranscribed genes encoding the Rieske Fe-S and cytochrome f proteins of the cyanobacterium Nostoc PCC 7906</p>
            </title>
            <aug>
               <au>
                  <snm>Kallas</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Spiller</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Malkin</snm>
                  <fnm>R</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci U S A</source>
            <pubdate>1988</pubdate>
            <volume>85</volume>
            <fpage>5794</fpage>
            <lpage>5798</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">281851</pubid>
                  <pubid idtype="pmpid" link="fulltext">2842748</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B3">
            <title>
               <p>Gene-cluster analysis in chloroplast genomics</p>
            </title>
            <aug>
               <au>
                  <snm>Stoebe</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Kowallik</snm>
                  <fnm>KV</fnm>
               </au>
            </aug>
            <source>Trends Genet</source>
            <pubdate>1999</pubdate>
            <volume>15</volume>
            <fpage>344</fpage>
            <lpage>347</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/S0168-9525(99)01815-6</pubid>
                  <pubid idtype="pmpid" link="fulltext">10461201</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B4">
            <title>
               <p>Clustering of housekeeping genes provides a unified model of gene order in the human genome</p>
            </title>
            <aug>
               <au>
                  <snm>Lercher</snm>
                  <fnm>MJ</fnm>
               </au>
               <au>
                  <snm>Urrutia</snm>
                  <fnm>AO</fnm>
               </au>
               <au>
                  <snm>Hurst</snm>
                  <fnm>LD</fnm>
               </au>
            </aug>
            <source>Nat Genet</source>
            <pubdate>2002</pubdate>
            <volume>31</volume>
            <fpage>180</fpage>
            <lpage>183</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/ng887</pubid>
                  <pubid idtype="pmpid" link="fulltext">11992122</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B5">
            <title>
               <p>Genomic gene clustering analysis of pathways in eukaryotes</p>
            </title>
            <aug>
               <au>
                  <snm>Lee</snm>
                  <fnm>JM</fnm>
               </au>
               <au>
                  <snm>Sonnhammer</snm>
                  <fnm>EL</fnm>
               </au>
            </aug>
            <source>Genome Res</source>
            <pubdate>2003</pubdate>
            <volume>13</volume>
            <fpage>875</fpage>
            <lpage>882</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">430880</pubid>
                  <pubid idtype="pmpid" link="fulltext">12695325</pubid>
                  <pubid idtype="doi">10.1101/gr.737703</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B6">
            <title>
               <p>Evolution of chloroplast and mitochondrial DNA in plants and algae</p>
            </title>
            <aug>
               <au>
                  <snm>Palmer</snm>
                  <fnm>JD</fnm>
               </au>
            </aug>
            <source>Molecular Evolutionary Genetics</source>
            <publisher>New York, Plenum Press</publisher>
            <editor>MacIntyre RJ</editor>
            <series>
               <title>
                  <p>Monographs in evolutionary biology</p>
               </title>
               <editor>Hecht M K, Wallace B, Prance G T</editor>
            </series>
            <pubdate>1985</pubdate>
            <fpage>131</fpage>
            <lpage>240</lpage>
         </bibl>
         <bibl id="B7">
            <title>
               <p>Chloroplast ATPase genes in the diatom Odontella sinensis reflect cyanobacterial characters in structure and arrangement</p>
            </title>
            <aug>
               <au>
                  <snm>Pancic</snm>
                  <fnm>PG</fnm>
               </au>
               <au>
                  <snm>Strotmann</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Kowallik</snm>
                  <fnm>KV</fnm>
               </au>
            </aug>
            <source>J Mol Biol</source>
            <pubdate>1992</pubdate>
            <volume>224</volume>
            <fpage>529</fpage>
            <lpage>536</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/0022-2836(92)91017-J</pubid>
                  <pubid idtype="pmpid" link="fulltext">1532839</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B8">
            <title>
               <p>Length of chromosome segments conserved since divergence of man and mouse</p>
            </title>
            <aug>
               <au>
                  <snm>Nadeau</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Taylor</snm>
                  <fnm>BA</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci USA</source>
            <pubdate>1984</pubdate>
            <volume>81</volume>
            <fpage>814</fpage>
            <lpage>818</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">344928</pubid>
                  <pubid idtype="pmpid" link="fulltext">6583681</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B9">
            <title>
               <p>A new fast heuristic for computing the breakpoint phylogeny and experimental phylogenetic analyses of real and synthetic data</p>
            </title>
            <aug>
               <au>
                  <snm>Cosner</snm>
                  <fnm>ME</fnm>
               </au>
               <au>
                  <snm>Jansen</snm>
                  <fnm>RK</fnm>
               </au>
               <au>
                  <snm>Moret</snm>
                  <fnm>BM</fnm>
               </au>
               <au>
                  <snm>Raubeson</snm>
                  <fnm>LA</fnm>
               </au>
               <au>
                  <snm>Wang</snm>
                  <fnm>LS</fnm>
               </au>
               <au>
                  <snm>Warnow</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Wyman</snm>
                  <fnm>S</fnm>
               </au>
            </aug>
            <source>Proc Int Conf Intell Syst Mol Biol</source>
            <pubdate>2000</pubdate>
            <volume>8</volume>
            <fpage>104</fpage>
            <lpage>115</lpage>
            <xrefbib>
               <pubid idtype="pmpid">10977071</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B10">
            <title>
               <p>New approaches for reconstructing phylogenies from gene order data</p>
            </title>
            <aug>
               <au>
                  <snm>Moret</snm>
                  <fnm>BM</fnm>
               </au>
               <au>
                  <snm>Wang</snm>
                  <fnm>LS</fnm>
               </au>
               <au>
                  <snm>Warnow</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Wayman</snm>
                  <fnm>SK</fnm>
               </au>
            </aug>
            <source>Bioinformatics</source>
            <pubdate>2001</pubdate>
            <volume>17</volume>
            <fpage>S165</fpage>
            <lpage>S173</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">11473006</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B11">
            <title>
               <p>Phylogenetic reconstruction from gene-rearrangement data with unequal gene content</p>
            </title>
            <aug>
               <au>
                  <snm>Tang</snm>
                  <fnm>JJ</fnm>
               </au>
               <au>
                  <snm>Moret</snm>
                  <fnm>BME</fnm>
               </au>
            </aug>
            <source>Lecture Notes in Computer Science</source>
            <publisher>Berlin, Springer-Verlag</publisher>
            <pubdate>2003</pubdate>
            <volume>2748</volume>
            <fpage>37</fpage>
            <lpage>46</lpage>
         </bibl>
         <bibl id="B12">
            <title>
               <p>Scaling up accurate phylogenetic reconstruction from gene-order data</p>
            </title>
            <aug>
               <au>
                  <snm>Tang</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Moret</snm>
                  <fnm>BM</fnm>
               </au>
            </aug>
            <source>Bioinformatics</source>
            <pubdate>2003</pubdate>
            <volume>19 Suppl 1</volume>
            <fpage>i305</fpage>
            <lpage>12</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/bioinformatics/btg1042</pubid>
                  <pubid idtype="pmpid" link="fulltext">12855474</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B13">
            <title>
               <p>Human and mouse genomic sequences reveal extensive breakpoint reuse in mammalian evolution</p>
            </title>
            <aug>
               <au>
                  <snm>Pevzner</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Tesler</snm>
                  <fnm>G</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci USA</source>
            <pubdate>2003</pubdate>
            <volume>100</volume>
            <fpage>7672</fpage>
            <lpage>7677</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">164646</pubid>
                  <pubid idtype="pmpid" link="fulltext">12810957</pubid>
                  <pubid idtype="doi">10.1073/pnas.1330369100</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B14">
            <title>
               <p>Genomic features in the breakpoint regions between syntenic blocks</p>
            </title>
            <aug>
               <au>
                  <snm>Trinh</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>McLysaght</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Sankoff</snm>
                  <fnm>D</fnm>
               </au>
            </aug>
            <source>Bioinformatics</source>
            <pubdate>2004</pubdate>
            <volume>20 Suppl 1</volume>
            <fpage>I318</fpage>
            <lpage>I325</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/bioinformatics/bth934</pubid>
                  <pubid idtype="pmpid" link="fulltext">15262815</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B15">
            <title>
               <p>Extensive gene rearrangements in the chloroplast DNAs of Chlamydomonas species featuring multiple dispersed repeats</p>
            </title>
            <aug>
               <au>
                  <snm>Boudreau</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Turmel</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>Mol Biol Evol</source>
            <pubdate>1996</pubdate>
            <volume>13</volume>
            <fpage>233</fpage>
            <lpage>243</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">8583896</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B16">
            <title>
               <p>Chlamydomonas chloroplast chromosome: islands of genes in a sea of repeats</p>
            </title>
            <aug>
               <au>
                  <snm>Maul</snm>
                  <fnm>JE</fnm>
               </au>
               <au>
                  <snm>Lilly</snm>
                  <fnm>JW</fnm>
               </au>
               <au>
                  <snm>Cui</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>dePamphilis</snm>
                  <fnm>CW</fnm>
               </au>
               <au>
                  <snm>Miller</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Harris</snm>
                  <fnm>EH</fnm>
               </au>
               <au>
                  <snm>Stern</snm>
                  <fnm>DB</fnm>
               </au>
            </aug>
            <source>Plant Cell</source>
            <pubdate>2002</pubdate>
            <volume>14</volume>
            <fpage>2659</fpage>
            <lpage>2679</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">153795</pubid>
                  <pubid idtype="pmpid" link="fulltext">12417694</pubid>
                  <pubid idtype="doi">10.1105/tpc.006155</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B17">
            <title>
               <p>Complete nucleotide sequence of the chloroplast genome from the green alga Chlorella vulgaris: the existence of genes possibly involved in chloroplast division</p>
            </title>
            <aug>
               <au>
                  <snm>Wakasugi</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Nagai</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Kapoor</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Sugita</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Ito</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Ito</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Tsudzuki</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Nakashima</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Tsudzuki</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Suzuki</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Hamada</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Ohta</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Inamura</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Yoshinaga</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Sugiura</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci U S A</source>
            <pubdate>1997</pubdate>
            <volume>94</volume>
            <fpage>5967</fpage>
            <lpage>5972</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">20890</pubid>
                  <pubid idtype="pmpid" link="fulltext">9159184</pubid>
                  <pubid idtype="doi">10.1073/pnas.94.11.5967</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B18">
            <title>
               <p>The Complete Nucleotide-Sequence of the Tobacco Chloroplast Genome - Its Gene Organization and Expression</p>
            </title>
            <aug>
               <au>
                  <snm>Shinozaki</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Ohme</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Tanaka</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Wakasugi</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Hayashida</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Matsubayashi</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Zaita</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Chunwongse</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Obokata</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Yamaguchishinozaki</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Ohto</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Torazawa</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Meng</snm>
                  <fnm>BY</fnm>
               </au>
               <au>
                  <snm>Sugita</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Deno</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Kamogashira</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Yamada</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Kusuda</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Takaiwa</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Kato</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Tohdoh</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Shimada</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Sugiura</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>Embo Journal</source>
            <pubdate>1986</pubdate>
            <volume>5</volume>
            <fpage>2043</fpage>
            <lpage>2049</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1167080</pubid>
                  <pubid idtype="pmpid">16453699</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B19">
            <title>
               <p>Structure and organization of Marchantia polymorpha chloroplast genome. I. Cloning and gene identification</p>
            </title>
            <aug>
               <au>
                  <snm>Ohyama</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Fukuzawa</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Kohchi</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Sano</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Sano</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Shirai</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Umesono</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Shiki</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Takeuchi</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Chang</snm>
                  <fnm>Z</fnm>
               </au>
               <etal/>
            </aug>
            <source>J Mol Biol</source>
            <pubdate>1988</pubdate>
            <volume>203</volume>
            <fpage>281</fpage>
            <lpage>298</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/0022-2836(88)90001-0</pubid>
                  <pubid idtype="pmpid">2462054</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B20">
            <title>
               <p>The chloroplast and mitochondrial genome sequences of the charophyte Chaetosphaeridium globosum: insights into the timing of the events that restructured organelle DNAs within the green algal lineage that led to land plants</p>
            </title>
            <aug>
               <au>
                  <snm>Turmel</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Otis</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Lemieux</snm>
                  <fnm>C</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci U S A</source>
            <pubdate>2002</pubdate>
            <volume>99</volume>
            <fpage>11275</fpage>
            <lpage>11280</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">123247</pubid>
                  <pubid idtype="pmpid" link="fulltext">12161560</pubid>
                  <pubid idtype="doi">10.1073/pnas.162203299</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B21">
            <title>
               <p>The complete chloroplast DNA sequence of the green alga Nephroselmis olivacea: insights into the architecture of ancestral chloroplast genomes</p>
            </title>
            <aug>
               <au>
                  <snm>Turmel</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Otis</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Lemieux</snm>
                  <fnm>C</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci U S A</source>
            <pubdate>1999</pubdate>
            <volume>96</volume>
            <fpage>10248</fpage>
            <lpage>10253</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">17874</pubid>
                  <pubid idtype="pmpid" link="fulltext">10468594</pubid>
                  <pubid idtype="doi">10.1073/pnas.96.18.10248</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B22">
            <title>
               <p>Ancestral chloroplast genome in Mesostigma viride reveals an early branch of green plant evolution</p>
            </title>
            <aug>
               <au>
                  <snm>Lemieux</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Otis</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Turmel</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>Nature</source>
            <pubdate>2000</pubdate>
            <volume>403</volume>
            <fpage>649</fpage>
            <lpage>652</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/35001059</pubid>
                  <pubid idtype="pmpid" link="fulltext">10688199</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B23">
            <title>
               <p>The cyanelles of Cyanophora paradoxa</p>
            </title>
            <aug>
               <au>
                  <snm>Loffelhardt</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Bohnert</snm>
                  <fnm>HJ</fnm>
               </au>
               <au>
                  <snm>Bryant</snm>
                  <fnm>DA</fnm>
               </au>
            </aug>
            <source>Crit Rev Plant Sci</source>
            <pubdate>1997</pubdate>
            <volume>16</volume>
            <fpage>393</fpage>
            <lpage>413</lpage>
         </bibl>
         <bibl id="B24">
            <title>
               <p>The closest living relatives of land plants</p>
            </title>
            <aug>
               <au>
                  <snm>Karol</snm>
                  <fnm>KG</fnm>
               </au>
               <au>
                  <snm>McCourt</snm>
                  <fnm>RM</fnm>
               </au>
               <au>
                  <snm>Cimino</snm>
                  <fnm>MT</fnm>
               </au>
               <au>
                  <snm>Delwiche</snm>
                  <fnm>CF</fnm>
               </au>
            </aug>
            <source>Science</source>
            <pubdate>2001</pubdate>
            <volume>294</volume>
            <fpage>2351</fpage>
            <lpage>2353</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1126/science.1065156</pubid>
                  <pubid idtype="pmpid" link="fulltext">11743201</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B25">
            <title>
               <p>Inferring ancestral chloroplast genomes with duplications</p>
            </title>
            <aug>
               <au>
                  <snm>Cui</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Tang</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Moret</snm>
                  <fnm>BME</fnm>
               </au>
               <au>
                  <snm>dePamphilis</snm>
                  <fnm>CW</fnm>
               </au>
            </aug>
            <source>TR-CS-2005-08</source>
            <publisher>, University of New Mexico</publisher>
            <pubdate>2005</pubdate>
         </bibl>
         <bibl id="B26">
            <title>
               <p>Contrasting modes and tempos of genome evolution in land plant organelles</p>
            </title>
            <aug>
               <au>
                  <snm>Palmer</snm>
                  <fnm>JD</fnm>
               </au>
            </aug>
            <source>Trends Genet</source>
            <pubdate>1990</pubdate>
            <volume>6</volume>
            <fpage>115</fpage>
            <lpage>120</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/0168-9525(90)90125-P</pubid>
                  <pubid idtype="pmpid">2132730</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B27">
            <title>
               <p>Gene rearrangements in Chlamydomonas chloroplast DNAs are accounted for by inversions and by the expansion/contraction of the inverted repeat</p>
            </title>
            <aug>
               <au>
                  <snm>Boudreau</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Turmel</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>Plant Mol Biol</source>
            <pubdate>1995</pubdate>
            <volume>27</volume>
            <fpage>351</fpage>
            <lpage>364</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1007/BF00020189</pubid>
                  <pubid idtype="pmpid">7888624</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B28">
            <title>
               <p>Mutant phenotypes support a trans-splicing mechanism for the expression of the tripartite psaA gene in the C. reinhardtii chloroplast</p>
            </title>
            <aug>
               <au>
                  <snm>Choquet</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Goldschmidt-Clermont</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Girard-Bascou</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Kuck</snm>
                  <fnm>U</fnm>
               </au>
               <au>
                  <snm>Bennoun</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Rochaix</snm>
                  <fnm>JD</fnm>
               </au>
            </aug>
            <source>Cell</source>
            <pubdate>1988</pubdate>
            <volume>52</volume>
            <fpage>903</fpage>
            <lpage>914</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/0092-8674(88)90432-1</pubid>
                  <pubid idtype="pmpid" link="fulltext">3280139</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B29">
            <title>
               <p>An unusual organization of the genes encoding cytochrome b559 in Chlamydomonas reinhardtii: psbE and psbF genes are separately transcribed from different regions of the plastid chromosome</p>
            </title>
            <aug>
               <au>
                  <snm>Mor</snm>
                  <fnm>TS</fnm>
               </au>
               <au>
                  <snm>Ohad</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Hirschberg</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Pakrasi</snm>
                  <fnm>HB</fnm>
               </au>
            </aug>
            <source>Mol Gen Genet</source>
            <pubdate>1995</pubdate>
            <volume>246</volume>
            <fpage>600</fpage>
            <lpage>604</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1007/BF00298966</pubid>
                  <pubid idtype="pmpid">7700232</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B30">
            <title>
               <p>The petD gene is transcribed by functionally redundant promoters in Chlamydomonas reinhardtii chloroplasts</p>
            </title>
            <aug>
               <au>
                  <snm>Sturm</snm>
                  <fnm>NR</fnm>
               </au>
               <au>
                  <snm>Kuras</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Buschlen</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Sakamoto</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Kindle</snm>
                  <fnm>KL</fnm>
               </au>
               <au>
                  <snm>Stern</snm>
                  <fnm>DB</fnm>
               </au>
               <au>
                  <snm>Wollman</snm>
                  <fnm>FA</fnm>
               </au>
            </aug>
            <source>Mol Cell Biol</source>
            <pubdate>1994</pubdate>
            <volume>14</volume>
            <fpage>6171</fpage>
            <lpage>6179</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">359144</pubid>
                  <pubid idtype="pmpid">8065350</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B31">
            <title>
               <p>Inversions in the Chlamydomonas chloroplast genome suppress a petD 5' untranslated region deletion by creating functional chimeric mRNAs</p>
            </title>
            <aug>
               <au>
                  <snm>Higgs</snm>
                  <fnm>DC</fnm>
               </au>
               <au>
                  <snm>Kuras</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Kindle</snm>
                  <fnm>KL</fnm>
               </au>
               <au>
                  <snm>Wollman</snm>
                  <fnm>FA</fnm>
               </au>
               <au>
                  <snm>Stern</snm>
                  <fnm>DB</fnm>
               </au>
            </aug>
            <source>Plant J</source>
            <pubdate>1998</pubdate>
            <volume>14</volume>
            <fpage>663</fpage>
            <lpage>671</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1046/j.1365-313x.1998.00165.x</pubid>
                  <pubid idtype="pmpid" link="fulltext">9681031</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B32">
            <title>
               <p>RNA editing in hornwort chloroplasts makes more than half the genes functional</p>
            </title>
            <aug>
               <au>
                  <snm>Kugita</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Yamamoto</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Fujikawa</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Matsumoto</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Yoshinaga</snm>
                  <fnm>K</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2003</pubdate>
            <volume>31</volume>
            <fpage>2417</fpage>
            <lpage>2423</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">154213</pubid>
                  <pubid idtype="pmpid" link="fulltext">12711687</pubid>
                  <pubid idtype="doi">10.1093/nar/gkg327</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B33">
            <title>
               <p>Strand asymmetry and codon usage bias in the chloroplast genome of Euglena gracilis</p>
            </title>
            <aug>
               <au>
                  <snm>Morton</snm>
                  <fnm>BR</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci USA</source>
            <pubdate>1999</pubdate>
            <volume>96</volume>
            <fpage>5123</fpage>
            <lpage>5128</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">21827</pubid>
                  <pubid idtype="pmpid" link="fulltext">10220429</pubid>
                  <pubid idtype="doi">10.1073/pnas.96.9.5123</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B34">
            <title>
               <p>Light affects the structure of Chlamydomonas chloroplast chromosomes</p>
            </title>
            <aug>
               <au>
                  <snm>Thompson</snm>
                  <fnm>RJ</fnm>
               </au>
               <au>
                  <snm>Mosig</snm>
                  <fnm>G</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>1990</pubdate>
            <volume>18</volume>
            <fpage>2625</fpage>
            <lpage>2631</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">330745</pubid>
                  <pubid idtype="pmpid">2339053</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B35">
            <title>
               <p>Searching limiting steps in the expression of chloroplast-encoded proteins: relations between gene copy number, transcription, transcript abundance and translation rate in the chloroplast of Chlamydomonas reinhardtii</p>
            </title>
            <aug>
               <au>
                  <snm>Eberhard</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Drapier</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Wollman</snm>
                  <fnm>FA</fnm>
               </au>
            </aug>
            <source>Plant J</source>
            <pubdate>2002</pubdate>
            <volume>31</volume>
            <fpage>149</fpage>
            <lpage>160</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1046/j.1365-313X.2002.01340.x</pubid>
                  <pubid idtype="pmpid" link="fulltext">12121445</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B36">
            <title>
               <p>Insertion of polydeoxyadenosine-rich sequences into an intergenic region increases transcription in Chlamydomonas reinhardtii chloroplasts</p>
            </title>
            <aug>
               <au>
                  <snm>Lisitsky</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Rott</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Schuster</snm>
                  <fnm>G</fnm>
               </au>
            </aug>
            <source>Planta</source>
            <pubdate>2001</pubdate>
            <volume>212</volume>
            <fpage>851</fpage>
            <lpage>857</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1007/s004250000449</pubid>
                  <pubid idtype="pmpid" link="fulltext">11346961</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B37">
            <title>
               <p>Simple sequence repeats in the Helicobacter pylori genome</p>
            </title>
            <aug>
               <au>
                  <snm>Saunders</snm>
                  <fnm>NJ</fnm>
               </au>
               <au>
                  <snm>Peden</snm>
                  <fnm>JF</fnm>
               </au>
               <au>
                  <snm>Hood</snm>
                  <fnm>DW</fnm>
               </au>
               <au>
                  <snm>Moxon</snm>
                  <fnm>ER</fnm>
               </au>
            </aug>
            <source>Mol Microbiol</source>
            <pubdate>1998</pubdate>
            <volume>27</volume>
            <fpage>1091</fpage>
            <lpage>1098</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1046/j.1365-2958.1998.00768.x</pubid>
                  <pubid idtype="pmpid" link="fulltext">9570395</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B38">
            <title>
               <p>Chloroplast DNA variation in Chlamydomonas and its potential application to the systematics of this genus</p>
            </title>
            <aug>
               <au>
                  <snm>Lemieux</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Turmel</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Lemieux</snm>
                  <fnm>C</fnm>
               </au>
            </aug>
            <source>Biosystems</source>
            <pubdate>1985</pubdate>
            <volume>18</volume>
            <fpage>293</fpage>
            <lpage>298</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/0303-2647(85)90029-2</pubid>
                  <pubid idtype="pmpid">4084678</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B39">
            <title>
               <p>Conserved gene clusters in the highly rearranged chloroplast genomes of Chlamydomonas moewusii and Chlamydomonas reinhardtii</p>
            </title>
            <aug>
               <au>
                  <snm>Boudreau</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Otis</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Turmel</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>Plant Mol Biol</source>
            <pubdate>1994</pubdate>
            <volume>24</volume>
            <fpage>585</fpage>
            <lpage>602</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1007/BF00023556</pubid>
                  <pubid idtype="pmpid">8155879</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B40">
            <title>
               <p>The highly rearranged chloroplast genome of Trachelium caeruleum (Campanulaceae): multiple inversions, inverted repeat expansion and contraction, transposition, insertions/deletions, and several repeat families</p>
            </title>
            <aug>
               <au>
                  <snm>Cosner</snm>
                  <fnm>ME</fnm>
               </au>
               <au>
                  <snm>Jansen</snm>
                  <fnm>RK</fnm>
               </au>
               <au>
                  <snm>Palmer</snm>
                  <fnm>JD</fnm>
               </au>
               <au>
                  <snm>Downie</snm>
                  <fnm>SR</fnm>
               </au>
            </aug>
            <source>Curr Genet</source>
            <pubdate>1997</pubdate>
            <volume>31</volume>
            <fpage>419</fpage>
            <lpage>429</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1007/s002940050225</pubid>
                  <pubid idtype="pmpid" link="fulltext">9162114</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B41">
            <title>
               <p>Evolutionary re-organisation of a large operon in adzuki bean chloroplast DNA caused by inverted repeat movement</p>
            </title>
            <aug>
               <au>
                  <snm>Perry</snm>
                  <fnm>AS</fnm>
               </au>
               <au>
                  <snm>Brennan</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Murphy</snm>
                  <fnm>DJ</fnm>
               </au>
               <au>
                  <snm>Kavanagh</snm>
                  <fnm>TA</fnm>
               </au>
               <au>
                  <snm>Wolfe</snm>
                  <fnm>KH</fnm>
               </au>
            </aug>
            <source>DNA Res</source>
            <pubdate>2002</pubdate>
            <volume>9</volume>
            <fpage>157</fpage>
            <lpage>162</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/dnares/9.5.157</pubid>
                  <pubid idtype="pmpid" link="fulltext">12465715</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B42">
            <title>
               <p>Dispersed repeats and structural reorganization in subclover chloroplast DNA</p>
            </title>
            <aug>
               <au>
                  <snm>Milligan</snm>
                  <fnm>BG</fnm>
               </au>
               <au>
                  <snm>Hampton</snm>
                  <fnm>JN</fnm>
               </au>
               <au>
                  <snm>Palmer</snm>
                  <fnm>JD</fnm>
               </au>
            </aug>
            <source>Mol Biol Evol</source>
            <pubdate>1989</pubdate>
            <volume>6</volume>
            <fpage>355</fpage>
            <lpage>368</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">2615639</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B43">
            <title>
               <p>Chloroplast DNA variation in the Geraniaceae - a preliminary report</p>
            </title>
            <aug>
               <au>
                  <snm>Price</snm>
                  <fnm>RA</fnm>
               </au>
               <au>
                  <snm>Calie</snm>
                  <fnm>PJ</fnm>
               </au>
               <au>
                  <snm>Downie</snm>
                  <fnm>SR</fnm>
               </au>
               <au>
                  <snm>Logsdon</snm>
                  <fnm>JJM</fnm>
               </au>
               <au>
                  <snm>Palmer</snm>
                  <fnm>JD</fnm>
               </au>
            </aug>
            <source>Proc Int Geraniaceae Symp</source>
            <publisher>Monvilla, South Africa, University of Stellenbosch</publisher>
            <editor>Vorster P</editor>
            <pubdate>1990</pubdate>
            <fpage>235</fpage>
            <lpage>244</lpage>
         </bibl>
         <bibl id="B44">
            <title>
               <p>Rare genomic changes as a tool for phylogenetics</p>
            </title>
            <aug>
               <au>
                  <snm>Rokas</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Holland</snm>
                  <fnm>PW</fnm>
               </au>
            </aug>
            <source>Trends Ecol Evol</source>
            <pubdate>2000</pubdate>
            <volume>15</volume>
            <fpage>454</fpage>
            <lpage>459</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/S0169-5347(00)01967-4</pubid>
                  <pubid idtype="pmpid">11050348</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B45">
            <title>
               <p>Fast phylogenetic methods for the analysis of genome rearrangement data: an empirical study</p>
            </title>
            <aug>
               <au>
                  <snm>Wang</snm>
                  <fnm>LSJRKMBMRLAWT</fnm>
               </au>
            </aug>
            <source>Pac Symp Biocomput</source>
            <publisher>River Edge, World Scientific Pub.</publisher>
            <pubdate>2002</pubdate>
            <fpage>524</fpage>
            <lpage>535</lpage>
            <xrefbib>
               <pubid idtype="pmpid">11928504</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B46">
            <title>
               <p>Bayesian Analysis to Describe Genomic Evolution by Rearrangement (BADGER), version 1.01 beta</p>
            </title>
            <aug>
               <au>
                  <snm>Simon</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Larget</snm>
                  <fnm>B</fnm>
               </au>
            </aug>
            <pubdate>2004</pubdate>
         </bibl>
         <bibl id="B47">
            <title>
               <p>Finding an Optimal Inversion Median: Experimental Results</p>
            </title>
            <aug>
               <au>
                  <snm>Siepel</snm>
                  <fnm>AC</fnm>
               </au>
               <au>
                  <snm>Moret</snm>
                  <fnm>BME</fnm>
               </au>
            </aug>
            <source>Lecture Notes in Computer Science</source>
            <publisher>, Springer-Verlag</publisher>
            <pubdate>2001</pubdate>
            <volume>2149</volume>
            <fpage>189</fpage>
            <lpage>203</lpage>
         </bibl>
         <bibl id="B48">
            <title>
               <p>Reconstructing large regions of an ancestral mammalian genome in silico</p>
            </title>
            <aug>
               <au>
                  <snm>Blanchette</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Green</snm>
                  <fnm>ED</fnm>
               </au>
               <au>
                  <snm>Miller</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Haussler</snm>
                  <fnm>D</fnm>
               </au>
            </aug>
            <source>Genome Res</source>
            <pubdate>2004</pubdate>
            <volume>14</volume>
            <fpage>2412</fpage>
            <lpage>2423</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">534665</pubid>
                  <pubid idtype="pmpid" link="fulltext">15574820</pubid>
                  <pubid idtype="doi">10.1101/gr.2800104</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B49">
            <title>
               <p>Exact-IEBP: A New Technique for Estimating Evolutionary Distances between Whole Genomes</p>
            </title>
            <aug>
               <au>
                  <snm>Wang</snm>
                  <fnm>LS</fnm>
               </au>
            </aug>
            <source>Lecture Notes in Computer Science</source>
            <publisher>, Springer-Verlag</publisher>
            <pubdate>2001</pubdate>
            <volume>2149</volume>
            <fpage>175</fpage>
            <lpage>188</lpage>
         </bibl>
         <bibl id="B50">
            <title>
               <p>Rates of nucleotide substitution vary greatly among plant mitochondrial, chloroplast, and nuclear DNAs</p>
            </title>
            <aug>
               <au>
                  <snm>Wolfe</snm>
                  <fnm>KH</fnm>
               </au>
               <au>
                  <snm>Li</snm>
                  <fnm>WH</fnm>
               </au>
               <au>
                  <snm>Sharp</snm>
                  <fnm>PM</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci U S A</source>
            <pubdate>1987</pubdate>
            <volume>84</volume>
            <fpage>9054</fpage>
            <lpage>9058</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">299690</pubid>
                  <pubid idtype="pmpid" link="fulltext">3480529</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B51">
            <title>
               <p>Recombination of Chlamydomonas Chloroplast DNA Occurs More Frequently in the Large Inverted Repeat Sequence Than in the Single-Copy Regions</p>
            </title>
            <aug>
               <au>
                  <snm>Lemieux</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Turmel</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Lemieux</snm>
                  <fnm>C</fnm>
               </au>
            </aug>
            <source>Theoretical and Applied Genetics</source>
            <pubdate>1990</pubdate>
            <volume>79</volume>
            <fpage>17</fpage>
            <lpage>27</lpage>
            <xrefbib>
               <pubid idtype="doi">10.1007/BF00223781</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B52">
            <title>
               <p>Ebb and flow of the chloroplast inverted repeat</p>
            </title>
            <aug>
               <au>
                  <snm>Goulding</snm>
                  <fnm>SE</fnm>
               </au>
               <au>
                  <snm>Olmstead</snm>
                  <fnm>RG</fnm>
               </au>
               <au>
                  <snm>Morden</snm>
                  <fnm>CW</fnm>
               </au>
               <au>
                  <snm>Wolfe</snm>
                  <fnm>KH</fnm>
               </au>
            </aug>
            <source>Mol Gen Genet</source>
            <pubdate>1996</pubdate>
            <volume>252</volume>
            <fpage>195</fpage>
            <lpage>206</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1007/BF02173220</pubid>
                  <pubid idtype="pmpid">8804393</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B53">
            <title>
               <p>Recombination between homologies in direct and inverse orientation in the chromosome of Salmonella: intervals which are nonpermissive for inversion formation</p>
            </title>
            <aug>
               <au>
                  <snm>Segall</snm>
                  <fnm>AM</fnm>
               </au>
               <au>
                  <snm>Roth</snm>
                  <fnm>JR</fnm>
               </au>
            </aug>
            <source>Genetics</source>
            <pubdate>1989</pubdate>
            <volume>122</volume>
            <fpage>737</fpage>
            <lpage>747</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">2547692</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B54">
            <title>
               <p>Coexpression of neighboring genes in the genome of Arabidopsis thaliana</p>
            </title>
            <aug>
               <au>
                  <snm>Williams</snm>
                  <fnm>EJ</fnm>
               </au>
               <au>
                  <snm>Bowles</snm>
                  <fnm>DJ</fnm>
               </au>
            </aug>
            <source>Genome Res</source>
            <pubdate>2004</pubdate>
            <volume>14</volume>
            <fpage>1060</fpage>
            <lpage>1067</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">419784</pubid>
                  <pubid idtype="pmpid" link="fulltext">15173112</pubid>
                  <pubid idtype="doi">10.1101/gr.2131104</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B55">
            <title>
               <p>A global analysis of Caenorhabditis elegans operons</p>
            </title>
            <aug>
               <au>
                  <snm>Blumenthal</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Evans</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Link</snm>
                  <fnm>CD</fnm>
               </au>
               <au>
                  <snm>Guffanti</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Lawson</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Thierry-Mieg</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Thierry-Mieg</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Chiu</snm>
                  <fnm>WL</fnm>
               </au>
               <au>
                  <snm>Duke</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Kiraly</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Kim</snm>
                  <fnm>SK</fnm>
               </au>
            </aug>
            <source>Nature</source>
            <pubdate>2002</pubdate>
            <volume>417</volume>
            <fpage>851</fpage>
            <lpage>854</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/nature00831</pubid>
                  <pubid idtype="pmpid" link="fulltext">12075352</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B56">
            <title>
               <p>Birth of a metabolic gene cluster in yeast by adaptive gene relocation</p>
            </title>
            <aug>
               <au>
                  <snm>Wong</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Wolfe</snm>
                  <fnm>KH</fnm>
               </au>
            </aug>
            <source>Nat Genet</source>
            <pubdate>2005</pubdate>
            <volume>37</volume>
            <fpage>777</fpage>
            <lpage>782</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/ng1584</pubid>
                  <pubid idtype="pmpid" link="fulltext">15951822</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B57">
            <title>
               <p>Processing and degradation of chloroplast mRNA.</p>
            </title>
            <aug>
               <au>
                  <snm>Monde</snm>
                  <fnm>RA</fnm>
               </au>
               <au>
                  <snm>Schuster</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Stern</snm>
                  <fnm>DB</fnm>
               </au>
            </aug>
            <source>Biochimie</source>
            <pubdate>2000</pubdate>
            <volume>82</volume>
            <fpage>573</fpage>
            <lpage>582</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/S0300-9084(00)00606-4</pubid>
                  <pubid idtype="pmpid" link="fulltext">10946108</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B58">
            <title>
               <p>Proteins encoded by a complex chloroplast transcription unit are each translated from both monocistronic and polycistronic RNAs.</p>
            </title>
            <aug>
               <au>
                  <snm>Barkan</snm>
                  <fnm>A</fnm>
               </au>
            </aug>
            <source>EMBO J</source>
            <pubdate>1988</pubdate>
            <fpage>2637</fpage>
            <lpage>2644</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">457051</pubid>
                  <pubid idtype="pmpid">2460341</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B59">
            <title>
               <p>Evolution's cauldron: duplication, deletion, and rearrangement in the mouse and human genomes</p>
            </title>
            <aug>
               <au>
                  <snm>Kent</snm>
                  <fnm>WJ</fnm>
               </au>
               <au>
                  <snm>Baertsch</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Hinrichs</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Miller</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Haussler</snm>
                  <fnm>D</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci U S A</source>
            <pubdate>2003</pubdate>
            <volume>100</volume>
            <fpage>11484</fpage>
            <lpage>11489</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">208784</pubid>
                  <pubid idtype="pmpid" link="fulltext">14500911</pubid>
                  <pubid idtype="doi">10.1073/pnas.1932072100</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B60">
            <title>
               <p>The Chlamydomonas sourcebook: A comprehensivve guide to biology and laboratory use</p>
            </title>
            <aug>
               <au>
                  <snm>Harris</snm>
                  <fnm>EH</fnm>
               </au>
            </aug>
            <publisher>San Diego, Academic Press</publisher>
            <pubdate>1989</pubdate>
         </bibl>
         <bibl id="B61">
            <title>
               <p>5' to 3' exoribonucleolytic activity is a normal component of chloroplast mRNA decay pathways</p>
            </title>
            <aug>
               <au>
                  <snm>Drager</snm>
                  <fnm>RG</fnm>
               </au>
               <au>
                  <snm>Higgs</snm>
                  <fnm>DC</fnm>
               </au>
               <au>
                  <snm>Kindle</snm>
                  <fnm>KL</fnm>
               </au>
               <au>
                  <snm>Stern</snm>
                  <fnm>DB</fnm>
               </au>
            </aug>
            <source>Plant J</source>
            <pubdate>1999</pubdate>
            <volume>19</volume>
            <fpage>521</fpage>
            <lpage>531</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1046/j.1365-313X.1999.00546.x</pubid>
                  <pubid idtype="pmpid" link="fulltext">10504574</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B62">
            <title>
               <p>Genomic sequencing</p>
            </title>
            <aug>
               <au>
                  <snm>Church</snm>
                  <fnm>GM</fnm>
               </au>
               <au>
                  <snm>Gilbert</snm>
                  <fnm>W</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci U S A</source>
            <pubdate>1984</pubdate>
            <volume>81</volume>
            <fpage>1991</fpage>
            <lpage>1995</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">345422</pubid>
                  <pubid idtype="pmpid" link="fulltext">6326095</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
      </refgrp>
   </bm>
</art>
