<?xml version='1.0'?>
<!DOCTYPE art SYSTEM 'http://www.biomedcentral.com/xml/article.dtd'>
<art>
   <ui>1471-2091-2-13</ui>
   <ji>1471-2091</ji>
   <fm>
      <dochead>Research article</dochead>
      <bibl>
         <title>
            <p>The intein of the Thermoplasma A-ATPase A subunit: Structure, evolution and expression in <it>E. coli</it></p>
         </title>
         <aug>
            <au id="A1">
               <snm>Senejani</snm>
               <mi>G</mi>
               <fnm>Alireza</fnm>
               <insr iid="I1"/>
               <email>ali@carrot.mcb.uconn.edu</email>
            </au>
            <au id="A2">
               <snm>Hilario</snm>
               <fnm>Elena</fnm>
               <insr iid="I2"/>
               <email>elena@carrot.mcb.uconn.edu</email>
            </au>
            <au id="A3" ca="yes">
               <snm>Gogarten</snm>
               <fnm>J Peter</fnm>
               <insr iid="I3"/>
               <email>gogarten@uconn.edu</email>
            </au>
         </aug>
         <insg>
            <ins id="I1">
               <p>Department of Molecular and Cell Biology, University of Connecticut, Storrs, CT 06269-3044, USA</p>
            </ins>
            <ins id="I2">
               <p>Current address: HortResearch, 120 Mt Albert Road, Private Bag 92, 169 Mt Albert, Auckland, New Zealand</p>
            </ins>
            <ins id="I3">
               <p>Department of Molecular and Cell Biology, University of Connecticut, 75 North Eagleville Rd. Storrs, CT 06269-3044, USA</p>
            </ins>
         </insg>
         <source>BMC Biochemistry</source>
         <issn>1471-2091</issn>
         <pubdate>2001</pubdate>
         <volume>2</volume>
         <issue>1</issue>
         <fpage>13</fpage>
         <url>http://www.biomedcentral.com/1471-2091/2/13</url>
         <xrefbib>
            <pubidlist>
               <pubid idtype="doi">10.1186/1471-2091-2-13</pubid>
               <pubid idtype="pmpid">11722801</pubid>
            </pubidlist>
         </xrefbib>
      </bibl>
      <history>
         <rec>
            <date>
               <day>17</day>
               <month>10</month>
               <year>2001</year>
            </date>
         </rec>
         <acc>
            <date>
               <day>14</day>
               <month>11</month>
               <year>2001</year>
            </date>
         </acc>
         <pub>
            <date>
               <day>14</day>
               <month>11</month>
               <year>2001</year>
            </date>
         </pub>
      </history>
      <cpyrt>
         <year>2001</year>
         <collab>Senejani et al; licensee BioMed Central Ltd. This is an Open Access article: verbatim copying and redistribution of this article are permitted in all media for any purpose, provided this notice is preserved along with the article's original URL.</collab>
      </cpyrt>
      <abs>
         <sec>
            <st>
               <p>Abstract</p>
            </st>
            <sec>
               <st>
                  <p>Background</p>
               </st>
               <p>Inteins are selfish genetic elements that excise themselves from the host protein during post translational processing, and religate the host protein with a peptide bond. In addition to this splicing activity, most reported inteins also contain an endonuclease domain that is important in intein propagation.</p>
            </sec>
            <sec>
               <st>
                  <p>Results</p>
               </st>
               <p>The gene encoding the <it>Thermoplasma acidophilum</it> A-ATPase catalytic subunit A is the only one in the entire <it>T. acidophilum</it> genome that has been identified to contain an intein. This intein is inserted in the same position as the inteins found in the ATPase A-subunits encoding gene in <it>Pyrococcus abyssi, P. furiosus and P. horikoshii</it> and is found 20 amino acids upstream of the intein in the homologous <it>vma-1</it> gene in <it>Saccharomyces cerevisiae.</it> In contrast to the other inteins in catalytic ATPase subunits, the <it>T. acidophilum</it> intein does not contain an endonuclease domain.</p>
               <p><it>T. acidophilum</it> has different codon usage frequencies as compared to <it>Escherichia coli.</it> Initially, the low abundance of rare tRNAs prevented expression of the <it>T. acidophilum</it> A-ATPase A subunit in <it>E. coli.</it> Using a strain of <it>E. coli</it> that expresses additional tRNAs for rare codons, the <it>T. acidophilum</it> A-ATPase A subunit was successfully expressed in <it>E. coli.</it></p>
            </sec>
            <sec>
               <st>
                  <p>Conclusions</p>
               </st>
               <p>Despite differences in pH and temperature between the <it>E. coli</it> and the <it>T. acidophilum</it> cytoplasms, the <it>T. acidophilum</it> intein retains efficient self-splicing activity when expressed in <it>E. coli.</it> The small intein in the <it>Thermoplasma</it> A-ATPase is closely related to the endonuclease containing intein in the <it>Pyrococcus</it> A-ATPase. Phylogenetic analyses suggest that this intein was horizontally transferred between <it>Pyrococcus</it> and <it>Thermoplasma,</it> and that the small intein has persisted in <it>Thermoplasma</it> apparently without homing.</p>
            </sec>
         </sec>
      </abs>
   </fm>
   <meta>
      <classifications>
         <classification type="BMC" subtype="old_arx_id">1472-2091-2-13</classification>
      </classifications>
   </meta>
   <bdy>
      <sec>
         <st>
            <p>Background</p>
         </st>
         <p>During the last decade several genes have been found to be interrupted by selfish genetic elements translated in frame with their host proteins. During post translational processing these elements excise themselves out of the host protein (see <abbrgrp><abbr bid="B1">1</abbr></abbrgrp> and <abbrgrp><abbr bid="B2">2</abbr></abbrgrp> for recent reviews). The sequences removed during splicing are called inteins (short for internal protein); the portions of the host protein are termed exteins (external protein) <abbrgrp><abbr bid="B3">3</abbr><abbr bid="B4">4</abbr><abbr bid="B5">5</abbr></abbrgrp>.</p>
         <p>Inteins facilitate their excision out of the host protein without the help of any known host specific activity. This phenomenon, called protein splicing, was first discovered about a decade ago in the <it>Saccharomyces cerevisiae</it> V-ATPase catalytic subunit A <abbrgrp><abbr bid="B6">6</abbr><abbr bid="B7">7</abbr></abbrgrp>. Intein excision depends on the splicing domain of the intein and the first amino acid residue of the C-extein <abbrgrp><abbr bid="B8">8</abbr></abbrgrp>. The inteins known to date are between 134 and 608 amino acids long, and they have been reported from all three domains of life: eukaryotes, eubacteria and archaea. Pietrokovski's webpage on inteins <url>http://blocks.fhcrc.org/~pietro/inteins/</url> currently lists more than 100 inteins in 34 different types of proteins <abbrgrp><abbr bid="B9">9</abbr></abbrgrp>. The host proteins are diverse in function, including metabolic enzymes, DNA and RNA polymerases, gyrases, proteases, ribonucleotide reductases, and vacuolar and archaeal type ATPases. Common features suggested for these proteins are their expression during DNA replication <abbrgrp><abbr bid="B1">1</abbr></abbrgrp> and their low substitution rate during evolution <abbrgrp><abbr bid="B9">9</abbr></abbrgrp>.</p>
         <p>Most reported inteins are composed of two domains: one is responsible for protein splicing, and the other has endonuclease activity <abbrgrp><abbr bid="B10">10</abbr><abbr bid="B11">11</abbr><abbr bid="B12">12</abbr><abbr bid="B13">13</abbr></abbrgrp>. The function of the endonuclease is to spread the intein to intein-free homologs of the host protein. During this process, called homing, the gene encoding the intein-free homolog is cleaved by the endonuclease at or close to the intein integration site. During the repair of the cleaved gene, the intein is copied to the previously intein-free homolog. Gimble and Thorner <abbrgrp><abbr bid="B14">14</abbr></abbrgrp> demonstrated intein homing in <it>Saccharomyces cerevisiae</it> using engineered V-ATPase genes from which the intein encoding portion had been previously removed. However, some inteins lack the endonuclease domain. Inteins without this domain perform autocatalytic splicing <abbrgrp><abbr bid="B15">15</abbr><abbr bid="B16">16</abbr></abbrgrp>. Homing endonucleases and the process of homing have been more intensively studied in self splicing introns <abbrgrp><abbr bid="B17">17</abbr></abbrgrp>, and the process is assumed to be similar for inteins.</p>
         <p><it>Thermoplasma acidophilum</it> is among sixteen archaea for which inteins have been reported to date (Intein database <url>http://www.neb.com/inteins/int_reg.html</url><abbrgrp><abbr bid="B18">18</abbr></abbrgrp>). Members of the genus <it>Thermoplasma</it> lack cell walls and possess a cytoskeleton. They live in hot and acidic environments, and are often found adhering to sulfur particles <abbrgrp><abbr bid="B19">19</abbr></abbrgrp>. <it>T. acidophilum</it> grows optimally at 59&#176;C and at an external pH between 1&#8211;2 <abbrgrp><abbr bid="B20">20</abbr></abbrgrp>. A cytoplasmic pH of 5.5 has been measured indirectly <abbrgrp><abbr bid="B21">21</abbr></abbrgrp>.</p>
         <p>Proton pumping ATPases/ATPsynthases are found in all groups of present day organisms <abbrgrp><abbr bid="B22">22</abbr></abbrgrp>. The typical archaeal ATPsynthase is homologous to the eukaryotic vacuolar ATPase. Because of the high degree of sequence similarity the archaeal ATP synthase (A-type ATPase) is sometimes labeled as vacuolar or V-type ATPase. The archaeal and the vacuolar ATPase are both homologous to the bacterial F-ATPases, but the level of sequence similarity with the F-ATPases is much lower than between the V- and the A-ATPases. To date seven species have been identified to harbor inteins in their ATPase catalytic subunits. The first intein was discovered in the <it>vma</it>-1 gene of <it>Saccharomyces</it><abbrgrp><abbr bid="B7">7</abbr></abbrgrp>. The yeast <it>Candida tropicalis</it><abbrgrp><abbr bid="B23">23</abbr></abbrgrp> possesses an intein in the same location. Inteins are also present in the A subunit of the A-type ATPases of <it>Thermoplasma acidophilum, T. volcanium, Pyrococcus abysii, P. horikoshii</it> and <it>P. furiosus.</it></p>
         <p>Here we present data on the cloning and expression of the <it>T. acidophilum</it> A-ATPase A subunit in <it>E. coli,</it> and we discuss implications for the location, propagation and distribution of inteins among organisms.</p>
      </sec>
      <sec>
         <st>
            <p>Results</p>
         </st>
         <sec>
            <st>
               <p>Sequence analysis of the <it>T. acidophilum</it> intein</p>
            </st>
            <p>The <it>T. acidophilum</it> intein was discovered while sequencing the catalytic subunit of the archaeal ATPase/ATPsynthase from <it>T. acidophilum</it> for systematic purposes <abbrgrp><abbr bid="B24">24</abbr></abbrgrp>. More recently, the complete genome sequences of <it>T. acidophilum</it><abbrgrp><abbr bid="B25">25</abbr></abbrgrp> and <it>T. volcanii</it><abbrgrp><abbr bid="B26">26</abbr></abbrgrp> have been reported. In both instances, the catalytic subunit of the ATPsynthase is the only gene in the entire genome for which an intein was reported. Using PSI-BLAST <abbrgrp><abbr bid="B27">27</abbr></abbrgrp> with different inteins as seeds, divergent inteins with less than ten percent sequence identity as compared to the query sequence are recovered (data not shown); however, using this approach we did not discover any additional intein or homing endonuclease encoding genes in <it>T. acidophilum.</it></p>
            <p>Multiple sequence alignments of diverse intein sequences identified eight motifs composed of moderately conserved residues <abbrgrp><abbr bid="B18">18</abbr><abbr bid="B28">28</abbr></abbrgrp>. The A-ATPase A subunit of the <it>Thermoplasma</it> and <it>Pyrococcus</it> intein multiple sequence alignment (with manual modification) is shown in figure <figr fid="F1">1</figr>. The <it>T. acidophilum</it> intein (173 amino acids long) is among the shortest inteins known, and the alignment with other inteins reveals the absence of sequences homologous to the typical endonuclease motifs. Only the motifs characteristic for the self splicing domain are present in the <it>Thermoplasma</it> intein (see figure <figr fid="F1">1</figr>).</p>
            <fig id="F1">
               <title>
                  <p>Figure 1</p>
               </title>
               <caption>
                  <p>Alignment of archaeal ATPase A-subunit intein sequences.</p>
               </caption>
               <text>
                  <p>Alignment of archaeal ATPase A-subunit intein sequences. The large gap in the <it>Thermoplasma</it> sequences indicates that the <it>Thermoplasma acidophilum (Tac)</it> and <it>Thermoplasma volcanium (Tvo)</it> inteins do not contain an endonuclease domain and only consist only of the self splicing domain, while the <it>Pyrococcus abyssi (Pab), P. furiosus (Pfu)</it> and <it>P. horikoshii (Pho)</it> A-ATPase A-subunits inteins are bifunctional. Regions corresponding to conserved blocks <abbrgrp><abbr bid="B18">18</abbr></abbrgrp> are indicated in bold and labeled A-H. Blocks A, B, F and G are part of the splicing domain, whereas blocks C, D, E, H are typical for endonucleases of the LAGDIDAG type <abbrgrp><abbr bid="B18">18</abbr></abbrgrp>.</p>
               </text>
               <graphic file="1471-2091-2-13-1"/>
            </fig>
            <p>The significance of the match between the <it>T. acidophilum</it> and the three pyrococcal ATPase inteins was assessed using PRSS at <url>http://fasta.bioch.virginia.edu/fasta/prss.htm</url><abbrgrp><abbr bid="B29">29</abbr></abbrgrp>. The P-value for this match, i.e. the probability of obtaining a match of this quality by chance alone, was calculated to be below 10<sup>-10</sup>. This indicates that not only the exteins, but also the inteins themselves are recognizably homologous. In contrast, a sequence alignment of all known inteins shows intein sequences to be much more divergent (not shown). For example, the P-value for the comparison between the <it>T. acidophilum</it> and the <it>Saccharomyces cerevisiae</it> intein was 0.28, i.e. no significant similarity between the yeast and the <it>Thermoplasma</it> inteins was detectable using pairwise alignments only.</p>
         </sec>
         <sec>
            <st>
               <p>Location of the intein within the host protein</p>
            </st>
            <p>The vacuolar and the archaeal ATPases are homologous to the bacterial and organellar F-type ATPases. The structure of the bovine mitochondrial F<sub>1</sub>-ATPase has been determined by X-ray crystallography <abbrgrp><abbr bid="B30">30</abbr></abbrgrp>. The intein insertion points in the yeasts' V-ATPase and the <it>Thermoplasma</it> and <it>Pyrococcus</it> A-ATPases correspond to the catalytic site where the ATP binds and is hydrolyzed during the catalytic cycle. Figure <figr fid="F2">2</figr> shows that the inteins are located in the regions of these very conserved proteins <abbrgrp><abbr bid="B22">22</abbr></abbrgrp> that have the lowest substitution rates.</p>
            <fig id="F2">
               <title>
                  <p>Figure 2</p>
               </title>
               <caption>
                  <p>Divergence along the archaeal and vacuolar ATPase A subunits.</p>
               </caption>
               <text>
                  <p>Divergence along the archaeal and vacuolar ATPase A subunits. The diagram depicts the number of substitutions per site calculated for a sliding window of 10 amino acid residues along an alignment of 62 vacuolar and archaeal ATPase catalytic subunits. The ordinate gives the number of substitutions observed in each position in the alignment. Arrows indicate the locations of the inteins. The first (A) gives the location of the inteins in <it>Thermoplasma</it> and <it>Pyrococcus,</it> the second (B) gives the location in the yeasts' vacuolar ATPases.</p>
               </text>
               <graphic file="1471-2091-2-13-2"/>
            </fig>
         </sec>
         <sec>
            <st>
               <p>Comparison of A-ATPase catalytic subunit and 16S rRNA phylogeny</p>
            </st>
            <p>The phylogenies of archaeal ATPase catalytic subunits and small subunit ribosomal RNAs are shown in Figure <figr fid="F3">3</figr>. Both phylogenies were calculated for a similar set of species. While the two phylogenies show some significant differences, neither of them groups the molecules from <it>Thermoplasma</it> with the <it>Pyrococcus</it> homologs. In both phylogenies several other Archaea, i.e., <it>M. jannaschii, M. thermolithotrophicus, Thermococcus sp., Halobacterium sp., Methanosarcina</it> and <it>Methanobacterium thermoautotrophicus</it> branch off between the two groups that carry inteins in their ATPase catalytic subunits. All of these intervening archaeal ATPases do not carry an intein.</p>
            <fig id="F3">
               <title>
                  <p>Figure 3</p>
               </title>
               <caption>
                  <p>Comparison of vacuolar/archaeal ATPase A subunit and small subunit ribosomal RNA phylogenies.</p>
               </caption>
               <text>
                  <p>Comparison of vacuolar/archaeal ATPase A subunit and small subunit ribosomal RNA phylogenies. The phylogeny depicted in (A) was calculated from the extein portions of the genes only; (B) was calculated from small subunit ribosomal RNAs. The trees were calculated as unrooted, but are depicted as rooted using the eukaryotes as an outgroup to the Archaea (A), or the Crenarchaeota as an outgroup to the Euryarchaeota (B). Numbers give bootstrap values calculated from 100 bootstrapped samples analyzed using parsimony analysis. Values are only given for groups with more than 50% support. Asterisks denote branches that in the maximum likelihood evaluation were not at least 3 or 2 times larger, respectively, than their estimated standard error. All other branches were at least three times larger than their standard deviation. See experimental procedures for further details on the phylogenetic reconstruction methods used. Species whose A-subunit contains an intein are indicated by arrows.</p>
               </text>
               <graphic file="1471-2091-2-13-3"/>
            </fig>
         </sec>
         <sec>
            <st>
               <p>Codon usage comparison</p>
            </st>
            <p>Codon usage varies among organisms. The production of tRNAs corresponds to the frequencies with which the different codons are present in their protein coding genes. The exact causes for tRNA regulation and codon usage are not completely understood; however, expression of foreign genes in an organism is often prevented by a different codon usage of the foreign gene <abbrgrp><abbr bid="B31">31</abbr></abbrgrp>. Many Archaea have codon usage frequencies and tRNA compositions different from <it>E. coli.</it> The <it>T. acidophilum</it> A-ATPase A subunit has 763 codons. Codon AUA (Ile) is the most frequent (39/1000). In <it>E. coli</it> the same codon (AUA) is a rare codon present at a frequency of only 5.5 per 1000. The other two major differences in codon usage between the <it>T. acidophilum</it> A-ATPase A subunit and <it>E. coli</it> are AGG (Arg) and AGA (Arg) which are present in <it>T. acidophilum</it> A-ATPase A subunit at frequencies of 41 and 16 per 1000 respectively. In <it>E. coli</it> however, these codons are considered rare and occur with frequencies of 1.7 and 2.8 per 1000 respectively.</p>
         </sec>
         <sec>
            <st>
               <p>Expression and intein splicing of <it>T. acidophilum</it> A-ATPase A subunit</p>
            </st>
            <p>The gene encoding the <it>T. acidophilum</it> A-ATPase A subunit was cloned into the expression vector pET-11a (Stratagene) and transformed into <it>E. coli</it> Bl21(DE3) and <it>E. coli</it> Bl21-CodonPlus(DE3)-RIL strain for protein expression. When <it>E. coli</it> Bl21(DE3), a strain that did not express additional rare tRNAs, was transformed with the cloned <it>T. acidophilum</it> ATPase A subunit, no additional protein bands were observed in extracts of induced cells (not shown). However, the <it>E. coli</it> Bl21-CodonPlus (DE3)-RIL strain transformed with the same plasmid expresses two additional proteins of 20 and 65 kDalton upon induction (Fig. <figr fid="F4">4</figr>). No additional band at 85 kDa, indicative of an unprocessed intein, was visible after induction. This demonstrates that autocatalytic splicing occurred efficiently in <it>E. coli.</it> Efficient autocatalytic splicing was also observed when the <it>E. coli</it> were grown and induced at 42&#176;C, or when the <it>E. coli</it> were induced at 16&#176;C for 16 hours (Fig. <figr fid="F4">4A</figr>). During preparation for SDS gel electrophoreses the samples are heated to 72&#176;C in the presence of DTT. With respect to temperature these conditions are more similar to the conditions in the <it>T. acidophilum</it> cytoplasm than the conditions in <it>E. coli.</it> Intein excision might occur only during this high temperature treatment. Therefore, proteins from induced <it>E. coli</it> were also separated under non-denaturing conditions with or without addition of DTT. The induced bands were excised from the non-denaturating gel and separated using denaturing SDS gel electrophoresis (Fig. <figr fid="F5">5</figr>). Both of the slower migrating bands visible after induction (A and B in Fig. <figr fid="F5">5</figr>), revealed only one major band corresponding to the spliced and religated A-subunit upon separation in an SDS denaturing gel. Presumably the slower migrating band was a dimer or higher aggregate of the A-subunit monomer. In none of these experiments did we find any indications for unprocessed or incompletely spliced inteins.</p>
            <fig id="F4">
               <title>
                  <p>Figure 4</p>
               </title>
               <caption>
                  <p>SDS polyacrylamide gel electrophoresis of proteins from induced <it>E. coli.</it></p>
               </caption>
               <text>
                  <p>SDS polyacrylamide gel electrophoresis of proteins from induced <it>E. coli.</it> Panel A, Lanes 1 and 3: <it>E. coli</it> Bl21-CodonPlus(DE3)-RIL strain transformed with empty pET-11a vector (negative control); lanes 2, 4, 5 and 6: <it>E. coli</it> Bl21-CodonPlus(DE3)-RIL strain transformed with the <it>Thermoplasma</it> A-ATPase cloned into pET-11a; lane 1-4: cells were induced for 4 hours; lane 5: cells were induced at 16&#176;C for 16 hours; lane 6: cells were induced at 42&#176;C for 2 hours; Sup.: supernatant. Panel B depicts the splicing process schematically. The unprocessed <it>T. acidophilum</it> A-ATPase A subunit has a calculated size of 85 kDalton. The expected molecular weight of the intein is 20 kDalton, and the religated host protein weighs approximately 65 kDalton.</p>
               </text>
               <graphic file="1471-2091-2-13-4"/>
            </fig>
            <fig id="F5">
               <title>
                  <p>Figure 5</p>
               </title>
               <caption>
                  <p>Non-denaturing polyacrylamide gel electrophoresis of proteins from induced <it>E. coli.</it></p>
               </caption>
               <text>
                  <p>Non-denaturing polyacrylamide gel electrophoresis of proteins from induced <it>E. coli.</it> Lanes 1 and 2: non-denaturing polyacrylamide gel electrophoresis of proteins from the supernatant of the cell lysate. Arrows point towards bands that are present in induced <it>E. coli</it> transformed with the <it>Thermoplasma</it> A-ATPase, but absent in the negative control. The sample buffer did not contain DTT or SDS. Lanes 3, 4, 5 and 6 are separations in denaturing SDS polyacrylamide gels. Lane 3: electro-eluted protein from lane 2 (arrow A); lane 4: electro-eluted protein from lane 2 (arrow B); lanes 1 and 5: <it>E. coli</it> Bl21-CodonPlus(DE3)-RIL strain transformed with empty pET-11a vector (negative control); lanes 2 and 6: <it>E. coli</it> Bl21-CodonPlus(DE3)-RIL strain transformed with the <it>Thermoplasma</it> A-ATPase cloned into pET-11a and induced for 8 hours at 37&#176;C.</p>
               </text>
               <graphic file="1471-2091-2-13-5"/>
            </fig>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Discussion</p>
         </st>
         <p>The <it>Daucus carota</it><abbrgrp><abbr bid="B32">32</abbr></abbrgrp> and <it>Saccharomyces cerevisiae</it> (Alireza Senejani unpublished) catalytic V-ATPase subunits form inclusion bodies when expressed in <it>E. coli.</it> In contrast, the <it>T. acidophilum</it> subunit is expressed as a soluble protein in the <it>E. coli</it> cytoplasm. Despite the chemical and physical differences between the <it>E. coli</it> cytoplasm and the environment in which the <it>T. acidophilum</it> intein is functioning <it>in vivo,</it> we found no indications that self-splicing of the intein was inefficient in <it>E. coli.</it> Even when <it>E. coli</it> was grown at lower temperatures, processing of the intein and religation of the exteins appeared 100% efficient. Complete processing was also observed when the <it>E. coli</it> proteins were separated in non-denaturing and non-reducing gels. Autocatalytic splicing of the <it>T. acidophilum</it> A-ATPase catalytic subunit occurs efficiently in the <it>E. coli</it> cytoplasm. The <it>T. acidophilum</it> A-ATPase intein appears to splice out efficiently at very different pHs (7.2 versus 5.5 <abbrgrp><abbr bid="B21">21</abbr><abbr bid="B33">33</abbr></abbrgrp>) and temperatures (16 to 37&#176; versus 55&#176;C <abbrgrp><abbr bid="B20">20</abbr></abbrgrp>).</p>
         <p>The <it>T. acidophilum</it> intein shows significant sequence similarity to the inteins found in the A-ATPase catalytic subunits of <it>Pyrococcus.</it> Moreover, these inteins are inserted into the same highly conserved sequence in the ATP binding site. This indicates that the inteins in <it>Thermoplasma</it> and <it>Pyrococcus</it> are homologous in the evolutionary sense, <it>i.e.,</it> they are derived from a common ancestral gene. However, <it>Pyrococcus</it> and <it>Thermoplasma</it> are not considered closely related Archaea. Based on their 16S rRNA they are considered distantly related Euryarchaeotes (See Fig. <figr fid="F3">3b</figr>).</p>
         <p>One explanation for the discrepancy between phylogenetic classification and distribution of the A-ATPase intein is horizontal gene transfer: the intein was not present in the last common ancestor of <it>Thermoplasma</it> and <it>Pyrococcus,</it> rather the intein invaded one of the lineages after their split and was more recently horizontally transferred to the other lineage. The horizontal transfer scenario is more parsimonious than the assumption of presence in the shared common ancestor because the latter requires several independently occurring losses of the intein together with its long-term persistence in the <it>Pyrococcus</it> and <it>Thermoplasma</it> lineages (<it>cf.</it> Fig. <figr fid="F3">3b</figr>). Horizontal transfer of whole genes and operons between divergent species is a frequent event <abbrgrp><abbr bid="B34">34</abbr><abbr bid="B35">35</abbr><abbr bid="B36">36</abbr><abbr bid="B37">37</abbr></abbrgrp>. Even house keeping genes are transferred between divergent species <abbrgrp><abbr bid="B37">37</abbr></abbrgrp>.</p>
         <p>Two possibilities exist for the horizontal transfer scenario. Either the whole A-ATPase catalytic subunits was transferred, or the intein alone spread as a selfish genetic element. To discriminate between these two scenarios, we constructed the phylogeny of the host protein (Fig. <figr fid="F3">3a</figr>). The resulting phylogeny is in reasonable agreement with the ribosomal rRNA phylogeny, the main exception being the placement of <it>Desulfurococcus.</it> According to its 16S rRNA this organisms is clearly classified as a Crenarcheote; however, its ATPase catalytic subunit groups with <it>Thermococcus</it> sp. The finding that the host protein itself does not group the genus <it>Thermoplasma</it> with the <it>Pyrococci</it> suggests that the intein alone was transferred between <it>Thermoplasma</it> and <it>Pyrococcus,</it> and that the sequence similarity between the <it>Thermoplasma</it> and <it>Pyrococcus</it> catalytic subunits was sufficient to allow homing into the same insertion site.</p>
         <p>The dispersion of the intein as a selfish genetic element is consistent with the work of Goddard and Burt <abbrgrp><abbr bid="B38">38</abbr></abbrgrp> on the persistence of an intron with homing endonuclease in yeast mitochondria. These authors studied the distribution of empty target sites, and introns with and without a functioning endonuclease gene among different yeasts. They concluded that the long term persistence of the intron depends on a cycle that begins with invasion of the empty target site by an intron with the help of a homing endonuclease encoded by an open reading frame within the intron. However, once the intron containing allele is fixed in the population, the endonuclease, which itself had been the reason for the rapid spread of the intron in the population, is no longer under selection, the endonuclease becomes non-functional and is lost, resulting in an intron without homing endonuclease activity. However, once the endonuclease is lost the intron containing allele becomes more likely to be replaced with an allele that has lost the intron altogether and the cycle of invasion and successive loss begins anew.</p>
         <p>The process of intein homing is likely to depend on a similar cycle; however, the time intervals for loss of the intein are likely to be longer than the loss of the intron. The A-ATPase and the V-ATPase inteins are located in the most conserved part of the host gene. Any deletion of the intein from the gene itself needs to be precise, because any alteration of the amino acid sequence in the catalytic site is likely to result in a non-functioning enzyme. Comparative sequence analysis (Fig. <figr fid="F1">1</figr>) shows that the <it>T. acidophilum</it> and <it>T. volcanium</it> inteins do not contain an endonuclease domain. Our search of the genomes of these Archaea did not identify any endonucleases that might function in homing. While the failure to identify a homing endonuclease is not proof of absence, the presence of a homing endonuclease acting in trans would be unprecedented and has to be regarded as improbable. The cyclic reinvasion model for long term persistence of a selfish genetic element through homing <abbrgrp><abbr bid="B38">38</abbr></abbrgrp> suggests that the small <it>Thermoplasma</it> intein evolved through reduction from a large endonuclease containing intein similar to the one found in the pyrococcal A-ATPase. Apparently, the small intein has been persisting in the <it>Thermoplasma</it> A-ATPase since the split between <it>T. acidophilum</it> and <it>T. volcanium</it> without the help of a homing endonuclease.</p>
         <p>The cyclic reinvasion model also explains why the insertion site is in a region of very low substitution rates: The high degree of sequence similarity surrounding the integration point facilitates the intein transfer between different populations and species using the homing endonuclease. A more variable sequence surrounding the integration point would restrict homing to members of the same species, and would thus lower the chances for long term survival of the intein.</p>
      </sec>
      <sec>
         <st>
            <p>Conclusion</p>
         </st>
         <p>The small intein in the <it>Thermoplasma</it> A-ATPase is closely related to the endonuclease containing intein in the <it>Pyrococcus</it> A-ATPase. Phylogenies constructed with the host protein (A-ATPase catalytic subunit) and with 16S rRNA do not group these two organisms together, suggesting that the A-ATPase intein spread through horizontal gene transfer. The small intein has persisted in <it>Thermoplasma</it> apparently without homing. The <it>T. acidophilum</it> intein retains efficient self-splicing activity when expressed in <it>E. coli.</it> This activity does not depend on the physicochemical conditions in the <it>T. acidophilum</it> cytoplasm.</p>
      </sec>
      <sec>
         <st>
            <p>Materials and Methods</p>
         </st>
         <p><it>Thermoplasma acidophilum</it> cultures were obtained from Denis Searcy at the University of Massachusetts in Amherst. Organisms were grown and their DNA isolated as described <abbrgrp><abbr bid="B39">39</abbr></abbrgrp>. DNA manipulation, sequencing and cloning followed standard procedures as described in Sambrook <it>et al</it>. <abbrgrp><abbr bid="B40">40</abbr></abbrgrp> and Ausubel <it>et al</it>. <abbrgrp><abbr bid="B41">41</abbr></abbrgrp>.</p>
         <sec>
            <st>
               <p>Plasmid constructs</p>
            </st>
            <p>The <it>T. acidophilum</it> A-ATPase A subunit encoding gene was amplified from genomic DNA using primers Ta-4 (ATGGATCCTTCTCAACGAAGAGCAGTG) and Ta-5 (GAGGTGAACATATGGGAAAGATAATCAG). These primers match the coding sequence of the <it>Thermoplasma acidophilum</it> A-ATPase A subunit (gene identification number 9369337) and introduce restriction sites useful in subcloning. Initially, the PCR product was cloned into pCR<sup>R</sup> 2.1 (TA cloning Vector, <aff id="AFF1">Invitrogen</aff>). After digestion with <it>Nde</it>I and <it>BamH</it>I the coding sequence was subcloned to the vector pET-11a (Stratagene) for gene expression experiments.</p>
         </sec>
         <sec>
            <st>
               <p>Protein expression and determination</p>
            </st>
            <p><it>E. coli</it> Bl21(DE3) and Bl21-CodonPlus(DE3)-RIL strain (Stratagene) were used for protein expression. If not stated otherwise, transformed <it>E. coli</it> were grown overnight in a culture wheel at 37&#176;C in Luria-Bertani broth with ampicillin (100 &#956;g/mL). One mL of the broth was used to inoculate 10 mL of LB broth containing ampicillin and incubated at 37&#176;C (200 rpm). After 2 hours, isopropyl thio-&#946;-D-galactoside (IPTG) was added to a final concentration of 1 mM to induce gene expression and the cultures. Cells were harvested 4 hours after induction by centrifugation at 7000 rpm for 10 min and washed with TEP buffer (0.1 M Tris-HCl, pH 7.4, 0.01 M EDTA and 1 mM phenyl methyl sulfonyl fluoride). Cells were resuspended in 500 &#956;L of TEP buffer and sonicated with a Braun-Sonic U with micro-tip for 4 minutes (0.5 duty cycle; power output approximately 120). The disrupted cells were centrifuged at 8000 rpm and the pellet was resuspended in 500 &#956;L of fresh TEP buffer. Both the supernatant and the pellet were diluted by adding 6X sample buffer (10% SDS, 1.2 mg/mL bromphenol blue, 0.6 M DTT, 30% glycerol, .1 M Tris/HCl pH 6.8) and heated to 80&#176;C for 15 minutes to denature the protein. Five to fifty microliters of this preparation were run in a 10% denaturing Tris/Tricine SDS polyacrylamide gel electrophoresis system as described by Schagger and von Jagow <abbrgrp><abbr bid="B42">42</abbr></abbrgrp>.</p>
            <p>Non-denatured protein was prepared with the same procedure except that SDS was omitted from the buffers and that the samples were not heated. Proteins were electroeluted from the non-denaturing gel as describe here <abbrgrp><abbr bid="B24">24</abbr></abbrgrp>. Gels were fixed with fixing solution (50% methanol, 10% acetic acid) for 15 minutes, followed by staining in Coomassie staining solution (20% acetic acid, 0.025% Coomassie blue G-250) for one hour, followed by destaining in destaining solution (5% methanol, 10% acetic acid) <abbrgrp><abbr bid="B42">42</abbr></abbrgrp>.</p>
         </sec>
         <sec>
            <st>
               <p>DNA sequencing and cloning</p>
            </st>
            <p>DNA was sequenced using the ABI PRISM BigDye Terminator Cycle Sequencing (PE Applied Biosystems). Sequencing gels were ran and processed in the Biotech Center (University of Connecticut)</p>
         </sec>
         <sec>
            <st>
               <p>Codon usage</p>
            </st>
            <p>The program Codon Usage Tabulated from GenBank (CUTG) at <url>http://www.kazusa.or.jp/codon/</url><abbrgrp><abbr bid="B43">43</abbr></abbrgrp> was used to calculate the codon usage of individual genes and genomes.</p>
         </sec>
         <sec>
            <st>
               <p>Sequence retrieval, alignment and phylogenetic reconstruction</p>
            </st>
            <p>The <it>Pyrococcus furiosus</it> A-ATPase sequence was retrieved via blastp from the unfinished genome using the web page <url>http://combdna.umbi.umd.edu/bags.html</url>. The aligned small subunit ribosomal RNA sequences were retrieved from the Ribosomal Database Project II <url>http://rdp.cme.msu.edu</url><abbrgrp><abbr bid="B44">44</abbr></abbrgrp>. Francine Perler (New England Biolabs) the curator of the intein database <url>http://www.neb.com/inteins/int_reg.html</url> kindly provided the sequences of all known inteins. All the other sequences were retrieved from the NCBI databank.</p>
            <p>Sequences were aligned using CLUSTAL X 1.8 <abbrgrp><abbr bid="B45">45</abbr></abbrgrp>. The number of substitutions per site in the aligned data set were calculated using a JAVA program written by Olga Zhaxybayeva (University of Connecticut). This program calculates and plots the number of substitutions in a sliding window of 10 aligned positions. The window is moved through the alignment one position at a time. For positions where less than 50% of the sequences have a gap, the average number of substitutions was calculated considering the gap as an additional character. Positions with gaps in more than 50% of the aligned sequences were skipped.</p>
            <p>Phylogenies were reconstructed from amino acid sequences aligned using CLUSTAL X 1.8 <abbrgrp><abbr bid="B45">45</abbr></abbrgrp>. The topologies of the depicted trees were calculated using neighbor joining as implemented in CLUSTAL X with correction for multiple substitutions. Branch lengths and their confidence intervals were calculated with TREE-PUZZLE 5.0 <abbrgrp><abbr bid="B46">46</abbr></abbrgrp> using the JTT or the HKY model for substitution respectively, and assuming an among site rate variation described by a gamma distribution. Bootstrap samples were analyzed using parsimony as implemented in PAUP* 4.0 beta 8 <abbrgrp><abbr bid="B47">47</abbr></abbrgrp> treating gaps as missing data. Each bootstrapped sample was analyzed using 10 different starting trees built through random addition, tree-branch-reconnection (TBR) branch swapping, and considering gaps as missing data. The following sequences were used for the 16S rRNA phylogeny (the sequences are available under these names from the Ribosomal Database Project II <url>http://rdp.cme.msu.edu/</url> : Sul.solfa4, Sul.acalda, Ap.pernixl, Mc.thlitho, Mc.janrrnA, Mb.tautot2, Pc.furios2, Pc.abyssi, AP000001, AB016298, Tpl.acidop, Arg.fulgid p, Hf.volcani, Hb.spCh2_2, Hb.salina2, Msr.mazei5, Msr.barke2, Dco.mobili.</p>
         </sec>
      </sec>
   </bdy>
   <bm>
      <ack>
         <sec>
            <st>
               <p>Acknowledgements</p>
            </st>
            <p>The research was supported in part through the NASA Exobiology Program. The authors thank Torsten Schadtle for help with DNA sequencing, Ryan Murphy for constructing a <it>Thermoplasma acidophilum</it> genomic library, Olga Zhaxybayeva for providing computer programs, Lorraine Olendzenski for reading the manuscript and assistance with experimental procedures, and Denis Searcy for help with culturing <it>Thermoplasma acidophilum.</it> A.S. thanks the Mannheim University of Applied Sciences and the University of Connecticut for support through fellowships.</p>
         </sec>
      </ack>
      <refgrp>
         <bibl id="B1">
            <title>
               <p>Protein-splicing intein: Genetic mobility, origin, and evolution.</p>
            </title>
            <aug>
               <au>
                  <snm>Liu</snm>
                  <fnm>XQ</fnm>
               </au>
            </aug>
            <source>Annu Rev Genet</source>
            <pubdate>2000</pubdate>
            <volume>34</volume>
            <fpage>61</fpage>
            <lpage>76</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1146/annurev.genet.34.1.61</pubid>
                  <pubid idtype="pmpid" link="fulltext">11092822</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B2">
            <title>
               <p>Protein splicing and related forms of protein autoprocessing.</p>
            </title>
            <aug>
               <au>
                  <snm>Paulus</snm>
                  <fnm>H</fnm>
               </au>
            </aug>
            <source>Annu Rev Biochem</source>
            <pubdate>2000</pubdate>
            <volume>69</volume>
            <fpage>447</fpage>
            <lpage>496</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1146/annurev.biochem.69.1.447</pubid>
                  <pubid idtype="pmpid" link="fulltext">10966466</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B3">
            <title>
               <p>Group I introns as mobile genetic elements: facts and mechanistic speculations &#8211; a review.</p>
            </title>
            <aug>
               <au>
                  <snm>Dujon</snm>
                  <fnm>B</fnm>
               </au>
            </aug>
            <source>Gene</source>
            <pubdate>1989</pubdate>
            <volume>82</volume>
            <fpage>91</fpage>
            <lpage>114</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/0378-1119(89)90034-6</pubid>
                  <pubid idtype="pmpid">2555264</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B4">
            <title>
               <p>Protein splicing elements: inteins and exteins &#8211; a definition of terms and recommended nomenclature.</p>
            </title>
            <aug>
               <au>
                  <snm>Perler</snm>
                  <fnm>FB</fnm>
               </au>
               <au>
                  <snm>Davis</snm>
                  <fnm>EO</fnm>
               </au>
               <au>
                  <snm>Dean</snm>
                  <fnm>GE</fnm>
               </au>
               <au>
                  <snm>Gimble</snm>
                  <fnm>FS</fnm>
               </au>
               <au>
                  <snm>Jack</snm>
                  <fnm>WE</fnm>
               </au>
               <au>
                  <snm>Neff</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Noren</snm>
                  <fnm>CJ</fnm>
               </au>
               <au>
                  <snm>Thorner</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Belfort</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>1994</pubdate>
            <volume>22</volume>
            <fpage>1125</fpage>
            <lpage>1127</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">523631</pubid>
                  <pubid idtype="pmpid">8165123</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B5">
            <title>
               <p>Protein splicing: self-splicing of genetically mobile elements at the protein level.</p>
            </title>
            <aug>
               <au>
                  <snm>Cooper</snm>
                  <fnm>AA</fnm>
               </au>
               <au>
                  <snm>Stevens</snm>
                  <fnm>TH</fnm>
               </au>
            </aug>
            <source>Trends Biochem Sci</source>
            <pubdate>1995</pubdate>
            <volume>20</volume>
            <fpage>351</fpage>
            <lpage>356</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/S0968-0004(00)89075-1</pubid>
                  <pubid idtype="pmpid" link="fulltext">7482702</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B6">
            <title>
               <p>Molecular structure of a gene, VMA1, encoding the catalytic subunit of H(+)-translocating adenosine triphosphatase from vacuolar membranes of Saccharomyces cerevisiae.</p>
            </title>
            <aug>
               <au>
                  <snm>Hirata</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Ohsumk</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Nakano</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Kawasaki</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Suzuki</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Anraku</snm>
                  <fnm>Y</fnm>
               </au>
            </aug>
            <source>J Biol Chem</source>
            <pubdate>1990</pubdate>
            <volume>265</volume>
            <fpage>6726</fpage>
            <lpage>6733</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">2139027</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B7">
            <title>
               <p>Protein splicing converts the yeast TFP1 gene product to the 69-kD subunit of the vacuolar H(+)-adenosine triphosphatase.</p>
            </title>
            <aug>
               <au>
                  <snm>Kane</snm>
                  <fnm>PM</fnm>
               </au>
               <au>
                  <snm>Yamashiro</snm>
                  <fnm>CT</fnm>
               </au>
               <au>
                  <snm>Wolczyk</snm>
                  <fnm>DF</fnm>
               </au>
               <au>
                  <snm>Neff</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Goebl</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Stevens</snm>
                  <fnm>TH</fnm>
               </au>
            </aug>
            <source>Science</source>
            <pubdate>1990</pubdate>
            <volume>250</volume>
            <fpage>651</fpage>
            <lpage>657</lpage>
            <xrefbib>
               <pubid idtype="pmpid">2146742</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B8">
            <title>
               <p>Protein splicing in the absence of an intein penultimate histidine.</p>
            </title>
            <aug>
               <au>
                  <snm>Chen</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Benner</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Perler</snm>
                  <fnm>FB</fnm>
               </au>
            </aug>
            <source>J Biol Chem</source>
            <pubdate>2000</pubdate>
         </bibl>
         <bibl id="B9">
            <title>
               <p>Intein spread and extinction in evolution.</p>
            </title>
            <aug>
               <au>
                  <snm>Pietrokovski</snm>
                  <fnm>S</fnm>
               </au>
            </aug>
            <source>Trends Genet</source>
            <pubdate>2001</pubdate>
            <volume>17</volume>
            <fpage>465</fpage>
            <lpage>472</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/S0168-9525(01)02365-4</pubid>
                  <pubid idtype="pmpid" link="fulltext">11485819</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B10">
            <title>
               <p>Modular organization of inteins and C-terminal autocatalytic domains.</p>
            </title>
            <aug>
               <au>
                  <snm>Pietrokovski</snm>
                  <fnm>S</fnm>
               </au>
            </aug>
            <source>Protein Sci</source>
            <pubdate>1998</pubdate>
            <volume>7</volume>
            <fpage>64</fpage>
            <lpage>71</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">9514260</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B11">
            <title>
               <p>Compilation and analysis of intein sequences.</p>
            </title>
            <aug>
               <au>
                  <snm>Perler</snm>
                  <fnm>FB</fnm>
               </au>
               <au>
                  <snm>Olsen</snm>
                  <fnm>GJ</fnm>
               </au>
               <au>
                  <snm>Adam</snm>
                  <fnm>E</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>1997</pubdate>
            <volume>25</volume>
            <fpage>1087</fpage>
            <lpage>1093</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">146560</pubid>
                  <pubid idtype="pmpid" link="fulltext">9092614</pubid>
                  <pubid idtype="doi">10.1093/nar/25.6.1087</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B12">
            <title>
               <p>Statistical modeling, phylogenetic analysis and structure prediction of a protein splicing domain common to inteins and hedgehog proteins.</p>
            </title>
            <aug>
               <au>
                  <snm>Dalgaard</snm>
                  <fnm>JZ</fnm>
               </au>
               <au>
                  <snm>Moser</snm>
                  <fnm>MJ</fnm>
               </au>
               <au>
                  <snm>Hughey</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Mian</snm>
                  <fnm>IS</fnm>
               </au>
            </aug>
            <source>J Comput Biol</source>
            <pubdate>1997</pubdate>
            <volume>4</volume>
            <fpage>193</fpage>
            <lpage>214</lpage>
            <xrefbib>
               <pubid idtype="pmpid">9228618</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B13">
            <title>
               <p>Crystal structure of PI-SceI, a homing endonuclease with protein splicing activity.</p>
            </title>
            <aug>
               <au>
                  <snm>Duan</snm>
                  <fnm>X</fnm>
               </au>
               <au>
                  <snm>Gimble</snm>
                  <fnm>FS</fnm>
               </au>
               <au>
                  <snm>Quiocho</snm>
                  <fnm>FA</fnm>
               </au>
            </aug>
            <source>Cell</source>
            <pubdate>1997</pubdate>
            <volume>89</volume>
            <fpage>555</fpage>
            <lpage>564</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/S0092-8674(00)80237-8</pubid>
                  <pubid idtype="pmpid" link="fulltext">9160747</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B14">
            <title>
               <p>Homing of a DNA endonuclease gene by meiotic gene conversion in Saccharomyces cerevisiae.</p>
            </title>
            <aug>
               <au>
                  <snm>Gimble</snm>
                  <fnm>FS</fnm>
               </au>
               <au>
                  <snm>Thorner</snm>
                  <fnm>J</fnm>
               </au>
            </aug>
            <source>Nature</source>
            <pubdate>1992</pubdate>
            <volume>357</volume>
            <fpage>301</fpage>
            <lpage>306</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/357301a0</pubid>
                  <pubid idtype="pmpid" link="fulltext">1534148</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B15">
            <title>
               <p>Protein splicing of the Saccharomyces cerevisiae VMA intein without the endonuclease motifs.</p>
            </title>
            <aug>
               <au>
                  <snm>Chong</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Xu</snm>
                  <fnm>MQ</fnm>
               </au>
            </aug>
            <source>J Biol Chem</source>
            <pubdate>1997</pubdate>
            <volume>272</volume>
            <fpage>15587</fpage>
            <lpage>15590</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1074/jbc.272.25.15587</pubid>
                  <pubid idtype="pmpid" link="fulltext">9188443</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B16">
            <title>
               <p>Genetic definition of a protein-splicing domain: functional mini-inteins support structure predictions and a model for intein evolution [published erratum appears in Proc Natl Acad Sci USA 1998 Jan 20;95(2):762].</p>
            </title>
            <aug>
               <au>
                  <snm>Derbyshire</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>Wood</snm>
                  <fnm>DW</fnm>
               </au>
               <au>
                  <snm>Wu</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Dansereau</snm>
                  <fnm>JT</fnm>
               </au>
               <au>
                  <snm>Dalgaard</snm>
                  <fnm>JZ</fnm>
               </au>
               <au>
                  <snm>Belfort</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci U S A</source>
            <pubdate>1997</pubdate>
            <volume>94</volume>
            <fpage>11466</fpage>
            <lpage>11471</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">23508</pubid>
                  <pubid idtype="pmpid" link="fulltext">9326633</pubid>
                  <pubid idtype="doi">10.1073/pnas.94.21.11466</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B17">
            <title>
               <p>Homing endonucleases: structure, function and evolution.</p>
            </title>
            <aug>
               <au>
                  <snm>Jurica</snm>
                  <fnm>MS</fnm>
               </au>
               <au>
                  <snm>Stoddard</snm>
                  <fnm>BL</fnm>
               </au>
            </aug>
            <source>Cell Mol Life Sci</source>
            <pubdate>1999</pubdate>
            <volume>55</volume>
            <fpage>1304</fpage>
            <lpage>1326</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1007/s000180050372</pubid>
                  <pubid idtype="pmpid">10487208</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B18">
            <title>
               <p>InBase, the Intein Database.</p>
            </title>
            <aug>
               <au>
                  <snm>Perler</snm>
                  <fnm>FB</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2000</pubdate>
            <volume>28</volume>
            <fpage>344</fpage>
            <lpage>345</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">102441</pubid>
                  <pubid idtype="pmpid" link="fulltext">10592269</pubid>
                  <pubid idtype="doi">10.1093/nar/28.1.344</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B19">
            <title>
               <p>Phylogenetic affinities between eukaryotic cells and a thermophilic mycoplasma.</p>
            </title>
            <aug>
               <au>
                  <snm>Searcy</snm>
                  <fnm>DG</fnm>
               </au>
               <au>
                  <snm>Stein</snm>
                  <fnm>DB</fnm>
               </au>
               <au>
                  <snm>Green</snm>
                  <fnm>GR</fnm>
               </au>
            </aug>
            <source>Biosystems</source>
            <pubdate>1978</pubdate>
            <volume>10</volume>
            <fpage>19</fpage>
            <lpage>28</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/0303-2647(78)90024-2</pubid>
                  <pubid idtype="pmpid">656568</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B20">
            <title>
               <p>A thermophilic, acidophilic mycoplasma isolated from a coal refuse pile.</p>
            </title>
            <aug>
               <au>
                  <snm>Darland</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Brock</snm>
                  <fnm>TD</fnm>
               </au>
               <au>
                  <snm>Samsonoff</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Conti</snm>
                  <fnm>SF</fnm>
               </au>
            </aug>
            <source>Science</source>
            <pubdate>1970</pubdate>
            <volume>170</volume>
            <fpage>1416</fpage>
            <lpage>1418</lpage>
            <xrefbib>
               <pubid idtype="pmpid">5481857</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B21">
            <title>
               <p>Thermoplasma acidophilum: intracellular pH and potassium concentration.</p>
            </title>
            <aug>
               <au>
                  <snm>Searcy</snm>
                  <fnm>DG</fnm>
               </au>
            </aug>
            <source>Biochim Biophys Acta</source>
            <pubdate>1976</pubdate>
            <volume>451</volume>
            <fpage>278</fpage>
            <lpage>286</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/0304-4165(76)90278-6</pubid>
                  <pubid idtype="pmpid">12804</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B22">
            <title>
               <p>Evolution and isoforms of V-ATPase subunits.</p>
            </title>
            <aug>
               <au>
                  <snm>Gogarten</snm>
                  <fnm>JP</fnm>
               </au>
               <au>
                  <snm>Starke</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Kibak</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Fishman</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Taiz</snm>
                  <fnm>L</fnm>
               </au>
            </aug>
            <source>J Exp Biol</source>
            <pubdate>1992</pubdate>
            <volume>172</volume>
            <fpage>137</fpage>
            <lpage>147</lpage>
            <xrefbib>
               <pubid idtype="pmpid">1491221</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B23">
            <title>
               <p>Peptide splicing in the vacuolar ATPase subunit A from Candida tropicalis.</p>
            </title>
            <aug>
               <au>
                  <snm>Gu</snm>
                  <fnm>HH</fnm>
               </au>
               <au>
                  <snm>Xu</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Gallagher</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Dean</snm>
                  <fnm>GE</fnm>
               </au>
            </aug>
            <source>J Biol Chem</source>
            <pubdate>1993</pubdate>
            <volume>268</volume>
            <fpage>7372</fpage>
            <lpage>7381</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">8463270</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B24">
            <title>
               <p>The vacuolar H+-ATPase from Giardia lamblia: a potential model for the study of the evolution of the first eukaryotes.</p>
            </title>
            <aug>
               <au>
                  <snm>Hilario</snm>
                  <fnm>E</fnm>
               </au>
            </aug>
            <source>University of Connecticut, Storrs, CT</source>
            <pubdate>1998</pubdate>
         </bibl>
         <bibl id="B25">
            <title>
               <p>The genome sequence of the thermoacidophilic scavenger Thermoplasma acidophilum.</p>
            </title>
            <aug>
               <au>
                  <snm>Ruepp</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Graml</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Santos-Martinez</snm>
                  <fnm>ML</fnm>
               </au>
               <au>
                  <snm>Koretke</snm>
                  <fnm>KK</fnm>
               </au>
               <au>
                  <snm>Volker</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Mewes</snm>
                  <fnm>HW</fnm>
               </au>
               <au>
                  <snm>Frishman</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Stocker</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Lupas</snm>
                  <fnm>AN</fnm>
               </au>
               <au>
                  <snm>Baumeister</snm>
                  <fnm>W</fnm>
               </au>
            </aug>
            <source>Nature</source>
            <pubdate>2000</pubdate>
            <volume>407</volume>
            <fpage>508</fpage>
            <lpage>513</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/35035069</pubid>
                  <pubid idtype="pmpid" link="fulltext">11029001</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B26">
            <title>
               <p>Archaeal adaptation to higher temperatures revealed by genomic sequence of Thermoplasma volcanium.</p>
            </title>
            <aug>
               <au>
                  <snm>Kawashima</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Amano</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Koike</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Makino</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Higuchi</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Kawashima-Ohya</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Watanabe</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Yamazaki</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Kanehori</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Kawamoto</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Nunoshiba</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Yamamoto</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Aramaki</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Makino</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Suzuki</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci USA</source>
            <pubdate>2000</pubdate>
            <volume>97</volume>
            <fpage>14257</fpage>
            <lpage>14262</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">18905</pubid>
                  <pubid idtype="pmpid" link="fulltext">11121031</pubid>
                  <pubid idtype="doi">10.1073/pnas.97.26.14257</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B27">
            <title>
               <p>Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements.</p>
            </title>
            <aug>
               <au>
                  <snm>Schaffer</snm>
                  <fnm>AA</fnm>
               </au>
               <au>
                  <snm>Aravind</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Madden</snm>
                  <fnm>TL</fnm>
               </au>
               <au>
                  <snm>Shavirin</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Spouge</snm>
                  <fnm>JL</fnm>
               </au>
               <au>
                  <snm>Wolf</snm>
                  <fnm>YI</fnm>
               </au>
               <au>
                  <snm>Koonin</snm>
                  <fnm>EV</fnm>
               </au>
               <au>
                  <snm>Altschul</snm>
                  <fnm>SF</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2001</pubdate>
            <volume>29</volume>
            <fpage>2994</fpage>
            <lpage>3005</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">55814</pubid>
                  <pubid idtype="pmpid" link="fulltext">11452024</pubid>
                  <pubid idtype="doi">10.1093/nar/29.14.2994</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B28">
            <title>
               <p>Conserved sequence features of inteins (protein introns) and their use in identifying new inteins and related proteins.</p>
            </title>
            <aug>
               <au>
                  <snm>Pietrokovski</snm>
                  <fnm>S</fnm>
               </au>
            </aug>
            <source>Protein Sci</source>
            <pubdate>1994</pubdate>
            <volume>3</volume>
            <fpage>2340</fpage>
            <lpage>2350</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">7756989</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B29">
            <title>
               <p>Empirical statistical estimates for sequence similarity searches.</p>
            </title>
            <aug>
               <au>
                  <snm>Pearson</snm>
                  <fnm>WR</fnm>
               </au>
            </aug>
            <source>J Mol Biol</source>
            <pubdate>1998</pubdate>
            <volume>276</volume>
            <fpage>71</fpage>
            <lpage>84</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1006/jmbi.1997.1525</pubid>
                  <pubid idtype="pmpid" link="fulltext">9514730</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B30">
            <title>
               <p>Structure at 2.8 A resolution of F1-ATPase from bovine heart mitochondria.</p>
            </title>
            <aug>
               <au>
                  <snm>Abrahams</snm>
                  <fnm>JP</fnm>
               </au>
               <au>
                  <snm>Leslie</snm>
                  <fnm>AG</fnm>
               </au>
               <au>
                  <snm>Lutter</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Walker</snm>
                  <fnm>JE</fnm>
               </au>
            </aug>
            <source>Nature</source>
            <pubdate>1994</pubdate>
            <volume>370</volume>
            <fpage>621</fpage>
            <lpage>628</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/370621a0</pubid>
                  <pubid idtype="pmpid" link="fulltext">8065448</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B31">
            <title>
               <p>Strategies for achieving high-level expression of genes in Escherichia coli.</p>
            </title>
            <aug>
               <au>
                  <snm>Makrides</snm>
                  <fnm>SC</fnm>
               </au>
            </aug>
            <source>Microbiol Rev</source>
            <pubdate>1996</pubdate>
            <volume>60</volume>
            <fpage>512</fpage>
            <lpage>538</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">239455</pubid>
                  <pubid idtype="pmpid">8840785</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B32">
            <title>
               <p>The cDNA sequence of the 69-kDa subunit of the carrot vacuolar H+- ATPase. Homology to the beta-chain of F0F1-ATPases.</p>
            </title>
            <aug>
               <au>
                  <snm>Zimniak</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Dittrich</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Gogarten</snm>
                  <fnm>JP</fnm>
               </au>
               <au>
                  <snm>Kibak</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Taiz</snm>
                  <fnm>L</fnm>
               </au>
            </aug>
            <source>J Biol Chem</source>
            <pubdate>1988</pubdate>
            <volume>263</volume>
            <fpage>9102</fpage>
            <lpage>9112</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">2897965</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B33">
            <title>
               <p>The proton electrochemical gradient in Escherichia coli cells.</p>
            </title>
            <aug>
               <au>
                  <snm>Padan</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Zilberstein</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Rottenberg</snm>
                  <fnm>H</fnm>
               </au>
            </aug>
            <source>Eur J Biochem</source>
            <pubdate>1976</pubdate>
            <volume>63</volume>
            <fpage>533</fpage>
            <lpage>541</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1111/j.1432-1033.1976.tb10257.x</pubid>
                  <pubid idtype="pmpid">4325</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B34">
            <title>
               <p>Lateral genomics.</p>
            </title>
            <aug>
               <au>
                  <snm>Doolittle</snm>
                  <fnm>WF</fnm>
               </au>
            </aug>
            <source>Trends Cell Biol</source>
            <pubdate>1999</pubdate>
            <volume>9</volume>
            <fpage>M5</fpage>
            <lpage>8</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/S0962-8924(99)01664-5</pubid>
                  <pubid idtype="pmpid" link="fulltext">10611671</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B35">
            <title>
               <p>Orthologs, paralogs and genome comparisons.</p>
            </title>
            <aug>
               <au>
                  <snm>Gogarten</snm>
                  <fnm>JP</fnm>
               </au>
               <au>
                  <snm>Olendzenski</snm>
                  <fnm>L</fnm>
               </au>
            </aug>
            <source>Curr Opin Genet Dev</source>
            <pubdate>1999</pubdate>
            <volume>9</volume>
            <fpage>630</fpage>
            <lpage>636</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/S0959-437X(99)00029-5</pubid>
                  <pubid idtype="pmpid" link="fulltext">10607614</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B36">
            <title>
               <p>Gene transfer, speciation, and the evolution of bacterial genomes.</p>
            </title>
            <aug>
               <au>
                  <snm>Lawrence</snm>
                  <fnm>JG</fnm>
               </au>
            </aug>
            <source>Curr Opin Microbiol</source>
            <pubdate>1999</pubdate>
            <volume>2</volume>
            <fpage>519</fpage>
            <lpage>523</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/S1369-5274(99)00010-7</pubid>
                  <pubid idtype="pmpid" link="fulltext">10508729</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B37">
            <title>
               <p>Horizontal transfer of archaeal genes into the deinococcaceae: detection by molecular and computer-based approaches.</p>
            </title>
            <aug>
               <au>
                  <snm>Olendzenski</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Liu</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Zhaxybayeva</snm>
                  <fnm>O</fnm>
               </au>
               <au>
                  <snm>Murphey</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Shin</snm>
                  <fnm>DG</fnm>
               </au>
               <au>
                  <snm>Gogarten</snm>
                  <fnm>JP</fnm>
               </au>
            </aug>
            <source>J Mol Evol</source>
            <pubdate>2000</pubdate>
            <volume>51</volume>
            <fpage>587</fpage>
            <lpage>599</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">11116332</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B38">
            <title>
               <p>Recurrent invasion and extinction of a selfish gene.</p>
            </title>
            <aug>
               <au>
                  <snm>Goddard</snm>
                  <fnm>MR</fnm>
               </au>
               <au>
                  <snm>Burt</snm>
                  <fnm>A</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci USA</source>
            <pubdate>1999</pubdate>
            <volume>96</volume>
            <fpage>13880</fpage>
            <lpage>13885</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">24159</pubid>
                  <pubid idtype="pmpid" link="fulltext">10570167</pubid>
                  <pubid idtype="doi">10.1073/pnas.96.24.13880</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B39">
            <title>
               <p>Thermophiles: Culture of Thermoplasma acidophilum.</p>
            </title>
            <aug>
               <au>
                  <snm>Searcy</snm>
                  <fnm>DG</fnm>
               </au>
            </aug>
            <source>Amherst, MA: Biology Dept., Univ. of Massachusetts, available through the author</source>
            <pubdate>1993</pubdate>
         </bibl>
         <bibl id="B40">
            <title>
               <p>Molecular Cloning: A Laboratory Manual,</p>
            </title>
            <aug>
               <au>
                  <snm>Sambrook</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Fritsch</snm>
                  <fnm>EF</fnm>
               </au>
               <au>
                  <snm>Maniatis</snm>
                  <fnm>T</fnm>
               </au>
            </aug>
            <source>Cold Spring Harbor Laboratory, Cold Spring Harbor, NY</source>
            <pubdate>1989</pubdate>
         </bibl>
         <bibl id="B41">
            <title>
               <p>Current Protocols in Molecular Biology.</p>
            </title>
            <aug>
               <au>
                  <snm>Ausubel</snm>
                  <fnm>FM</fnm>
               </au>
               <au>
                  <snm>Brent</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Kingston</snm>
                  <fnm>RE</fnm>
               </au>
               <au>
                  <snm>Moore</snm>
                  <fnm>DD</fnm>
               </au>
               <au>
                  <snm>Seidman</snm>
                  <fnm>JG</fnm>
               </au>
               <au>
                  <snm>Smith</snm>
                  <fnm>JA</fnm>
               </au>
               <au>
                  <snm>Struhl</snm>
                  <fnm>K</fnm>
               </au>
            </aug>
            <source>John Wiley and Sons Inc</source>
            <pubdate>2000</pubdate>
         </bibl>
         <bibl id="B42">
            <title>
               <p>Tricine-sodium dodecyl sulfate-polyacrylamide gel electrophoresis for the separation of proteins in the range from 1 to 100 kDa.</p>
            </title>
            <aug>
               <au>
                  <snm>Schagger</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>von Jagow</snm>
                  <fnm>G</fnm>
               </au>
            </aug>
            <source>Anal Biochem</source>
            <pubdate>1987</pubdate>
            <volume>166</volume>
            <fpage>368</fpage>
            <lpage>379</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/0003-2697(87)90587-2</pubid>
                  <pubid idtype="pmpid" link="fulltext">2449095</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B43">
            <title>
               <p>Codon usage tabulated from international DNA sequence databases: status for the year 2000.</p>
            </title>
            <aug>
               <au>
                  <snm>Nakamura</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Gojobori</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Ikemura</snm>
                  <fnm>T</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2000</pubdate>
            <volume>28</volume>
            <fpage>292</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">102460</pubid>
                  <pubid idtype="pmpid" link="fulltext">10592250</pubid>
                  <pubid idtype="doi">10.1093/nar/28.1.292</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B44">
            <title>
               <p>The RDP-II (Ribosomal Database Project).</p>
            </title>
            <aug>
               <au>
                  <snm>Maidak</snm>
                  <fnm>BL</fnm>
               </au>
               <au>
                  <snm>Cole</snm>
                  <fnm>JR</fnm>
               </au>
               <au>
                  <snm>Lilburn</snm>
                  <fnm>TG</fnm>
               </au>
               <au>
                  <snm>Parker</snm>
                  <fnm>CT</fnm>
                  <suf>Jr</suf>
               </au>
               <au>
                  <snm>Saxman</snm>
                  <fnm>PR</fnm>
               </au>
               <au>
                  <snm>Farris</snm>
                  <fnm>RJ</fnm>
               </au>
               <au>
                  <snm>Garrity</snm>
                  <fnm>GM</fnm>
               </au>
               <au>
                  <snm>Olsen</snm>
                  <fnm>GJ</fnm>
               </au>
               <au>
                  <snm>Schmidt</snm>
                  <fnm>TM</fnm>
               </au>
               <au>
                  <snm>Tiedje</snm>
                  <fnm>JM</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2001</pubdate>
            <volume>29</volume>
            <fpage>173</fpage>
            <lpage>174</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">29785</pubid>
                  <pubid idtype="pmpid" link="fulltext">11125082</pubid>
                  <pubid idtype="doi">10.1093/nar/29.1.173</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B45">
            <title>
               <p>The CLUSTAL_X windows interface: flexible strategies for multiple sequence alignment aided by quality analysis tools.</p>
            </title>
            <aug>
               <au>
                  <snm>Thompson</snm>
                  <fnm>JD</fnm>
               </au>
               <au>
                  <snm>Gibson</snm>
                  <fnm>TJ</fnm>
               </au>
               <au>
                  <snm>Plewniak</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Jeanmougin</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Higgins</snm>
                  <fnm>DG</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>1997</pubdate>
            <volume>25</volume>
            <fpage>4876</fpage>
            <lpage>4882</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">147148</pubid>
                  <pubid idtype="pmpid" link="fulltext">9396791</pubid>
                  <pubid idtype="doi">10.1093/nar/25.24.4876</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B46">
            <title>
               <p>Quartet puzzling: A quartet maximum-likelihood method for reconstructing tree topologies.</p>
            </title>
            <aug>
               <au>
                  <snm>Strimmer</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>von Haeseler</snm>
                  <fnm>A</fnm>
               </au>
            </aug>
            <source>Molecular Biology and Evolution</source>
            <pubdate>1996</pubdate>
            <fpage>964</fpage>
            <lpage>969</lpage>
         </bibl>
         <bibl id="B47">
            <title>
               <p>PAUP* 4.0 beta version, Phylogenetic Analysis Using Parsimony (and Other Methods):</p>
            </title>
            <aug>
               <au>
                  <snm>Swofford</snm>
                  <fnm>D</fnm>
               </au>
            </aug>
            <source>Sinauer Associates, Inc., Sunderland, MA, USA</source>
            <pubdate>1998</pubdate>
         </bibl>
      </refgrp>
   </bm>
</art>

