<?xml version='1.0'?>
<!DOCTYPE art SYSTEM 'http://www.biomedcentral.com/xml/article.dtd'>
<art>
   <ui>1471-2148-7-140</ui>
   <ji>1471-2148</ji>
   <fm>
      <dochead>Research article</dochead>
      <bibl>
         <title>
            <p>Multiple domain insertions and losses in the evolution of the Rab prenylation complex</p>
         </title>
         <aug>
            <au id="A1">
               <snm>Rasteiro</snm>
               <fnm>Rita</fnm>
               <insr iid="I1"/>
               <email>rrasteiro@igc.gulbenkian.pt</email>
            </au>
            <au id="A2" ca="yes">
               <snm>Pereira-Leal</snm>
               <mi>B</mi>
               <fnm>Jose</fnm>
               <insr iid="I1"/>
               <email>jleal@igc.gulbenkian.pt</email>
            </au>
         </aug>
         <insg>
            <ins id="I1">
               <p>Instituto Gulbenkian de Ci&#234;ncia, Apartado 14, P-2781-901 Oeiras Portugal</p>
            </ins>
         </insg>
         <source>BMC Evolutionary Biology</source>
         <issn>1471-2148</issn>
         <pubdate>2007</pubdate>
         <volume>7</volume>
         <issue>1</issue>
         <fpage>140</fpage>
         <url>http://www.biomedcentral.com/1471-2148/7/140</url>
         <xrefbib>
            <pubidlist>
               <pubid idtype="pmpid">17705859</pubid>
               <pubid idtype="doi">10.1186/1471-2148-7-140</pubid>
            </pubidlist>
         </xrefbib>
      </bibl>
      <history>
         <rec>
            <date>
               <day>16</day>
               <month>6</month>
               <year>2007</year>
            </date>
         </rec>
         <acc>
            <date>
               <day>17</day>
               <month>8</month>
               <year>2007</year>
            </date>
         </acc>
         <pub>
            <date>
               <day>17</day>
               <month>8</month>
               <year>2007</year>
            </date>
         </pub>
      </history>
      <cpyrt>
         <year>2007</year>
         <collab>Rasteiro and Pereira-Leal; licensee BioMed Central Ltd.</collab>
         <note>This is an Open Access article distributed under the terms of the Creative Commons Attribution License (<url>http://creativecommons.org/licenses/by/2.0</url>), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</note>
      </cpyrt>
      <abs>
         <sec>
            <st>
               <p>Abstract</p>
            </st>
            <sec>
               <st>
                  <p>Background</p>
               </st>
               <p>Rab proteins are regulators of vesicular trafficking, requiring a lipid modification for proper function, prenylation of C-terminal cysteines. This is catalysed by a complex of a catalytic heterodimer (Rab Geranylgeranyl Transferase &#8211; RabGGTase) and an accessory protein (Rab Escort Protein. REP). Components of this complex display domain insertions relative to paralogous proteins. The function of these inserted domains is unclear.</p>
            </sec>
            <sec>
               <st>
                  <p>Results</p>
               </st>
               <p>We profiled the domain architecture of the components of the Rab prenylation complex in evolution. We identified the orthologues of the components of the Rab prenylation machinery in 43 organisms, representing the crown eukaryotic groups. We characterize in detail the domain structure of all these components and the phylogenetic relationships between the individual domains.</p>
            </sec>
            <sec>
               <st>
                  <p>Conclusion</p>
               </st>
               <p>We found different domain insertions in different taxa, in &#945;-subunits of RGGTase and REP. Our results suggest that there were multiple insertions, expansions and contractions in the evolution of this prenylation complex.</p>
            </sec>
         </sec>
      </abs>
   </fm>
   <meta>
      <classifications>
         <classification type="bmc" subtype="user_supplied_xml" id="endnote"/>
      </classifications>
   </meta>
   <bdy>
      <sec>
         <st>
            <p>Background</p>
         </st>
         <p>Protein prenyl transferases are a family of protein complexes that catalyze the lipid modification of proteins with isoprenoid groups. There are covalently attached to cysteine residues near or at the C-termini of intracellular proteins via tioether linkages (reviewed in <abbrgrp><abbr bid="B1">1</abbr><abbr bid="B2">2</abbr></abbrgrp>). The family includes Protein Farnesyl Transferase (FTase), Geranylgeranyl Transferase I (GGTase I) and RabGeranylgeranyl Transferase (RabGTTase). FTase and GGTase I modify C-terminal cystein residues in the context of a CAAX motif with a 15 or 20 carbon isoprenoid, respectively, and are thus termed CAAX prenyltransferases (A stands for aliphatic residue, X for any residue). Substrates of the FTase include Ras family small GTPases, nuclear lamins, centromeric proteins among others, whereas substrates of the latter include the Rho family of small GTPases and heterotrimeric G protein &#947; subunits (reviewed in reference <abbrgrp><abbr bid="B3">3</abbr></abbrgrp>). RabGGTase in contrast is specific to the Rab family of small GTPases, and catalyses the modification of these with two or sometimes one 20 carbon isoprenoid. It also differs from the CAXX prenyltransferases by the absolute requirement for an accessory protein, termed Rab Escort Protein (REP) for proper catalysis (reviewed in <abbrgrp><abbr bid="B4">4</abbr></abbrgrp>). Protein prenylation affects proteins involved in a multitude of cellular processes, is involved in a variety of human diseases and therepaeutic approaches and this is extensively reviewed, for example in references <abbrgrp><abbr bid="B3">3</abbr><abbr bid="B4">4</abbr><abbr bid="B5">5</abbr><abbr bid="B6">6</abbr></abbrgrp>.</p>
         <p>Protein prenyl transferases are heterodimeric complexes of a &#945; and a &#946; chain, and in the case of RGGTase a third subunit, the REP protein(s). The genes coding for these enzymes have been cloned in a variety of species and shown to be essential for life <abbrgrp><abbr bid="B7">7</abbr><abbr bid="B8">8</abbr><abbr bid="B9">9</abbr></abbrgrp>. FTase and GGTaseI share the &#945;-subunit, but have distinct &#946;-subunits, whereas the &#945;- and &#946;-subunits of RabGGTase are coded by distinct genes. The three-dimensional structure of the three enzymes has been solved, and revealed that the three enzymes are structurally homologous (see Figure <figr fid="F1">1</figr>). The &#945;-subunit is a right-handed, crescent shaped, super-helix composed by 15 &#945;-helices wrapped around the &#945;-&#945; barrel of the &#946;-subunit <abbrgrp><abbr bid="B10">10</abbr><abbr bid="B11">11</abbr></abbrgrp>. RabGGTase is a similar &#945;&#946; heterodimer but the rat protein displays the insertion of two additional domains in the &#945;-subunit relative to the other prenyltransferases, and to the yeast protein, a Ig-like domain and a Leucine Rich Repeat domain <abbrgrp><abbr bid="B12">12</abbr></abbrgrp>. In contrast with the CAXX prenyltransferases that recognize the C-terminal motif of the substrate proteins via the &#946;, catalytic subunit, RabGGTase recognizes the substrate by interactions of the &#945;-subunit with an extra protein, REP. The rat REP, like the &#945;-subunit, displays an insertion relative to the yeast orthologue and the paralogous RabGDI <abbrgrp><abbr bid="B13">13</abbr><abbr bid="B14">14</abbr></abbrgrp> (Figure <figr fid="F1">1</figr>). Protein prenyltransferases are believed to have evolved from an ancestral heterodimer, which by gene duplication gave rise to the current constellation of subunits <abbrgrp><abbr bid="B15">15</abbr></abbrgrp>.</p>
         <fig id="F1">
            <title>
               <p>Figure 1</p>
            </title>
            <caption>
               <p>Cartoon representing the three dimensional structure of Rat RabGGTase complexed with REP-1 (<ext-link ext-link-type="pdb" ext-link-id="1LTX">1LTX</ext-link>) [29]</p>
            </caption>
            <text>
               <p>Cartoon representing the three dimensional structure of Rat RabGGTase complexed with REP-1 (<ext-link ext-link-type="pdb" ext-link-id="1LTX">1LTX</ext-link>) [29]. The &#945; subunit is shown in green with a yellow and red parts, representing a Leucine Rich Repeat and a C2-like domain, respectively. REP1 is shown in blue with dotted blue lines representing disordered regions. The &#946; subunit is shown in gray. The &#945;-subunit and REP are shown in isolation on the left and right side respectively, next to cartoons representing the three dimensional structure of the paralogous &#945;-subunit of FTase and of RabGDI (PDB codes <ext-link ext-link-type="pdb" ext-link-id="1FT1">1FT1</ext-link> [11] and <ext-link ext-link-type="pdb" ext-link-id="1LV0">1LV0</ext-link> [72], respectively).</p>
            </text>
            <graphic file="1471-2148-7-140-1"/>
         </fig>
         <p>The role of gene duplication in the evolution of protein complexes has been subject of recent interest <abbrgrp><abbr bid="B16">16</abbr><abbr bid="B17">17</abbr><abbr bid="B18">18</abbr><abbr bid="B19">19</abbr></abbrgrp>. Individual gene duplications can contribute to specialization of protein complexes, and can also accumulate to the point where two or more fully independent protein complexes exist <abbrgrp><abbr bid="B16">16</abbr><abbr bid="B19">19</abbr></abbrgrp>. Less well understood is the role of domain insertions and deletions in the context of protein complexes. Protein domains are the basic evolutionary units in protein evolution <abbrgrp><abbr bid="B20">20</abbr><abbr bid="B21">21</abbr></abbrgrp>. Proteins can gain, rearrange and loose domains in the course of evolution. Domain gains result mostly from fusions of independent genes that contain one or more domains and it is perhaps the most frequent mechanism of protein evolution after gene duplication <abbrgrp><abbr bid="B22">22</abbr></abbrgrp>. It can also be a consequence of duplication followed by in-frame fusion, resulting in tandem duplications. Less frequently, domain insertions may happen, situations where one or more domains are inserted into another domain, usually in loop regions <abbrgrp><abbr bid="B23">23</abbr><abbr bid="B24">24</abbr></abbrgrp>. Domain losses are less well understood, but recent evidence suggests that they are most abundant at C-termini and due to the premature stop codons <abbrgrp><abbr bid="B25">25</abbr></abbrgrp>. Rearrangements or circular permutations can be achieved by complete duplication with in-frame fusion followed by partial deletion of domains at the termini, the most frequent mechanism <abbrgrp><abbr bid="B26">26</abbr></abbrgrp>, but also by independent fusion <abbrgrp><abbr bid="B26">26</abbr></abbrgrp> and also by a mechanism termed "cut and paste" where a gene is fragmented, for example by the action of endonucleases, and then reassembles the resulting fragments in a different order <abbrgrp><abbr bid="B27">27</abbr></abbrgrp>.</p>
         <p>Here we characterize the domain architectures of the components of the Rab prenylation complex, <it>i.e. </it>the heterodimer &#945;&#946; of RGGTase and the accessory protein REP. We find that orthologous proteins display diverse domain architectures, suggestive of multiple independent events involving gains and losses of domains.</p>
      </sec>
      <sec>
         <st>
            <p>Results and discussion</p>
         </st>
         <sec>
            <st>
               <p>&#945; subunit &#8211; tetratricopeptide repeats</p>
            </st>
            <p>The rat &#945;-subunit of RGGTase is very similar to the corresponding &#945;-subunit of FTase, containing 15 &#945;-helices arranged in a crescent-shaped, double layered right-handed superhelix, enveloping the &#946;-subunit <abbrgrp><abbr bid="B11">11</abbr><abbr bid="B12">12</abbr></abbrgrp>. Its structural architecture, together with statistically significant sequence similarity with the tetratricopeptide repeat (TPR) motif indicates that it belongs of the TPR superfamily <abbrgrp><abbr bid="B15">15</abbr></abbrgrp>. An individual TPR is a pair of anti-parallel &#945;-helices, with consensus residues mediating the packing of these helices <abbrgrp><abbr bid="B15">15</abbr></abbrgrp>. In the rat RGGTase &#945;-subunit these 7 TPR are helices 2 to 15. The crystal structure of the Rat RGGTase revealed two additional domains, a C-terminal Leucine Rich Repeat domain (LRR), and an Ig-like domain inserted between helices 11 and 12, <it>i.e. </it>between TPR 5 and 6. This is the exact same loop where in the FTase &#945;-subunit there is a short 3<sub>10 </sub>helix between &#945;-helices 11 and 12 <abbrgrp><abbr bid="B11">11</abbr></abbrgrp>.</p>
            <p>We investigated the complete and partial genome sequences of 43 organisms, covering the crown eukaryotic groups discicristates, excavates, alveolata, heterokonts, plants, amoebozoa and the opisthokonts (metazoa, fungi and choanoflagelates) <abbrgrp><abbr bid="B28">28</abbr></abbrgrp>. In all the organisms with a complete genome sequence available, we detect a putative RGGTase &#945;-subunit (Figure S1). Thus we can conclude that the last eukaryotic common ancestor already possessed this subunit. FTase &#945;-subunits were also found in the majority of the organisms studied (Figure S1), suggesting that the duplication that gave rise to the two distinct &#945;-subunits preceded the split of the eukaryotic crown groups.</p>
            <p>Detailed phylogenetic reconstruction of the evolutionary relationships between &#945;-subunits based solely on the TPRs shows that the &#945;-subunits of each enzyme are monophyletic (Figure S1). It is then most likely that there was one and only one gene duplication event that created the two paralogous subunits, and that this happened at the base of the eukaryotic tree. Furthermore, the fact that the secondary structure composed of 15 repeating helices with TPRs is conserved suggests that the ancestral of prenyltransferases already had this configuration.</p>
            <p>We find that the secondary structure of this subunit varies in evolution, displaying multiple domain insertions in different organisms, which we characterize below.</p>
         </sec>
         <sec>
            <st>
               <p>&#945; subunit &#8211; C2-like domain</p>
            </st>
            <p>Rat RGGTase a-subunit displays an inserted globular domain between helices 11 and 12 <abbrgrp><abbr bid="B12">12</abbr></abbrgrp>. The function of this domain is unclear, but it is clearly not involved in contacts with REP nor the Rab substrates <abbrgrp><abbr bid="B29">29</abbr></abbrgrp>. This globular domain is a &#946; sandwich composed of eight strands in two sheets. It is a domain broadly related to the Ig fold, and in the two papers describing the structure of the complex it was termed an Ig-like domain. This is consistent with the CATH hierarchy, a fully automated classification of protein structures <abbrgrp><abbr bid="B30">30</abbr></abbrgrp>. Here we will consider instead the SCOP classification of evolutionary relationships between proteins structures, as it complements automated classification with manual curation <abbrgrp><abbr bid="B21">21</abbr><abbr bid="B31">31</abbr></abbrgrp>. SCOP 1.71 classifies this domain as an independent superfamily, included in the C2 domain-like fold. From now on we will refer to this domain as the C2 domain-like.</p>
            <p>The full SCOP hierarchy for this domain is shown in additional file <supplr sid="S1">1</supplr> (Figure S2). This fold includes superfamilies such as the C2 domain. This is a domain that is found in multiple eukaryotic proteins and is involved in signaling, vesicular transport, modification of lipids, among other functions <abbrgrp><abbr bid="B32">32</abbr></abbrgrp>. C2 domains usually regulate their respective protein function by establishing Ca<sup>2+</sup>-dependent and Ca<sup>2+</sup>-independent phospholipids complexes. One class of C2 domains can bind Ca<sup>2+ </sup>without binding phospholipids <abbrgrp><abbr bid="B33">33</abbr></abbrgrp>. Recently, the C2 domain of PKCd was shown to mediate protein-protein interactions by binding directly to phosphotyrosine peptides in a sequence-specific manner <abbrgrp><abbr bid="B34">34</abbr></abbrgrp>. It is unclear if the RabGGTase C2-lke domain displays any of these functions. Since the role of Rab isoprenylation is to allow hydrophilic Rab proteins to associated with cellular membranes, it is plausible to think that such modification should occur in proximity to those membranes. If this C2 domain-like is a phospholipid-binding domain, then it could play a role in bringing the prenylation reaction next to membranes. Ultra-structural studies could be used to test this hypothesis.</p>
            <suppl id="S1">
               <title>
                  <p>Additional file 1</p>
               </title>
               <text>
                  <p>Supplementary material. supplementary figures and tables</p>
               </text>
               <file name="1471-2148-7-140-S1.pdf">
                  <p>Click here for file</p>
               </file>
            </suppl>
            <p>It was previously observed that this domain was not present in the yeast orthologue of &#945;-subunit RGGTase, but that it would be present in worm also <abbrgrp><abbr bid="B15">15</abbr></abbrgrp>. We analyzed in detail the sequences between helices 11 and 12 of RGGTase, where the Rat C2-like domain is found <abbrgrp><abbr bid="B12">12</abbr></abbrgrp>, and also where the FTase a-subunit displays an inserted 3<sub>10 </sub>helix <abbrgrp><abbr bid="B11">11</abbr></abbrgrp>. We found that only a restricted number of branches on the eukaryotic tree display insertions in this region (Figure <figr fid="F2">2</figr>). These include metazoa, plants and alveolata. All other branches of the tree have a predicted secondary structure similar to that of Bet2 in <it>S. cerevisae</it>, which does no display any insertion between the two TPRs.</p>
            <fig id="F2">
               <title>
                  <p>Figure 2</p>
               </title>
               <caption>
                  <p><b>(A) </b>taxonomic tree of the organisms analyzed in this study, adapted from the NCBI's taxonomy database [73] and from reference [28] <b>(B) </b>N- to C-terminal domain organization of the &#945;-subunit of RGGTase</p>
               </caption>
               <text>
                  <p><b>(A) </b>taxonomic tree of the organisms analyzed in this study, adapted from the NCBI's taxonomy database [73] and from reference [28] <b>(B) </b>N- to C-terminal domain organization of the &#945;-subunit of RGGTase. Boxes with the same shape represent the same domain, whereas different shapes represent distinct domains. Green boxes represent tetratricopeptide repeats (TPRs). In between the 5<sup>th </sup>and 6<sup>th </sup>TPR some proteins display an inserted domain. In deuterostomes this is a C2-like domain (inverted orange triangle), but in other organisms the inserted sequences are not recognized as any known domain. They are represented by colored lines. Inserts of the same color are similar to each other. Ellipses represent LRR domains &#8211; distinct colors represent sequence similarity. The red circle with 2&#215; on the left indicates that this protein is duplicated in that organism. <b>(C) </b>N- to C-terminal domain organization of REP. The blue boxes represent the Sequence Conserved Regions (SCRs) between SCR1 and SCR2 there is an inserted domain of unknown structure. This domain is only conserved within taxa: all the inserted sequences represented in yellow are similar to each other but distinct from all other inserted sequences represented in other colors. <b>(D) </b>N- to C-terminal domain organization of the &#946;-subunit The gray boxes represent the prenyltransferase and squalene oxidase repeat.</p>
               </text>
               <graphic file="1471-2148-7-140-2"/>
            </fig>
            <p>Next we investigated whether the insert we found on the three distinct branches of the tree of life are similar, and hence likely to represent one single insertion event in the ancestor of all RGGTases that was subsequently lost in other branches. Or instead whether they are distinct domains, resulting from independent insertion or expansion events. Our results favor the second hypothesis (Figure <figr fid="F2">2</figr>). We find that the inserted domains are similar within taxonomical groups, but different beyond recognition across taxa. By difference beyond recognition we mean BLAST sequence searches <abbrgrp><abbr bid="B35">35</abbr></abbrgrp>, Pfam <abbrgrp><abbr bid="B36">36</abbr></abbrgrp> and Superfamily <abbrgrp><abbr bid="B37">37</abbr><abbr bid="B38">38</abbr></abbrgrp> domain assignments and secondary structure predictions <abbrgrp><abbr bid="B39">39</abbr><abbr bid="B40">40</abbr></abbrgrp>. Thus, in Metazoa, all Deuterostomes have a clearly defined C2-like domain, but the insects and the nematodes have a distinct insertion conserved solely in their taxonomical group. In the plants, angiosperms have a conserved domain in the same region of the &#945;-subunit, but different beyond recognition from any other insert of the orthologous &#945;-subunits. The same is true for alveolata. The size of the insertion is also conserved with taxonomic group but not across groups. For example, whereas deuterostomes have an insert of near 125 amino residues long, plants have a larger one, nearing 200 residues (Figure S3). Our results are thus compatible at least with up to five distinct insertion and/or expansion events in the same position of the &#945;-subunit of RGGTase.</p>
            <p>One exciting possibility is that all these insertions and or expansions represent the same function accomplished by different sequences, as this would expand our repertoire of sequence-function relationships.</p>
            <p>Multiple sequence insertions/expansions in the same site suggest that this site is capable of accommodating structural variations more easily than others. Thus there seems to be a structural constraint in place. The fact that the paralogous &#945;-subunit of FTase contains an inserted region in between these helices, in the form of a 3<sub>10 </sub>helix adds support to this hypothesis. However, if the function of these distinct insertions and/or expansions is the same, then this could be the only place where this function is possible, and thus the recurrent use of the same site reflect functional rather than structural constraint. In the absence of information regarding the role played by these insertions/expansions it is not possible to resolve this question.</p>
         </sec>
         <sec>
            <st>
               <p>&#945; subunit &#8211; LRR domain</p>
            </st>
            <p>At the C-terminus of rat RGGTase &#945;-subunit there is a Leucine Rich Repeat domain (Figure <figr fid="F1">1</figr>). This domain in not present in FTase (Figure <figr fid="F1">1</figr>). LRRs belong to the SCOP fold of the same name, which groups proteins forming a right-handed &#946;-&#945; superhelix <abbrgrp><abbr bid="B21">21</abbr><abbr bid="B31">31</abbr></abbrgrp>. It is formed by three superfamilies, the RNI-like, Outer arm dynein light chain and the L domain-like which includes the RGGTase LRR domain. LRRs are involved in a variety of biological processes, both in eukaryotes and prokaryotes. Their common role is the establishment of complexes with other proteins <abbrgrp><abbr bid="B41">41</abbr><abbr bid="B42">42</abbr></abbrgrp>.</p>
            <p>We now investigate whether this LRR domain is a feature of all RGGTase &#945;-subunits, which would suggest that it was present in the ancestral eukaryote, or if it is a recent acquisition restricted to a specific taxon or set of taxa. It is clear from the results shown in Figure <figr fid="F2">2</figr> that the LRR domain is not universal, as we can only detect it in some animals, in angiosperms and in alveolata. This phylogenetic profile is consistent with two evolutionary scenarios &#8211; independent domain fusions or a single domain fusion at the base of the eukaryotic tree followed by a specific domain loss in multiple branches of the eukaryotic tree. Both scenarios seem equally unlikely, so we investigated this further using phylogenetic reconstruction based on the LRR domain sequences only. Our hypotheses is that there is enough phylogenetic signal in these sequences to solve this puzzle. In Figure <figr fid="F3">3</figr> we show three phylogenies of the LRRs, including sequences from other superfamilies as reference, reconstructed by three distinct methods. Plant and vertebrate LRR are consistently monophyletic, suggesting a common origin. In contrast, the LRR of <it>Ciona</it>, of nematodes and of alveolata segregate with different reference sequences. This varies according to the method used to reconstruct phylogeny. It is thus impossible at this stage to resolve the question regarding the common or independent origin of the LRR sequences in the &#945;-subunit. However, the recent observation that convergent evolution of domain architectures is very rare, with an estimated frequency of 0.4% to 4% <abbrgrp><abbr bid="B43">43</abbr></abbrgrp>, is more supportive of the first scenario.</p>
            <fig id="F3">
               <title>
                  <p>Figure 3</p>
               </title>
               <caption>
                  <p>Maximum Likelihood reconstruction of the evolutionary relationships between the LRR domains found in RGGTase a-subunit (red branches) and reference sequences representing other superfamilies of LRR domains &#8211; L-domain-like, internalin LRR (<ext-link ext-link-type="pdb" ext-link-id="106v">106v</ext-link>) &#8211; green; RNI-like, 28-residue LRR, Ribonucelase inhibitor (<ext-link ext-link-type="pdb" ext-link-id="1A4Y">1A4Y</ext-link>) &#8211; blue; RNI-like, RanGAP1, N-terminal domain(<ext-link ext-link-type="pdb" ext-link-id="1K5D">1K5D</ext-link>) &#8211; purple; outer arm dynein light chain(<ext-link ext-link-type="pdb" ext-link-id="1M9L">1M9L</ext-link>) &#8211; orange; L-domain-like, RabGGTase &#945;-subunit, C-terminal domain (<ext-link ext-link-type="pdb" ext-link-id="1LTX">1LTX</ext-link>) &#8211; red</p>
               </caption>
               <text>
                  <p>Maximum Likelihood reconstruction of the evolutionary relationships between the LRR domains found in RGGTase a-subunit (red branches) and reference sequences representing other superfamilies of LRR domains &#8211; L-domain-like, internalin LRR (<ext-link ext-link-type="pdb" ext-link-id="106v">106v</ext-link>) &#8211; green; RNI-like, 28-residue LRR, Ribonucelase inhibitor (<ext-link ext-link-type="pdb" ext-link-id="1A4Y">1A4Y</ext-link>) &#8211; blue; RNI-like, RanGAP1, N-terminal domain(<ext-link ext-link-type="pdb" ext-link-id="1K5D">1K5D</ext-link>) &#8211; purple; outer arm dynein light chain(<ext-link ext-link-type="pdb" ext-link-id="1M9L">1M9L</ext-link>) &#8211; orange; L-domain-like, RabGGTase &#945;-subunit, C-terminal domain (<ext-link ext-link-type="pdb" ext-link-id="1LTX">1LTX</ext-link>) &#8211; red. Inset boxes show the topologies of Bayesian and Neighbor Joining trees of the same sequences, showing contradicting topologies. Species codes are Aaeg &#8211; Aedes aegypti; Atha &#8211; Arabidopsis thaliana; Bthr-Bacillus thrugiensis; Cbri &#8211; Caenorhabditis briggsae; Cele &#8211; Caenorhabditis elegans; Cint &#8211; Ciona intestinalis; Crei &#8211; Chlamydomonas reinhardtii; Dmel &#8211; Drosophila melanogaster; Hsap &#8211; Homo sapiens; Liva &#8211; Listeria ivanovii, Lmon &#8211; Listeria monocytogenes; Mmus &#8211; Mus musculus; Osat &#8211; Oryza sativa; Pfal-Plasmodium falciparum; Ptri &#8211; Populus trichocarpa; Ptro &#8211; Pan troglodytes; Pyoe-Plamodium yoelii; Rnor &#8211; Rattus norvegicus; Scer &#8211; Saccharomyces cerevisae;Spom &#8211; Schizosaccharomyces pombe; Tbru &#8211; Trypanosoma brucei</p>
               </text>
               <graphic file="1471-2148-7-140-3"/>
            </fig>
         </sec>
         <sec>
            <st>
               <p>&#946;-subunit domain architecture</p>
            </st>
            <p>The &#946;-subunit of rat RGGTase is a &#945;-&#945; barrel composed of 12 &#945; helices. It is very similar to the &#945;-&#945; barrel in the &#946;-subunit of FTase <abbrgrp><abbr bid="B12">12</abbr></abbrgrp>. The &#946;-subunits of prenyltransferases are more conserved than the &#945; subunits, but identification and classification of RGGTase &#946;-subunit was simple using a combination of BLAST searches of sequences databases followed by phylogenetic analysis (Figure S4).</p>
            <p>We discussed above that the &#945;-subunit display multiple sequence insertions in different species. Below we will show that in some species REP also display inserted sequences. The &#946;-subunit in contrast appears to have an invariable domain architecture throughout evolution. We investigated its structural relatives in order to gain insight whether this is due to structural constraints. In order to do so, we investigated the SCOP hierarchical classification of protein structures. Prenyl transferase's &#946;-subunits belong to the "&#945;-&#945; toroid" fold (SCOP: 48207), Terpenoid cyclases/Protein prenyltransferases superfamily (SCOP: 48239). This fold consists of multihelical proteins displaying up to seven alpha-hairpins arranged in a closed circular array. Therpene synthases are classified into the same superfamily. These proteins are characterized by two &#945;-&#945; domains in the same peptide chain; the first is an &#945;6-&#945;6 barrel of two concentric rings, whereas the second is a barrel with 10 &#945; helices, one 3<sub>10 </sub>helix and at least two &#946;-strands inserted in between helical elements <abbrgrp><abbr bid="B44">44</abbr></abbrgrp>. It appears then that this superfamily can have varying number of helical elements as well as accommodate extra structural elements. It is then plausible that the invariable nature of the domain architecture of &#946;-subunits is due to functional rather than structural constraints.</p>
            <p>In animals and yeast it is clear that RGGTase is composed of independent &#945; and &#946;-subunits. In contrast GGTase I and FTase have distinct &#946;-subunits, but share a &#945;-subunit. Thus the GGTase I and FTase are related by a single gene duplication. In contrast, RGGTase is separated from the other prenyltrasnsferases by two gene duplications. Although the number of duplication steps that separates the different enzymes is clear, the order of duplication is not. In other words, we don't know which subunits emerged first and which resulted from these by duplication. We find that all organisms with a complete genome sequence that we investigated display a &#946;-subunit of RGGTase (Figure <figr fid="F2">2</figr>). Considering we also always find a RGGTase &#945;-subunit in the same organisms, it follows that the ancestor of all eukaryotes already had a distinct and fully separated RGGTase.</p>
         </sec>
         <sec>
            <st>
               <p>REP</p>
            </st>
            <p>Rab escort proteins (REP) belong to the same protein family as Rab GDP dissociation inhibitors (RabGDI). They are both classified in the SCOP hierarchy <abbrgrp><abbr bid="B31">31</abbr></abbrgrp> as FAD/NAD(P)-binding domain fold and superfamily, which suggests a common ancestry. Their structure comprises two domains: domain I include the Rab binding platform, whereas domain II in REP mediates binding to the alpha-subunit of RGGTase <abbrgrp><abbr bid="B29">29</abbr><abbr bid="B45">45</abbr></abbrgrp>. REP and RabGDI share conserved regions, termed SCRs (sequence conserved regions), which are highlighted in Figure <figr fid="F4">4</figr> as brown boxes. Multiple sequence alignments of REPs and RabGDI reveal that mammalian REPs display an insertion between domain I and domain II, absent in RabGDI, which maps roughly to a sequence region delimited by the conserved regions SCR1B and SCR2 <abbrgrp><abbr bid="B13">13</abbr><abbr bid="B45">45</abbr></abbrgrp>. This insert appears to be larger in vertebrate sequences than in S. cerevisae <abbrgrp><abbr bid="B13">13</abbr></abbrgrp>. The function of this insert is unclear, particularly at the light of the recent structure of REP1 in complex with RGGT, which shows that it is not involved in contacts with the RGGTase subunits nor with Rab substrates <abbrgrp><abbr bid="B29">29</abbr></abbrgrp>.</p>
            <fig id="F4">
               <title>
                  <p>Figure 4</p>
               </title>
               <caption>
                  <p><b>(A) </b>scheme illustrating the location of the sequence conserved regions (SCRs) in RabGDI and REP sequences, as well as the variable size of the region between SCR1A and SCR2</p>
               </caption>
               <text>
                  <p><b>(A) </b>scheme illustrating the location of the sequence conserved regions (SCRs) in RabGDI and REP sequences, as well as the variable size of the region between SCR1A and SCR2. The broken yellow and blue boxes indicate the "gap" in the alignment of these sequences with mammalian REP. Adapted from [13] <b>(B) </b>Neighbor-Joining tree of selected sequences of REP and RabGDI, illustrating that classification of these sequences into one of the two subfamilies is clear. The same tree with bootstrap vaues is provided as supplementary Figure S6. Species codes are Aaeg &#8211; Aedes aegypti; Afum &#8211; Aspergillus fumigatus; Agam &#8211; Anopheles gambiae; Atha &#8211; Arabidopsis thaliana; Cbri &#8211; Caenorhabditis briggsae; Cele-Caenorhabditis elegans; Cint &#8211; Ciona intestinalis; Cmer &#8211; Cyanidioschyzon merolae; Cneo &#8211; Cryptococcus neoformans; Cpar &#8211; Cryptosporodium parvum; Crei-Chlamydomonas reinhardtii; Ddis &#8211; Dictyostelium discoideum; Dmel &#8211; Drosophila melanogaster; Dpse &#8211; Drosophila pseudoobscura; Drer &#8211; Danio rerio; Gint-Giardis intestinalis; Hsap &#8211; Homo sapiens; Mbre &#8211; Monosiga brevicollis; Mmus-Mus musculus; Ngru &#8211; Naegleria gruberi; Nvec &#8211; Nematostella vectensis; Osat-Oryza sativa; Otau &#8211; Ostreococcus tauri; Pber &#8211; Plasmodium berghei; Pfal &#8211; Plasmodium falciparum; Ppyg-Pongo pygmaeus; Pram &#8211; Phytophthora ramorum; Psoj &#8211; Phytophthora sojae; Ptri-Populus trichocarpa; Pyoe &#8211; Plamodium yoelii; Rnor &#8211; Rattus norvegicus; Scer-Saccharomyces cerevisae; Sjap &#8211; Schistosoma japonicum; Spom-Schizosaccharomyces pombe; Tbru &#8211; Trypanosoma brucei; Tcas &#8211; Tribolium castaneum; Tcru &#8211; Trypanosoma cruzi; Tpse-Thalassiosira pseudonana; Tnig &#8211; Tetraodon nigroviridis; Trub &#8211; Takifugu rubripes; Tthe &#8211; Tetrahymena thermophila; Xlae &#8211; Xenopus laevis; Xtro &#8211; Xenopus tropicalis; Ylip &#8211; Yarrowia lipolytica <b>(C) </b>Plot of the size of the region between SCR1A and SCR2 in the different species, measured in number of amino acids. The bars are coded according to the colors used in Figure 2 and same color denotes identifiable sequence similarity. The blue and pink dotted perpendicular lines are for reference and indicate the number of amino acids between SCR1B and SCR2 for RabGDI and MRS6, respectively. A red D indicates that the insert contains a predicted disordered region [51], whereas a cross means that such regions are not predicted in the insert.</p>
               </text>
               <graphic file="1471-2148-7-140-4"/>
            </fig>
            <p>We identified REPs in all species studied here by searching GeneBank and the other genomic databases (see methods) with known REP sequences. We used phylogenetic reconstruction to classify the obtained sequences into the REP and RabGDI subfamilies (Figure <figr fid="F4">4B</figr>). We identified at least one distinct REP sequence in all species for which a complete genome sequence was available. Mammals have two paralogous REPS (REP-1 or Choroideremia and REP-2 or CHM-like). <it>X. laevis </it>also displays two paralogous REPs, but our phylogenetic analysis suggests that this is an independent and species-specific duplication (Figure <figr fid="F4">4b</figr>). It is interesting to note that in <it>A. thaliana</it>, the duplication of both enzyme subunits was not accompanied by REP duplication.</p>
            <p>The most striking observation is that few branches of the tree of life are characterized by the presence of a larger insert region between SCR1B and SCR2 in REPs. Their size indicates that few independent taxa display inserts that are larger than those of yeast MRS6 (yeast REP) and RabGDIs (Figure <figr fid="F4">4C</figr>). At the sequence level there is no real conservation &#8211; the inserts are similar within taxa but very different across taxa. Systematic database searches using solely the insert regions from several specie can only find closely related REP sequences. For example, a BLAST search using the insert region of rat or human REP1 will find only deuterostomes REP inserts, but not plant inserts, and <it>vice versa</it>. They are also not similar to any other protein other than REPs, which means that we cannot use this approach to define hypothetical functions for this region.</p>
            <p>In the crystal structure of rat REP1, this sequence insert corresponds to a region in the crystal with no clear electron density <abbrgrp><abbr bid="B29">29</abbr></abbrgrp>. Such regions are typically labeled natively unfolded <abbrgrp><abbr bid="B46">46</abbr><abbr bid="B47">47</abbr></abbrgrp>. Natively unfolded proteins are involved in a variety of cellular functions, namely transcriptional and translational regulation, signaling and regulation of the self-assembly of large multi-subunit complexes such as the ribosome and the bacterial flagellum <abbrgrp><abbr bid="B47">47</abbr><abbr bid="B48">48</abbr></abbrgrp>. They are also expected to be involved in a variety of human diseases <abbrgrp><abbr bid="B49">49</abbr></abbrgrp>. Although they can perform their function in the unfolded state, the majority of unfolded regions undergo a process termed induced-folding, in which upon binding to their physiological partners they undergo a transition to a structured form <abbrgrp><abbr bid="B50">50</abbr></abbrgrp>. We tested whether the different inserts were also natively unfolded, using the predictive algorithm Globplot2 <abbrgrp><abbr bid="B51">51</abbr></abbrgrp>. We observed that angiosperm insert sequences are predicted to include a disordered, region, so are all vertebrate sequences and nematodes. In contrast, insects never show a predicted disordered region within the insert region (Figure <figr fid="F4">4C</figr> and S5). Thus, the presence of disordered regions is not restricted to the larger inserts of vertebrates, nor does it seem to correlate with insert size.</p>
         </sec>
         <sec>
            <st>
               <p>Duplications</p>
            </st>
            <p>In most species studied, we find a single copy of the &#945;- and the &#946;-subunit. In contrast we found two RGGTase &#945;- and &#946;-subunits in <it>A. thaliana</it>. We have previously observed that duplication of whole protein complexes is frequent <abbrgrp><abbr bid="B16">16</abbr></abbrgrp>. We argued for the prevalence of stepwise duplications leading to complete duplication of subunits of complexes, like what is observed with the adaptin tetrameric complexes <abbrgrp><abbr bid="B52">52</abbr></abbrgrp>. The duplication of the two subunits of RGGTase in <it>A. thaliana </it>however appears to have occurred simultaneously, as a result of a whole genome duplication (WGD) estimated to have happened around 38 million years ago <abbrgrp><abbr bid="B53">53</abbr></abbrgrp>. The paralogous pairs localize to distinct chromosomes, to duplicated segments that were mapped to that WGD (&#945;-subunit: At4g2424490 (chr.4) and At5g41820 (chr.5); &#946;-subunit: At3g12070 (chr.3) and At5g12210 (chr.5)).</p>
            <p>The functional relevance of maintaining these two copies of RGGTase is unclear. It is clear from all the phylogenetic trees that the subunits are very closely related (e.g. Figure S1), suggesting some selective pressure. <it>A. thaliana </it>may require high levels of this enzyme and concerted gene duplication is one way of boosting the levels of a given gene product <abbrgrp><abbr bid="B54">54</abbr></abbrgrp>. Alternatively, the large number and diversity of Rab GTPases in <it>A. thaliana </it><abbrgrp><abbr bid="B55">55</abbr><abbr bid="B56">56</abbr></abbrgrp> may require specific regulation of Rab prenylation. This could be achieved by having multiple copies of the enzyme subunits under differential regulation or displaying distinct substrate specificities.</p>
            <p>Animals also display an expansion of the Rab family. They don't have multiple copies of the enzyme subunits; they have instead two paralogous REPs that appear to form complexes with distinct substrates and have distinct specificities to the RGGTase <abbrgrp><abbr bid="B57">57</abbr></abbrgrp>. This lends some support to the second hypothesis. It would be thus interesting to investigate whether there is specificity in the pairing of the <it>A. thaliana </it>subunits, and whether the different versions of the enzyme have distinct substrate specificities. In contrast, some animals display two or more copies of the &#946;-subunit for a single &#945;-subunit (Figure <figr fid="F2">2</figr>). Since substrate-binding specificity is not defined by the &#946;-subunit in RabGGTase, it seems plausible that expression levels are at the root of these duplications. Amoebozoa like <it>D. discoideum</it>, which represents another example of independent expansions of the Rab family, have in excess of 50 Rab proteins (unpublished observations). <it>D. discoideum </it>only has one copy of each component (Figure <figr fid="F2">2</figr>), indicating that large Rab families are compatible with a single copy of each RabGGTase component.</p>
            <p>The duplication of components of RabGGTase appear thus to be restricted to specific branches of the tree of life (Figure <figr fid="F2">2</figr>). Our analysis also indicates that it is also restricted to RabGGTase. We did not observe duplications of FTase nor GGTase I subunits. This is despite the fact that their substrates show expansions akin to those of the Rab family. For example, the Ras and Rho families expanded from 3 and 6 members respectively in <it>S. cerevisiae</it>, to 22 and 34, respectively, in <it>H. sapiens </it><abbrgrp><abbr bid="B58">58</abbr></abbrgrp>. It is possible that dosage balance may place a barrier to duplications of subunits <abbrgrp><abbr bid="B18">18</abbr><abbr bid="B59">59</abbr></abbrgrp>.</p>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Conclusion</p>
         </st>
         <p>In summary, we identified the orthologues of the components of the Rab prenylation complex in 43 species &#8211; the &#945;&#946; heterodimer RabGGTase, and the accessory proteins REP. We characterized their domain architectures and found that it varies considerably in evolution. The &#945;-subunit can have distinct inserts in two positions, between the tetratricopeptide repeats 5 and 6 and at the C-terminal. These domain insertions are specific to RabGGTase, as the paralogous &#945;-subunit of Farnesyl and Geranylgeranyl (type I) Transferases displays conserved domain architectures. The &#946; subunit has a conserved domain architecture but the REP proteins also have variable inserts between SCR1B and SCR2.</p>
         <p>We found the full constellation of protein prenyltransferases in the organisms we analyzed as found in model organisms <abbrgrp><abbr bid="B1">1</abbr><abbr bid="B2">2</abbr></abbrgrp>. This suggests that the Last Common Eukaryotic Ancestor (L.E.C.A.) had the same constellation of enzymes and subunits. The three enzymes are related by gene duplication events, which suggests that there was an ancestral heterodimeric protein prenyl transferase at the base of the eukaryotic tree. The presence of structural homologues of &#945; and &#946;-subunits in all the branches of the tree of life indicates that the eukaryotic innovation required solely novel interactions, rather than novel folds. The precise order of duplication events is however unclear. FTase and GGTase I are separated by a single gene duplication event, whereas RGGTase is separated by two duplication events from either of the other enzymes. The simpler architecture of FTase and GGTase I, and their independence from further components for proper activity suggests that they preceded RGGTase. However, substrates of all three enzymes are ubiquitous in the eukaryotic tree, which further emphasizes that the L.E.C.A. is likely to have already the three enzymes.</p>
         <p>In conclusion, the components of the Rab prenylation complex display varied domain architectures in evolution, which are more consistent with multiple independent events in the first insertion of the &#945;-subunit, multiple losses in the second insert, and independent expansions in REP. There is no known function for the variable domains. There is no obvious correlation between Rab family size, and the presence or size of any of the RGGTase inserts, which suggests that these insertions are independent of Rab family expansion. Furthermore, the inserted domains do not parallel each other within or across subunits. This  indicates that it is unlikely that they are involved in direct physical contacts, and that they are likely to have independent functions. The few inserted sequences that are similar to known sequences (C2-like and LRR) suggest that the role of these domains is regulatory, possibly involving lipid and protein binding. It is tempting to speculate that they are involved in taxon-specific regulatory interactions not yet described. One future avenue of research will be the identification of putative binding partners, for example by bioinformatic analysis, searching for genes with similar phylogenetic profiles as those of the inserted domains.</p>
         <p>Finally, this case study suggests that domain gains and losses may be an important force driving the evolution and diversification of protein complexes. A future avenue of research is the quantification of this contribution.</p>
      </sec>
      <sec>
         <st>
            <p>Methods</p>
         </st>
         <p>The analyses were performed on sequenced eukaryotic genomes that were downloaded from NCBI and other eukaryotic genomes databases. A full listing of the organisms investigated is provided in additional file <supplr sid="S1">1</supplr>. It aims for maximum coverage of eukaryotic diversity within the species with a complete genome sequence. Sequence searches combined BLAST <abbrgrp><abbr bid="B35">35</abbr></abbrgrp> searches using known RGGTase and REP sequences as query, as well as protein families defined in the Pfam <abbrgrp><abbr bid="B36">36</abbr></abbrgrp> and superfamily <abbrgrp><abbr bid="B37">37</abbr><abbr bid="B38">38</abbr></abbrgrp> databases. Multiple sequence alignments were performed with ClustalW 1.83 <abbrgrp><abbr bid="B60">60</abbr></abbrgrp>, pairwise alignments used the Smith-Waterman algorithm <abbrgrp><abbr bid="B61">61</abbr></abbrgrp> (Water in EMBOSS <abbrgrp><abbr bid="B62">62</abbr></abbrgrp>), in both cases with the a Blosum 62 matrix <abbrgrp><abbr bid="B63">63</abbr></abbrgrp> and default GAP and extension penalties. Sequence manipulation was done with Jalview 2.1.1 <abbrgrp><abbr bid="B64">64</abbr></abbrgrp>. Domain assignments were done using Superfamily <abbrgrp><abbr bid="B65">65</abbr></abbrgrp> and Pfam <abbrgrp><abbr bid="B66">66</abbr></abbrgrp>. Secondary structure predictions was performed using Jpred <abbrgrp><abbr bid="B39">39</abbr><abbr bid="B40">40</abbr></abbrgrp>. Phylogenetic reconstruction was done using the Neighbor-Joining clustering algorithm as implemented in ClustalW 1.83 <abbrgrp><abbr bid="B60">60</abbr></abbrgrp>, using 1000 bootstraps, as well as the cladistic methods Maximum Likelihood in the Phylip 3.61 package (ProML) <abbrgrp><abbr bid="B67">67</abbr></abbrgrp> (Jones-Taylor-Thorthon (JTT) matrix; 100 boostraps) and Bayesian method implemented in MrBayes v. 3.1.2 <abbrgrp><abbr bid="B68">68</abbr><abbr bid="B69">69</abbr></abbrgrp> using Blosum62 as a fixed rate aa model, ran until average standard deviation of split frequencies was lower than 0.01. Trees were drawn using in FigTree v. 1.0 <abbrgrp><abbr bid="B70">70</abbr></abbrgrp>.</p>
         <p>All sequences used in this study, as well as alignment and tree files can be found in the supplementary website <abbrgrp><abbr bid="B71">71</abbr></abbrgrp>.</p>
      </sec>
      <sec>
         <st>
            <p>Authors' contributions</p>
         </st>
         <p>JBPL conceived the study and wrote the manuscript. RR and JBPL performed the analysis.</p>
      </sec>
   </bdy>
   <bm>
      <ack>
         <sec>
            <st>
               <p>Acknowledgements</p>
            </st>
            <p>We thank members of the Computational Genomics and Cell Cycle Regulation laboratories at the Instituto Gulbenkian de Ciencia for helpful discussions.</p>
         </sec>
      </ack>
      <refgrp>
         <bibl id="B1">
            <title>
               <p>Protein prenyltransferases</p>
            </title>
            <aug>
               <au>
                  <snm>Maurer-Stroh</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Washietl</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Eisenhaber</snm>
                  <fnm>F</fnm>
               </au>
            </aug>
            <source>Genome Biol</source>
            <pubdate>2003</pubdate>
            <volume>4</volume>
            <issue>4</issue>
            <fpage>212</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">154572</pubid>
                  <pubid idtype="pmpid" link="fulltext">12702202</pubid>
                  <pubid idtype="doi">10.1186/gb-2003-4-4-212</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B2">
            <title>
               <p>Protein prenyltransferases</p>
            </title>
            <aug>
               <au>
                  <snm>Casey</snm>
                  <fnm>PJ</fnm>
               </au>
               <au>
                  <snm>Seabra</snm>
                  <fnm>MC</fnm>
               </au>
            </aug>
            <source>J Biol Chem</source>
            <pubdate>1996</pubdate>
            <volume>271</volume>
            <issue>10</issue>
            <fpage>5289</fpage>
            <lpage>5292</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1074/jbc.271.10.5289</pubid>
                  <pubid idtype="pmpid" link="fulltext">8621375</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B3">
            <title>
               <p>Thematic review series: lipid posttranslational modifications. Structural biology of protein farnesyltransferase and geranylgeranyltransferase type I</p>
            </title>
            <aug>
               <au>
                  <snm>Lane</snm>
                  <fnm>KT</fnm>
               </au>
               <au>
                  <snm>Beese</snm>
                  <fnm>LS</fnm>
               </au>
            </aug>
            <source>J Lipid Res</source>
            <pubdate>2006</pubdate>
            <volume>47</volume>
            <issue>4</issue>
            <fpage>681</fpage>
            <lpage>699</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1194/jlr.R600002-JLR200</pubid>
                  <pubid idtype="pmpid" link="fulltext">16477080</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B4">
            <title>
               <p>Thematic review series: lipid posttranslational modifications. geranylgeranylation of Rab GTPases</p>
            </title>
            <aug>
               <au>
                  <snm>Leung</snm>
                  <fnm>KF</fnm>
               </au>
               <au>
                  <snm>Baron</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Seabra</snm>
                  <fnm>MC</fnm>
               </au>
            </aug>
            <source>J Lipid Res</source>
            <pubdate>2006</pubdate>
            <volume>47</volume>
            <issue>3</issue>
            <fpage>467</fpage>
            <lpage>475</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1194/jlr.R500017-JLR200</pubid>
                  <pubid idtype="pmpid" link="fulltext">16401880</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B5">
            <title>
               <p>Isoprenylated proteins</p>
            </title>
            <aug>
               <au>
                  <snm>McTaggart</snm>
                  <fnm>SJ</fnm>
               </au>
            </aug>
            <source>Cell Mol Life Sci</source>
            <pubdate>2006</pubdate>
            <volume>63</volume>
            <issue>3</issue>
            <fpage>255</fpage>
            <lpage>267</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1007/s00018-005-5298-6</pubid>
                  <pubid idtype="pmpid" link="fulltext">16378247</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B6">
            <title>
               <p>Prenylation of Rab GTPases: molecular mechanisms and involvement in genetic disease</p>
            </title>
            <aug>
               <au>
                  <snm>Pereira-Leal</snm>
                  <fnm>JB</fnm>
               </au>
               <au>
                  <snm>Hume</snm>
                  <fnm>AN</fnm>
               </au>
               <au>
                  <snm>Seabra</snm>
                  <fnm>MC</fnm>
               </au>
            </aug>
            <source>FEBS Lett</source>
            <pubdate>2001</pubdate>
            <volume>498</volume>
            <issue>2-3</issue>
            <fpage>197</fpage>
            <lpage>200</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/S0014-5793(01)02483-8</pubid>
                  <pubid idtype="pmpid" link="fulltext">11412856</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B7">
            <title>
               <p>Bet2p and Mad2p are components of a prenyltransferase that adds geranylgeranyl onto Ypt1p and Sec4p</p>
            </title>
            <aug>
               <au>
                  <snm>Jiang</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Rossi</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Ferro-Novick</snm>
                  <fnm>S</fnm>
               </au>
            </aug>
            <source>Nature</source>
            <pubdate>1993</pubdate>
            <volume>366</volume>
            <issue>6450</issue>
            <fpage>84</fpage>
            <lpage>86</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/366084a0</pubid>
                  <pubid idtype="pmpid" link="fulltext">8232542</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B8">
            <title>
               <p>Identification of yeast component A: reconstitution of the geranylgeranyltransferase that modifies Ypt1p and Sec4p</p>
            </title>
            <aug>
               <au>
                  <snm>Jiang</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Ferro-Novick</snm>
                  <fnm>S</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci U S A</source>
            <pubdate>1994</pubdate>
            <volume>91</volume>
            <issue>10</issue>
            <fpage>4377</fpage>
            <lpage>4381</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">43788</pubid>
                  <pubid idtype="pmpid" link="fulltext">8183917</pubid>
                  <pubid idtype="doi">10.1073/pnas.91.10.4377</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B9">
            <title>
               <p>RAM2, an essential gene of yeast, and RAM1 encode the two polypeptide components of the farnesyltransferase that prenylates a-factor and Ras proteins</p>
            </title>
            <aug>
               <au>
                  <snm>He</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Chen</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Chen</snm>
                  <fnm>SY</fnm>
               </au>
               <au>
                  <snm>Vancura</snm>
                  <fnm>KL</fnm>
               </au>
               <au>
                  <snm>Michaelis</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Powers</snm>
                  <fnm>S</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci U S A</source>
            <pubdate>1991</pubdate>
            <volume>88</volume>
            <issue>24</issue>
            <fpage>11373</fpage>
            <lpage>11377</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">53137</pubid>
                  <pubid idtype="pmpid" link="fulltext">1763050</pubid>
                  <pubid idtype="doi">10.1073/pnas.88.24.11373</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B10">
            <title>
               <p>Structure of mammalian protein geranylgeranyltransferase type-I</p>
            </title>
            <aug>
               <au>
                  <snm>Taylor</snm>
                  <fnm>JS</fnm>
               </au>
               <au>
                  <snm>Reid</snm>
                  <fnm>TS</fnm>
               </au>
               <au>
                  <snm>Terry</snm>
                  <fnm>KL</fnm>
               </au>
               <au>
                  <snm>Casey</snm>
                  <fnm>PJ</fnm>
               </au>
               <au>
                  <snm>Beese</snm>
                  <fnm>LS</fnm>
               </au>
            </aug>
            <source>Embo J</source>
            <pubdate>2003</pubdate>
            <volume>22</volume>
            <issue>22</issue>
            <fpage>5963</fpage>
            <lpage>5974</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">275430</pubid>
                  <pubid idtype="pmpid" link="fulltext">14609943</pubid>
                  <pubid idtype="doi">10.1093/emboj/cdg571</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B11">
            <title>
               <p>Crystal structure of protein farnesyltransferase at 2.25 angstrom resolution</p>
            </title>
            <aug>
               <au>
                  <snm>Park</snm>
                  <fnm>HW</fnm>
               </au>
               <au>
                  <snm>Boduluri</snm>
                  <fnm>SR</fnm>
               </au>
               <au>
                  <snm>Moomaw</snm>
                  <fnm>JF</fnm>
               </au>
               <au>
                  <snm>Casey</snm>
                  <fnm>PJ</fnm>
               </au>
               <au>
                  <snm>Beese</snm>
                  <fnm>LS</fnm>
               </au>
            </aug>
            <source>Science</source>
            <pubdate>1997</pubdate>
            <volume>275</volume>
            <issue>5307</issue>
            <fpage>1800</fpage>
            <lpage>1804</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1126/science.275.5307.1800</pubid>
                  <pubid idtype="pmpid" link="fulltext">9065406</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B12">
            <title>
               <p>Crystal structure of Rab geranylgeranyltransferase at 2.0 A resolution</p>
            </title>
            <aug>
               <au>
                  <snm>Zhang</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Seabra</snm>
                  <fnm>MC</fnm>
               </au>
               <au>
                  <snm>Deisenhofer</snm>
                  <fnm>J</fnm>
               </au>
            </aug>
            <source>Structure</source>
            <pubdate>2000</pubdate>
            <volume>8</volume>
            <issue>3</issue>
            <fpage>241</fpage>
            <lpage>251</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/S0969-2126(00)00102-7</pubid>
                  <pubid idtype="pmpid" link="fulltext">10745007</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B13">
            <title>
               <p>Organization of the Rab-GDI/CHM superfamily: the functional basis for choroideremia disease</p>
            </title>
            <aug>
               <au>
                  <snm>Alory</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Balch</snm>
                  <fnm>WE</fnm>
               </au>
            </aug>
            <source>Traffic</source>
            <pubdate>2001</pubdate>
            <volume>2</volume>
            <issue>8</issue>
            <fpage>532</fpage>
            <lpage>543</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1034/j.1600-0854.2001.20803.x</pubid>
                  <pubid idtype="pmpid" link="fulltext">11489211</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B14">
            <title>
               <p>Structural insights into the function of the Rab GDI superfamily</p>
            </title>
            <aug>
               <au>
                  <snm>Wu</snm>
                  <fnm>SK</fnm>
               </au>
               <au>
                  <snm>Zeng</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Wilson</snm>
                  <fnm>IA</fnm>
               </au>
               <au>
                  <snm>Balch</snm>
                  <fnm>WE</fnm>
               </au>
            </aug>
            <source>Trends Biochem Sci</source>
            <pubdate>1996</pubdate>
            <volume>21</volume>
            <issue>12</issue>
            <fpage>472</fpage>
            <lpage>476</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/S0968-0004(96)10062-1</pubid>
                  <pubid idtype="pmpid" link="fulltext">9009830</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B15">
            <title>
               <p>The alpha-subunit of protein prenyltransferases is a member of the tetratricopeptide repeat family</p>
            </title>
            <aug>
               <au>
                  <snm>Zhang</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Grishin</snm>
                  <fnm>NV</fnm>
               </au>
            </aug>
            <source>Protein Sci</source>
            <pubdate>1999</pubdate>
            <volume>8</volume>
            <issue>8</issue>
            <fpage>1658</fpage>
            <lpage>1667</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">10452610</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B16">
            <title>
               <p>Novel specificities emerge by stepwise duplication of functional modules</p>
            </title>
            <aug>
               <au>
                  <snm>Pereira-Leal</snm>
                  <fnm>JB</fnm>
               </au>
               <au>
                  <snm>Teichmann</snm>
                  <fnm>SA</fnm>
               </au>
            </aug>
            <source>Genome Res</source>
            <pubdate>2005</pubdate>
            <volume>15</volume>
            <issue>4</issue>
            <fpage>552</fpage>
            <lpage>559</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1074369</pubid>
                  <pubid idtype="pmpid" link="fulltext">15805495</pubid>
                  <pubid idtype="doi">10.1101/gr.3102105</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B17">
            <title>
               <p>Evolution of protein complexes by duplication of homomeric interactions</p>
            </title>
            <aug>
               <au>
                  <snm>Pereira-Leal</snm>
                  <fnm>JB</fnm>
               </au>
               <au>
                  <snm>Levy</snm>
                  <fnm>ED</fnm>
               </au>
               <au>
                  <snm>Kamp</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Teichmann</snm>
                  <fnm>SA</fnm>
               </au>
            </aug>
            <source>Genome Biol</source>
            <pubdate>2007</pubdate>
            <volume>8</volume>
            <issue>4</issue>
            <fpage>R51</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1895999</pubid>
                  <pubid idtype="pmpid" link="fulltext">17411433</pubid>
                  <pubid idtype="doi">10.1186/gb-2007-8-4-r51</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B18">
            <title>
               <p>Dosage sensitivity and the evolution of gene families in yeast</p>
            </title>
            <aug>
               <au>
                  <snm>Papp</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Pal</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Hurst</snm>
                  <fnm>LD</fnm>
               </au>
            </aug>
            <source>Nature</source>
            <pubdate>2003</pubdate>
            <volume>424</volume>
            <issue>6945</issue>
            <fpage>194</fpage>
            <lpage>197</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/nature01771</pubid>
                  <pubid idtype="pmpid" link="fulltext">12853957</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B19">
            <title>
               <p>Detection of parallel functional modules by comparative analysis of genome sequences</p>
            </title>
            <aug>
               <au>
                  <snm>Li</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Pellegrini</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Eisenberg</snm>
                  <fnm>D</fnm>
               </au>
            </aug>
            <source>Nat Biotechnol</source>
            <pubdate>2005</pubdate>
            <volume>23</volume>
            <issue>2</issue>
            <fpage>253</fpage>
            <lpage>260</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/nbt1065</pubid>
                  <pubid idtype="pmpid" link="fulltext">15696156</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B20">
            <title>
               <p>The relationship between domain duplication and recombination</p>
            </title>
            <aug>
               <au>
                  <snm>Vogel</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Teichmann</snm>
                  <fnm>SA</fnm>
               </au>
               <au>
                  <snm>Pereira-Leal</snm>
                  <fnm>J</fnm>
               </au>
            </aug>
            <source>J Mol Biol</source>
            <pubdate>2005</pubdate>
            <volume>346</volume>
            <issue>1</issue>
            <fpage>355</fpage>
            <lpage>365</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/j.jmb.2004.11.050</pubid>
                  <pubid idtype="pmpid" link="fulltext">15663950</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B21">
            <title>
               <p>SCOP: a structural classification of proteins database for the investigation of sequences and structures</p>
            </title>
            <aug>
               <au>
                  <snm>Murzin</snm>
                  <fnm>AG</fnm>
               </au>
               <au>
                  <snm>Brenner</snm>
                  <fnm>SE</fnm>
               </au>
               <au>
                  <snm>Hubbard</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Chothia</snm>
                  <fnm>C</fnm>
               </au>
            </aug>
            <source>J Mol Biol</source>
            <pubdate>1995</pubdate>
            <volume>247</volume>
            <issue>4</issue>
            <fpage>536</fpage>
            <lpage>540</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1006/jmbi.1995.0159</pubid>
                  <pubid idtype="pmpid" link="fulltext">7723011</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B22">
            <title>
               <p>Relative rates of gene fusion and fission in multi-domain proteins</p>
            </title>
            <aug>
               <au>
                  <snm>Kummerfeld</snm>
                  <fnm>SK</fnm>
               </au>
               <au>
                  <snm>Teichmann</snm>
                  <fnm>SA</fnm>
               </au>
            </aug>
            <source>Trends Genet</source>
            <pubdate>2005</pubdate>
            <volume>21</volume>
            <issue>1</issue>
            <fpage>25</fpage>
            <lpage>30</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/j.tig.2004.11.007</pubid>
                  <pubid idtype="pmpid" link="fulltext">15680510</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B23">
            <title>
               <p>Domain insertion</p>
            </title>
            <aug>
               <au>
                  <snm>Russell</snm>
                  <fnm>RB</fnm>
               </au>
            </aug>
            <source>Protein Eng</source>
            <pubdate>1994</pubdate>
            <volume>7</volume>
            <issue>12</issue>
            <fpage>1407</fpage>
            <lpage>1410</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/protein/7.12.1407</pubid>
                  <pubid idtype="pmpid" link="fulltext">7716150</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B24">
            <title>
               <p>Domain insertions in protein structures</p>
            </title>
            <aug>
               <au>
                  <snm>Aroul-Selvam</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Hubbard</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Sasidharan</snm>
                  <fnm>R</fnm>
               </au>
            </aug>
            <source>J Mol Biol</source>
            <pubdate>2004</pubdate>
            <volume>338</volume>
            <issue>4</issue>
            <fpage>633</fpage>
            <lpage>641</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/j.jmb.2004.03.039</pubid>
                  <pubid idtype="pmpid" link="fulltext">15099733</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B25">
            <title>
               <p>Domain deletions and substitutions in the modular protein evolution</p>
            </title>
            <aug>
               <au>
                  <snm>Weiner</snm>
                  <fnm>J</fnm>
                  <suf>3rd</suf>
               </au>
               <au>
                  <snm>Beaussart</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Bornberg-Bauer</snm>
                  <fnm>E</fnm>
               </au>
            </aug>
            <source>Febs J</source>
            <pubdate>2006</pubdate>
            <volume>273</volume>
            <issue>9</issue>
            <fpage>2037</fpage>
            <lpage>2047</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1111/j.1742-4658.2006.05220.x</pubid>
                  <pubid idtype="pmpid" link="fulltext">16640566</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B26">
            <title>
               <p>Evolution of circular permutations in multidomain proteins</p>
            </title>
            <aug>
               <au>
                  <snm>Weiner</snm>
                  <fnm>J</fnm>
                  <suf>3rd</suf>
               </au>
               <au>
                  <snm>Bornberg-Bauer</snm>
                  <fnm>E</fnm>
               </au>
            </aug>
            <source>Mol Biol Evol</source>
            <pubdate>2006</pubdate>
            <volume>23</volume>
            <issue>4</issue>
            <fpage>734</fpage>
            <lpage>743</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/molbev/msj091</pubid>
                  <pubid idtype="pmpid" link="fulltext">16431849</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B27">
            <title>
               <p>Sequence permutations in the molecular evolution of DNA methyltransferases</p>
            </title>
            <aug>
               <au>
                  <snm>Bujnicki</snm>
                  <fnm>JM</fnm>
               </au>
            </aug>
            <source>BMC Evol Biol</source>
            <pubdate>2002</pubdate>
            <volume>2</volume>
            <fpage>3</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">102321</pubid>
                  <pubid idtype="pmpid" link="fulltext">11914127</pubid>
                  <pubid idtype="doi">10.1186/1471-2148-2-3</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B28">
            <title>
               <p>The deep roots of eukaryotes</p>
            </title>
            <aug>
               <au>
                  <snm>Baldauf</snm>
                  <fnm>SL</fnm>
               </au>
            </aug>
            <source>Science</source>
            <pubdate>2003</pubdate>
            <volume>300</volume>
            <issue>5626</issue>
            <fpage>1703</fpage>
            <lpage>1706</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1126/science.1085544</pubid>
                  <pubid idtype="pmpid" link="fulltext">12805537</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B29">
            <title>
               <p>Structure of Rab escort protein-1 in complex with Rab geranylgeranyltransferase</p>
            </title>
            <aug>
               <au>
                  <snm>Pylypenko</snm>
                  <fnm>O</fnm>
               </au>
               <au>
                  <snm>Rak</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Reents</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Niculae</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Sidorovitch</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>Cioaca</snm>
                  <fnm>MD</fnm>
               </au>
               <au>
                  <snm>Bessolitsyna</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Thoma</snm>
                  <fnm>NH</fnm>
               </au>
               <au>
                  <snm>Waldmann</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Schlichting</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Goody</snm>
                  <fnm>RS</fnm>
               </au>
               <au>
                  <snm>Alexandrov</snm>
                  <fnm>K</fnm>
               </au>
            </aug>
            <source>Mol Cell</source>
            <pubdate>2003</pubdate>
            <volume>11</volume>
            <issue>2</issue>
            <fpage>483</fpage>
            <lpage>494</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/S1097-2765(03)00044-3</pubid>
                  <pubid idtype="pmpid" link="fulltext">12620235</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B30">
            <title>
               <p>The CATH Domain Structure Database and related resources Gene3D and DHS provide comprehensive domain family information for genome analysis</p>
            </title>
            <aug>
               <au>
                  <snm>Pearl</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Todd</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Sillitoe</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Dibley</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Redfern</snm>
                  <fnm>O</fnm>
               </au>
               <au>
                  <snm>Lewis</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Bennett</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Marsden</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Grant</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Lee</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Akpor</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Maibaum</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Harrison</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Dallman</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Reeves</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Diboun</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Addou</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Lise</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Johnston</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Sillero</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Thornton</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Orengo</snm>
                  <fnm>C</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2005</pubdate>
            <volume>33</volume>
            <issue>Database issue</issue>
            <fpage>D247</fpage>
            <lpage>51</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">539978</pubid>
                  <pubid idtype="pmpid" link="fulltext">15608188</pubid>
                  <pubid idtype="doi">10.1093/nar/gki024</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B31">
            <title>
               <p>SCOP database in 2004: refinements integrate structure and sequence family data</p>
            </title>
            <aug>
               <au>
                  <snm>Andreeva</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Howorth</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Brenner</snm>
                  <fnm>SE</fnm>
               </au>
               <au>
                  <snm>Hubbard</snm>
                  <fnm>TJ</fnm>
               </au>
               <au>
                  <snm>Chothia</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Murzin</snm>
                  <fnm>AG</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2004</pubdate>
            <volume>32</volume>
            <issue>Database issue</issue>
            <fpage>D226</fpage>
            <lpage>9</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">308773</pubid>
                  <pubid idtype="pmpid" link="fulltext">14681400</pubid>
                  <pubid idtype="doi">10.1093/nar/gkh039</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B32">
            <title>
               <p>The C2 domain calcium-binding motif: structural and functional diversity</p>
            </title>
            <aug>
               <au>
                  <snm>Nalefski</snm>
                  <fnm>EA</fnm>
               </au>
               <au>
                  <snm>Falke</snm>
                  <fnm>JJ</fnm>
               </au>
            </aug>
            <source>Protein Sci</source>
            <pubdate>1996</pubdate>
            <volume>5</volume>
            <issue>12</issue>
            <fpage>2375</fpage>
            <lpage>2390</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">8976547</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B33">
            <title>
               <p>Evolutionarily conserved multiple C2 domain proteins with two transmembrane regions (MCTPs) and unusual Ca2+ binding properties</p>
            </title>
            <aug>
               <au>
                  <snm>Shin</snm>
                  <fnm>OH</fnm>
               </au>
               <au>
                  <snm>Han</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Wang</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Sudhof</snm>
                  <fnm>TC</fnm>
               </au>
            </aug>
            <source>J Biol Chem</source>
            <pubdate>2005</pubdate>
            <volume>280</volume>
            <issue>2</issue>
            <fpage>1641</fpage>
            <lpage>1651</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1074/jbc.M407305200</pubid>
                  <pubid idtype="pmpid" link="fulltext">15528213</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B34">
            <title>
               <p>The C2 domain of PKCdelta is a phosphotyrosine binding domain</p>
            </title>
            <aug>
               <au>
                  <snm>Benes</snm>
                  <fnm>CH</fnm>
               </au>
               <au>
                  <snm>Wu</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Elia</snm>
                  <fnm>AE</fnm>
               </au>
               <au>
                  <snm>Dharia</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Cantley</snm>
                  <fnm>LC</fnm>
               </au>
               <au>
                  <snm>Soltoff</snm>
                  <fnm>SP</fnm>
               </au>
            </aug>
            <source>Cell</source>
            <pubdate>2005</pubdate>
            <volume>121</volume>
            <issue>2</issue>
            <fpage>271</fpage>
            <lpage>280</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/j.cell.2005.02.019</pubid>
                  <pubid idtype="pmpid" link="fulltext">15851033</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B35">
            <title>
               <p>Gapped BLAST and PSI-BLAST: a new generation of protein database search programs</p>
            </title>
            <aug>
               <au>
                  <snm>Altschul</snm>
                  <fnm>SF</fnm>
               </au>
               <au>
                  <snm>Madden</snm>
                  <fnm>TL</fnm>
               </au>
               <au>
                  <snm>Schaffer</snm>
                  <fnm>AA</fnm>
               </au>
               <au>
                  <snm>Zhang</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Zhang</snm>
                  <fnm>Z</fnm>
               </au>
               <au>
                  <snm>Miller</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Lipman</snm>
                  <fnm>DJ</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>1997</pubdate>
            <volume>25</volume>
            <issue>17</issue>
            <fpage>3389</fpage>
            <lpage>3402</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">146917</pubid>
                  <pubid idtype="pmpid" link="fulltext">9254694</pubid>
                  <pubid idtype="doi">10.1093/nar/25.17.3389</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B36">
            <title>
               <p>The Pfam protein families database</p>
            </title>
            <aug>
               <au>
                  <snm>Bateman</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Coin</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Durbin</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Finn</snm>
                  <fnm>RD</fnm>
               </au>
               <au>
                  <snm>Hollich</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>Griffiths-Jones</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Khanna</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Marshall</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Moxon</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Sonnhammer</snm>
                  <fnm>EL</fnm>
               </au>
               <au>
                  <snm>Studholme</snm>
                  <fnm>DJ</fnm>
               </au>
               <au>
                  <snm>Yeats</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Eddy</snm>
                  <fnm>SR</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2004</pubdate>
            <volume>32</volume>
            <issue>Database issue</issue>
            <fpage>D138</fpage>
            <lpage>41</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">308855</pubid>
                  <pubid idtype="pmpid" link="fulltext">14681378</pubid>
                  <pubid idtype="doi">10.1093/nar/gkh121</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B37">
            <title>
               <p>The SUPERFAMILY database in 2007: families and functions</p>
            </title>
            <aug>
               <au>
                  <snm>Wilson</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Madera</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Vogel</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Chothia</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Gough</snm>
                  <fnm>J</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2007</pubdate>
            <volume>35</volume>
            <issue>Database issue</issue>
            <fpage>D308</fpage>
            <lpage>13</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1669749</pubid>
                  <pubid idtype="pmpid" link="fulltext">17098927</pubid>
                  <pubid idtype="doi">10.1093/nar/gkl910</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B38">
            <title>
               <p>SUPERFAMILY: HMMs representing all proteins of known structure. SCOP sequence searches, alignments and genome assignments</p>
            </title>
            <aug>
               <au>
                  <snm>Gough</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Chothia</snm>
                  <fnm>C</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2002</pubdate>
            <volume>30</volume>
            <issue>1</issue>
            <fpage>268</fpage>
            <lpage>272</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">99153</pubid>
                  <pubid idtype="pmpid" link="fulltext">11752312</pubid>
                  <pubid idtype="doi">10.1093/nar/30.1.268</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B39">
            <title>
               <p>JPred: a consensus secondary structure prediction server</p>
            </title>
            <aug>
               <au>
                  <snm>Cuff</snm>
                  <fnm>JA</fnm>
               </au>
               <au>
                  <snm>Clamp</snm>
                  <fnm>ME</fnm>
               </au>
               <au>
                  <snm>Siddiqui</snm>
                  <fnm>AS</fnm>
               </au>
               <au>
                  <snm>Finlay</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Barton</snm>
                  <fnm>GJ</fnm>
               </au>
            </aug>
            <source>Bioinformatics</source>
            <pubdate>1998</pubdate>
            <volume>14</volume>
            <issue>10</issue>
            <fpage>892</fpage>
            <lpage>893</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/bioinformatics/14.10.892</pubid>
                  <pubid idtype="pmpid" link="fulltext">9927721</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B40">
            <title>
               <p>Application of multiple sequence alignment profiles to improve protein secondary structure prediction</p>
            </title>
            <aug>
               <au>
                  <snm>Cuff</snm>
                  <fnm>JA</fnm>
               </au>
               <au>
                  <snm>Barton</snm>
                  <fnm>GJ</fnm>
               </au>
            </aug>
            <source>Proteins</source>
            <pubdate>2000</pubdate>
            <volume>40</volume>
            <issue>3</issue>
            <fpage>502</fpage>
            <lpage>511</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1002/1097-0134(20000815)40:3&lt;502::AID-PROT170>3.0.CO;2-Q</pubid>
                  <pubid idtype="pmpid" link="fulltext">10861942</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B41">
            <title>
               <p>Protein repeats: structures, functions, and evolution</p>
            </title>
            <aug>
               <au>
                  <snm>Andrade</snm>
                  <fnm>MA</fnm>
               </au>
               <au>
                  <snm>Perez-Iratxeta</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Ponting</snm>
                  <fnm>CP</fnm>
               </au>
            </aug>
            <source>J Struct Biol</source>
            <pubdate>2001</pubdate>
            <volume>134</volume>
            <issue>2-3</issue>
            <fpage>117</fpage>
            <lpage>131</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1006/jsbi.2001.4392</pubid>
                  <pubid idtype="pmpid" link="fulltext">11551174</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B42">
            <title>
               <p>The leucine-rich repeat as a protein recognition motif</p>
            </title>
            <aug>
               <au>
                  <snm>Kobe</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Kajava</snm>
                  <fnm>AV</fnm>
               </au>
            </aug>
            <source>Curr Opin Struct Biol</source>
            <pubdate>2001</pubdate>
            <volume>11</volume>
            <issue>6</issue>
            <fpage>725</fpage>
            <lpage>732</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/S0959-440X(01)00266-4</pubid>
                  <pubid idtype="pmpid" link="fulltext">11751054</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B43">
            <title>
               <p>Convergent evolution of domain architectures (is rare)</p>
            </title>
            <aug>
               <au>
                  <snm>Gough</snm>
                  <fnm>J</fnm>
               </au>
            </aug>
            <source>Bioinformatics</source>
            <pubdate>2005</pubdate>
            <volume>21</volume>
            <issue>8</issue>
            <fpage>1464</fpage>
            <lpage>1471</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/bioinformatics/bti204</pubid>
                  <pubid idtype="pmpid" link="fulltext">15585523</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B44">
            <title>
               <p>Structure and function of a squalene cyclase</p>
            </title>
            <aug>
               <au>
                  <snm>Wendt</snm>
                  <fnm>KU</fnm>
               </au>
               <au>
                  <snm>Poralla</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Schulz</snm>
                  <fnm>GE</fnm>
               </au>
            </aug>
            <source>Science</source>
            <pubdate>1997</pubdate>
            <volume>277</volume>
            <issue>5333</issue>
            <fpage>1811</fpage>
            <lpage>1815</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1126/science.277.5333.1811</pubid>
                  <pubid idtype="pmpid" link="fulltext">9295270</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B45">
            <title>
               <p>Molecular basis for Rab prenylation</p>
            </title>
            <aug>
               <au>
                  <snm>Alory</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Balch</snm>
                  <fnm>WE</fnm>
               </au>
            </aug>
            <source>J Cell Biol</source>
            <pubdate>2000</pubdate>
            <volume>150</volume>
            <issue>1</issue>
            <fpage>89</fpage>
            <lpage>103</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1083/jcb.150.1.89</pubid>
                  <pubid idtype="pmpid" link="fulltext">10893259</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B46">
            <title>
               <p>Natively unfolded proteins: a point where biology waits for physics</p>
            </title>
            <aug>
               <au>
                  <snm>Uversky</snm>
                  <fnm>VN</fnm>
               </au>
            </aug>
            <source>Protein Sci</source>
            <pubdate>2002</pubdate>
            <volume>11</volume>
            <issue>4</issue>
            <fpage>739</fpage>
            <lpage>756</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1110/ps.4210102</pubid>
                  <pubid idtype="pmpid" link="fulltext">11910019</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B47">
            <title>
               <p>Intrinsically unstructured proteins and their functions</p>
            </title>
            <aug>
               <au>
                  <snm>Dyson</snm>
                  <fnm>HJ</fnm>
               </au>
               <au>
                  <snm>Wright</snm>
                  <fnm>PE</fnm>
               </au>
            </aug>
            <source>Nat Rev Mol Cell Biol</source>
            <pubdate>2005</pubdate>
            <volume>6</volume>
            <issue>3</issue>
            <fpage>197</fpage>
            <lpage>208</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/nrm1589</pubid>
                  <pubid idtype="pmpid" link="fulltext">15738986</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B48">
            <title>
               <p>Functional anthology of intrinsic disorder. 1. Biological processes and functions of proteins with long disordered regions</p>
            </title>
            <aug>
               <au>
                  <snm>Xie</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Vucetic</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Iakoucheva</snm>
                  <fnm>LM</fnm>
               </au>
               <au>
                  <snm>Oldfield</snm>
                  <fnm>CJ</fnm>
               </au>
               <au>
                  <snm>Dunker</snm>
                  <fnm>AK</fnm>
               </au>
               <au>
                  <snm>Obradovic</snm>
                  <fnm>Z</fnm>
               </au>
               <au>
                  <snm>Uversky</snm>
                  <fnm>VN</fnm>
               </au>
            </aug>
            <source>Journal of Proteome Reseach</source>
            <pubdate>2007</pubdate>
            <inpress/>
         </bibl>
         <bibl id="B49">
            <title>
               <p>Functional anthology of intrinsic disorder. 3. Ligands, Post-Translational modifications,and diseases associated with intrinsically disordered proteins</p>
            </title>
            <aug>
               <au>
                  <snm>Xie</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Vucetic</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Iakoucheva</snm>
                  <fnm>LM</fnm>
               </au>
               <au>
                  <snm>Oldfield</snm>
                  <fnm>CJ</fnm>
               </au>
               <au>
                  <snm>Dunker</snm>
                  <fnm>AK</fnm>
               </au>
               <au>
                  <snm>Obradovic</snm>
                  <fnm>Z</fnm>
               </au>
               <au>
                  <snm>Uversky</snm>
                  <fnm>VN</fnm>
               </au>
            </aug>
            <source>Journal of Proteome Research</source>
            <pubdate>2007</pubdate>
            <inpress/>
         </bibl>
         <bibl id="B50">
            <title>
               <p>Assessing protein disorder and induced folding</p>
            </title>
            <aug>
               <au>
                  <snm>Receveur-Brechot</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>Bourhis</snm>
                  <fnm>JM</fnm>
               </au>
               <au>
                  <snm>Uversky</snm>
                  <fnm>VN</fnm>
               </au>
               <au>
                  <snm>Canard</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Longhi</snm>
                  <fnm>S</fnm>
               </au>
            </aug>
            <source>Proteins</source>
            <pubdate>2006</pubdate>
            <volume>62</volume>
            <issue>1</issue>
            <fpage>24</fpage>
            <lpage>45</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1002/prot.20750</pubid>
                  <pubid idtype="pmpid" link="fulltext">16287116</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B51">
            <title>
               <p>GlobPlot: Exploring protein sequences for globularity and disorder</p>
            </title>
            <aug>
               <au>
                  <snm>Linding</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Russell</snm>
                  <fnm>RB</fnm>
               </au>
               <au>
                  <snm>Neduva</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>Gibson</snm>
                  <fnm>TJ</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2003</pubdate>
            <volume>31</volume>
            <issue>13</issue>
            <fpage>3701</fpage>
            <lpage>3708</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">169197</pubid>
                  <pubid idtype="pmpid" link="fulltext">12824398</pubid>
                  <pubid idtype="doi">10.1093/nar/gkg519</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B52">
            <title>
               <p>Adaptins: the final recount</p>
            </title>
            <aug>
               <au>
                  <snm>Boehm</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Bonifacino</snm>
                  <fnm>JS</fnm>
               </au>
            </aug>
            <source>Mol Biol Cell</source>
            <pubdate>2001</pubdate>
            <volume>12</volume>
            <issue>10</issue>
            <fpage>2907</fpage>
            <lpage>2920</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">60144</pubid>
                  <pubid idtype="pmpid" link="fulltext">11598180</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B53">
            <title>
               <p>The age of the Arabidopsis thaliana genome duplication</p>
            </title>
            <aug>
               <au>
                  <snm>Ermolaeva</snm>
                  <fnm>MD</fnm>
               </au>
               <au>
                  <snm>Wu</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Eisen</snm>
                  <fnm>JA</fnm>
               </au>
               <au>
                  <snm>Salzberg</snm>
                  <fnm>SL</fnm>
               </au>
            </aug>
            <source>Plant Mol Biol</source>
            <pubdate>2003</pubdate>
            <volume>51</volume>
            <issue>6</issue>
            <fpage>859</fpage>
            <lpage>866</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1023/A:1023001130337</pubid>
                  <pubid idtype="pmpid" link="fulltext">12777046</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B54">
            <title>
               <p>Selection for more of the same product as a force to enhance concerted evolution of duplicated genes</p>
            </title>
            <aug>
               <au>
                  <snm>Sugino</snm>
                  <fnm>RP</fnm>
               </au>
               <au>
                  <snm>Innan</snm>
                  <fnm>H</fnm>
               </au>
            </aug>
            <source>Trends Genet</source>
            <pubdate>2006</pubdate>
            <volume>22</volume>
            <issue>12</issue>
            <fpage>642</fpage>
            <lpage>644</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/j.tig.2006.09.014</pubid>
                  <pubid idtype="pmpid" link="fulltext">17045359</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B55">
            <title>
               <p>Evolution of the Rab family of small GTP-binding proteins</p>
            </title>
            <aug>
               <au>
                  <snm>Pereira-Leal</snm>
                  <fnm>JB</fnm>
               </au>
               <au>
                  <snm>Seabra</snm>
                  <fnm>MC</fnm>
               </au>
            </aug>
            <source>J Mol Biol</source>
            <pubdate>2001</pubdate>
            <volume>313</volume>
            <issue>4</issue>
            <fpage>889</fpage>
            <lpage>901</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1006/jmbi.2001.5072</pubid>
                  <pubid idtype="pmpid" link="fulltext">11697911</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B56">
            <title>
               <p>The Arabidopsis Rab GTPase family: another enigma variation</p>
            </title>
            <aug>
               <au>
                  <snm>Rutherford</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Moore</snm>
                  <fnm>I</fnm>
               </au>
            </aug>
            <source>Curr Opin Plant Biol</source>
            <pubdate>2002</pubdate>
            <volume>5</volume>
            <issue>6</issue>
            <fpage>518</fpage>
            <lpage>528</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/S1369-5266(02)00307-2</pubid>
                  <pubid idtype="pmpid" link="fulltext">12393015</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B57">
            <title>
               <p>Multiple factors contribute to inefficient prenylation of Rab27a in Rab prenylation diseases</p>
            </title>
            <aug>
               <au>
                  <snm>Larijani</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Hume</snm>
                  <fnm>AN</fnm>
               </au>
               <au>
                  <snm>Tarafder</snm>
                  <fnm>AK</fnm>
               </au>
               <au>
                  <snm>Seabra</snm>
                  <fnm>MC</fnm>
               </au>
            </aug>
            <source>J Biol Chem</source>
            <pubdate>2003</pubdate>
            <volume>278</volume>
            <issue>47</issue>
            <fpage>46798</fpage>
            <lpage>46804</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1074/jbc.M307799200</pubid>
                  <pubid idtype="pmpid" link="fulltext">12941939</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B58">
            <title>
               <p>Comparative and evolutionary analysis of genes encoding small GTPases and their activating proteins in eukaryotic genomes</p>
            </title>
            <aug>
               <au>
                  <snm>Jiang</snm>
                  <fnm>SY</fnm>
               </au>
               <au>
                  <snm>Ramachandran</snm>
                  <fnm>S</fnm>
               </au>
            </aug>
            <source>Physiol Genomics</source>
            <pubdate>2006</pubdate>
            <volume>24</volume>
            <issue>3</issue>
            <fpage>235</fpage>
            <lpage>251</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">16332933</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B59">
            <title>
               <p>Nonlinear effects in macromolecular assembly and dosage sensitivity</p>
            </title>
            <aug>
               <au>
                  <snm>Veitia</snm>
                  <fnm>RA</fnm>
               </au>
            </aug>
            <source>J Theor Biol</source>
            <pubdate>2003</pubdate>
            <volume>220</volume>
            <issue>1</issue>
            <fpage>19</fpage>
            <lpage>25</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1006/jtbi.2003.3105</pubid>
                  <pubid idtype="pmpid" link="fulltext">12453447</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B60">
            <title>
               <p>CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice</p>
            </title>
            <aug>
               <au>
                  <snm>Thompson</snm>
                  <fnm>JD</fnm>
               </au>
               <au>
                  <snm>Higgins</snm>
                  <fnm>DG</fnm>
               </au>
               <au>
                  <snm>Gibson</snm>
                  <fnm>TJ</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>1994</pubdate>
            <volume>22</volume>
            <issue>22</issue>
            <fpage>4673</fpage>
            <lpage>4680</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">308517</pubid>
                  <pubid idtype="pmpid" link="fulltext">7984417</pubid>
                  <pubid idtype="doi">10.1093/nar/22.22.4673</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B61">
            <title>
               <p>Identification of common molecular subsequences</p>
            </title>
            <aug>
               <au>
                  <snm>Smith</snm>
                  <fnm>TF</fnm>
               </au>
               <au>
                  <snm>Waterman</snm>
                  <fnm>MS</fnm>
               </au>
            </aug>
            <source>J Mol Biol</source>
            <pubdate>1981</pubdate>
            <volume>147</volume>
            <issue>1</issue>
            <fpage>195</fpage>
            <lpage>197</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/0022-2836(81)90087-5</pubid>
                  <pubid idtype="pmpid" link="fulltext">7265238</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B62">
            <title>
               <p>EMBOSS: the European Molecular Biology Open Software Suite</p>
            </title>
            <aug>
               <au>
                  <snm>Rice</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Longden</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Bleasby</snm>
                  <fnm>A</fnm>
               </au>
            </aug>
            <source>Trends Genet</source>
            <pubdate>2000</pubdate>
            <volume>16</volume>
            <issue>6</issue>
            <fpage>276</fpage>
            <lpage>277</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/S0168-9525(00)02024-2</pubid>
                  <pubid idtype="pmpid" link="fulltext">10827456</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B63">
            <title>
               <p>Amino acid substitution matrices from protein blocks</p>
            </title>
            <aug>
               <au>
                  <snm>Henikoff</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Henikoff</snm>
                  <fnm>JG</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci U S A</source>
            <pubdate>1992</pubdate>
            <volume>89</volume>
            <issue>22</issue>
            <fpage>10915</fpage>
            <lpage>10919</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">50453</pubid>
                  <pubid idtype="pmpid" link="fulltext">1438297</pubid>
                  <pubid idtype="doi">10.1073/pnas.89.22.10915</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B64">
            <title>
               <p>The Jalview Java alignment editor</p>
            </title>
            <aug>
               <au>
                  <snm>Clamp</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Cuff</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Searle</snm>
                  <fnm>SM</fnm>
               </au>
               <au>
                  <snm>Barton</snm>
                  <fnm>GJ</fnm>
               </au>
            </aug>
            <source>Bioinformatics</source>
            <pubdate>2004</pubdate>
            <volume>20</volume>
            <issue>3</issue>
            <fpage>426</fpage>
            <lpage>427</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/bioinformatics/btg430</pubid>
                  <pubid idtype="pmpid" link="fulltext">14960472</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B65">
            <title>
               <p>The SUPERFAMILY database in 2004: additions and improvements</p>
            </title>
            <aug>
               <au>
                  <snm>Madera</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Vogel</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Kummerfeld</snm>
                  <fnm>SK</fnm>
               </au>
               <au>
                  <snm>Chothia</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Gough</snm>
                  <fnm>J</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2004</pubdate>
            <volume>32</volume>
            <issue>Database issue</issue>
            <fpage>D235</fpage>
            <lpage>9</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">308851</pubid>
                  <pubid idtype="pmpid" link="fulltext">14681402</pubid>
                  <pubid idtype="doi">10.1093/nar/gkh117</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B66">
            <title>
               <p>iPfam: visualization of protein-protein interactions in PDB at domain and amino acid resolutions</p>
            </title>
            <aug>
               <au>
                  <snm>Finn</snm>
                  <fnm>RD</fnm>
               </au>
               <au>
                  <snm>Marshall</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Bateman</snm>
                  <fnm>A</fnm>
               </au>
            </aug>
            <source>Bioinformatics</source>
            <pubdate>2005</pubdate>
            <volume>21</volume>
            <issue>3</issue>
            <fpage>410</fpage>
            <lpage>412</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/bioinformatics/bti011</pubid>
                  <pubid idtype="pmpid" link="fulltext">15353450</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B67">
            <title>
               <p>PHYLIP - Phylogeny Inference Package (Version 3.2)</p>
            </title>
            <aug>
               <au>
                  <snm>Felsenstein</snm>
                  <fnm>J</fnm>
               </au>
            </aug>
            <source>Cladistics</source>
            <pubdate>1989</pubdate>
            <volume>5</volume>
            <fpage>164</fpage>
            <lpage>166</lpage>
         </bibl>
         <bibl id="B68">
            <title>
               <p>MRBAYES: Bayesian inference of phylogenetic trees</p>
            </title>
            <aug>
               <au>
                  <snm>Huelsenbeck</snm>
                  <fnm>JP</fnm>
               </au>
               <au>
                  <snm>Ronquist</snm>
                  <fnm>F</fnm>
               </au>
            </aug>
            <source>Bioinformatics</source>
            <pubdate>2001</pubdate>
            <volume>17</volume>
            <issue>8</issue>
            <fpage>754</fpage>
            <lpage>755</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/bioinformatics/17.8.754</pubid>
                  <pubid idtype="pmpid" link="fulltext">11524383</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B69">
            <title>
               <p>MrBayes 3: Bayesian phylogenetic inference under mixed models</p>
            </title>
            <aug>
               <au>
                  <snm>Ronquist</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Huelsenbeck</snm>
                  <fnm>JP</fnm>
               </au>
            </aug>
            <source>Bioinformatics</source>
            <pubdate>2003</pubdate>
            <volume>19</volume>
            <issue>12</issue>
            <fpage>1572</fpage>
            <lpage>1574</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/bioinformatics/btg180</pubid>
                  <pubid idtype="pmpid" link="fulltext">12912839</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B70">
            <title>
               <p>FigTree</p>
            </title>
            <url>http://evolve.zoo.ox.ac.uk/software.html?id=figtree</url>
         </bibl>
         <bibl id="B71">
            <title>
               <p>Computational Genomics Laboratory </p>
            </title>
            <url>http://eao.igc.gulbenkian.pt/CGL/</url>
         </bibl>
         <bibl id="B72">
            <title>
               <p>Structure and mutational analysis of Rab GDP-dissociation inhibitor</p>
            </title>
            <aug>
               <au>
                  <snm>Schalk</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Zeng</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Wu</snm>
                  <fnm>SK</fnm>
               </au>
               <au>
                  <snm>Stura</snm>
                  <fnm>EA</fnm>
               </au>
               <au>
                  <snm>Matteson</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Huang</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Tandon</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Wilson</snm>
                  <fnm>IA</fnm>
               </au>
               <au>
                  <snm>Balch</snm>
                  <fnm>WE</fnm>
               </au>
            </aug>
            <source>Nature</source>
            <pubdate>1996</pubdate>
            <volume>381</volume>
            <issue>6577</issue>
            <fpage>42</fpage>
            <lpage>48</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/381042a0</pubid>
                  <pubid idtype="pmpid" link="fulltext">8609986</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B73">
            <title>
               <p>Database resources of the National Center for Biotechnology Information</p>
            </title>
            <aug>
               <au>
                  <snm>Wheeler</snm>
                  <fnm>DL</fnm>
               </au>
               <au>
                  <snm>Chappey</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Lash</snm>
                  <fnm>AE</fnm>
               </au>
               <au>
                  <snm>Leipe</snm>
                  <fnm>DD</fnm>
               </au>
               <au>
                  <snm>Madden</snm>
                  <fnm>TL</fnm>
               </au>
               <au>
                  <snm>Schuler</snm>
                  <fnm>GD</fnm>
               </au>
               <au>
                  <snm>Tatusova</snm>
                  <fnm>TA</fnm>
               </au>
               <au>
                  <snm>Rapp</snm>
                  <fnm>BA</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2000</pubdate>
            <volume>28</volume>
            <issue>1</issue>
            <fpage>10</fpage>
            <lpage>14</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">102437</pubid>
                  <pubid idtype="pmpid" link="fulltext">10592169</pubid>
                  <pubid idtype="doi">10.1093/nar/28.1.10</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
      </refgrp>
   </bm>
</art>
