<?xml version='1.0'?>
<!DOCTYPE art SYSTEM 'http://www.biomedcentral.com/xml/article.dtd'>
<art>
   <ui>1745-6150-2-4</ui>
   <ji>1745-6150</ji>
   <fm>
      <dochead>Research</dochead>
      <bibl>
         <title>
            <p>A novel superfamily containing the &#946;-grasp fold involved in binding diverse soluble ligands</p>
         </title>
         <aug>
            <au id="A1">
               <snm>Burroughs</snm>
               <fnm>A Maxwell</fnm>
               <insr iid="I1"/>
               <insr iid="I2"/>
               <email>burrough@ncbi.nlm.nih.gov</email>
            </au>
            <au id="A2">
               <snm>Balaji</snm>
               <fnm>S</fnm>
               <insr iid="I1"/>
               <email>sbalaji@ncbi.nlm.nih.gov</email>
            </au>
            <au id="A3">
               <snm>Iyer</snm>
               <mi>M</mi>
               <fnm>Lakshminarayan</fnm>
               <insr iid="I1"/>
               <email>lakshmin@ncbi.nlm.nih.gov</email>
            </au>
            <au id="A4" ca="yes">
               <snm>Aravind</snm>
               <fnm>L</fnm>
               <insr iid="I1"/>
               <email>aravind@ncbi.nlm.nih.gov</email>
            </au>
         </aug>
         <insg>
            <ins id="I1">
               <p>National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894, USA</p>
            </ins>
            <ins id="I2">
               <p>Bioinformatics Program, Boston University, Boston, MA 02215, USA</p>
            </ins>
         </insg>
         <source>Biology Direct</source>
         <issn>1745-6150</issn>
         <pubdate>2007</pubdate>
         <volume>2</volume>
         <issue>1</issue>
         <fpage>4</fpage>
         <url>http://www.biology-direct.com/content/2/1/4</url>
         <xrefbib>
            <pubidlist>
               <pubid idtype="pmpid">17250770</pubid>
               <pubid idtype="doi">10.1186/1745-6150-2-4</pubid>
            </pubidlist>
         </xrefbib>
      </bibl>
      <history>
         <rec>
            <date>
               <day>05</day>
               <month>1</month>
               <year>2007</year>
            </date>
         </rec>
         <acc>
            <date>
               <day>24</day>
               <month>1</month>
               <year>2007</year>
            </date>
         </acc>
         <pub>
            <date>
               <day>24</day>
               <month>1</month>
               <year>2007</year>
            </date>
         </pub>
      </history>
      <cpyrt>
         <year>2007</year>
         <collab>Burroughs et al; licensee BioMed Central Ltd.</collab>
         <note>This is an Open Access article distributed under the terms of the Creative Commons Attribution License (<url>http://creativecommons.org/licenses/by/2.0</url>), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</note>
      </cpyrt>
      <abs>
         <sec>
            <st>
               <p>Abstract</p>
            </st>
            <sec>
               <st>
                  <p>Background</p>
               </st>
               <p>Domains containing the &#946;-grasp fold are utilized in a great diversity of physiological functions but their role, if any, in soluble or small molecule ligand recognition is poorly studied.</p>
            </sec>
            <sec>
               <st>
                  <p>Results</p>
               </st>
               <p>Using sensitive sequence and structure similarity searches we identify a novel superfamily containing the &#946;-grasp fold. They are found in a diverse set of proteins that include the animal vitamin B12 uptake proteins transcobalamin and intrinsic factor, the bacterial polysaccharide export proteins, the competence DNA receptor ComEA, the cob(I)alamin generating enzyme PduS and the Nqo1 subunit of the respiratory electron transport chain. We present evidence that members of this superfamily are likely to bind a range of soluble ligands, including B12. There are two major clades within this superfamily, namely the transcobalamin-like clade and the Nqo1-like clade. The former clade is typified by an insert of a &#946;-hairpin after the helix of the &#946;-grasp fold, whereas the latter clade is characterized by an insert between strands 4 and 5 of the core fold.</p>
            </sec>
            <sec>
               <st>
                  <p>Conclusion</p>
               </st>
               <p>Members of both clades within this superfamily are predicted to interact with ligands in a similar spatial location, with their specific inserts playing a role in the process. Both clades are widely represented in bacteria suggesting that this superfamily was derived early in bacterial evolution. The animal lineage appears to have acquired the transcobalamin-like proteins from low GC Gram-positive bacteria, and this might be correlated with the emergence of the ability to utilize B12 produced by gut bacteria.</p>
            </sec>
            <sec>
               <st>
                  <p>Reviewers</p>
               </st>
               <p>This article was reviewed by Andrei Osterman, Igor Zhulin, and Arcady Mushegian.</p>
            </sec>
         </sec>
      </abs>
   </fm>
   <meta>
      <classifications>
         <classification type="bmc" subtype="user_supplied_xml" id="endnote"/>
      </classifications>
   </meta>
   <bdy>
      <sec>
         <st>
            <p>Background</p>
         </st>
         <p>The &#946;-grasp fold (&#946;-GF) was first recognized in ubiquitin and the immunoglobulin-binding (IG-binding) domains of Gram-positive cocci <abbrgrp><abbr bid="B1">1</abbr><abbr bid="B2">2</abbr></abbrgrp>. Since then it has come to be known as a widespread fold, utilized in proteins performing a great diversity of cellular functions. These include regulation of protein stability and signal transduction through the ubiquitin-conjugation system <abbrgrp><abbr bid="B3">3</abbr></abbrgrp>, RNA-protein interactions as seen in the TGS domain of tRNA synthetases <abbrgrp><abbr bid="B4">4</abbr></abbrgrp>, and adaptor functions involving protein-protein interactions as seen in the FERM module <abbrgrp><abbr bid="B5">5</abbr></abbrgrp>. Additionally, standalone &#946;-GF domain proteins ThiS/MoaD function as sulfur carriers in molybdopterin and thiamine biosynthesis <abbrgrp><abbr bid="B6">6</abbr></abbrgrp> and the fold also provides an effective scaffold for binding iron-sulfur clusters in the case of the 2Fe-2S ferrodoxins involved in electron transport (see SCOP database <abbrgrp><abbr bid="B7">7</abbr></abbrgrp>).</p>
         <p>As part of our larger effort to understand the evolutionary and structural basis for the functional versatility of this widespread fold (for example see reference <abbrgrp><abbr bid="B8">8</abbr></abbrgrp>) we were keen to determine if there were as yet uncharacterized representatives that might widen the functional horizon of the &#946;-GF. In particular, we were interested in exploring the possibility of versions of the &#946;-GF domains binding soluble ligands. Such a function was of interest because the presence of 2Fe-2S ferredoxins suggested that the &#946;-GF domains could potentially provide a scaffold for binding a wider range of small molecules or other prosthetic groups. We accordingly investigated this further by applying a combination of sensitive structural comparisons and sequence profile analysis on members of the &#946;-GF. As a result, we identify a novel domain superfamily with the &#946;-GF fold and provide support that its members might be involved in binding different soluble ligands. We also study their genomic contexts, domain architectures and phyletic patterns to present evidence for their role in diverse metabolic networks, including those related to vitamin B12.</p>
      </sec>
      <sec>
         <st>
            <p>Results and discussion</p>
         </st>
         <sec>
            <st>
               <p>Detection of sequence and structure relationships</p>
            </st>
            <p>To identify potential novel versions of the &#946;-GF that bind soluble ligands, we initiated comprehensive structural comparisons with various previously characterized members of the fold (see &#946;-grasp fold in SCOP database) using the DALI program. Several of these searches retrieved the C-terminal domain of the transcobalamin protein with significant Z-scores. For example, searches initiated with MoaD proteins (PDB: 1v8c, 1vjk) retrieved the C-terminal domain of transcobalamin (PDB: 2bbc) with Z-scores of ~7. Transcobalamin is an animal-specific protein that binds cobalamin (vitamin B12), and is involved in its uptake by animal cells <abbrgrp><abbr bid="B9">9</abbr></abbrgrp>. Transcobalamin contains an N-terminal &#945;/&#945; toroid domain, and a C-terminal &#945;/&#946; domain <abbrgrp><abbr bid="B10">10</abbr></abbrgrp> that corresponded to the &#946;-GF domain recovered in the above searches. Further, DALI searches initiated with the C-terminal domain of transcobalamin recovered a diverse set of previously known &#946;-GF domains such as MoaD (PDB:1vjk), YukD (PDB: 2bps) 2Fe-2S ferredoxin (PDB:1feh) and the middle domain of the Nqo1 subunit of the bacterial and mitochondrial NADPH-quinone oxidoreductase I complex (PDB: 2fug. S chain) with Z-scores in the range of 5&#8211;7. The structural alignments generated by these searches showed that the transcobalamin C-terminal domain aligns completely with all core structural elements of the &#946;-GF, including a &#946;-sheet of 5 strands and a helix between strands 2 and 3. However, the transcobalamin C-terminal domains are distinguished by the presence of a unique &#946;-hairpin after the conserved helix of the &#946;-GF (Figure <figr fid="F1">1A</figr>). The N-terminal &#945;/&#945; toroid domain and the C-terminal &#946;-GF domain cooperate in ligand-binding by sandwiching a single B12 molecule between them <abbrgrp><abbr bid="B10">10</abbr></abbrgrp>. Systematic searches for contacts between the B12 ligand and the C-terminal &#946;-GF domain in transcobalamin showed that the unique insert plays a prominent role in binding the ligand by contributing several direct or solvent-mediated interactions <abbrgrp><abbr bid="B10">10</abbr></abbrgrp>. Additional contacts with the ligand are also made by residues from the core &#946;-GF such as those from strand 3, the end of strand 4 and the "ascending connector" between strand 4 and 5 (Figure <figr fid="F1">1A</figr>). These observations suggest that the C-terminal domain of transcobalamin represents a novel adaptation of the &#946;-GF for small-molecule ligand interactions.</p>
            <fig id="F1">
               <title>
                  <p>Figure 1</p>
               </title>
               <caption>
                  <p>Topology diagram of SLBB domain and multiple alignment of the SLBB superfamily</p>
               </caption>
               <text>
                  <p><b>Topology diagram of SLBB domain and multiple alignment of the SLBB superfamily</b>. (A) The five-stranded core (characteristic of all members of the &#946;G-F) is shown with the helical face at the near side. &#946;-strands are depicted as blue arrows, with the arrowhead at the C-terminus while the &#945;-helix is shown in red. The two-strand insertion in the Transcobalamin-like clade is colored in yellow and enclosed in a dotted box. The insertion point for the Nqo1-like clade is marked by a red box. The approximate soluble ligand-binding spatial region is marked by a green oval. Residues known to contribute to cobalamin binding as derived from the crystal structure of Transcobalamin are shown as small circles. Orange circles indicate sidechain-mediated interactions while greenish blue circles indicate backbone or backbone and sidechain-mediated interactions. The conservation of an aromatic residue in Transcobalamin proteins is represented by a phenylalanine residue, rendered as a line drawing and colored purple. (B) Proteins are denoted by their gene names, species abbreviations, and gi numbers; demarcated by underscores. Amino acid residues are colored according to sidechain properties and degree of conservation within the alignment, set at 80% consensus. Consensus abbreviations are shown below the alignment. The secondary structure shown above the alignment is derived from the crystal structures of Transcobalamin and Nqo1 and secondary structure prediction programs. E and H denote &#946;-strand and &#945;-helix, respectively. Secondary structure elements conserved across the SLBB superfamily are labeled in the top line of the alignment. "Insert #1" refers to the Transcobalamin-like clade insert while "Insert #2" refers to the Nqo1-like clade insert. "asc" refers to the ascending connector between strands 4 and 5 often observed in the &#946;-grasp fold. The consensus abbreviations and coloring scheme are as follows: h, hydrophobic residues (ACFILMVWY) shaded yellow; s, small residues (AGSVCDN) colored blue; p, polar residues (STEDKRNQHC) colored purple; and b, big residues (LIYERFQKMW) shaded gray. The conserved glycine residues characteristic of this superfamily are shaded light green and colored white. Species abbreviations are as follows: Aae: <it>Aquifex aeolicus</it>; Amel: <it>Apis mellifera</it>; Ana: <it>Nostoc </it>sp.; Bcer: <it>Bacillus cereus</it>; Bmar: <it>Blastopirellula marina</it>; Bthu: <it>Bacillus thuringiensis</it>; Cglu: <it>Corynebacterium glutamicum</it>; Ctet: <it>Clostridium tetani</it>; Dhaf: <it>Desulfitobacterium hafniense</it>; Dmel: <it>Drosophila melanogaster</it>; Ecol: <it>Escherichia coli</it>; Hinf: <it>Haemophilus influenzae</it>; Hsap: <it>Homo sapiens</it>; Krad: <it>Kineococcus radiotolerans</it>; Lint: <it>Leptospira interrogans</it>; Lmon: <it>Listeria monocytogenes</it>; Lreu: <it>Lactobacillus reuteri</it>; Lsak: <it>Lactobacillus sakei</it>; Mace: <it>Methanosarcina acetivorans</it>; Mbur: <it>Methanococcoides burtonii</it>; Moth: <it>Moorella thermoacetica</it>; Nham: <it>Nitrobacter hamburgensis</it>; Oihe: <it>Oceanobacillus iheyensis</it>; Ppro: <it>Photobacterium profundum</it>; Psyr: <it>Pseudomonas syringae</it>; Rbal: <it>Rhodopirellula baltica</it>; Sent: <it>Salmonella enterica</it>; Sepi: <it>Staphylococcus epidermidis</it>; Sfum: <it>Syntrophobacter fumaroxidans</it>; Spyo: <it>Streptococcus pyogenes</it>; Sthe: <it>Streptococcus thermophilus</it>; Styp: <it>Salmonella typhimurium</it>; Susi: <it>Solibacter usitatus</it>; Syn: <it>Synechococcus </it>sp.; Tmar: <it>Thermotoga maritima</it>; Tnig: <it>Tetraodon nigroviridis</it>; and Tthe: <it>Thermus thermophilus</it>.</p>
               </text>
               <graphic file="1745-6150-2-4-1"/>
            </fig>
            <p>To better understand the diversity of this class of ligand-binding &#946;-GF domains and their phyletic spread we initiated sequence profile and hidden Markov model (HMM) searches for homologs using PSI-BLAST and the HMMER package respectively. In addition to orthologs of transcobalamin, intrinsic factor and solo C-terminal domains from fishes, these searches retrieved numerous prokaryotic proteins, which were either present as stand-alone &#946;-GF domains or in large multidomain proteins. For example, a search initiated with the &#946;-GF domain of puffer fish transcobalamin (<it>Tetraodon nigroviridus</it>, gi: 47226456, region: 325&#8211;425) recovered closely related eukaryotic orthologs and paralogs fused to N-terminal &#945;/&#945; toroid modules (iteration 1), solo transcobalamin C-terminal domains with predicted signal peptides (e.g. XP_689937, <it>Danio rerio</it>, iteration 2, e-value: 3 &#215; 10-12), and several prokaryotic proteins (e.g. BAC13773, <it>Oceanobacterium iheyensis</it>, iteration 3, e-value: 2 &#215; 10-3). In order to exhaustively recover all divergent homologs, we conducted transitive searches with all above-detected members and also evaluated all hits below the threshold of PSI-BLAST searches for the presence of potentially homologous domains. We also prepared HMMs and PSSMs from the alignment of this region of all proteins recovered with significant expect-values (e &lt; .01 with statistical correction for compositional bias) and used these to search all completely sequenced genomes. These searches consistently retrieved hits to multiple sequence repeats in a group of bacterial cell-surface/secreted sugar-binding proteins involved in polysaccharide export with significant e-values (e.g. <it>Hahella </it>periplasmic protein, HCH_02380 residues 852&#8211;990). Inclusion of polysaccharide export proteins in profiles for further searches additionally recovered the N-terminal region of the ComEA family of DNA uptake receptors of Gram positive bacteria (e.g. <it>Clostridium </it>ComEA, gi: 67874543, iteration 2, e = 10-6), PduS-like cobalamin reductases (e.g. <it>E. coli </it>cobalamin reductase iteration 7, e = 10-3), the middle domain of the 51 kD subunit (F chain) of the NADPH-quinone oxidoreductase complex I (Nqo1, E: 10-4, iteration 11) and the RnfC subunit of the oxidoreductases encoded by the bacterial Rnf operons <abbrgrp><abbr bid="B11">11</abbr></abbrgrp>(<it>Rhodobacter </it>RnfC, E: 10-6; iteration 14). This latter set of proteins was more similar to the homologous region recovered in the polysaccharide export proteins than to the transcobalamin C-terminal domain (Figure <figr fid="F1">1B</figr>). However, recovery of the middle domain of Nqo1 in sequence searches clearly confirms their relationship with transcobalamin C-terminal domains, because the former are also known, via structural analysis, to contain a similar &#946;-GF domain <abbrgrp><abbr bid="B12">12</abbr></abbrgrp> (See above and <supplr sid="S1">Additional file 1</supplr>). This was additionally supported by separate secondary structure prediction for individual sub-groups with potentially homologous regions such as the ComEA N-terminal regions and the polysaccharide export proteins (Figure <figr fid="F1">1B</figr>).</p>
            <suppl id="S1">
               <title>
                  <p>Additional file 1</p>
               </title>
               <text>
                  <p><b>Cartoon representations of SLBB domains</b>. File 1 contains cartoon representations of the transcobalamin and the Nqo1 middle domain showing the structural similarity and innovations associated with the potential binding site for soluble ligands.</p>
               </text>
               <file name="1745-6150-2-4-S1.pdf">
                  <p>Click here for file</p>
               </file>
            </suppl>
            <p>We hereinafter refer to the homologous &#946;-GF domains found in all these proteins as the <ul>S</ul>oluble-<ul>L</ul>igand-<ul>B</ul>inding <ul>&#946;</ul>-grasp (SLBB superfamily) as many members of this superfamily are known or predicted to bind soluble ligands (See below for further details).</p>
         </sec>
         <sec>
            <st>
               <p>Sequence and structure features of the SLBB superfamily</p>
            </st>
            <p>A comprehensive multiple alignment for the SLBB superfamily (Figure <figr fid="F1">1B</figr>) was prepared by combining alignments for individual groups constructed using the T-Coffee program, based on the structural superposition of transcobalamin C-terminal domain (2bbc) and Nqo1 middle domain (2fug; chain S). Much of the conservation seen across the entire superfamily is in the form of hydrophobic residues forming the stabilizing core of the fold. However, there was a notable sequence feature in the form of two strongly conserved glycine residues, one in the turn leading into the horizontal flange preceding the third &#946;-strand (Figure <figr fid="F1">1A</figr>) of the &#946;-GF and the other immediately downstream of the second conserved &#946;-strand (Figure <figr fid="F1">1</figr>). This conservation pattern is a unique feature of the SLBB superfamily that distinguishes them from all other previously characterized &#946;-GF domains, supporting a common ancestry for this set of domains within the &#946;-GF.</p>
            <p>The alignment also helped us to classify the SLBB superfamily into several distinct families. The Transcobalamin C-terminal domain clade is unified by the presence of the above-described &#946;-hairpin insert within the &#946;-GF that plays an important role in contacting the ligand (Fig <figr fid="F1">1A</figr>, see <supplr sid="S1">Additional file 1</supplr>). This &#946;-hairpin contains a conserved hydrophobic position that makes a stacking interaction with the aromatic ring of the base in cobalamin. However, the rest of the sequence in this region is poorly conserved as most other interactions occur through backbone oxygen or nitrogen atoms <abbrgrp><abbr bid="B10">10</abbr></abbrgrp> (Figure <figr fid="F1">1B</figr>). Within animals, insects and most vertebrates have a single ortholog of the B12 binding protein, whereas the mammals have three distinct versions, transcobalamin I, transcobalamin II and the intrinsic factor. Besides animals, members of this clade are widely represented in Low GC Gram-positive bacteria and planctomycetes and less frequently in the euryarchaea.</p>
            <p>The Nqo1-like clade includes at least two distinct families: 1) the first includes the NADPH-quinone oxidoreductase complex I subunit Nqo1 (51 kD/F chain), the RnfC oxidoreductase subunit, and the PduS-like cobalamin reductases. 2) The second family contains polysaccharide export proteins and the DNA receptor ComEA. This clade is unified by the presence of a small, often &#945;-helical insert, in the "connector arm" between the fourth and fifth strands of the domain (Figure <figr fid="F1">1</figr>, see <supplr sid="S1">Additional file 1</supplr>). In some cases, such as the ComEA protein, the helical segment is followed by a low complexity region; suggesting the presence of a disordered, extended loop. These proteins are also characterized by an sGG motif (where 's' is any small residue) around the second conserved glycine of the superfamily (Figure <figr fid="F1">1B</figr>). The Nqo1 subunit of the classical NADPH-quinone oxidoreductase complex I is present in all major bacterial lineages with well-developed electron-transport chains, in most mitochondriate eukaryotic lineages, and very rarely in euryarchaea. The RnfC proteins are strongly conserved in &#947;-proteobacteria, but are also found in some representatives of Low GC Gram positive bacteria and the Bacteroidetes/Chlorobi assemblage. The PduS protein is restricted to the Low GC Gram-positive bacteria and certain &#947; and &#948; proteobacteria. The ComEA proteins and polysaccharide export proteins show a nearly mutually exclusive complementary distribution. The ComEA family is chiefly present in Low GC Gram-positive bacteria and actinobacteria, whereas the polysaccharide export family is more widespread and widely present in proteobacteria, cyanobacteria, acidobacteria, planctomycetes, bacteroidetes/chlorobi, and more sporadically in a few other groups.</p>
            <p>While the interaction between B12 and the transcobalamin-like SLBB domain involves the unique &#946;-hairpin insert, key contacts are also contributed by the core fold (See above, Figure <figr fid="F1">1A</figr>), and in general the position of the bound ligand is comparable to that of the bound metal-sulfur cluster in the &#946;-GF ferredoxins. The Nqo1-like clade shows its distinctive innovation in the region between strands 4 and 5, which also corresponds to the same general spatial location where the ligands are bound in the transcobalamin-like clade and &#946;-GF ferredoxins (Figure <figr fid="F1">1B</figr>). This indicates that the structural innovation specific to the Nqo1-like clade might also be involved in binding a ligand at a similar position. This spatial location might thus represent a common site for soluble ligand interactions in the &#946;-GF that is distinct from the C-terminal tail and the opposite protein surface that is key to the functional interaction of sulfur carriers like ThiS and MoaD and the ubiquitin-like proteins <abbrgrp><abbr bid="B13">13</abbr></abbrgrp>.</p>
         </sec>
         <sec>
            <st>
               <p>Contextual associations and inferences of possible functions for the SLBB</p>
            </st>
            <p>To investigate the functional diversification of the SLBB fold we used contextual analysis, which often provides insights into biochemical functions of poorly characterized protein domains or genes. Contextual analysis utilizes the information gleaned from the association of uncharacterized domains with other domains of known function and the tendency of genes whose products functionally interact to associate in conserved gene neighborhoods or predicted operons. <abbrgrp><abbr bid="B14">14</abbr><abbr bid="B15">15</abbr><abbr bid="B16">16</abbr></abbrgrp>.</p>
            <p>Most members of the transcobalamin C-terminal domain clade of the SLBB superfamily contain signal peptides, and several also contain the C-terminal Gram-positive anchor motif <abbrgrp><abbr bid="B17">17</abbr></abbrgrp>, suggesting that they are secreted or cell-surface proteins. A common domain architecture in this clade encountered in both eukaryotes and bacteria is the fusion of the SLBB to an &#945;/&#945; toroid domain. In bacteria the toroid may be present either N-terminal (e.g. <it>Desulfotomaculum</it>, gi: 88945170) or C-terminal (e.g. <it>Bacillus</it>, gi: 42782379) to the SLBB (Figure <figr fid="F2">2A</figr>). As the central cavity formed by the &#945;/&#945; toroid in transcobalamin plays a major role in binding B12 <abbrgrp><abbr bid="B10">10</abbr></abbrgrp>, it is likely that the two domains cooperate in binding B12 in all these proteins. Additional architectures include fusions to domains typically found in extracellular proteins, such as one or more immunoglobulin-fold domains (e.g. <it>Archaeoglobus</it>; gi: 11498993 and <it>Moorella</it>; gi: 83590303), the FIVAR (Pfam entry: PF07554) sugar-binding domain (<it>Clostridium</it>, gi: 28210467), the fasciclin domain (<it>Methanosarcina</it>; gi: 21228740) and a &#946;-propeller domain (<it>Clostridium</it>, gi: 28210494). Given that many of these domains are often involved in interactions with polysaccharides, they might play a role in tethering these proteins to the cell surface by binding peptidoglycan or capsular polysaccharides <abbrgrp><abbr bid="B18">18</abbr><abbr bid="B19">19</abbr><abbr bid="B20">20</abbr><abbr bid="B21">21</abbr><abbr bid="B22">22</abbr><abbr bid="B23">23</abbr></abbrgrp>. Often these multi-domain SLBB proteins occur in conserved operons that might additionally code a second paralogous extracellular SLBB protein (Figure <figr fid="F2">2</figr>). This might imply that different extracellular SLBB proteins interact together to form protein complexes on the cell surface. Interestingly, an analysis of the B12 biosynthesis pathways of all the bacteria that possess proteins with such SLBB domains showed they usually lacked key biosynthetic enzymes for B12. Furthermore, these SLBB proteins are generally encoded by predicted operons that also contain genes for CbiO-like ABC ATPase and the CbiQ-like integral membrane protein implicated in cobalt transport <abbrgrp><abbr bid="B24">24</abbr></abbrgrp>. These observations suggest that the primary role of this clade of SLBB proteins might be to scavenge B12 or its precursors from the environment. As the archaea which contain these SLBB proteins often possess an anaerobic B12 synthesis pathway, it is possible that these might instead be involved in scavenging a distinct metabolite. In <it>Syntrophomonas </it>the SLBB domain is found in putative extracellular enzymes fused to sulfite oxidase-like molybdopterin cofactor binding domain (e.g. gi: 71491441) <abbrgrp><abbr bid="B25">25</abbr></abbrgrp>. It is likely that in these proteins the SLBB provides a B12 cofactor that might be required by these enzymes.</p>
            <fig id="F2">
               <title>
                  <p>Figure 2</p>
               </title>
               <caption>
                  <p>Domain architectures and conserved gene neighborhoods of the SLBB superfamily</p>
               </caption>
               <text>
                  <p><b>Domain architectures and conserved gene neighborhoods of the SLBB superfamily</b>. (A) Representative architectures are grouped according to family/clade type and labeled by gene name, organism abbreviation, and gi number; demarcated by underscores. Individual domains in architectures are depicted as colored polygons. (B) A sampling of conserved gene neighborhoods found in association with the SLBB domain, with genes depicted as boxed arrows. The SLBB members of the depicted gene neighborhoods are labeled below by gene name, organism abbreviation, and gi number; demarcated by underscores. The family/clade name of the SLBB domain is given to the left in each architecture. Names are given at the top of genes in neighborhoods, where appropriate. The large PduS operon is broken into commonly-observed gene clusters; each boxed arrow enclosed by a dotted line represents such a cluster. ComEA proteins are always in the vicinity of the two-domain ComEC protein that has an integral membrane domain predicted to form a pore through which the DNA is transported into the cell and a metallo-&#946;-lactamase-like domain that may serve as a DNA nuclease during the uptake DNA from the outer cell wall. Additional organism abbreviations not given in Figure 1: <it>Mhun</it>, <it>Methanospirillum hungatei</it>; <it>Mmaz</it>, <it>Methanosarcina mazei</it>; <it>Dred</it>, <it>Desulfotomaculum reducens</it>; <it>Swol</it>, <it>Syntrophomonas wolfei</it>; <it>Valg</it>, <it>Vibrio alginolyticus</it>; <it>Rleg</it>, <it>Rhizobium leguminosarum</it>; <it>and Msp</it>, <it>Mycobacterium sp. MCS</it>. Additional abbreviations: SO, Sulfite Oxidoreductase; &#946;-P, &#946;-propeller; Fer4, 4Fe-S ferrodoxin; HTH, Helix-Turn-Helix; HhH, Helix-hairpin-Helix; RFC&#948;, RFC clamp loader subunit; LMW, low-molecular weight; and O-Ag, O-antigen.</p>
               </text>
               <graphic file="1745-6150-2-4-2"/>
            </fig>
            <p>Intracellular versions of the transcobalamin-like clade show fusions of the SLBB domain with two distinct winged HTH domains, namely those of the ArsR-like (e.g. gi:72395507, <it>Methanosarcina</it>) and AraC-like families (E.g. gi: 86604362, <it>Lactobacillus</it>) (Figure <figr fid="F2">2A</figr>). These proteins probably function as one-component transcription factors that respond to concentration of B12, its precursors or some other as yet unknown soluble ligands.</p>
            <p>In the Nqo1-like clade, polysaccharide export proteins are predicted to be secreted or periplasmic proteins and contain an absolutely conserved N-terminal &#946;-strand-rich domain followed by 1&#8211;8 repeats of the SLBB domain (Figure <figr fid="F2">2A</figr>). They appear to be part of a larger complex that is involved in transport of polysaccharides to the cell surface and are believe to associate with the outer membrane and periplasmic space in proteobacteria <abbrgrp><abbr bid="B26">26</abbr></abbrgrp>. Conserved gene-neighborhoods that encode these proteins are populated with proteins involved in the biosynthesis and modification of sugars or polysaccharides, which is consistent with their role in polysaccharide export (Figure <figr fid="F2">2</figr>). The related ComEA proteins of Gram-positive bacteria also contain a signal peptide followed by an N-terminal SLBB domain that is always fused to a pair of DNA-binding Helix-hairpin-Helix domains at their C-terminus. This is consistent with the role of the ComEA protein as a non-specific DNA receptor in the transformation competence mechanism of Gram-positive bacteria <abbrgrp><abbr bid="B27">27</abbr><abbr bid="B28">28</abbr></abbrgrp>. Prior studies suggest that this DNA receptor may be linked to the cell surface via the N-terminal region spanning the SLBB domain <abbrgrp><abbr bid="B27">27</abbr></abbrgrp>. Taken together these observations suggest that the SLBB domain in these proteins is likely to be critical for interaction with cell polysaccharides and/or sugars of the peptidoglycan. The complementary phyletic distribution of ComEA and polysaccharide export proteins is strongly correlated with the presence or the absence of the specialized Gram-positive cell wall (See above). This suggests that they probably diverged from a common ancestral polysaccharide/sugar-binding domain that was originally involved in uptake or extrusion of large molecules at the cell surface.</p>
            <p>The remaining three groups of proteins, namely NqoI, RnfC and PduS, within the Nqo1-like clade of the SLBB superfamily share a common architectural core consisting of a fusion between an N-terminal Rossmannoid domain and an SLBB domain. Unlike classical Rossmann fold domains of oxidoreductases with 5&#8211;7 strands, the Rossmannoid domain of these proteins has a 4-stranded core in a 3214 order <abbrgrp><abbr bid="B12">12</abbr></abbrgrp> coupled to a N-terminal two-stranded hairpin contributed by a module similar to the BBMs of RNA polymerases <abbrgrp><abbr bid="B29">29</abbr></abbrgrp>. The SLBB and this Rossmannoid domain are additionally combined to a variety of other domains in NqoI, RnfC and PduS proteins. The most common fusions seen in all three groups of proteins are those to 4Fe-S ferredoxin domains that flank the above two-domain core. The NqoI family also might contain a C-terminal tetrahelical bundle with an up-and-down topology that coordinates an Fe-S cluster via conserved cysteine residues, which in addition to the 4Fe-S ferredoxin is likely to provide an additional redox center for electron transport <abbrgrp><abbr bid="B12">12</abbr></abbrgrp>. A biotin/lipoate carrier-like Sandwich Barrel Hybrid Motif (SBHM) domain <abbrgrp><abbr bid="B29">29</abbr></abbrgrp> is found respectively at the C- and N-termini of members of PduS and RnfC families.</p>
            <p>The roles of these versions of the SLBB remain enigmatic; however there is evidence that the PduS protein might bind soluble ligands. The PduS gene typically belongs to a large mobile operon coding proteins required for the biogenesis of carboxysome/polyhedral bodies, which contains enzymes involved in propanediol degradation. The PduS has been shown to strongly bind cob(I)alamin and was characterized as a bifunctional cob(II)alamin and hydroxycobalamin (cob(III)alamin) reductase catalyzing the formation of cob(I)alamin. Cob(I)alamin is the immediate precursor of Ado-cobalamin, which serves as an essential coenzyme for the diol dehydratase in degradation of 1,2-propanediol <abbrgrp><abbr bid="B30">30</abbr><abbr bid="B31">31</abbr></abbrgrp>. It is likely that the SLBB domain in PduS, like that in transcobalamin, binds cob(I)alamin or HO-cobalamin, while the N-terminal Rossmanoid domain binds the flavin nucleotide cofactor for the redox reaction. Such a function is also supported by the observation that cob(I)alamin is highly reactive and needs to be shielded from the environment <abbrgrp><abbr bid="B30">30</abbr></abbrgrp>. The role of the fused SBHM domain seen in PduS proteins is less clear. However, given that the SBHM domain carries covalently associated ligands such as biotin/lipoate <abbrgrp><abbr bid="B32">32</abbr><abbr bid="B33">33</abbr></abbrgrp>, it might similarly carry cofactor ligands or intermediates in propanediol degradation such as 1,2-propanediol-1-yl radical <abbrgrp><abbr bid="B34">34</abbr></abbrgrp>. There is currently no evidence for a soluble ligand interacting with the related SLBB domain in the RnfC and NqoI. Nevertheless, crystal structures indicate an exposed location for the SLBB domains in these proteins, allowing the possibility that they might be allosterically regulated by hitherto unknown ligands interacting with this domain.</p>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Evolutionary history of the SLBB domain and general conclusions</p>
         </st>
         <p>The phyletic and domain-architecture distributions show that the SLBB superfamily is well-represented and has diversified across the entire bacteria superkingdom. Their sporadic presence in archaea and the stronger affinity of the different eukaryotic versions to their bacterial counterparts suggests that the SLBB superfamily was derived early in evolution of bacteria, most probably from one of the many ancient &#946;-grasp fold domains. The two major families in the NqoI-like clade appear to have a pan-bacterial distribution suggesting that this clade had already differentiated into versions associated with cell wall related functions (ComEA-Polysaccharide export protein family) and intracellular oxido-reductase related functions (Nqo1, RnfC and PduS). Of the latter group the NqoI protein of the respiratory complex-I is seen across bacteria and was transferred to the eukaryotic lineage during the primary endosymbiotic event that generated the eukaryotic cell with mitochondria. RnfC and PduS proteins have more restricted phyletic distributions and are likely to be late derivatives of the more ancient Nqo1 lineage. The transcobalamin C-terminal clade is very divergent in sequence and appears to be a lineage-specific innovation in Gram positive bacteria that was recruited specifically for transporting extracellular B12-like cofactors or their precursors. Subsequently, a specific version that combined the &#945;/&#945; toroid domain with the SLBB domain appears to have been laterally transferred to the animal lineage early in its evolution. This event probably conferred on animals the ability to directly absorb cobalamin synthesized by bacteria in the gut. In parallel, there appear to have been sporadic transfers of large extracellular multidomain versions as well as intracellular versions fused to DNA-binding HTH domains to certain euryarchaeal lineages.</p>
         <p>In this context, it is of interest to note that the discovery of soluble ligand binding versions of the &#946;-GF points to a noteworthy structure-function analogy with the RNA-recognition motif (RRM)-like fold. In functional terms, the RRM-like fold has long been known to bind a range of soluble ligands such as amino acids, sugars and co-factors. Notable examples of these include the ACT domain superfamily and the amino acid-binding domain of the LRP-like transcription factors <abbrgrp><abbr bid="B20">20</abbr><abbr bid="B35">35</abbr><abbr bid="B36">36</abbr></abbrgrp>. Both folds also provide scaffolds for iron-sulfur clusters, (4Fe-4S ferrodoxins in the case of the RRM-like fold <abbrgrp><abbr bid="B37">37</abbr></abbrgrp>) and are also involved in RNA-binding, as well as adaptor functions related to protein-protein interactions <abbrgrp><abbr bid="B32">32</abbr><abbr bid="B33">33</abbr></abbrgrp>. These functional analogies in turn might be related to certain general organizational similarities seen in the two domains: like the &#946;-GF domain, the RRM-like fold domain is also a relatively small domain with an asymmetric two-layered structure. One surface of the core sheets is partially obscured by helical segments in both these folds, whereas the other is largely left exposed (see SCOP database <abbrgrp><abbr bid="B7">7</abbr></abbrgrp>). Further study of these functional analogies might throw light on whether there exist certain general structural principles that have affected the recruitment of certain small ancient domains in similar contexts.</p>
         <p>In conclusion, we show that the &#946;-GF domains found in transcobalamin, polysaccharide export proteins, ComEA, PduS, and RnfC and Nqo1-like oxidoreductases define a novel superfamily, several of which might interact with different soluble ligands. This investigation provides the possible evolutionary scenario for the origin of the vitamin B12 uptake in animals via transcobalamin and intrinsic factor. It also provides leads for new investigations into B12 metabolism in bacteria and other aspects of protein-ligand interaction in competence, cell-surface biochemistry, and respiratory electron transfer.</p>
      </sec>
      <sec>
         <st>
            <p>Methods</p>
         </st>
         <p>Searches of the PDB database with query structures were conducted using the DALI program <abbrgrp><abbr bid="B38">38</abbr></abbrgrp>. Structural visualization and manipulations were performed using the Swiss-PDB viewer program <abbrgrp><abbr bid="B39">39</abbr></abbrgrp>. Sensitive profile searches were conducted using the PSI-BLAST <abbrgrp><abbr bid="B40">40</abbr></abbrgrp> and HMMER programs <abbrgrp><abbr bid="B41">41</abbr></abbrgrp>. PSI-BLAST searches were performed against the nonredundant (NR) database of protein sequences (National Center for Biotechnology Information [NCBI], NIH, Bethesda, MA, USA), with either a single sequence or an alignment used as the query, with a default profile inclusion expectation (e) value threshold of 0.01 (unless specified otherwise), and was iterated until convergence. All sequences collected in these searches are made available in <supplr sid="S2">Additional file 2</supplr>. The library of profiles for various domains was prepared by extracting all alignments from the PFAM database <abbrgrp><abbr bid="B42">42</abbr></abbrgrp> and updating them by adding new members from the NR database. These updated alignments were then used to make HMMs with the HMMER package or PSSMs with PSI-BLAST. For all searches involving membrane-spanning domains we used a statistical correction for compositional bias to reduce false positives due to the general hydrophobicity of these proteins <abbrgrp><abbr bid="B43">43</abbr></abbrgrp>. Signal peptide and transmembrane helices were predicted using the SignalP <abbrgrp><abbr bid="B44">44</abbr></abbrgrp> and TMHMM programs <abbrgrp><abbr bid="B45">45</abbr></abbrgrp>. Multiple alignments were constructed using the T_Coffee <abbrgrp><abbr bid="B46">46</abbr></abbrgrp> and MUSCLE programs <abbrgrp><abbr bid="B47">47</abbr></abbrgrp> followed by manual adjustments based on PSI-BLAST results. Protein secondary structure was predicted using a multiple alignment as the input for the JPRED program <abbrgrp><abbr bid="B48">48</abbr></abbrgrp>, with information extracted from a PSSM, HMM, and the seed alignment itself. Similarity-based clustering of proteins was carried out using the BLASTCLUST program <abbrgrp><abbr bid="B49">49</abbr></abbrgrp>. Gene neighborhoods were determined using a custom script that uses completely sequenced genomes or whole genome shotgun sequences to derive a table of gene neighbors for a query gene. The BLASTCLUST program was then used to cluster the proteins sequences in the neighborhoods and establish conserved co-occurring genes. The KEGG database was used to identify key components of the B12 synthesis pathway <abbrgrp><abbr bid="B50">50</abbr></abbrgrp>. Automation of all large-scale sequence analysis procedures were carried out using the in-house TASS package (Anantharaman V, Balaji S, Aravind L; unpublished), which operates similar to the previously published SEALS package <abbrgrp><abbr bid="B51">51</abbr></abbrgrp>.</p>
         <suppl id="S2">
            <title>
               <p>Additional file 2</p>
            </title>
            <text>
               <p><b>Comprehensive list of proteins of the SLBB superfamily, their gene neighborhoods, and domain architectures</b>. Alignments of the SLBB domain will be submitted to the Pfam domain database <abbrgrp><abbr bid="B39">39</abbr></abbrgrp>.</p>
            </text>
            <file name="1745-6150-2-4-S2.txt">
               <p>Click here for file</p>
            </file>
         </suppl>
      </sec>
      <sec>
         <st>
            <p>Reviewers' comments</p>
         </st>
         <sec>
            <st>
               <p>Reviewer's report 1</p>
            </st>
            <sec>
               <st>
                  <p>Andrei Osterman, Burnham Institute for Medical Research</p>
               </st>
               <p>The manuscript "A novel superfamily with the &#946;-grasp fold involved in binding diverse soluble ligands" by A. M. Burroughs is an excellent and insightful comparative genomics study of a very interesting domain family broadly conserved across all kingdoms of Life. The authors took on a very challenging task of classifying multiple representatives of this superfamily on the edge of detectable homology. Such a task is particularly daunting for non-enzymatic proteins, where the divergence of sequence driven by adaptation to new biological tasks and contexts is much more pronounced than for enzymes. This study tracks down a complex evolution of the family and reveals an apparent functional theme, binding of small molecule ligands, such as B12 and its analogs, that "runs in the family" from bacteria to mammals. In addition to sensitive and sophisticated homology-based methods, the authors broadly used genome context analysis, which was one of the key success factors in pursuing this challenge. Of particular interest is the observation that in course of a fascinating evolutionary reshuffling, not only the ligand binding specificity, but the actual functional context of this domain could have changed on a number of occasions. One of the most remarkable events is a proposed recruitment of this domain into the context of transcriptional regulation in some bacteria. The latter theme of recruiting former enzymes and, as we see now, other types of proteins with natural affinity to certain ligands, is being recognized as one of the most important strategies of "natural engineering" of effector-binding domains in transcriptional regulators. Although this bioinformatics analysis alone does not allow to precisely identifying specific functions for all SLBB subfamilies, it certainly provides a perfect starting point for many case studies driven by specific research interests of various experimental groups. This constitutes a broad impact of this paper, which goes beyond the innovative bioinformatics methodology and obvious implications in the field of domain classification and evolution. The paper is written very clearly, with sufficient details of methods and key results and with helpful illustrations. The Supplementary material providing the entire list of SLBB domain superfamily including the information about their genome context, is highly valuable. This reviewer strongly supports the publication of this paper "as is" in Biology Direct, a perfect home for this wonderful study.</p>
            </sec>
            <sec>
               <st>
                  <p>Author response</p>
               </st>
               <p><it>We appreciate the comments, and particularly would like to emphasize that we feel the findings in this paper can be used as starting points for experimental work that will contribute significantly to the study of several important biological pathways</it>.</p>
            </sec>
         </sec>
         <sec>
            <st>
               <p>Reviewer's report 2</p>
            </st>
            <sec>
               <st>
                  <p>Igor Zhulin, University of Tennessee</p>
               </st>
               <p>In this study, the authors report a novel domain superfamily within the &#946;-grasp fold. The predicted distinct property of this domain is binding diverse soluble ligands, thus it is termed Soluble-Ligand-Binding &#946;-grasp or SLBB. Motivation for this work came from expectation (but lack of factual knowledge) that some members of the ubiquitous &#946;-grasp fold are involved in binding small ligands.</p>
               <p>First, the authors searched the PDB database with known structures to retrieve related structures and then used an array of sensitive sequence-based searches (hidden-Markov-model- or position-specific-scoring-matrix-based) to identify remote homologs. Technically, it is very well done and described in sufficient detail. The authors took advantage of available structures to guide editing of the multiple alignment, which in turn allowed them to associate conserved positions with the structural features.</p>
               <p>Biological function prediction came primarily from the contextual analysis, which was quite thorough. The most interesting prediction is that the primary role of one of the two SLBB clades is binding vitamin B12 and its precursors.</p>
               <p>Overall, this is an interesting, well-executed study and defining novel domain families and assigning potential biological functions is very important. I would like to add that it is also important to deposit the newly described domain to leading domain databases, such as Pfam. I have not seen plans for doing so in this study. This reviewer is guilty of not always doing so either, but still it should be a rule rather than exception. Perhaps, two version of SLBB could be produced capturing characteristics of the two clades.</p>
            </sec>
            <sec>
               <st>
                  <p>Author response</p>
               </st>
               <p><it>We could not agree more regarding the importance of domain databases, and we will be submitting two versions of the SLBB to Pfam</it>.</p>
               <sec>
                  <st>
                     <p>Major concern</p>
                  </st>
                  <p>Essentially, there is only one major concern with respect to generalization of biological function prediction. I think the members of the domain clade are divergent enough to worry about B12 and its precursors being the only or even the main ligand. The statement in the abstract seems to address this issue &#8211; the authors rightly indicate that "members of this superfamily are likely to bind a range of soluble ligands, including B12"; however in the text it becomes more stringent. For instance, on page 3, when describing SLBB fusions with HTH domains in transcription factors, it is suggested that these proteins "respond to B12 or its precursors". I would suggest toning down the claim that all members of the transcobalamin clade bind exclusively B12 and its precursors. Perhaps, the authors did not intend to make it sounds like that, but it does.</p>
               </sec>
            </sec>
            <sec>
               <st>
                  <p>Author response</p>
               </st>
               <p><it>We do admit that there might be a greater diversity of bound ligands, rather than B12 alone. Only in the cases where there was contextual evidence for involvement in B12 metabolism do we suggest it as a most likely ligand. Following the suggestion of the reviewer, we have altered the wording in the text to clarify the possibility of greater diversity in terms of bound ligands, especially with the HTH-fused versions</it>.</p>
            </sec>
         </sec>
         <sec>
            <st>
               <p>Reviewer's report 3</p>
            </st>
            <sec>
               <st>
                  <p>Arcady Mushegian, Stowers Institute, Kansas City, USA</p>
               </st>
               <p>I have no concerns about the main observation of the extended beta-grasp SLBB superfamily, or about functional inferences from the contextual analysis of genes that contain various versions of this domain. I think, however, that the analogy with the RRM fold early in the manuscript is a distraction at that point of the authors' reasoning. Generally, I suppose that this type of domain/fold recognition work may benefit from being more logical and less chronological: that is, even if analogy with RRM played a role in authors' own thinking about the SLBB superfamily, I want to first focus on what has been actually observed. On the other hand, I would like to see a discussion of RRM and other analogous folds at the end of the manuscript, when it could serve a more useful purpose of establishing some trends in evolution (do we see lots of small ligand-binding domains evolving from nucleic-acid binding domains? How about the other way around? And so on).</p>
            </sec>
            <sec>
               <st>
                  <p>Author's response</p>
               </st>
               <p><it>We admit this might provide a better flow for the article. Accordingly, we have now shifted the discussion of the parallels between the &#946;-grasp and RRM folds to the Results and Discussion section</it>.</p>
            </sec>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Authors' contributions</p>
         </st>
         <p>LA and SB made the initial discovery reported here. The complete sequence and structure analysis and preparation of figures and supplementary material was performed by AMB, LMI and SB. The paper was written by AMB and LA.</p>
      </sec>
   </bdy>
   <bm>
      <ack>
         <sec>
            <st>
               <p>Acknowledgements</p>
            </st>
            <p>The authors gratefully acknowledge the Intramural research program of the National Library of Medicine, National Institutes of Health, USA, for funding their research.</p>
         </sec>
      </ack>
      <refgrp>
         <bibl id="B1">
            <title>
               <p>Similarity of protein G and ubiquitin</p>
            </title>
            <aug>
               <au>
                  <snm>Kraulis</snm>
                  <fnm>PJ</fnm>
               </au>
            </aug>
            <source>Science</source>
            <pubdate>1991</pubdate>
            <volume>254</volume>
            <issue>5031</issue>
            <fpage>581</fpage>
            <lpage>582</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1126/science.1658931</pubid>
                  <pubid idtype="pmpid">1658931</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B2">
            <title>
               <p>Familiar strangers</p>
            </title>
            <aug>
               <au>
                  <snm>Murzin</snm>
                  <fnm>AG</fnm>
               </au>
            </aug>
            <source>Nature</source>
            <pubdate>1992</pubdate>
            <volume>360</volume>
            <issue>6405</issue>
            <fpage>635</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/360635a0</pubid>
                  <pubid idtype="pmpid" link="fulltext">1465124</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B3">
            <title>
               <p>The ubiquitin system</p>
            </title>
            <aug>
               <au>
                  <snm>Hershko</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Ciechanover</snm>
                  <fnm>A</fnm>
               </au>
            </aug>
            <source>Annu Rev Biochem</source>
            <pubdate>1998</pubdate>
            <volume>67</volume>
            <fpage>425</fpage>
            <lpage>479</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1146/annurev.biochem.67.1.425</pubid>
                  <pubid idtype="pmpid" link="fulltext">9759494</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B4">
            <title>
               <p>Evolution of aminoacyl-tRNA synthetases--analysis of unique domain architectures and phylogenetic trees reveals a complex history of horizontal gene transfer events</p>
            </title>
            <aug>
               <au>
                  <snm>Wolf</snm>
                  <fnm>YI</fnm>
               </au>
               <au>
                  <snm>Aravind</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Grishin</snm>
                  <fnm>NV</fnm>
               </au>
               <au>
                  <snm>Koonin</snm>
                  <fnm>EV</fnm>
               </au>
            </aug>
            <source>Genome Res</source>
            <pubdate>1999</pubdate>
            <volume>9</volume>
            <issue>8</issue>
            <fpage>689</fpage>
            <lpage>710</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">10447505</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B5">
            <title>
               <p>The FERM domain: a unique module involved in the linkage of cytoplasmic proteins to the membrane</p>
            </title>
            <aug>
               <au>
                  <snm>Chishti</snm>
                  <fnm>AH</fnm>
               </au>
               <au>
                  <snm>Kim</snm>
                  <fnm>AC</fnm>
               </au>
               <au>
                  <snm>Marfatia</snm>
                  <fnm>SM</fnm>
               </au>
               <au>
                  <snm>Lutchman</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Hanspal</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Jindal</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Liu</snm>
                  <fnm>SC</fnm>
               </au>
               <au>
                  <snm>Low</snm>
                  <fnm>PS</fnm>
               </au>
               <au>
                  <snm>Rouleau</snm>
                  <fnm>GA</fnm>
               </au>
               <au>
                  <snm>Mohandas</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Chasis</snm>
                  <fnm>JA</fnm>
               </au>
               <au>
                  <snm>Conboy</snm>
                  <fnm>JG</fnm>
               </au>
               <au>
                  <snm>Gascard</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Takakuwa</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Huang</snm>
                  <fnm>SC</fnm>
               </au>
               <au>
                  <snm>Benz</snm>
                  <fnm>EJ</fnm>
                  <suf>Jr.</suf>
               </au>
               <au>
                  <snm>Bretscher</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Fehon</snm>
                  <fnm>RG</fnm>
               </au>
               <au>
                  <snm>Gusella</snm>
                  <fnm>JF</fnm>
               </au>
               <au>
                  <snm>Ramesh</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>Solomon</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Marchesi</snm>
                  <fnm>VT</fnm>
               </au>
               <au>
                  <snm>Tsukita</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Tsukita</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Hoover</snm>
                  <fnm>KB</fnm>
               </au>
               <etal/>
            </aug>
            <source>Trends Biochem Sci</source>
            <pubdate>1998</pubdate>
            <volume>23</volume>
            <issue>8</issue>
            <fpage>281</fpage>
            <lpage>282</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/S0968-0004(98)01237-7</pubid>
                  <pubid idtype="pmpid" link="fulltext">9757824</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B6">
            <title>
               <p>Crystal structure of molybdopterin synthase and its evolutionary relationship to ubiquitin activation</p>
            </title>
            <aug>
               <au>
                  <snm>Rudolph</snm>
                  <fnm>MJ</fnm>
               </au>
               <au>
                  <snm>Wuebbens</snm>
                  <fnm>MM</fnm>
               </au>
               <au>
                  <snm>Rajagopalan</snm>
                  <fnm>KV</fnm>
               </au>
               <au>
                  <snm>Schindelin</snm>
                  <fnm>H</fnm>
               </au>
            </aug>
            <source>Nat Struct Biol</source>
            <pubdate>2001</pubdate>
            <volume>8</volume>
            <issue>1</issue>
            <fpage>42</fpage>
            <lpage>46</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/83034</pubid>
                  <pubid idtype="pmpid" link="fulltext">11135669</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B7">
            <title>
               <p>SCOP database</p>
            </title>
            <fpage>[http://scop.mrc</fpage>
            <lpage>lmb.cam.ac.uk/scop/]</lpage>
         </bibl>
         <bibl id="B8">
            <title>
               <p>The prokaryotic antecedents of the ubiquitin-signaling system and the early evolution of ubiquitin-like beta-grasp domains</p>
            </title>
            <aug>
               <au>
                  <snm>Iyer</snm>
                  <fnm>LM</fnm>
               </au>
               <au>
                  <snm>Burroughs</snm>
                  <fnm>AM</fnm>
               </au>
               <au>
                  <snm>Aravind</snm>
                  <fnm>L</fnm>
               </au>
            </aug>
            <source>Genome Biol</source>
            <pubdate>2006</pubdate>
            <volume>7</volume>
            <issue>7</issue>
            <fpage>R60</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1779556</pubid>
                  <pubid idtype="pmpid" link="fulltext">16859499</pubid>
                  <pubid idtype="doi">10.1186/gb-2006-7-7-r60</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B9">
            <title>
               <p>New insights into carrier binding and epithelial uptake of the erythropoietic nutrients cobalamin and folate</p>
            </title>
            <aug>
               <au>
                  <snm>Moestrup</snm>
                  <fnm>SK</fnm>
               </au>
            </aug>
            <source>Curr Opin Hematol</source>
            <pubdate>2006</pubdate>
            <volume>13</volume>
            <issue>3</issue>
            <fpage>119</fpage>
            <lpage>123</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1097/01.moh.0000219654.65538.5b</pubid>
                  <pubid idtype="pmpid" link="fulltext">16567952</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B10">
            <title>
               <p>Structural basis for mammalian vitamin B12 transport by transcobalamin</p>
            </title>
            <aug>
               <au>
                  <snm>Wuerges</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Garau</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Geremia</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Fedosov</snm>
                  <fnm>SN</fnm>
               </au>
               <au>
                  <snm>Petersen</snm>
                  <fnm>TE</fnm>
               </au>
               <au>
                  <snm>Randaccio</snm>
                  <fnm>L</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci U S A</source>
            <pubdate>2006</pubdate>
            <volume>103</volume>
            <issue>12</issue>
            <fpage>4386</fpage>
            <lpage>4391</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1450181</pubid>
                  <pubid idtype="pmpid" link="fulltext">16537422</pubid>
                  <pubid idtype="doi">10.1073/pnas.0509099103</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B11">
            <title>
               <p>Identification of a new class of nitrogen fixation genes in Rhodobacter capsulatus: a putative membrane complex involved in electron transport to nitrogenase</p>
            </title>
            <aug>
               <au>
                  <snm>Schmehl</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Jahn</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Meyer zu Vilsendorf</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Hennecke</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Masepohl</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Schuppler</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Marxer</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Oelze</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Klipp</snm>
                  <fnm>W</fnm>
               </au>
            </aug>
            <source>Mol Gen Genet</source>
            <pubdate>1993</pubdate>
            <volume>241</volume>
            <issue>5-6</issue>
            <fpage>602</fpage>
            <lpage>615</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1007/BF00279903</pubid>
                  <pubid idtype="pmpid">8264535</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B12">
            <title>
               <p>Structure of the hydrophilic domain of respiratory complex I from Thermus thermophilus</p>
            </title>
            <aug>
               <au>
                  <snm>Sazanov</snm>
                  <fnm>LA</fnm>
               </au>
               <au>
                  <snm>Hinchliffe</snm>
                  <fnm>P</fnm>
               </au>
            </aug>
            <source>Science</source>
            <pubdate>2006</pubdate>
            <volume>311</volume>
            <issue>5766</issue>
            <fpage>1430</fpage>
            <lpage>1436</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1126/science.1123809</pubid>
                  <pubid idtype="pmpid" link="fulltext">16469879</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B13">
            <title>
               <p>Ulp1-SUMO crystal structure and genetic analysis reveal conserved interactions and a regulatory element essential for cell growth in yeast</p>
            </title>
            <aug>
               <au>
                  <snm>Mossessova</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Lima</snm>
                  <fnm>CD</fnm>
               </au>
            </aug>
            <source>Mol Cell</source>
            <pubdate>2000</pubdate>
            <volume>5</volume>
            <issue>5</issue>
            <fpage>865</fpage>
            <lpage>876</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/S1097-2765(00)80326-3</pubid>
                  <pubid idtype="pmpid" link="fulltext">10882122</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B14">
            <title>
               <p>The use of gene clusters to infer functional coupling</p>
            </title>
            <aug>
               <au>
                  <snm>Overbeek</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Fonstein</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>D'Souza</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Pusch</snm>
                  <fnm>GD</fnm>
               </au>
               <au>
                  <snm>Maltsev</snm>
                  <fnm>N</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci U S A</source>
            <pubdate>1999</pubdate>
            <volume>96</volume>
            <issue>6</issue>
            <fpage>2896</fpage>
            <lpage>2901</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">15866</pubid>
                  <pubid idtype="pmpid" link="fulltext">10077608</pubid>
                  <pubid idtype="doi">10.1073/pnas.96.6.2896</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B15">
            <title>
               <p>Predicting protein function by genomic context: quantitative evaluation and qualitative inferences</p>
            </title>
            <aug>
               <au>
                  <snm>Huynen</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Snel</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Lathe</snm>
                  <fnm>W</fnm>
                  <suf>3rd</suf>
               </au>
               <au>
                  <snm>Bork</snm>
                  <fnm>P</fnm>
               </au>
            </aug>
            <source>Genome Res</source>
            <pubdate>2000</pubdate>
            <volume>10</volume>
            <issue>8</issue>
            <fpage>1204</fpage>
            <lpage>1210</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">310926</pubid>
                  <pubid idtype="pmpid" link="fulltext">10958638</pubid>
                  <pubid idtype="doi">10.1101/gr.10.8.1204</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B16">
            <title>
               <p>Genome alignment, evolution of prokaryotic genome organization and prediction of gene function using genomic context</p>
            </title>
            <aug>
               <au>
                  <snm>Wolf</snm>
                  <fnm>YI</fnm>
               </au>
               <au>
                  <snm>Rogozin</snm>
                  <fnm>IB</fnm>
               </au>
               <au>
                  <snm>Kondrashov</snm>
                  <fnm>AS</fnm>
               </au>
               <au>
                  <snm>Koonin</snm>
                  <fnm>EV</fnm>
               </au>
            </aug>
            <source>Genome Res</source>
            <pubdate>2001</pubdate>
            <volume>11</volume>
            <fpage>356</fpage>
            <lpage>372</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1101/gr.GR-1619R</pubid>
                  <pubid idtype="pmpid" link="fulltext">11230160</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B17">
            <title>
               <p>Conservation of a hexapeptide sequence in the anchor region of surface proteins from gram-positive cocci</p>
            </title>
            <aug>
               <au>
                  <snm>Fischetti</snm>
                  <fnm>VA</fnm>
               </au>
               <au>
                  <snm>Pancholi</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>Schneewind</snm>
                  <fnm>O</fnm>
               </au>
            </aug>
            <source>Mol Microbiol</source>
            <pubdate>1990</pubdate>
            <volume>4</volume>
            <issue>9</issue>
            <fpage>1603</fpage>
            <lpage>1605</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1111/j.1365-2958.1990.tb02072.x</pubid>
                  <pubid idtype="pmpid">2287281</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B18">
            <title>
               <p>Identification of a fibronectin-binding protein from Staphylococcus epidermidis</p>
            </title>
            <aug>
               <au>
                  <snm>Williams</snm>
                  <fnm>RJ</fnm>
               </au>
               <au>
                  <snm>Henderson</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Sharp</snm>
                  <fnm>LJ</fnm>
               </au>
               <au>
                  <snm>Nair</snm>
                  <fnm>SP</fnm>
               </au>
            </aug>
            <source>Infect Immun</source>
            <pubdate>2002</pubdate>
            <volume>70</volume>
            <issue>12</issue>
            <fpage>6805</fpage>
            <lpage>6810</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">133053</pubid>
                  <pubid idtype="pmpid" link="fulltext">12438356</pubid>
                  <pubid idtype="doi">10.1128/IAI.70.12.6805-6810.2002</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B19">
            <title>
               <p>Evolutionary connections between bacterial and eukaryotic signaling systems: a genomic perspective</p>
            </title>
            <aug>
               <au>
                  <snm>Aravind</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Anantharaman</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>Iyer</snm>
                  <fnm>LM</fnm>
               </au>
            </aug>
            <source>Curr Opin Microbiol</source>
            <pubdate>2003</pubdate>
            <volume>6</volume>
            <issue>5</issue>
            <fpage>490</fpage>
            <lpage>497</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/j.mib.2003.09.003</pubid>
                  <pubid idtype="pmpid" link="fulltext">14572542</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B20">
            <title>
               <p>Gleaning non-trivial structural, functional and evolutionary information about proteins by iterative database searches</p>
            </title>
            <aug>
               <au>
                  <snm>Aravind</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Koonin</snm>
                  <fnm>EV</fnm>
               </au>
            </aug>
            <source>J Mol Biol</source>
            <pubdate>1999</pubdate>
            <volume>287</volume>
            <issue>5</issue>
            <fpage>1023</fpage>
            <lpage>1040</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1006/jmbi.1999.2653</pubid>
                  <pubid idtype="pmpid" link="fulltext">10222208</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B21">
            <title>
               <p>Three-dimensional structure of cyclodextrin glycosyltransferase from Bacillus circulans at 3.4 A resolution</p>
            </title>
            <aug>
               <au>
                  <snm>Hofmann</snm>
                  <fnm>BE</fnm>
               </au>
               <au>
                  <snm>Bender</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Schulz</snm>
                  <fnm>GE</fnm>
               </au>
            </aug>
            <source>J Mol Biol</source>
            <pubdate>1989</pubdate>
            <volume>209</volume>
            <issue>4</issue>
            <fpage>793</fpage>
            <lpage>800</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/0022-2836(89)90607-4</pubid>
                  <pubid idtype="pmpid" link="fulltext">2531228</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B22">
            <title>
               <p>Relationship of secretion pattern and MPB70 homology with osteoblast-specific factor 2 to osteitis following Mycobacterium bovis BCG vaccination</p>
            </title>
            <aug>
               <au>
                  <snm>Ulstrup</snm>
                  <fnm>JC</fnm>
               </au>
               <au>
                  <snm>Jeansson</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Wiker</snm>
                  <fnm>HG</fnm>
               </au>
               <au>
                  <snm>Harboe</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>Infect Immun</source>
            <pubdate>1995</pubdate>
            <volume>63</volume>
            <issue>2</issue>
            <fpage>672</fpage>
            <lpage>675</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">173047</pubid>
                  <pubid idtype="pmpid" link="fulltext">7822037</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B23">
            <title>
               <p>Novel fold revealed by the structure of a FAS1 domain pair from the insect cell adhesion molecule fasciclin I</p>
            </title>
            <aug>
               <au>
                  <snm>Clout</snm>
                  <fnm>NJ</fnm>
               </au>
               <au>
                  <snm>Tisi</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Hohenester</snm>
                  <fnm>E</fnm>
               </au>
            </aug>
            <source>Structure</source>
            <pubdate>2003</pubdate>
            <volume>11</volume>
            <issue>2</issue>
            <fpage>197</fpage>
            <lpage>203</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/S0969-2126(03)00002-9</pubid>
                  <pubid idtype="pmpid" link="fulltext">12575939</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B24">
            <title>
               <p>Comparative and functional genomic analysis of prokaryotic nickel and cobalt uptake transporters: evidence for a novel group of ATP-binding cassette transporters</p>
            </title>
            <aug>
               <au>
                  <snm>Rodionov</snm>
                  <fnm>DA</fnm>
               </au>
               <au>
                  <snm>Hebbeln</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Gelfand</snm>
                  <fnm>MS</fnm>
               </au>
               <au>
                  <snm>Eitinger</snm>
                  <fnm>T</fnm>
               </au>
            </aug>
            <source>J Bacteriol</source>
            <pubdate>2006</pubdate>
            <volume>188</volume>
            <issue>1</issue>
            <fpage>317</fpage>
            <lpage>327</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1317602</pubid>
                  <pubid idtype="pmpid" link="fulltext">16352848</pubid>
                  <pubid idtype="doi">10.1128/JB.188.1.317-327.2006</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B25">
            <title>
               <p>Molecular basis of sulfite oxidase deficiency from the structure of sulfite oxidase</p>
            </title>
            <aug>
               <au>
                  <snm>Kisker</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Schindelin</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Pacheco</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Wehbi</snm>
                  <fnm>WA</fnm>
               </au>
               <au>
                  <snm>Garrett</snm>
                  <fnm>RM</fnm>
               </au>
               <au>
                  <snm>Rajagopalan</snm>
                  <fnm>KV</fnm>
               </au>
               <au>
                  <snm>Enemark</snm>
                  <fnm>JH</fnm>
               </au>
               <au>
                  <snm>Rees</snm>
                  <fnm>DC</fnm>
               </au>
            </aug>
            <source>Cell</source>
            <pubdate>1997</pubdate>
            <volume>91</volume>
            <issue>7</issue>
            <fpage>973</fpage>
            <lpage>983</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/S0092-8674(00)80488-2</pubid>
                  <pubid idtype="pmpid" link="fulltext">9428520</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B26">
            <title>
               <p>The cell surface expression of group 2 capsular polysaccharides in Escherichia coli: the role of KpsD, RhsA and a multi-protein complex at the pole of the cell</p>
            </title>
            <aug>
               <au>
                  <snm>McNulty</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Thompson</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Barrett</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Lord</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Andersen</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Roberts</snm>
                  <fnm>IS</fnm>
               </au>
            </aug>
            <source>Mol Microbiol</source>
            <pubdate>2006</pubdate>
            <volume>59</volume>
            <issue>3</issue>
            <fpage>907</fpage>
            <lpage>922</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1111/j.1365-2958.2005.05010.x</pubid>
                  <pubid idtype="pmpid" link="fulltext">16420360</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B27">
            <title>
               <p>ComEA, a Bacillus subtilis integral membrane protein required for genetic transformation, is needed for both DNA binding and transport</p>
            </title>
            <aug>
               <au>
                  <snm>Inamine</snm>
                  <fnm>GS</fnm>
               </au>
               <au>
                  <snm>Dubnau</snm>
                  <fnm>D</fnm>
               </au>
            </aug>
            <source>J Bacteriol</source>
            <pubdate>1995</pubdate>
            <volume>177</volume>
            <issue>11</issue>
            <fpage>3045</fpage>
            <lpage>3051</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">176991</pubid>
                  <pubid idtype="pmpid" link="fulltext">7768800</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B28">
            <title>
               <p>ComEA is a DNA receptor for transformation of competent Bacillus subtilis</p>
            </title>
            <aug>
               <au>
                  <snm>Provvedi</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Dubnau</snm>
                  <fnm>D</fnm>
               </au>
            </aug>
            <source>Mol Microbiol</source>
            <pubdate>1999</pubdate>
            <volume>31</volume>
            <issue>1</issue>
            <fpage>271</fpage>
            <lpage>280</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1046/j.1365-2958.1999.01170.x</pubid>
                  <pubid idtype="pmpid" link="fulltext">9987128</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B29">
            <title>
               <p>Evolutionary connection between the catalytic subunits of DNA-dependent RNA polymerases and eukaryotic RNA-dependent RNA polymerases and the origin of RNA polymerases</p>
            </title>
            <aug>
               <au>
                  <snm>Iyer</snm>
                  <fnm>LM</fnm>
               </au>
               <au>
                  <snm>Koonin</snm>
                  <fnm>EV</fnm>
               </au>
               <au>
                  <snm>Aravind</snm>
                  <fnm>L</fnm>
               </au>
            </aug>
            <source>BMC Struct Biol</source>
            <pubdate>2003</pubdate>
            <volume>3</volume>
            <fpage>1</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">151600</pubid>
                  <pubid idtype="pmpid" link="fulltext">12553882</pubid>
                  <pubid idtype="doi">10.1186/1472-6807-3-1</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B30">
            <title>
               <p>Biochemical evidence that the pduS gene encodes a bifunctional cobalamin reductase</p>
            </title>
            <aug>
               <au>
                  <snm>Sampson</snm>
                  <fnm>EM</fnm>
               </au>
               <au>
                  <snm>Johnson</snm>
                  <fnm>CL</fnm>
               </au>
               <au>
                  <snm>Bobik</snm>
                  <fnm>TA</fnm>
               </au>
            </aug>
            <source>Microbiology</source>
            <pubdate>2005</pubdate>
            <volume>151</volume>
            <issue>Pt 4</issue>
            <fpage>1169</fpage>
            <lpage>1177</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1099/mic.0.27755-0</pubid>
                  <pubid idtype="pmpid" link="fulltext">15817784</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B31">
            <title>
               <p>The propanediol utilization (pdu) operon of Salmonella enterica serovar Typhimurium LT2 includes genes necessary for formation of polyhedral organelles involved in coenzyme B(12)-dependent 1, 2-propanediol degradation</p>
            </title>
            <aug>
               <au>
                  <snm>Bobik</snm>
                  <fnm>TA</fnm>
               </au>
               <au>
                  <snm>Havemann</snm>
                  <fnm>GD</fnm>
               </au>
               <au>
                  <snm>Busch</snm>
                  <fnm>RJ</fnm>
               </au>
               <au>
                  <snm>Williams</snm>
                  <fnm>DS</fnm>
               </au>
               <au>
                  <snm>Aldrich</snm>
                  <fnm>HC</fnm>
               </au>
            </aug>
            <source>J Bacteriol</source>
            <pubdate>1999</pubdate>
            <volume>181</volume>
            <issue>19</issue>
            <fpage>5967</fpage>
            <lpage>5975</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">103623</pubid>
                  <pubid idtype="pmpid" link="fulltext">10498708</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B32">
            <title>
               <p>Swinging arms and swinging domains in multifunctional enzymes: catalytic machines for multistep reactions</p>
            </title>
            <aug>
               <au>
                  <snm>Perham</snm>
                  <fnm>RN</fnm>
               </au>
            </aug>
            <source>Annu Rev Biochem</source>
            <pubdate>2000</pubdate>
            <volume>69</volume>
            <fpage>961</fpage>
            <lpage>1004</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1146/annurev.biochem.69.1.961</pubid>
                  <pubid idtype="pmpid" link="fulltext">10966480</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B33">
            <title>
               <p>Regulatory potential, phyletic distribution and evolution of ancient, intracellular small-molecule-binding domains</p>
            </title>
            <aug>
               <au>
                  <snm>Anantharaman</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>Koonin</snm>
                  <fnm>EV</fnm>
               </au>
               <au>
                  <snm>Aravind</snm>
                  <fnm>L</fnm>
               </au>
            </aug>
            <source>J Mol Biol</source>
            <pubdate>2001</pubdate>
            <volume>307</volume>
            <issue>5</issue>
            <fpage>1271</fpage>
            <lpage>1292</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1006/jmbi.2001.4508</pubid>
                  <pubid idtype="pmpid" link="fulltext">11292341</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B34">
            <title>
               <p>Identification of the 1,2-propanediol-1-yl radical as an intermediate in adenosylcobalamin-dependent diol dehydratase reaction</p>
            </title>
            <aug>
               <au>
                  <snm>Yamanishi</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Ide</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Murakami</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Toraya</snm>
                  <fnm>T</fnm>
               </au>
            </aug>
            <source>Biochemistry</source>
            <pubdate>2005</pubdate>
            <volume>44</volume>
            <issue>6</issue>
            <fpage>2113</fpage>
            <lpage>2118</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1021/bi0481850</pubid>
                  <pubid idtype="pmpid" link="fulltext">15697237</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B35">
            <title>
               <p>Crystal structure of the Lrp-like transcriptional regulator from the archaeon Pyrococcus furiosus</p>
            </title>
            <aug>
               <au>
                  <snm>Leonard</snm>
                  <fnm>PM</fnm>
               </au>
               <au>
                  <snm>Smits</snm>
                  <fnm>SH</fnm>
               </au>
               <au>
                  <snm>Sedelnikova</snm>
                  <fnm>SE</fnm>
               </au>
               <au>
                  <snm>Brinkman</snm>
                  <fnm>AB</fnm>
               </au>
               <au>
                  <snm>de Vos</snm>
                  <fnm>WM</fnm>
               </au>
               <au>
                  <snm>van der Oost</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Rice</snm>
                  <fnm>DW</fnm>
               </au>
               <au>
                  <snm>Rafferty</snm>
                  <fnm>JB</fnm>
               </au>
            </aug>
            <source>Embo J</source>
            <pubdate>2001</pubdate>
            <volume>20</volume>
            <issue>5</issue>
            <fpage>990</fpage>
            <lpage>997</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">145483</pubid>
                  <pubid idtype="pmpid" link="fulltext">11230123</pubid>
                  <pubid idtype="doi">10.1093/emboj/20.5.990</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B36">
            <title>
               <p>The ACT domain family</p>
            </title>
            <aug>
               <au>
                  <snm>Chipman</snm>
                  <fnm>DM</fnm>
               </au>
               <au>
                  <snm>Shaanan</snm>
                  <fnm>B</fnm>
               </au>
            </aug>
            <source>Curr Opin Struct Biol</source>
            <pubdate>2001</pubdate>
            <volume>11</volume>
            <issue>6</issue>
            <fpage>694</fpage>
            <lpage>700</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/S0959-440X(01)00272-X</pubid>
                  <pubid idtype="pmpid" link="fulltext">11751050</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B37">
            <title>
               <p>The structure of iron-sulfur proteins</p>
            </title>
            <aug>
               <au>
                  <snm>Sticht</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Rosch</snm>
                  <fnm>P</fnm>
               </au>
            </aug>
            <source>Prog Biophys Mol Biol</source>
            <pubdate>1998</pubdate>
            <volume>70</volume>
            <issue>2</issue>
            <fpage>95</fpage>
            <lpage>136</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/S0079-6107(98)00027-3</pubid>
                  <pubid idtype="pmpid" link="fulltext">9785959</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B38">
            <title>
               <p>Dali: a network tool for protein structure comparison</p>
            </title>
            <aug>
               <au>
                  <snm>Holm</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Sander</snm>
                  <fnm>C</fnm>
               </au>
            </aug>
            <source>Trends Biochem Sci</source>
            <pubdate>1995</pubdate>
            <volume>20</volume>
            <issue>11</issue>
            <fpage>478</fpage>
            <lpage>480</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/S0968-0004(00)89105-7</pubid>
                  <pubid idtype="pmpid" link="fulltext">8578593</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B39">
            <title>
               <p>SWISS-MODEL and the Swiss-PdbViewer: an environment for comparative protein modeling</p>
            </title>
            <aug>
               <au>
                  <snm>Guex</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Peitsch</snm>
                  <fnm>MC</fnm>
               </au>
            </aug>
            <source>Electrophoresis</source>
            <pubdate>1997</pubdate>
            <volume>18</volume>
            <issue>15</issue>
            <fpage>2714</fpage>
            <lpage>2723</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1002/elps.1150181505</pubid>
                  <pubid idtype="pmpid">9504803</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B40">
            <title>
               <p>Gapped BLAST and PSI-BLAST: a new generation of protein database search programs</p>
            </title>
            <aug>
               <au>
                  <snm>Altschul</snm>
                  <fnm>SF</fnm>
               </au>
               <au>
                  <snm>Madden</snm>
                  <fnm>TL</fnm>
               </au>
               <au>
                  <snm>Schaffer</snm>
                  <fnm>AA</fnm>
               </au>
               <au>
                  <snm>Zhang</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Zhang</snm>
                  <fnm>Z</fnm>
               </au>
               <au>
                  <snm>Miller</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Lipman</snm>
                  <fnm>DJ</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>1997</pubdate>
            <volume>25</volume>
            <issue>17</issue>
            <fpage>3389</fpage>
            <lpage>3402</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">146917</pubid>
                  <pubid idtype="pmpid" link="fulltext">9254694</pubid>
                  <pubid idtype="doi">10.1093/nar/25.17.3389</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B41">
            <title>
               <p>Profile hidden Markov models</p>
            </title>
            <aug>
               <au>
                  <snm>Eddy</snm>
                  <fnm>SR</fnm>
               </au>
            </aug>
            <source>Bioinformatics</source>
            <pubdate>1998</pubdate>
            <volume>14</volume>
            <issue>9</issue>
            <fpage>755</fpage>
            <lpage>763</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/bioinformatics/14.9.755</pubid>
                  <pubid idtype="pmpid" link="fulltext">9918945</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B42">
            <title>
               <p>Pfam: clans, web tools and services</p>
            </title>
            <aug>
               <au>
                  <snm>Finn</snm>
                  <fnm>RD</fnm>
               </au>
               <au>
                  <snm>Mistry</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Schuster-Bockler</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Griffiths-Jones</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Hollich</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>Lassmann</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Moxon</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Marshall</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Khanna</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Durbin</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Eddy</snm>
                  <fnm>SR</fnm>
               </au>
               <au>
                  <snm>Sonnhammer</snm>
                  <fnm>EL</fnm>
               </au>
               <au>
                  <snm>Bateman</snm>
                  <fnm>A</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2006</pubdate>
            <volume>34</volume>
            <issue>Database issue</issue>
            <fpage>D247</fpage>
            <lpage>51</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1347511</pubid>
                  <pubid idtype="pmpid" link="fulltext">16381856</pubid>
                  <pubid idtype="doi">10.1093/nar/gkj149</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B43">
            <title>
               <p>Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements</p>
            </title>
            <aug>
               <au>
                  <snm>Schaffer</snm>
                  <fnm>AA</fnm>
               </au>
               <au>
                  <snm>Aravind</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Madden</snm>
                  <fnm>TL</fnm>
               </au>
               <au>
                  <snm>Shavirin</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Spouge</snm>
                  <fnm>JL</fnm>
               </au>
               <au>
                  <snm>Wolf</snm>
                  <fnm>YI</fnm>
               </au>
               <au>
                  <snm>Koonin</snm>
                  <fnm>EV</fnm>
               </au>
               <au>
                  <snm>Altschul</snm>
                  <fnm>SF</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2001</pubdate>
            <volume>29</volume>
            <issue>14</issue>
            <fpage>2994</fpage>
            <lpage>3005</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">55814</pubid>
                  <pubid idtype="pmpid" link="fulltext">11452024</pubid>
                  <pubid idtype="doi">10.1093/nar/29.14.2994</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B44">
            <title>
               <p>Improved prediction of signal peptides: SignalP 3.0</p>
            </title>
            <aug>
               <au>
                  <snm>Bendtsen</snm>
                  <fnm>JD</fnm>
               </au>
               <au>
                  <snm>Nielsen</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>von Heijne</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Brunak</snm>
                  <fnm>S</fnm>
               </au>
            </aug>
            <source>J Mol Biol</source>
            <pubdate>2004</pubdate>
            <volume>340</volume>
            <issue>4</issue>
            <fpage>783</fpage>
            <lpage>795</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/j.jmb.2004.05.028</pubid>
                  <pubid idtype="pmpid" link="fulltext">15223320</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B45">
            <title>
               <p>Predicting transmembrane protein topology with a hidden Markov model: application to complete genomes</p>
            </title>
            <aug>
               <au>
                  <snm>Krogh</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Larsson</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>von Heijne</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Sonnhammer</snm>
                  <fnm>EL</fnm>
               </au>
            </aug>
            <source>J Mol Biol</source>
            <pubdate>2001</pubdate>
            <volume>305</volume>
            <issue>3</issue>
            <fpage>567</fpage>
            <lpage>580</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1006/jmbi.2000.4315</pubid>
                  <pubid idtype="pmpid" link="fulltext">11152613</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B46">
            <title>
               <p>T-Coffee: A novel method for fast and accurate multiple sequence alignment</p>
            </title>
            <aug>
               <au>
                  <snm>Notredame</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Higgins</snm>
                  <fnm>DG</fnm>
               </au>
               <au>
                  <snm>Heringa</snm>
                  <fnm>J</fnm>
               </au>
            </aug>
            <source>J Mol Biol</source>
            <pubdate>2000</pubdate>
            <volume>302</volume>
            <issue>1</issue>
            <fpage>205</fpage>
            <lpage>217</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1006/jmbi.2000.4042</pubid>
                  <pubid idtype="pmpid" link="fulltext">10964570</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B47">
            <title>
               <p>MUSCLE: a multiple sequence alignment method with reduced time and space complexity</p>
            </title>
            <aug>
               <au>
                  <snm>Edgar</snm>
                  <fnm>RC</fnm>
               </au>
            </aug>
            <source>BMC Bioinformatics</source>
            <pubdate>2004</pubdate>
            <volume>5</volume>
            <fpage>113</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">517706</pubid>
                  <pubid idtype="pmpid" link="fulltext">15318951</pubid>
                  <pubid idtype="doi">10.1186/1471-2105-5-113</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B48">
            <title>
               <p>JPred: a consensus secondary structure prediction server</p>
            </title>
            <aug>
               <au>
                  <snm>Cuff</snm>
                  <fnm>JA</fnm>
               </au>
               <au>
                  <snm>Clamp</snm>
                  <fnm>ME</fnm>
               </au>
               <au>
                  <snm>Siddiqui</snm>
                  <fnm>AS</fnm>
               </au>
               <au>
                  <snm>Finlay</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Barton</snm>
                  <fnm>GJ</fnm>
               </au>
            </aug>
            <source>Bioinformatics</source>
            <pubdate>1998</pubdate>
            <volume>14</volume>
            <issue>10</issue>
            <fpage>892</fpage>
            <lpage>893</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/bioinformatics/14.10.892</pubid>
                  <pubid idtype="pmpid" link="fulltext">9927721</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B49">
            <title>
               <p>BLASTCLUST program </p>
            </title>
            <fpage>[ftp://ftp.ncbi.nih.gov/blast/documents/README.bcl]</fpage>
         </bibl>
         <bibl id="B50">
            <title>
               <p>From genomics to chemical genomics: new developments in KEGG</p>
            </title>
            <aug>
               <au>
                  <snm>Kanehisa</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Goto</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Hattori</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Aoki-Kinoshita</snm>
                  <fnm>KF</fnm>
               </au>
               <au>
                  <snm>Itoh</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Kawashima</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Katayama</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Araki</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Hirakawa</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2006</pubdate>
            <volume>34</volume>
            <issue>Database issue</issue>
            <fpage>D354</fpage>
            <lpage>7</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1347464</pubid>
                  <pubid idtype="pmpid" link="fulltext">16381885</pubid>
                  <pubid idtype="doi">10.1093/nar/gkj102</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B51">
            <title>
               <p>SEALS: a system for easy analysis of lots of sequences</p>
            </title>
            <aug>
               <au>
                  <snm>Walker</snm>
                  <fnm>DR</fnm>
               </au>
               <au>
                  <snm>Koonin</snm>
                  <fnm>EV</fnm>
               </au>
            </aug>
            <source>Proc Int Conf Intell Syst Mol Biol</source>
            <pubdate>1997</pubdate>
            <volume>5</volume>
            <fpage>333</fpage>
            <lpage>339</lpage>
            <xrefbib>
               <pubid idtype="pmpid">9322058</pubid>
            </xrefbib>
         </bibl>
      </refgrp>
   </bm>
</art>
