<?xml version='1.0'?>
<!DOCTYPE art SYSTEM 'http://www.biomedcentral.com/xml/article.dtd'>
<art>
   <ui>1471-2164-8-162</ui>
   <ji>1471-2164</ji>
   <fm>
      <dochead>Research article</dochead>
      <bibl>
         <title>
            <p>Re-annotation and re-analysis of the <it>Campylobacter jejuni </it>NCTC11168 genome sequence</p>
         </title>
         <aug>
            <au id="A1">
               <snm>Gundogdu</snm>
               <fnm>Ozan</fnm>
               <insr iid="I1"/>
               <email>Ozan.gundogdu@lshtm.ac.uk</email>
            </au>
            <au id="A2">
               <snm>Bentley</snm>
               <mi>D</mi>
               <fnm>Stephen</fnm>
               <insr iid="I2"/>
               <email>sdb@sanger.ac.uk</email>
            </au>
            <au id="A3">
               <snm>Holden</snm>
               <mi>T</mi>
               <fnm>Matt</fnm>
               <insr iid="I2"/>
               <email>mh3@sanger.ac.uk</email>
            </au>
            <au id="A4">
               <snm>Parkhill</snm>
               <fnm>Julian</fnm>
               <insr iid="I2"/>
               <email>parkhill@sanger.ac.uk</email>
            </au>
            <au id="A5">
               <snm>Dorrell</snm>
               <fnm>Nick</fnm>
               <insr iid="I1"/>
               <email>Nick.dorrell@lshtm.ac.uk</email>
            </au>
            <au id="A6" ca="yes">
               <snm>Wren</snm>
               <mi>W</mi>
               <fnm>Brendan</fnm>
               <insr iid="I1"/>
               <email>Brendan.wren@lshtm.ac.uk</email>
            </au>
         </aug>
         <insg>
            <ins id="I1">
               <p>Pathogen Molecular Department, London School of Hygiene &amp; Tropical Medicine, Keppel Street, UK</p>
            </ins>
            <ins id="I2">
               <p>Pathogen Sequencing Unit, Sanger Institute, UK</p>
            </ins>
         </insg>
         <source>BMC Genomics</source>
         <issn>1471-2164</issn>
         <pubdate>2007</pubdate>
         <volume>8</volume>
         <issue>1</issue>
         <fpage>162</fpage>
         <url>http://www.biomedcentral.com/1471-2164/8/162</url>
         <xrefbib>
            <pubidlist>
               <pubid idtype="pmpid">17565669</pubid>
               <pubid idtype="doi">10.1186/1471-2164-8-162</pubid>
            </pubidlist>
         </xrefbib>
      </bibl>
      <history>
         <rec>
            <date>
               <day>16</day>
               <month>1</month>
               <year>2007</year>
            </date>
         </rec>
         <acc>
            <date>
               <day>12</day>
               <month>6</month>
               <year>2007</year>
            </date>
         </acc>
         <pub>
            <date>
               <day>12</day>
               <month>6</month>
               <year>2007</year>
            </date>
         </pub>
      </history>
      <cpyrt>
         <year>2007</year>
         <collab>Gundogdu et al; licensee BioMed Central Ltd.</collab>
         <note>This is an Open Access article distributed under the terms of the Creative Commons Attribution License (<url>http://creativecommons.org/licenses/by/2.0</url>), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</note>
      </cpyrt>
      <abs>
         <sec>
            <st>
               <p>Abstract</p>
            </st>
            <sec>
               <st>
                  <p>Background</p>
               </st>
               <p><it>Campylobacter jejuni </it>is the leading bacterial cause of human gastroenteritis in the developed world. To improve our understanding of this important human pathogen, the <it>C. jejuni </it>NCTC11168 genome was sequenced and published in 2000. The original annotation was a milestone in <it>Campylobacter </it>research, but is outdated. We now describe the complete re-annotation and re-analysis of the <it>C. jejuni </it>NCTC11168 genome using current database information, novel tools and annotation techniques not used during the original annotation.</p>
            </sec>
            <sec>
               <st>
                  <p>Results</p>
               </st>
               <p>Re-annotation was carried out using sequence database searches such as FASTA, along with programs such as TMHMM for additional support. The re-annotation also utilises sequence data from additional <it>Campylobacter </it>strains and species not available during the original annotation. Re-annotation was accompanied by a full literature search that was incorporated into the updated EMBL file [EMBL: <ext-link ext-link-type="embl" ext-link-id="AL111168">AL111168</ext-link>]. The <it>C. jejuni </it>NCTC11168 re-annotation reduced the total number of coding sequences from 1654 to 1643, of which 90.0% have additional information regarding the identification of new motifs and/or relevant literature. Re-annotation has led to 18.2% of coding sequence product functions being revised.</p>
            </sec>
            <sec>
               <st>
                  <p>Conclusions</p>
               </st>
               <p>Major updates were made to genes involved in the biosynthesis of important surface structures such as lipooligosaccharide, capsule and both <it>O</it>- and <it>N</it>-linked glycosylation. This re-annotation will be a key resource for <it>Campylobacter </it>research and will also provide a prototype for the re-annotation and re-interpretation of other bacterial genomes.</p>
            </sec>
         </sec>
      </abs>
   </fm>
   <meta>
      <classifications>
         <classification type="bmc" subtype="user_supplied_xml" id="endnote"/>
      </classifications>
   </meta>
   <bdy>
      <sec>
         <st>
            <p>Background</p>
         </st>
         <p><it>Campylobacter jejuni </it>is the leading bacterial cause of human gastroenteritis in the developed world <abbrgrp><abbr bid="B1">1</abbr></abbrgrp>. <it>C. jejuni </it>infection has also been associated with post-infection sequelae including septicaemia and neuropathies such as Guillain-Barr&#233; Syndrome (GBS) <abbrgrp><abbr bid="B2">2</abbr></abbrgrp>. Infection has largely been linked with the consumption of contaminated poultry or meat products. Given the socioeconomic importance of this pathogen, it is surprising that the ecology, the epidemiology and, in particular, the pathogenesis are still so poorly understood <abbrgrp><abbr bid="B3">3</abbr></abbrgrp>. The lack of information on this problematic pathogen was one of the main driving forces for the original <it>C. jejuni </it>NCTC11168 genome project published in 2000 <abbrgrp><abbr bid="B4">4</abbr></abbrgrp>, and equally is why a re-annotation and re-analysis of the genome is required.</p>
         <p>Since the publication of the <it>C. jejuni </it>NCTC11168 genome sequence in 2000, there has been a spectacular increase in research on this important human pathogen. One result of this has been significant revisions of the genetic loci that code for important surface structures on <it>C. jejuni </it>strains. The surface polysaccharide region has since been identified as a capsule locus (<it>Cj1413c </it>&#8211; <it>Cj1448c</it>) <abbrgrp><abbr bid="B5">5</abbr><abbr bid="B6">6</abbr><abbr bid="B7">7</abbr></abbrgrp>. The flagellar modification locus has been identified as an <it>O</it>-linked glycosylation pathway (<it>Cj1293 </it>&#8211; <it>Cj1342</it>) <abbrgrp><abbr bid="B8">8</abbr><abbr bid="B9">9</abbr><abbr bid="B10">10</abbr><abbr bid="B11">11</abbr></abbrgrp>. Progress has also been made in our understanding of the lipooligosaccharide (LOS) locus. In addition, the <it>N</it>-linked glycosylation pathway has been identified in <it>C. jejuni </it>(<it>Cj1119 </it>&#8211; <it>Cj1130</it>) <abbrgrp><abbr bid="B9">9</abbr><abbr bid="B12">12</abbr><abbr bid="B13">13</abbr><abbr bid="B14">14</abbr></abbrgrp>. This <it>N</it>-linked general glycosylation system was initially thought to only be present in eukaryotes. To date, up to 30 proteins modified with the same heptasaccharide glycan structure have been identified. Research over the last 7 years on <it>C. jejuni</it>, coupled with the publication of a further 2 <it>C. jejuni </it>genome sequences <abbrgrp><abbr bid="B15">15</abbr><abbr bid="B16">16</abbr></abbrgrp> and another 3 <it>Campylobacter </it>species <abbrgrp><abbr bid="B15">15</abbr></abbrgrp>, has heightened the need for re-analysis of the original NCTC11168 genome sequence.</p>
         <p>Re-annotation is defined as the process of annotating a previously annotated genome <abbrgrp><abbr bid="B17">17</abbr></abbrgrp>. Examples of re-annotated genomes are unfortunately rare compared to the number of sequenced genomes <abbrgrp><abbr bid="B18">18</abbr><abbr bid="B19">19</abbr></abbrgrp>. Clearly the ever-increasing number of new genome sequences requires prioritisation from annotators. Automated methods can save time and resources, but will not incorporate the maximum information available from expert curators, leading to incomplete or even false designations. By contrast, manual annotation is costly and time consuming. However, manual re-annotation of genomes can significantly reduce the perpetuation of errors and thus reduce the time spent on flawed research. Outdated annotations can lead to significant gaps in our knowledge. Hence, there is a need for a research community-wide review and regular update of genome interpretations. Here we have shown the importance of genome re-annotation in terms of maintaining and increasing the usefulness of this resource, a number of years after the original genome sequencing project was completed.</p>
         <p>In this study, we describe the re-annotation and re-analysis of the <it>C. jejuni </it>NCTC11168 genome. Manual re-annotation of all coding sequences (CDSs) was carried out using current annotation techniques. Literature searches, updates to genome structure and additional unique genome searches were carried out to produce the most comprehensive annotation of any <it>Campylobacter </it>genome to date. The re-annotation of the <it>C. jejuni </it>NCTC11168 genome also represents a useful model for the re-evaluation of other bacterial genomes.</p>
      </sec>
      <sec>
         <st>
            <p>Results &amp; Discussion</p>
         </st>
         <sec>
            <st>
               <p>Gene number adjustment</p>
            </st>
            <p>A complete re-annotation of the <it>C. jejuni </it>NCTC11168 genome was performed resulting in the reduction of the total number of CDSs from 1654 to 1643. This reduction was due to the merging of adjacent CDSs or the removal of CDSs. Three CDSs originally designated as pseudogenes were removed as a result of merging with adjacent pseudogenes. CDSs designated as pseudogenes were also updated to reflect the complete amino acid sequence for the encoded protein regardless of expression. Phase-variable CDSs that contained an intersecting homopolymeric region between adjacent CDSs on separate frames were merged. This allowed the complete amino acid sequence for appropriate genes to be obtained regardless of phase. Re-interpretation of phase-variable CDSs resulted in removal of seven CDSs. CDS (<it>Cj1520</it>) was removed because of the recently discovered CRISPR structural moieties <abbrgrp><abbr bid="B20">20</abbr></abbrgrp> (See Structural modifications section in Results &amp; Discussion). In total, 11 CDSs were removed from the re-annotated sequence (Table <tblr tid="T1">1</tblr>). The accurate identification of all CDSs within the genome has implications for downstream applications, such as mutagenesis, microarray design and proteome analysis.</p>
            <tbl id="T1">
               <title>
                  <p>Table 1</p>
               </title>
               <caption>
                  <p>CDSs removed or merged from <it>C. jejuni </it>NCTC11168 re-annotation.</p>
               </caption>
               <tblbdy cols="2">
                  <r>
                     <c ca="center">
                        <p>
                           <b>Gene Number</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>Type/Description</b>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c cspan="2">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>Cj0290c/Cj0291c/Cj0292c</p>
                     </c>
                     <c ca="center">
                        <p>Pseudogenes</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>Cj0968/Cj0969</p>
                     </c>
                     <c ca="center">
                        <p>Pseudogenes</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>Cj0031/Cj0032</p>
                     </c>
                     <c ca="center">
                        <p>Phase-variable</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>Cj0170/Cj0171</p>
                     </c>
                     <c ca="center">
                        <p>Phase-variable</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>Cj0628/Cj0629</p>
                     </c>
                     <c ca="center">
                        <p>Phase-variable</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>Cj1144c/Cj1145c</p>
                     </c>
                     <c ca="center">
                        <p>Phase-variable</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>Cj1325/Cj1326</p>
                     </c>
                     <c ca="center">
                        <p>Phase-variable</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>Cj1335/Cj1336</p>
                     </c>
                     <c ca="center">
                        <p>Phase-variable</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>Cj1677/Cj1678</p>
                     </c>
                     <c ca="center">
                        <p>Phase-variable</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>Cj1520</p>
                     </c>
                     <c ca="center">
                        <p>CRISPR region identified</p>
                     </c>
                  </r>
               </tblbdy>
            </tbl>
         </sec>
         <sec>
            <st>
               <p>Functional annotation update</p>
            </st>
            <p>A systematic re-annotation of all CDSs was performed. For the purpose of this re-annotation, all CDSs with additional information have had an 'updated' note qualifier attached. This qualifier contains consistent free-hand descriptions on recently identified motifs, relevant similarity search results and any characterisation work carried out within <it>Campylobacter </it>species/strains or any orthologs in similar microorganisms. Additionally, the 'updated' note qualifier also contains reasoning for including 'putative' or not within the product function. Putative designations infer an accepted product function without definitive evidence. For each CDS, a full literature search was performed. In total, 64.5% of CDSs have had one or more literature qualifier added. Interestingly, from all the literature added (2092), 50.5% have been published after the year 2000. Considering there was no literature qualifier in the original annotation, this data illustrates the depth of research that has been carried out since 2000 and further supports the need to make use of this information in a re-annotation. Detailed statistics on genome modifications are given in Table <tblr tid="T2">2</tblr>. 18.2% of CDSs have had their product functions updated. 60.5% of CDSs with new product function have been designated with a different functional classification. Additional file <supplr sid="S1">1</supplr> gives the outline of functional classification used in this annotation. This description was adopted from the Sanger Institute. A different functional classification may still be within the same field as the previous function, or may be in a completely new area. 97.8% of these CDSs with a new product function and a different functional classification were given a completely new type of functional classification. Additional file's <supplr sid="S2">2</supplr>, <supplr sid="S3">3</supplr> and <supplr sid="S4">4</supplr> give in-depth data on the change and distribution of CDSs within these functional categories. Importantly, the number of CDSs in the 'Unknown and other' category has been reduced by 122. Also, the number of CDSs in the 'Miscellaneous' category has risen by 77. This is attributed to the fact that a number of CDSs have new information relating to a product function from uncharacterised motifs and thus the CDSs were not placed into a specific category as yet.</p>
            <suppl id="S1">
               <title>
                  <p>Additional File 1</p>
               </title>
               <text>
                  <p><it>C. jejuni </it>functional classification (created at Sanger Institute).</p>
               </text>
               <file name="1471-2164-8-162-S1.doc">
                  <p>Click here for file</p>
               </file>
            </suppl>
            <suppl id="S2">
               <title>
                  <p>Additional File 2</p>
               </title>
               <text>
                  <p>Distribution of functional classification before/after re-annotation.</p>
               </text>
               <file name="1471-2164-8-162-S2.doc">
                  <p>Click here for file</p>
               </file>
            </suppl>
            <suppl id="S3">
               <title>
                  <p>Additional File 3</p>
               </title>
               <text>
                  <p>Changes to functional classification categories before and after the re-annotation.</p>
               </text>
               <file name="1471-2164-8-162-S3.doc">
                  <p>Click here for file</p>
               </file>
            </suppl>
            <suppl id="S4">
               <title>
                  <p>Additional File 4</p>
               </title>
               <text>
                  <p>Changes to CDS functions and functional classifications.</p>
               </text>
               <file name="1471-2164-8-162-S4.xls">
                  <p>Click here for file</p>
               </file>
            </suppl>
            <tbl id="T2">
               <title>
                  <p>Table 2</p>
               </title>
               <caption>
                  <p>Genome wide statistics from <it>C. jejuni </it>NCTC11168 re-annotation.</p>
               </caption>
               <tblbdy cols="2">
                  <r>
                     <c ca="center">
                        <p>CDSs with new motifs identified</p>
                     </c>
                     <c ca="center">
                        <p>+44.9%</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>CDSs with new literature qualifier</p>
                     </c>
                     <c ca="center">
                        <p>+64.5%</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>CDSs with new updated note qualifier</p>
                     </c>
                     <c ca="center">
                        <p>+90.0%</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>CDSs with hypothetical designations in product function</p>
                     </c>
                     <c ca="center">
                        <p>-7.5%</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>CDSs with conserved hypothetical in product function</p>
                     </c>
                     <c ca="center">
                        <p>+3.7%</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>CDSs with putative designation in product function</p>
                     </c>
                     <c ca="center">
                        <p>+5.9%</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>CDSs with new gene qualifier</p>
                     </c>
                     <c ca="center">
                        <p>+6.3%</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>CDSs with new product function</p>
                     </c>
                     <c ca="center">
                        <p>+18.2%</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>CDSs with new product function and with a new functional classification</p>
                     </c>
                     <c ca="center">
                        <p>+60.5%</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>CDS with new product function and a new functional classification with a different type of function</p>
                     </c>
                     <c ca="center">
                        <p>+97.8%</p>
                     </c>
                  </r>
               </tblbdy>
            </tbl>
            <p>Since the original annotation, significant new information has been derived on the genetic loci encoding the four main carbohydrate surface structures. The <it>C. jejuni N</it>-linked glycosylation pathway (not described in the original annotation), has been fully characterised <abbrgrp><abbr bid="B9">9</abbr><abbr bid="B12">12</abbr><abbr bid="B13">13</abbr><abbr bid="B14">14</abbr></abbrgrp>. This re-annotation includes the nomenclature for the <it>pglA-K </it>(<it>p</it>rotein <it>gl</it>ycosylation) genes and has updated all product functions for genes <it>Cj1119c </it>&#8211; <it>Cj1130c</it>. The LOS locus (<it>Cj1131c </it>&#8211; <it>Cj1152c</it>) described in the original annotation was updated to include recent product functions and gene names including <it>neuA</it>1, <it>B</it>1, <it>C</it>1 and <it>hldDE </it><abbrgrp><abbr bid="B21">21</abbr><abbr bid="B22">22</abbr><abbr bid="B23">23</abbr><abbr bid="B24">24</abbr></abbrgrp>. The <it>O</it>-linked glycosylation loci (<it>Cj1293 </it>&#8211; <it>Cj1342</it>) involved in flagellar glycosylation, has been updated to include <it>neu</it>, <it>pse </it>and <it>maf </it>genes <abbrgrp><abbr bid="B8">8</abbr><abbr bid="B9">9</abbr><abbr bid="B10">10</abbr><abbr bid="B11">11</abbr></abbrgrp>. Finally, the capsule locus (<it>Cj1413c </it>&#8211; <it>Cj1448c</it>), has now been updated to include <it>kps </it>and <it>hdd </it>genes <abbrgrp><abbr bid="B5">5</abbr><abbr bid="B6">6</abbr><abbr bid="B7">7</abbr></abbrgrp>.</p>
            <p>Additional genome-wide updates were also carried out, of which a large proportion entailed adding specificity to existing product function. For example, the identification of a new PFAM or PROSITE motif has allowed the product function to become further specified e.g. putative transport protein modified to putative MFS (Major Facilitator System) transport protein. A complete list of changes throughout the <it>C. jejuni </it>NCTC11168 genome is provided in Additional File <supplr sid="S5">5</supplr>.</p>
            <suppl id="S5">
               <title>
                  <p>Additional File 5</p>
               </title>
               <text>
                  <p>CDSs modified in <it>C. jejuni </it>NCTC11168 re-annotation.</p>
               </text>
               <file name="1471-2164-8-162-S5.doc">
                  <p>Click here for file</p>
               </file>
            </suppl>
         </sec>
         <sec>
            <st>
               <p>Pseudogene &amp; phase-variable modification</p>
            </st>
            <p>Pseudogene identification is a challenging process where discrepancies exist between pseudogene assignment techniques <abbrgrp><abbr bid="B25">25</abbr></abbrgrp>. Identifiers include detection of Open Reading Frames (ORF) belonging to a single CDS on multiple frames, the presence of one or more stop codon within a CDS, and extra information from the biology of the microorganism. More recently, comparative genomics has been used as a technique for pseudogene assignment <abbrgrp><abbr bid="B26">26</abbr></abbrgrp>. The number of pseudogenes identified in the original annotation of the <it>C. jejuni </it>NCTC11168 genome was 20. We carried out a re-analysis on all pseudogenes in the NCTC11168 genome. The majority of revisions we carried out incorporated multiple features created from different coordinates on more than one frame. This process is often complicated with support needed from FASTA and TBLASTX search results. Completion of this re-analysis resulted in modification of 19 out of 20 pseudogenes (Table <tblr tid="T3">3</tblr>). The final pseudogene number was 19 due to the merging of two adjacent CDSs designated as pseudogenes (<it>Cj0968/Cj0969</it>).</p>
            <tbl id="T3">
               <title>
                  <p>Table 3</p>
               </title>
               <caption>
                  <p>Pseudogenes in <it>C. jejuni </it>NCTC11168 with modification.</p>
               </caption>
               <tblbdy cols="3">
                  <r>
                     <c ca="left">
                        <p>
                           <b>Gene Number</b>
                        </p>
                     </c>
                     <c ca="left">
                        <p>
                           <b>Product</b>
                        </p>
                     </c>
                     <c ca="left">
                        <p>
                           <b>Modification</b>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c cspan="3">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Cj0046</p>
                     </c>
                     <c ca="left">
                        <p>pseudogene (putative sodium:sulfate transmembrane transport protein)</p>
                     </c>
                     <c ca="left">
                        <p>Features introduced within CDS</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Cj0072c</p>
                     </c>
                     <c ca="left">
                        <p>pseudogene (putative iron-binding protein)</p>
                     </c>
                     <c ca="left">
                        <p>Features introduced within CDS</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Cj0223</p>
                     </c>
                     <c ca="left">
                        <p>pseudogene (putative IgA protease family protein)</p>
                     </c>
                     <c ca="left">
                        <p>Features introduced within CDS</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Cj0292c</p>
                     </c>
                     <c ca="left">
                        <p>pseudogene (putative glycerol-3-phosphate transporter)</p>
                     </c>
                     <c ca="left">
                        <p>Merging of multiple CDS</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Cj0444</p>
                     </c>
                     <c ca="left">
                        <p>pseudogene (putative TonB-denpendent outer membrane receptor)</p>
                     </c>
                     <c ca="left">
                        <p>Features introduced within CDS</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Cj0501</p>
                     </c>
                     <c ca="left">
                        <p>pseudogene (ammonium transporter)</p>
                     </c>
                     <c ca="left">
                        <p>Features introduced within CDS</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Cj0565</p>
                     </c>
                     <c ca="left">
                        <p>pseudogene (conserved hypothetical protein)</p>
                     </c>
                     <c ca="left">
                        <p>Features introduced within CDS</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Cj0654c</p>
                     </c>
                     <c ca="left">
                        <p>pseudogene (putative transmembrane transport protein)</p>
                     </c>
                     <c ca="left">
                        <p>Features introduced within CDS</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Cj0676</p>
                     </c>
                     <c ca="left">
                        <p>pseudogene (potassium-transporting ATPase A chain)</p>
                     </c>
                     <c ca="left">
                        <p>Features introduced within CDS</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Cj0678</p>
                     </c>
                     <c ca="left">
                        <p>pseudogene (potassium-transporting ATPase C chain)</p>
                     </c>
                     <c ca="left">
                        <p>Features introduced within CDS</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Cj0742</p>
                     </c>
                     <c ca="left">
                        <p>pseudogene (putative outer membrane protein)</p>
                     </c>
                     <c ca="left">
                        <p>Features introduced within CDS</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Cj0752</p>
                     </c>
                     <c ca="left">
                        <p>pseudogene (IS element transposase)</p>
                     </c>
                     <c ca="left">
                        <p>Features introduced within CDS</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Cj0866</p>
                     </c>
                     <c ca="left">
                        <p>pseudogene (arylsulfatase)</p>
                     </c>
                     <c ca="left">
                        <p>Features introduced within CDS</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Cj0969</p>
                     </c>
                     <c ca="left">
                        <p>pseudogene (putative periplasmic protein)</p>
                     </c>
                     <c ca="left">
                        <p>Merging of multiple CDS</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Cj1064</p>
                     </c>
                     <c ca="left">
                        <p>pseudogene (nitroreductase)</p>
                     </c>
                     <c ca="left">
                        <p>Features introduced within CDS</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Cj1389</p>
                     </c>
                     <c ca="left">
                        <p>pseudogene (putative C4-dicarboxylate anaerobic carrier)</p>
                     </c>
                     <c ca="left">
                        <p>Features introduced within CDS</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Cj1395</p>
                     </c>
                     <c ca="left">
                        <p>pseudogene (putative MmgE/PrpD family protein)</p>
                     </c>
                     <c ca="left">
                        <p>Unmodified</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Cj1470c</p>
                     </c>
                     <c ca="left">
                        <p>pesudogene (type II protein secretion system F protein)</p>
                     </c>
                     <c ca="left">
                        <p>Features introduced within CDS</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Cj1528</p>
                     </c>
                     <c ca="left">
                        <p>pseudogene (putative C4-dicarboxylate anaerobic carrier)</p>
                     </c>
                     <c ca="left">
                        <p>Features introduced within CDS</p>
                     </c>
                  </r>
               </tblbdy>
            </tbl>
            <p>An example of the difficulty and complexity associated with pseudogene designation is observed when viewing the CDSs <it>Cj0522</it>, <it>Cj0523 </it>and <it>Cj0524 </it>within <it>C. jejuni </it>NCTC11168. These three CDSs are represented as one whole CDS on a single frame within <it>C. jejuni </it>RM1221 (<it>Cje0628</it>). The three CDSs are large enough to be represented as individual CDSs and in <it>C. jejuni </it>NCTC11168 have been represented on more than one frame. The question can be asked as to whether these CDSs (which are intact in <it>C. jejuni </it>RM1221), represent a pseudogene in <it>C. jejuni </it>NCTC11168. Given the fact that in <it>C. jejuni </it>RM1221 these three CDSs do actually code for a product (Na/Pi-cotransporter, putative), it is more likely that they represent a pseudogene in <it>C. jejuni </it>NCTC11168. In this re-annotation, our intention was to carry out a full mark up of existing pseudogenes, however, the potential for a pseudogene has been noted.</p>
            <p>The frequency and importance of pseudogene formation in microorganisms has attained added significance in recent years with the emergence of genome reduction theories and enhanced virulence through pathoadaptive mutations <abbrgrp><abbr bid="B27">27</abbr><abbr bid="B28">28</abbr></abbrgrp>. Recent studies have suggested that ever increasing non-functional genes are being identified within microorganisms and in particular are more common in genomes of recently evolved pathogens, than in their benign or free-living relatives <abbrgrp><abbr bid="B25">25</abbr></abbrgrp>. The number and type of predicted pseudogenes within <it>C. jejuni </it>NCTC11168 and <it>C. jejuni </it>RM1221 are compared in Additional File <supplr sid="S6">6</supplr>. Observing CDS location rather than CDS function was carried out for this comparison. This was to ensure variation in product function naming does not exclude identical pseudogenes, which are represented on the same row. Currently, the <it>C. jejuni </it>81&#8211;176 genome has not been fully annotated so could not be used in this comparison. This is also the case for <it>C. coli </it>RM2228, <it>C. lari </it>RM2100 and <it>C. upsaliensis </it>3195 which only have an estimation of pseudogene numbers based on a subset of genes <abbrgrp><abbr bid="B15">15</abbr></abbrgrp>. In <it>C. jejuni </it>NCTC11168, 63% (12/19) of the pseudogenes are shared with <it>C. jejuni </it>RM1221. In contrast to 19 pseudogenes in <it>C. jejuni </it>NCTC11168, <it>C. jejuni </it>RM1221 contains 47 pseudogenes. Assuming these are genuine pseudogenes this would imply <it>C. jejuni </it>NCTC11168 (1980 human isolate, UK) and <it>C. jejuni </it>RM1221 (2000 chicken isolate, US), share a core set of ancestral pseudogenes. Even with the variation of isolation dates, source and geographical location, there is substantial conservation of pseudogene type. It is speculative to suggest when and how the additional pseudogenes in <it>C. jejuni </it>RM1221 arose, or when and how the <it>C. jejuni </it>NCTC11168 genome lost CDSs as pseudogenes since divergence occurred.</p>
            <suppl id="S6">
               <title>
                  <p>Additional File 6</p>
               </title>
               <text>
                  <p>Pseudogene comparison between <it>C. jejuni </it>NCTC11168 and <it>C. jejuni </it>RM1221.</p>
               </text>
               <file name="1471-2164-8-162-S6.doc">
                  <p>Click here for file</p>
               </file>
            </suppl>
            <p>The significance of pseudogenes in early genome annotations were frequently ignored, as these were considered as sequencing artefacts. However, given the recent realisation of the importance of pseudogenes in pathoadaptive mutations, a greater significance is placed on their identification <abbrgrp><abbr bid="B26">26</abbr><abbr bid="B29">29</abbr></abbrgrp>. An example of this is the re-analysis of <it>Escherichia coli </it>K-12, which has predicted an additional 161 from the original single pseudogene identified <abbrgrp><abbr bid="B28">28</abbr></abbrgrp>. The same study also indicated pseudogenes are continually generated, with existing pseudogenes being eliminated over a period of time <abbrgrp><abbr bid="B28">28</abbr></abbrgrp>. Pseudogenes can accumulate in the genomes of some bacterial species, especially those undergoing processes like niche adaptation, host specialisation or weak selection strength <abbrgrp><abbr bid="B30">30</abbr></abbrgrp>. Analysis of further <it>Campylobacter </it>strains and species along with additional epsilon proteobacteria species will aid our understanding on this emerging area of interest. Also, greater understanding of pseudogene dynamics and in particular innovative pseudogene identification techniques will yield more information about the actual number and purpose of these entities within microorganisms.</p>
            <p>Phase-variable CDSs containing hypervariable regions were also analysed. The initial annotation gave a number of hypervariable sequences found within the <it>C. jejuni </it>genomic shotgun sequence <abbrgrp><abbr bid="B4">4</abbr></abbrgrp>. These hypervariable sequences are scattered throughout the genome, however, there is a large cluster within both the <it>O</it>-linked glycosylation and capsule loci. Further research on these loci have illustrated the impact of phase-variation on microorganism pathogenicity <abbrgrp><abbr bid="B11">11</abbr><abbr bid="B31">31</abbr><abbr bid="B32">32</abbr></abbrgrp>. Table <tblr tid="T4">4</tblr> shows phase-variable CDSs that have been modified.</p>
            <tbl id="T4">
               <title>
                  <p>Table 4</p>
               </title>
               <caption>
                  <p>Merged phase-variable CDSs in <it>C. jejuni </it>NCTC11168 re-annotation.</p>
               </caption>
               <tblbdy cols="3">
                  <r>
                     <c ca="left">
                        <p>
                           <b>Gene Number</b>
                        </p>
                     </c>
                     <c ca="left">
                        <p>
                           <b>Product</b>
                        </p>
                     </c>
                     <c ca="left">
                        <p>
                           <b>Effect</b>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c cspan="3">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Cj0031/Cj0032</p>
                     </c>
                     <c ca="left">
                        <p>putative type IIS restriction/modification enzyme</p>
                     </c>
                     <c ca="left">
                        <p>Fusion/separation</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Cj0170/Cj0171</p>
                     </c>
                     <c ca="left">
                        <p>hypothetical protein Cj0170</p>
                     </c>
                     <c ca="left">
                        <p>Fusion/separation</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Cj0628/Cj0629</p>
                     </c>
                     <c ca="left">
                        <p>putative lipoprotein</p>
                     </c>
                     <c ca="left">
                        <p>Fusion/separation</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Cj1144/Cj1145</p>
                     </c>
                     <c ca="left">
                        <p>hypothetical protein Cj1144c</p>
                     </c>
                     <c ca="left">
                        <p>Fusion/separation</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Cj1325/Cj1326</p>
                     </c>
                     <c ca="left">
                        <p>putative methyltransferase</p>
                     </c>
                     <c ca="left">
                        <p>Fusion/separation</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Cj1335/Cj1336</p>
                     </c>
                     <c ca="left">
                        <p>motility accessory factor (function unknown)</p>
                     </c>
                     <c ca="left">
                        <p>Fusion/separation</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Cj1677/Cj1678</p>
                     </c>
                     <c ca="left">
                        <p>putative lipoprotein</p>
                     </c>
                     <c ca="left">
                        <p>Fusion/separation</p>
                     </c>
                  </r>
               </tblbdy>
            </tbl>
         </sec>
         <sec>
            <st>
               <p>Structural modifications</p>
            </st>
            <p>As well as CDS updates, novel features were also added to the re-annotation. For example, the incorporation of the recently identified Clustered Regularly Interspaced Short Palindromic Repeats (CRISPR) regions within <it>Campylobacter </it><abbrgrp><abbr bid="B20">20</abbr><abbr bid="B33">33</abbr><abbr bid="B34">34</abbr></abbrgrp>. CRISPR regions are thought to be mobile elements. In conjunction with this, the three CDSs upstream of the CRISPR repeats were identified as CRISPR associated proteins and this concurs with existing CRISPR structures. To date, there has only been one identified CRISPR region within <it>C. jejuni </it>and this has now been incorporated within the genome. As a result, one CDS (<it>Cj1520</it>) has been removed. This CDS was previously annotated as having five repeat regions. Thus, the genome now contains a CRISPR repeat region in place of the removed CDS.</p>
            <p>Additional genome searches included RFAM database search to discover any non-coding RNAs. This search identified two new non-coding RNA structures. RFAM RF00169, a bacterial signal recognition particle (SRP) RNA, was identified upstream of <it>Cj0046</it>. The SRP is a universally conserved ribonucleoprotein involved in the co-translational targeting of proteins to membranes <abbrgrp><abbr bid="B35">35</abbr><abbr bid="B36">36</abbr></abbrgrp>. Also, RFAM RF00059 a thiamin pyrophosphate (TPP) riboswitch (THI element) was identified upstream of <it>Cj0453 </it>(thiamin biosynthesis protein ThiC). The RFAM motif is a conserved structure (THI element), involved in thiamin-regulation <abbrgrp><abbr bid="B37">37</abbr></abbrgrp>.</p>
            <p>The final step of the re-annotation process was the incorporation of Gene Ontology (GO) annotation. GO annotation attempts to link three structured, controlled vocabularies (ontologies) that describe gene products in terms of their associated biological processes, cellular components and molecular functions in a species-independent manner <abbrgrp><abbr bid="B38">38</abbr></abbrgrp>. This re-annotation aims to incorporate GO annotation using two automated methods. Upon submission to EBI, the EMBL file will carry a GOA link that lists the GO annotation identified automatically by EBI. A second version of GO was generated by performing a reciprocal FASTA search using RM1221. This was created and submitted to GeneDB. GO annotation is a valuable feature in current annotation techniques that can expedite systems biology approaches to genome analysis.</p>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Conclusions</p>
         </st>
         <p>In summary, the re-annotation and re-analysis of the <it>C. jejuni </it>NCTC11168 genome sequence has led to substantial updates across the entire genome, incorporating a vast amount of research information performed since the original annotation in 2000 and also integrating data from additional <it>Campylobacter </it>species and strains. Major updates include noteworthy modifications to the 4 main surface structure loci in the genome, 18.2% of genome product functions being updated and 90.0% of all CDSs now having additional information. The inclusion of literature searches and a GO annotation alongside genome wide structural modifications has resulted in <it>C. jejuni </it>NCTC11168 being the most comprehensively annotated <it>Campylobacter </it>genome to date.</p>
      </sec>
      <sec>
         <st>
            <p>Methods</p>
         </st>
         <sec>
            <st>
               <p>Sequence searches</p>
            </st>
            <p>Manual re-annotation of all previously annotated <it>C. jejuni </it>NCTC11168 CDSs <abbrgrp><abbr bid="B4">4</abbr></abbrgrp> was carried out based on results from BLASTP <abbrgrp><abbr bid="B39">39</abbr></abbrgrp> and FASTA <abbrgrp><abbr bid="B40">40</abbr></abbrgrp> sequence comparisons using non-redundant databases. Re-annotation was based, wherever possible, on characterised proteins or genes <abbrgrp><abbr bid="B4">4</abbr></abbrgrp>. Additional functional data was provided by using the PFAM <abbrgrp><abbr bid="B41">41</abbr></abbrgrp> and PROSITE <abbrgrp><abbr bid="B42">42</abbr></abbrgrp> motif databases. New searches carried out in this re-annotation included running RFAM <abbrgrp><abbr bid="B43">43</abbr></abbrgrp> database and also the programs TMHMM <abbrgrp><abbr bid="B44">44</abbr></abbrgrp> and SIGNALP <abbrgrp><abbr bid="B45">45</abbr></abbrgrp>.</p>
         </sec>
         <sec>
            <st>
               <p>Literature &amp; additional searches</p>
            </st>
            <p>This re-annotation included a complete literature search of all CDS numbers and gene names using PubMed <abbrgrp><abbr bid="B46">46</abbr></abbrgrp>, HighWire Press <abbrgrp><abbr bid="B47">47</abbr></abbrgrp>, Scirus <abbrgrp><abbr bid="B48">48</abbr></abbrgrp> and Google Scholar <abbrgrp><abbr bid="B49">49</abbr></abbrgrp>. Artemis software release 8 <abbrgrp><abbr bid="B50">50</abbr></abbrgrp> was used during re-annotation. The re-annotated sequence was submitted to the EMBL public database and also to GeneDB <abbrgrp><abbr bid="B51">51</abbr></abbrgrp> and CampyDB <abbrgrp><abbr bid="B52">52</abbr></abbrgrp>. The EMBL file included an 'original' and 'updated' note qualifier, a 'product' qualifier and each CDS represented with a unique 'locus_tag' qualifier. Appropriate 'gene' qualifiers were also present. The GeneDB submission included all the above and extra qualifiers 'colour' and 'literature'. This re-annotation also included for the first time a Gene Ontology (GO) annotation of the NCTC11168 genome sequence. This was created automatically on submission to EMBL and can be accessed via the GOA link. A separate GO annotation was created within GeneDB by carrying out a reciprocal FASTA comparison with <it>C. jejuni </it>RM1221 and adopting the GO annotation of orthologous CDSs.</p>
         </sec>
         <sec>
            <st>
               <p>Re-designation of pseudogenes</p>
            </st>
            <p>Advances in genome annotation techniques that were unavailable during the original annotation have led to updated interpretation of pseudogenes and phase-variable CDSs. Using guidance from TBLASTX search results, we carried out a full re-analysis of all pseudogenes. CDSs designated as pseudogenes have been updated to reflect the complete amino acid sequence for the encoded protein regardless of expression. This has caused differences from the amino acid sequence of the previous annotation. Some pseudogene modifications entailed merging two or more adjacent, in frame CDSs (previously annotated as separate pseudogene CDSs), to create a single pseudogene containing internal stop codons. In other cases, pseudogene features were created with multiple coordinates representing one or more frameshift in the CDS &#8211; these had previously only detailed the start and stop coordinates so did not reflect the true position of the non-mutated CDS. In both cases the assignment of coordinates was based on matches to homologues determined through FASTA searches.</p>
         </sec>
         <sec>
            <st>
               <p>Re-designation of CDSs with an intersecting homopolymeric tract</p>
            </st>
            <p>CDSs containing an intersecting homopolymeric tract were merged to reflect the complete amino acid sequence for appropriate genes regardless of phase. This is analogous to the scenario described above for frameshifted pseudogenes. This modification was carried out for two CDSs with an intersecting homopolymeric tract. The joining of such CDSs was not undertaken in the original annotation.</p>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Authors' contributions</p>
         </st>
         <p>OG carried out the re-annotation process and drafted the manuscript. SDB assisted with the re-annotation process. MTH assisted with running additional programs used in the re-annotation. JP, ND and BWW participated in the conception and supervised the design of the study. All authors submitted comments on drafts and read and approved the final manuscript.</p>
      </sec>
   </bdy>
   <bm>
      <ack>
         <sec>
            <st>
               <p>Acknowledgements</p>
            </st>
            <p>Funding was provided by the BBSRC and the Wellcome Trust. We would like to thank C. Parry for technical support. We would like to thank M. Sebaihia and NR. Thomson for stimulating conversations. We would also like to thank TJ. Carver for computational support.</p>
         </sec>
      </ack>
      <refgrp>
         <bibl id="B1">
            <title>
               <p>Diverse roles for HspR in Campylobacter jejuni revealed by the proteome, transcriptome and phenotypic characterization of an hspR mutant</p>
            </title>
            <aug>
               <au>
                  <snm>Andersen</snm>
                  <fnm>MT</fnm>
               </au>
               <au>
                  <snm>Brondsted</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Pearson</snm>
                  <fnm>BM</fnm>
               </au>
               <au>
                  <snm>Mulholland</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Parker</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Pin</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Wells</snm>
                  <fnm>JM</fnm>
               </au>
               <au>
                  <snm>Ingmer</snm>
                  <fnm>H</fnm>
               </au>
            </aug>
            <source>Microbiology</source>
            <pubdate>2005</pubdate>
            <volume>151</volume>
            <issue>Pt 3</issue>
            <fpage>905</fpage>
            <lpage>915</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1099/mic.0.27513-0</pubid>
                  <pubid idtype="pmpid" link="fulltext">15758235</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B2">
            <title>
               <p>Campylobacter species and Guillain-Barre syndrome</p>
            </title>
            <aug>
               <au>
                  <snm>Nachamkin</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Allos</snm>
                  <fnm>BM</fnm>
               </au>
               <au>
                  <snm>Ho</snm>
                  <fnm>T</fnm>
               </au>
            </aug>
            <source>Clin Microbiol Rev</source>
            <pubdate>1998</pubdate>
            <volume>11</volume>
            <issue>3</issue>
            <fpage>555</fpage>
            <lpage>567</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">88896</pubid>
                  <pubid idtype="pmpid" link="fulltext">9665983</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B3">
            <title>
               <p>Whole genome comparison of Campylobacter jejuni human isolates using a low-cost microarray reveals extensive genetic diversity</p>
            </title>
            <aug>
               <au>
                  <snm>Dorrell</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Mangan</snm>
                  <fnm>JA</fnm>
               </au>
               <au>
                  <snm>Laing</snm>
                  <fnm>KG</fnm>
               </au>
               <au>
                  <snm>Hinds</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Linton</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Al-Ghusein</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Barrell</snm>
                  <fnm>BG</fnm>
               </au>
               <au>
                  <snm>Parkhill</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Stoker</snm>
                  <fnm>NG</fnm>
               </au>
               <au>
                  <snm>Karlyshev</snm>
                  <fnm>AV</fnm>
               </au>
               <au>
                  <snm>Butcher</snm>
                  <fnm>PD</fnm>
               </au>
               <au>
                  <snm>Wren</snm>
                  <fnm>BW</fnm>
               </au>
            </aug>
            <source>Genome Res</source>
            <pubdate>2001</pubdate>
            <volume>11</volume>
            <issue>10</issue>
            <fpage>1706</fpage>
            <lpage>1715</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">311159</pubid>
                  <pubid idtype="pmpid" link="fulltext">11591647</pubid>
                  <pubid idtype="doi">10.1101/gr.185801</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B4">
            <title>
               <p>The genome sequence of the food-borne pathogen Campylobacter jejuni reveals hypervariable sequences</p>
            </title>
            <aug>
               <au>
                  <snm>Parkhill</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Wren</snm>
                  <fnm>BW</fnm>
               </au>
               <au>
                  <snm>Mungall</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Ketley</snm>
                  <fnm>JM</fnm>
               </au>
               <au>
                  <snm>Churcher</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Basham</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Chillingworth</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Davies</snm>
                  <fnm>RM</fnm>
               </au>
               <au>
                  <snm>Feltwell</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Holroyd</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Jagels</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Karlyshev</snm>
                  <fnm>AV</fnm>
               </au>
               <au>
                  <snm>Moule</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Pallen</snm>
                  <fnm>MJ</fnm>
               </au>
               <au>
                  <snm>Penn</snm>
                  <fnm>CW</fnm>
               </au>
               <au>
                  <snm>Quail</snm>
                  <fnm>MA</fnm>
               </au>
               <au>
                  <snm>Rajandream</snm>
                  <fnm>MA</fnm>
               </au>
               <au>
                  <snm>Rutherford</snm>
                  <fnm>KM</fnm>
               </au>
               <au>
                  <snm>van Vliet</snm>
                  <fnm>AH</fnm>
               </au>
               <au>
                  <snm>Whitehead</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Barrell</snm>
                  <fnm>BG</fnm>
               </au>
            </aug>
            <source>Nature</source>
            <pubdate>2000</pubdate>
            <volume>403</volume>
            <issue>6770</issue>
            <fpage>665</fpage>
            <lpage>668</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/35001088</pubid>
                  <pubid idtype="pmpid" link="fulltext">10688204</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B5">
            <title>
               <p>Genetic and biochemical evidence of a Campylobacter jejuni capsular polysaccharide that accounts for Penner serotype specificity</p>
            </title>
            <aug>
               <au>
                  <snm>Karlyshev</snm>
                  <fnm>AV</fnm>
               </au>
               <au>
                  <snm>Linton</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Gregson</snm>
                  <fnm>NA</fnm>
               </au>
               <au>
                  <snm>Lastovica</snm>
                  <fnm>AJ</fnm>
               </au>
               <au>
                  <snm>Wren</snm>
                  <fnm>BW</fnm>
               </au>
            </aug>
            <source>Mol Microbiol</source>
            <pubdate>2000</pubdate>
            <volume>35</volume>
            <issue>3</issue>
            <fpage>529</fpage>
            <lpage>541</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1046/j.1365-2958.2000.01717.x</pubid>
                  <pubid idtype="pmpid" link="fulltext">10672176</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B6">
            <title>
               <p>Analysis of Campylobacter jejuni capsular loci reveals multiple mechanisms for the generation of structural diversity and the ability to form complex heptoses</p>
            </title>
            <aug>
               <au>
                  <snm>Karlyshev</snm>
                  <fnm>AV</fnm>
               </au>
               <au>
                  <snm>Champion</snm>
                  <fnm>OL</fnm>
               </au>
               <au>
                  <snm>Churcher</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Brisson</snm>
                  <fnm>JR</fnm>
               </au>
               <au>
                  <snm>Jarrell</snm>
                  <fnm>HC</fnm>
               </au>
               <au>
                  <snm>Gilbert</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Brochu</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>St Michael</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Li</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Wakarchuk</snm>
                  <fnm>WW</fnm>
               </au>
               <au>
                  <snm>Goodhead</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Sanders</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Stevens</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>White</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Parkhill</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Wren</snm>
                  <fnm>BW</fnm>
               </au>
               <au>
                  <snm>Szymanski</snm>
                  <fnm>CM</fnm>
               </au>
            </aug>
            <source>Mol Microbiol</source>
            <pubdate>2005</pubdate>
            <volume>55</volume>
            <issue>1</issue>
            <fpage>90</fpage>
            <lpage>103</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1111/j.1365-2958.2004.04374.x</pubid>
                  <pubid idtype="pmpid" link="fulltext">15612919</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B7">
            <title>
               <p>Demonstration of polysaccharide capsule in Campylobacter jejuni using electron microscopy</p>
            </title>
            <aug>
               <au>
                  <snm>Karlyshev</snm>
                  <fnm>AV</fnm>
               </au>
               <au>
                  <snm>McCrossan</snm>
                  <fnm>MV</fnm>
               </au>
               <au>
                  <snm>Wren</snm>
                  <fnm>BW</fnm>
               </au>
            </aug>
            <source>Infect Immun</source>
            <pubdate>2001</pubdate>
            <volume>69</volume>
            <issue>9</issue>
            <fpage>5921</fpage>
            <lpage>5924</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">98714</pubid>
                  <pubid idtype="pmpid" link="fulltext">11500474</pubid>
                  <pubid idtype="doi">10.1128/IAI.69.9.5921-5924.2001</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B8">
            <title>
               <p>Identification of the carbohydrate moieties and glycosylation motifs in Campylobacter jejuni flagellin</p>
            </title>
            <aug>
               <au>
                  <snm>Thibault</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Logan</snm>
                  <fnm>SM</fnm>
               </au>
               <au>
                  <snm>Kelly</snm>
                  <fnm>JF</fnm>
               </au>
               <au>
                  <snm>Brisson</snm>
                  <fnm>JR</fnm>
               </au>
               <au>
                  <snm>Ewing</snm>
                  <fnm>CP</fnm>
               </au>
               <au>
                  <snm>Trust</snm>
                  <fnm>TJ</fnm>
               </au>
               <au>
                  <snm>Guerry</snm>
                  <fnm>P</fnm>
               </au>
            </aug>
            <source>J Biol Chem</source>
            <pubdate>2001</pubdate>
            <volume>276</volume>
            <issue>37</issue>
            <fpage>34862</fpage>
            <lpage>34870</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1074/jbc.M104529200</pubid>
                  <pubid idtype="pmpid" link="fulltext">11461915</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B9">
            <title>
               <p>Campylobacter--a tale of two protein glycosylation systems</p>
            </title>
            <aug>
               <au>
                  <snm>Szymanski</snm>
                  <fnm>CM</fnm>
               </au>
               <au>
                  <snm>Logan</snm>
                  <fnm>SM</fnm>
               </au>
               <au>
                  <snm>Linton</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Wren</snm>
                  <fnm>BW</fnm>
               </au>
            </aug>
            <source>Trends Microbiol</source>
            <pubdate>2003</pubdate>
            <volume>11</volume>
            <issue>5</issue>
            <fpage>233</fpage>
            <lpage>238</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">12781527</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B10">
            <title>
               <p>PseG of Pseudaminic Acid Biosynthesis: A UDP-SUGAR HYDROLASE AS A MASKED GLYCOSYLTRANSFERASE</p>
            </title>
            <aug>
               <au>
                  <snm>Liu</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Tanner</snm>
                  <fnm>ME</fnm>
               </au>
            </aug>
            <source>J Biol Chem</source>
            <pubdate>2006</pubdate>
            <volume>281</volume>
            <issue>30</issue>
            <fpage>20902</fpage>
            <lpage>20909</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1074/jbc.M602972200</pubid>
                  <pubid idtype="pmpid" link="fulltext">16728396</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B11">
            <title>
               <p>A novel paralogous gene family involved in phase-variable flagella-mediated motility in Campylobacter jejuni</p>
            </title>
            <aug>
               <au>
                  <snm>Karlyshev</snm>
                  <fnm>AV</fnm>
               </au>
               <au>
                  <snm>Linton</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Gregson</snm>
                  <fnm>NA</fnm>
               </au>
               <au>
                  <snm>Wren</snm>
                  <fnm>BW</fnm>
               </au>
            </aug>
            <source>Microbiology</source>
            <pubdate>2002</pubdate>
            <volume>148</volume>
            <issue>Pt 2</issue>
            <fpage>473</fpage>
            <lpage>480</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">11832511</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B12">
            <title>
               <p>Functional analysis of the Campylobacter jejuni N-linked protein glycosylation pathway</p>
            </title>
            <aug>
               <au>
                  <snm>Linton</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Dorrell</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Hitchen</snm>
                  <fnm>PG</fnm>
               </au>
               <au>
                  <snm>Amber</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Karlyshev</snm>
                  <fnm>AV</fnm>
               </au>
               <au>
                  <snm>Morris</snm>
                  <fnm>HR</fnm>
               </au>
               <au>
                  <snm>Dell</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Valvano</snm>
                  <fnm>MA</fnm>
               </au>
               <au>
                  <snm>Aebi</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Wren</snm>
                  <fnm>BW</fnm>
               </au>
            </aug>
            <source>Mol Microbiol</source>
            <pubdate>2005</pubdate>
            <volume>55</volume>
            <issue>6</issue>
            <fpage>1695</fpage>
            <lpage>1703</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1111/j.1365-2958.2005.04519.x</pubid>
                  <pubid idtype="pmpid" link="fulltext">15752194</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B13">
            <title>
               <p>In vitro assembly of the undecaprenylpyrophosphate-linked heptasaccharide for prokaryotic N-linked glycosylation</p>
            </title>
            <aug>
               <au>
                  <snm>Glover</snm>
                  <fnm>KJ</fnm>
               </au>
               <au>
                  <snm>Weerapana</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Imperiali</snm>
                  <fnm>B</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci U S A</source>
            <pubdate>2005</pubdate>
            <volume>102</volume>
            <issue>40</issue>
            <fpage>14255</fpage>
            <lpage>14259</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1242339</pubid>
                  <pubid idtype="pmpid" link="fulltext">16186480</pubid>
                  <pubid idtype="doi">10.1073/pnas.0507311102</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B14">
            <title>
               <p>Biosynthesis of the N-linked glycan in Campylobacter jejuni and addition onto protein through block transfer</p>
            </title>
            <aug>
               <au>
                  <snm>Kelly</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Jarrell</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Millar</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Tessier</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Fiori</snm>
                  <fnm>LM</fnm>
               </au>
               <au>
                  <snm>Lau</snm>
                  <fnm>PC</fnm>
               </au>
               <au>
                  <snm>Allan</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Szymanski</snm>
                  <fnm>CM</fnm>
               </au>
            </aug>
            <source>J Bacteriol</source>
            <pubdate>2006</pubdate>
            <volume>188</volume>
            <issue>7</issue>
            <fpage>2427</fpage>
            <lpage>2434</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1428418</pubid>
                  <pubid idtype="pmpid" link="fulltext">16547029</pubid>
                  <pubid idtype="doi">10.1128/JB.188.7.2427-2434.2006</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B15">
            <title>
               <p>Major structural differences and novel potential virulence mechanisms from the genomes of multiple campylobacter species</p>
            </title>
            <aug>
               <au>
                  <snm>Fouts</snm>
                  <fnm>DE</fnm>
               </au>
               <au>
                  <snm>Mongodin</snm>
                  <fnm>EF</fnm>
               </au>
               <au>
                  <snm>Mandrell</snm>
                  <fnm>RE</fnm>
               </au>
               <au>
                  <snm>Miller</snm>
                  <fnm>WG</fnm>
               </au>
               <au>
                  <snm>Rasko</snm>
                  <fnm>DA</fnm>
               </au>
               <au>
                  <snm>Ravel</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Brinkac</snm>
                  <fnm>LM</fnm>
               </au>
               <au>
                  <snm>DeBoy</snm>
                  <fnm>RT</fnm>
               </au>
               <au>
                  <snm>Parker</snm>
                  <fnm>CT</fnm>
               </au>
               <au>
                  <snm>Daugherty</snm>
                  <fnm>SC</fnm>
               </au>
               <au>
                  <snm>Dodson</snm>
                  <fnm>RJ</fnm>
               </au>
               <au>
                  <snm>Durkin</snm>
                  <fnm>AS</fnm>
               </au>
               <au>
                  <snm>Madupu</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Sullivan</snm>
                  <fnm>SA</fnm>
               </au>
               <au>
                  <snm>Shetty</snm>
                  <fnm>JU</fnm>
               </au>
               <au>
                  <snm>Ayodeji</snm>
                  <fnm>MA</fnm>
               </au>
               <au>
                  <snm>Shvartsbeyn</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Schatz</snm>
                  <fnm>MC</fnm>
               </au>
               <au>
                  <snm>Badger</snm>
                  <fnm>JH</fnm>
               </au>
               <au>
                  <snm>Fraser</snm>
                  <fnm>CM</fnm>
               </au>
               <au>
                  <snm>Nelson</snm>
                  <fnm>KE</fnm>
               </au>
            </aug>
            <source>PLoS Biol</source>
            <pubdate>2005</pubdate>
            <volume>3</volume>
            <issue>1</issue>
            <fpage>e15</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">539331</pubid>
                  <pubid idtype="pmpid" link="fulltext">15660156</pubid>
                  <pubid idtype="doi">10.1371/journal.pbio.0030015</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B16">
            <title>
               <p>Unique features of a highly pathogenic Campylobacter jejuni strain</p>
            </title>
            <aug>
               <au>
                  <snm>Hofreuter</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Tsai</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Watson</snm>
                  <fnm>RO</fnm>
               </au>
               <au>
                  <snm>Novik</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>Altman</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Benitez</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Clark</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Perbost</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Jarvie</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Du</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Galan</snm>
                  <fnm>JE</fnm>
               </au>
            </aug>
            <source>Infect Immun</source>
            <pubdate>2006</pubdate>
            <volume>74</volume>
            <issue>8</issue>
            <fpage>4694</fpage>
            <lpage>4707</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1539605</pubid>
                  <pubid idtype="pmpid" link="fulltext">16861657</pubid>
                  <pubid idtype="doi">10.1128/IAI.00210-06</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B17">
            <title>
               <p>The past, present and future of genome-wide re-annotation</p>
            </title>
            <aug>
               <au>
                  <snm>Ouzounis</snm>
                  <fnm>CA</fnm>
               </au>
               <au>
                  <snm>Karp</snm>
                  <fnm>PD</fnm>
               </au>
            </aug>
            <source>Genome Biol</source>
            <pubdate>2002</pubdate>
            <volume>3</volume>
            <issue>2</issue>
            <fpage>COMMENT2001</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">139008</pubid>
                  <pubid idtype="pmpid" link="fulltext">11864365</pubid>
                  <pubid idtype="doi">10.1186/gb-2002-3-2-comment2001</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B18">
            <title>
               <p>Re-annotation of the genome sequence of <it>Mycobacterium tuberculosis</it> H37Rv</p>
            </title>
            <aug>
               <au>
                  <snm>Camus</snm>
                  <fnm>JC</fnm>
               </au>
               <au>
                  <snm>Pryor</snm>
                  <fnm>MJ</fnm>
               </au>
               <au>
                  <snm>Medigue</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Cole</snm>
                  <fnm>ST</fnm>
               </au>
            </aug>
            <source>Microbiology</source>
            <pubdate>2002</pubdate>
            <volume>148</volume>
            <issue>Pt 10</issue>
            <fpage>2967</fpage>
            <lpage>2973</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmpid" link="fulltext">12368430</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B19">
            <title>
               <p>Re-annotating the <it>Mycoplasma pneumoniae</it> genome sequence: adding value, function and reading frames</p>
            </title>
            <aug>
               <au>
                  <snm>Dandekar</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Huynen</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Regula</snm>
                  <fnm>JT</fnm>
               </au>
               <au>
                  <snm>Ueberle</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Zimmermann</snm>
                  <fnm>CU</fnm>
               </au>
               <au>
                  <snm>Andrade</snm>
                  <fnm>MA</fnm>
               </au>
               <au>
                  <snm>Doerks</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Sanchez-Pulido</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Snel</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Suyama</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Yuan</snm>
                  <fnm>YP</fnm>
               </au>
               <au>
                  <snm>Herrmann</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Bork</snm>
                  <fnm>P</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2000</pubdate>
            <volume>28</volume>
            <issue>17</issue>
            <fpage>3278</fpage>
            <lpage>3288</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">110705</pubid>
                  <pubid idtype="pmpid" link="fulltext">10954595</pubid>
                  <pubid idtype="doi">10.1093/nar/28.17.3278</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B20">
            <title>
               <p>Comparative genotyping of Campylobacter jejuni by amplified fragment length polymorphism, multilocus sequence typing, and short repeat sequencing: strain diversity, host range, and recombination</p>
            </title>
            <aug>
               <au>
                  <snm>Schouls</snm>
                  <fnm>LM</fnm>
               </au>
               <au>
                  <snm>Reulen</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Duim</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Wagenaar</snm>
                  <fnm>JA</fnm>
               </au>
               <au>
                  <snm>Willems</snm>
                  <fnm>RJ</fnm>
               </au>
               <au>
                  <snm>Dingle</snm>
                  <fnm>KE</fnm>
               </au>
               <au>
                  <snm>Colles</snm>
                  <fnm>FM</fnm>
               </au>
               <au>
                  <snm>Van Embden</snm>
                  <fnm>JD</fnm>
               </au>
            </aug>
            <source>J Clin Microbiol</source>
            <pubdate>2003</pubdate>
            <volume>41</volume>
            <issue>1</issue>
            <fpage>15</fpage>
            <lpage>26</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">149617</pubid>
                  <pubid idtype="pmpid" link="fulltext">12517820</pubid>
                  <pubid idtype="doi">10.1128/JCM.41.1.15-26.2003</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B21">
            <title>
               <p>Sialylation of lipooligosaccharide cores affects immunogenicity and serum resistance of Campylobacter jejuni</p>
            </title>
            <aug>
               <au>
                  <snm>Guerry</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Ewing</snm>
                  <fnm>CP</fnm>
               </au>
               <au>
                  <snm>Hickey</snm>
                  <fnm>TE</fnm>
               </au>
               <au>
                  <snm>Prendergast</snm>
                  <fnm>MM</fnm>
               </au>
               <au>
                  <snm>Moran</snm>
                  <fnm>AP</fnm>
               </au>
            </aug>
            <source>Infect Immun</source>
            <pubdate>2000</pubdate>
            <volume>68</volume>
            <issue>12</issue>
            <fpage>6656</fpage>
            <lpage>6662</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">97763</pubid>
                  <pubid idtype="pmpid" link="fulltext">11083778</pubid>
                  <pubid idtype="doi">10.1128/IAI.68.12.6656-6662.2000</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B22">
            <title>
               <p>The genetic bases for the variation in the lipo-oligosaccharide of the mucosal pathogen, Campylobacter jejuni. Biosynthesis of sialylated ganglioside mimics in the core oligosaccharide</p>
            </title>
            <aug>
               <au>
                  <snm>Gilbert</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Karwaski</snm>
                  <fnm>MF</fnm>
               </au>
               <au>
                  <snm>Bernatchez</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Young</snm>
                  <fnm>NM</fnm>
               </au>
               <au>
                  <snm>Taboada</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Michniewicz</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Cunningham</snm>
                  <fnm>AM</fnm>
               </au>
               <au>
                  <snm>Wakarchuk</snm>
                  <fnm>WW</fnm>
               </au>
            </aug>
            <source>J Biol Chem</source>
            <pubdate>2002</pubdate>
            <volume>277</volume>
            <issue>1</issue>
            <fpage>327</fpage>
            <lpage>337</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1074/jbc.M108452200</pubid>
                  <pubid idtype="pmpid" link="fulltext">11689567</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B23">
            <title>
               <p>Multiple N-acetyl neuraminic acid synthetase (neuB) genes in Campylobacter jejuni: identification and characterization of the gene involved in sialylation of lipo-oligosaccharide</p>
            </title>
            <aug>
               <au>
                  <snm>Linton</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Karlyshev</snm>
                  <fnm>AV</fnm>
               </au>
               <au>
                  <snm>Hitchen</snm>
                  <fnm>PG</fnm>
               </au>
               <au>
                  <snm>Morris</snm>
                  <fnm>HR</fnm>
               </au>
               <au>
                  <snm>Dell</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Gregson</snm>
                  <fnm>NA</fnm>
               </au>
               <au>
                  <snm>Wren</snm>
                  <fnm>BW</fnm>
               </au>
            </aug>
            <source>Mol Microbiol</source>
            <pubdate>2000</pubdate>
            <volume>35</volume>
            <issue>5</issue>
            <fpage>1120</fpage>
            <lpage>1134</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1046/j.1365-2958.2000.01780.x</pubid>
                  <pubid idtype="pmpid" link="fulltext">10712693</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B24">
            <title>
               <p>Novel pathways for biosynthesis of nucleotide-activated glycero-manno-heptose precursors of bacterial glycoproteins and cell surface polysaccharides</p>
            </title>
            <aug>
               <au>
                  <snm>sValvano</snm>
                  <fnm>MA</fnm>
               </au>
               <au>
                  <snm>Messner</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Kosma</snm>
                  <fnm>P</fnm>
               </au>
            </aug>
            <source>Microbiology</source>
            <pubdate>2002</pubdate>
            <volume>148</volume>
            <issue>Pt 7</issue>
            <fpage>1979</fpage>
            <lpage>1989</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">12101286</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B25">
            <title>
               <p>Recognizing the pseudogenes in bacterial genomes</p>
            </title>
            <aug>
               <au>
                  <snm>Lerat</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Ochman</snm>
                  <fnm>H</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2005</pubdate>
            <volume>33</volume>
            <issue>10</issue>
            <fpage>3125</fpage>
            <lpage>3132</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1142405</pubid>
                  <pubid idtype="pmpid" link="fulltext">15933207</pubid>
                  <pubid idtype="doi">10.1093/nar/gki631</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B26">
            <title>
               <p>Psi-Phi: exploring the outer limits of bacterial pseudogenes</p>
            </title>
            <aug>
               <au>
                  <snm>Lerat</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Ochman</snm>
                  <fnm>H</fnm>
               </au>
            </aug>
            <source>Genome Res</source>
            <pubdate>2004</pubdate>
            <volume>14</volume>
            <issue>11</issue>
            <fpage>2273</fpage>
            <lpage>2278</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">525686</pubid>
                  <pubid idtype="pmpid" link="fulltext">15479949</pubid>
                  <pubid idtype="doi">10.1101/gr.2925604</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B27">
            <title>
               <p>Studying genomes through the aeons: protein families, pseudogenes and proteome evolution</p>
            </title>
            <aug>
               <au>
                  <snm>Harrison</snm>
                  <fnm>PM</fnm>
               </au>
               <au>
                  <snm>Gerstein</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>J Mol Biol</source>
            <pubdate>2002</pubdate>
            <volume>318</volume>
            <issue>5</issue>
            <fpage>1155</fpage>
            <lpage>1174</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/S0022-2836(02)00109-2</pubid>
                  <pubid idtype="pmpid" link="fulltext">12083509</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B28">
            <title>
               <p>The nature and dynamics of bacterial genomes</p>
            </title>
            <aug>
               <au>
                  <snm>Ochman</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Davalos</snm>
                  <fnm>LM</fnm>
               </au>
            </aug>
            <source>Science</source>
            <pubdate>2006</pubdate>
            <volume>311</volume>
            <issue>5768</issue>
            <fpage>1730</fpage>
            <lpage>1733</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1126/science.1119966</pubid>
                  <pubid idtype="pmpid" link="fulltext">16556833</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B29">
            <title>
               <p>A systematic investigation identifies a significant number of probable pseudogenes in the <it>Escherichia coli</it> genome</p>
            </title>
            <aug>
               <au>
                  <snm>Homma</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Fukuchi</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Kawabata</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Ota</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Nishikawa</snm>
                  <fnm>K</fnm>
               </au>
            </aug>
            <source>Gene</source>
            <pubdate>2002</pubdate>
            <volume>294</volume>
            <issue>1-2</issue>
            <fpage>25</fpage>
            <lpage>33</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/S0378-1119(02)00794-1</pubid>
                  <pubid idtype="pmpid" link="fulltext">12234664</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B30">
            <title>
               <p>The silencing of pseudogenes</p>
            </title>
            <aug>
               <au>
                  <snm>Mira</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Pushker</snm>
                  <fnm>R</fnm>
               </au>
            </aug>
            <source>Mol Biol Evol</source>
            <pubdate>2005</pubdate>
            <volume>22</volume>
            <issue>11</issue>
            <fpage>2135</fpage>
            <lpage>2138</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/molbev/msi209</pubid>
                  <pubid idtype="pmpid" link="fulltext">16014873</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B31">
            <title>
               <p>Deciphering Campylobacter jejuni cell surface interactions from the genome sequence</p>
            </title>
            <aug>
               <au>
                  <snm>Linton</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Karlyshev</snm>
                  <fnm>AV</fnm>
               </au>
               <au>
                  <snm>Wren</snm>
                  <fnm>BW</fnm>
               </au>
            </aug>
            <source>Curr Opin Microbiol</source>
            <pubdate>2001</pubdate>
            <volume>4</volume>
            <issue>1</issue>
            <fpage>35</fpage>
            <lpage>40</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/S1369-5274(00)00161-2</pubid>
                  <pubid idtype="pmpid" link="fulltext">11173031</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B32">
            <title>
               <p>Detection of conserved N-linked glycans and phase-variable lipooligosaccharides and capsules from campylobacter cells by mass spectrometry and high resolution magic angle spinning NMR spectroscopy</p>
            </title>
            <aug>
               <au>
                  <snm>Szymanski</snm>
                  <fnm>CM</fnm>
               </au>
               <au>
                  <snm>Michael</snm>
                  <fnm>FS</fnm>
               </au>
               <au>
                  <snm>Jarrell</snm>
                  <fnm>HC</fnm>
               </au>
               <au>
                  <snm>Li</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Gilbert</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Larocque</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Vinogradov</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Brisson</snm>
                  <fnm>JR</fnm>
               </au>
            </aug>
            <source>J Biol Chem</source>
            <pubdate>2003</pubdate>
            <volume>278</volume>
            <issue>27</issue>
            <fpage>24509</fpage>
            <lpage>24520</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1074/jbc.M301273200</pubid>
                  <pubid idtype="pmpid" link="fulltext">12716884</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B33">
            <title>
               <p>The repetitive DNA elements called CRISPRs and their associated genes: evidence of horizontal transfer among prokaryotes</p>
            </title>
            <aug>
               <au>
                  <snm>Godde</snm>
                  <fnm>JS</fnm>
               </au>
               <au>
                  <snm>Bickerton</snm>
                  <fnm>A</fnm>
               </au>
            </aug>
            <source>J Mol Evol</source>
            <pubdate>2006</pubdate>
            <volume>62</volume>
            <issue>6</issue>
            <fpage>718</fpage>
            <lpage>729</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1007/s00239-005-0223-z</pubid>
                  <pubid idtype="pmpid" link="fulltext">16612537</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B34">
            <title>
               <p>Identification of a novel family of sequence repeats among prokaryotes</p>
            </title>
            <aug>
               <au>
                  <snm>Jansen</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>van Embden</snm>
                  <fnm>JD</fnm>
               </au>
               <au>
                  <snm>Gaastra</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Schouls</snm>
                  <fnm>LM</fnm>
               </au>
            </aug>
            <source>Omics</source>
            <pubdate>2002</pubdate>
            <volume>6</volume>
            <issue>1</issue>
            <fpage>23</fpage>
            <lpage>33</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1089/15362310252780816</pubid>
                  <pubid idtype="pmpid">11883425</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B35">
            <title>
               <p>SRPDB: Signal Recognition Particle Database</p>
            </title>
            <aug>
               <au>
                  <snm>Rosenblad</snm>
                  <fnm>MA</fnm>
               </au>
               <au>
                  <snm>Gorodkin</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Knudsen</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Zwieb</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Samuelsson</snm>
                  <fnm>T</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2003</pubdate>
            <volume>31</volume>
            <issue>1</issue>
            <fpage>363</fpage>
            <lpage>364</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">165554</pubid>
                  <pubid idtype="pmpid" link="fulltext">12520023</pubid>
                  <pubid idtype="doi">10.1093/nar/gkg107</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B36">
            <title>
               <p>Prediction of signal recognition particle RNA genes</p>
            </title>
            <aug>
               <au>
                  <snm>Regalia</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Rosenblad</snm>
                  <fnm>MA</fnm>
               </au>
               <au>
                  <snm>Samuelsson</snm>
                  <fnm>T</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2002</pubdate>
            <volume>30</volume>
            <issue>15</issue>
            <fpage>3368</fpage>
            <lpage>3377</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">137091</pubid>
                  <pubid idtype="pmpid" link="fulltext">12140321</pubid>
                  <pubid idtype="doi">10.1093/nar/gkf468</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B37">
            <title>
               <p>Comparative genomics of thiamin biosynthesis in procaryotes. New genes and regulatory mechanisms</p>
            </title>
            <aug>
               <au>
                  <snm>Rodionov</snm>
                  <fnm>DA</fnm>
               </au>
               <au>
                  <snm>Vitreschak</snm>
                  <fnm>AG</fnm>
               </au>
               <au>
                  <snm>Mironov</snm>
                  <fnm>AA</fnm>
               </au>
               <au>
                  <snm>Gelfand</snm>
                  <fnm>MS</fnm>
               </au>
            </aug>
            <source>J Biol Chem</source>
            <pubdate>2002</pubdate>
            <volume>277</volume>
            <issue>50</issue>
            <fpage>48949</fpage>
            <lpage>48959</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1074/jbc.M208965200</pubid>
                  <pubid idtype="pmpid" link="fulltext">12376536</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B38">
            <title>
               <p>Gene ontology: tool for the unification of biology. The Gene Ontology Consortium</p>
            </title>
            <aug>
               <au>
                  <snm>Ashburner</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Ball</snm>
                  <fnm>CA</fnm>
               </au>
               <au>
                  <snm>Blake</snm>
                  <fnm>JA</fnm>
               </au>
               <au>
                  <snm>Botstein</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Butler</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Cherry</snm>
                  <fnm>JM</fnm>
               </au>
               <au>
                  <snm>Davis</snm>
                  <fnm>AP</fnm>
               </au>
               <au>
                  <snm>Dolinski</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Dwight</snm>
                  <fnm>SS</fnm>
               </au>
               <au>
                  <snm>Eppig</snm>
                  <fnm>JT</fnm>
               </au>
               <au>
                  <snm>Harris</snm>
                  <fnm>MA</fnm>
               </au>
               <au>
                  <snm>Hill</snm>
                  <fnm>DP</fnm>
               </au>
               <au>
                  <snm>Issel-Tarver</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Kasarskis</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Lewis</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Matese</snm>
                  <fnm>JC</fnm>
               </au>
               <au>
                  <snm>Richardson</snm>
                  <fnm>JE</fnm>
               </au>
               <au>
                  <snm>Ringwald</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Rubin</snm>
                  <fnm>GM</fnm>
               </au>
               <au>
                  <snm>Sherlock</snm>
                  <fnm>G</fnm>
               </au>
            </aug>
            <source>Nat Genet</source>
            <pubdate>2000</pubdate>
            <volume>25</volume>
            <issue>1</issue>
            <fpage>25</fpage>
            <lpage>29</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/75556</pubid>
                  <pubid idtype="pmpid" link="fulltext">10802651</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B39">
            <title>
               <p>Basic local alignment search tool</p>
            </title>
            <aug>
               <au>
                  <snm>Altschul</snm>
                  <fnm>SF</fnm>
               </au>
               <au>
                  <snm>Gish</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Miller</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Myers</snm>
                  <fnm>EW</fnm>
               </au>
               <au>
                  <snm>Lipman</snm>
                  <fnm>DJ</fnm>
               </au>
            </aug>
            <source>J Mol Biol</source>
            <pubdate>1990</pubdate>
            <volume>215</volume>
            <issue>3</issue>
            <fpage>403</fpage>
            <lpage>410</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">2231712</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B40">
            <title>
               <p>Improved tools for biological sequence comparison</p>
            </title>
            <aug>
               <au>
                  <snm>Pearson</snm>
                  <fnm>WR</fnm>
               </au>
               <au>
                  <snm>Lipman</snm>
                  <fnm>DJ</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci U S A</source>
            <pubdate>1988</pubdate>
            <volume>85</volume>
            <issue>8</issue>
            <fpage>2444</fpage>
            <lpage>2448</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">280013</pubid>
                  <pubid idtype="pmpid" link="fulltext">3162770</pubid>
                  <pubid idtype="doi">10.1073/pnas.85.8.2444</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B41">
            <title>
               <p>Pfam: a comprehensive database of protein domain families based on seed alignments</p>
            </title>
            <aug>
               <au>
                  <snm>Sonnhammer</snm>
                  <fnm>EL</fnm>
               </au>
               <au>
                  <snm>Eddy</snm>
                  <fnm>SR</fnm>
               </au>
               <au>
                  <snm>Durbin</snm>
                  <fnm>R</fnm>
               </au>
            </aug>
            <source>Proteins</source>
            <pubdate>1997</pubdate>
            <volume>28</volume>
            <issue>3</issue>
            <fpage>405</fpage>
            <lpage>420</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1002/(SICI)1097-0134(199707)28:3&lt;405::AID-PROT10>3.0.CO;2-L</pubid>
                  <pubid idtype="pmpid" link="fulltext">9223186</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B42">
            <title>
               <p>The PROSITE database, its status in 2002</p>
            </title>
            <aug>
               <au>
                  <snm>Falquet</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Pagni</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Bucher</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Hulo</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Sigrist</snm>
                  <fnm>CJ</fnm>
               </au>
               <au>
                  <snm>Hofmann</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Bairoch</snm>
                  <fnm>A</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2002</pubdate>
            <volume>30</volume>
            <issue>1</issue>
            <fpage>235</fpage>
            <lpage>238</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">99105</pubid>
                  <pubid idtype="pmpid" link="fulltext">11752303</pubid>
                  <pubid idtype="doi">10.1093/nar/30.1.235</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B43">
            <title>
               <p>Rfam: an RNA family database</p>
            </title>
            <aug>
               <au>
                  <snm>Griffiths-Jones</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Bateman</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Marshall</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Khanna</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Eddy</snm>
                  <fnm>SR</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2003</pubdate>
            <volume>31</volume>
            <issue>1</issue>
            <fpage>439</fpage>
            <lpage>441</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">165453</pubid>
                  <pubid idtype="pmpid" link="fulltext">12520045</pubid>
                  <pubid idtype="doi">10.1093/nar/gkg006</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B44">
            <title>
               <p>A hidden Markov model for predicting transmembrane helices in protein sequences</p>
            </title>
            <aug>
               <au>
                  <snm>Sonnhammer</snm>
                  <fnm>EL</fnm>
               </au>
               <au>
                  <snm>von Heijne</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Krogh</snm>
                  <fnm>A</fnm>
               </au>
            </aug>
            <source>Proc Int Conf Intell Syst Mol Biol</source>
            <pubdate>1998</pubdate>
            <volume>6</volume>
            <fpage>175</fpage>
            <lpage>182</lpage>
            <xrefbib>
               <pubid idtype="pmpid">9783223</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B45">
            <title>
               <p>Identification of prokaryotic and eukaryotic signal peptides and prediction of their cleavage sites</p>
            </title>
            <aug>
               <au>
                  <snm>Nielsen</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Engelbrecht</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Brunak</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>von Heijne</snm>
                  <fnm>G</fnm>
               </au>
            </aug>
            <source>Protein Eng</source>
            <pubdate>1997</pubdate>
            <volume>10</volume>
            <issue>1</issue>
            <fpage>1</fpage>
            <lpage>6</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/protein/10.1.1</pubid>
                  <pubid idtype="pmpid" link="fulltext">9051728</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B46">
            <title>
               <p/>
            </title>
            <aug>
               <au>
                  <snm>Entrez Pubmed</snm>
                  <fnm>NIH</fnm>
                  <suf>USA</suf>
               </au>
            </aug>
            <url>http://www.ncbi.nlm.nih.gov/entrez/query.fcgi</url>
         </bibl>
         <bibl id="B47">
            <title>
               <p/>
            </title>
            <aug>
               <au>
                  <snm>HighWire Press</snm>
                  <fnm>SU</fnm>
               </au>
            </aug>
            <url>http://highwire.stanford.edu/</url>
         </bibl>
         <bibl id="B48">
            <title>
               <p/>
            </title>
            <aug>
               <au>
                  <snm>Scirus - for scientific information only</snm>
                  <fnm>E</fnm>
               </au>
            </aug>
            <url>http://www.scirus.com/srsapp/</url>
         </bibl>
         <bibl id="B49">
            <title>
               <p/>
            </title>
            <aug>
               <au>
                  <snm>Scholar</snm>
                  <fnm>G</fnm>
               </au>
            </aug>
            <url>http://scholar.google.com/</url>
         </bibl>
         <bibl id="B50">
            <title>
               <p>Artemis: sequence visualization and annotation</p>
            </title>
            <aug>
               <au>
                  <snm>Rutherford</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Parkhill</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Crook</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Horsnell</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Rice</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Rajandream</snm>
                  <fnm>MA</fnm>
               </au>
               <au>
                  <snm>Barrell</snm>
                  <fnm>B</fnm>
               </au>
            </aug>
            <source>Bioinformatics</source>
            <pubdate>2000</pubdate>
            <volume>16</volume>
            <issue>10</issue>
            <fpage>944</fpage>
            <lpage>945</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/bioinformatics/16.10.944</pubid>
                  <pubid idtype="pmpid" link="fulltext">11120685</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B51">
            <title>
               <p>GeneDB: a resource for prokaryotic and eukaryotic organisms</p>
            </title>
            <aug>
               <au>
                  <snm>Hertz-Fowler</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Peacock</snm>
                  <fnm>CS</fnm>
               </au>
               <au>
                  <snm>Wood</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>Aslett</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Kerhornou</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Mooney</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Tivey</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Berriman</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Hall</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Rutherford</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Parkhill</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Ivens</snm>
                  <fnm>AC</fnm>
               </au>
               <au>
                  <snm>Rajandream</snm>
                  <fnm>MA</fnm>
               </au>
               <au>
                  <snm>Barrell</snm>
                  <fnm>B</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2004</pubdate>
            <volume>32</volume>
            <issue>Database issue</issue>
            <fpage>D339</fpage>
            <lpage>43</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">308742</pubid>
                  <pubid idtype="pmpid" link="fulltext">14681429</pubid>
                  <pubid idtype="doi">10.1093/nar/gkh007</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B52">
            <title>
               <p>coliBASE: an online database for Escherichia coli, Shigella and Salmonella comparative genomics</p>
            </title>
            <aug>
               <au>
                  <snm>Chaudhuri</snm>
                  <fnm>RR</fnm>
               </au>
               <au>
                  <snm>Khan</snm>
                  <fnm>AM</fnm>
               </au>
               <au>
                  <snm>Pallen</snm>
                  <fnm>MJ</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2004</pubdate>
            <volume>32</volume>
            <issue>Database issue</issue>
            <fpage>D296</fpage>
            <lpage>9</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">308765</pubid>
                  <pubid idtype="pmpid" link="fulltext">14681417</pubid>
                  <pubid idtype="doi">10.1093/nar/gkh031</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
      </refgrp>
   </bm>
</art>
