<?xml version='1.0'?>
<!DOCTYPE art SYSTEM 'http://www.biomedcentral.com/xml/article.dtd'>
<art>
<ui>1471-2164-13-89</ui>
<ji>1471-2164</ji>
<fm>
<dochead>Research article</dochead>
<bibl>
<title><p>Genome sequencing of ovine isolates of <it>Mycobacterium avium </it>subspecies <it>paratuberculosis </it>offers insights into host association</p></title>
<aug>
<au id="A1" ca="yes"><snm>Bannantine</snm><mi>P</mi><fnm>John</fnm><insr iid="I1"/><email>John.Bannantine@ARS.USDA.GOV</email></au>
<au id="A2"><snm>Wu</snm><fnm>Chia-wei</fnm><insr iid="I2"/><email>microwu3388@gmail.com</email></au>
<au id="A3"><snm>Hsu</snm><fnm>Chungyi</fnm><insr iid="I2"/><email>cyhsu@svm.vetmed.wisc.edu</email></au>
<au id="A4"><snm>Zhou</snm><fnm>Shiguo</fnm><insr iid="I3"/><email>szhou@wisc.edu</email></au>
<au id="A5"><snm>Schwartz</snm><mi>C</mi><fnm>David</fnm><insr iid="I3"/><email>dcschwartz@wisc.edu</email></au>
<au id="A6"><snm>Bayles</snm><mi>O</mi><fnm>Darrell</fnm><insr iid="I1"/><email>Darrell.Bayles@ARS.USDA.GOV</email></au>
<au id="A7"><snm>Paustian</snm><mi>L</mi><fnm>Michael</fnm><insr iid="I1"/><email>mike.paustian@gmail.com</email></au>
<au id="A8"><snm>Alt</snm><mi>P</mi><fnm>David</fnm><insr iid="I1"/><email>David.Alt@ARS.USDA.GOV</email></au>
<au id="A9"><snm>Sreevatsan</snm><fnm>Srinand</fnm><insr iid="I4"/><email>sreev001@umn.edu</email></au>
<au id="A10"><snm>Kapur</snm><fnm>Vivek</fnm><insr iid="I5"/><email>vkapur@psu.edu</email></au>
<au id="A11" ca="yes"><snm>Talaat</snm><mi>M</mi><fnm>Adel</fnm><insr iid="I2"/><insr iid="I6"/><email>atalaat@wisc.edu</email></au>
</aug>
<insg>
<ins id="I1"><p>National Animal Disease Center, USDA-Agricultural Research Service, Ames, Iowa, USA</p></ins>
<ins id="I2"><p>The Laboratory of Bacterial Genomics, Department of Pathobiological Sciences, University of Wisconsin-Madison, Madison, Wisconsin, USA</p></ins>
<ins id="I3"><p>Laboratory for Molecular and Computational Genomics, Department of Chemistry and Laboratory of Genetics, UW Biotechnology Center, University of Wisconsin-Madison, Madison, Wisconsin, USA</p></ins>
<ins id="I4"><p>Department of Veterinary Population Medicine and Department of Veterinary and Biomedical Sciences, University of Minnesota, St. Paul, Minnesota, USA</p></ins>
<ins id="I5"><p>Department of Veterinary and Biomedical Sciences and Huck Institutes of the Life Sciences, Penn State University, University Park, Pennsylvania, USA</p></ins>
<ins id="I6"><p>Department of Food Hygiene, Cairo University, Cairo, Egypt</p></ins>
</insg>
<source>BMC Genomics</source>
<section><title><p>Eukaryote Microbial Genomics </p></title></section><issn>1471-2164</issn>
<pubdate>2012</pubdate>
<volume>13</volume>
<issue>1</issue>
<fpage>89</fpage>
<url>http://www.biomedcentral.com/1471-2164/13/89</url>
<xrefbib><pubidlist><pubid idtype="doi">10.1186/1471-2164-13-89</pubid><pubid idtype="pmpid">22409516</pubid></pubidlist></xrefbib></bibl>
<history><rec><date><day>7</day><month>10</month><year>2011</year></date></rec><acc><date><day>12</day><month>3</month><year>2012</year></date></acc><pub><date><day>12</day><month>3</month><year>2012</year></date></pub></history><cpyrt><year>2012</year><collab>Bannantine et al; licensee BioMed Central Ltd.</collab><note>This is an Open Access article distributed under the terms of the Creative Commons Attribution License (<url>http://creativecommons.org/licenses/by/2.0</url>), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</note></cpyrt>
<kwdg><kwd><it>M. paratuberculosis</it></kwd><kwd>Evolution</kwd><kwd>Johne's disease</kwd><kwd>Genome</kwd><kwd>Optical mapping</kwd></kwdg>
<abs>
<sec><st><p>Abstract</p></st>
<sec><st><p>Background</p></st>
<p>The genome of <it>Mycobacterium avium </it>subspecies <it>paratuberculosis </it>(<it>MAP</it>) is remarkably homogeneous among the genomes of bovine, human and wildlife isolates. However, previous work in our laboratories with the bovine K-10 strain has revealed substantial differences compared to sheep isolates. To systematically characterize all genomic differences that may be associated with the specific hosts, we sequenced the genomes of three U.S. sheep isolates and also obtained an optical map.</p>
</sec>
<sec><st><p>Results</p></st>
<p>Our analysis of one of the isolates, <it>MAP </it>S397, revealed a genome 4.8 Mb in size with 4,700 open reading frames (ORFs). Comparative analysis of the <it>MAP </it>S397 isolate showed it acquired approximately 10 large sequence regions that are shared with the human <it>M. avium </it>subsp. <it>hominissuis </it>strain 104 and lost 2 large regions that are present in the bovine strain. In addition, optical mapping defined the presence of 7 large inversions between the bovine and ovine genomes (~ 2.36 Mb). Whole-genome sequencing of 2 additional sheep strains of <it>MAP </it>(JTC1074 and JTC7565) further confirmed genomic homogeneity of the sheep isolates despite the presence of polymorphisms on the nucleotide level.</p>
</sec>
<sec><st><p>Conclusions</p></st>
<p>Comparative sequence analysis employed here provided a better understanding of the host association, evolution of members of the <it>M. avium </it>complex and could help in deciphering the phenotypic differences observed among sheep and cattle strains of <it>MAP</it>. A similar approach based on whole-genome sequencing combined with optical mapping could be employed to examine closely related pathogens. We propose an evolutionary scenario for <it>M. avium </it>complex strains based on these genome sequences.</p>
</sec>
</sec>
</abs>
</fm>
<bdy>
<sec><st><p>Background</p></st>
<p><it>Mycobacterium avium </it>subspecies <it>paratuberculosis </it>(<it>MAP</it>) causes Johne's disease in sheep, cattle, goats and other ruminant animals. This disease is chronic in nature with multiple years separating the initial infection from clinical stages of disease <abbrgrp><abbr bid="B1">1</abbr></abbrgrp>. The details of the pathogenic mechanisms occurring during this long incubation period still need further study, but it has been demonstrated that MAP colonizes the small intestine through invasion of both M cells and epithelial cells <abbrgrp><abbr bid="B2">2</abbr></abbrgrp>. The disease is of considerable economic significance to livestock industries, particularly the dairy industry.</p>
<p>Generally, <it>MAP </it>is a genetically homogenous subspecies, especially among bovine, human and wildlife isolates <abbrgrp><abbr bid="B3">3</abbr><abbr bid="B4">4</abbr><abbr bid="B5">5</abbr></abbrgrp>. However, three lineages of <it>MAP </it>have emerged following extensive molecular strain typing and comparative genomic studies-type I and type III strains (ovine) and type II (bovine) strains. The type III strains were originally called intermediate strains and are highly similar genetically, and thus, difficult to distinguish from type I strains. Early on, the type I (<it>MAP</it>-S) and type II (<it>MAP</it>-C) strains were distinguished based on their molecular fingerprints using IS<it>1311 </it>polymorphism <abbrgrp><abbr bid="B6">6</abbr></abbrgrp>, representational difference analysis <abbrgrp><abbr bid="B7">7</abbr></abbrgrp>, MLSSR typing <abbrgrp><abbr bid="B8">8</abbr><abbr bid="B9">9</abbr><abbr bid="B10">10</abbr></abbrgrp> and <it>hsp65 </it>sequencing <abbrgrp><abbr bid="B11">11</abbr></abbrgrp>. On the other hand, type III (a sub-lineage of the <it>MAP</it>-S strains) was genotyped based on <it>gyrA </it>and <it>gyrB </it>genes <abbrgrp><abbr bid="B12">12</abbr></abbrgrp>.</p>
<p>In addition to these recently published genotypic distinctions between "S" and "C" strains of <it>MAP</it>, phenotypic differences have been noted since the middle of the last century <abbrgrp><abbr bid="B4">4</abbr></abbrgrp>. More recently, Motiwala et al. <abbrgrp><abbr bid="B13">13</abbr></abbrgrp> have shown transcriptional changes in human macrophages infected with <it>MAP</it>-C, human and bison isolates induce an anti-inflammatory gene expression pattern, while the <it>MAP</it>-S isolates showed expression of pro-inflammatory cytokines. Furthermore, some of the ovine strains are pigmented <abbrgrp><abbr bid="B14">14</abbr></abbrgrp>. The ovine and bovine strains likewise are distinct in their growth characteristics. The <it>MAP</it>-S strains are more fastidious and slower in their growth rate than the <it>MAP</it>-C counterpart. In contrast to <it>MAP</it>-C strains, the <it>MAP</it>-S strains do not grow readily on Herold's egg yolk media or Middlebrook 7H9 media that is not supplemented with egg yolk <abbrgrp><abbr bid="B15">15</abbr></abbrgrp>. Nutrient limitation will kill <it>MAP</it>-S strains but it is only bacteriostatic for <it>MAP</it>-C strains <abbrgrp><abbr bid="B16">16</abbr></abbrgrp>. On the transcriptional level, RNA extracted in low iron and heat stressed environments is divergent between <it>MAP</it>-S and <it>MAP</it>-C strains <abbrgrp><abbr bid="B17">17</abbr></abbrgrp>. Recently, iron storage in low iron conditions was only observed in the <it>MAP</it>-C strains but not <it>MAP</it>-S strains <abbrgrp><abbr bid="B18">18</abbr></abbrgrp>. Because of these well-documented phenotypic differences, we hypothesized that sequencing of the genomes of ovine isolates and comparing them to other genomes in the MAC group could provide some clues for these host-specific variations.</p>
<p>The <it>MAP</it>-C strain K-10 was sequenced in 2005 to obtain a complete genome 4.8 Mb in size <abbrgrp><abbr bid="B19">19</abbr></abbrgrp>. It was subsequently found to possess an inversion due to misalignment that was resolved by optical mapping <abbrgrp><abbr bid="B20">20</abbr></abbrgrp>. Very recently, draft sequences of ten <it>MAP </it>isolates have been reported with the presence of two large duplications, especially among human isolates <abbrgrp><abbr bid="B21">21</abbr></abbrgrp>. Finally, another <it>M. avium </it>subspecies (strain 104) has also been sequenced but not published as yet. This genome of subspecies <it>hominissuis </it>is 5.4 Mb in size and greater than 95% homologous to the <it>MAP </it>K-10 genome <abbrgrp><abbr bid="B3">3</abbr><abbr bid="B5">5</abbr><abbr bid="B22">22</abbr></abbrgrp>. Both of these genomes have served as reference genomes in the current project to assist in assembly, open reading frame (ORF) predictions, and annotation. With the help of next-generation sequencing and optical mapping, we were able to assemble a draft of the standard sheep strain of <it>MAP </it>S397 and compare its sequence to other clinical isolates from sheep or the K-10 strain. Interestingly, several inversion regions and single nucleotide polymorphisms distinguished the <it>MAP</it>-S strains from their <it>MAP</it>-C counterpart. Insights into the evolution of <it>MAP </it>strains have been gained through this analysis.</p>
</sec>
<sec><st><p>Results</p></st>
<sec><st><p>Genome general features</p></st>
<p>Pyrosequencing indicated that the <it>MAP </it>strain S397 has a circular chromosome with at least 4,814,922 bp, a G + C content of 69.31% and contains 4,700 predicted open reading frames (ORFs). The majority of these genes (44.5%) were predicted <abbrgrp><abbr bid="B23">23</abbr></abbrgrp> to encode cytoplasmic proteins (Additional file <supplr sid="S1">1</supplr>: Table S1) involved in various cellular functions and a minority of extracellular proteins (&lt; 1%). The number of annotated genes in S397 was more than the bovine K-10 strain (Table <tblr tid="T1">1</tblr>) due to the different annotation methods used on each genome <abbrgrp><abbr bid="B19">19</abbr></abbrgrp>. However, like <it>MAP </it>K-10, the S397 genome contains one rRNA operon and 46 tRNA genes representing all 20 amino acids. A detailed comparison between <it>MAP </it>strains K-10 and S397 as well as the human, <it>MAH </it>104 is shown in Table <tblr tid="T1">1</tblr>. The <it>de novo </it>assembly of the compiled S397 genome had an average sequencing depth of 24 &#215; in 184 scaffolds (Additional file <supplr sid="S2">2</supplr>: Table S2). When aligned to the K-10 sequence, over 110 of these scaffolds are separated by a sequence gap of less than 500 bp suggesting the small size of most gaps. Furthermore, when gaps of 3.5 kb or less were ignored, we were able to assemble the whole genome into 3 scaffolds. The two largest sequence gaps are between contig00150c and contig00149c, which is estimated at 30.19 kb and the contig00082-contig00041c gap, which is estimated at 18.87 kb. Additional file <supplr sid="S3">3</supplr>: Table S3 gives an overview of the ordered scaffolds.</p>
<suppl id="S1">
<title><p>Additional file 1</p></title>
<text><p><b>Table S1</b>. PSORT analysis of MAP S397 genes to determine their localization.</p></text>
<file name="1471-2164-13-89-S1.XLSX">
   <p>Click here for file</p>
</file>
</suppl>
<tbl id="T1"><title><p>Table 1</p></title><caption><p>A summary of the genomic features of <it>M. avium </it>subspecies isolates from different hosts</p></caption><tblbdy cols="4">
      <r>
         <c>
            <p/>
         </c>
         <c ca="center">
            <p>
               <b>MAP K-10</b>
            </p>
         </c>
         <c ca="center">
            <p>
               <b>MAP S397</b>
            </p>
         </c>
         <c ca="center">
            <p>
               <b>MAH 104</b>
            </p>
         </c>
      </r>
      <r>
         <c cspan="4">
            <hr/>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>Origin</p>
         </c>
         <c ca="center">
            <p>Bovine</p>
         </c>
         <c ca="center">
            <p>Ovine</p>
         </c>
         <c ca="center">
            <p>Human</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>Genome size (bp)</p>
         </c>
         <c ca="center">
            <p>4,829,781</p>
         </c>
         <c ca="center">
            <p>4,814,922</p>
         </c>
         <c ca="center">
            <p>5,475,491</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>DNA scaffolds</p>
         </c>
         <c ca="center">
            <p>1</p>
         </c>
         <c ca="center">
            <p>184</p>
         </c>
         <c ca="center">
            <p>1</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>G + C (%)</p>
         </c>
         <c ca="center">
            <p>69.30</p>
         </c>
         <c ca="center">
            <p>69.31</p>
         </c>
         <c ca="center">
            <p>68.99</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>Protein coding (%)</p>
         </c>
         <c ca="center">
            <p>91.56</p>
         </c>
         <c ca="center">
            <p>91.31</p>
         </c>
         <c ca="center">
            <p>88.66</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>Total genes</p>
         </c>
         <c ca="center">
            <p>4,415</p>
         </c>
         <c ca="center">
            <p>4,700</p>
         </c>
         <c ca="center">
            <p>5,305</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>Total protein coding genes (PCG)</p>
         </c>
         <c ca="center">
            <p>4,350</p>
         </c>
         <c ca="center">
            <p>4,642</p>
         </c>
         <c ca="center">
            <p>5,240</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>PCG without function prediction</p>
         </c>
         <c ca="center">
            <p>3,014</p>
         </c>
         <c ca="center">
            <p>1,085</p>
         </c>
         <c ca="center">
            <p>1,732</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>PCG connected to KEGG pathways</p>
         </c>
         <c ca="center">
            <p>1,215</p>
         </c>
         <c ca="center">
            <p>1,249</p>
         </c>
         <c ca="center">
            <p>1,303</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>tRNAs</p>
         </c>
         <c ca="center">
            <p>46</p>
         </c>
         <c ca="center">
            <p>46</p>
         </c>
         <c ca="center">
            <p>46</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>rRNA operon</p>
         </c>
         <c ca="center">
            <p>1</p>
         </c>
         <c ca="center">
            <p>1</p>
         </c>
         <c ca="center">
            <p>1</p>
         </c>
      </r>
   </tblbdy></tbl>
<suppl id="S2">
<title><p>Additional file 2</p></title>
<text><p><b>Table S2</b>. A list of 184 genomic DNA scaffolds of <it>M</it>AP S397 genome.</p></text>
<file name="1471-2164-13-89-S2.XLSX">
   <p>Click here for file</p>
</file>
</suppl>
<suppl id="S3">
<title><p>Additional file 3</p></title>
<text><p><b>Table S3</b>. Synteny of annotated genes between <it>MAP </it>K-10 and <it>MAP </it>S397 genomes.</p></text>
<file name="1471-2164-13-89-S3.XLSX">
   <p>Click here for file</p>
</file>
</suppl>
<p>Analysis of the two additional genomes sequenced in this study (JTC1074 and JTC7565) revealed more than 99% identity to the S397 genome sequence (Table <tblr tid="T2">2</tblr>). A de novo assembly of these genomes sequenced using Illumina platform produced an average sequence depth of 60 &#215;. As expected, no significant differences were found between the common features of the 3 sequenced sheep isolate genomes. In fact, there were no gene differences; hence all three genomes were identically annotated. Similar to other sequenced mycobacterial genomes, <it>dnaA </it>was assigned the first locus tag (MAPs_00010).</p>
<tbl id="T2"><title><p>Table 2</p></title><caption><p>Reference genome assembly of clinical ovine isolates using simulated <it>MAP </it>S397 genome</p></caption><tblbdy cols="3">
      <r>
         <c>
            <p/>
         </c>
         <c ca="left">
            <p>
               <b>JTC1074</b>
            </p>
         </c>
         <c ca="left">
            <p>
               <b>JTC7565</b>
            </p>
         </c>
      </r>
      <r>
         <c cspan="3">
            <hr/>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>Reference organism</p>
         </c>
         <c ca="left">
            <p>MAP S397</p>
         </c>
         <c ca="left">
            <p>MAP S397</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>Reference length</p>
         </c>
         <c ca="left">
            <p>4,766,015</p>
         </c>
         <c ca="left">
            <p>4,766,015</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>Consensus length</p>
         </c>
         <c ca="left">
            <p>4,753,502</p>
         </c>
         <c ca="left">
            <p>4,742,737</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>%Homology<sup>a </sup>to S397</p>
         </c>
         <c ca="left">
            <p>99.74</p>
         </c>
         <c ca="left">
            <p>99.51</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>%Homology to K-10</p>
         </c>
         <c ca="left">
            <p>98.79</p>
         </c>
         <c ca="left">
            <p>98.56</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>Average Coverage<sup>b</sup></p>
         </c>
         <c ca="left">
            <p>59.89</p>
         </c>
         <c ca="left">
            <p>62.77</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>Standard deviation</p>
         </c>
         <c ca="left">
            <p>32.36</p>
         </c>
         <c ca="left">
            <p>34.65</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>Non-specific matches read count<sup>c</sup></p>
         </c>
         <c ca="left">
            <p>1387</p>
         </c>
         <c ca="left">
            <p>1461</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>Paired read distance distribution</p>
         </c>
         <c ca="left">
            <p>140-360</p>
         </c>
         <c ca="left">
            <p>120-380</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>No. of SNPs</p>
         </c>
         <c ca="left">
            <p>70</p>
         </c>
         <c ca="left">
            <p>53</p>
         </c>
      </r>
   </tblbdy><tblfn>
      <p><sup>a</sup>Homology% was calculated as: consensus length divided by reference length and then multiplied by 100</p>
      <p><sup>b</sup>Average coverage is the average of all the reads in each area in the consensus sequence</p>
      <p><sup>c</sup>Non-specific match read counts are those reads that can be matched more than one place in the reference genome and such reads were randomly placed in one of the matched spots</p>
   </tblfn></tbl>
<p>The IS elements usually play a role in the genomic diversity among strains of mycobacteria <abbrgrp><abbr bid="B24">24</abbr></abbrgrp> and could act as a good target for molecular diagnostics <abbrgrp><abbr bid="B25">25</abbr></abbrgrp>. Similar to K-10, the S397 genome has all the well-studied insertion sequences (e.g. IS<it>900</it>, IS<it>1311 </it>and IS<it>_map02</it>). IS<it>900 </it>is generally considered a <it>MAP </it>specific element that was originally discovered in 1989 <abbrgrp><abbr bid="B26">26</abbr><abbr bid="B27">27</abbr></abbrgrp>. A total of 17 copies of IS<it>900 </it>were found in the S397 genome, which is identical to the K-10 strain. Another element, IS<it>_map02</it>, is a <it>MAP </it>specific insertion sequence that was discovered by sequencing the K-10 genome. A total of 6 copies of IS<it>_map02 </it>are present in both S397 and K-10. Likewise, IS<it>1311 </it>is present 7 times in each genome. No IS elements were found to be unique to one or the other genome.</p>
</sec>
<sec><st><p>Organization of the <it>MAP </it>S397 genome</p></st>
<p>Sequence analysis alone was not sufficient to decipher the synteny of the genome. Previously, we used an optical mapping protocol to confirm the organization of the <it>MAP </it>K-10 genome <abbrgrp><abbr bid="B20">20</abbr></abbrgrp>. A similar strategy was used to analyze the genome of S397. The raw optical map dataset comprised 2,950 single molecule maps with a total mass of 784.5 Mb, and an average molecule size of 333.6 Kb (Figure <figr fid="F1">1</figr>). After assembly, the compiled optical map contained 905 single molecule optical maps (301.9 Mb; total mass), which covers the genome 58 &#215;. After a G + C content adjustment by a factor of 0.95, the estimated size of <it>MAP </it>S397 optical map is 4.95 Mb, which is slightly higher than the sequence data suggested. However, if the estimated sequence gaps are added in, the estimated sizes are very similar.</p>
<fig id="F1"><title><p>Figure 1</p></title><caption><p>Optical map of the <it>MAP </it>S397 genome</p></caption><text>
   <p><b>Optical map of the <it>MAP </it>S397 genome</b>. A total of 905 optical contigs were assembled into one circular consensus map, which has a 58-fold genome coverage and totaled 4.95 Mb. Optical contigs are represented by arcs of various lengths. Each arc is intersected by radiating lines that represent <it>Bsi</it>WI cutting sites, and arbitrary colors represent homologous overlapping fragments.</p>
</text><graphic file="1471-2164-13-89-1" hint_layout="double"/></fig>
<p>To our surprise, there were 7 inversions that are larger than 22 kb when the S397 genome was compared to the sequenced genome of K-10 compiled by Wynne <abbrgrp><abbr bid="B20">20</abbr><abbr bid="B28">28</abbr></abbrgrp>. The total size of these inversions spanned 2.4 Mb of the S397 genome. Individual sizes of those inversions range from 22 to 1,174 kb. As shown in Figure <figr fid="F2">2B</figr>, homologous segments between <it>MAP </it>K-10 and S397 are represented by color boxes and to each segment a number was assigned. Detail information of each segment is shown in Table <tblr tid="T3">3</tblr>. Thirteen out of the 14 segments have at least one IS element on the flanking regions (Figure <figr fid="F2">2</figr>).</p>
<fig id="F2"><title><p>Figure 2</p></title><caption><p>Comparative genome analysis of K-10 and S397 <it>MAP </it>strains</p></caption><text>
   <p><b>Comparative genome analysis of K-10 and S397 <it>MAP </it>strains</b>. <b>(A) </b>Comparison of the <it>Bsi</it>WI restriction maps between K-10 (inner circle) and S397 (outer circle). Each box represents a restriction fragment. Green boxes are regions in the same direction and red boxes are regions that are inverted between the two genomes. White boxes are fragments that are not aligned. The red thin line at 12 o'clock is the locus of the gene <it>dnaA</it>. <b>(B) </b>Mauve alignment of all 184 scaffolds of S397 (bottom) with the complete genome of K-10 (top). The colored boxes represent homologous regions present in each genome, which are also connected by lines. Blocks below the centerline of the S397 genome indicate regions with inverse orientation. Regions outside the blocks lack homology between the genomes. Within each block there is a similarity profile of the DNA sequences and the white areas indicate sequences specific to a genome. The scale is in base pairs.</p>
</text><graphic file="1471-2164-13-89-2" hint_layout="double"/></fig>
<tbl id="T3"><title><p>Table 3</p></title><caption><p>Boundaries and flanking ORFs of aligned segments between <it>MAP </it>K-10 and <it>MAP </it>S397 genomes</p></caption><tblbdy cols="9">
      <r>
         <c>
            <p/>
         </c>
         <c cspan="2" ca="left">
            <p>
               <b>K-10 coordinates by Wynne </b>
               <abbrgrp>
                  <abbr bid="B28">
                     <b>28</b>
                  </abbr>
               </abbrgrp>
            </p>
         </c>
         <c cspan="2" ca="left">
            <p>
               <b>K-10 coordinates by Li </b>
               <abbrgrp>
                  <abbr bid="B19">
                     <b>19</b>
                  </abbr>
               </abbrgrp>
            </p>
         </c>
         <c cspan="2" ca="left">
            <p>
               <b>Flanking ORF</b>
            </p>
         </c>
         <c ca="left">
            <p>
               <b>Approx. size (Kb)</b>
            </p>
         </c>
         <c ca="left">
            <p>
               <b>Alignment between K-10 and S397</b>
            </p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>
               <b>ID</b>
            </p>
         </c>
         <c ca="left">
            <p>
               <b>Left end</b>
            </p>
         </c>
         <c ca="left">
            <p>
               <b>Right end</b>
            </p>
         </c>
         <c ca="left">
            <p>
               <b>Left end</b>
            </p>
         </c>
         <c ca="left">
            <p>
               <b>Right end</b>
            </p>
         </c>
         <c ca="left">
            <p>
               <b>Left end</b>
            </p>
         </c>
         <c ca="left">
            <p>
               <b>Right end</b>
            </p>
         </c>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
      </r>
      <r>
         <c cspan="9">
            <hr/>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>1</p>
         </c>
         <c ca="left">
            <p>4751793</p>
         </c>
         <c ca="left">
            <p>11164</p>
         </c>
         <c ca="left">
            <p>4748984</p>
         </c>
         <c ca="left">
            <p>11164</p>
         </c>
         <c ca="left">
            <p>IS<it>900</it></p>
         </c>
         <c ca="left">
            <p>IS_MAP03 and IS<it>1311</it></p>
         </c>
         <c ca="left">
            <p>92</p>
         </c>
         <c ca="left">
            <p>Forward</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>2</p>
         </c>
         <c ca="left">
            <p>13889</p>
         </c>
         <c ca="left">
            <p>325818</p>
         </c>
         <c ca="left">
            <p>4197078</p>
         </c>
         <c ca="left">
            <p>3885146</p>
         </c>
         <c ca="left">
            <p>IS_MAP03 and IS<it>1311</it></p>
         </c>
         <c ca="left">
            <p>No known IS element</p>
         </c>
         <c ca="left">
            <p>312</p>
         </c>
         <c ca="left">
            <p>Forward</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>3</p>
         </c>
         <c ca="left">
            <p>357239</p>
         </c>
         <c ca="left">
            <p>1424396</p>
         </c>
         <c ca="left">
            <p>3853725</p>
         </c>
         <c ca="left">
            <p>2786561</p>
         </c>
         <c ca="left">
            <p>IS_MAP02</p>
         </c>
         <c ca="left">
            <p>No known IS element</p>
         </c>
         <c ca="left">
            <p>1067</p>
         </c>
         <c ca="left">
            <p>Forward</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>4</p>
         </c>
         <c ca="left">
            <p>1424397</p>
         </c>
         <c ca="left">
            <p>1627687</p>
         </c>
         <c ca="left">
            <p>2786560</p>
         </c>
         <c ca="left">
            <p>2583268</p>
         </c>
         <c ca="left">
            <p>No known IS element</p>
         </c>
         <c ca="left">
            <p>ISmav2</p>
         </c>
         <c ca="left">
            <p>203</p>
         </c>
         <c ca="left">
            <p>Inverted</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>5</p>
         </c>
         <c ca="left">
            <p>1629051</p>
         </c>
         <c ca="left">
            <p>1828408</p>
         </c>
         <c ca="left">
            <p>2581904</p>
         </c>
         <c ca="left">
            <p>2382587</p>
         </c>
         <c ca="left">
            <p>ISmav2</p>
         </c>
         <c ca="left">
            <p>IS<it>1311</it></p>
         </c>
         <c ca="left">
            <p>199</p>
         </c>
         <c ca="left">
            <p>Forward</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>6</p>
         </c>
         <c ca="left">
            <p>1829822</p>
         </c>
         <c ca="left">
            <p>3004096</p>
         </c>
         <c ca="left">
            <p>2381173</p>
         </c>
         <c ca="left">
            <p>1206891</p>
         </c>
         <c ca="left">
            <p>IS<it>1311</it></p>
         </c>
         <c ca="left">
            <p>IS<it>1311</it></p>
         </c>
         <c ca="left">
            <p>1174</p>
         </c>
         <c ca="left">
            <p>Inverted</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>7</p>
         </c>
         <c ca="left">
            <p>3005512</p>
         </c>
         <c ca="left">
            <p>3524059</p>
         </c>
         <c ca="left">
            <p>1205475</p>
         </c>
         <c ca="left">
            <p>686927</p>
         </c>
         <c ca="left">
            <p>IS<it>1311</it></p>
         </c>
         <c ca="left">
            <p>ISmav2</p>
         </c>
         <c ca="left">
            <p>519</p>
         </c>
         <c ca="left">
            <p>Forward</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>8</p>
         </c>
         <c ca="left">
            <p>3525433</p>
         </c>
         <c ca="left">
            <p>3737882</p>
         </c>
         <c ca="left">
            <p>685553</p>
         </c>
         <c ca="left">
            <p>473103</p>
         </c>
         <c ca="left">
            <p>ISmav2</p>
         </c>
         <c ca="left">
            <p>No known IS element</p>
         </c>
         <c ca="left">
            <p>212</p>
         </c>
         <c ca="left">
            <p>Inverted</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>9</p>
         </c>
         <c ca="left">
            <p>3737883</p>
         </c>
         <c ca="left">
            <p>3833853</p>
         </c>
         <c ca="left">
            <p>473102</p>
         </c>
         <c ca="left">
            <p>377132</p>
         </c>
         <c ca="left">
            <p>No known IS element</p>
         </c>
         <c ca="left">
            <p>No known IS element</p>
         </c>
         <c ca="left">
            <p>96</p>
         </c>
         <c ca="left">
            <p>Forward</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>10</p>
         </c>
         <c ca="left">
            <p>3849713</p>
         </c>
         <c ca="left">
            <p>3890501</p>
         </c>
         <c ca="left">
            <p>361272</p>
         </c>
         <c ca="left">
            <p>320484</p>
         </c>
         <c ca="left">
            <p>IS_MAP02</p>
         </c>
         <c ca="left">
            <p>No known IS element</p>
         </c>
         <c ca="left">
            <p>41</p>
         </c>
         <c ca="left">
            <p>Inverted</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>11</p>
         </c>
         <c ca="left">
            <p>3890502</p>
         </c>
         <c ca="left">
            <p>4098807</p>
         </c>
         <c ca="left">
            <p>320483</p>
         </c>
         <c ca="left">
            <p>112177</p>
         </c>
         <c ca="left">
            <p>No known IS element</p>
         </c>
         <c ca="left">
            <p>IS<it>1311</it></p>
         </c>
         <c ca="left">
            <p>208</p>
         </c>
         <c ca="left">
            <p>Inverted</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>12</p>
         </c>
         <c ca="left">
            <p>4100223</p>
         </c>
         <c ca="left">
            <p>4176993</p>
         </c>
         <c ca="left">
            <p>110761</p>
         </c>
         <c ca="left">
            <p>33991</p>
         </c>
         <c ca="left">
            <p>IS<it>1311</it></p>
         </c>
         <c ca="left">
            <p>IS<it>1311</it></p>
         </c>
         <c ca="left">
            <p>77</p>
         </c>
         <c ca="left">
            <p>Forward</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>13</p>
         </c>
         <c ca="left">
            <p>4177670</p>
         </c>
         <c ca="left">
            <p>4199835</p>
         </c>
         <c ca="left">
            <p>33287</p>
         </c>
         <c ca="left">
            <p>11158</p>
         </c>
         <c ca="left">
            <p>IS<it>1311</it></p>
         </c>
         <c ca="left">
            <p>IS<it>1311</it></p>
         </c>
         <c ca="left">
            <p>22</p>
         </c>
         <c ca="left">
            <p>Inverted</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>14</p>
         </c>
         <c ca="left">
            <p>4201310</p>
         </c>
         <c ca="left">
            <p>unknown<sup>1</sup></p>
         </c>
         <c ca="left">
            <p>4198493</p>
         </c>
         <c ca="left">
            <p>unknown<sup>1</sup></p>
         </c>
         <c ca="left">
            <p>IS<it>1311</it></p>
         </c>
         <c ca="left">
            <p>N/A</p>
         </c>
         <c ca="left">
            <p>562</p>
         </c>
         <c ca="left">
            <p>Inverted</p>
         </c>
      </r>
   </tblbdy></tbl>
<p>Similar to our analysis of inversions discovered in the K-10 strain, we used a PCR-based approach to examine two of the inversion breakpoints in the S397 genome (Figure <figr fid="F3">3</figr>), which are the right end of segment ID #1 and the left end of segment ID#2 (Table <tblr tid="T3">3</tblr>). As expected, our PCR analysis confirmed the inversion predicted in the genome of K-10 and S397 strains. Because these inversions were readily identified from the optical map and sequence alignment data, we did not attempt to confirm all of the inverted fragments by PCR. Despite these inversions, there is strong synteny between these genomes, underscoring their close relatedness. Both genomes share a number of large-scale clusters of homology where gene order is highly conserved (Additional file <supplr sid="S4">4</supplr>: Table S4).</p>
<fig id="F3"><title><p>Figure 3</p></title><caption><p>PCR analysis of a 648-kb inverted region <abbrgrp><abbr bid="B20">20</abbr></abbrgrp> between genomes of <it>MAP </it>bovine type strain (K-10), and four ovine strains (S397, JTC1074, 1294 and 7565)</p></caption><text>
   <p><b>PCR analysis of a 648-kb inverted region </b><abbrgrp><abbr bid="B20"><b>20</b></abbr></abbrgrp><b>between genomes of <it>MAP </it>bovine type strain (K-10), and four ovine strains (S397, JTC1074, 1294 and 7565)</b>. <b>(A) </b>A diagram showing the inverted region (gray-to-black gradient box) and location of primers used in the PCR analysis. All primers were designed according to the published <it>MAP </it>K-10 genome sequence <abbrgrp><abbr bid="B19">19</abbr></abbrgrp>. <b>(B) </b>PCR results on an ethidium bromide stained agarose gel. Lanes loaded with PCR products amplified with original primer pairs F1 + R1 (lane 8-13) or F2 + R2 (lane 14-19) show no PCR products from the cattle strain (lane 9 and 22) but a 2.1-kb and 3.6-kb band from the sheep strains (lane 10-13 and 23-26), respectively. Lanes with products amplified with switched primer pairs F1 + F2 (lane 14-19) or R1 + R2 (lane 27-32) show a 3.6-kb and 2.3-kb fragment from the cattle strain (lane 15 and 28) but no product from the sheep strains (lane 16-19 and 29-32), respectively. The opposite PCR amplification pattern between the cattle and sheep strains confirmed that this segment is inverted between these 2 genomes.</p>
</text><graphic file="1471-2164-13-89-3" hint_layout="double"/></fig>
<suppl id="S4">
<title><p>Additional file 4</p></title>
<text><p><b>Table S4</b>. <it>MAP </it>S397 genes that are absent in <it>MAP </it>K-10.</p></text>
<file name="1471-2164-13-89-S4.XLSX">
   <p>Click here for file</p>
</file>
</suppl>
</sec>
<sec><st><p>Genomic insertions</p></st>
<p>Further comparative sequence analysis identified several regions that are present in <it>MAP </it>S397 and <it>MAH </it>104, but not in <it>MAP </it>K-10 (Additional file <supplr sid="S5">5</supplr>: Table S5). The largest of these is a 9-kb gene cluster encompassing 13 ORFs (MAPs_15940-MAPs_16060). This region was partially identified by representation difference analysis and termed PIG-RDA20 for pigmented strain representational difference analysis-20, as detailed before <abbrgrp><abbr bid="B7">7</abbr></abbrgrp>. It was also mapped to the <it>MAH </it>104 genome by Dohmann and coworkers <abbrgrp><abbr bid="B7">7</abbr></abbrgrp> and was subsequently described by Semret and coworkers as large sequence polymorphism (LSP), LSP<sup>A</sup>4-II <abbrgrp><abbr bid="B29">29</abbr></abbrgrp>. This region contains a copy of the IS<it>1311 </it>insertion sequence and within the <it>MAH </it>104 genome is flanked by an additional copy of IS<it>1311</it>. Another previously described LSP included 9 ORFs (MAPs_46190-MAPs_46270) and totals 6.6 kb. This region was partially identified as the PIG-RDA10 sequence and was mapped to a 16 kb segment of the <it>MAH </it>104 genome <abbrgrp><abbr bid="B7">7</abbr></abbrgrp>. The full sequence was later identified as LSP<sup>A</sup>18 <abbrgrp><abbr bid="B29">29</abbr></abbrgrp>, which is equivalent to MAV island 24 <abbrgrp><abbr bid="B3">3</abbr></abbrgrp>. An interesting feature of LSP<sup>A</sup>18 is that it begins and ends with a transcriptional regulator. Eight other LSPs containing 4 or more ORFs not present in K-10 were also observed (Table <tblr tid="T4">4</tblr>). Overall, a total of 70 ORFs were present in <it>MAP </it>S397 but absent in the <it>MAP </it>K-10 genome (Additional file <supplr sid="S5">5</supplr>: Table S5).</p>
<suppl id="S5">
<title><p>Additional file 5</p></title>
<text><p><b>Table S5</b>. <it>MAP </it>K-10 genes that are absent in <it>MAP </it>S397. All supplemental tables are in Excel format.</p></text>
<file name="1471-2164-13-89-S5.XLS">
   <p>Click here for file</p>
</file>
</suppl>
<tbl id="T4"><title><p>Table 4</p></title><caption><p>Large sequences present in the three sheep strain genomes but absent in <it>MAP </it>K-10.</p></caption><tblbdy cols="7">
      <r>
         <c cspan="2" ca="center">
            <p>
               <b>LSP</b>
            </p>
         </c>
         <c cspan="4" ca="center">
            <p>
               <b>Location</b>
            </p>
         </c>
         <c ca="center">
            <p>
               <b>Gene Content</b>
            </p>
         </c>
      </r>
      <r>
         <c cspan="7">
            <hr/>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>
               <b>Name</b>
            </p>
         </c>
         <c ca="left">
            <p>
               <b>Synonym</b>
            </p>
         </c>
         <c ca="center">
            <p>
               <b>Start</b>
            </p>
         </c>
         <c ca="center">
            <p>
               <b>Stop</b>
            </p>
         </c>
         <c ca="center">
            <p>
               <b>Size</b>
            </p>
         </c>
         <c ca="center">
            <p>
               <b>Contig</b>
            </p>
         </c>
         <c ca="center">
            <p>
               <b>ORFs</b>
            </p>
         </c>
      </r>
      <r>
         <c cspan="7">
            <hr/>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>LSP<sup>s</sup>1</p>
         </c>
         <c ca="left">
            <p>LSP<sup>A</sup>4-II (RD20)</p>
         </c>
         <c ca="center">
            <p>34,308</p>
         </c>
         <c ca="center">
            <p>43,318</p>
         </c>
         <c ca="center">
            <p>9.01</p>
         </c>
         <c ca="center">
            <p>00036c</p>
         </c>
         <c ca="center">
            <p>MAPs_15940-16060</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>LSP<sup>s</sup>2</p>
         </c>
         <c ca="left">
            <p>LSP<sup>A</sup>18 (RD10)</p>
         </c>
         <c ca="center">
            <p>116,723</p>
         </c>
         <c ca="center">
            <p>123,369</p>
         </c>
         <c ca="center">
            <p>6.64</p>
         </c>
         <c ca="center">
            <p>00127c</p>
         </c>
         <c ca="center">
            <p>MAPs_46190-46270</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>LSP<sup>s</sup>3</p>
         </c>
         <c>
            <p/>
         </c>
         <c ca="center">
            <p>15,581</p>
         </c>
         <c ca="center">
            <p>19,356</p>
         </c>
         <c ca="center">
            <p>3.78</p>
         </c>
         <c ca="center">
            <p>00035c</p>
         </c>
         <c ca="center">
            <p>MAPs_14620-14660</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>LSP<sup>s</sup>4</p>
         </c>
         <c>
            <p/>
         </c>
         <c ca="center">
            <p>124,533</p>
         </c>
         <c ca="center">
            <p>128,162</p>
         </c>
         <c ca="center">
            <p>3.63</p>
         </c>
         <c ca="center">
            <p>00127c</p>
         </c>
         <c ca="center">
            <p>MAPs_46290-46320</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>LSP<sup>s</sup>5</p>
         </c>
         <c>
            <p/>
         </c>
         <c ca="center">
            <p>87,461</p>
         </c>
         <c ca="center">
            <p>90,928</p>
         </c>
         <c ca="center">
            <p>3.47</p>
         </c>
         <c ca="center">
            <p>00126c</p>
         </c>
         <c ca="center">
            <p>MAPs_17580-17610</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>LSP<sup>s</sup>6</p>
         </c>
         <c>
            <p/>
         </c>
         <c ca="center">
            <p>4,736</p>
         </c>
         <c ca="center">
            <p>7,731</p>
         </c>
         <c ca="center">
            <p>2.99</p>
         </c>
         <c ca="center">
            <p>00161</p>
         </c>
         <c ca="center">
            <p>MAPs_40470-40500</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>LSP<sup>s</sup>7</p>
         </c>
         <c>
            <p/>
         </c>
         <c ca="center">
            <p>2</p>
         </c>
         <c ca="center">
            <p>2,889</p>
         </c>
         <c ca="center">
            <p>2.88</p>
         </c>
         <c ca="center">
            <p>00133c</p>
         </c>
         <c ca="center">
            <p>MAPs_17640-17670</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>LSP<sup>s</sup>8</p>
         </c>
         <c>
            <p/>
         </c>
         <c ca="center">
            <p>47,323</p>
         </c>
         <c ca="center">
            <p>49,712</p>
         </c>
         <c ca="center">
            <p>2.39</p>
         </c>
         <c ca="center">
            <p>00199c</p>
         </c>
         <c ca="center">
            <p>MAPs_02730-02760</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>LSP<sup>s</sup>9</p>
         </c>
         <c>
            <p/>
         </c>
         <c ca="center">
            <p>70,659</p>
         </c>
         <c ca="center">
            <p>72,500</p>
         </c>
         <c ca="center">
            <p>1.84</p>
         </c>
         <c ca="center">
            <p>00033</p>
         </c>
         <c ca="center">
            <p>MAPs_23120-23150</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>LSP<sup>s</sup>10</p>
         </c>
         <c>
            <p/>
         </c>
         <c ca="center">
            <p>5,879</p>
         </c>
         <c ca="center">
            <p>7,457</p>
         </c>
         <c ca="center">
            <p>1.58</p>
         </c>
         <c ca="center">
            <p>00100</p>
         </c>
         <c ca="center">
            <p>MAPs_42460-42490</p>
         </c>
      </r>
   </tblbdy><tblfn>
      <p><it>LSP </it>Large sequence polymorphism as identified before <abbrgrp><abbr bid="B29">29</abbr></abbrgrp></p>
      <p>Size is in kilobases</p>
   </tblfn></tbl>
<p>Several new or only partially described LSPs common to <it>MAP </it>S397 and <it>MAH </it>104 strains were also identified. A good example here is the novel LSP found in <it>MAP </it>sheep and <it>MAH </it>104 genomes is comprised of 14 ORFs (MAPs_17580 - MAPs_17710), predicted to encode proteins involved in the biosynthesis of glycopeptidolipids <abbrgrp><abbr bid="B30">30</abbr></abbrgrp>. This region in <it>MAP </it>S397 revealed the presence of four additional ORFs (<it>hyp, hlpA, dhgA </it>and <it>mtfC</it>) with homology to glycopeptidolipid biosynthesis genes immediately downstream. The additional 4 ORFs were also not present in the <it>MAH </it>104 sequence. Finally, a putative transcriptional regular labeled as MAPs_44910 is present in <it>MAP </it>S397. The protein encoded by this ORF has homology to the GntR-family of transcriptional regulators, which are widely distributed across bacterial species and regulate a variety of cellular processes <abbrgrp><abbr bid="B31">31</abbr><abbr bid="B32">32</abbr></abbrgrp>.</p>
</sec>
<sec><st><p>Genomic deletions</p></st>
<p>A second subset of sequence polymorphism was represented by 32 ORFs that were present in the <it>MAP </it>K-10 genome but absent from the genome of <it>MAP </it>S397 (Additional file <supplr sid="S6">6</supplr>: Figure S1). Several of these deletions have already been described earlier. The deletion encompassing MAP1485c-MAP1491 was previously identified by Marsh and coworkers as S strain deletion #1 in an Australian <it>MAP </it>sheep isolate <abbrgrp><abbr bid="B33">33</abbr></abbrgrp> and by Semret and coworkers as LSP<sup>A</sup>20 <abbrgrp><abbr bid="B29">29</abbr></abbrgrp>. An additional larger deletion in the <it>MAP </it>S397 included the cluster of ORFs between MAP1728c and MAP1744. This deletion was partially identified by Marsh and coworkers as RDA3 <abbrgrp><abbr bid="B34">34</abbr></abbrgrp>, and later fully described as S deletion #2 <abbrgrp><abbr bid="B33">33</abbr></abbrgrp>.</p>
<suppl id="S6">
<title><p>Additional file 6</p></title>
<text><p><b>Figure S1</b>. An overview alignment of <it>MAP </it>S397 scaffold assembly and <it>Bsi</it>WI restriction fragments.</p></text>
<file name="1471-2164-13-89-S6.PDF">
   <p>Click here for file</p>
</file>
</suppl>
<p>A novel deletion comprising the ORFs MAP1432-MAP1438c (partial) was identified in the current study as absent from <it>MAP </it>S397. This deletion, termed s&#916;-1, was originally discovered by comparative genomic analysis and subsequently confirmed by PCR analysis. This gene cluster is predicted to encode four energy metabolism enzymes as well as a lipase (MAP1438c). MAP1432 encodes a hypothetical protein with homology to the REP13E12, a family of repetitive elements that were originally described in <it>M. tuberculosis </it>and have been shown to be targets of phage integration <abbrgrp><abbr bid="B35">35</abbr></abbrgrp>. There is a homolog to MAP1434 that is present in S397 (MAPs_13210). The region around MAPs_13210 is not near the end of a contig and is nearly identical to an inverted stretch in K-10, thus leading to the conclusion that MAPs_13210 is only a homolog of MAP1434, but that the gene itself is not present in the S397 genome.</p>
<p>Interestingly, MAP2656 was initially identified as absent via microarray analysis <abbrgrp><abbr bid="B5">5</abbr></abbrgrp> but sequencing of <it>MAP </it>S397 identified a homologue with 100% identity (MAPs_10401 &amp; MAPs_10402). Likewise, MAP2325 was identified as being absent from Australian sheep isolates of <it>MAP </it><abbrgrp><abbr bid="B33">33</abbr></abbrgrp>. This ORF was not identified as missing from <it>MAP </it>S397 as sequencing confirmed the presence of an ORF (MAPs_34380) with 100% identity to MAP2325. These discrepancies may represent a geographic difference between <it>MAP </it>isolates recovered from sheep in Australia and the United States or it may be an error from the microarray experiment. These were the only observed differences between the microarray and sequence data. Overall, genomic alignments indicated the presence of a significant number of insertions and deletions between ovine and bovine strains of <it>MAP </it>that are suggested to be associated with their respective host.</p>
</sec>
<sec><st><p>Evolutionary analysis of the <it>MAP </it>S397 genome</p></st>
<p>Genomic insertions and deletions have been previously used to determine evolutionary relationships among MAC strains <abbrgrp><abbr bid="B36">36</abbr></abbrgrp>. With the genome sequence of these ovine isolates of <it>MAP</it>, we can now add comprehensive SNP and inversion data to strengthen evolutionary hypotheses. Earlier genotyping of the <it>MAP S397 </it>utilizing SNP of <it>recF, gyrA </it>and <it>gyrB </it>genes indicated that this strain belong to the <it>MAP </it>type III, a sublineage of the <it>MAP-S </it>cluster of isolates <abbrgrp><abbr bid="B37">37</abbr></abbrgrp>. To examine the evolutionary history of <it>MAP</it>, we analyzed the genome sequence of S397 compared to other clinical isolates circulating in sheep as well as the standard cattle strain, K-10. Our first level of analysis included the alignment of the S397 genome to that of the JTC1074 and JTC7565. This alignment resulted in identical genome organization of all three ovine isolates, as expected. Additionally, we examined the relationship among S397 (ovine origin) with both K-10 (bovine origin) and <it>MAH </it>104 (human origin). Such analysis identified several events of inversions and potential insertions/deletions between genomes belonging to the ovine isolates and other isolates of bovine and human origins (Figure <figr fid="F4">4</figr>). The optical map of S397 confirmed these inversions as well. Moreover, when the draft genome sequence of <it>M. intracellulare </it>was added to the comparison, the whole contig00148 (accession number GenBank: <ext-link ext-link-id="ABIN01000141" ext-link-type="gen">ABIN01000141</ext-link>) aligns to the region spanning the right breakpoint (Figure <figr fid="F4">4</figr>) of <it>MAH </it>104 and <it>MAP </it>ovine strains, an indication of a conserved genome synteny among <it>M. intracellulare, MAH </it>and <it>MAP </it>sheep strains, but distinct from <it>MAP </it>bovine strains.</p>
<fig id="F4"><title><p>Figure 4</p></title><caption><p>Genomic alignment of inversion breakpoints among members of the <it>M. avium </it>complex</p></caption><text>
   <p><b>Genomic alignment of inversion breakpoints among members of the <it>M. avium </it>complex</b>. Regions spanning the right breakpoint are depicted. The junction between the red and green boxes shown in the <it>MAP </it>sheep panel represents the breakpoint. Note that the breakpoint is within contig00148 of <it>M. intracellulare</it>. The alignment shows that the genome synteny among <it>MAP </it>sheep (S397), <it>MAH </it>104 and <it>M. intracellulare </it>is conserved.</p>
</text><graphic file="1471-2164-13-89-4" hint_layout="single"/></fig>
<p>In the second level of analysis on the nucleotide level, a core of 42 single nucleotide polymorphisms (SNPs) were present in both JTC isolates compared to S397. In addition, a very small number of unique SNPs in JTC1074 (N = 22) and JTC7565 (N = 11) were not present in any other genome in this study. Collectively, this small level of polymorphism indicates the clonal nature of ovine isolates, which contrasts sharply with the 4,438 SNPs between the ovine S397 and the bovine K-10 strains (Figure <figr fid="F5">5A</figr>). Additionally, when analyzing genome-wide SNPs, it appears that <it>MAP </it>S397 and K-10 split off recently from the <it>hominissuis </it>progenitor strain (Figure <figr fid="F5">5B</figr>). A similar result is obtained when SNPs are restricted to coding sequences (Figure <figr fid="F5">5C</figr>).</p>
<fig id="F5"><title><p>Figure 5</p></title><caption><p>Polymorphisms among <it>M. avium </it>complex (MAC) members</p></caption><text>
   <p><b>Polymorphisms among <it>M. avium </it>complex (MAC) members</b>. <b>(A) </b>Table of the total single nucleotide polymorphisms (SNPs) present in each genome. <b>(B) </b>Phylogenetic relationship among MAC strains using all SNPs or those restricted to coding regions <b>(C)</b>. The trees were constructed using the Neighbor-Joining method <abbrgrp><abbr bid="B38">38</abbr></abbrgrp>. Each tree is drawn to scale, with branch lengths (indicated below the branches) in the same units as those of the evolutionary distances used to infer the phylogenetic tree. The evolutionary distances were computed using the LogDet (Tamura-Kumar) method <abbrgrp><abbr bid="B39">39</abbr></abbrgrp> and are in the units of the number of base substitutions per site. There were a total of 50,924 SNPs in the dataset for (B) and 38,546 SNPs in (C). Evolutionary analyses were conducted in MEGA5 <abbrgrp><abbr bid="B40">40</abbr></abbrgrp>.</p>
</text><graphic file="1471-2164-13-89-5" hint_layout="single"/></fig>
</sec>
</sec>
<sec><st><p>Discussion</p></st>
<p>Comparative genomic hybridizations using DNA microarrays have revealed large sequence polymorphisms (LSPs) between <it>MAP-S </it>and <it>MAP-C </it>strains <abbrgrp><abbr bid="B36">36</abbr><abbr bid="B41">41</abbr></abbrgrp>. Two large deletions of an Australian sheep isolate were found by genomic hybridization to the <it>MAP </it>K-10 array <abbrgrp><abbr bid="B33">33</abbr></abbrgrp>. One deletion encompassed 8 ORFs (MAP1485c-MAP1491) and a second deletion encompassed 17 ORFs extending from MAP1728c to MAP1744. These deletions relative to the bovine strains were later observed in U.S. ovine <it>MAP </it>isolates <abbrgrp><abbr bid="B5">5</abbr><abbr bid="B13">13</abbr></abbrgrp>. Construction of a <it>MAP </it>array containing <it>MAH </it>sequences revealed LSPs in the ovine strains that were missing in the bovine K-10 strain <abbrgrp><abbr bid="B5">5</abbr><abbr bid="B42">42</abbr></abbrgrp>. These documented differences formed the basis for whole-genome sequencing of a sheep isolate to enable comprehensive description of all genetic differences from <it>MAP-S </it>and <it>MAP-C </it>strains. We took advantage of next-generation sequencing technology combined with optical mapping <abbrgrp><abbr bid="B20">20</abbr></abbrgrp> to decipher the complete genome of <it>MAP </it>isolates from sheep flocks raised in the USA. Our analysis confirmed earlier polymorphisms among <it>MAP</it>-S and <it>MAP</it>-C strains and revealed novel regions of difference. Surprisingly, both genome sequencing and optical mapping showed remarkable differences between <it>MAP</it>-S and <it>MAP</it>-C strains despite the overall similarity in the clinical signs of Johne's disease in sheep and cattle. Recently, a study using a large number of <it>MAP </it>isolates provided an example of such a genomic polymorphism including 2 large regions of duplication, termed vGI-17 (containing 63 ORFs) and vGI-18 (containing 109 ORFs), observed in most <it>MAP</it>-C strains but not <it>MAP</it>-S isolates <abbrgrp><abbr bid="B21">21</abbr></abbrgrp>. Both of these duplications were also missing in our sequenced <it>MAP</it>-S genomes as determined by PCR amplification using outward facing primers reported by Wynne et al. (data not shown).</p>
<p>There are 70 genes present in all three ovine isolates that are absent from the K-10 strain, an indication for <it>MAP </it>adaptation to specific hosts (in this case sheep). Analysis of additional ovine and bovine isolates is needed to strengthen any linkage between these genes with host association. Within this subset, we identified a surprising number of genes annotated as hypothetical proteins (N = 30). Six transcriptional regulators were also present among these genes with the remaining genes showing weak homology to sequences in the GenBank database. We hypothesize that these genes could be responsible for the observed phenotypic differences between ovine and bovine strains and warrant future studies to address this hypothesis.</p>
<p>Based on extensive genomic rearrangements between <it>MAP </it>bovine and ovine strains, we were able to provide a possible evolutionary scenario for members of the MAC group. A genomic region spanning the inversion of <it>MAP </it>bovine strains, <it>MAP </it>ovine strains and <it>MAH </it>104 are shown in Figure <figr fid="F4">4</figr>. To diverge into these three subspecies, the common ancestor appears to have undergone two independent genomic inversion events (Figure <figr fid="F6">6A</figr>). Specifically, it would take one inversion event to diverge between <it>MAH </it>104 and <it>MAP </it>sheep strains followed by a second inversion event between <it>MAP </it>sheep strains and the <it>MAP </it>cattle strain (Figure <figr fid="F6">6A</figr>). Therefore, assuming that one strain diverges into another strain by taking the shortest evolutionary path, it would be least likely that <it>MAH </it>directly evolved from <it>MAP </it>cattle strains or vice versa. This strongly suggests that <it>MAP </it>sheep strains are the intermediate taxon of the three. Data from Behr and coworkers suggest <it>MAH </it>104 is the ancestor strain <abbrgrp><abbr bid="B36">36</abbr></abbrgrp>. Moreover, when the genome of <it>M. intracellulare </it>is added to the comparison, the genome synteny was conserved among <it>M. intracellulare, MAH </it>and <it>MAP </it>sheep strains, but not in <it>MAP </it>cattle strains. Thus, it is possible that the common ancestor of the MAC must resemble either <it>MAH </it>104 or <it>M. intracellulare</it>, and <it>MAP </it>bovine strains are the latest diverged strains among them with <it>MAP </it>S397 as an intermediary strain (Figure <figr fid="F6">6B</figr>). This model partially agrees with a hypothesis that suggests <it>MAH </it>differentiated into two lineages, <it>MAP </it>ovine and bovine strains, by delineating chronological genomic insertion/deletion events without considering other genomic rearrangement events <abbrgrp><abbr bid="B36">36</abbr></abbrgrp>. Of the 70 genes in S397 that are absent in K-10, 57 are present in <it>MAH </it>104 and only 13 are absent from <it>MAH </it>104. Further genotyping of the S397 clustered this isolate with the group of <it>MAP</it>-S type III <abbrgrp><abbr bid="B37">37</abbr></abbrgrp>, a sub-lineage of the sheep strains. However, we prefer to maintain the <it>MAP</it>-S designation since the type III genotype was based on 3 SNPs present in a subgroup of sheep isolates with no distinctive clinical or pathological features. Finally, a recent study analyzing the sequence polymorphisms of IS<it>1311 </it>among the MAC also supports the hypothesis that <it>MAP </it>ovine strains are the intermediary taxa between <it>MAH </it>and <it>MAP </it>bovine strains <abbrgrp><abbr bid="B43">43</abbr></abbrgrp>.</p>
<fig id="F6"><title><p>Figure 6</p></title><caption><p>An evolutionary scenario for members of the <it>M. avium </it>complex</p></caption><text>
   <p><b>An evolutionary scenario for members of the <it>M. avium </it>complex</b>. <b>(A) </b>Depicted is a two-step inversion process as one possible scenario explaining how <it>MAH </it>evolved into <it>MAP </it>K-10 through <it>MAP </it>S397. To examine evolutionary relationship among the MAC, genome alignment around the inversion segment is depicted with Mauve version 2.3.1 <abbrgrp><abbr bid="B44">44</abbr></abbrgrp>. Divergence between <it>MAH </it>104 and <it>MAP </it>sheep strains or between <it>MAP </it>sheep strains and cattle strains would take only one inversion, whereas divergence between <it>MAH </it>104 and <it>MAP </it>cattle strains would need two independent inversion events. <b>(B) </b>Our proposed model for evolution of the <it>M. avium </it>complex.</p>
</text><graphic file="1471-2164-13-89-6" hint_layout="single"/></fig>
</sec>
<sec><st><p>Conclusions</p></st>
<p>Genome sequencing of <it>MAP-S </it>strains have revealed extensive genome inversions and previously characterized deletions when compared to the K-10 strain. Furthermore, there appears to be a high degree of homology within US <it>MAP-S </it>strains as suggested by the remarkably low number of SNPs present in the three isolates sequenced. Evolutionary analysis based on whole genome sequencing suggests MAH is the progenitor strain, followed by MAP-S, followed by MAP-C strains.</p>
<p>Overall, Next-generation sequencing combined with optical mapping provided us with a high resolution tool to decipher the evolution of important pathogenic mycobacteria. Comparative sequence analysis of the <it>MAP </it>isolates from sheep has improved our understanding of the evolutionary history of members of MAC and provided the foundation for novel insights into the pathogenesis of this important pathogen. Similar approaches can be used to examine other closely related pathogens.</p>
</sec>
<sec><st><p>Methods</p></st>
<sec><st><p><it>MAP </it>ovine isolates</p></st>
<p>Isolates were cultured in Middlebrook 7H9 broth (BD Biosciences, San Jose, CA) media supplemented with 10% OADC (2% glucose, 5% bovine serum albumin factor V, and 0.85% NaCl), 0.05% Tween 80 and 2 &#956;g/ml of Mycobactin J at 37&#176;C <abbrgrp><abbr bid="B45">45</abbr></abbrgrp>. The <it>MAP </it>ovine S397 strain was obtained from a Suffolk breed in Iowa. It was isolated from the distal ileum at necropsy in 2004. The other 2 sheep isolates of <it>MAP </it>(JTC1074 and JTC7565) were isolated from the intestine of infected sheep in Texas and obtained from the Johne's Testing Center at the University of Wisconsin-Madison. All isolates were genotyped using the IS<it>1311 </it>restriction endonuclease, which yielded the 2-band pattern typical of ovine strains <abbrgrp><abbr bid="B6">6</abbr></abbrgrp>.</p>
</sec>
<sec><st><p>Genome sequencing</p></st>
<p>Genomic DNA was extracted as described in detail previously <abbrgrp><abbr bid="B3">3</abbr><abbr bid="B46">46</abbr></abbrgrp>. For the S397 strain, the DNA (1-5 &#956;g) was sequenced using Roche 454 pyrosequencing (GS20 and FLX) at the National Animal Disease Center. A whole-genomic shotgun sequencing library was prepared according to Roche protocols. The library was used with the appropriate emulsion based PCR kits to produce sufficient beads for sequencing using the Roche Standard Chemistry GS-LR 70 sequencing kit. For the JTC1074 and JTC7565, the purified genomic DNA (~5 &#956;g) of each strain was sent to Genomic Resource Center at the University of Maryland for Illumina whole genome sequencing (Multiplexing Sample Preparation oligonucleotide Kit) as outline before <abbrgrp><abbr bid="B47">47</abbr></abbrgrp>. The adapters and indexing oligonucleotides were purchased from Illumina (5 Paired End Cluster Generation Kits-v4). The CLC Genomic Workbench software (version 4.0.3) was used to perform reference and de novo assembly on all sequenced genomes.</p>
</sec>
<sec><st><p>Genome annotation</p></st>
<p>The S397 sequence was annotated using the Integrated Microbial Genomes Expert Review (IMG-ER) pipeline <abbrgrp><abbr bid="B48">48</abbr></abbrgrp>. The sequences of the JTC isolates were annotated based on S397. Genes were each designated by the locus tag "MAPs" to distinguish it as a <it>MAP </it>sheep strain gene. This locus tag is followed by a five digit unique identifier, which incrementally increases by ten (i.e. MAPs_45660... MAPs_45670... MAPs_45680...). With this numbering configuration, additional genes can easily be added as they are discovered or when remaining gaps are closed.</p>
</sec>
<sec><st><p>Genome comparison</p></st>
<p>The genome data for <it>MAP </it>K-10 (accession no. GenBank: <ext-link ext-link-id="NC_002944.2" ext-link-type="gen">NC_002944.2</ext-link>) and <it>M. avium </it>subsp. <it>hominissuis </it>(<it>MAH</it>) strain 104 (GenBank: <ext-link ext-link-id="NC_008595.1" ext-link-type="gen">NC_008595.1</ext-link>) were used in alignments in the Artemis and Artemis Comparison Tool (ACT) programs or Mauve 2.3.1 <abbrgrp><abbr bid="B49">49</abbr></abbrgrp>. BLASTP analysis was used for similarity searches and protein sequence analysis. In addition, Mauve algorithm was used to align two or more genomes <abbrgrp><abbr bid="B50">50</abbr></abbrgrp>. For detecting single nucleotide polymorphisms (SNPs) among sheep isolates, the CLC Genomic Workbench was used. The coverage range setting for each strain was at 10-55 reads, and the frequency of the mutation was at least in 50% of the reads.</p>
</sec>
<sec><st><p>Optical mapping</p></st>
<p>Shotgun optical mapping, as previously described <abbrgrp><abbr bid="B20">20</abbr><abbr bid="B51">51</abbr><abbr bid="B52">52</abbr><abbr bid="B53">53</abbr><abbr bid="B54">54</abbr><abbr bid="B55">55</abbr></abbrgrp>, was used to construct a physical restriction map for the S397 genome. Genomic DNA, in agarose inserts <abbrgrp><abbr bid="B56">56</abbr></abbrgrp>, was electroeluted into a solution containing a lambda DNA sizing standard (30 pg/&#956;l), and then were mounted on cleaned, derivitized glass surfaces using a microfluidic device <abbrgrp><abbr bid="B57">57</abbr></abbrgrp> followed by polymerization of a thin layer of polyacrylamide (3.3% containing 0.02% Triton X-100). Mounted DNA was digested with 20 units of <it>Bsi</it>WI (NEB, Ipswich, MA) for 1 to 2 hrs at 37&#176;C. Fluorochrome-stained DNA fragments were imaged by fluorescence microscopy with a 63 &#215; objective lens (Carl Zeiss, Thornwood, NY) and a high-resolution digital camera (Princeton Instruments, Trenton, NJ). Images were acquired and processed using "ChannelCollect" and "Pathfinder" -custom software <abbrgrp><abbr bid="B57">57</abbr></abbrgrp> that converts captured images into map data sets. Bayesian inference and an efficient dynamic programming algorithm were also being used to fine-tune the parameters including standard deviation, digestion rate, false cut, and false match probability etc. <abbrgrp><abbr bid="B54">54</abbr><abbr bid="B58">58</abbr><abbr bid="B59">59</abbr></abbrgrp>. The final circular optical map contig was built using an iterative assembly process <abbrgrp><abbr bid="B60">60</abbr></abbrgrp> including rounds of pair-wise alignment (single molecule maps vs. seed maps; provisional assemblies) and assembly <abbrgrp><abbr bid="B52">52</abbr><abbr bid="B54">54</abbr></abbrgrp>. Due to the high G + C content of <it>MAP</it>, which skews fragment sizing by integrated fluorescence intensity measurement, the final maps were globally scaled (0.95) to correct this problem <abbrgrp><abbr bid="B20">20</abbr><abbr bid="B61">61</abbr></abbrgrp>. A laboratory software implementation of an optical map alignment algorithm <abbrgrp><abbr bid="B62">62</abbr></abbrgrp> was used to align between optical fragments generated from <it>MAP </it>S397 and the <it>in silico </it>restriction maps of <it>MAP </it>K-10, which provided a whole-genome rearrangement comparison between the two genomes. This restriction framework was used to generate a temporary rearranged genome as the reference sequence to guide the assembly of <it>MAP </it>S397 <it>de novo </it>contigs with the function "move contigs" in Mauve 2.3.1 <abbrgrp><abbr bid="B49">49</abbr></abbrgrp>.</p>
</sec>
<sec><st><p>PCR amplification of inversion breakpoints and deletions</p></st>
<p>PCR reactions were performed in 25-&#956;l reaction mixture containing 1 M betaine (Sigma-Aldrich, St. Louis, MO), 50 mM potassium glutamate (Sigma-Aldrich), 10 mM Tris-HCl pH 8.8, 0.1% Triton X-100, 2 mM magnesium chloride, 0.2 mM dNTPs, 0.5 &#956;M each primer, 0.5 U of GoTaq<sup>&#174; </sup>Flexi DNA Polymerase (Promega, Madison, WI) and 25 ng of genomic DNA. The amplification thermocycle started with an initial step of 94&#176;C for 5 minutes followed by 5 cycles of 94&#176;C for 30 s, 62&#176;C for 30 s with 1&#176;C decrease for each cycle and 72&#176;C for 3.5 min, and followed by 30 cycles of 94&#176;C for 30 s, 57&#176;C for 30 s and 72&#176;C for 3.5 min. PCR primers used for examining the breakpoints included control F: AAGCATCACCTGCATGAGC, control R: CGGGAATTTATCCGTTTCAG, F1: GGGATCGATCTTGACCACAT, R1: GTGCCTGGACTCGATTTTGT, F2: AAGAGGTCGGAGGTTCGAGT and R2: CGGTGAGAGATTTCGTCACA. Primers used to demonstrate the S397 s&#916;-1 deletion included F18: CGTCTTCCCCGTCGTCGTTC, B24: CGATGAGAGTCCGTGCGTGG, F15: CGGCGGGCGGTCAGGGTTTG, B17: GCAGGTTGGGGTTCGGCTTG, F7: GGTGGTCGGCGTCCTCGTAG, B9: CGTCGTCACAGCGAAAACGG, F3: CCACCCGCCTCACACCACTC, B4: AGGACGCCGACCACCAAACG. Conditions for the amplifications are essentially as described immediately above except that Advantage GC Genomic LA PCR Polymerase kit (Clontech) was used for each reaction.</p>
</sec>
<sec><st><p>Nucleotide sequence accession number</p></st>
<p>This Whole Genome Shotgun project has been deposited at DDBJ/EMBL/GenBank under the accession AFIF00000000.</p>
</sec>
</sec>
<sec><st><p>Competing interests</p></st>
<p>The authors declare that they have no competing interests.</p>
</sec>
<sec><st><p>Authors' contributions</p></st>
<p>JPB, MLP and DPA sequenced the S397 genome. JPB and DOB annotated the S397 genome. JPB and CW integrated and analyzed data from the S397 contigs and optical mapping, performed genomic comparison, PCR confirmation as well as SNP, inversion and evolutionary analyses. CH, CW and AMT sequenced and analyzed genomes of isolates JTC1074 and JTC7565. SZ and DCS collected, assembled and aligned S397 optical map. JPB, CW and AMT wrote the manuscript. JPB, SS, VK and AMT assisted with the analysis. All authors read and approved the final manuscript.</p>
</sec>
</bdy>
<bm>
<ack>
<sec><st><p>Acknowledgements</p></st>
<p>The authors would like to thank members of the Genomic Resource Center at the University of Maryland-Baltimore for Illumina sequencing and Janis K. Hansen (USDA-ARS) for technical assistance. This work was supported by the USDA-Agricultural Research Service (JPB, MLP, DPA and DOB), NRI 2007-35204-18400 and JDIP -Q6286224301 grants from the USDA and US-Egypt Joint Scientific Baord#1937 to AMT.</p>
</sec>
</ack>
<refgrp><bibl id="B1"><title><p>Invasion and persistence of <it>Mycobacterium avium </it>subsp. <it>paratuberculosis </it>during early stages of Johne's disease in calves</p></title><aug><au><snm>Wu</snm><fnm>CW</fnm></au><au><snm>Livesey</snm><fnm>M</fnm></au><au><snm>Schmoller</snm><fnm>SK</fnm></au><au><snm>Manning</snm><fnm>EJ</fnm></au><au><snm>Steinberg</snm><fnm>H</fnm></au><au><snm>Davis</snm><fnm>WC</fnm></au><etal/></aug><source>Infect Immun</source><pubdate>2007</pubdate><volume>75</volume><fpage>2110</fpage><lpage>2119</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1128/IAI.01739-06</pubid><pubid idtype="pmcid">1865790</pubid><pubid idtype="pmpid" link="fulltext">17296749</pubid></pubidlist></xrefbib></bibl><bibl id="B2"><title><p>Peyer's patch-deficient mice demonstrate that <it>Mycobacterium avium </it>subsp. <it>paratuberculosis </it>translocates across the mucosal barrier via both M cells and enterocytes but has inefficient dissemination</p></title><aug><au><snm>Bermudez</snm><fnm>LE</fnm></au><au><snm>Petrofsky</snm><fnm>M</fnm></au><au><snm>Sommer</snm><fnm>S</fnm></au><au><snm>Barletta</snm><fnm>RG</fnm></au></aug><source>Infect Immun</source><pubdate>2010</pubdate><volume>78</volume><fpage>3570</fpage><lpage>3577</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1128/IAI.01411-09</pubid><pubid idtype="pmcid">2916292</pubid><pubid idtype="pmpid" link="fulltext">20498259</pubid></pubidlist></xrefbib></bibl><bibl id="B3"><title><p>Whole-genome plasticity among <it>Mycobacterium avium </it>subspecies: insights from comparative genomic hybridizations</p></title><aug><au><snm>Wu</snm><fnm>CW</fnm></au><au><snm>Glasner</snm><fnm>J</fnm></au><au><snm>Collins</snm><fnm>M</fnm></au><au><snm>Naser</snm><fnm>S</fnm></au><au><snm>Talaat</snm><fnm>AM</fnm></au></aug><source>J Bacteriol</source><pubdate>2006</pubdate><volume>188</volume><fpage>711</fpage><lpage>723</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1128/JB.188.2.711-723.2006</pubid><pubid idtype="pmcid">1347307</pubid><pubid idtype="pmpid" link="fulltext">16385061</pubid></pubidlist></xrefbib></bibl><bibl id="B4"><title><p>Varieties of <it>Mycobacterium johnei </it>isolated from sheep</p></title><aug><au><snm>Taylor</snm><fnm>AW</fnm></au></aug><source>J Pathol Bacteriol</source><pubdate>1951</pubdate><volume>63</volume><fpage>333</fpage><lpage>336</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1002/path.1700630217</pubid><pubid idtype="pmpid">14851174</pubid></pubidlist></xrefbib></bibl><bibl id="B5"><title><p>Comparative genomic analysis of <it>Mycobacterium avium </it>subspecies obtained from multiple host species</p></title><aug><au><snm>Paustian</snm><fnm>ML</fnm></au><au><snm>Zhu</snm><fnm>X</fnm></au><au><snm>Sreevatsan</snm><fnm>S</fnm></au><au><snm>Robbe-Austerman</snm><fnm>S</fnm></au><au><snm>Kapur</snm><fnm>V</fnm></au><au><snm>Bannantine</snm><fnm>JP</fnm></au></aug><source>BMC Genomics</source><pubdate>2008</pubdate><volume>9</volume><fpage>135</fpage><lpage>149</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1186/1471-2164-9-135</pubid><pubid idtype="pmcid">2323391</pubid><pubid idtype="pmpid" link="fulltext">18366709</pubid></pubidlist></xrefbib></bibl><bibl id="B6"><title><p>PCR-restriction endonuclease analysis for identification and strain typing of Mycobacterium avium subsp. paratuberculosis and Mycobacterium avium subsp. avium based on polymorphisms in IS1311</p></title><aug><au><snm>Marsh</snm><fnm>I</fnm></au><au><snm>Whittington</snm><fnm>R</fnm></au><au><snm>Cousins</snm><fnm>D</fnm></au></aug><source>Mol Cell Probes</source><pubdate>1999</pubdate><volume>13</volume><fpage>115</fpage><lpage>126</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1006/mcpr.1999.0227</pubid><pubid idtype="pmpid" link="fulltext">10208802</pubid></pubidlist></xrefbib></bibl><bibl id="B7"><title><p>Characterization of genetic differences between <it>Mycobacterium avium </it>subsp. <it>paratuberculosis </it>type I and type II isolates</p></title><aug><au><snm>Dohmann</snm><fnm>K</fnm></au><au><snm>Strommenger</snm><fnm>B</fnm></au><au><snm>Stevenson</snm><fnm>K</fnm></au><au><snm>de</snm><fnm>JL</fnm></au><au><snm>Stratmann</snm><fnm>J</fnm></au><au><snm>Kapur</snm><fnm>V</fnm></au><etal/></aug><source>J Clin Microbiol</source><pubdate>2003</pubdate><volume>41</volume><fpage>5215</fpage><lpage>5223</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1128/JCM.41.11.5215-5223.2003</pubid><pubid idtype="pmcid">262536</pubid><pubid idtype="pmpid" link="fulltext">14605167</pubid></pubidlist></xrefbib></bibl><bibl id="B8"><title><p>Multilocus short sequence repeat sequencing approach for differentiating among <it>Mycobacterium avium </it>subsp. <it>paratuberculosis </it>strains</p></title><aug><au><snm>Amonsin</snm><fnm>A</fnm></au><au><snm>Li</snm><fnm>LL</fnm></au><au><snm>Zhang</snm><fnm>Q</fnm></au><au><snm>Bannantine</snm><fnm>JP</fnm></au><au><snm>Motiwala</snm><fnm>AS</fnm></au><au><snm>Sreevatsan</snm><fnm>S</fnm></au><etal/></aug><source>J Clin Microbiol</source><pubdate>2004</pubdate><volume>42</volume><fpage>1694</fpage><lpage>1702</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1128/JCM.42.4.1694-1702.2004</pubid><pubid idtype="pmcid">387571</pubid><pubid idtype="pmpid" link="fulltext">15071027</pubid></pubidlist></xrefbib></bibl><bibl id="B9"><title><p>Comparative analysis of <it>Mycobacterium avium </it>subsp. <it>paratuberculosis </it>isolates from cattle, sheep and goats by short sequence repeat and pulsed-field gel electrophoresis typing</p></title><aug><au><snm>Sevilla</snm><fnm>I</fnm></au><au><snm>Li</snm><fnm>L</fnm></au><au><snm>Amonsin</snm><fnm>A</fnm></au><au><snm>Garrido</snm><fnm>JM</fnm></au><au><snm>Geijo</snm><fnm>MV</fnm></au><au><snm>Kapur</snm><fnm>V</fnm></au><etal/></aug><source>BMC Microbiol</source><pubdate>2008</pubdate><volume>8</volume><fpage>204</fpage><xrefbib><pubidlist><pubid idtype="doi">10.1186/1471-2180-8-204</pubid><pubid idtype="pmcid">2605457</pubid><pubid idtype="pmpid" link="fulltext">19032737</pubid></pubidlist></xrefbib></bibl><bibl id="B10"><title><p>Combined multilocus short sequence repeat and mycobacterial interspersed repetitive unit- variable-number tandem repeat typing of <it>Mycobacterium avium </it>subsp. <it>paratuberculosis </it>isolates</p></title><aug><au><snm>Thibault</snm><fnm>VC</fnm></au><au><snm>Grayon</snm><fnm>M</fnm></au><au><snm>Boschiroli</snm><fnm>ML</fnm></au><au><snm>Willery</snm><fnm>E</fnm></au><au><snm>Allix-Beguec</snm><fnm>C</fnm></au><au><snm>Stevenson</snm><fnm>K</fnm></au><etal/></aug><source>J Clin Microbiol</source><pubdate>2008</pubdate><volume>46</volume><fpage>4091</fpage><lpage>4094</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1128/JCM.01349-08</pubid><pubid idtype="pmcid">2593263</pubid><pubid idtype="pmpid" link="fulltext">18923016</pubid></pubidlist></xrefbib></bibl><bibl id="B11"><title><p>Sequencing of <it>hsp6 </it>distinguishes among subsets of the <it>Mycobacterium aviu </it>complex</p></title><aug><au><snm>Turenne</snm><fnm>CY</fnm></au><au><snm>Semret</snm><fnm>M</fnm></au><au><snm>Cousins</snm><fnm>DV</fnm></au><au><snm>Collins</snm><fnm>DM</fnm></au><au><snm>Behr</snm><fnm>MA</fnm></au></aug><source>J Clin Microbiol</source><pubdate>2006</pubdate><volume>44</volume><fpage>433</fpage><lpage>440</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1128/JCM.44.2.433-440.2006</pubid><pubid idtype="pmcid">1392684</pubid><pubid idtype="pmpid" link="fulltext">16455896</pubid></pubidlist></xrefbib></bibl><bibl id="B12"><title><p>Progress in molecular typing of Mycobacterium avium subspecies paratuberculosis</p></title><aug><au><snm>Castellanos</snm><fnm>E</fnm></au><au><snm>Juan</snm><fnm>Ld</fnm></au><au><snm>Dom&#237;nguez</snm><fnm>L</fnm></au><au><snm>Aranaz</snm><fnm>A</fnm></au></aug><source>Res Vet Sci</source><pubdate>2011</pubdate><note>doi:10.1016/j.rvsc.2011.05.017</note></bibl><bibl id="B13"><title><p>Comparative transcriptional analysis of human macrophages exposed to animal and human isolates of <it>Mycobacterium avius </it>subspecies <it>paratuberculosis </it>with diverse genotypes</p></title><aug><au><snm>Motiwala</snm><fnm>AS</fnm></au><au><snm>Janagama</snm><fnm>HK</fnm></au><au><snm>Paustian</snm><fnm>ML</fnm></au><au><snm>Zhu</snm><fnm>X</fnm></au><au><snm>Bannantine</snm><fnm>JP</fnm></au><au><snm>Kapur</snm><fnm>V</fnm></au><etal/></aug><source>Infect Immun</source><pubdate>2006</pubdate><volume>74</volume><fpage>6046</fpage><lpage>6056</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1128/IAI.00326-06</pubid><pubid idtype="pmcid">1695517</pubid><pubid idtype="pmpid" link="fulltext">17057086</pubid></pubidlist></xrefbib></bibl><bibl id="B14"><title><p>Molecular characterization of pigmented and nonpigmented isolates of Mycobacterium avium subsp. paratuberculosis</p></title><aug><au><snm>Stevenson</snm><fnm>K</fnm></au><au><snm>Hughes</snm><fnm>VM</fnm></au><au><snm>de</snm><fnm>JL</fnm></au><au><snm>Inglis</snm><fnm>NF</fnm></au><au><snm>Wright</snm><fnm>F</fnm></au><au><snm>Sharp</snm><fnm>JM</fnm></au></aug><source>J Clin Microbiol</source><pubdate>2002</pubdate><volume>40</volume><fpage>1798</fpage><lpage>1804</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1128/JCM.40.5.1798-1804.2002</pubid><pubid idtype="pmcid">130953</pubid><pubid idtype="pmpid" link="fulltext">11980962</pubid></pubidlist></xrefbib></bibl><bibl id="B15"><title><p>Culture Phenotypes of Genomically and Geographically Diverse <it>Mycobacterium avium </it>subsp. <it>paratuberculosis </it>Isolates from Different Hosts</p></title><aug><au><snm>Whittington</snm><fnm>RJ</fnm></au><au><snm>Marsh</snm><fnm>IB</fnm></au><au><snm>Saunders</snm><fnm>V</fnm></au><au><snm>Grant</snm><fnm>IR</fnm></au><au><snm>Juste</snm><fnm>R</fnm></au><au><snm>Sevilla</snm><fnm>IA</fnm></au><etal/></aug><source>J Clin Microbiol</source><pubdate>2011</pubdate><volume>49</volume><fpage>1822</fpage><lpage>1830</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1128/JCM.00210-11</pubid><pubid idtype="pmcid">3122651</pubid><pubid idtype="pmpid" link="fulltext">21430104</pubid></pubidlist></xrefbib></bibl><bibl id="B16"><title><p>Growth pattern and partial proteome of Mycobacterium avium subsp. paratuberculosis during the stress response to hypoxia and nutrient starvation</p></title><aug><au><snm>Gumber</snm><fnm>S</fnm></au><au><snm>Taylor</snm><fnm>DL</fnm></au><au><snm>Marsh</snm><fnm>IB</fnm></au><au><snm>Whittington</snm><fnm>RJ</fnm></au></aug><source>Vet Microbiol</source><pubdate>2009</pubdate><volume>133</volume><fpage>344</fpage><lpage>357</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1016/j.vetmic.2008.07.021</pubid><pubid idtype="pmpid" link="fulltext">18786786</pubid></pubidlist></xrefbib></bibl><bibl id="B17"><title><p>Analysis of the growth pattern, survival and proteome of Mycobacterium avium subsp. paratuberculosis following exposure to heat</p></title><aug><au><snm>Gumber</snm><fnm>S</fnm></au><au><snm>Whittington</snm><fnm>RJ</fnm></au></aug><source>Vet Microbiol</source><pubdate>2009</pubdate><volume>136</volume><fpage>82</fpage><lpage>90</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1016/j.vetmic.2008.10.003</pubid><pubid idtype="pmpid" link="fulltext">19019575</pubid></pubidlist></xrefbib></bibl><bibl id="B18"><title><p>Identification and functional characterization of the iron-dependent regulator (<it>IdeR</it>) of <it>Mycobacterium avium </it>subsp. <it>paratuberculosis</it></p></title><aug><au><snm>Janagama</snm><fnm>HK</fnm></au><au><snm>Senthilkumar</snm><fnm>TM</fnm></au><au><snm>Bannantine</snm><fnm>JP</fnm></au><au><snm>Rodriguez</snm><fnm>GM</fnm></au><au><snm>Smith</snm><fnm>I</fnm></au><au><snm>Paustian</snm><fnm>ML</fnm></au><etal/></aug><source>Microbiology</source><pubdate>2009</pubdate><volume>155</volume><fpage>3683</fpage><lpage>3690</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1099/mic.0.031948-0</pubid><pubid idtype="pmcid">2888127</pubid><pubid idtype="pmpid" link="fulltext">19684064</pubid></pubidlist></xrefbib></bibl><bibl id="B19"><title><p>The complete genome sequence of <it>Mycobacterium avium </it>subspecies <it>paratuberculosis</it></p></title><aug><au><snm>Li</snm><fnm>L</fnm></au><au><snm>Bannantine</snm><fnm>JP</fnm></au><au><snm>Zhang</snm><fnm>Q</fnm></au><au><snm>Amonsin</snm><fnm>A</fnm></au><au><snm>May</snm><fnm>BJ</fnm></au><au><snm>Alt</snm><fnm>D</fnm></au><etal/></aug><source>Proc Natl Acad Sci USA</source><pubdate>2005</pubdate><volume>102</volume><fpage>12344</fpage><lpage>12349</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1073/pnas.0505662102</pubid><pubid idtype="pmcid">1194940</pubid><pubid idtype="pmpid" link="fulltext">16116077</pubid></pubidlist></xrefbib></bibl><bibl id="B20"><title><p>Optical mapping of the <it>Mycobacterium avium </it>subspecies <it>paratuberculosis </it>genome</p></title><aug><au><snm>Wu</snm><fnm>CW</fnm></au><au><snm>Schramm</snm><fnm>TM</fnm></au><au><snm>Zhou</snm><fnm>S</fnm></au><au><snm>Schwartz</snm><fnm>DC</fnm></au><au><snm>Talaat</snm><fnm>AM</fnm></au></aug><source>BMC Genomics</source><pubdate>2009</pubdate><volume>10</volume><fpage>25</fpage><xrefbib><pubidlist><pubid idtype="doi">10.1186/1471-2164-10-25</pubid><pubid idtype="pmcid">2633350</pubid><pubid idtype="pmpid" link="fulltext">19146697</pubid></pubidlist></xrefbib></bibl><bibl id="B21"><title><p>Exploring the zoonotic potential of <it>Mycobacterium avium </it>subspecies <it>paratuberculosis </it>through comparative genomics</p></title><aug><au><snm>Wynne</snm><fnm>JW</fnm></au><au><snm>Bull</snm><fnm>TJ</fnm></au><au><snm>Seemann</snm><fnm>T</fnm></au><au><snm>Bulach</snm><fnm>DM</fnm></au><au><snm>Wagner</snm><fnm>J</fnm></au><au><snm>Kirkwood</snm><fnm>CD</fnm></au><etal/></aug><source>PLoS One</source><pubdate>2011</pubdate><volume>6</volume><fpage>e22171</fpage><xrefbib><pubidlist><pubid idtype="doi">10.1371/journal.pone.0022171</pubid><pubid idtype="pmcid">3142125</pubid><pubid idtype="pmpid" link="fulltext">21799786</pubid></pubidlist></xrefbib></bibl><bibl id="B22"><title><p>Genomic homogeneity between Mycobacterium avium subsp. avium and Mycobacterium avium subsp. paratuberculosis belies their divergent growth rates</p></title><aug><au><snm>Bannantine</snm><fnm>JP</fnm></au><au><snm>Zhang</snm><fnm>Q</fnm></au><au><snm>Li</snm><fnm>LL</fnm></au><au><snm>Kapur</snm><fnm>V</fnm></au></aug><source>BMC Microbiol</source><pubdate>2003</pubdate><volume>3</volume><fpage>10</fpage><xrefbib><pubidlist><pubid idtype="doi">10.1186/1471-2180-3-10</pubid><pubid idtype="pmcid">156888</pubid><pubid idtype="pmpid" link="fulltext">12740027</pubid></pubidlist></xrefbib></bibl><bibl id="B23"><title><p>PSORTb 3.0: improved protein subcellular localization prediction with refined localization subcategories and predictive capabilities for all prokaryotes</p></title><aug><au><snm>Yu</snm><fnm>NY</fnm></au><au><snm>Wagner</snm><fnm>JR</fnm></au><au><snm>Laird</snm><fnm>MR</fnm></au><au><snm>Melli</snm><fnm>G</fnm></au><au><snm>Rey</snm><fnm>S</fnm></au><au><snm>Lo</snm><fnm>R</fnm></au><etal/></aug><source>Bioinformatics</source><pubdate>2010</pubdate><volume>26</volume><fpage>1608</fpage><lpage>1615</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1093/bioinformatics/btq249</pubid><pubid idtype="pmcid">2887053</pubid><pubid idtype="pmpid" link="fulltext">20472543</pubid></pubidlist></xrefbib></bibl><bibl id="B24"><title><p>Restricted structural gene polymorphism in the <it>Mycobacterium tuberculosis </it>complex indicates evolutionarily recent global dissemination</p></title><aug><au><snm>Sreevatsan</snm><fnm>S</fnm></au><au><snm>Pan</snm><fnm>X</fnm></au><au><snm>Stockbauer</snm><fnm>KE</fnm></au><au><snm>Connell</snm><fnm>ND</fnm></au><au><snm>Kreiswirth</snm><fnm>BN</fnm></au><au><snm>Whittam</snm><fnm>TS</fnm></au><etal/></aug><source>Proc Natl Acad Sci USA</source><pubdate>1997</pubdate><volume>94</volume><fpage>9869</fpage><lpage>9874</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1073/pnas.94.18.9869</pubid><pubid idtype="pmcid">23284</pubid><pubid idtype="pmpid" link="fulltext">9275218</pubid></pubidlist></xrefbib></bibl><bibl id="B25"><title><p>Recent nosocomial transmission and genotypes of multidrug-resistant <it>Mycobacterium tuberculosis</it></p></title><aug><au><snm>Nodieva</snm><fnm>A</fnm></au><au><snm>Jansone</snm><fnm>I</fnm></au><au><snm>Broka</snm><fnm>L</fnm></au><au><snm>Pole</snm><fnm>I</fnm></au><au><snm>Skenders</snm><fnm>G</fnm></au><au><snm>Baumanis</snm><fnm>V</fnm></au></aug><source>Int J Tuberc Lung Dis</source><pubdate>2010</pubdate><volume>14</volume><fpage>427</fpage><lpage>433</lpage><xrefbib><pubid idtype="pmpid" link="fulltext">20202300</pubid></xrefbib></bibl><bibl id="B26"><title><p>Identification of a repetitive DNA sequence specific to <it>Mycobacterium paratuberculosis</it></p></title><aug><au><snm>Collins</snm><fnm>DM</fnm></au><au><snm>Gabric</snm><fnm>DM</fnm></au><au><snm>de Lisle</snm><fnm>GW</fnm></au></aug><source>FEMS Microbiol Lett</source><pubdate>1989</pubdate><volume>60</volume><fpage>175</fpage><lpage>178</lpage><xrefbib><pubid idtype="doi">10.1111/j.1574-6968.1989.tb03440.x</pubid></xrefbib></bibl><bibl id="B27"><title><p>Sequence and characteristics of IS<it>900</it>, an insertion element identified in a human Crohn's disease isolate of <it>Mycobacterium paratuberculosis</it></p></title><aug><au><snm>Green</snm><fnm>EP</fnm></au><au><snm>Tizard</snm><fnm>ML</fnm></au><au><snm>Moss</snm><fnm>MT</fnm></au><au><snm>Thompson</snm><fnm>J</fnm></au><au><snm>Winterbourne</snm><fnm>DJ</fnm></au><au><snm>McFadden</snm><fnm>JJ</fnm></au><etal/></aug><source>Nucleic Acids Res</source><pubdate>1989</pubdate><volume>17</volume><fpage>9063</fpage><lpage>9073</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1093/nar/17.22.9063</pubid><pubid idtype="pmcid">335114</pubid><pubid idtype="pmpid" link="fulltext">2555783</pubid></pubidlist></xrefbib></bibl><bibl id="B28"><title><p>Re-sequencing the <it>Mycobacterium avium </it>subsp. <it>paratuberculosis </it>K10 genome: improved annotation and revised genome sequence</p></title><aug><au><snm>Wynne</snm><fnm>JW</fnm></au><au><snm>Seemann</snm><fnm>T</fnm></au><au><snm>Bulach</snm><fnm>D</fnm></au><au><snm>Coutts</snm><fnm>SA</fnm></au><au><snm>Talaat</snm><fnm>AM</fnm></au><au><snm>Michalski</snm><fnm>WP</fnm></au></aug><source>J Bacteriol</source><pubdate>2010</pubdate><volume>192</volume><fpage>6319</fpage><lpage>6320</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1128/JB.00972-10</pubid><pubid idtype="pmcid">2981200</pubid><pubid idtype="pmpid" link="fulltext">20870759</pubid></pubidlist></xrefbib></bibl><bibl id="B29"><title><p>Differentiating host-associated variants of <it>Mycobacterium avium </it>by PCR for detection of large sequence polymorphisms</p></title><aug><au><snm>Semret</snm><fnm>M</fnm></au><au><snm>Turenne</snm><fnm>CY</fnm></au><au><snm>de</snm><fnm>HP</fnm></au><au><snm>Collins</snm><fnm>DM</fnm></au><au><snm>Behr</snm><fnm>MA</fnm></au></aug><source>J Clin Microbiol</source><pubdate>2006</pubdate><volume>44</volume><fpage>881</fpage><lpage>887</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1128/JCM.44.3.881-887.2006</pubid><pubid idtype="pmcid">1393138</pubid><pubid idtype="pmpid" link="fulltext">16517871</pubid></pubidlist></xrefbib></bibl><bibl id="B30"><title><p>Proposed pathway for the biosynthesis of serovar-specific glycopeptidolipids in <it>Mycobacterium avium </it>serovar 2</p></title><aug><au><snm>Eckstein</snm><fnm>TM</fnm></au><au><snm>Belisle</snm><fnm>JT</fnm></au><au><snm>Inamine</snm><fnm>JM</fnm></au></aug><source>Microbiology</source><pubdate>2003</pubdate><volume>149</volume><fpage>2797</fpage><lpage>2807</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1099/mic.0.26528-0</pubid><pubid idtype="pmpid" link="fulltext">14523113</pubid></pubidlist></xrefbib></bibl><bibl id="B31"><title><p>A new family of bacterial regulatory proteins</p></title><aug><au><snm>Haydon</snm><fnm>DJ</fnm></au><au><snm>Guest</snm><fnm>JR</fnm></au></aug><source>FEMS Microbiol Lett</source><pubdate>1991</pubdate><volume>63</volume><fpage>291</fpage><lpage>295</lpage><xrefbib><pubid idtype="pmpid">2060763</pubid></xrefbib></bibl><bibl id="B32"><title><p>GntR family of regulators in <it>Mycobacterium smegmatis</it>: a sequence and structure based characterization</p></title><aug><au><snm>Vindal</snm><fnm>V</fnm></au><au><snm>Suma</snm><fnm>K</fnm></au><au><snm>Ranjan</snm><fnm>A</fnm></au></aug><source>BMC Genomics</source><pubdate>2007</pubdate><volume>8</volume><fpage>289</fpage><xrefbib><pubidlist><pubid idtype="doi">10.1186/1471-2164-8-289</pubid><pubid idtype="pmcid">2018728</pubid><pubid idtype="pmpid" link="fulltext">17714599</pubid></pubidlist></xrefbib></bibl><bibl id="B33"><title><p>Genomic comparison of <it>Mycobacterium avium </it>subsp. <it>paratuberculosis </it>sheep and cattle strains by microarray hybridization</p></title><aug><au><snm>Marsh</snm><fnm>IB</fnm></au><au><snm>Bannantine</snm><fnm>JP</fnm></au><au><snm>Paustian</snm><fnm>ML</fnm></au><au><snm>Tizard</snm><fnm>ML</fnm></au><au><snm>Kapur</snm><fnm>V</fnm></au><au><snm>Whittington</snm><fnm>RJ</fnm></au></aug><source>J Bacteriol</source><pubdate>2006</pubdate><volume>188</volume><fpage>2290</fpage><lpage>2293</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1128/JB.188.6.2290-2293.2006</pubid><pubid idtype="pmcid">1428150</pubid><pubid idtype="pmpid" link="fulltext">16513760</pubid></pubidlist></xrefbib></bibl><bibl id="B34"><title><p>Deletion of an <it>mmp </it>gene and multiple associated genes from the genome of the S strain of <it>Mycobacterium avium </it>subsp. <it>paratuberculosis </it>identified by representational difference analysis and <it>in silic </it>analysis</p></title><aug><au><snm>Marsh</snm><fnm>IB</fnm></au><au><snm>Whittington</snm><fnm>RJ</fnm></au></aug><source>Mol Cell Probes</source><pubdate>2005</pubdate><volume>19</volume><fpage>371</fpage><lpage>384</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1016/j.mcp.2005.06.005</pubid><pubid idtype="pmpid" link="fulltext">16226868</pubid></pubidlist></xrefbib></bibl><bibl id="B35"><title><p>Identification of variable regions in the genomes of tubercle bacilli using bacterial artificial chromosome arrays</p></title><aug><au><snm>Gordon</snm><fnm>SV</fnm></au><au><snm>Brosch</snm><fnm>R</fnm></au><au><snm>Billault</snm><fnm>A</fnm></au><au><snm>Garnier</snm><fnm>T</fnm></au><au><snm>Eiglmeier</snm><fnm>K</fnm></au><au><snm>Cole</snm><fnm>ST</fnm></au></aug><source>Mol Microbiol</source><pubdate>1999</pubdate><volume>32</volume><fpage>643</fpage><lpage>655</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1046/j.1365-2958.1999.01383.x</pubid><pubid idtype="pmpid" link="fulltext">10320585</pubid></pubidlist></xrefbib></bibl><bibl id="B36"><title><p>Insertion and deletion events that define the pathogen <it>Mycobacterium avium </it>subsp. <it>paratuberculosis</it></p></title><aug><au><snm>Alexander</snm><fnm>DC</fnm></au><au><snm>Turenne</snm><fnm>CY</fnm></au><au><snm>Behr</snm><fnm>MA</fnm></au></aug><source>J Bacteriol</source><pubdate>2009</pubdate><volume>191</volume><fpage>1018</fpage><lpage>1025</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1128/JB.01340-08</pubid><pubid idtype="pmcid">2632067</pubid><pubid idtype="pmpid" link="fulltext">19028885</pubid></pubidlist></xrefbib></bibl><bibl id="B37"><title><p>Genome-wide Analysis of the Emerging Infection with <it>Mycobacterium avium </it>subspecies <it>paratuberculosis </it>in the Arabian Camels (<it>Camelus dromedarius</it>)</p></title><aug><au><snm>Ghosh</snm><fnm>P</fnm></au><au><snm>Hsu</snm><fnm>C-Y</fnm></au><au><snm>Alyamani</snm><fnm>E</fnm></au><au><snm>Shehata</snm><fnm>MM</fnm></au><au><snm>Al-Dubaib</snm><fnm>MA</fnm></au><au><snm>Al-Naeem</snm><fnm>A</fnm></au><etal/></aug><source>PLoS ONE</source><pubdate>2012</pubdate><inpress/></bibl><bibl id="B38"><title><p>The neighbor-joining method: a new method for reconstructing phylogenetic trees</p></title><aug><au><snm>Saitou</snm><fnm>N</fnm></au><au><snm>Nei</snm><fnm>M</fnm></au></aug><source>Mol Biol Evol</source><pubdate>1987</pubdate><volume>4</volume><fpage>406</fpage><lpage>425</lpage><xrefbib><pubid idtype="pmpid" link="fulltext">3447015</pubid></xrefbib></bibl><bibl id="B39"><title><p>Evolutionary distance estimation under heterogeneous substitution pattern among lineages</p></title><aug><au><snm>Tamura</snm><fnm>K</fnm></au><au><snm>Kumar</snm><fnm>S</fnm></au></aug><source>Mol Biol Evol</source><pubdate>2002</pubdate><volume>19</volume><fpage>1727</fpage><lpage>1736</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1093/oxfordjournals.molbev.a003995</pubid><pubid idtype="pmpid" link="fulltext">12270899</pubid></pubidlist></xrefbib></bibl><bibl id="B40"><title><p>MEGA5: Molecular Evolutionary Genetics Analysis using Maximum Likelihood, Evolutionary Distance, and Maximum Parsimony Methods</p></title><aug><au><snm>Tamura</snm><fnm>K</fnm></au><au><snm>Peterson</snm><fnm>D</fnm></au><au><snm>Peterson</snm><fnm>N</fnm></au><au><snm>Stecher</snm><fnm>G</fnm></au><au><snm>Nei</snm><fnm>M</fnm></au><au><snm>Kumar</snm><fnm>S</fnm></au></aug><source>Mol Biol Evol</source><pubdate>2011</pubdate><volume>28</volume><fpage>2731</fpage><lpage>2739</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1093/molbev/msr121</pubid><pubid idtype="pmpid" link="fulltext">21546353</pubid></pubidlist></xrefbib></bibl><bibl id="B41"><title><p>Genomic polymorphisms for <it>Mycobacterium avium </it>subsp. <it>paratuberculosis </it>diagnostics</p></title><aug><au><snm>Semret</snm><fnm>M</fnm></au><au><snm>Alexander</snm><fnm>DC</fnm></au><au><snm>Turenne</snm><fnm>CY</fnm></au><au><snm>de</snm><fnm>HP</fnm></au><au><snm>Overduin</snm><fnm>P</fnm></au><au><snm>van Soolingen</snm><fnm>D</fnm></au><etal/></aug><source>J Clin Microbiol</source><pubdate>2005</pubdate><volume>43</volume><fpage>3704</fpage><lpage>3712</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1128/JCM.43.8.3704-3712.2005</pubid><pubid idtype="pmcid">1234005</pubid><pubid idtype="pmpid" link="fulltext">16081899</pubid></pubidlist></xrefbib></bibl><bibl id="B42"><title><p>Discovery of Stable and Variable Differences in the <it>Mycobacterium avium </it>subsp <it>paratuberculosi </it>Type I, II, and III Genomes by Pan-Genome Microarray Analysis</p></title><aug><au><snm>Castellanos</snm><fnm>E</fnm></au><au><snm>Aranaz</snm><fnm>A</fnm></au><au><snm>Gould</snm><fnm>KA</fnm></au><au><snm>Linedale</snm><fnm>R</fnm></au><au><snm>Stevenson</snm><fnm>K</fnm></au><au><snm>Alvarez</snm><fnm>J</fnm></au><etal/></aug><source>Appl Environ Microbiol</source><pubdate>2009</pubdate><volume>75</volume><fpage>676</fpage><lpage>686</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1128/AEM.01683-08</pubid><pubid idtype="pmcid">2632155</pubid><pubid idtype="pmpid" link="fulltext">19047395</pubid></pubidlist></xrefbib></bibl><bibl id="B43"><title><p>On the evolution of 'Indian Bison type' strains of <it>Mycobacterium avium </it>subspecies <it>paratuberculosis</it></p></title><aug><au><snm>Sohal</snm><fnm>JS</fnm></au><au><snm>Singh</snm><fnm>SV</fnm></au><au><snm>Singh</snm><fnm>PK</fnm></au><au><snm>Singh</snm><fnm>AV</fnm></au></aug><source>Microbiol Res</source><pubdate>2010</pubdate><volume>165</volume><fpage>163</fpage><lpage>171</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1016/j.micres.2009.03.007</pubid><pubid idtype="pmpid" link="fulltext">19442500</pubid></pubidlist></xrefbib></bibl><bibl id="B44"><title><p>Mauve: multiple alignment of conserved genomic sequence with rearrangements</p></title><aug><au><snm>Darling</snm><fnm>AC</fnm></au><au><snm>Mau</snm><fnm>B</fnm></au><au><snm>Blattner</snm><fnm>FR</fnm></au><au><snm>Perna</snm><fnm>NT</fnm></au></aug><source>Genome Res</source><pubdate>2004</pubdate><volume>14</volume><fpage>1394</fpage><lpage>1403</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1101/gr.2289704</pubid><pubid idtype="pmcid">442156</pubid><pubid idtype="pmpid" link="fulltext">15231754</pubid></pubidlist></xrefbib></bibl><bibl id="B45"><title><p>Defining the stressome of <it>Mycobacterium avium </it>subsp. <it>paratuberculosis </it>in vitro and in naturally infected cows</p></title><aug><au><snm>Wu</snm><fnm>CW</fnm></au><au><snm>Schmoller</snm><fnm>SK</fnm></au><au><snm>Shin</snm><fnm>SJ</fnm></au><au><snm>Talaat</snm><fnm>AM</fnm></au></aug><source>J Bacteriol</source><pubdate>2007</pubdate><volume>189</volume><fpage>7877</fpage><lpage>7886</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1128/JB.00780-07</pubid><pubid idtype="pmcid">2168719</pubid><pubid idtype="pmpid" link="fulltext">17693514</pubid></pubidlist></xrefbib></bibl><bibl id="B46"><title><p>Genome scale comparison of <it>Mycobacterium avium </it>subsp. <it>paratuberculosis </it>with <it>Mycobacterium avium </it>subsp. <it>avium </it>reveals potential diagnostic sequences</p></title><aug><au><snm>Bannantine</snm><fnm>JP</fnm></au><au><snm>Baechler</snm><fnm>E</fnm></au><au><snm>Zhang</snm><fnm>Q</fnm></au><au><snm>Li</snm><fnm>L</fnm></au><au><snm>Kapur</snm><fnm>V</fnm></au></aug><source>J Clin Microbiol</source><pubdate>2002</pubdate><volume>40</volume><fpage>1303</fpage><lpage>1310</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1128/JCM.40.4.1303-1310.2002</pubid><pubid idtype="pmcid">140397</pubid><pubid idtype="pmpid" link="fulltext">11923349</pubid></pubidlist></xrefbib></bibl><bibl id="B47"><title><p>Deep sequencing of the zebrafish transcriptome response to mycobacterium infection</p></title><aug><au><snm>Hegedus</snm><fnm>Z</fnm></au><au><snm>Zakrzewska</snm><fnm>A</fnm></au><au><snm>Agoston</snm><fnm>VC</fnm></au><au><snm>Ordas</snm><fnm>A</fnm></au><au><snm>Racz</snm><fnm>P</fnm></au><au><snm>Mink</snm><fnm>M</fnm></au><etal/></aug><source>Mol Immunol</source><pubdate>2009</pubdate><volume>46</volume><fpage>2918</fpage><lpage>2930</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1016/j.molimm.2009.07.002</pubid><pubid idtype="pmpid" link="fulltext">19631987</pubid></pubidlist></xrefbib></bibl><bibl id="B48"><title><p>IMG ER: a system for microbial genome annotation expert review and curation</p></title><aug><au><snm>Markowitz</snm><fnm>VM</fnm></au><au><snm>Mavromatis</snm><fnm>K</fnm></au><au><snm>Ivanova</snm><fnm>NN</fnm></au><au><snm>Chen</snm><fnm>IM</fnm></au><au><snm>Chu</snm><fnm>K</fnm></au><au><snm>Kyrpides</snm><fnm>NC</fnm></au></aug><source>Bioinformatics</source><pubdate>2009</pubdate><volume>25</volume><fpage>2271</fpage><lpage>2278</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1093/bioinformatics/btp393</pubid><pubid idtype="pmpid" link="fulltext">19561336</pubid></pubidlist></xrefbib></bibl><bibl id="B49"><title><p>progressiveMauve: multiple genome alignment with gene gain, loss and rearrangement</p></title><aug><au><snm>Darling</snm><fnm>AE</fnm></au><au><snm>Mau</snm><fnm>B</fnm></au><au><snm>Perna</snm><fnm>NT</fnm></au></aug><source>PLoS One</source><pubdate>2010</pubdate><volume>5</volume><fpage>e11147</fpage><xrefbib><pubidlist><pubid idtype="doi">10.1371/journal.pone.0011147</pubid><pubid idtype="pmcid">2892488</pubid><pubid idtype="pmpid" link="fulltext">20593022</pubid></pubidlist></xrefbib></bibl><bibl id="B50"><title><p>Molecular evolution of a pathogenicity island from enterohemorrhagic <it>Escherichia coli </it>O157:H7</p></title><aug><au><snm>Perna</snm><fnm>NT</fnm></au><au><snm>Mayhew</snm><fnm>GF</fnm></au><au><snm>Posfai</snm><fnm>G</fnm></au><au><snm>Elliott</snm><fnm>S</fnm></au><au><snm>Donnenberg</snm><fnm>MS</fnm></au><au><snm>Kaper</snm><fnm>JB</fnm></au><etal/></aug><source>Infect Immun</source><pubdate>1998</pubdate><volume>66</volume><fpage>3810</fpage><lpage>3817</lpage><xrefbib><pubidlist><pubid idtype="pmcid">108423</pubid><pubid idtype="pmpid" link="fulltext">9673266</pubid></pubidlist></xrefbib></bibl><bibl id="B51"><title><p>Whole-genome shotgun optical mapping of <it>Deinococcus radiodurans</it></p></title><aug><au><snm>Lin</snm><fnm>J</fnm></au><au><snm>Qi</snm><fnm>R</fnm></au><au><snm>Aston</snm><fnm>C</fnm></au><au><snm>Jing</snm><fnm>J</fnm></au><au><snm>Anantharaman</snm><fnm>TS</fnm></au><au><snm>Mishra</snm><fnm>B</fnm></au><etal/></aug><source>Science</source><pubdate>1999</pubdate><volume>285</volume><fpage>1558</fpage><lpage>1562</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1126/science.285.5433.1558</pubid><pubid idtype="pmpid">10477518</pubid></pubidlist></xrefbib></bibl><bibl id="B52"><title><p>Validation of rice genome sequence by optical mapping</p></title><aug><au><snm>Zhou</snm><fnm>S</fnm></au><au><snm>Bechner</snm><fnm>MC</fnm></au><au><snm>Place</snm><fnm>M</fnm></au><au><snm>Churas</snm><fnm>CP</fnm></au><au><snm>Pape</snm><fnm>L</fnm></au><au><snm>Leong</snm><fnm>SA</fnm></au><etal/></aug><source>BMC Genomics</source><pubdate>2007</pubdate><volume>8</volume><fpage>278</fpage><lpage>295</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1186/1471-2164-8-278</pubid><pubid idtype="pmcid">2048515</pubid><pubid idtype="pmpid" link="fulltext">17697381</pubid></pubidlist></xrefbib></bibl><bibl id="B53"><title><p>A whole-genome shotgun optical map of <it>Yersinia pestis </it>strain KIM</p></title><aug><au><snm>Zhou</snm><fnm>S</fnm></au><au><snm>Deng</snm><fnm>W</fnm></au><au><snm>Anantharaman</snm><fnm>TS</fnm></au><au><snm>Lim</snm><fnm>A</fnm></au><au><snm>Dimalanta</snm><fnm>ET</fnm></au><au><snm>Wang</snm><fnm>J</fnm></au><etal/></aug><source>Appl Environ Microbiol</source><pubdate>2002</pubdate><volume>68</volume><fpage>6321</fpage><lpage>6331</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1128/AEM.68.12.6321-6331.2002</pubid><pubid idtype="pmcid">134435</pubid><pubid idtype="pmpid" link="fulltext">12450857</pubid></pubidlist></xrefbib></bibl><bibl id="B54"><title><p>Shotgun optical mapping of the entire <it>Leishmania major </it>Friedlin genome</p></title><aug><au><snm>Zhou</snm><fnm>S</fnm></au><au><snm>Kile</snm><fnm>A</fnm></au><au><snm>Kvikstad</snm><fnm>E</fnm></au><au><snm>Bechner</snm><fnm>M</fnm></au><au><snm>Severin</snm><fnm>J</fnm></au><au><snm>Forrest</snm><fnm>D</fnm></au><etal/></aug><source>Mol Biochem Parasitol</source><pubdate>2004</pubdate><volume>138</volume><fpage>97</fpage><lpage>106</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1016/j.molbiopara.2004.08.002</pubid><pubid idtype="pmpid" link="fulltext">15500921</pubid></pubidlist></xrefbib></bibl><bibl id="B55"><title><p>A single molecule scaffold for the maize genome</p></title><aug><au><snm>Zhou</snm><fnm>S</fnm></au><au><snm>Wei</snm><fnm>F</fnm></au><au><snm>Nguyen</snm><fnm>J</fnm></au><au><snm>Bechner</snm><fnm>M</fnm></au><au><snm>Potamousis</snm><fnm>K</fnm></au><au><snm>Goldstein</snm><fnm>S</fnm></au><etal/></aug><source>PLoS Genet</source><pubdate>2009</pubdate><volume>5</volume><fpage>e1000711</fpage><xrefbib><pubidlist><pubid idtype="doi">10.1371/journal.pgen.1000711</pubid><pubid idtype="pmcid">2774507</pubid><pubid idtype="pmpid" link="fulltext">19936062</pubid></pubidlist></xrefbib></bibl><bibl id="B56"><title><p>Separation of yeast chromosome-sized DNAs by pulsed field gradient gel electrophoresis</p></title><aug><au><snm>Schwartz</snm><fnm>DC</fnm></au><au><snm>Cantor</snm><fnm>CR</fnm></au></aug><source>Cell</source><pubdate>1984</pubdate><volume>37</volume><fpage>67</fpage><lpage>75</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1016/0092-8674(84)90301-5</pubid><pubid idtype="pmpid" link="fulltext">6373014</pubid></pubidlist></xrefbib></bibl><bibl id="B57"><title><p>A microfluidic system for large DNA molecule arrays</p></title><aug><au><snm>Dimalanta</snm><fnm>ET</fnm></au><au><snm>Lim</snm><fnm>A</fnm></au><au><snm>Runnheim</snm><fnm>R</fnm></au><au><snm>Lamers</snm><fnm>C</fnm></au><au><snm>Churas</snm><fnm>C</fnm></au><au><snm>Forrest</snm><fnm>DK</fnm></au><etal/></aug><source>Anal Chem</source><pubdate>2004</pubdate><volume>76</volume><fpage>5293</fpage><lpage>5301</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1021/ac0496401</pubid><pubid idtype="pmpid" link="fulltext">15362885</pubid></pubidlist></xrefbib></bibl><bibl id="B58"><title><p>Shotgun optical maps of the whole <it>Escherichia coli </it>O157:H7 genome</p></title><aug><au><snm>Lim</snm><fnm>A</fnm></au><au><snm>Dimalanta</snm><fnm>ET</fnm></au><au><snm>Potamousis</snm><fnm>KD</fnm></au><au><snm>Yen</snm><fnm>G</fnm></au><au><snm>Apodoca</snm><fnm>J</fnm></au><au><snm>Tao</snm><fnm>C</fnm></au><etal/></aug><source>Genome Res</source><pubdate>2001</pubdate><volume>11</volume><fpage>1584</fpage><lpage>1593</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1101/gr.172101</pubid><pubid idtype="pmcid">311123</pubid><pubid idtype="pmpid" link="fulltext">11544203</pubid></pubidlist></xrefbib></bibl><bibl id="B59"><title><p>Whole-genome shotgun optical mapping of <it>Rhodobacter sphaeroides </it>strain 2.4.1 and its use for whole-genome shotgun sequence assembly</p></title><aug><au><snm>Zhou</snm><fnm>S</fnm></au><au><snm>Kvikstad</snm><fnm>E</fnm></au><au><snm>Kile</snm><fnm>A</fnm></au><au><snm>Severin</snm><fnm>J</fnm></au><au><snm>Forrest</snm><fnm>D</fnm></au><au><snm>Runnheim</snm><fnm>R</fnm></au><etal/></aug><source>Genome Res</source><pubdate>2003</pubdate><volume>13</volume><fpage>2142</fpage><lpage>2151</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1101/gr.1128803</pubid><pubid idtype="pmcid">403714</pubid><pubid idtype="pmpid" link="fulltext">12952882</pubid></pubidlist></xrefbib></bibl><bibl id="B60"><title><p>High-resolution genome structure by single molecule analysis</p></title><aug><au><snm>Teague</snm><fnm>B</fnm></au><au><snm>Waterman</snm><fnm>MS</fnm></au><au><snm>Goldstein</snm><fnm>S</fnm></au><au><snm>Potamousis</snm><fnm>K</fnm></au><au><snm>Zhou</snm><fnm>S</fnm></au><au><snm>Reslewic</snm><fnm>S</fnm></au><etal/></aug><source>Proc Natl Acad Sci USA</source><pubdate>2010</pubdate><volume>107</volume><fpage>10848</fpage><lpage>10853</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1073/pnas.0914638107</pubid><pubid idtype="pmcid">2890719</pubid><pubid idtype="pmpid" link="fulltext">20534489</pubid></pubidlist></xrefbib></bibl><bibl id="B61"><title><p>A shotgun optical map of the entire <it>Plasmodium falciparum </it>genome</p></title><aug><au><snm>Lai</snm><fnm>Z</fnm></au><au><snm>Jing</snm><fnm>J</fnm></au><au><snm>Aston</snm><fnm>C</fnm></au><au><snm>Clarke</snm><fnm>V</fnm></au><au><snm>Apodaca</snm><fnm>J</fnm></au><au><snm>Dimalanta</snm><fnm>ET</fnm></au><etal/></aug><source>Nat Genet</source><pubdate>1999</pubdate><volume>23</volume><fpage>309</fpage><lpage>313</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1038/15484</pubid><pubid idtype="pmpid" link="fulltext">10610179</pubid></pubidlist></xrefbib></bibl><bibl id="B62"><title><p>An algorithm for assembly of ordered restriction maps from single DNA molecules</p></title><aug><au><snm>Valouev</snm><fnm>A</fnm></au><au><snm>Schwartz</snm><fnm>DC</fnm></au><au><snm>Zhou</snm><fnm>S</fnm></au><au><snm>Waterman</snm><fnm>MS</fnm></au></aug><source>Proc Natl Acad Sci USA</source><pubdate>2006</pubdate><volume>103</volume><fpage>15770</fpage><lpage>15775</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1073/pnas.0604040103</pubid><pubid idtype="pmcid">1635078</pubid><pubid idtype="pmpid" link="fulltext">17043225</pubid></pubidlist></xrefbib></bibl></refgrp>
</bm>
</art>