<?xml version='1.0'?>
<!DOCTYPE art SYSTEM 'http://www.biomedcentral.com/xml/article.dtd'>
<art>
   <ui>gb-2002-3-2-research0010</ui>
   <ji>GBJ</ji>
   <fm>
      <dochead>Research</dochead>
      <bibl>
         <title>
            <p>A database for the provisional identification of species using only genotypes: web-based genome profiling</p>
         </title>
         <aug>
            <au id="A1">
               <snm>Watanabe</snm>
               <fnm>Takehiro</fnm>
               <insr iid="I1"/>
            </au>
            <au id="A2">
               <snm>Saito</snm>
               <fnm>Ayumu</fnm>
               <insr iid="I1"/>
            </au>
            <au id="A3">
               <snm>Takeuchi</snm>
               <fnm>Yusuke</fnm>
               <insr iid="I1"/>
            </au>
            <au id="A4">
               <snm>Naimuddin</snm>
               <fnm>Mohammed</fnm>
               <insr iid="I1"/>
            </au>
            <au id="A5" ca="yes">
               <snm>Nishigaki</snm>
               <fnm>Koichi</fnm>
               <insr iid="I1"/>
               <email>koichi@fms.saitama-u.ac.jp</email>
            </au>
         </aug>
         <insg>
            <ins id="I1">
               <p>Department of Functional Materials Science, Saitama University, 255 Shimo-Okubo, Saitama, Saitama 338-8570, Japan</p>
            </ins>
         </insg>
         <source>Genome Biology</source>
         <issn>1465-6906</issn>
         <pubdate>2002</pubdate>
         <volume>3</volume>
         <issue>2</issue>
         <fpage>research0010.1</fpage>
         <lpage>research0010.8</lpage>
         <url>http://genomebiology.com/2002/3/2/research/0010</url>
         <xrefbib>
            <pubidlist>
               <pubid idtype="doi">10.1186/gb-2002-3-2-research0010</pubid>
               <pubid idtype="pmpid">11864372</pubid>
            </pubidlist>
         </xrefbib>
      </bibl>
      <history>
         <rec>
            <date>
               <day>28</day>
               <month>8</month>
               <year>2001</year>
            </date>
         </rec>
         <revrec>
            <date>
               <day>22</day>
               <month>10</month>
               <year>2001</year>
            </date>
         </revrec>
         <acc>
            <date>
               <day>5</day>
               <month>12</month>
               <year>2001</year>
            </date>
         </acc>
         <pub>
            <date>
               <day>28</day>
               <month>1</month>
               <year>2002</year>
            </date>
         </pub>
      </history>
      <cpyrt>
         <year>2002</year>
         <collab>Watanabe et al., licensee BioMed Central Ltd</collab>
      </cpyrt>
      <shortabs>
         <p>An approach that will allow rapid and accurate phylogenetic comparison of any unknown microbial strain to all known type strains has been developed, enabling tentative assignments of strains to species. The approach is based on two main technologies: genome profiling and Internet-based databases.</p>
      </shortabs>
      <abs>
         <sec>
            <st>
               <p>Abstract</p>
            </st>
            <sec>
               <st>
                  <p>Background</p>
               </st>
               <p>For a long time one could not imagine being able to identify species on the basis of genotype only as there were no technological means to do so. But conventional phenotype-based identification requires much effort and a high level of skill, making it almost impossible to analyze a huge number of organisms, as, for example, in microbe-related biological disciplines. Comparative analysis of 16S rRNA has been changing the situation, however. We report here an approach that will allow rapid and accurate phylogenetic comparison of any unknown strain to all known type strains, enabling tentative assignments of strains to species. The approach is based on two main technologies: genome profiling and Internet-based databases.</p>
            </sec>
            <sec>
               <st>
                  <p>Results</p>
               </st>
               <p>A complete procedure for provisional identification of species using only their genomes is presented, using random polymerase chain reaction, temperature-gradient gel electrophoresis, image processing to generate 'species-identification dots' (spiddos) and data processing. A database website for this purpose was also constructed and operated successfully. The protocol was standardized to make the system reproducible and reliable. The overall methodology thus established has remarkable aspects in that it enables non-experts to obtain an initial species identification without a lot of effort and is self-developing; that is, species can be determined more definitively as the database is used more and accumulates more genome profiles.</p>
            </sec>
            <sec>
               <st>
                  <p>Conclusions</p>
               </st>
               <p>We have devised a methodology that enables provisional identification of species on the basis of their genotypes only. It is most useful for microbe-related disciplines as they face the most serious difficulties in species identification.</p>
            </sec>
         </sec>
      </abs>
   </fm>
   <meta>
      <classifications>
         <classification type="BMC" subtype="man_spc_id" id="30010008">Evolution</classification>
         <classification type="BMC" subtype="man_spc_id" id="30010002">Bioinformatics</classification>
         <classification type="BMC" subtype="man_spc_id" id="30010013">Methods</classification>
         <classification type="BMC" subtype="man_spc_id" id="30010014">Microbiology and parasitology</classification>
      </classifications>
   </meta>
   <bdy>
      <sec>
         <st>
            <p>Background</p>
         </st>
         <p>A biological species is usually defined in principle as a set of actually or potentially interbreeding organisms, but as interbreeding is very difficult to measure, species have in practice been identified by their phenotypic traits. Until recently, progress in most microbe-related disciplines has been hampered by the enormous effort needed to identify less prominent traits. We are now in an age when we can identify species based on the genome (genotype) [<abbr bid="B1">1</abbr>], although this does not change the principle that taxonomy is defined by phenotypes [<abbr bid="B2">2</abbr>]: according to the generally accepted rules of taxonomy, a strain belongs to a species if it falls within the range of phenotypes that define that species. This situation has been brought about by the success of the ribosomal RNA approach to phylogenetics [<abbr bid="B3">3</abbr>,<abbr bid="B4">4</abbr>,<abbr bid="B5">5</abbr>]. Well-conserved molecules, such as 16S rRNA in particular, have been used to give a species a molecular identifier and to draw phylogenetic relationships. The 16S rRNA-based approach has been widely accepted and has proved successful in phylogenetic tree-making and even in identifying species. In this context, the Ribosomal Database Project has been established [<abbr bid="B6">6</abbr>]. There are other similar approaches, such as one based on the gyrase gene [<abbr bid="B7">7</abbr>] and multilocus sequence typing [<abbr bid="B8">8</abbr>]. Nonetheless, it has been impossible in practice to analyze all the constituents of a microbial population, not only because of the huge size of such populations (more than 10<sup>8</sup> cells per ml) but also because of lack of suitable methodology. Although there are methods other than gene and genome sequencing for analyzing genomes, such as restriction-fragment length polymorphism (RFLP), amplified fragment-length polymorphism (AFLP), Octamer-based genome scanning (OBGS), random polymerase chain reaction (PCR) and others [<abbr bid="B9">9</abbr>,<abbr bid="B10">10</abbr>,<abbr bid="B11">11</abbr>,<abbr bid="B12">12</abbr>], most cannot be used to identify species without a knowledge of phenotypic traits. In reality, there is no general methodology that enables us to identify species by genotype only, although many approaches use genotypic information (DNA sequences) to complement phenotypic information.</p>
         <p>We have recently demonstrated the possibility of species identification by genotype using genome profiling [<abbr bid="B13">13</abbr>], which is a temperature-gradient gel electrophoresis (TGGE) analysis of random PCR products [<abbr bid="B14">14</abbr>]. In particular, the use of 'species-identification dots' (spiddos), which are feature points in genome profiles, is very useful for objective and reproducible data processing [<abbr bid="B15">15</abbr>,<abbr bid="B16">16</abbr>]. We present here a universal method for provisional genotype-based species identification based on these technological advances and using the Internet environment, which enables us to identify species in general. This paper also presents the important concepts of genome distance and genome sequence space, which are essential for species identification based on genotype.</p>
      </sec>
      <sec>
         <st>
            <p>Results and discussion</p>
         </st>
         <p>Figure <figr fid="F1">1</figr> shows one of the results obtained using the protocols described in the Materials and methods. For the query species, the closest species as judged by spiddos and a list of genome profiles within the tolerance (&#964;), together with the annotation attached to it, is given (Figure <figr fid="F1">1</figr>). If there is a genome profile among the list annotated with species, then it means that the query species is identified with the confidence defined by the pattern similarity score (PaSS; see Materials and methods). If the value of PaSS is very high (that is, close to unity), then it is highly probable that it is indeed an exact match. In contrast, if the value is not sufficiently close to unity, then it may be only a related species (not the exact species), belonging to the same genus or family or any of the higher taxonomical categories, depending on the value of PaSS. Although we do not yet have enough data to determine the PaSS value at which it is safe to identify a species, we have a preliminary idea, based on experience, that 0.95 (<it>Z</it> score &#8776; 4) may be a critical value [<abbr bid="B15">15</abbr>]. The important challenge of how to reconcile the difference between identification of species by phenotype, which conventional taxonomy has adopted, with that based on genotype, is discussed later. An important aspect of this system is that one does not need to be a specialist in the relevant biological field to obtain an initial identification of an unknown organism. All that is required is to register the genome profile of the unknown species on the database. Therefore, an incomplete set of phenotypic data, which do not reach the criteria for species identification (say, peculiar behaviors or unusual properties), can also be registered and later used without having to undertake further laborious phenotypic identification (Figure <figr fid="F2">2</figr>). All the information regarding a given species (in other words, all the entries within a certain PaSS value) will be connected automatically, generating a volume of data on a particular species. Scientists can work cooperatively to identify species and collect their phenotypic traits (Figure <figr fid="F2">2</figr>). In conventional approaches to identification, most of which have been phenotype-based, those data that failed to meet the required criteria for identification were left unconnected, and could not be used later because there was no convenient way of correlating them with a given species without knowing the species name (Figure <figr fid="F2">2</figr>). Thus, our approach of genotype-based species identification, utilizing genome profile and the Internet, will be of great help to the field of taxonomy.</p>
         <fig id="F1">
            <title>
               <p>Figure 1</p>
            </title>
            <caption>
               <p>Result for a trial of on-web genome profiling</p>
            </caption>
            <text>
               <p>Result for a trial of on-web genome profiling. After uploading a genome-profile image and assigning spiddos and then subjecting it to a database search, a result will be displayed as shown, with the values of PaSS and genome distance to the closest species in the database. Note that a PaSS value close to unity infers that the query species is close to (or even the same as) the one retrieved from the database. The information on the selected species (right) already registered in the database can be viewed by clicking the button.</p>
            </text>
            <graphic file="gb-2002-3-2-research0010-1"/>
         </fig>
         <fig id="F2">
            <title>
               <p>Figure 2</p>
            </title>
            <caption>
               <p>How to assign species in phenotype-based and genotype-based approaches</p>
            </caption>
            <text>
               <p>How to assign species in phenotype-based and genotype-based approaches. Phenotype-based approaches (indicated by <it>p1</it>, <it>p2</it> and <it>p3</it>) are heavily dependent on the traits (phenotypic or behavioral, appearing as different shapes) to identify species. In order to clarify such traits, sophisticated instruments and expert skills are often required. <it>p2</it> represents a successful identification attempt, where all the required traits for identifying the species have been obtained, whereas <it>p1</it> and <it>p3</it> are not successful because of insufficient information. Identity confirmed by genome profile in the genotype-based approaches makes it easy to compare and link unknown species (<it>g</it><it><sub>1</sub></it>-<it>g</it><it><sub>3</sub></it>) to known ones (<it>g</it><it><sub>0</sub></it>) without the requirement for extensive knowledge of phenotypic traits. Thus, the traits of each organism can be attributed to a particular species.</p>
            </text>
            <graphic file="gb-2002-3-2-research0010-2"/>
         </fig>
         <sec>
            <st>
               <p>Key concepts of the on-web genome profiling</p>
            </st>
            <p>In evaluating the effectiveness of this methodology, the nature of PaSS must first be considered, as it plays the most important part in the method. As PaSS is calculated on the basis of the coordinates of spiddos (see Equation 1 in Materials and methods), the nature of spiddos must be thoroughly investigated. If two genomic DNAs contain common sequence regions that can be amplified by random PCR using the same primer, the resultant DNAs will usually generate similar spiddos (by definition, the spiddos obtained by TGGE represent the crucial points of a genome profile, points at which the temperature corresponds to the beginning of a prominent structural transition in DNA [<abbr bid="B15">15</abbr>]). As shown schematically in Figure <figr fid="F3">3</figr>, two corresponding spiddos derived from two closely related species can be connected by a displacement vector, which consists of two independent elements of mobility (&#956;) and temperature (&#952;). The differences in each element (&#916;&#956; and &#916;&#952;) can be related to the differences between two sequences as shown in Figure <figr fid="F3">3</figr>. The displacement in the ordinate is caused by the difference in length between the two DNAs and is caused by deletion or insertion, whereas that in the abscissa is mainly caused by point mutation (although insertion/deletion can also contribute).</p>
            <fig id="F3">
               <title>
                  <p>Figure 3</p>
               </title>
               <caption>
                  <p>Causes of displacement in spiddos</p>
               </caption>
               <text>
                  <p>Causes of displacement in spiddos. The displacement between two spiddos (P<sub>i</sub> and P<sub>i</sub>') from two genome profiles can be decomposed into two elements, &#916;&#956; and &#916;&#952;. &#916;&#952;, which results from the shift in melting temperature, must have been caused mainly by point mutation and sometimes by deletion/insertion. On the other hand, &#916;&#956;, which is a measure of length, must be a result of insertion/deletion events occurring in the DNAs.</p>
               </text>
               <graphic file="gb-2002-3-2-research0010-3"/>
            </fig>
            <p>As the extent of these changes is roughly proportional to the evolutionary time since the species diverged, we can expect that the summation of the displacement of each spiddo is approximately proportional to the time since divergence, and thus to the genome-to-genome distance. By using a sufficient number of spiddos, we can obtain statistically reliable results. Empirically, we know that 8-10 spiddos, which can be obtained from a single genome profile, can be significant. However, since the more spiddos the better the result, we tentatively made it a rule to adopt four genome profiles (&#8776; 32-40 spiddos) - that is, four random-PCR products - as a current standard of initial species identification. Therefore, PaSS has the theoretical and empirical basis to be used as a measure of similarity between genomes, although the extent of its effectiveness remains to be shown experimentally as data accumulates. We have introduced a measure of distance, <it>d'</it>, obtained from PaSS as formulated in Equation 2 (see Materials and methods), for the sake of convenience [<abbr bid="B15">15</abbr>].</p>
            <p>We call <it>d'</it> a genome sub-distance because it is based not on the whole but a part of the genome sequence. Thus, we introduce (true) genome distance, <it>d</it>, as in Equation 3 (see Materials and methods). Genome distance must have a close relationship with genetic distance, as defined by Nei and others [<abbr bid="B17">17</abbr>,<abbr bid="B18">18</abbr>,<abbr bid="B19">19</abbr>], although there is a difference in the definition. The genetic distance based on sequences is basically the Hamming distance (the number of different letters at each corresponding position of two sequences of letters that are optimally aligned) between two nucleotide (or amino-acid) sequences. In aligning sequences arbitrariness is introduced, depending on the algorithm and parameters used [<abbr bid="B20">20</abbr>]. Another constraint on genetic distance is that it is usually obtained from a limited number of genes, although that is also the case for genome distance. As genome distance is easier to obtain in practice using our method, it should be easier to obtain a lot of data on it compared with genetic distance. On the basis of <it>d</it> (in practice <it>d'</it>), we can construct phylogenetic trees and genome sequence space (an imaginary spherical space in which all the genomes (individuals) can be uniquely located in a finite manner based on the distance between genomes, providing clusters of species (K.N., unpublished observations)).</p>
            <p>Although the applicability and effectiveness of genome distance for such purposes needs to be further investigated, it is obvious that an organism that has near-zero genome distance from a certain standard species, as an average over four or more genome sub-distances obtained from as many genome profiles, can be easily assigned to that same species with a high level of confidence. We are not claiming, however, to be able to give the correct taxonomical name to any species using this method. The greater the number of strains registered in the database, the more easily will a species be assigned. Basically, no special efforts, except expanding the database and using sophisticated algorithms, are necessary to raise the proportion of correct assignments. This is the self-developing nature of the database. Therefore, this methodology has two potential great advantages for tentative species identification: first, expertise is not always necessary; and second, database building can be carried out in a self-developing manner (that is, by acquiring more and more accurate data on species) with no waste of information.</p>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Materials and methods</p>
         </st>
         <p>The principle that a species can be identified on the basis of its similarity to a standard species remains unchanged in the shift from phenotype-based to genotype-based methodology. Therefore, the essence of our methodology resides in finding a sufficiently closely related species by way of a measure of similarity - a pattern similarity score (PaSS). Note that this genotype-based methodology cannot define species under the current taxonomy regime, in which phenotype is used as the defining characteristic of species [<abbr bid="B2">2</abbr>].</p>
         <sec>
            <st>
               <p>General protocol for on-web genome profiling</p>
            </st>
            <p>Although genome profiling is the basic technology for our current purpose, provisional species identification based on genotype can be fulfilled only by using computer-aided database technology, which is most effectively constructed in the Internet environment. As this methodology is based on use by a large number of scientists, the protocol must be designed to be reproducible and easy to carry out. The processes have been deliberately designed with this in view, and are presented on our website [<abbr bid="B21">21</abbr>].</p>
            <p>Genome profiling consists of two basic technologies: random PCR and TGGE, which have been well established [<abbr bid="B22">22</abbr>,<abbr bid="B23">23</abbr>,<abbr bid="B24">24</abbr>]. However, if it is to be used for the purpose of general and universal applications, well-defined standardization is absolutely required to obtain significant results. We have carried out such standardization for genome profiling. The main topics included in the protocol are: preparation of genome DNAs; the set of primers used for random PCR and the internal reference DNAs used for TGGE; experimental conditions for random PCR; and the experimental conditions for TGGE. The protocol also includes the related procedures (extraction of spiddos, calculation of PaSS, and others).</p>
            <sec>
               <st>
                  <p>Preparation of genomic DNA</p>
               </st>
               <p>Briefly, the alkaline extraction method was selected for simplicity as follows: 10 mg of cells or tissue are placed in an eppendorf tube and heated for 1 min at 100&#176;C. The cells are mixed with 10 &#956;g 0.5 M NaOH and stirred for 1 min (or 5 min or so for stiffer cells such as yeast) using a microhomogenizer, if necessary, with added quartz sand. Immediately, a 5 &#956;l aliquot of the lysate is mixed with 495 &#956;l 100 mM Tris-HCl (pH 8.0). Usually, a 3 &#956;l aliquot of the mixture thus obtained is used as a template for 100-&#956;l-scale PCR. In some cases, such as <it>Escherichia coli</it>, which does not have a strong cell envelope, these cell-breakdown processes can even be omitted and the cells can be directly used in PCR. In other cases, such as fungi, thorough mechanical treatment (grinding with quartz sand) is needed. Thus, minimal and common procedures are preferred as much as possible for simplicity and generality in so far as they are consistent with the purity and integrity of the DNA samples. DNA samples thus prepared were shown to be identical with those DNAs prepared by the more elaborate conventional method of Thomas [<abbr bid="B25">25</abbr>] as a PCR template [<abbr bid="B26">26</abbr>]. This seems quite natural, as PCR can be carried out successfully in the presence of contaminating proteins or polysaccharides, irrespective of the DNA cleavages introduced, unless the regions of DNA to be amplified are completely cleaved. Nonspecific binding of proteins, which gives footprint effects, will change the yield but not the molecular ratio of random PCR products as long as the binding is totally stochastic. We also adopt a universal, convenient definition for genome DNA - that it is composed of all DNAs thus prepared, including dynamic elements such as satellite and organelle DNAs, and is irrespective of haploid or diploid status of the cells. Therefore, the DNA samples for genome profiling can be prepared in a common, technically well-defined method for all organisms.</p>
            </sec>
            <sec>
               <st>
                  <p>Set of primers for random PCR</p>
               </st>
               <p>Technically important restrictions are introduced by selecting a standard set of primers for random PCR (T.W., A.S., M.N. and K.N., unpublished observations). It is important to carry out random PCR with all kinds of organisms using the same primers so that all species can be compared on the same platform. We have initially selected four oligonucleotides (pfM12: dAGAACGCGCCTG; pfM19: dCAGGGCGCGTAC; d(TGC)<sub>3</sub>; d(T<sub>3</sub>G<sub>3</sub>)<sub>2</sub>) as a standard set of random PCR primers. The primers pfM12 and pfM19 were selected on the basis of the abundant experimental background on them, whereas d(TGC)<sub>3</sub> and d(T<sub>3</sub>G<sub>3</sub>)<sub>2</sub> were rather theoretically favored (K.N. and A.S., unpublished observations). 'Oligonucleotide-stickiness analysis', which monitors oligonucleotide-binding sites along the template DNA (K.N. and A.S., unpublished observations), was exploited to determine the universal primers and moderately sticky oligonucleotides were selected. These four primers can be fluorescently labeled for convenience. More primers can be used to obtain more detailed information or to supplement insufficient information provided by the four primers about particular pairs of organisms. The information provided by such extra primers can explore in a more detailed manner the local landscape in genome sequence space. In contrast, the standard primers give us rough relationship between any pair of organisms.</p>
            </sec>
            <sec>
               <st>
                  <p>Internal reference DNAs</p>
               </st>
               <p>Internal reference bands, which are provided by DNAs of a known melting pattern, are used to calibrate each genome profile, giving highly reproducible results [<abbr bid="B22">22</abbr>].</p>
            </sec>
            <sec>
               <st>
                  <p>Conditions for random PCR</p>
               </st>
               <p>Random PCR is usually carried out under standard conditions: 10 ng template DNA, 50 pmol primer DNA, 250 &#956;M of each dNTP, 50 mM Tris-HCl (pH 8.8), 15 mM (NH<sub>4</sub>)<sub>2</sub>SO<sub>4</sub>, 10 mM MgCl<sub>2</sub>, 0.45% Triton X-100, 200 &#956;g/ml bovine serum albumin and 2 units of <it>Taq</it> DNA polymerase (Biotech International). PCR was carried out in 30 cycles of 30 sec at 94&#176;C, 2 min at 28&#176;C and 2 min at 47&#176;C, using a thermal cycler PTC-100TM (MJ Research, MA). Annealing temperature can be attenuated depending on the size of the template DNA (in general, the larger the template, the greater the number of DNA fragments generated by random PCR).</p>
            </sec>
            <sec>
               <st>
                  <p>Experimental conditions for TGGE</p>
               </st>
               <p>TGGE analysis of random PCR products is carried out with co-migrating internal reference DNAs. TGGE can be either the conventional type or a micronized type [<abbr bid="B16">16</abbr>]. At least two feature points are extracted from the band pattern of the internal reference DNA(s), and then used for calibration of genome profiles or species identification dots (spiddos) [<abbr bid="B15">15</abbr>] as described below. After calibration, sufficiently high reproducibility of the pattern of spiddos is guaranteed [<abbr bid="B16">16</abbr>].</p>
            </sec>
         </sec>
         <sec>
            <st>
               <p>Extraction of spiddos</p>
            </st>
            <p>Although the genome profile is a kind of reduction of information contained in the whole genome sequence, it is still too complicated to deal with as it is. Thus, a second reduction is carried out by extracting feature points (spiddos) from the genome profiles. Double-stranded DNAs are known to melt in an intrinsically determined manner, depending on their sequence, when heated gradually [<abbr bid="B27">27</abbr>]. All the intermediate states of DNA have their own structure and mobility in gel. Spiddos correspond to the structural transition points appearing in band patterns (Figure <figr fid="F4">4</figr>). Currently, there are four kinds of spiddos: initial melting point (P<sub>ini</sub>); minimum mobility point (P<sub>min</sub>); isomobility point (P<sub>iso</sub>); and the end melting point (P<sub>end</sub>). Empirically, P<sub>ini</sub> is the most reproducible. Therefore, P<sub>ini</sub> is recommended for working spiddos wherever possible. Further details are given in the standard protocol on our website [<abbr bid="B21">21</abbr>].</p>
            <fig id="F4">
               <title>
                  <p>Figure 4</p>
               </title>
               <caption>
                  <p>Spiddo assignment</p>
               </caption>
               <text>
                  <p>Spiddo assignment. <b>(a)</b> A genome profile before processing. The temperature gradient is set from left (low) to right (high) and the direction of migration is top to bottom; IR, internal reference band used for normalization. <b>(b)</b> The spiddos of the genome profile are marked with red filled circles; those of the IR are indicated with red open circles. All the spiddos except for the rightmost one are at the first transition of DNA melting (P<sub>ini</sub>). Although there are four kinds of spiddos (dots), as described in Materials and methods, P<sub>ini</sub> is used for simplicity as these points are clearly visible.</p>
               </text>
               <graphic file="gb-2002-3-2-research0010-4"/>
            </fig>
         </sec>
         <sec>
            <st>
               <p>Calculation of PaSS and genome distance</p>
            </st>
            <p>A set of spiddos (around ten), assigned to a genome profile on a computer display, is processed to calculate the normalized mobility and temperature of each point. A measure of similarity of two genomes - the PaSS - is introduced as follows.</p>
            <p>
               <graphic file="gb-2002-3-2-research0010-i1.gif"/>
            </p>
            <p><graphic file="gb-2002-3-2-research0010-i3.gif"/> of each spiddo (1 to n)is its position vector and is a function of temperature and mobility (that is, |<graphic file="gb-2002-3-2-research0010-i3.gif"/>| = <it>P</it> (<it>T</it>, <it>m</it>)). The superscripts 1 and 2 in parentheses in Equation (1) represent genomes 1 and 2, respectively. PaSS will be unity for a complete match in two sets of spiddos. In general, 0 &#8804; PaSS &#8804; 1. Genome distance and genome sub-distance (<it>d'</it>) are derived from PaSS as follows:</p>
            <p><it>d'</it> = (1 - PaSS)/PaSS  (2)</p>
            <p>
               <graphic file="gb-2002-3-2-research0010-i2.gif"/>
            </p>
            <p>Where <it>d'</it>(<it>i</it>) is the <it>i</it>th genome sub-distance obtained with the <it>i</it>th primer used for random PCR.</p>
         </sec>
         <sec>
            <st>
               <p>Computer-aided data acquisition</p>
            </st>
            <p>The overall process of obtaining an on-web genome profile is shown in Figure <figr fid="F5">5</figr>. There are two steps in this methodology: the local phase and the database phase. In the local phase, genome profiling is carried out for the organism of interest, following the standard protocol presented on our website [<abbr bid="B21">21</abbr>] and outlined in the previous sections. After obtaining a genome profile, the database is accessed and the database phase is begun as a client. The database site requires the client to input an image of the genome profile, to assign spiddos on the genome profile (Figure <figr fid="F4">4</figr>), and to fill in relevant data on the online form. The site will search the database for species with the most similar pattern of spiddos by calculating the PaSS [<abbr bid="B15">15</abbr>].</p>
            <fig id="F5">
               <title>
                  <p>Figure 5</p>
               </title>
               <caption>
                  <p>On-web genome profiling</p>
               </caption>
               <text>
                  <p>On-web genome profiling. The overall procedures to tentatively identify species by genotype only (genome profiling) are shown. Genome profiles are prepared by TGGE of random PCR products obtained from the genome DNA of a particular organism at the client site (the local phase). After accessing the database (represented by the red cylinder), a client (red circle) has spiddos assigned to each genome profile, which are used to calculate the measure of similarity, PaSS, and will finally get an output of the nearest species registered in the database (this phase of the process is called the database phase). Genome sequence space, with the location of the genomes A and B, is shown in green above the database of genomes.</p>
               </text>
               <graphic file="gb-2002-3-2-research0010-5"/>
            </fig>
         </sec>
      </sec>
   </bdy>
   <bm>
      <ack>
         <sec>
            <st>
               <p>Acknowledgements</p>
            </st>
            <p>This study was supported in part by a Grant-in-Aid (09272203) from the Ministry of Education, Science, Sports and Culture of Japan. M.N. was supported by the Japan Society for Promotion of Science (13001147).</p>
         </sec>
      </ack>
      <refgrp>
         <bibl id="B1">
            <title>
               <p>Towards a natural system of organisms: Proposal for the domains Archaea, Bacteria, and Eukarya.</p>
            </title>
            <aug>
               <au>
                  <snm>Woese</snm>
                  <fnm>CR</fnm>
               </au>
               <au>
                  <snm>Kandler</snm>
                  <fnm>O</fnm>
               </au>
               <au>
                  <snm>Wheelis</snm>
                  <fnm>ML</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci USA</source>
            <pubdate>1990</pubdate>
            <volume>87</volume>
            <fpage>4576</fpage>
            <lpage>4579</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">54159</pubid>
                  <pubid idtype="pmpid" link="fulltext">2112744 </pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B2">
            <title>
               <p>Report of the ad hoc committee on reconciliation of approaches to bacterial systematics.</p>
            </title>
            <aug>
               <au>
                  <snm>Wayne</snm>
                  <fnm>LG</fnm>
               </au>
               <au>
                  <snm>Brenner</snm>
                  <fnm>DJ</fnm>
               </au>
               <au>
                  <snm>Colwell</snm>
                  <fnm>RR</fnm>
               </au>
               <au>
                  <snm>Grimont</snm>
                  <fnm>PAD</fnm>
               </au>
               <au>
                  <snm>Kandler</snm>
                  <fnm>O</fnm>
               </au>
               <au>
                  <snm>Krichevsky</snm>
                  <fnm>MI</fnm>
               </au>
               <au>
                  <snm>Moore</snm>
                  <fnm>LH</fnm>
               </au>
               <au>
                  <snm>Moore</snm>
                  <fnm>WEC</fnm>
               </au>
               <au>
                  <snm>Murray</snm>
                  <fnm>RGE</fnm>
               </au>
               <au>
                  <snm>Stackebrandt</snm>
                  <fnm>E</fnm>
               </au>
               <etal/>
            </aug>
            <source>Int J Syst Bacteriol</source>
            <pubdate>1987</pubdate>
            <volume>37</volume>
            <fpage>463</fpage>
            <lpage>464</lpage>
         </bibl>
         <bibl id="B3">
            <title>
               <p>The Ribosomal Database Project.</p>
            </title>
            <aug>
               <au>
                  <snm>Olsen</snm>
                  <fnm>GJ</fnm>
               </au>
               <au>
                  <snm>Overbeek</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Larsen</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Marsh</snm>
                  <fnm>TL</fnm>
               </au>
               <au>
                  <snm>McCaughey</snm>
                  <fnm>MJ</fnm>
               </au>
               <au>
                  <snm>Maciukenas</snm>
                  <fnm>MA</fnm>
               </au>
               <au>
                  <snm>Kuan</snm>
                  <fnm>WM</fnm>
               </au>
               <au>
                  <snm>Macke</snm>
                  <fnm>TJ</fnm>
               </au>
               <au>
                  <snm>Xing</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Woese</snm>
                  <fnm>CR</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>1992</pubdate>
            <volume>Suppl 20</volume>
            <fpage>199</fpage>
            <lpage>200</lpage>
            <xrefbib>
               <pubid idtype="pmpid">1598241</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B4">
            <title>
               <p>Ciliate evolution: the ribosomal phylogenies of the tetrahymenine ciliates.</p>
            </title>
            <aug>
               <au>
                  <snm>Preparata</snm>
                  <fnm>RM</fnm>
               </au>
               <au>
                  <snm>Meyer</snm>
                  <fnm>EB</fnm>
               </au>
               <au>
                  <snm>Preparata</snm>
                  <fnm>FP</fnm>
               </au>
               <au>
                  <snm>Simon</snm>
                  <fnm>EM</fnm>
               </au>
               <au>
                  <snm>Vossbrinck</snm>
                  <fnm>CR</fnm>
               </au>
               <au>
                  <snm>Nanney</snm>
                  <fnm>DL</fnm>
               </au>
            </aug>
            <source>J Mol Evol</source>
            <pubdate>1989</pubdate>
            <volume>28</volume>
            <fpage>427</fpage>
            <lpage>441</lpage>
            <xrefbib>
               <pubid idtype="pmpid">2501504</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B5">
            <title>
               <p>RISSC: a novel database for ribosomal 16S-23S RNA genes spacer regions.</p>
            </title>
            <aug>
               <au>
                  <snm>Martinez</snm>
                  <fnm>JG</fnm>
               </au>
               <au>
                  <snm>Bescos</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Sala</snm>
                  <fnm>JJR</fnm>
               </au>
               <au>
                  <snm>Valera</snm>
                  <fnm>FR</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2001</pubdate>
            <volume>29</volume>
            <fpage>178</fpage>
            <lpage>180</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">29764</pubid>
                  <pubid idtype="pmpid" link="fulltext">11125084</pubid>
                  <pubid idtype="doi">10.1093/nar/29.1.178</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B6">
            <title>
               <p>The RDP-II (Ribosomal Database Project).</p>
            </title>
            <aug>
               <au>
                  <snm>Maidak</snm>
                  <fnm>BL</fnm>
               </au>
               <au>
                  <snm>Cole</snm>
                  <fnm>JR</fnm>
               </au>
               <au>
                  <snm>Lilburn</snm>
                  <fnm>TG</fnm>
               </au>
               <au>
                  <snm>Parker</snm>
                  <fnm>CT</fnm>
                  <suf>Jr</suf>
               </au>
               <au>
                  <snm>Saxman</snm>
                  <fnm>PR</fnm>
               </au>
               <au>
                  <snm>Farris</snm>
                  <fnm>RJ</fnm>
               </au>
               <au>
                  <snm>Garrity</snm>
                  <fnm>GM</fnm>
               </au>
               <au>
                  <snm>Olsen</snm>
                  <fnm>GJ</fnm>
               </au>
               <au>
                  <snm>Schmidt</snm>
                  <fnm>TM</fnm>
               </au>
               <au>
                  <snm>Tiedje</snm>
                  <fnm>JM</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2001</pubdate>
            <volume>29</volume>
            <fpage>173</fpage>
            <lpage>174</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">29785</pubid>
                  <pubid idtype="pmpid" link="fulltext">11125082</pubid>
                  <pubid idtype="doi">10.1093/nar/29.1.173</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B7">
            <title>
               <p>PCR amplification and direct sequencing of <it>gyrB</it> genes with universal primers and their application to the detection and taxonomic analysis of <it>Pseudomonas putida</it> strains.</p>
            </title>
            <aug>
               <au>
                  <snm>Yamamoto</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Harayama</snm>
                  <fnm>S</fnm>
               </au>
            </aug>
            <source>Appl Environ Microbiol</source>
            <pubdate>1995</pubdate>
            <volume>61</volume>
            <fpage>1104</fpage>
            <lpage>1109</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">167365</pubid>
                  <pubid idtype="pmpid" link="fulltext">7793912</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B8">
            <title>
               <p>Multilocus sequence typing: a portable approach to the identification of clones within populations of pathogenic microorganisms.</p>
            </title>
            <aug>
               <au>
                  <snm>Maiden</snm>
                  <fnm>MC</fnm>
               </au>
               <au>
                  <snm>Bygraves</snm>
                  <fnm>JA</fnm>
               </au>
               <au>
                  <snm>Feil</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Morelli</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Russell</snm>
                  <fnm>JE</fnm>
               </au>
               <au>
                  <snm>Urwin</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Zhang</snm>
                  <fnm>Q</fnm>
               </au>
               <au>
                  <snm>Zhou</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Zurth</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Caugant</snm>
                  <fnm>DA</fnm>
               </au>
               <etal/>
            </aug>
            <source>Proc Natl Acad Sci USA</source>
            <pubdate>1998</pubdate>
            <volume>95</volume>
            <fpage>3140</fpage>
            <lpage>3145</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">19708</pubid>
                  <pubid idtype="pmpid" link="fulltext">9501229</pubid>
                  <pubid idtype="doi">10.1073/pnas.95.6.3140</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B9">
            <title>
               <p>Restriction fragment length polymorphisms.</p>
            </title>
            <aug>
               <au>
                  <snm>Tsipouras</snm>
                  <fnm>P</fnm>
               </au>
            </aug>
            <source>Methods Enzymol</source>
            <pubdate>1987</pubdate>
            <volume>145</volume>
            <fpage>205</fpage>
            <lpage>213</lpage>
            <xrefbib>
               <pubid idtype="pmpid">2885721</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B10">
            <title>
               <p>AFLP: a new technique for DNA fingerprinting.</p>
            </title>
            <aug>
               <au>
                  <snm>Vos</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Hogers</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Bleeker</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Reijans</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>van de Lee</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Hornes</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Fri-jters</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Pot</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Peleman</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Kuiper</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Zabeau</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>1995</pubdate>
            <volume>23</volume>
            <fpage>4407</fpage>
            <lpage>4414</lpage>
            <xrefbib>
               <pubid idtype="pmpid">7501463</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B11">
            <title>
               <p>Octamer-based genome scanning distinguishes a unique subpopulation of <it>Escherichia coli</it> O157:H7 strains in cattle.</p>
            </title>
            <aug>
               <au>
                  <snm>Kim</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Nietfeldt</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Benson</snm>
                  <fnm>AK</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci USA</source>
            <pubdate>1999</pubdate>
            <volume>96</volume>
            <fpage>13288</fpage>
            <lpage>13293</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">23940</pubid>
                  <pubid idtype="pmpid" link="fulltext">10557313</pubid>
                  <pubid idtype="doi">10.1073/pnas.96.23.13288</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B12">
            <title>
               <p>DNA polymorphisms amplified by arbitrary primers are useful as genetic markers.</p>
            </title>
            <aug>
               <au>
                  <snm>Williams</snm>
                  <fnm>JG</fnm>
               </au>
               <au>
                  <snm>Kubelik</snm>
                  <fnm>AR</fnm>
               </au>
               <au>
                  <snm>Livak</snm>
                  <fnm>KJ</fnm>
               </au>
               <au>
                  <snm>Rafalski</snm>
                  <fnm>JA</fnm>
               </au>
               <au>
                  <snm>Tingey</snm>
                  <fnm>SV</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>1990</pubdate>
            <volume>18</volume>
            <fpage>6531</fpage>
            <lpage>6535</lpage>
            <xrefbib>
               <pubid idtype="pmpid">1979162</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B13">
            <title>
               <p>Genome profiling: a realistic solution for genotype-based identification of species.</p>
            </title>
            <aug>
               <au>
                  <snm>Nishigaki</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Naimuddin</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Hamano</snm>
                  <fnm>K</fnm>
               </au>
            </aug>
            <source>J Biochem</source>
            <pubdate>2000</pubdate>
            <volume>128</volume>
            <fpage>107</fpage>
            <lpage>112</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">10876164</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B14">
            <title>
               <p>DNA profiling: an approach of systematic characterization, classification, and comparison of genomic DNAs.</p>
            </title>
            <aug>
               <au>
                  <snm>Nishigaki</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Amano</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Takasawa</snm>
                  <fnm>T</fnm>
               </au>
            </aug>
            <source>Chem Lett</source>
            <pubdate>1991</pubdate>
            <volume>1991</volume>
            <fpage>1097</fpage>
            <lpage>1100</lpage>
         </bibl>
         <bibl id="B15">
            <title>
               <p>Species-identification dots: a potent tool for developing genome microbiology.</p>
            </title>
            <aug>
               <au>
                  <snm>Naimuddin</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Kurazono</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Zhang</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Watanabe</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Yamaguchi</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Nishigaki</snm>
                  <fnm>K</fnm>
               </au>
            </aug>
            <source>Gene</source>
            <pubdate>2000</pubdate>
            <volume>261</volume>
            <fpage>243</fpage>
            <lpage>250</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/S0378-1119(00)00502-3</pubid>
                  <pubid idtype="pmpid" link="fulltext">11167011</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B16">
            <title>
               <p>Hundredfold productivity of genome analysis by introduction of microtemperature-gradient gel electrophoresis.</p>
            </title>
            <aug>
               <au>
                  <snm>Biyani</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Nishigaki</snm>
                  <fnm>K</fnm>
               </au>
            </aug>
            <source>Electrophoresis</source>
            <pubdate>2001</pubdate>
            <volume>22</volume>
            <fpage>23</fpage>
            <lpage>28</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1002/1522-2683(200101)22:1&lt;23::AID-ELPS23>3.0.CO;2-Z</pubid>
                  <pubid idtype="pmpid" link="fulltext">11197172 </pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B17">
            <title>
               <p>Descent of mammalian alpha globin chain sequences investigated by the maximum parsimony method.</p>
            </title>
            <aug>
               <au>
                  <snm>Barnabas</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Goodman</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Moore</snm>
                  <fnm>GW</fnm>
               </au>
            </aug>
            <source>J Mol Biol</source>
            <pubdate>1972</pubdate>
            <volume>69</volume>
            <fpage>249</fpage>
            <lpage>278</lpage>
            <xrefbib>
               <pubid idtype="pmpid">4627161</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B18">
            <title>
               <p>Genetic distance and electrophoretic identity of proteins between taxa.</p>
            </title>
            <aug>
               <au>
                  <snm>Nei</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Chakraborty</snm>
                  <fnm>R</fnm>
               </au>
            </aug>
            <source>J Mol Evol</source>
            <pubdate>1973</pubdate>
            <volume>2</volume>
            <fpage>323</fpage>
            <lpage>328</lpage>
            <xrefbib>
               <pubid idtype="pmpid">4807198</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B19">
            <title>
               <p>Accuracy of estimated phylogenetic trees from molecular data. I. Distantly related species.</p>
            </title>
            <aug>
               <au>
                  <snm>Tateno</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Nei</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Tajima</snm>
                  <fnm>F</fnm>
               </au>
            </aug>
            <source>J Mol Evol</source>
            <pubdate>1982</pubdate>
            <volume>18</volume>
            <fpage>387</fpage>
            <lpage>404</lpage>
            <xrefbib>
               <pubid idtype="pmpid">7175956</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B20">
            <title>
               <p>Sensitive methods for determining the relatedness of proteins with limited sequence homology.</p>
            </title>
            <aug>
               <au>
                  <snm>Argos</snm>
                  <fnm>P</fnm>
               </au>
            </aug>
            <source>Curr Opin Biotechnol</source>
            <pubdate>1994</pubdate>
            <volume>5</volume>
            <fpage>361</fpage>
            <lpage>371</lpage>
            <xrefbib>
               <pubid idtype="pmpid">7765168</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B21">
            <title>
               <p>On-Web GP</p>
            </title>
            <url>http://gp.fms.saitama-u.ac.jp</url>
         </bibl>
         <bibl id="B22">
            <title>
               <p>Structural analysis of nucleic acids by precise denaturing gradient gel electrophoresis: I. Methodology.</p>
            </title>
            <aug>
               <au>
                  <snm>Nishigaki</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Tsubota</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Miura</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Chonan</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Husimi</snm>
                  <fnm>Y</fnm>
               </au>
            </aug>
            <source>J Biochem</source>
            <pubdate>1992</pubdate>
            <volume>111</volume>
            <fpage>144</fpage>
            <lpage>150</lpage>
            <xrefbib>
               <pubid idtype="pmpid">1569038</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B23">
            <title>
               <p>Temperature gradient gel electrophoresis (TGGE) for the detection of polymorphic DNA and RNA.</p>
            </title>
            <aug>
               <au>
                  <snm>Henco</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Harders</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Wiese</snm>
                  <fnm>U</fnm>
               </au>
               <au>
                  <snm>Riesner</snm>
                  <fnm>D</fnm>
               </au>
            </aug>
            <source>Methods Mol Biol</source>
            <pubdate>1994</pubdate>
            <volume>31</volume>
            <fpage>211</fpage>
            <lpage>228</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1385/0-89603-258-2:211</pubid>
                  <pubid idtype="pmpid">7522829</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B24">
            <title>
               <p>Detecting single base substitutions, mismatches and bulges in DNA by temperature gradient gel electrophoresis and related methods.</p>
            </title>
            <aug>
               <au>
                  <snm>Wartell</snm>
                  <fnm>RM</fnm>
               </au>
               <au>
                  <snm>Hosseini</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Powell</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Zhu</snm>
                  <fnm>J</fnm>
               </au>
            </aug>
            <source>J Chromatogr A</source>
            <pubdate>1998</pubdate>
            <volume>806</volume>
            <fpage>169</fpage>
            <lpage>185</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/S0021-9673(98)00149-6</pubid>
                  <pubid idtype="pmpid">9639888</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B25">
            <title>
               <p>Isolation of higher molecular weight DNA from <it>Hemophilus influenzae</it>.</p>
            </title>
            <aug>
               <au>
                  <snm>Berns</snm>
                  <fnm>KI</fnm>
               </au>
               <au>
                  <snm>Thomas</snm>
                  <fnm>CA</fnm>
                  <suf>Jr</suf>
               </au>
            </aug>
            <source>J Mol Biol</source>
            <pubdate>1965</pubdate>
            <volume>11</volume>
            <fpage>476</fpage>
            <lpage>490</lpage>
            <xrefbib>
               <pubid idtype="pmpid">14267270</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B26">
            <title>
               <p>Genome profiling- establishment and practical evaluation of its methodology.</p>
            </title>
            <aug>
               <au>
                  <snm>Hamano</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Takasawa</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Kurazono</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Okuyama</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Nishigaki</snm>
                  <fnm>K</fnm>
               </au>
            </aug>
            <source>Nikkashi</source>
            <pubdate>1996</pubdate>
            <volume>1996</volume>
            <fpage>54</fpage>
            <lpage>61</lpage>
         </bibl>
         <bibl id="B27">
            <title>
               <p>Fine structure in the thermal denaturation of DNA: high temperature-resolution spectrophotometric studies.</p>
            </title>
            <aug>
               <au>
                  <snm>Wada</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Yabuki</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Husimi</snm>
                  <fnm>Y</fnm>
               </au>
            </aug>
            <source>CRC Crit Rev Biochem</source>
            <pubdate>1980</pubdate>
            <volume>9</volume>
            <fpage>87</fpage>
            <lpage>144</lpage>
            <xrefbib>
               <pubid idtype="pmpid">6777116</pubid>
            </xrefbib>
         </bibl>
      </refgrp>
   </bm>
</art>
