<?xml version='1.0'?>
<!DOCTYPE art SYSTEM 'http://www.biomedcentral.com/xml/article.dtd'>
<art>
   <ui>1471-2105-9-121</ui>
   <ji>1471-2105</ji>
   <fm>
      <dochead>Software</dochead>
      <bibl>
         <title>
            <p>SciDBMaker: new software for computer-aided design of specialized biological databases</p>
         </title>
         <aug>
            <au id="A1">
               <snm>Hammami</snm>
               <fnm>Riadh</fnm>
               <insr iid="I1"/>
               <email>Riadh.hammami@fst.rnu.tn</email>
            </au>
            <au id="A2">
               <snm>Zouhir</snm>
               <fnm>Abdelmajid</fnm>
               <insr iid="I1"/>
               <email>azouheirb10@yahoo.fr</email>
            </au>
            <au id="A3">
               <snm>Naghmouchi</snm>
               <fnm>Karim</fnm>
               <insr iid="I2"/>
               <email>naghmouchik@agr.gc.ca</email>
            </au>
            <au id="A4">
               <snm>Ben Hamida</snm>
               <fnm>Jeannette</fnm>
               <insr iid="I1"/>
               <email>benhamida_jeannette@yahoo.fr</email>
            </au>
            <au id="A5" ca="yes">
               <snm>Fliss</snm>
               <fnm>Ismail</fnm>
               <insr iid="I3"/>
               <email>ismail.fliss@aln.ulaval.ca</email>
            </au>
         </aug>
         <insg>
            <ins id="I1">
               <p>Unit&#233; de Prot&#233;omie Fonctionnelle &amp; Biopr&#233;servation Alimentaire, Institut Sup&#233;rieur des Sciences Biologiques Appliqu&#233;es de Tunis, Universit&#233; El Manar, Tunisie</p>
            </ins>
            <ins id="I2">
               <p>Agriculture and Agri-Food Canada, Lethbridge Research Centre, Lethbridge, Alberta, T1J 4B1 Canada</p>
            </ins>
            <ins id="I3">
               <p>Institut des Nutraceutiques et des Aliments Fonctionnels (INAF), Universit&#233; Laval, Qu&#233;bec, Canada</p>
            </ins>
         </insg>
         <source>BMC Bioinformatics</source>
         <issn>1471-2105</issn>
         <pubdate>2008</pubdate>
         <volume>9</volume>
         <issue>1</issue>
         <fpage>121</fpage>
         <url>http://www.biomedcentral.com/1471-2105/9/121</url>
         <xrefbib>
            <pubidlist>
               <pubid idtype="pmpid">18298861</pubid>
               <pubid idtype="doi">10.1186/1471-2105-9-121</pubid>
            </pubidlist>
         </xrefbib>
      </bibl>
      <history>
         <rec>
            <date>
               <day>06</day>
               <month>12</month>
               <year>2007</year>
            </date>
         </rec>
         <acc>
            <date>
               <day>25</day>
               <month>2</month>
               <year>2008</year>
            </date>
         </acc>
         <pub>
            <date>
               <day>25</day>
               <month>2</month>
               <year>2008</year>
            </date>
         </pub>
      </history>
      <cpyrt>
         <year>2008</year>
         <collab>Hammami et al; licensee BioMed Central Ltd.</collab>
         <note>This is an Open Access article distributed under the terms of the Creative Commons Attribution License (<url>http://creativecommons.org/licenses/by/2.0</url>), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</note>
      </cpyrt>
      <abs>
         <sec>
            <st>
               <p>Abstract</p>
            </st>
            <sec>
               <st>
                  <p>Background</p>
               </st>
               <p>The exponential growth of research in molecular biology has brought concomitant proliferation of databases for stocking its findings. A variety of protein sequence databases exist. While all of these strive for completeness, the range of user interests is often beyond their scope. Large databases covering a broad range of domains tend to offer less detailed information than smaller, more specialized resources, often creating a need to combine data from many sources in order to obtain a complete picture. Scientific researchers are continually developing new specific databases to enhance their understanding of biological processes.</p>
            </sec>
            <sec>
               <st>
                  <p>Description</p>
               </st>
               <p>In this article, we present the implementation of a new tool for protein data analysis. With its easy-to-use user interface, this software provides the opportunity to build more specialized protein databases from a universal protein sequence database such as Swiss-Prot. A family of proteins known as bacteriocins is analyzed as 'proof of concept'.</p>
            </sec>
            <sec>
               <st>
                  <p>Conclusion</p>
               </st>
               <p>SciDBMaker is stand-alone software that allows the extraction of protein data from the Swiss-Prot database, sequence analysis comprising physicochemical profile calculations, homologous sequences search, multiple sequence alignments and the building of new and more specialized databases. It compiles information with relative ease, updates and compares various data relevant to a given protein family and could solve the problem of dispersed biological search results.</p>
            </sec>
         </sec>
      </abs>
   </fm>
   <bdy>
      <sec>
         <st>
            <p>Background</p>
         </st>
         <p>The exponential growth of molecular biology research in recent decades has brought concomitant growth in the number and size of databases used to interpret experimental findings. For example, UniProtKB/Swiss-Prot release 53.2, dated 26-06-07, contains 272,212 sequence entries comprising 99,940,143 amino acids, abstracted from 157,086 references <abbrgrp><abbr bid="B1">1</abbr></abbrgrp>. A variety of protein sequence databases exist, ranging from simple sequence repositories to expertly curated universal databases that cover all species and in which the original sequence data are enhanced by manual addition of further information in each sequence record <abbrgrp><abbr bid="B2">2</abbr></abbrgrp>. While all of these strive for completeness, the range of user interests is often beyond their scope. This may reflect the user's wish to combine different types of information or the inability of a single resource to contain the complete details of every relevant experiment. In addition, large databases with broad domains tend to offer less detailed information than smaller, more specialized, resources, with the result that data from many resources may need to be combined to provide a complete picture. There is a clear need to gather, filter and critically evaluate this mass of information so that it can be used with greater efficiency. Since scientists are continually developing new specific databases to enhance their understanding of biological processes, we created SciDBMaker to provide a tool for easy building of new specialized protein knowledge bases. This paper describes the development of new stand-alone software, <b><it>Sci</it></b><it>entific </it><b><it>D</it></b><it>ata</it><b><it>B</it></b><it>ase </it><b><it>M</it></b><it>aker</it>, for protein data analysis with online and/or off-line access. The software interface allows successive steps for sequence manipulation, starting from user sequence search and homologous sequence retrieval from the SwissProt databank, followed by physicochemical profile calculations, multiple sequence alignments, phylogenic tree visualization and culminating in database export/building. All steps are performed in an interactive manner. Physical and chemical parameters, rarely found in public databases, provide a helpful tool for the analysis of a set of proteins and their calculation is achieved in a direct and interactive manner, with off-line access. SciDBMaker also processes a great number of sequences simultaneously.</p>
      </sec>
      <sec>
         <st>
            <p>Implementation</p>
         </st>
         <sec>
            <st>
               <p>Swiss-Prot format</p>
            </st>
            <p>The Swiss-Prot format has been described previously in reference <abbrgrp><abbr bid="B3">3</abbr></abbrgrp>.</p>
         </sec>
         <sec>
            <st>
               <p>Physicochemical profiles</p>
            </st>
            <p>Protein families may be analyzed with the help of physicochemical profiles such as amino acid composition (acidic, basic, hydrophobic, polar, absent and common amino acids), atomic composition, molecular weight <abbrgrp><abbr bid="B4">4</abbr></abbrgrp>, theoretical pI <abbrgrp><abbr bid="B4">4</abbr><abbr bid="B5">5</abbr></abbrgrp>, extinction coefficient <abbrgrp><abbr bid="B6">6</abbr></abbrgrp>, absorbance at 280 nm, estimated half-life in mammalian cells, yeast and <it>E. coli </it><abbrgrp><abbr bid="B7">7</abbr><abbr bid="B8">8</abbr></abbrgrp>, instability index <abbrgrp><abbr bid="B9">9</abbr></abbrgrp>, aliphatic index <abbrgrp><abbr bid="B10">10</abbr></abbrgrp>, grand average of hydropathicity (GRAVY) <abbrgrp><abbr bid="B11">11</abbr></abbrgrp> and protein-binding potential (Boman index) <abbrgrp><abbr bid="B12">12</abbr></abbrgrp>.</p>
         </sec>
         <sec>
            <st>
               <p>Integrated tools</p>
            </st>
            <p>The European Bioinformatics Institute provides the Dbfetch tool for easy retrieval of entries from various databases <abbrgrp><abbr bid="B13">13</abbr></abbrgrp>. Entries may be imported online into SciDBMaker from the SwissProt database using Dbfetch. To find similar sequences, the containers can be queried with either proteins from the SwissProt database or user-imported sequences, using the BLAST algorithm <abbrgrp><abbr bid="B14">14</abbr></abbrgrp>. Multiple sequence alignments (MSA) are an essential tool for predicting protein structure and function prediction, phylogenic inference and other common tasks in sequence analysis. To date, CLUSTALW is still the most popular alignment tool. Since it is the method of choice for biologists, CLUSTALW <abbrgrp><abbr bid="B15">15</abbr></abbrgrp> was included in SciDBMaker for multiple sequence alignments. Generated trees may be easily viewed using phylogenic tree visualization software such as TREEVIEW <abbrgrp><abbr bid="B16">16</abbr></abbrgrp>.</p>
         </sec>
         <sec>
            <st>
               <p>Hardware and software specifications</p>
            </st>
            <p>The executable version of the SciDBMaker software can be installed and run on a standard PC platform with a Windows operating system. The software development was done using Windows XP and tested with success on all platforms, including Win 98, Win XP and Win vista. The source code was written in Microsoft Visual Basic .NET (2005). The environment is based upon the .NET Framework library v2.0.</p>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Results &amp; discussion</p>
         </st>
         <sec>
            <st>
               <p>Program description</p>
            </st>
            <p>The workflow diagram shown in Figure <figr fid="F1">1</figr> and the following discussion illustrate how the tool works. Figure <figr fid="F2">2</figr> illustrates a typical user interface of the program. Users may open files in Fasta or Swiss-Prot format, or import sequence entries from the Swiss-Prot database. Users may also use their own sequences, search for homologous sequences entries in Swiss-Prot database using BLAST algorithm and load selected entries into SciDBMaker (Fig. <figr fid="F3">3</figr>). The program will automatically extract available information in Swiss-Prot entries and calculate physicochemical profiles for loaded proteins. Users may also choose the information to be extracted and the properties to be calculated, as shown in Figure <figr fid="F4">4</figr>. The interface allows users to filter, search, add, remove and update data rows as required. An intuitive interface allows BLAST selection of all user sequences. Similarly, sequences may be aligned using the multiple alignment program ClustalW. Resulting trees may be shown using the phylogenic tree visualization software TREEVIEW, as proposed by SciDBMaker. As a final step, data may be printed or saved in various file formats. Sequences may be extracted into a Fasta format file. The resulting data grid may be saved as an MS Excel data sheet, as well as database files (XML, MS Access, MySQL).</p>
            <fig id="F1">
               <title>
                  <p>Figure 1</p>
               </title>
               <caption>
                  <p>Workflow diagram</p>
               </caption>
               <text>
                  <p>
                     <b>Workflow diagram.</b>
                  </p>
               </text>
               <graphic file="1471-2105-9-121-1"/>
            </fig>
            <fig id="F2">
               <title>
                  <p>Figure 2</p>
               </title>
               <caption>
                  <p>User interface</p>
               </caption>
               <text>
                  <p>
                     <b>User interface.</b>
                  </p>
               </text>
               <graphic file="1471-2105-9-121-2"/>
            </fig>
            <fig id="F3">
               <title>
                  <p>Figure 3</p>
               </title>
               <caption>
                  <p>A typical Blast result window</p>
               </caption>
               <text>
                  <p>
                     <b>A typical Blast result window.</b>
                  </p>
               </text>
               <graphic file="1471-2105-9-121-3"/>
            </fig>
            <fig id="F4">
               <title>
                  <p>Figure 4</p>
               </title>
               <caption>
                  <p>Preference display columns</p>
               </caption>
               <text>
                  <p>
                     <b>Preference display columns.</b>
                  </p>
               </text>
               <graphic file="1471-2105-9-121-4"/>
            </fig>
         </sec>
         <sec>
            <st>
               <p>Program runs</p>
            </st>
            <p>A database was developed using SciDBMaker. Named BACTIBASE, this database has been previously described in reference <abbrgrp><abbr bid="B17">17</abbr></abbrgrp>.</p>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Conclusion</p>
         </st>
         <p>The stand-alone software SciDBMaker allows the extraction of protein data from the Swiss-Prot database, sequence analysis comprising physicochemical profile calculations, homologous sequence searches, multiple sequence alignments and the building of new and more specialized databases. Programs of this type compile information with relative ease, update and compare various data relevant to a given protein family and could solve the problem of dispersed biological search results. Collection of a multitude of information regarding a given protein family allows the development of more coherent and focused approaches to structure-function relationships, thereby enhancing the development of theoretical concepts in biological sciences.</p>
      </sec>
      <sec>
         <st>
            <p>Availability and requirements</p>
         </st>
         <p>The program runs on the PC platform with a Windows operating system. The graphical environment needs the .NET Framework library v2.0. This complement is available for free download at the Microsoft website and comes pre-installed in the majority of recent computers. An installation package for SciDBMaker may be obtained from the authors free of charge upon request. The SciDBMaker website is hosted by the Centre de Calcul El Khawarizmi CCK (Tunisia) and is available at <url>http://scidbmaker.pfba-lab.org</url>. The SciDBMaker software is provided 'as is' with no guarantee or warranty of any kind and is available for all non-commercial use. Any other use of the software requires special permission from the primary author.</p>
      </sec>
      <sec>
         <st>
            <p>Authors' contributions</p>
         </st>
         <p>RH programmed the software interface, performed the implementation of physicochemical parameters and drafted the manuscript. AZ participated in the design of the study, interacted with RH to carry out the physicochemical data calculation and tested the program. KN tested the program and contributed to the manuscript. JBH oversaw the project and helped define user requirements. IF conceived the study, participated in its design and coordination and helped to draft the manuscript. All authors read and approved the final manuscript.</p>
      </sec>
   </bdy>
   <bm>
      <ack>
         <sec>
            <st>
               <p>Acknowledgements</p>
            </st>
            <p>The authors thank Dr. Stephen Davids for proofreading the manuscript. This research was supported by Ministry of Higher Education, Scientific Research and Technology, Republic of Tunisia.</p>
         </sec>
      </ack>
      <refgrp>
         <bibl id="B1">
            <title>
               <p>The Universal Protein Resource (UniProt)</p>
            </title>
            <source>Nucleic Acids Res</source>
            <pubdate>2007</pubdate>
            <volume>35</volume>
            <fpage>D193</fpage>
            <lpage>D197</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1669721</pubid>
                  <pubid idtype="pmpid" link="fulltext">17142230</pubid>
                  <pubid idtype="doi">10.1093/nar/gkl929</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B2">
            <title>
               <p>Protein sequence databases</p>
            </title>
            <aug>
               <au>
                  <snm>Apweiler</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Bairoch</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Wu</snm>
                  <fnm>HC</fnm>
               </au>
            </aug>
            <source>Curr Opin Chem Biol</source>
            <pubdate>2004</pubdate>
            <volume>8</volume>
            <fpage>76</fpage>
            <lpage>80</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/j.cbpa.2003.12.004</pubid>
                  <pubid idtype="pmpid" link="fulltext">15036160</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B3">
            <title>
               <p>The SWISS-PROT protein knowledgebase and its supplement TrEMBL in 2003</p>
            </title>
            <aug>
               <au>
                  <snm>Boeckmann</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Bairoch</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Apweiler</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Blatter</snm>
                  <fnm>MC</fnm>
               </au>
               <au>
                  <snm>Estreicher</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Gasteiger</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Martin</snm>
                  <fnm>MJ</fnm>
               </au>
               <au>
                  <snm>Michoud</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>O'Donovan</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Phan</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Pilbout</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Schneider</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2003</pubdate>
            <volume>31</volume>
            <fpage>365</fpage>
            <lpage>370</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">165542</pubid>
                  <pubid idtype="pmpid" link="fulltext">12520024</pubid>
                  <pubid idtype="doi">10.1093/nar/gkg095</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B4">
            <title>
               <p>Polypeptide amino acid composition and isoelectric point</p>
            </title>
            <aug>
               <au>
                  <snm>Patrickios</snm>
                  <fnm>CS</fnm>
               </au>
               <au>
                  <snm>Yamasaki</snm>
                  <fnm>EN</fnm>
               </au>
            </aug>
            <source>Anal Biochem</source>
            <pubdate>1995</pubdate>
            <volume>231</volume>
            <fpage>82</fpage>
            <lpage>91</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1006/abio.1995.1506</pubid>
                  <pubid idtype="pmpid" link="fulltext">8678324</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B5">
            <title>
               <p>Reference points for comparisons of two-dimensional maps of proteins from different human cell types defined in a pH scale where isoelectric points correlate with polypeptide compositions</p>
            </title>
            <aug>
               <au>
                  <snm>Bjellqvist</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Basse</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Olsen</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Celis</snm>
                  <fnm>JE</fnm>
               </au>
            </aug>
            <source>Electrophoresis</source>
            <pubdate>1994</pubdate>
            <volume>15</volume>
            <fpage>529</fpage>
            <lpage>539</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1002/elps.1150150171</pubid>
                  <pubid idtype="pmpid">8055880</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B6">
            <title>
               <p>Statistical determination of the average values of the extinction coefficients of tryptophan and tyrosine in native proteins</p>
            </title>
            <aug>
               <au>
                  <snm>Henryk</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Russell</snm>
                  <fnm>MC</fnm>
               </au>
               <au>
                  <snm>Randolph</snm>
                  <fnm>VL</fnm>
               </au>
            </aug>
            <source>Anal Biochem</source>
            <pubdate>1992</pubdate>
            <volume>200</volume>
            <fpage>74</fpage>
            <lpage>80</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/0003-2697(92)90279-G</pubid>
                  <pubid idtype="pmpid" link="fulltext">1595904</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B7">
            <title>
               <p>In vivo half-life of a protein is a function of its amino-terminal residue</p>
            </title>
            <aug>
               <au>
                  <snm>Bachmair</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Finley</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Varshavsky</snm>
                  <fnm>A</fnm>
               </au>
            </aug>
            <source>Science</source>
            <pubdate>1986</pubdate>
            <volume>234</volume>
            <fpage>179</fpage>
            <lpage>186</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1126/science.3018930</pubid>
                  <pubid idtype="pmpid" link="fulltext">3018930</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B8">
            <title>
               <p>Universality and structure of the N-end rule</p>
            </title>
            <aug>
               <au>
                  <snm>Gonda</snm>
                  <fnm>DK</fnm>
               </au>
               <au>
                  <snm>Bachmair</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Wunning</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Tobias</snm>
                  <fnm>JW</fnm>
               </au>
               <au>
                  <snm>Lane</snm>
                  <fnm>WS</fnm>
               </au>
               <au>
                  <snm>Varshavsky</snm>
                  <fnm>AJ</fnm>
               </au>
            </aug>
            <source>J Biol Chem</source>
            <pubdate>1989</pubdate>
            <volume>264</volume>
            <fpage>16700</fpage>
            <lpage>16712</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">2506181</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B9">
            <title>
               <p>Correlation between stability of a protein and its dipeptide composition: a novel approach for predicting in vivo stability of a protein from its primary sequence</p>
            </title>
            <aug>
               <au>
                  <snm>Guruprasad</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Reddy</snm>
                  <fnm>BVB</fnm>
               </au>
               <au>
                  <snm>Pandit</snm>
                  <fnm>MW</fnm>
               </au>
            </aug>
            <source>Protein Eng</source>
            <pubdate>1990</pubdate>
            <volume>4</volume>
            <fpage>155</fpage>
            <lpage>161</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/protein/4.2.155</pubid>
                  <pubid idtype="pmpid" link="fulltext">2075190</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B10">
            <title>
               <p>Thermostability and aliphatic index of globular proteins</p>
            </title>
            <aug>
               <au>
                  <snm>Ikai</snm>
                  <fnm>AJ</fnm>
               </au>
            </aug>
            <source>J Biochem</source>
            <pubdate>1980</pubdate>
            <volume>88</volume>
            <fpage>1895</fpage>
            <lpage>1898</lpage>
            <xrefbib>
               <pubid idtype="pmpid">7462208</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B11">
            <title>
               <p>A simple method for displaying the hydropathic character of a protein</p>
            </title>
            <aug>
               <au>
                  <snm>Jack</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Russell</snm>
                  <fnm>FD</fnm>
               </au>
            </aug>
            <source>J Mol Biol</source>
            <pubdate>1982</pubdate>
            <volume>157</volume>
            <fpage>105</fpage>
            <lpage>132</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/0022-2836(82)90515-0</pubid>
                  <pubid idtype="pmpid" link="fulltext">7108955</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B12">
            <title>
               <p>Comparing the polarities of amino acids: side-chain distribution coefficients between vapor phase, cyclohexane, 1-octanol and neutral aqueous solution</p>
            </title>
            <aug>
               <au>
                  <snm>Radzeka</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Wolfenden</snm>
                  <fnm>R</fnm>
               </au>
            </aug>
            <source>Biochemistry</source>
            <pubdate>1988</pubdate>
            <volume>27</volume>
            <fpage>1664</fpage>
            <lpage>1670</lpage>
            <xrefbib>
               <pubid idtype="doi">10.1021/bi00405a042</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B13">
            <title>
               <p>Web Services at the European Bioinformatics Institute</p>
            </title>
            <aug>
               <au>
                  <snm>Labarga</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Valentin</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Andersson</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Lopez</snm>
                  <fnm>R</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2007</pubdate>
            <issue>35 Web Server</issue>
            <fpage>W6</fpage>
            <lpage>11</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1933145</pubid>
                  <pubid idtype="pmpid" link="fulltext">17576686</pubid>
                  <pubid idtype="doi">10.1093/nar/gkm291</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B14">
            <title>
               <p>Gapped BLAST and PSI-BLAST: a new generation of protein database search programs</p>
            </title>
            <aug>
               <au>
                  <snm>Altschul</snm>
                  <fnm>SF</fnm>
               </au>
               <au>
                  <snm>Madden</snm>
                  <fnm>TL</fnm>
               </au>
               <au>
                  <snm>Schaffer</snm>
                  <fnm>AA</fnm>
               </au>
               <au>
                  <snm>Zhang</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Zhang</snm>
                  <fnm>Z</fnm>
               </au>
               <au>
                  <snm>Miller</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Lipman</snm>
                  <fnm>DJ</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>1997</pubdate>
            <volume>25</volume>
            <fpage>3389</fpage>
            <lpage>3402</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">146917</pubid>
                  <pubid idtype="pmpid" link="fulltext">9254694</pubid>
                  <pubid idtype="doi">10.1093/nar/25.17.3389</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B15">
            <title>
               <p>ClustalW and ClustalX version 2.0</p>
            </title>
            <aug>
               <au>
                  <snm>Larkin</snm>
                  <fnm>MA</fnm>
               </au>
               <au>
                  <snm>Blackshields</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Brown</snm>
                  <fnm>NP</fnm>
               </au>
               <au>
                  <snm>Chenna</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>McGettigan</snm>
                  <fnm>PA</fnm>
               </au>
               <au>
                  <snm>McWilliam</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Valentin</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Wallace</snm>
                  <fnm>IM</fnm>
               </au>
               <au>
                  <snm>Wilm</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Lopez</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Thompson</snm>
                  <fnm>JD</fnm>
               </au>
               <au>
                  <snm>Gibson</snm>
                  <fnm>TJ</fnm>
               </au>
               <au>
                  <snm>Higgins</snm>
                  <fnm>DG</fnm>
               </au>
            </aug>
            <source>Bioinformatics</source>
            <pubdate>2007</pubdate>
            <volume>23</volume>
            <issue>21</issue>
            <fpage>2947</fpage>
            <lpage>2948</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/bioinformatics/btm404</pubid>
                  <pubid idtype="pmpid" link="fulltext">17846036</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B16">
            <title>
               <p>TREEVIEW: An application to display phylogenetic trees on personal computers</p>
            </title>
            <aug>
               <au>
                  <snm>Page</snm>
                  <fnm>RDM</fnm>
               </au>
            </aug>
            <source>CABIOS</source>
            <pubdate>1996</pubdate>
            <volume>12</volume>
            <fpage>357</fpage>
            <lpage>358</lpage>
            <xrefbib>
               <pubid idtype="pmpid">8902363</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B17">
            <title>
               <p>BACTIBASE: a new web-accessible database for bacteriocin characterization</p>
            </title>
            <aug>
               <au>
                  <snm>Hammami</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Zouhir</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Ben Hamida</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Fliss</snm>
                  <fnm>I</fnm>
               </au>
            </aug>
            <source>BMC Microbiol</source>
            <pubdate>2007</pubdate>
            <volume>7</volume>
            <fpage>89</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">2211298</pubid>
                  <pubid idtype="pmpid" link="fulltext">17941971</pubid>
                  <pubid idtype="doi">10.1186/1471-2180-7-89</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
      </refgrp>
   </bm>
</art>
