<?xml version='1.0'?>
<!DOCTYPE art SYSTEM 'http://www.biomedcentral.com/xml/article.dtd'>
<art>
   <ui>1471-2229-8-45</ui>
   <ji>1471-2229</ji>
   <fm>
      <dochead>Research article</dochead>
      <bibl>
         <title>
            <p>Unexpected complexity of the Aquaporin gene family in the moss <it>Physcomitrella patens</it></p>
         </title>
         <aug>
            <au id="A1">
               <snm>Danielson</snm>
               <mi>&#197;H</mi>
               <fnm>Jonas</fnm>
               <insr iid="I1"/>
               <email>jonas.danielson@biochemistry.lu.se</email>
            </au>
            <au id="A2" ca="yes">
               <snm>Johanson</snm>
               <fnm>Urban</fnm>
               <insr iid="I1"/>
               <email>urban.johanson@biochemistry.lu.se</email>
            </au>
         </aug>
         <insg>
            <ins id="I1">
               <p>Department of Biochemistry, Center for Molecular Protein Science, Center for Chemistry and Chemical Engineering, Lund University, PO Box 124, S-221 00 Lund, Sweden</p>
            </ins>
         </insg>
         <source>BMC Plant Biology</source>
         <issn>1471-2229</issn>
         <pubdate>2008</pubdate>
         <volume>8</volume>
         <issue>1</issue>
         <fpage>45</fpage>
         <url>http://www.biomedcentral.com/1471-2229/8/45</url>
         <xrefbib>
            <pubidlist>
               <pubid idtype="pmpid">18430224</pubid>
               <pubid idtype="doi">10.1186/1471-2229-8-45</pubid>
            </pubidlist>
         </xrefbib>
      </bibl>
      <history>
         <rec>
            <date>
               <day>20</day>
               <month>12</month>
               <year>2007</year>
            </date>
         </rec>
         <acc>
            <date>
               <day>22</day>
               <month>4</month>
               <year>2008</year>
            </date>
         </acc>
         <pub>
            <date>
               <day>22</day>
               <month>4</month>
               <year>2008</year>
            </date>
         </pub>
      </history>
      <cpyrt>
         <year>2008</year>
         <collab>Danielson and Johanson; licensee BioMed Central Ltd.</collab>
         <note>This is an Open Access article distributed under the terms of the Creative Commons Attribution License (<url>http://creativecommons.org/licenses/by/2.0</url>), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</note>
      </cpyrt>
      <abs>
         <sec>
            <st>
               <p>Abstract</p>
            </st>
            <sec>
               <st>
                  <p>Background</p>
               </st>
               <p>Aquaporins, also called major intrinsic proteins (MIPs), constitute an ancient superfamily of channel proteins that facilitate the transport of water and small solutes across cell membranes. MIPs are found in almost all living organisms and are particularly abundant in plants where they form a divergent group of proteins able to transport a wide selection of substrates.</p>
            </sec>
            <sec>
               <st>
                  <p>Results</p>
               </st>
               <p>Analyses of the whole genome of <it>Physcomitrella patens </it>resulted in the identification of 23 MIPs, belonging to seven different subfamilies, of which only five have been previously described. Of the newly discovered subfamilies one was only identified in <it>P. patens </it>(Hybrid Intrinsic Protein, HIP) whereas the other was found to be present in a wide variety of dicotyledonous plants and forms a major previously unrecognized MIP subfamily (X Intrinsic Proteins, XIPs). Surprisingly also some specific groups within subfamilies present in <it>Arabidopsis thaliana </it>and <it>Zea mays </it>could be identified in <it>P. patens</it>.</p>
            </sec>
            <sec>
               <st>
                  <p>Conclusion</p>
               </st>
               <p>Our results suggest an early diversification of MIPs resulting in a large number of subfamilies already in primitive terrestrial plants. During the evolution of higher plants some of these subfamilies were subsequently lost while the remaining subfamilies expanded and in some cases diversified, resulting in the formation of more specialized groups within these subfamilies.</p>
            </sec>
         </sec>
      </abs>
   </fm>
   <bdy>
      <sec>
         <st>
            <p>Background</p>
         </st>
         <p>Water transport across cell membranes is essential for life and in order to facilitate the transport of water and other small polar molecules across hydrophobic membranes, living organisms have evolved a wide array of membrane integral protein channels. These proteins, termed major intrinsic proteins (MIPs), form a large and evolutionarily conserved superfamily of channel proteins, found in all types of organisms, including eubacteria, archaea, fungi, animals and plants <abbrgrp><abbr bid="B1">1</abbr><abbr bid="B2">2</abbr></abbrgrp>. MIPs are present in many different tissues in mammals and are likely to be of major importance for many different diseases [reviewed in <abbrgrp><abbr bid="B3">3</abbr></abbrgrp>], either directly or indirectly through their involvement in transport and water balance regulation. This general physiological involvement of MIPs has stimulated a growing interest in the molecular mechanisms responsible for regulation and substrate specificity. In plants the functions of MIPs are more complex and their physiological roles are not as clear [reviewed in <abbrgrp><abbr bid="B4">4</abbr><abbr bid="B5">5</abbr></abbrgrp>]. However, the mere number of different MIPs in plants implies their importance, and it is likely that some isoforms play key roles in events such as rapid cell elongation and drought adaptation through their involvement in water transport regulation <abbrgrp><abbr bid="B6">6</abbr></abbrgrp>. In order to fully understand whole plant water relations and the transport of other small polar molecules at a molecular level it is necessary to identify the complete set of MIPs along with their substrate specificities and expression patterns.</p>
         <p>A comprehensive phylogenetic study of MIPs <abbrgrp><abbr bid="B7">7</abbr></abbrgrp> supports the classification of two main evolutionary groups. Aquaporins (AQPs) originally thought to specifically transport water, and glycerol-uptake facilitators or aquaglyceroporins (GLPs) facilitating the transport of a variety of small neutral molecules. Although the MIPs form passive channels, the permeability of the membrane is regulated by controlling the amount of different MIPs and also in some cases by phosphorylation/dephosphorylation of the channels. Structures from x-ray and electron crystallography of MIPs <abbrgrp><abbr bid="B8">8</abbr><abbr bid="B9">9</abbr><abbr bid="B10">10</abbr><abbr bid="B11">11</abbr><abbr bid="B12">12</abbr><abbr bid="B13">13</abbr><abbr bid="B14">14</abbr></abbrgrp> show a tetrameric quaternary structure in which each monomer consists of six membrane spanning helices (H1 to H6) connected by five loops (A-E). Loop B (cytoplasmic) and loop E (extracellular) form two half-membrane spanning helices (HB and HE) and interact with each other from opposing sides through two highly conserved aspargine-proline-alanine (NPA) boxes, forming a narrow region of the pore. A constriction region about 8 &#197; from the NPA boxes toward the periplasmic side, termed the aromatic/arginine (ar/R) region, is formed by two residues from H2 and H5 and two residues from loop E. This region forms a primary selection filter and is a major checkpoint for solute permeability [<abbrgrp><abbr bid="B15">15</abbr></abbrgrp>, and references therein].</p>
         <p>Plant MIPs form a large and divergent superfamily of proteins with more than thirty identified members encoded in each of the genomes of <it>Arabidopsis thaliana </it><abbrgrp><abbr bid="B16">16</abbr><abbr bid="B17">17</abbr></abbrgrp>, <it>Zea mays </it><abbrgrp><abbr bid="B18">18</abbr></abbrgrp> and <it>Oryza sativa </it><abbrgrp><abbr bid="B19">19</abbr></abbrgrp>. These large numbers of MIPs likely reflect a wide diversity in substrate specificity, localisation, transcriptional and posttranslational regulation. Based on sequence similarity plant MIPs have been divided into five subfamilies; the plasma membrane intrinsic proteins (PIPs), the tonoplast intrinsic proteins (TIPs), the nodulin-26 like intrinsic proteins (NIPs), the small basic intrinsic proteins (SIPs) and the GlpF-like intrinsic protein (GIPs) <abbrgrp><abbr bid="B7">7</abbr><abbr bid="B16">16</abbr><abbr bid="B20">20</abbr></abbrgrp>. The GIPs have so far only been identified in <it>Physcomitrella patens </it>and another closely related moss <abbrgrp><abbr bid="B20">20</abbr></abbrgrp>. Each of the other subfamilies can be further divided into groups based on sequence similarity <abbrgrp><abbr bid="B16">16</abbr></abbrgrp>. Even though all MIPs in higher plants phylogenetically belong to the AQP clade of MIPs <abbrgrp><abbr bid="B7">7</abbr></abbrgrp> they are not all highly specific for water. Several studies have shown plant MIPs to be permeable also to other molecules, for example TIPs have been reported to facilitate urea and ammonia transport <abbrgrp><abbr bid="B21">21</abbr><abbr bid="B22">22</abbr><abbr bid="B23">23</abbr></abbrgrp>; NIPs to transport glycerol <abbrgrp><abbr bid="B24">24</abbr></abbrgrp>, ammonia <abbrgrp><abbr bid="B25">25</abbr></abbrgrp>, lactic acid <abbrgrp><abbr bid="B26">26</abbr></abbrgrp>, boron <abbrgrp><abbr bid="B27">27</abbr></abbrgrp> and silicon <abbrgrp><abbr bid="B28">28</abbr></abbrgrp>; PIPs have been postulated to be able to facilitate CO<sub>2 </sub>diffusion <abbrgrp><abbr bid="B29">29</abbr><abbr bid="B30">30</abbr></abbrgrp> and for the SIPs water transport has only been reported for the SIP1 subgroup <abbrgrp><abbr bid="B31">31</abbr></abbrgrp>. The difference in transport specificity is likely due to major differences in the ar/R filter of plant MIPs, as has been suggested for MIPs in <it>A. thaliana, Z. mays </it>and <it>O. sativa </it><abbrgrp><abbr bid="B32">32</abbr><abbr bid="B33">33</abbr></abbrgrp>.</p>
         <p><it>P. patens </it>is a moss (bryophyte) and as such diverged from the lineage leading to higher plants approximately 443&#8211;490 million years ago, before the evolution of vascular plants <abbrgrp><abbr bid="B34">34</abbr></abbrgrp>. This makes <it>P. patens </it>a valuable source of information in evolutionary comparisons with higher plants and any common features found can be expected to be present in most terrestrial plants. In addition <it>P. patens </it>has properties that make it an attractive plant model for future functional studies, above all the possibility of homologous recombination [information about the use of <it>P. patens </it>can be found in two excellent reviews by David Cove <abbrgrp><abbr bid="B35">35</abbr><abbr bid="B36">36</abbr></abbrgrp>]. An assembled genome of <it>P. patens </it>(<it>circa </it>480 Mbp), based on 8.1 times coverage, has recently been released by the Joint Genome Institute <abbrgrp><abbr bid="B37">37</abbr><abbr bid="B38">38</abbr></abbrgrp> and has made it possible to extend the analysis of gene family evolution back to basal land plant lineages. Such an analysis has previously been described for the expansin superfamily of proteins <abbrgrp><abbr bid="B39">39</abbr></abbrgrp> and we now present a similar analysis of the MIP superfamily. In agreement with the expansin study, we also hypothesised that <it>P. patens </it>were to have a simpler superfamily structure due to less need of cell-specific expression, a hypothesis that was partially proven wrong by the data collected for <it>P. patens</it>. In our analysis we did not only identify the five previously defined subfamilies (PIP, TIP, NIP, SIP and GIP) but also found two previously uncategorised MIP subfamilies; the hybrid intrinsic proteins (HIPs) and the uncategorized X intrinsic proteins (XIPs), a subfamily which we found also to be present in many other plant species. This data implies that MIP subfamilies evolved early on in plants and that the existence of diverse subfamilies reflects differences in subcellular localisation, substrate specificity, transcriptional and/or posttranslational regulation already of importance in primitive plants, whereas the specificity needed only in higher plants (e.g. cell specific expression in vascular tissue and seeds) is covered by the MIP groups that evolved later within the subfamilies present in higher plants.</p>
         <p>In this study we try to address plant MIP function from an evolutionary perspective by comparing the whole set of MIPs in a primitive land plant (the moss <it>P. patens</it>) with those of two higher plants (<it>A. thaliana </it>and <it>Z. mays</it>). By annotating the whole MIP superfamily in <it>P. patens </it>we also lay the foundation for future functional studies in a plant system allowing homologous recombination and all advantages of this, such as knocking out/replacing endogenous genes.</p>
      </sec>
      <sec>
         <st>
            <p>Results</p>
         </st>
         <sec>
            <st>
               <p>Identification of <it>Physcomitrella patens </it>MIPs</p>
            </st>
            <p>The recent sequencing of the moss <it>P. patens </it>genome <abbrgrp><abbr bid="B37">37</abbr><abbr bid="B38">38</abbr></abbrgrp> has for the first time made it possible to identify all MIP genes in a more primitive plant and hence to make conclusions on the molecular evolution of the MIP superfamily of proteins. Searches of the <it>Physcomitrella patens ssp patens v1.1 </it>database (PpDB) at JGI, using the 35 protein sequences of the complete set of <it>A. thaliana </it>MIPs (AtMIPs) <abbrgrp><abbr bid="B16">16</abbr></abbrgrp>, resulted in identification of 23 different genes encoding <it>P. patens </it>MIPs (PpMIPs) (Table <tblr tid="T1">1</tblr>). Two genes were identical at nucleotide level and therefore only one protein sequence (PpPIP2;4), representing both genes, was included in further analyses. PpGIP1;1, a <it>P. patens </it>MIP previously described in detail by Gustavsson et al <abbrgrp><abbr bid="B20">20</abbr></abbrgrp> was also included in the PpMIP set which were then reaching a total of 23 full length MIPs. Four genes encoding partial MIP-like sequences were also identified. Of these, three were either partial or contained premature stop codons and therefore considered to be non-functional pseudogenes (pseudoPIP#1, pseudoPIP#2 and pseudoNIP#1). The fourth sequence might represent a functional MIP encoding gene, but was situated in a short contig interrupted by a large sequencing gap after the identified exon and could therefore not be included in the analysis (referred to as partialNIP#1). The JGI gene models were manually inspected and considered correct for most PpMIP genes. However, for some genes a different annotation of the coding sequence in the genomic sequence was favoured either by cDNA sequences or due to a better conservation of subfamily specific sequences and gene structure. These alternative assignations of exons, specified in Table <tblr tid="T1">1</tblr>, were used in all translations and analyses in this paper.</p>
            <tbl id="T1">
               <title>
                  <p>Table 1</p>
               </title>
               <caption>
                  <p>Proposed systematic names for all <it>Physcomitrella patens </it>MIPs</p>
               </caption>
               <tblbdy cols="6">
                  <r>
                     <c ca="center">
                        <p>
                           <b>New name</b>
                           <sup>a</sup>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>Borstlap</b>
                           <sup>b</sup>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>PpDB</b>
                           <sup>c</sup>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>EST</b>
                           <sup>d</sup>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>ProteinID</b>
                           <sup>e</sup>
                        </p>
                     </c>
                     <c ca="left">
                        <p>
                           <b>Comments</b>
                           <sup>f</sup>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c cspan="6">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>PIP1;1</p>
                     </c>
                     <c ca="center">
                        <p>-</p>
                     </c>
                     <c ca="center">
                        <p>PIP1</p>
                     </c>
                     <c ca="center">
                        <p>Y</p>
                     </c>
                     <c ca="center">
                        <p>62169</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>PIP1;2</p>
                     </c>
                     <c ca="center">
                        <p>PIP1</p>
                     </c>
                     <c ca="center">
                        <p>PIP</p>
                     </c>
                     <c ca="center">
                        <p>Y</p>
                     </c>
                     <c ca="center">
                        <p>166091</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>PIP1;3</p>
                     </c>
                     <c ca="center">
                        <p>PIP1</p>
                     </c>
                     <c ca="center">
                        <p>PIP</p>
                     </c>
                     <c ca="center">
                        <p>Y</p>
                     </c>
                     <c ca="center">
                        <p>171662</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>PIP2;1</p>
                     </c>
                     <c ca="center">
                        <p>-</p>
                     </c>
                     <c ca="center">
                        <p>PIP</p>
                     </c>
                     <c ca="center">
                        <p>Y</p>
                     </c>
                     <c ca="center">
                        <p>202226</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>PIP2;2</p>
                     </c>
                     <c ca="center">
                        <p>PIP2</p>
                     </c>
                     <c ca="center">
                        <p>PIP</p>
                     </c>
                     <c ca="center">
                        <p>Y</p>
                     </c>
                     <c ca="center">
                        <p>209703</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>PIP2;3</p>
                     </c>
                     <c ca="center">
                        <p>PIP2</p>
                     </c>
                     <c ca="center">
                        <p>PIP</p>
                     </c>
                     <c ca="center">
                        <p>Y</p>
                     </c>
                     <c ca="center">
                        <p>196472</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>PIP2;4</p>
                     </c>
                     <c ca="center">
                        <p>-</p>
                     </c>
                     <c ca="center">
                        <p>PIP</p>
                     </c>
                     <c ca="center">
                        <p>?</p>
                     </c>
                     <c ca="center">
                        <p>135286</p>
                     </c>
                     <c ca="left">
                        <p>Identical to 83986<sup>g</sup></p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>-</p>
                     </c>
                     <c ca="center">
                        <p>-</p>
                     </c>
                     <c ca="center">
                        <p>PIP</p>
                     </c>
                     <c ca="center">
                        <p>?</p>
                     </c>
                     <c ca="center">
                        <p>83986</p>
                     </c>
                     <c ca="left">
                        <p>Identical to 135286<sup>g</sup></p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>PIP3;1</p>
                     </c>
                     <c ca="center">
                        <p>-</p>
                     </c>
                     <c ca="center">
                        <p>PIP2</p>
                     </c>
                     <c ca="center">
                        <p>?</p>
                     </c>
                     <c ca="center">
                        <p>68172</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>PseudoPIP#1</p>
                     </c>
                     <c ca="center">
                        <p>-</p>
                     </c>
                     <c ca="center">
                        <p>-<sup>h</sup></p>
                     </c>
                     <c ca="center">
                        <p>?</p>
                     </c>
                     <c ca="center">
                        <p>113412</p>
                     </c>
                     <c ca="left">
                        <p>Pseudogene, PIP-like, based on ProteinID = 113412 but encoding 123 amino acids in two exons</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>PseudoPIP#2</p>
                     </c>
                     <c ca="center">
                        <p>-</p>
                     </c>
                     <c ca="center">
                        <p>-</p>
                     </c>
                     <c ca="center">
                        <p>?</p>
                     </c>
                     <c ca="center">
                        <p>-</p>
                     </c>
                     <c ca="left">
                        <p>Pseudogene, PIP-like, encoding 83 amino acids in one exon</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>TIP6;1</p>
                     </c>
                     <c ca="center">
                        <p>-</p>
                     </c>
                     <c ca="center">
                        <p>TIP</p>
                     </c>
                     <c ca="center">
                        <p>Y</p>
                     </c>
                     <c ca="center">
                        <p>73809</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>TIP6;2</p>
                     </c>
                     <c ca="center">
                        <p>TIP</p>
                     </c>
                     <c ca="center">
                        <p>TIP</p>
                     </c>
                     <c ca="center">
                        <p>Y</p>
                     </c>
                     <c ca="center">
                        <p>191107</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>TIP6;3</p>
                     </c>
                     <c ca="center">
                        <p>TIP</p>
                     </c>
                     <c ca="center">
                        <p>-<sup>h</sup></p>
                     </c>
                     <c ca="center">
                        <p>Y</p>
                     </c>
                     <c ca="center">
                        <p>214518</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>TIP6;4</p>
                     </c>
                     <c ca="center">
                        <p>TIP</p>
                     </c>
                     <c ca="center">
                        <p>TIP</p>
                     </c>
                     <c ca="center">
                        <p>Y</p>
                     </c>
                     <c ca="center">
                        <p>219971</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>NIP3;1</p>
                     </c>
                     <c ca="center">
                        <p>-</p>
                     </c>
                     <c ca="center">
                        <p>NIP5</p>
                     </c>
                     <c ca="center">
                        <p>?</p>
                     </c>
                     <c ca="center">
                        <p>94322</p>
                     </c>
                     <c ca="left">
                        <p>The PpDB classification refers to ProteinID = 147365 which is a truncated version</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>NIP5;1</p>
                     </c>
                     <c ca="center">
                        <p>-</p>
                     </c>
                     <c ca="center">
                        <p>NIP4</p>
                     </c>
                     <c ca="center">
                        <p>Y</p>
                     </c>
                     <c ca="center">
                        <p>115513</p>
                     </c>
                     <c ca="left">
                        <p>Misannotated: delete the first amino acid and add exon 1 (68 amino acids)</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>NIP5;2</p>
                     </c>
                     <c ca="center">
                        <p>NIP</p>
                     </c>
                     <c ca="center">
                        <p>NIP4</p>
                     </c>
                     <c ca="center">
                        <p>Y</p>
                     </c>
                     <c ca="center">
                        <p>186237</p>
                     </c>
                     <c ca="left">
                        <p>Misannotated: delete first eleven amino acids and add exon 1 (68 amino acids)</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>NIP5;3</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>NIP4</p>
                     </c>
                     <c ca="center">
                        <p>Y</p>
                     </c>
                     <c ca="center">
                        <p>179749</p>
                     </c>
                     <c ca="left">
                        <p>Misannotated: delete first seven amino acids and add exon 1 (66 amino acids)</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>NIP6;1</p>
                     </c>
                     <c ca="center">
                        <p>-</p>
                     </c>
                     <c ca="center">
                        <p>NIP</p>
                     </c>
                     <c ca="center">
                        <p>?</p>
                     </c>
                     <c ca="center">
                        <p>16763</p>
                     </c>
                     <c ca="left">
                        <p>Misannotated: add exon 1 (65 amino acids) and extend last exon 24 amino acids</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>PartialNIP#1</p>
                     </c>
                     <c ca="center">
                        <p>-</p>
                     </c>
                     <c ca="center">
                        <p>Possibly an aquaporin<sup>i</sup></p>
                     </c>
                     <c ca="center">
                        <p>?</p>
                     </c>
                     <c ca="center">
                        <p>103774</p>
                     </c>
                     <c ca="left">
                        <p>Possibly a full length gene (NIP5) but the genomic sequence is only 825 bp long and interrupted by a 34 kb gap. The model which the classification refers to (ProteinID = 103774) is completely wrong, but in the opposite direction is an exon encoding 103 amino acids.</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>PseudoNIP#1</p>
                     </c>
                     <c ca="center">
                        <p>-</p>
                     </c>
                     <c ca="center">
                        <p>-</p>
                     </c>
                     <c ca="center">
                        <p>?</p>
                     </c>
                     <c ca="center">
                        <p>73549</p>
                     </c>
                     <c ca="left">
                        <p>Pseudogene, NIP-like, delete first 22 amino acids from model</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>SIP1;1</p>
                     </c>
                     <c ca="center">
                        <p>SIP</p>
                     </c>
                     <c ca="center">
                        <p>SIP</p>
                     </c>
                     <c ca="center">
                        <p>?</p>
                     </c>
                     <c ca="center">
                        <p>112053</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>SIP1;2</p>
                     </c>
                     <c ca="center">
                        <p>SIP</p>
                     </c>
                     <c ca="center">
                        <p>SIP</p>
                     </c>
                     <c ca="center">
                        <p>Y</p>
                     </c>
                     <c ca="center">
                        <p>200882</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>GIP1;1</p>
                     </c>
                     <c ca="center">
                        <p>-</p>
                     </c>
                     <c ca="center">
                        <p>PpGlP1-1</p>
                     </c>
                     <c ca="center">
                        <p>Y</p>
                     </c>
                     <c ca="center">
                        <p>171260</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>HIP1;1</p>
                     </c>
                     <c ca="center">
                        <p>-</p>
                     </c>
                     <c ca="center">
                        <p>-<sup>h</sup></p>
                     </c>
                     <c ca="center">
                        <p>?</p>
                     </c>
                     <c ca="center">
                        <p>91611</p>
                     </c>
                     <c ca="left">
                        <p>Misannotated, we removed 141 aa from beginning of exon 1, 22 aa from end of exon 2 and 15 aa from beginning of exon 3</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>XIP1;1</p>
                     </c>
                     <c ca="center">
                        <p>-</p>
                     </c>
                     <c ca="center">
                        <p>TIP1</p>
                     </c>
                     <c ca="center">
                        <p>Y</p>
                     </c>
                     <c ca="center">
                        <p>71087</p>
                     </c>
                     <c ca="left">
                        <p>The PpDB classification refers to ProteinID = 26452 which is a truncated version</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>XIP1;2</p>
                     </c>
                     <c ca="center">
                        <p>-</p>
                     </c>
                     <c ca="center">
                        <p>TIP</p>
                     </c>
                     <c ca="center">
                        <p>Y</p>
                     </c>
                     <c ca="center">
                        <p>71489</p>
                     </c>
                     <c ca="left">
                        <p>Misannotated, removed 15 amino acids from exon 2 and replaced exon 1 (now 31 aa) The PpDB classification refers to ProteinID = 47381 which is a truncated version</p>
                     </c>
                  </r>
               </tblbdy>
               <tblfn>
                  <p><sup>a </sup>Proposed new names for <it>P. patens </it>MIPs. <sup>b </sup>Classification used in Borstlap (2002). <sup>c </sup>Classification used to describe gene models by Shizong Ma in PpDB. <sup>d </sup>Matching ESTs in PpDB: Y = Yes, ? = Not found. <sup>e </sup>Protein ID number for the protein or related protein in PpDB. <sup>f </sup>Alternative exon/intron positions proposed and used in this paper and odd features of genes and/or proteins encoded. <sup>g </sup>both genes are in a region of 3023 bp of identical genomic sequence, the two genes were therefore treated as one in all analyzes. <sup>h </sup>Classified as belonging to one of the Aquaporin KOG groups (KOG0223 or KOG0224) but without further description in PpDB <sup>i </sup>the complete comment is "Possibly an aquaporin, similar to NIP1;2, with one signature peptide, "HFNPAVSV"".</p>
               </tblfn>
            </tbl>
            <p>When this study was initiated only 11 out of the 23 PpMIPs had been described in the literature <abbrgrp><abbr bid="B20">20</abbr><abbr bid="B40">40</abbr></abbrgrp>. Since then one more of the 23 PpMIPs (PpPIP2;1) has been published <abbrgrp><abbr bid="B41">41</abbr></abbrgrp>. All 23 PpMIP sequences were categorized as belonging to an aquaporin euKaryotic Orthologous Groups (KOG) at the PpDB and most of these also had a suggested classification (Table <tblr tid="T1">1</tblr>). Based on the phylogeny of the PpMIPs together with the AtMIPs and <it>Z. mays </it>MIPs (ZmMIPs) a new and more systematic classification of the PpMIPs, that is consistent with the AtMIPs and ZmMIPs nomenclature <abbrgrp><abbr bid="B16">16</abbr><abbr bid="B18">18</abbr></abbrgrp>, is proposed (Table <tblr tid="T1">1</tblr>).</p>
         </sec>
         <sec>
            <st>
               <p>Phylogeny and classification</p>
            </st>
            <p>Using the full length protein alignments of all PpMIPs, AtMIPs and ZmMIPs [see Additional file <supplr sid="S1">1</supplr>] the neighbour joining (NJ) method resulted in one tree (Fig. <figr fid="F1">1</figr>) which was compared to trees from the maximum parsimony (MP) method and the Bayesian (Bay) method. Bootstrap support and Bayesian posterior probabilities were used to construct a "method-consensus" cladogram summarizing the results of the three methods and used to classify the PpMIPs (Fig. <figr fid="F2">2</figr>). The classification of AtMIPs and ZmMIPs in subgroups within subfamilies is similar for all MIPs except the NIPs. We named the PpNIPs according to the nomenclature used in classification of the NIPs in <it>Z. mays </it>and <it>O. sativa </it>since these four wider subgroups allow more sequence divergence and hence are more generic than the more narrow seven subgroups defined in <it>A. thaliana</it>. <it>P. patens </it>subgroups that failed to group with the previously classified subfamily groups were given consecutive higher indices (e.g. PpPIP3, PpTIP6, PpNIP5 or PpNIP6). In total 3 PpPIP1s, 4 PpPIP2s, 1 PpPIP3, 4 PpTIP6s, 1 PpNIP3, 3 PpNIP5s, 1 PpNIP6 and 2 PpSIP1s were categorized. Four PpMIPs failed to be classified into a subfamily, since they lack orthologs among the MIPs identified in <it>A. thaliana </it>and <it>Z. mays</it>. One of these was the MIP xenolog (homolog resulting from horizontal gene transfer) PpGIP1;1 previously identified as a GlpF-like MIP and named accordingly <abbrgrp><abbr bid="B20">20</abbr></abbrgrp>. The remaining three were the PpHIP1;1 which shares similarities with both TIPs and PIPs but forms a separate distinct subfamily of its own, and the PpXIP1;1 and PpXIP1;2, two divergent MIPs that share some unique previously undescribed motifs.</p>
            <suppl id="S1">
               <title>
                  <p>Additional file 1</p>
               </title>
               <text>
                  <p>Figure showing the alignment of PpMIPs, AtMIPs and ZmMIPs. Shading is indicating the degree of conservation of an amino acid at a position. The actual alignment is available as "ALIGN_001168" from the EMBL align database.</p>
               </text>
               <file name="1471-2229-8-45-S1.pdf">
                  <p>Click here for file</p>
               </file>
            </suppl>
            <fig id="F1">
               <title>
                  <p>Figure 1</p>
               </title>
               <caption>
                  <p>Evolutionary relationship of plant MIPs</p>
               </caption>
               <text>
                  <p><b>Evolutionary relationship of plant MIPs</b>. An unrooted neighbour-joining tree showing the phylogenetic comparison of the complete set of 23 different MIPs from <it>P. patens </it>(Pp) in bold and the 35 respectively 33 MIPs from <it>A. thaliana </it>(At) and <it>Z. mays </it>(Zm). The seven subfamilies found in <it>P. patens </it>are indicated with the same colours as in Fig. 6. Note that the XIP, HIP and GIP have not been found in <it>A. thaliana </it>or <it>Z. mays</it>. The bar indicates the mean distance of 0.1 changes per amino acid residue.</p>
               </text>
               <graphic file="1471-2229-8-45-1"/>
            </fig>
            <fig id="F2">
               <title>
                  <p>Figure 2</p>
               </title>
               <caption>
                  <p>Cladogram used for categorization of PpMIPs</p>
               </caption>
               <text>
                  <p><b>Cladogram used for categorization of PpMIPs</b>. A "method consensus" cladogram, summarizing the overall robustness, as measured by bootstrapping for the neighbour joining (NJ) and maximum parsimony (MP) methods and posterior probabilities for the Bayesian (Bay) method. The tree was used for classification of the PpMIPs. The right panel shows an enlargement of the upper half of the tree. Note the low level of support (in italics) for the nodes basal to the PpHIP1;1 and the PpXIP-group, indicating the uncertainty of the placement of these groups. All nodes that have a support of less than 50 % for more than one method were collapsed. For visibility reasons, topology of clades with only <it>A. thaliana </it>and/or <it>Z. mays </it>MIPs are left out and replaced with triangles indicating the group. Support values for branches are presented as percentage, in the order NJ/Bay and underneath MP. A dash (-) indicates a support value of less than 50 %.</p>
               </text>
               <graphic file="1471-2229-8-45-2"/>
            </fig>
            <p>To find orthologs of the three uncategorized PpMIPs (PpHIP1;1, PpXIP1;1 and PpXIP1;2) searches of databases at NCBI and embl were conducted. Hits representing a wide variety of species were selected and the corresponding protein sequences were aligned with the PpPIPs, the PpTIPs and either PpHIP1;1 or PpXIP1;1 and PpXIP1;2. The alignments were used in phylogenetic analyses to evaluate if the newly acquired sequences could help in categorizing the three PpMIPs. The PpHIP1;1 hits were mainly annotated as TIPs or AQP4s in the databases and the phylogenetic analysis resulted in three clusters (PIPs, TIPs and AQP4s) but PpHIP1;1 were still basal to all of these and could therefore not be assigned to any of these subfamilies (data not shown). As for PpXIP1;1 and PpXIP1;2, hits were mostly annotated as Plant MIP, TIP or AQP0 sequences. The phylogenetic analysis resulted in four different subfamilies, TIPs, PIPs AQP0s and a fourth clade consisting of unspecified plant MIPs and the PpXIPs (data not shown), see further analyses in next paragraph.</p>
         </sec>
         <sec>
            <st>
               <p>The XIPs &#8211; an unrecognized MIP subfamily in higher plants</p>
            </st>
            <p>Sequences belonging to this fourth clade have a weak overall sequence similarity to MIPs in general (about 30 % amino acid identity, data not shown), and could neither be assigned to any of the previously identified classes of plant MIPs (PIPs, TIPs, NIPs, SIPs and GIPs) nor be associated with the PpHIP1;1 sequence. However, some conserved motifs within this new subfamily (see discussion) were identified and based on these one representative sequence (the castor bean cDNA sequence [GenBank:<ext-link ext-link-type="gen" ext-link-id="EG656577">EG656577</ext-link>]) was selected. This sequence was used in database searches in order to obtain more MIPs belonging to this novel subfamily. A handful of more sequences that all shared the same conserved motifs were identified. One of these sequences originated from <it>Populus trichocarpa </it>and therefore the <it>P. trichocarpa </it>genome at JGI were searched, identifying 4 more paralogs (Table <tblr tid="T2">2</tblr>). These sequences, together with the sequences retrieved from the castor bean cDNA and the PpXIP searches and all PpMIP sequences (except PpHIP1;1) were combined into one sequence alignment used in phylogenetic analysis. The resulting trees confirmed that the unclassified MIPs form a distinct monophyletic clade (with the PpXIPs as basal taxa), different from the other MIPs included in the analysis (Fig. <figr fid="F3">3</figr>). As shown in Table <tblr tid="T3">3</tblr> there is considerable variation both at the first NPA box and the ar/R filter among the sequences in this clade. We propose that, awaiting further characterization, MIPs in the new subfamily should be referred to as X Intrinsic Proteins (XIPs) emphasizing that currently we have very little information on the function of these proteins.</p>
            <tbl id="T2">
               <title>
                  <p>Table 2</p>
               </title>
               <caption>
                  <p>Sequences identified as belonging to the novel XIP subfamily</p>
               </caption>
               <tblbdy cols="6">
                  <r>
                     <c ca="center">
                        <p>
                           <b>Number</b>
                           <sup>a</sup>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>ID</b>
                           <sup>b</sup>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>Type</b>
                           <sup>c</sup>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>Organism</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>Descr.</b>
                        </p>
                     </c>
                     <c ca="left">
                        <p>
                           <b>Comments</b>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c cspan="6">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>1</p>
                     </c>
                     <c ca="center">
                        <p>DN837617</p>
                     </c>
                     <c ca="center">
                        <p>EST</p>
                     </c>
                     <c ca="center">
                        <p>
                           <it>Selaginella moellendorffii</it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>-</p>
                     </c>
                     <c ca="left">
                        <p>cDNA from whole plant</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>2</p>
                     </c>
                     <c ca="center">
                        <p>BT014197</p>
                     </c>
                     <c ca="center">
                        <p>EST</p>
                     </c>
                     <c ca="center">
                        <p>
                           <it>Solanum lycopersicum</it>
                           <sup>d</sup>
                        </p>
                     </c>
                     <c ca="center">
                        <p>-</p>
                     </c>
                     <c ca="left">
                        <p>cDNA from fruit</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>3</p>
                     </c>
                     <c ca="center">
                        <p>DY275505</p>
                     </c>
                     <c ca="center">
                        <p>EST</p>
                     </c>
                     <c ca="center">
                        <p>
                           <it>Citrus clementina</it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>-</p>
                     </c>
                     <c ca="left">
                        <p>cDNA from mixed tissue</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>4</p>
                     </c>
                     <c ca="center">
                        <p>CO092422</p>
                     </c>
                     <c ca="center">
                        <p>EST</p>
                     </c>
                     <c ca="center">
                        <p>
                           <it>Gossypium raimondii</it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>-</p>
                     </c>
                     <c ca="left">
                        <p>cDNA from whole seedlings</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>5</p>
                     </c>
                     <c ca="center">
                        <p>CK295158</p>
                     </c>
                     <c ca="center">
                        <p>EST</p>
                     </c>
                     <c ca="center">
                        <p>
                           <it>Nicotiana benthamiana</it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>-</p>
                     </c>
                     <c ca="left">
                        <p>cDNA from mixed tissue</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>6</p>
                     </c>
                     <c ca="center">
                        <p>EG656577</p>
                     </c>
                     <c ca="center">
                        <p>EST</p>
                     </c>
                     <c ca="center">
                        <p>
                           <it>Ricinus communis</it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>-</p>
                     </c>
                     <c ca="left">
                        <p>cDNA from seeds</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>7</p>
                     </c>
                     <c ca="center">
                        <p>EG666650</p>
                     </c>
                     <c ca="center">
                        <p>EST</p>
                     </c>
                     <c ca="center">
                        <p>
                           <it>Ricinus communis</it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>-</p>
                     </c>
                     <c ca="left">
                        <p>cDNA from roots</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>8</p>
                     </c>
                     <c ca="center">
                        <p>CK746370<sup>e </sup>DT60037<sup>e</sup></p>
                     </c>
                     <c ca="center">
                        <p>EST</p>
                     </c>
                     <c ca="center">
                        <p>
                           <it>Liriodendron tulipifera</it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>-</p>
                     </c>
                     <c ca="left">
                        <p>cDNA from flower buds</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>9</p>
                     </c>
                     <c ca="center">
                        <p>DR936893<sup>e </sup>DT742029<sup>e</sup></p>
                     </c>
                     <c ca="center">
                        <p>EST</p>
                     </c>
                     <c ca="center">
                        <p>
                           <it>Aquilegia Formosa &#215; Aquilegia pubescens</it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>-</p>
                     </c>
                     <c ca="left">
                        <p>cDNA from mixed tissue</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>10</p>
                     </c>
                     <c ca="center">
                        <p>AM455454</p>
                     </c>
                     <c ca="center">
                        <p>WGSS</p>
                     </c>
                     <c ca="center">
                        <p>
                           <it>Vitis vinifera</it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>-</p>
                     </c>
                     <c ca="left">
                        <p>Exons between nucleotides 61100&#8211;61186, 61265&#8211;61354 &amp; 61465&#8211;62185</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>11</p>
                     </c>
                     <c ca="center">
                        <p>AM455454</p>
                     </c>
                     <c ca="center">
                        <p>WGSS</p>
                     </c>
                     <c ca="center">
                        <p>
                           <it>Vitis vinifera</it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>-</p>
                     </c>
                     <c ca="left">
                        <p>Exons between nucleotides 69471&#8211;69617 &amp; 69685&#8211;70443</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>12</p>
                     </c>
                     <c ca="center">
                        <p>557139</p>
                     </c>
                     <c ca="center">
                        <p>Gene</p>
                     </c>
                     <c ca="center">
                        <p>
                           <it>Populus trichocarpa</it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>PIP</p>
                     </c>
                     <c ca="left">
                        <p>no EST support</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>13</p>
                     </c>
                     <c ca="center">
                        <p>829126</p>
                     </c>
                     <c ca="center">
                        <p>Gene</p>
                     </c>
                     <c ca="center">
                        <p>
                           <it>Populus trichocarpa</it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>PIP</p>
                     </c>
                     <c ca="left">
                        <p>EST support from cambium</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>14</p>
                     </c>
                     <c ca="center">
                        <p>767334</p>
                     </c>
                     <c ca="center">
                        <p>Gene</p>
                     </c>
                     <c ca="center">
                        <p>
                           <it>Populus trichocarpa</it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>PIP</p>
                     </c>
                     <c ca="left">
                        <p>no EST support</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>15</p>
                     </c>
                     <c ca="center">
                        <p>759781</p>
                     </c>
                     <c ca="center">
                        <p>Gene</p>
                     </c>
                     <c ca="center">
                        <p>
                           <it>Populus trichocarpa</it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>PIP</p>
                     </c>
                     <c ca="left">
                        <p>no EST support</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>16</p>
                     </c>
                     <c ca="center">
                        <p>821124</p>
                     </c>
                     <c ca="center">
                        <p>Gene</p>
                     </c>
                     <c ca="center">
                        <p>
                           <it>Populus trichocarpa</it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>PIP</p>
                     </c>
                     <c ca="left">
                        <p>EST support from petioles</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>17</p>
                     </c>
                     <c ca="center">
                        <p>XM_639170</p>
                     </c>
                     <c ca="center">
                        <p>Gene</p>
                     </c>
                     <c ca="center">
                        <p>
                           <it>Dictyostelium discoideum AX4 </it>
                           <sup>f</sup>
                        </p>
                     </c>
                     <c ca="center">
                        <p>MIP</p>
                     </c>
                     <c ca="left">
                        <p>Hypothetical protein</p>
                     </c>
                  </r>
               </tblbdy>
               <tblfn>
                  <p><sup>a</sup>Number used for identification in Fig. 3 <sup>b </sup>GenBank ID or Protein ID <it>for Populus trichocarpa v 1.1 </it>database at JGI <sup>c </sup>EST = Expressed Sequence Tag, WGSS = Whole Genome Shotgun Sequence, Gene = Annotated gene <sup>d </sup>Tomato, previously named <it>Lycopersicon esculentum </it><sup>e </sup>Two overlapping sequences were used to construct a full length sequence <sup>f </sup>The only non-plant species and a very divergent sequence</p>
               </tblfn>
            </tbl>
            <fig id="F3">
               <title>
                  <p>Figure 3</p>
               </title>
               <caption>
                  <p>Phylogenetic tree showing that the XIPs constitute a monophyletic subfamily distinct from other MIP subfamilies</p>
               </caption>
               <text>
                  <p><b>Phylogenetic tree showing that the XIPs constitute a monophyletic subfamily distinct from other MIP subfamilies</b>. The unrooted bootstrap majority-rule consensus tree was generated with the parsimony method. Bootstrap support values in percentage are presented for the branches separating the subfamilies. The taxa in the XIP group are numbered for identification in Table 2. Except for these sequences and all PpMIPs (except PpHIP1;1), AQP0 sequences of <it>Bos taurus </it>[GenBank:<ext-link ext-link-type="gen" ext-link-id="NM_173937">NM_173937</ext-link>] and <it>Ovis aries </it>[GenBank:<ext-link ext-link-type="gen" ext-link-id="AY573927">AY573927</ext-link>] and TIP sequences from <it>Picea abies </it>[GenBank:<ext-link ext-link-type="gen" ext-link-id="AJ005078">AJ005078</ext-link>], <it>Lotus japonicus </it>[GenBank:<ext-link ext-link-type="gen" ext-link-id="AF275315">AF275315</ext-link>], <it>Helianthus annus </it>[GenBank:<ext-link ext-link-type="gen" ext-link-id="EF469912">EF469912</ext-link>], <it>Oryza sativa </it>[GenBank:<ext-link ext-link-type="gen" ext-link-id="AB114829">AB114829</ext-link>] and <it>Posidonia oceanica </it>[GenBank:<ext-link ext-link-type="gen" ext-link-id="AJ314583">AJ314583</ext-link>] were used.</p>
               </text>
               <graphic file="1471-2229-8-45-3"/>
            </fig>
            <tbl id="T3">
               <title>
                  <p>Table 3</p>
               </title>
               <caption>
                  <p>Aromatic/arginine filter of PpMIPs and MIPs of the XIP subfamily</p>
               </caption>
               <tblbdy cols="8">
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c cspan="2" ca="center">
                        <p>
                           <b>NPA motifs</b>
                        </p>
                     </c>
                     <c cspan="5" ca="center">
                        <p>
                           <b>Ar/R selectivity filter</b>
                           <sup>a</sup>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c cspan="2">
                        <hr/>
                     </c>
                     <c cspan="5">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>
                           <b>MIP protein(s)</b>
                           <sup>b</sup>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>Loop B</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>Loop E</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>H2</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>H5</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>LE</b>
                           <sub>1</sub>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>LE</b>
                           <sub>2</sub>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>Alt. H5</b>
                           <sup>c</sup>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c cspan="8">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>PpPIPs</p>
                     </c>
                     <c ca="center">
                        <p>NPA</p>
                     </c>
                     <c ca="center">
                        <p>NPA</p>
                     </c>
                     <c ca="center">
                        <p>F</p>
                     </c>
                     <c ca="center">
                        <p>H</p>
                     </c>
                     <c ca="center">
                        <p>T</p>
                     </c>
                     <c ca="center">
                        <p>R</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>PpTIPs</p>
                     </c>
                     <c ca="center">
                        <p>NPA</p>
                     </c>
                     <c ca="center">
                        <p>NPG</p>
                     </c>
                     <c ca="center">
                        <p>H</p>
                     </c>
                     <c ca="center">
                        <p>I</p>
                     </c>
                     <c ca="center">
                        <p>A</p>
                     </c>
                     <c ca="center">
                        <p>R</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>PpNIP3.1</p>
                     </c>
                     <c ca="center">
                        <p>NPA</p>
                     </c>
                     <c ca="center">
                        <p>NPV</p>
                     </c>
                     <c ca="center">
                        <p>A</p>
                     </c>
                     <c ca="center">
                        <p>I</p>
                     </c>
                     <c ca="center">
                        <p>A</p>
                     </c>
                     <c ca="center">
                        <p>R</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>PpNIP5s</p>
                     </c>
                     <c ca="center">
                        <p>NPA</p>
                     </c>
                     <c ca="center">
                        <p>NPA</p>
                     </c>
                     <c ca="center">
                        <p>F</p>
                     </c>
                     <c ca="center">
                        <p>A</p>
                     </c>
                     <c ca="center">
                        <p>A</p>
                     </c>
                     <c ca="center">
                        <p>R</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>PpNIP6.1</p>
                     </c>
                     <c ca="center">
                        <p>NPA</p>
                     </c>
                     <c ca="center">
                        <p>NPM</p>
                     </c>
                     <c ca="center">
                        <p>G</p>
                     </c>
                     <c ca="center">
                        <p>V</p>
                     </c>
                     <c ca="center">
                        <p>A</p>
                     </c>
                     <c ca="center">
                        <p>R</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>PpSIPs</p>
                     </c>
                     <c ca="center">
                        <p>NPT</p>
                     </c>
                     <c ca="center">
                        <p>NPA</p>
                     </c>
                     <c ca="center">
                        <p>V</p>
                     </c>
                     <c ca="center">
                        <p>V</p>
                     </c>
                     <c ca="center">
                        <p>P</p>
                     </c>
                     <c ca="center">
                        <p>N</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>PpGIP1.1</p>
                     </c>
                     <c ca="center">
                        <p>NPA</p>
                     </c>
                     <c ca="center">
                        <p>NPA</p>
                     </c>
                     <c ca="center">
                        <p>F</p>
                     </c>
                     <c ca="center">
                        <p>V</p>
                     </c>
                     <c ca="center">
                        <p>P</p>
                     </c>
                     <c ca="center">
                        <p>R</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>PpHIP1.1</p>
                     </c>
                     <c ca="center">
                        <p>NPA</p>
                     </c>
                     <c ca="center">
                        <p>NPA</p>
                     </c>
                     <c ca="center">
                        <p>H</p>
                     </c>
                     <c ca="center">
                        <p>H</p>
                     </c>
                     <c ca="center">
                        <p>A</p>
                     </c>
                     <c ca="center">
                        <p>R</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>PpXIP1.1</p>
                     </c>
                     <c ca="center">
                        <p>NPC</p>
                     </c>
                     <c ca="center">
                        <p>NPA</p>
                     </c>
                     <c ca="center">
                        <p>Q</p>
                     </c>
                     <c ca="center">
                        <p>A</p>
                     </c>
                     <c ca="center">
                        <p>A</p>
                     </c>
                     <c ca="center">
                        <p>R</p>
                     </c>
                     <c ca="center">
                        <p>A</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>PpXIP1.2</p>
                     </c>
                     <c ca="center">
                        <p>NPS</p>
                     </c>
                     <c ca="center">
                        <p>NPA</p>
                     </c>
                     <c ca="center">
                        <p>Q</p>
                     </c>
                     <c ca="center">
                        <p>I</p>
                     </c>
                     <c ca="center">
                        <p>A</p>
                     </c>
                     <c ca="center">
                        <p>R</p>
                     </c>
                     <c ca="center">
                        <p>Q</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>DN837617</p>
                     </c>
                     <c ca="center">
                        <p>NPI</p>
                     </c>
                     <c ca="center">
                        <p>NPA</p>
                     </c>
                     <c ca="center">
                        <p>L</p>
                     </c>
                     <c ca="center">
                        <p>Q</p>
                     </c>
                     <c ca="center">
                        <p>A</p>
                     </c>
                     <c ca="center">
                        <p>R</p>
                     </c>
                     <c ca="center">
                        <p>S</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>DY275505</p>
                     </c>
                     <c ca="center">
                        <p>NPL</p>
                     </c>
                     <c ca="center">
                        <p>NPA</p>
                     </c>
                     <c ca="center">
                        <p>V</p>
                     </c>
                     <c ca="center">
                        <p>V</p>
                     </c>
                     <c ca="center">
                        <p>A</p>
                     </c>
                     <c ca="center">
                        <p>R</p>
                     </c>
                     <c ca="center">
                        <p>T</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>AM455454.1</p>
                     </c>
                     <c ca="center">
                        <p>NPV</p>
                     </c>
                     <c ca="center">
                        <p>NPA</p>
                     </c>
                     <c ca="center">
                        <p>V</p>
                     </c>
                     <c ca="center">
                        <p>V</p>
                     </c>
                     <c ca="center">
                        <p>A</p>
                     </c>
                     <c ca="center">
                        <p>R</p>
                     </c>
                     <c ca="center">
                        <p>T</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>557139</p>
                     </c>
                     <c ca="center">
                        <p>NPI</p>
                     </c>
                     <c ca="center">
                        <p>NPA</p>
                     </c>
                     <c ca="center">
                        <p>V</p>
                     </c>
                     <c ca="center">
                        <p>V</p>
                     </c>
                     <c ca="center">
                        <p>A</p>
                     </c>
                     <c ca="center">
                        <p>R</p>
                     </c>
                     <c ca="center">
                        <p>T</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>829126</p>
                     </c>
                     <c ca="center">
                        <p>NPI</p>
                     </c>
                     <c ca="center">
                        <p>NPA</p>
                     </c>
                     <c ca="center">
                        <p>V</p>
                     </c>
                     <c ca="center">
                        <p>V</p>
                     </c>
                     <c ca="center">
                        <p>A</p>
                     </c>
                     <c ca="center">
                        <p>R</p>
                     </c>
                     <c ca="center">
                        <p>T</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>759781</p>
                     </c>
                     <c ca="center">
                        <p>NPI</p>
                     </c>
                     <c ca="center">
                        <p>NPA</p>
                     </c>
                     <c ca="center">
                        <p>V</p>
                     </c>
                     <c ca="center">
                        <p>V</p>
                     </c>
                     <c ca="center">
                        <p>A</p>
                     </c>
                     <c ca="center">
                        <p>R</p>
                     </c>
                     <c ca="center">
                        <p>T</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>EG666650</p>
                     </c>
                     <c ca="center">
                        <p>SPT</p>
                     </c>
                     <c ca="center">
                        <p>NPA</p>
                     </c>
                     <c ca="center">
                        <p>V</p>
                     </c>
                     <c ca="center">
                        <p>V</p>
                     </c>
                     <c ca="center">
                        <p>V</p>
                     </c>
                     <c ca="center">
                        <p>R</p>
                     </c>
                     <c ca="center">
                        <p>T</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>DR936893 DT742029</p>
                     </c>
                     <c ca="center">
                        <p>NPT</p>
                     </c>
                     <c ca="center">
                        <p>NPS</p>
                     </c>
                     <c ca="center">
                        <p>V</p>
                     </c>
                     <c ca="center">
                        <p>V</p>
                     </c>
                     <c ca="center">
                        <p>V</p>
                     </c>
                     <c ca="center">
                        <p>R</p>
                     </c>
                     <c ca="center">
                        <p>S</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>CK746370 D T60037</p>
                     </c>
                     <c ca="center">
                        <p>NPI</p>
                     </c>
                     <c ca="center">
                        <p>NPA</p>
                     </c>
                     <c ca="center">
                        <p>V</p>
                     </c>
                     <c ca="center">
                        <p>I</p>
                     </c>
                     <c ca="center">
                        <p>V</p>
                     </c>
                     <c ca="center">
                        <p>R</p>
                     </c>
                     <c ca="center">
                        <p>G</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>767334</p>
                     </c>
                     <c ca="center">
                        <p>NPL</p>
                     </c>
                     <c ca="center">
                        <p>NPA</p>
                     </c>
                     <c ca="center">
                        <p>A</p>
                     </c>
                     <c ca="center">
                        <p>V</p>
                     </c>
                     <c ca="center">
                        <p>A</p>
                     </c>
                     <c ca="center">
                        <p>R</p>
                     </c>
                     <c ca="center">
                        <p>T</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>CK295158</p>
                     </c>
                     <c ca="center">
                        <p>NPV</p>
                     </c>
                     <c ca="center">
                        <p>NPA</p>
                     </c>
                     <c ca="center">
                        <p>I</p>
                     </c>
                     <c ca="center">
                        <p>V</p>
                     </c>
                     <c ca="center">
                        <p>A</p>
                     </c>
                     <c ca="center">
                        <p>R</p>
                     </c>
                     <c ca="center">
                        <p>T</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>BT014197</p>
                     </c>
                     <c ca="center">
                        <p>NPV</p>
                     </c>
                     <c ca="center">
                        <p>NPA</p>
                     </c>
                     <c ca="center">
                        <p>I</p>
                     </c>
                     <c ca="center">
                        <p>V</p>
                     </c>
                     <c ca="center">
                        <p>A</p>
                     </c>
                     <c ca="center">
                        <p>R</p>
                     </c>
                     <c ca="center">
                        <p>T</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>AM455454.2</p>
                     </c>
                     <c ca="center">
                        <p>NPI</p>
                     </c>
                     <c ca="center">
                        <p>NPA</p>
                     </c>
                     <c ca="center">
                        <p>I</p>
                     </c>
                     <c ca="center">
                        <p>V</p>
                     </c>
                     <c ca="center">
                        <p>A</p>
                     </c>
                     <c ca="center">
                        <p>R</p>
                     </c>
                     <c ca="center">
                        <p>T</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>821124</p>
                     </c>
                     <c ca="center">
                        <p>NPA</p>
                     </c>
                     <c ca="center">
                        <p>NPA</p>
                     </c>
                     <c ca="center">
                        <p>I</p>
                     </c>
                     <c ca="center">
                        <p>V</p>
                     </c>
                     <c ca="center">
                        <p>V</p>
                     </c>
                     <c ca="center">
                        <p>R</p>
                     </c>
                     <c ca="center">
                        <p>T</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>EG656577</p>
                     </c>
                     <c ca="center">
                        <p>NPV</p>
                     </c>
                     <c ca="center">
                        <p>NPA</p>
                     </c>
                     <c ca="center">
                        <p>I</p>
                     </c>
                     <c ca="center">
                        <p>V</p>
                     </c>
                     <c ca="center">
                        <p>V</p>
                     </c>
                     <c ca="center">
                        <p>R</p>
                     </c>
                     <c ca="center">
                        <p>T</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>CO092422</p>
                     </c>
                     <c ca="center">
                        <p>NPV</p>
                     </c>
                     <c ca="center">
                        <p>NPA</p>
                     </c>
                     <c ca="center">
                        <p>I</p>
                     </c>
                     <c ca="center">
                        <p>V</p>
                     </c>
                     <c ca="center">
                        <p>V</p>
                     </c>
                     <c ca="center">
                        <p>R</p>
                     </c>
                     <c ca="center">
                        <p>T</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>XM_639170</p>
                     </c>
                     <c ca="center">
                        <p>NPS</p>
                     </c>
                     <c ca="center">
                        <p>NPA</p>
                     </c>
                     <c ca="center">
                        <p>H</p>
                     </c>
                     <c ca="center">
                        <p>S</p>
                     </c>
                     <c ca="center">
                        <p>F</p>
                     </c>
                     <c ca="center">
                        <p>R</p>
                     </c>
                     <c ca="center">
                        <p>I</p>
                     </c>
                  </r>
               </tblbdy>
               <tblfn>
                  <p><sup>a </sup>The ar/R filter is defined by four amino acid residues: one in helix 2, one in helix 5 and two in loop E <sup>b </sup>The PpMIPs are identified with their proposed names and the other MIPs are identified by their GenBank accession numbers <sup>c </sup>Alternative residue at H5 position due to alignment of conserved glycines in helix 5, however this also introduces two extra amino acids between helix 5 and the second NPA box</p>
               </tblfn>
            </tbl>
         </sec>
         <sec>
            <st>
               <p>Gene structure</p>
            </st>
            <p>The average <it>PpMIP </it>was found to have 2.6 introns with a size of 246.4 bp. This is about half the number of introns, but of approximately the same size as predicted for the average <it>P. patens </it>gene in a genome wide analysis <abbrgrp><abbr bid="B42">42</abbr></abbrgrp>. The exon/intron patterns of the <it>PpMIPs </it>were found to be highly conserved within each subfamily, as shown in Figure <figr fid="F4">4</figr>. Comparison with the <it>AtMIPs </it>showed the intron positions to be conserved for both <it>PIPs </it>and <it>NIPs</it>, but not for <it>TIPs </it>(in <it>P. patens </it>the intron position is 35 base pairs further to the 5'-end) and <it>SIPs </it>(completely lacking introns in <it>P. patens</it>). The exon/intron pattern also supported that the <it>PpHIP </it>and the <it>PpXIPs </it>were to be classified neither as <it>PIPs</it>, <it>TIPs</it>, <it>NIPs</it>, <it>SIPs </it>nor <it>GIPs</it>, but rather as separate subfamilies on their own.</p>
            <fig id="F4">
               <title>
                  <p>Figure 4</p>
               </title>
               <caption>
                  <p>The conserved structure of <it>MIP </it>genes in <it>P. patens </it>is consistent with their phylogenetic classification</p>
               </caption>
               <text>
                  <p><b>The conserved structure of <it>MIP </it>genes in <it>P. patens </it>is consistent with their phylogenetic classification</b>. Horizontal bars represents exons (only coding sequence), gaps being introns. Position of transmembrane helices H1 to H6, and the two half transmembrane helices HB and HE, is indicated by vertical bars. Shading of the vertical bars shows the homologous helices in the first and second halves of the MIPs. Exons and transmembrane helices as well as position of transmembrane helices are drawn to scale, but introns are only depicted schematically, the bar indicates the length of 100 bp.</p>
               </text>
               <graphic file="1471-2229-8-45-4"/>
            </fig>
            <p>The identification of five <it>P. trichocarpa </it>XIP paralogs allowed comparison of gene structure across species. All five <it>P. trichocarpa </it>genes have the same pattern of exon-introns with two introns in the N-terminal sequence (data not shown). This is also true for the PpXIP1;2, but since the N-termini have a high degree of interspecies variation it is hard to make any conclusion on whether the intron positions are exactly conserved.</p>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Discussion</p>
         </st>
         <sec>
            <st>
               <p><it>Physcomitrella patens </it>Major Intrinsic Proteins</p>
            </st>
            <p>Comparison of protein superfamilies of distantly related species can aid in our understanding of protein function and by annotating all MIPs in <it>P. patens </it>we have made such a comparison possible for the MIP superfamily of higher plants and mosses. Originally we hypothesised that mosses were to have a relatively small superfamily, due to them being simpler (for example lacking vascular tissue and therefore having a less complex water transport regulation). It was therefore much to our surprise that we found <it>P. patens </it>to have seven subfamilies containing in total 23 different MIPs, an unexpected large and divergent superfamily. One of these (PpGIP1;1) is analysed in detail by Gustavsson et al. <abbrgrp><abbr bid="B20">20</abbr></abbrgrp>, and is therefore omitted from this discussion. Half of the remaining 22 PpMIPs are previously described by Borstlap <abbrgrp><abbr bid="B40">40</abbr></abbrgrp> and Lienard et al. <abbrgrp><abbr bid="B41">41</abbr></abbrgrp> and the remaining 11 are previously not described in the literature. The gene structure of the PpMIPs supports the phylogenetic analyses and the resulting division into seven subfamilies. Comparison with AtMIPs shows that PIPs and NIPs have conserved intron positions whereas SIPs and TIPs do not. This is consistent with the conservation of individual groups of the NIP and PIP subfamily in both <it>P. patens </it>and <it>A. thaliana </it>(discussed further below).</p>
         </sec>
         <sec>
            <st>
               <p>PIPs &#8211; the most conserved MIPs in plants</p>
            </st>
            <p>PIPs are remarkably well conserved plant MIPs that can be further classified into PIP1s and PIP2s. Both PIP1s and PIP2s are highly conserved in <it>P. patens </it>indicating that these groups must have formed early on in the evolution of land plants and are of fundamental importance in plant physiology. The physiological relevance of PIP1s and PIP2s in water relations in higher plants is well established and recently also carbon dioxide has been added to the list of possible substrates [reviewed in <abbrgrp><abbr bid="B4">4</abbr></abbrgrp>]. The ar/R filter is strictly conserved in PIPs including PpPIPs suggesting that all PIPs, irrespectively of subgroup, have the same substrate specificity (Table <tblr tid="T3">3</tblr>). It is likely that the evolution of PIP sequences is constrained also in many other ways. For example the PIPs reside in the plasma membrane and it is essential that they are impermeable for protons in order to maintain the proton gradient. Furthermore, the water permeability of PIPs can be regulated by phosphorylations, pH and Ca<sup>2+ </sup>via an intricate gating mechanism <abbrgrp><abbr bid="B11">11</abbr></abbrgrp>. From our results presented here it is clear that the diacidic motif in the N-terminal region and the histidine in the D-loop responsible for Ca<sup>2+ </sup>binding and pH gating, respectively, are both conserved in all PpPIP1s and PpPIP2s. The phosphorylation site in loop B is also conserved in all PpPIPs whereas the PIP2 specific C-terminal phosphorylation motif is restricted to the PpPIP2s. This suggests that the gating mechanism is generic in all species and tissues where PIPs are expressed and that for instance pH gating is not limited to anaerobic conditions in roots of higher plants.</p>
            <p>In <it>P. patens </it>there is also an odd PIP (PpPIP3;1), basal to both PIP1s and PIP2s. The PpPIP3;1 has a deletion of 11 amino acids after the second NPA-box (between helix E and helix 6) and this, together with the relatively high divergence from other PIPs (e.g. lack of the Ca<sup>2+ </sup>binding site at the N terminal region and a conserved cysteine at helix 2) and the absence of ESTs, makes it questionable if this <it>MIP </it>gene is at all functional.</p>
         </sec>
         <sec>
            <st>
               <p>TIPs specialization occurred later</p>
            </st>
            <p>It has already been suggested that <it>P. patens </it>is lacking the specific isoforms of TIPs observed in higher plants <abbrgrp><abbr bid="B40">40</abbr></abbrgrp> and now, with this complete set of PpMIPs at hand, this is confirmed. Interestingly, it has been proposed that vacuole sub-types harbor specific sets of TIP isoforms <abbrgrp><abbr bid="B43">43</abbr></abbrgrp> and it is easy to speculate that the TIP groups in higher plants evolved due to special functional requirements of different vacuoles. The identification of conserved proteins in <it>P. patens</it>, involved in the sorting of proteins to different types of vacuoles, suggests that there are most likely more than one type of vacuole in bryophytes <abbrgrp><abbr bid="B44">44</abbr></abbrgrp>. This implies that TIPs are not conserved markers for subtypes of vacuoles as the presence of only one group of TIPs in <it>P. patens </it>indicates that either there is only one of the vacuole types in moss that has TIPs, or alternatively several different vacuoles in the moss cell all have the same type of TIPs. Both interpretations are consistent with recent experiments in higher plants that have challenged the idea of TIPs as valid markers for vacuole sub-types <abbrgrp><abbr bid="B45">45</abbr><abbr bid="B46">46</abbr></abbrgrp>.</p>
            <p>Rather than forming a very distant subclass of TIPs, the PpTIP6s appears as a conserved mosaic of the different motifs that are found in the different TIP groups of higher plants. For example the first few amino acid residues at the N-terminus are similar to TIP2s, whereas the C-terminal region is most similar to TIP3s. The identities of the amino acid residues at the ar/R filter (HIAR) are shared with both some TIP3s and TIP4s suggesting a similar specificity. In fact exactly these residues are the most common, comparing the frequencies in the selectivity regions of all <it>A. thaliana</it>, <it>Z. mays </it>and <it>O. sativa </it>TIPs (H<sub>0.81</sub>I<sub>0.62</sub>A<sub>0.72</sub>R<sub>0.75</sub>; based on Table 4 in <abbrgrp><abbr bid="B47">47</abbr></abbrgrp>). This makes it likely that PpTIP6s are similar to the TIPs present in the last common ancestor of bryophytes and vascular plants and that the other motifs found at these positions are derived characters that have appeared later as different groups of TIPs evolved in vascular plants. The expansion and formation of specialized groups in the TIP subfamily of higher plants might suggest that some of these TIPs have taken over the functions of the MIPs of subfamilies that are missing in higher plants (e.g. HIPs and XIPs).</p>
         </sec>
         <sec>
            <st>
               <p>NIP groups evolved early</p>
            </st>
            <p>In higher plants NIPs form a divergent subfamily with large variation between species. This is true also for NIPs in <it>P. patens</it>, but surprisingly one of the three NIP groups identified is present also in higher plants, indicating that this group of NIPs, NIP3, was present already in a common ancestor to <it>P. patens </it>and higher plants (Fig. <figr fid="F2">2</figr>). The conserved intron positions among <it>NIPs </it>in <it>A. thaliana </it>and <it>P. patens </it>indicate that this gene structure was also present in the ancestral <it>NIP </it>gene. NIPs are different from other MIPs in that they often have unorthodox NPA boxes. In many NIP3s of higher plants the first and second NPA boxes are replaced by NPS and NPV, respectively <abbrgrp><abbr bid="B47">47</abbr></abbrgrp>. The corresponding motifs in PpNIP3;1 are NPA and NPV (Table <tblr tid="T3">3</tblr>), which is identical to AtNIP6;1 (one of the two NIP3s in <it>A. thaliana </it>according to the monocot classification), suggesting that NIP3s had these motifs before the split of bryophytes and vascular plants.</p>
            <p>The two NIP groups specific for <it>P. patens </it>(PpNIP5 and PpNIP6), have a unique combination of amino acids at the ar/R filter (Table <tblr tid="T3">3</tblr>). In contrast the ar/R region of PpNIP3;1 conforms to the residues found in other NIP3s, supporting that they are orthologs with the same conserved function. Recently a NIP3 have been shown to have a role in boron uptake in roots of <it>A. thaliana </it><abbrgrp><abbr bid="B27">27</abbr></abbrgrp> and even though mosses lack roots it cannot be ruled out that PpNIP3;1 has a role in boron transport in the moss.</p>
            <p>The N-terminal region of NIPs is relatively long compared to most other plant MIPs and is encoded on a separate exon. Due to the lack of generally conserved motifs in this region the first exon is often missing in annotations of <it>NIP </it>genes. However, within NIP3s of higher plants several motifs have been recognized in the N-terminal region <abbrgrp><abbr bid="B48">48</abbr></abbrgrp> and some of these features are also conserved in PpNIP3;1. Similar to higher plants PpNIP3;1 has a high degree of proline and threonine residues and a sequence (AKCFP), corresponding to the conserved motif (C [KN]C [LF] [PS]) in higher plants.</p>
            <p>Many NIPs in higher plants have a conserved potential phosphorylation motif in the C-terminal region corresponding to the phosphorylation site in <it>Glycine max </it>NOD26 (GmNOD26, S262) and <it>Spinacia oleracea </it>PIP2;1 (SoPIP2;1; S274) <abbrgrp><abbr bid="B5">5</abbr><abbr bid="B49">49</abbr></abbrgrp>. A serine at this position is also present in a similar motif in NIP3s of higher plants ([RK]XXR<b>S</b>FXR) <abbrgrp><abbr bid="B48">48</abbr></abbrgrp> but not in PpNIP3;1 where the serine is substituted to a valine. In PpNIP5;3 and PpNIP6;1 there are serines but some of the basic residues in the motif are not conserved. In contrast a corresponding serine in the motif (KXXK<b>S</b>F [HR]R) is present in PpNIP5;1 and PpNIP5;2 suggesting that at least some NIPs in a common ancestor of bryophytes and higher plants were regulated by phosphorylation.</p>
            <p>It is interesting to see that there is no NIP2 type of MIP in <it>P. patens</it>, a NIP-group recently identified as a silicon transporter in rice <abbrgrp><abbr bid="B28">28</abbr></abbrgrp>. Since bryophytes are known to accumulate silicon <abbrgrp><abbr bid="B50">50</abbr></abbrgrp>, the lack of PpNIP2s suggests that this function is carried out by a different isoform or class of proteins in <it>P. patens</it>.</p>
         </sec>
         <sec>
            <st>
               <p>Only SIP1s are found in <it>Physcomitrella patens</it></p>
            </st>
            <p>In <it>A. thaliana </it>there are two classes of SIPs, SIP1s and SIP2s, both having the same gene structure with two introns at conserved positions <abbrgrp><abbr bid="B16">16</abbr></abbrgrp>. In <it>P. patens </it>there are two SIPs but neither of them has an intron. Surprisingly both of the PpSIPs belong to the SIP1 group whereas SIP2s of higher plants form a basal clade. This suggests that either SIP2s were present already in early land plants but were subsequently lost in <it>P. patens </it>in which the remaining SIP1s were subject to intron loss, or that SIP2s have rapidly diverged from SIP1s after the split leading to mosses and higher plants. An intron loss in <it>PpSIP1s </it>or an intron gain in a common ancestor to <it>SIP1s </it>and <it>SIP2s </it>in higher plant is equally likely in this scenario. In most SIP1s the corresponding sequence to the first NPA box is NPT, interestingly this unusual motif is conserved also in PpSIP1s, implying that this is a structurally and functionally important feature of SIP1s. In addition the ar/R filter is consistent with the phylogenetic classification, suggesting a conserved function of SIP1s among terrestrial plants.</p>
         </sec>
         <sec>
            <st>
               <p>HIP a unique MIP with similarities to both PIPs and TIPs</p>
            </st>
            <p>There are three <it>P. patens </it>MIP sequences that cannot be classified into any of the five subfamilies previously described in plants <abbrgrp><abbr bid="B16">16</abbr><abbr bid="B20">20</abbr></abbrgrp>. One of these, the PpHIP1;1, seems to be a rather rare MIP, since we were not able to identify any orthologs. The unique gene structure indicates that this protein belongs to a separate subfamily. In phylogenetic analyses PpHIP1;1 tend to cluster with PIPs and TIPs, although the support for this is not very strong as seen in Figure <figr fid="F2">2</figr>. Upon looking at the ar/R filter (Table <tblr tid="T3">3</tblr>) one could also speculate that the HIP is related to TIPs and PIPs, since it has histidines both at the H2 position, typical for TIPs and the H5 position, typical for PIPs. What effect having two large and basic amino acid residues in the filter will have on transport properties is however unclear, and since there are no ESTs of the gene it might even be that it is not expressed. According to a subcellular localization prediction (WoLF PSORT <abbrgrp><abbr bid="B51">51</abbr></abbrgrp>, data not shown) PpHIP1;1 is slightly more likely to reside in the tonoplast than the plasma membrane. Further studies are required to explore expression, localization and substrate specificity of the PpHIP.</p>
            <p>The two other sequences belong to another group, the XIPs, further discussed in the next paragraph.</p>
         </sec>
         <sec>
            <st>
               <p>The XIP subfamily</p>
            </st>
            <p>A search for PpXIP orthologs resulted in the finding of many XIP sequences from a wide variety of species, including five paralogs from <it>P. trichocarpa (</it>probably the same five described as "putative aquaporins lacking in the <it>Arabidopsis</it>" by Tuskan et al. <abbrgrp><abbr bid="B52">52</abbr></abbrgrp>). It is striking that no sequences are from monocots. Although most sequences were from dicots, no ortholog was found in <it>A. thaliana</it>, which may be explained by gene loss due to a relatively recent reduction of the genome size <abbrgrp><abbr bid="B53">53</abbr></abbrgrp>. Phylogenetic analyses confirmed that these sequences are from a, to our knowledge, previously unrecognized MIP subfamily, different from PIPs, TIPs, NIPs, SIPs and GIPs. The only non-plant sequence included in the analyses was a protein encoded by the [GenBank:<ext-link ext-link-type="gen" ext-link-id="XM_639170">XM_639170</ext-link>] gene from the amoeba <it>Dictyostelium discoideum AX4 </it>and it should be pointed out that although this protein is clustering with the XIPs in phylogenetic analyses, it is annotated as a hypothetical protein and lacks some of the characteristics of the XIPs. For example the amoeba protein has NPA boxes and an ar/R filter different from all other XIPs and also an overall highly divergent MIP sequence, all which makes it questionable if this protein has the same function as other XIPs. There is also a sequence from a lycophyte, the spike moss <it>Selaginella moellendorffii</it>, which together with the two PpXIPs are the three most divergent sequences albeit all three are clearly categorisable as XIPs. Although most sequences were derived from ESTs, no general conclusion could be made on expression pattern, since XIP transcripts were isolated from many different tissues ranging from roots, seedlings, flower buds to seeds and fruits (Table <tblr tid="T2">2</tblr>). Based on a subcellular localization prediction XIPs are likely to be situated in the plasma membrane (WoLF PSORT <abbrgrp><abbr bid="B51">51</abbr></abbrgrp>, data not shown).</p>
            <p>In the first NPA box of the XIPs, the alanine is replaced by a valine, leucine, isoleucine, serine or cysteine. All of these replacements, except isoleucine, have been observed in NPA boxes of other MIPs <abbrgrp><abbr bid="B47">47</abbr></abbrgrp>. The most conserved feature of the new subfamily is located after the second NPA box, where a cysteine amino acid is thoroughly conserved in the motif NPAR<b>C</b>. This cysteine is only a moderate change of the conserved serine or threonine found in many other subfamilies e.g. PIPs, TIPs, NIPs and in several mammalian AQPs. However, from the solved structure of SoPIP2;1 it is clear that residues at this position can stabilize the conformation of the C-loop by hydrogen bonds ([PDB:<ext-link ext-link-type="pdb" ext-link-id="1Z98">1Z98</ext-link>];S226 &#8211; N153, see Fig. <figr fid="F5">5</figr>) an interaction that seem to be structurally conserved and that also can be seen in BtAQP1 ([PDB:<ext-link ext-link-type="pdb" ext-link-id="1J4N">1J4N</ext-link>]; S198 &#8211; N129), BtAQP0 ([PDB:<ext-link ext-link-type="pdb" ext-link-id="1YMG">1YMG</ext-link>];S188 - N119) and, with the donor-acceptor interchanged, in EcGlpF ([PDB:<ext-link ext-link-type="pdb" ext-link-id="1FX8">1FX8</ext-link>];D207 - T137). This stabilisation is probably directly affecting the permeability of the pore since the orientation of the arginine of the ar/R filter is also stabilised by a hydrogen bond to the backbone of the C-loop (Fig. <figr fid="F5">5</figr>). Interestingly all the XIPs also have a conserved cysteine resulting in the motif LGG<b>C </b>in the C-loop at a position that can be aligned to N153 in SoPIP2;1. This suggests that a cysteine bridge may covalently fixate the C-loop relative to the arginine in the XIPs and that the extracellular entrance to the pore therefore might be more rigid than that of other MIPs.</p>
            <fig id="F5">
               <title>
                  <p>Figure 5</p>
               </title>
               <caption>
                  <p>Interaction of loop C and helix E</p>
               </caption>
               <text>
                  <p><b>Interaction of loop C and helix E</b>. Detail from the structure of SoPIP2;1 illustrating how loop C and residues in helix E interacts via H-bonds. In XIPs N153 and S226 are replaced by cysteins suggesting a covalent linkage between loop C and helix E. Oxygens of water molecules at the ar/R region are represented by spheres and the discussed residues are depicted by sticks.</p>
               </text>
               <graphic file="1471-2229-8-45-5"/>
            </fig>
            <p>There is also a highly conserved motif with a proline at the end of helix 2, 7 amino acids before the first NPA-box (<b>P</b>ISGGHINP), also found in mammalian AQP5s. A corresponding motif can be found in helix 5 of many other plant MIPs, which is interesting as this reflects the symmetry of the MIP proteins, consisting of two direct repeats of sequence. It is also worth noting that, with the exception of PpXIPs, there is a lack of an otherwise highly conserved glycine in helix 5, allowing the close packing of helix 2 and 5 <abbrgrp><abbr bid="B54">54</abbr></abbrgrp>, which in most XIPs is replaced by either a leucine or an isoleucine. An alternative alignment that retains the conserved glycine, but introduces two extra amino acids between helix 5 and the second NPA box is possible, but not used in the analysis presented here. This alignment will also affect which amino acid is positioned in the H5 position of the ar/R filter (Table <tblr tid="T3">3</tblr>). In the chosen alignment a valine is the most frequent residue in the H5 position and in the alternative alignment threonine would be in the H5 position. At the H2 position most XIPs have an aliphatic amino acid, something that can also be found in some NIPs and SIPs <abbrgrp><abbr bid="B47">47</abbr></abbrgrp>. This suggests that XIPs are not primarily water channels, although substrate specificity experiments have to be carried out to establish this. In the XIPs from <it>P. patens </it>and <it>S. moellendorffii </it>there is a glutamine at the H2 respectively H5 position of the ar/R filter, also found in TIP4s and TIP5s of higher plants, suggesting that maybe these TIPs have taken over some function of the XIPs in primitive plants. Further studies of localization, specificity and expression patterns are needed in order to determine the function of this novel MIP subfamily.</p>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Conclusion</p>
         </st>
         <p>In this study we identified a surprisingly large number of MIP encoding genes in <it>P. patens</it>, forming a diverse superfamily with seven subfamilies. In total 23 PpMIPs were identified; eight PIPs, four TIPs, five NIPs and two SIPs, one GIP and three MIPs belonging to two different, novel subfamilies, the HIPs and the XIPs. HIPs are hitherto not found in any higher plants, whereas the XIPs seem to be present in many plant species, although not in monocots. Interestingly, specific groups within the subfamilies, like PIP1s, PIP2s, NIP3s and possibly SIP1s were already present in a common ancestor of higher plants and bryophytes. In contrast, the subgroups of TIPs probably evolved later. These results suggest that early land plants had a large and divergent MIP superfamily consisting of at least the seven subfamilies found in <it>P. patens </it>and that during the evolution of higher plants some subfamilies were lost (Fig. <figr fid="F6">6</figr>) whereas remaining subfamilies evolved further resulting in diversification and formation of subgroups within the subfamilies. We speculate that some of the new subgroups, or perhaps some other unrelated transporters have taken over the function of the lost MIP subfamilies in higher plants.</p>
         <fig id="F6">
            <title>
               <p>Figure 6</p>
            </title>
            <caption>
               <p>The evolution of the MIP superfamily in plants</p>
            </caption>
            <text>
               <p><b>The evolution of the MIP superfamily in plants</b>. A schematic drawing of a likely scenario for the evolution of the MIP superfamily in plants. The ancestral plant is proposed to have had all seven subfamilies of MIPs found in extant mosses. The GIP and HIP were lost during the evolution of higher plants and subsequently the XIP subfamily was lost in monocots.</p>
            </text>
            <graphic file="1471-2229-8-45-6"/>
         </fig>
      </sec>
      <sec>
         <st>
            <p>Methods</p>
         </st>
         <sec>
            <st>
               <p>Gene identification and annotation</p>
            </st>
            <p><it>Physcomitrella patens MIP </it>genes were identified by TBLASTN searches of the PpDB at the Joint Genome Institute <abbrgrp><abbr bid="B37">37</abbr></abbrgrp> using the protein sequences of the complete set of 35 MIPs from <it>Arabidopsis thaliana </it>as queries <abbrgrp><abbr bid="B16">16</abbr></abbrgrp>. Gene models overlapping with hits were manually inspected and kept based on subfamily sequence similarity or EST support. If no satisfying model existed, the genomic sequence was used to identify exons for the new or modified model (as specified in Table <tblr tid="T1">1</tblr>). The PpGIP1;1 sequence was also added to the sequences since it was previously identified as a PpMIP <abbrgrp><abbr bid="B20">20</abbr></abbrgrp>. Protein sequences corresponding to the translation of the <it>PpMIP </it>genes were used in a second round of TBLASTN searches to identify more divergent MIP sequences in PpDB, but none were found. The resulting 23 PpMIPs were used in a multiple alignment of translated sequences, together with the 35 AtMIP and 33 ZmMIPs <abbrgrp><abbr bid="B18">18</abbr></abbrgrp>. Alignments were manually inspected and adjusted and care was taken to keep the number of gaps low and to avoid gaps in functionally important features, such as the NPA-boxes and transmembrane regions. The alignment that forms the basis for all the phylogenetic analysis regarding the PpMIPs presented here is available as ALIGN_001168 in the EMBL-align database (which can be accessed either via the EMBL-EBI SRS homepage <abbrgrp><abbr bid="B55">55</abbr></abbrgrp> or FTP <abbrgrp><abbr bid="B56">56</abbr></abbrgrp>).</p>
            <p>Orthologs of the unclassified PpHIP, PpXIP1;1 and PpXIP1;2 were searched for by TFASTX3 searches of the EMBL nucleotide sequence database <abbrgrp><abbr bid="B57">57</abbr></abbrgrp> and TBLASTN searches of the nr/nt, est, gss and htgs databases at NCBI <abbrgrp><abbr bid="B58">58</abbr></abbrgrp> using the translated sequence of the three <it>PpMIP</it>s. Translations representing hits from a wide variety of species were used in protein alignments together with either PpHIP1;1 or PpXIP1;1 and PpXIP1;2 and the PpPIPs and PpTIPs. The alignments were manually inspected and adjusted as mentioned above and used for phylogenetic analysis of PpHIP1;1 and the PpXIPs and are available in the EMBL-align database as ALIGN_001169 respectively ALIGN_001170.</p>
            <p>The translated sequence of one of the PpXIP orthologs found [GenBank:<ext-link ext-link-type="gen" ext-link-id="EG656577">EG656577</ext-link>] was used in additional TBLASTN searches of the nr/nt, est, gss and htgs databases at NCBI in order to find more homologs of this group. One ortholog found was from <it>Populus trichocarpa </it>and a translation of this sequence was used in a TBLASTN search of the <it>P. trichocarpa </it>genome at JGI to find paralogs. These paralogs together with a selection of homologs from the [GenBank:<ext-link ext-link-type="gen" ext-link-id="EG656577">EG656577</ext-link>] and PpXIP searches were used in a multiple sequence alignment of translated sequences together with 22 PpMIPs (all except the PpHIP). The alignment was manually inspected and adjusted in the same manner as the PpMIP-AtMIP-ZmMIP alignment. This alignment forms the basis for all the phylogenetic analysis regarding the XIP group of MIPs and is available as ALIGN_001171 in the EMBL-align database.</p>
         </sec>
         <sec>
            <st>
               <p>Phylogenetic analysis</p>
            </st>
            <p>The PpMIP sequence alignment was analyzed by three different phylogenetic methods, Neighbour Joining (NJ), Maximum Parsimony (MP) and Bayesian inference (Bay). For all methods, gaps were treated as missing data. PAUP*4.0b10 <abbrgrp><abbr bid="B59">59</abbr></abbrgrp> was used for the NJ and MP analysis. The default settings were used for both methods and bootstrapping with one thousand replicates for each method assessed the confidence of the best trees. Bayesian phylogenetic inferences were conducted using MrBayes 3.0.2 <abbrgrp><abbr bid="B60">60</abbr></abbrgrp> using vague or uninformative prior probability distributions of the likelihood model under the JTT <abbrgrp><abbr bid="B61">61</abbr></abbrgrp> +I+&#915; model. Two sets of four parallel Metropolis Coupled Monte Carlo Markov Chains, of which three were heated with 0.2 temperature increments, were run for 2 million generations starting from random trees. Each 100th tree was sampled. The first 25 % of sampled trees was discarded as burn in, and stationary phase was empirically determined by looking at the likelihood scores of the kept samples. Robustness of the inferred tree was evaluated using Bayesian posterior probabilities. A "method consensus" tree was constructed as an overview, in this tree only branches that had a bootstrap or posterior probability support of more than 50 % in at least two of the methods were kept and all other were collapsed.</p>
            <p>For the PpHIP1;1, PpXIPs and XIP-group alignments, PAUP*4.0b10 <abbrgrp><abbr bid="B59">59</abbr></abbrgrp> was used for a NJ and MP analysis (gaps treated as missing data). The default settings were used for both methods and for the XIP-group alignment analysis, bootstrapping with one thousand replicates for each method assessed the confidence of the best trees. All trees from the PpMIP, PpHIP, PpXIPs and XIP family analyses are available in nexus format for viewing in Tree-View <abbrgrp><abbr bid="B62">62</abbr></abbrgrp> [see Additional files <supplr sid="S2">2</supplr>, <supplr sid="S3">3</supplr>, <supplr sid="S4">4</supplr>, <supplr sid="S5">5</supplr>, <supplr sid="S6">6</supplr>, <supplr sid="S7">7</supplr>, <supplr sid="S8">8</supplr>, <supplr sid="S9">9</supplr>, <supplr sid="S10">10</supplr>, <supplr sid="S11">11</supplr>, <supplr sid="S12">12</supplr>, <supplr sid="S13">13</supplr>, <supplr sid="S14">14</supplr>].</p>
            <suppl id="S2">
               <title>
                  <p>Additional file 2</p>
               </title>
               <text>
                  <p>Phylogenetic tree (in nexus format) using the Bayesian inference method and the dataset ALIGN_001168.</p>
               </text>
               <file name="1471-2229-8-45-S2.tre">
                  <p>Click here for file</p>
               </file>
            </suppl>
            <suppl id="S3">
               <title>
                  <p>Additional file 3</p>
               </title>
               <text>
                  <p>Bootstrap majority consensus phylogenetic tree (in nexus format) using the Parsimony method and the dataset ALIGN_001168.</p>
               </text>
               <file name="1471-2229-8-45-S3.tre">
                  <p>Click here for file</p>
               </file>
            </suppl>
            <suppl id="S4">
               <title>
                  <p>Additional file 4</p>
               </title>
               <text>
                  <p>Phylogenetic tree (in nexus format) using the Parsimony method and the dataset ALIGN_001168.</p>
               </text>
               <file name="1471-2229-8-45-S4.tre">
                  <p>Click here for file</p>
               </file>
            </suppl>
            <suppl id="S5">
               <title>
                  <p>Additional file 5</p>
               </title>
               <text>
                  <p>Bootstrap majority consensus phylogenetic tree (in nexus format) using the Neighbour Joining method and the dataset ALIGN_001168.</p>
               </text>
               <file name="1471-2229-8-45-S5.tre">
                  <p>Click here for file</p>
               </file>
            </suppl>
            <suppl id="S6">
               <title>
                  <p>Additional file 6</p>
               </title>
               <text>
                  <p>Phylogenetic tree (in nexus format) using the Neighbour Joining method and the dataset ALIGN_001168.</p>
               </text>
               <file name="1471-2229-8-45-S6.tre">
                  <p>Click here for file</p>
               </file>
            </suppl>
            <suppl id="S7">
               <title>
                  <p>Additional file 7</p>
               </title>
               <text>
                  <p>Phylogenetic tree (in nexus format) using the Parsimony method and the dataset ALIGN_001169.</p>
               </text>
               <file name="1471-2229-8-45-S7.tre">
                  <p>Click here for file</p>
               </file>
            </suppl>
            <suppl id="S8">
               <title>
                  <p>Additional file 8</p>
               </title>
               <text>
                  <p>Phylogenetic tree (in nexus format) using the Neighbour Joining method and the dataset ALIGN_001169.</p>
               </text>
               <file name="1471-2229-8-45-S8.tre">
                  <p>Click here for file</p>
               </file>
            </suppl>
            <suppl id="S9">
               <title>
                  <p>Additional file 9</p>
               </title>
               <text>
                  <p>Phylogenetic tree (in nexus format) using the Parsimony method and the dataset ALIGN_001170.</p>
               </text>
               <file name="1471-2229-8-45-S9.tre">
                  <p>Click here for file</p>
               </file>
            </suppl>
            <suppl id="S10">
               <title>
                  <p>Additional file 10</p>
               </title>
               <text>
                  <p>Phylogenetic tree (in nexus format) using the Neighbour Joining method and the dataset ALIGN_001170.</p>
               </text>
               <file name="1471-2229-8-45-S10.tre">
                  <p>Click here for file</p>
               </file>
            </suppl>
            <suppl id="S11">
               <title>
                  <p>Additional file 11</p>
               </title>
               <text>
                  <p>Bootstrap majority consensus phylogenetic tree (in nexus format) using the Parsimony method and the dataset ALIGN_001171.</p>
               </text>
               <file name="1471-2229-8-45-S11.tre">
                  <p>Click here for file</p>
               </file>
            </suppl>
            <suppl id="S12">
               <title>
                  <p>Additional file 12</p>
               </title>
               <text>
                  <p>Phylogenetic tree (in nexus format) using the Parsimony method and the dataset ALIGN_001171.</p>
               </text>
               <file name="1471-2229-8-45-S12.tre">
                  <p>Click here for file</p>
               </file>
            </suppl>
            <suppl id="S13">
               <title>
                  <p>Additional file 13</p>
               </title>
               <text>
                  <p>Bootstrap majority consensus phylogenetic tree (in nexus format) using the Neighbour Joining method and the dataset ALIGN_001171.</p>
               </text>
               <file name="1471-2229-8-45-S13.tre">
                  <p>Click here for file</p>
               </file>
            </suppl>
            <suppl id="S14">
               <title>
                  <p>Additional file 14</p>
               </title>
               <text>
                  <p>Phylogenetic tree (in nexus format) using the Neighbour Joining method and the dataset ALIGN_001171.</p>
               </text>
               <file name="1471-2229-8-45-S14.tre">
                  <p>Click here for file</p>
               </file>
            </suppl>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Authors' contributions</p>
         </st>
         <p>J&#197;HD carried out the acquisition, analysis and interpretation of data and drafting of the manuscript. UJ conceived the study and helped with the interpretation of data. Both authors worked with the design of the study and with revising the manuscript and they both read and approved the final manuscript.</p>
      </sec>
      <sec>
         <st>
            <p>Note added in proof</p>
         </st>
         <p>During the publication of this work we successfully identified the HIP subfamily of MIPs in the spike moss <it>Selaginella moellendorffii</it>. PpHIP1;1 and the closest homolog in <it>S. moellendorffii </it>are highly similar (with 73.7 % amino acid identity) and have the same NPA-boxes and ar/R filter motives. This proves that the HIP subfamily is indeed a novel conserved subfamily of MIPs and not an anomaly only found in <it>Physcomitrella patens</it>.</p>
      </sec>
   </bdy>
   <bm>
      <ack>
         <sec>
            <st>
               <p>Acknowledgements</p>
            </st>
            <p>We are grateful to the U.S. Department of Energy Joint Genome Institute for sequencing the genome of <it>Physcomitrella patens </it>and making the sequence available to the public. We would also like to thank Assoc. Prof. Nils Cronberg for valuable discussions on mosses and PhD Virginia Balbi and Laura Saavedra for the introduction to the PpDB at the Joint Genome Institute. This work was supported by grants from the Swedish Research Council for Environment, Agricultural Sciences and Spatial Planning (Formas; grants to U.J.).</p>
         </sec>
      </ack>
      <refgrp>
         <bibl id="B1">
            <title>
               <p>Aquaporin water channels: molecular mechanisms for human diseases</p>
            </title>
            <aug>
               <au>
                  <snm>Agre</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Kozono</snm>
                  <fnm>D</fnm>
               </au>
            </aug>
            <source>FEBS Lett</source>
            <pubdate>2003</pubdate>
            <volume>555</volume>
            <issue>1</issue>
            <fpage>72</fpage>
            <lpage>78</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/S0014-5793(03)01083-4</pubid>
                  <pubid idtype="pmpid" link="fulltext">14630322</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B2">
            <title>
               <p>Aquaporins: Phylogeny, Structure, and Physiology of Water Channels</p>
            </title>
            <aug>
               <au>
                  <snm>Heymann</snm>
                  <fnm>JB</fnm>
               </au>
               <au>
                  <snm>Engel</snm>
                  <fnm>A</fnm>
               </au>
            </aug>
            <source>News Physiol Sci</source>
            <pubdate>1999</pubdate>
            <volume>14</volume>
            <fpage>187</fpage>
            <lpage>193</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">11390849</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B3">
            <title>
               <p>From structure to disease: the evolving tale of aquaporin biology</p>
            </title>
            <aug>
               <au>
                  <snm>King</snm>
                  <fnm>LS</fnm>
               </au>
               <au>
                  <snm>Kozono</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Agre</snm>
                  <fnm>P</fnm>
               </au>
            </aug>
            <source>Nature reviews</source>
            <pubdate>2004</pubdate>
            <volume>5</volume>
            <issue>9</issue>
            <fpage>687</fpage>
            <lpage>698</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid 