<?xml version='1.0'?>
<!DOCTYPE art SYSTEM 'http://www.biomedcentral.com/xml/article.dtd'>
<art><ui>1471-2148-11-78</ui><ji>1471-2148</ji><fm>
<dochead>Research article</dochead>
<bibl>
<title>
<p>Evolution of <it>Parallel Spindles Like </it>genes in plants and highlight of unique domain architecture<sup>#</sup>
</p>
</title>
<aug>
<au id="A1"><snm>Cigliano</snm><mnm>Aiese</mnm><fnm>Riccardo</fnm><insr iid="I1"/><email>riccardo.aiesecigliano@unina.it</email></au>
<au id="A2"><snm>Sanseverino</snm><fnm>Walter</fnm><insr iid="I2"/><email>walter.sanseverino@unina.it</email></au>
<au id="A3"><snm>Cremona</snm><fnm>Gaetana</fnm><insr iid="I1"/><email>gaetana.cremona@unina.it</email></au>
<au id="A4"><snm>Consiglio</snm><mi>M</mi><fnm>Federica</fnm><insr iid="I1"/><email>marconsi@unina.it</email></au>
<au ca="yes" id="A5"><snm>Conicella</snm><fnm>Clara</fnm><insr iid="I1"/><email>conicell@unina.it</email></au>
</aug>
<insg>
<ins id="I1"><p>CNR - National Research Council of Italy, Institute of Plant Genetics, Research Division Portici, Via Universit&#224; 133, 80055 Portici. Italy</p></ins>
<ins id="I2"><p>DISSPAPA, Dept. of Soil, Plant and Environmental Sciences, University of Naples "Federico II", Via Universit&#224; 100, 80055 Portici. Italy</p></ins>
</insg>
<source>BMC Evolutionary Biology</source>
<issn>1471-2148</issn>
<pubdate>2011</pubdate>
<volume>11</volume>
<issue>1</issue>
<fpage>78</fpage>
<url>http://www.biomedcentral.com/1471-2148/11/78</url>
<xrefbib><pubidlist><pubid idtype="doi">10.1186/1471-2148-11-78</pubid><pubid idtype="pmpid">21435253</pubid></pubidlist></xrefbib>
</bibl>
<history><rec><date><day>26</day><month>11</month><year>2010</year></date></rec><acc><date><day>24</day><month>3</month><year>2011</year></date></acc><pub><date><day>24</day><month>3</month><year>2011</year></date></pub></history>
<cpyrt><year>2011</year><collab>Cigliano et al; licensee BioMed Central Ltd.</collab><note>This is an Open Access article distributed under the terms of the Creative Commons Attribution License (<url>http://creativecommons.org/licenses/by/2.0</url>), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</note></cpyrt>
<abs>
<sec>
<st>
<p>Abstract</p>
</st>
<sec>
<st>
<p>Background</p>
</st>
<p>Polyploidy has long been recognized as playing an important role in plant evolution. In flowering plants, the major route of polyploidization is suggested to be sexual through gametes with somatic chromosome number (2<it>n</it>). <it>Parallel Spindle1 </it>gene in <it>Arabidopsis thaliana </it>(<it>AtPS1</it>) was recently demonstrated to control spindle orientation in the 2nd division of meiosis and, when mutated, to induce 2<it>n </it>pollen. Interestingly, <it>AtPS1 </it>encodes a protein with a FHA domain and PINc domain putatively involved in RNA decay (i.e. Nonsense Mediated mRNA Decay). In potato, 2<it>n </it>pollen depending on parallel spindles was described long time ago but the responsible gene has never been isolated. The knowledge derived from <it>AtPS1 </it>as well as the availability of genome sequences makes it possible to isolate potato <it>PSLike </it>(<it>PSL</it>) and to highlight the evolution of <it>PSL </it>family in plants.</p>
</sec>
<sec>
<st>
<p>Results</p>
</st>
<p>Our work leading to the first characterization of <it>PSLs </it>in potato showed a greater <it>PSL </it>complexity in this species respect to <it>Arabidopsis thaliana</it>. Indeed, a genomic <it>PSL </it>locus and seven cDNAs affected by alternative splicing have been cloned. In addition, the occurrence of at least two other <it>PSL </it>loci in potato was suggested by the sequence comparison of alternatively spliced transcripts.</p>
<p>Phylogenetic analysis on 20 <it>Viridaeplantae </it>showed the wide distribution of <it>PSLs </it>throughout the species and the occurrence of multiple copies only in potato and soybean.</p>
<p>The analysis of PSL<sup>FHA </sup>and PSL<sup>PINc </sup>domains evidenced that, in terms of secondary structure, a major degree of variability occurred in PINc domain respect to FHA. In terms of specific active sites, both domains showed diversification among plant species that could be related to a functional diversification among <it>PSL </it>genes. In addition, some specific active sites were strongly conserved among plants as supported by sequence alignment and by evidence of negative selection evaluated as difference between non-synonymous and synonymous mutations.</p>
</sec>
<sec>
<st>
<p>Conclusions</p>
</st>
<p>In this study, we highlight the existence of PSLs throughout <it>Viridaeplantae</it>, from mosses to higher plants. We provide evidence that <it>PSLs </it>occur mostly as singleton in the analyzed genomes except in soybean and potato both characterized by a recent whole genome duplication event. In potato, we suggest the candidate <it>PSL </it>gene having a role in 2<it>n </it>pollen that should be deeply investigated.</p>
<p>We provide useful insight into evolutionary conservation of FHA and PINc domains throughout plant PSLs which suggest a fundamental role of these domains for PSL function.</p>
</sec>
</sec>
</abs>
</fm><bdy>
<sec>
<st>
<p>Background</p>
</st>
<p>Polyploidy represents the occurrence of more than two complete sets of chromosomes in an organism and has long been recognized as playing an especially important role in plant evolution <abbrgrp>
<abbr bid="B1">1</abbr>
</abbrgrp>. In flowering plants, polyploidy extent has been largely underestimated in terms of its commonality. Indeed, major recent advances in genomic analysis has revealed that almost all angiosperms have experienced at least one round of whole genome duplication during their evolution.</p>
<p>The wide spreading of polyploidy throughout the angiosperms can be related to their highly plastic genome structure, as inferred from their tolerance to changes in chromosome number, genome size and epigenome <abbrgrp>
<abbr bid="B2">2</abbr>
</abbrgrp>. Although information with regard to the modes of polyploidization is limited, the major route of polyploidization seems to be sexual through the functioning of gametes with somatic chromosome number (2<it>n </it>gametes) <abbrgrp>
<abbr bid="B3">3</abbr>
</abbrgrp>. Indeed, sexual polyploidization as compared to asexual would explain better the success of polyploid species in terms of higher fitness and more genetic flexibility. The control of 2<it>n </it>gamete formation has been generally attributed to the action of single recessive genes. These genes exhibit incomplete penetrance and variable expression that is significantly influenced by genetic, environmental and developmental factors <abbrgrp>
<abbr bid="B4">4</abbr>
</abbrgrp>. The molecular mechanisms leading to 2<it>n </it>gametes have only recently begun to be uncovered <abbrgrp>
<abbr bid="B5">5</abbr>
<abbr bid="B6">6</abbr>
</abbrgrp>. In particular, d'Erfurth and colleagues <abbrgrp>
<abbr bid="B7">7</abbr>
</abbrgrp> isolated and characterized <it>Parallel Spindle1 </it>gene in <it>Arabidopsis thaliana </it>(<it>AtPS1</it>) that controls diploid pollen formation through spindle orientation in the second division of meiosis. The occurrence of parallel spindles at meiosis II is a frequently found mechanism for 2<it>n </it>pollen formation that was described in potato many decades ago <abbrgrp>
<abbr bid="B8">8</abbr>
<abbr bid="B9">9</abbr>
</abbrgrp>. In potato, <it>ps </it>mutants have been used for breeding purposes in order to introgress beneficial traits from diploid (2<it>n </it>= 2<it>x </it>= 24) relatives into cultivated strains <abbrgrp>
<abbr bid="B10">10</abbr>
</abbrgrp>. However, the gene <it>ps </it>leading to 2<it>n </it>pollen via parallel spindles was not isolated, so far.</p>
<p>Interestingly, AtPS1 is a protein which contains contemporarily a ForkHead Associated domain (FHA), and a C-terminal PilT N-terminus domain (PINc). So far, the FHA domain has been found in more than 5600 different proteins from prokaryotes to higher eukaryotes involved in several processes including cell cycle control, DNA repair, protein degradation, transcription and pre-mRNA splicing <abbrgrp>
<abbr bid="B11">11</abbr>
</abbrgrp>. FHA domain was shown to recognize phosphothreonine-containing epitopes <abbrgrp>
<abbr bid="B12">12</abbr>
</abbrgrp>. PINc domain has been found in more than 3600 proteins in all life kingdoms. PINc domain has RNA nuclease activity <abbrgrp>
<abbr bid="B13">13</abbr>
</abbrgrp>. In eukaryotes, PINc-containing proteins, such as human SMG6 and SMG5, were linked to <ul>N</ul>onsense-<ul>M</ul>ediated mRNA <ul>D</ul>ecay (NMD), that recognizes and rapidly degrades mRNAs containing <ul>P</ul>remature translation <ul>T</ul>ermination <ul>C</ul>odons (PTCs).</p>
<p>In this study, a sequence-homology-based strategy was carried out to isolate <it>PS </it>gene from a diploid potato. Through this approach, a genomic locus <it>PS-Like </it>(<it>PSL</it>) and seven cDNAs affected by alternative splicing have been cloned. The occurrence of at least two other <it>PSL </it>loci is suggested in potato. In order to shed light on the evolution and function of <it>PSL </it>genes in plants, a phylogenetic analysis of <it>PSL </it>genes was performed and FHA/PINc domains were compared. As far as we know, this is the first report about the isolation and characterization of <it>PSL </it>in a crop. We also demonstrate the conservation of this gene family throughout plants.</p>
</sec>
<sec>
<st>
<p>Results</p>
</st>
<sec>
<st>
<p>Cloning and characterization of <it>PSL </it>genes in potato</p>
</st>
<p>Using the predicted protein sequence of <it>AtPS1</it>, two ESTs were identified on DFCI Potato Gene Index (<url>http://compbio.dfci.harvard.edu/tgi</url>) named TC194262 and DN590600 corresponding to FHA and PINc domains, respectively. The primers designed on the above mentioned ESTs allowed to isolate in diploid potato a 3 Kbp genomic clone (<it>PSL1</it>) lacking UTR regions. In order to complete genomic sequence of <it>PSL1</it>, by querying Potato Genome Sequencing Consortium Database (<url>http://www.potatogenome.net/index.php</url>) and SOL Genomic Network (<url>http://solgenomics.net</url>) we retrieved a 3 Kbp region of <it>S. phureja </it>v3 scaffold PGSC0003DMS000001829 (<it>phuPSL</it>) and a tomato BAC clone AC211085.1 (<it>SlPSL1</it>) sharing 97% and 93% of sequence identity with <it>PSL1</it>, respectively. Primers designed on the tomato sequence were used to isolate a potato 5.3 Kbp <it>PSL1 </it>genomic region spanning 2 Kbp upstream of the start codon to about 200 bp downstream of the stop codon [GenBank:<ext-link ext-link-id="HQ418834" ext-link-type="gen">HQ418834</ext-link>]. <it>In silico </it>gene prediction showed a structure of <it>PSL1 </it>composed by six exons and five introns (Figure <figr fid="F1">1A</figr>) and a 2.4 Kbp hypothetical <it>PSL1 </it>cDNA (<it>PSL1</it>_pred) with a GC content of 42% encoding a 92.5 kDa protein of 823 aminoacids (pPSL1_pred).</p>
<fig id="F1"><title><p>Figure 1</p></title><caption><p>Potato <it>PSL </it>gene family</p></caption><text>
   <p><b>Potato <it>PSL </it>gene family</b>. (A) PSL1 gene structure (<it>PSL1_gen</it>) with UTRs reported in red, exons in orange and introns in grey is showed together with the schematic alignment of the cloned cDNAs and <it>PSL1_gen </it>sequences, with exons in white, introns in grey and nucleotide differences respect to <it>PSL1_gen </it>as vertical black lines. (B) Maximum Likelihood phylogenetic tree of <it>PSL </it>cDNAs is reported with bootstrap values for each node and branch lengths measured in bootstrap values. (C) The distance in terms of number of different nucleotides is reported for the cloned <it>PSL </it>cDNAs. (D) Schematic representation of PSL predicted proteins obtained after <it>in silico </it>translation of the corresponding cDNAs showing FHA and PINc domains as ovals and exagons, respectively.</p>
</text><graphic file="1471-2148-11-78-1" hint_layout="double"/></fig>
<p>Given the meiotic function of <it>AtPS1</it>, potato <it>PSL1 </it>cDNA was isolated from pre-bolting buds. Seven different cDNAs ranging from 2.3 to 2.7 Kbp were isolated including <it>PSL1a </it>of 2.4 Kbp corresponding to <it>PSL1</it>_pred. The sequence alignment between cDNAs and the genomic <it>PSL1 </it>indicated the presence of different groups of related <it>PSL </it>sequences encoded by more than one locus (Figure <figr fid="F1">1A</figr>). In order to investigate the relationship between the different <it>PSL </it>cDNAs, we conducted a phylogenetic analysis that suggested the existence of three different loci named <it>PSL1</it>, <it>PSL2 </it>and <it>PSL3 </it>(Figure <figr fid="F1">1B</figr>). On the basis of sequence similarity, we assigned <it>PSL1a</it>, <it>PSL1b </it>and <it>PSL1c </it>cDNAs [GenBank:<ext-link ext-link-id="HQ418835" ext-link-type="gen">HQ418835</ext-link>, GenBank:<ext-link ext-link-id="HQ418836" ext-link-type="gen">HQ418836</ext-link> and GenBank:<ext-link ext-link-id="HQ418837" ext-link-type="gen">HQ418837</ext-link>] to genomic <it>PSL1</it>, <it>PSL3a</it>, <it>PSL3b </it>and <it>PSL3c </it>cDNAs [GenBank:<ext-link ext-link-id="HQ418839" ext-link-type="gen">HQ418839</ext-link>, GenBank:<ext-link ext-link-id="HQ418840" ext-link-type="gen">HQ418840</ext-link> and GenBank:<ext-link ext-link-id="HQ418841" ext-link-type="gen">HQ418841</ext-link>] to <it>PSL3 </it>and the last cDNA to <it>PSL2 </it>[GenBank:<ext-link ext-link-id="HQ418838" ext-link-type="gen">HQ418838</ext-link>]. Moreover, the distance measured as the number of different nucleotides among the seven cDNAs showed that <it>PSL2 </it>was more similar to <it>PSL3 </it>than to <it>PSL1 </it>(Figure <figr fid="F1">1C</figr>). The nucleotide comparison between the predicted <it>phuPSL </it>cDNA and <it>PSL1a</it>, <it>PSL2 </it>and <it>PSL3c </it>cDNAs showing a similarity of 98.5%, 99.1% and 99.6%, respectively, suggested that <it>phuPSL </it>locus corresponds to <it>PSL3 </it>being the differences explained by the different genetic background.</p>
<p>Based on the sequences of cloned <it>PSL </it>cDNAs, <it>PSL1b </it>and <it>PSL1c</it>, <it>PSL3a </it>and <it>PSL3b </it>are alternative splicing forms of <it>PSL1 </it>and <it>PSL3 </it>since they retained complete or partial introns causing the formation of premature stop codons (PTCs) (Figure <figr fid="F1">1A</figr>). Moreover, the cloned <it>PSL2 </it>cDNA showed a PTC caused by the retention of the second intron. As a consequence, all the predicted PSL proteins, except pPSL1a and pPSL3c, had truncated or lacking PINc domain (Figure <figr fid="F1">1D</figr>).</p>
<p>In order to evaluate whether the alternative splicing <it>PSL </it>variants were possible target of degradation through NMD we calculated the distance between PTCs and the successive exon-exon junction. Being this distance more than 50-55 nt according to Nagi and Maquat <abbrgrp>
<abbr bid="B14">14</abbr>
</abbrgrp> we could consider <it>PSL1b, PSL1c, PSL2</it>, <it>PSL3a </it>and <it>PSL3b </it>as target of NMD.</p>
</sec>
<sec>
<st>
<p>Phylogenetic analysis of <it>PSL </it>genes in sequenced <it>Viridaeplantae</it>
</p>
</st>
<p>In order to investigate the evolution of <it>PSL </it>family, a phylogenetic analysis was performed by a search on Interpro (<url>http://www.ebi.ac.uk/interpro</url>) for proteins with both FHA and PINc domains. Interestingly, these domains were contemporary present only in plants, except for the multidrug-efflux transporter NAEGRDRAFT_78193 from the amoeboid <it>Naegleria gruberi</it>.</p>
<p>PSL1a sequence was blasted against the Phytozome v6 database (<url>http://www.phytozome.net</url>) that contains the genomic sequences of 22 organisms and against the SOL Genomics Network containing the tomato genome assembly. Afterwards, 25 sequences of predicted PSL proteins were collected from 19 different organisms (Additional File <supplr sid="S1">1</supplr>), since <it>Physcomitrella patens</it>, <it>Ricinus communis</it>, <it>Volvox carteri</it>, <it>Zea mays </it>and <it>Chlamydomonas rehinardtii </it>seemingly lack <it>PSL </it>genes. The sequences were then aligned and the Maximum-Likelihood phylogenetic tree is shown in Figure <figr fid="F2">2</figr>. The distribution of PSLs is in agreement with the known phylogenetic relationships between species among dicots and monocots. Moreover, one <it>PSL </it>locus was found in the analyzed plant species, except for <it>Glycine max</it>. Indeed, four different <it>PSL </it>loci were identified in soybean and three of them encode alternative transcripts. Differently from potato, the alternative transcripts of soybean retain PINc domain being the splicing sites located at the 3'-end of mRNA (Figure <figr fid="F3">3</figr>).</p>
<suppl id="S1">
<title>
<p>Additional file 1</p>
</title>
<text>
<p>
<b>PSL sequences used in this study</b>. Description of sequence names, accession numbers, organisms and source databases used in this study.</p>
</text>
<file name="1471-2148-11-78-S1.PDF">
   <p>Click here for file</p>
</file>
</suppl>
<fig id="F2"><title><p>Figure 2</p></title><caption><p>Phylogenetic tree of plant PSL proteins</p></caption><text>
   <p><b>Phylogenetic tree of plant PSL proteins</b>. Maximum Likelihood phylogenetic tree of PSL predicted proteins from 20 plant species. Bootstrap values are shown for each node. The tree is drawn to scale, with branch lengths measured in the number of substitutions per site. At = <it>Arabidopsis thaliana</it>; Ac = <it>Aquilegia coerulea</it>; Al = <it>Arabidopsis lyrata</it>; Bd = <it>Brachypodium distachyon</it>; Br = <it>Brassica rapa</it>; Cp = <it>Carica papaya</it>; Cs = <it>Cucumis sativus</it>; Gm = <it>Glycine max</it>; Me = <it>Manihot esculenta</it>; Mg = <it>Mimulus guttatus</it>; Mt = <it>Medicago truncatula</it>; Os = <it>Oryza saliva</it>; Pp = <it>Prunus persica</it>; Pt = <it>Populus trichocarpa</it>; Sb = <it>Sorghum bicolour</it>; Si = <it>Setaria italica</it>; Sl = <it>Solanum lycopersicon</it>; Sm = <it>Selaginella moellendorffii</it>; Vv = <it>Vitis vinifera</it>.</p>
</text><graphic file="1471-2148-11-78-2" hint_layout="double"/></fig>
<fig id="F3"><title><p>Figure 3</p></title><caption><p>Alternative splicing of <it>GmPSL </it>genes</p></caption><text>
   <p><b>Alternative splicing of <it>GmPSL </it>genes</b>. <it>GmPSL1_gen</it>, <it>GmPSL2_gen</it>, <it>GmPSL3_gen</it>, <it>GmPSL4_gen </it>gene structures are reported with UTRs in red, exons in orange and introns in grey. The mRNAs are reported with UTRs in red, exons in orange and spliced introns as interrupted lines underneath each gene structure. Arrow indicates the lack of complete sequence.</p>
</text><graphic file="1471-2148-11-78-3" hint_layout="single"/></fig>
</sec>
<sec>
<st>
<p>Analysis of FHA and PINc secondary structure and active sites in PSL proteins</p>
</st>
<p>In order to assess the conservation degree of PSLs, we predicted and compared the secondary structure of FHA and PINc domains using the SMART (<url>http://smart.embl-heidelberg.de</url>) and PSIPRED (<url>http://bioinf.cs.ucl.ac.uk/psipred</url>) tools. It is reported that FHA domain is 80-100 aminoacid (aa) long folded into a 11-stranded beta sandwich, which sometimes contains small helical insertions between the loops connecting the strands. However, <it>in silico </it>analysis of FHA displays only 8 beta-strands (b-strands) out of 11 including the residues involved in phosphopeptide recognition and stabilisation of domain architecture <abbrgrp>
<abbr bid="B15">15</abbr>
</abbrgrp>. Using the above mentioned tools on yeast RAD53p [NCBI:6325104], a well characterized FHA containing protein <abbrgrp>
<abbr bid="B12">12</abbr>
</abbrgrp>, the identified FHA region was 52 aa covering 6 beta-strands (data not shown). As shown in Figure <figr fid="F4">4</figr>, the length of the predicted FHA region in our dataset was 53 aa except for <it>Brassica rapa </it>(51 aa) and for <it>Glycine max </it>PSL2 (68 aa). While the majority of FHA domains showed 6 beta-strands, <it>Brassica rapa </it>FHA was predicted to be composed of 4 consecutive beta-strands followed by an alpha helical region. The group of monocots, <it>Brachypodium distachyon</it>, <it>Oryza sativa </it>and <it>Sorghum bicolor</it>, as well as dicot <it>Glycine max </it>(<it>GmPSL1</it>) showed a helical insertion between the 2nd and the 3rd beta-strand. In other species this helical insertion was predicted but at a low confidence value as estimated by PSIPRED. Afterward, we compared the active sites in yRAD53p with those present in PSL<sup>FHA </sup>domains through the protein alignment showed in Figure <figr fid="F5">5</figr>. It can be observed that glycine-5, arginine-6, serine-21 and histidine-24 in FHA domain are perfectly conserved. The arginine-19 seems to be absent in all plant sequences. In the analyzed species, asparagine-60 showed a substitution with histidine, characterized by a different polarity, except for soybean PSL4 glutamine-60 and <it>Brassica rapa </it>that seems to lack this site. Asparagine-66 is mostly substituted with the similar polar serine. Instead, AlPSL1, AtPSL1, GmPSL3 and GmPSL4 exhibited arginine-66 with different polarity while BrPSL1 glutammine-66 and MtPSL1 cysteine-66 both showing similar polarity of asparagine.</p>
<fig id="F4"><title><p>Figure 4</p></title><caption><p>Predicted secondary structures of PSLs<sup>FHA</sup></p></caption><text>
   <p><b>Predicted secondary structures of PSLs<sup>FHA</sup></b>. FHA domain predicted secondary structures of PSLs are reported with alpha-helices as violet tubes, beta-strands as yellow arrows and coiled regions as lines. Regions underlined with brackets show low significance as estimated by PSIPRED. The domain length is also reported. At = <it>Arabidopsis thaliana</it>; Ac = <it>Aquilegia coerulea</it>; Al = <it>Arabidopsis lyrata</it>; Bd = <it>Brachypodium distachyon</it>; Br = <it>Brassica rapa</it>; Cp = <it>Carica papaya</it>; Cs = <it>Cucumis sativus</it>; Gm = <it>Glycine max</it>; Me = <it>Manihot esculenta</it>; Mg = <it>Mimulus guttatus</it>; Mt = <it>Medicago truncatula</it>; Os = <it>Oryza saliva</it>; Pp = <it>Prunus persica</it>; Pt = <it>Populus trichocarpa</it>; Sb = <it>Sorghum bicolour</it>; Si = <it>Setaria italica</it>; Sl = <it>Solanum lycopersicon</it>; Sm = <it>Selaginella moellendorffii</it>; Vv = <it>Vitis vinifera</it>.</p>
</text><graphic file="1471-2148-11-78-4" hint_layout="double"/></fig>
<fig id="F5"><title><p>Figure 5</p></title><caption><p>Alignment of PSLs<sup>FHA </sup>and yRAD53p<sup>FHA</sup></p></caption><text>
   <p><b>Alignment of PSLs<sup>FHA </sup>and yRAD53p<sup>FHA</sup></b>. The protein domain alignment of PSLs<sup>FHA </sup>and yRAD53p<sup>FHA </sup>is reported showing the conservation of active sites labeled with a green octagon when the residues are conserved. A yellow or a red octagon mark an aminoacid substitution of same or different polarity, respectively. At = <it>Arabidopsis thaliana</it>; Ac = <it>Aquilegia coerulea</it>; Al = <it>Arabidopsis lyrata</it>; Bd = <it>Brachypodium distachyon</it>; Br = <it>Brassica rapa</it>; Cp = <it>Carica papaya</it>; Cs = <it>Cucumis sativus</it>; Gm = <it>Glycine max</it>; Me = <it>Manihot esculenta</it>; Mg = <it>Mimulus guttatus</it>; Mt = <it>Medicago truncatula</it>; Os = <it>Oryza saliva</it>; Pp = <it>Prunus persica</it>; Pt = <it>Populus trichocarpa</it>; Sb = <it>Sorghum bicolour</it>; Si = <it>Setaria italica</it>; Sl = <it>Solanum lycopersicon</it>; Sm = <it>Selaginella moellendorffii</it>; Vv = <it>Vitis vinifera</it>.</p>
</text><graphic file="1471-2148-11-78-5" hint_layout="double"/></fig>
<p>The analysis of PINc domain in our dataset started with the prediction of its secondary structure in human SMG6 (hSMG6) [UniProt:Q86US8]. It is reported that hSMG6<sup>PINc </sup>(hSMG6) is 182 aa folded into a 5-stranded parallel beta-sheets that is highly twisted and alpha-helices arranged on both sides of each beta-sheet for a total of 6. Three aspartate residues are essential for PINc activity in hSMG6, while a threonine or a serine in the sequence (T/S)XD might be involved in catalytic role on the basis of similarity with other PINc domains <abbrgrp>
<abbr bid="B16">16</abbr>
<abbr bid="B17">17</abbr>
</abbrgrp>. SMART predicted a PINc domain of 152 aa lacking the first and the last alpha-helices while PSIPRED predicted 4 beta-sheets and 4 alpha helices in hSMG6 (data not shown). In our dataset the PINc domain ranged from a minimum of 123 aa in <it>Cucumis sativus </it>to a maximum of 162 aa in <it>Brachypodium distachyon </it>(Figure <figr fid="F6">6</figr>). In our prediction, the number of beta-sheets ranged from 3 in GmPSL3 and CsPSL1 to 9 in CpPSL1. The number of predicted alpha-helices ranged from 3 in CsPSL1, OsPSL1 and PSL3c to 7 in BdPSL1 and CpPSL1. Afterward, we compared the active residues of PINc in PSL proteins. The alignment between PINc domain of PSL proteins and hSMG6 is shown in Figure <figr fid="F7">7</figr> (refer to Additional file <supplr sid="S2">2</supplr>: Alignment of PSL<sup>PINc </sup>and hSMG6<sup>PINc </sup>residues to look at raw alignment). BrPSL1, SmPSL1 and PSL3c show the three expected aspartate residues at positions 6, 194 and 233 of the alignment. SiPSL1 and SbPSL1 have a substitution of two aspartate residues with asparagine-6, glutammate-194. When a substitution occurs, the aspartate-194 is mostly replaced with glutammate-194 except for CpPSL1 showing a different polarity residue (lysine-194) and MgPSL1 showing an alyphatic residue (alanine-194). The aspartate-233 is widely conserved except in BdPSL1, OsPSL1 and SiPSL1 where it is substituted with a serine-233. Most of the PSL proteins show a catalytic serine-231 in the described pattern SXD (where X can be D, N, E or S in this study) instead of threonine-231 in hSMG6, PtPSL1, SiPSL1, SmPSL1 and VvPSL1. GmPSL3 and SbPSL1 are the only proteins lacking the C-terminal extremity of PINc interrupting at leucine-203 and lysine-227 respectively thus missing the catalytic threonine/serine-231 and the aspartate-233.</p>
<fig id="F6"><title><p>Figure 6</p></title><caption><p>Predicted secondary structure of PSLs<sup>PINc</sup></p></caption><text>
   <p><b>Predicted secondary structure of PSLs<sup>PINc</sup></b>. PINc domain predicted secondary structures of PSLs are reported with alpha-helices as violet tubes, beta-strands as yellow arrows and coiled regions as lines. Regions underlined with brackets show low significance as estimated by PSIPRED. The domain length is also reported. At = <it>Arabidopsis thaliana</it>; Ac = <it>Aquilegia coerulea</it>; Al = <it>Arabidopsis lyrata</it>; Bd = <it>Brachypodium distachyon</it>; Br = <it>Brassica rapa</it>; Cp = <it>Carica papaya</it>; Cs = <it>Cucumis sativus</it>; Gm = <it>Glycine max</it>; Me = <it>Manihot esculenta</it>; Mg = <it>Mimulus guttatus</it>; Mt = <it>Medicago truncatula</it>; Os = <it>Oryza saliva</it>; Pp = <it>Prunus persica</it>; Pt = <it>Populus trichocarpa</it>; Sb = <it>Sorghum bicolour</it>; Si = <it>Setaria italica</it>; Sl = <it>Solanum lycopersicon</it>; Sm = <it>Selaginella moellendorffii</it>; Vv = <it>Vitis vinifera</it>.</p>
</text><graphic file="1471-2148-11-78-6" hint_layout="double"/></fig>
<fig id="F7"><title><p>Figure 7</p></title><caption><p>Comparison of catalytic residues among PSLs<sup>PINc </sup>and hSMG6<sup>PINc</sup></p></caption><text>
   <p><b>Comparison of catalytic residues among PSLs<sup>PINc </sup>and hSMG6<sup>PINc</sup></b>. A schematic protein domain alignment of PSLs<sup>PINc </sup>and hSMG6<sup>PINc </sup>is reported showing the conservation of catalytic residues. Black lines represent gaps in the alignment. Active sites are labeled with a green octagon when the residues are conserved among PSLs<sup>PINc </sup>and hSMG6<sup>PINc</sup>. A yellow or a red octagon mark an aminoacid substitution of same or different polarity, respectively. At = <it>Arabidopsis thaliana</it>; Ac = <it>Aquilegia coerulea</it>; Al = <it>Arabidopsis lyrata</it>; Bd = <it>Brachypodium distachyon</it>; Br = <it>Brassica rapa</it>; Cp = <it>Carica papaya</it>; Cs = <it>Cucumis sativus</it>; Gm = <it>Glycine max</it>; Me = <it>Manihot esculenta</it>; Mg = <it>Mimulus guttatus</it>; Mt = <it>Medicago truncatula</it>; Os = <it>Oryza saliva</it>; Pp = <it>Prunus persica</it>; Pt = <it>Populus trichocarpa</it>; Sb = <it>Sorghum bicolour</it>; Si = <it>Setaria italica</it>; Sl = <it>Solanum lycopersicon</it>; Sm = <it>Selaginella moellendorffii</it>; Vv = <it>Vitis vinifera</it>.</p>
</text><graphic file="1471-2148-11-78-7" hint_layout="double"/></fig>
<suppl id="S2">
<title>
<p>Additional file 2</p>
</title>
<text>
<p>
<b>Alignment of PSL<sup>PINc </sup>and hSMG6<sup>PINc </sup>residues</b>. The protein domain alignment of PSLs<sup>PINc </sup>and hSMG6<sup>PINc </sup>is reported showing the conservation of catalytic residues. Dotted lines represent gaps in the alignment. Active sites are labeled with a green octagon when the residues are conserved among PSLs<sup>PINc </sup>and hSMG6<sup>PINc</sup>. A yellow or a red octagon mark an aminoacid substitution of same or different polarity, respectively.</p>
</text>
<file name="1471-2148-11-78-S2.PDF">
   <p>Click here for file</p>
</file>
</suppl>
</sec>
<sec>
<st>
<p>Selective pressure among amino acid sites in the <it>PSL </it>family</p>
</st>
<p>In order to test for presence of positive selection at individual amino acid codons, the site specific models implemented in DataMonkey 2010 webserver (<url>http://www.datamonkey.org</url>) <abbrgrp>
<abbr bid="B18">18</abbr>
</abbrgrp> were used. The Integrative Selection Analysis of FEL, SLAC and FER alghoritms evidenced no significant positive selected sites (dN-dS &gt; 0). Conversely, 217 significant negative selected sites (dN-dS &lt; 0) were identified. The region between FHA to PINc domains included few negative selected sites which were mostly located near the functional domains. The codons encoding the highly conserved active sites glycine-5, arginine-6, serine-21 and histidine-24 in FHA and aspartate-6, serine/threonine-231 and aspartate-233 in PINc were also subjected to negative selection (Figure <figr fid="F8">8</figr>). These observations support the results obtained through sequence alignment and evidence an occurrence of negative pressure upon non-synonymous mutations in <it>PSL </it>genes. In particular, the regions next to the functional domains and the codons for active sites were affected.</p>
<fig id="F8"><title><p>Figure 8</p></title><caption><p>Distribution of negatively selected codons in PSL<sup>FHA </sup>and PSL<sup>PINc</sup></p></caption><text>
   <p><b>Distribution of negatively selected codons in PSL<sup>FHA </sup>and PSL<sup>PINc</sup></b>. A schematic representation of FHA and PINc domains is reported showing negatively selected sites as grey diamonds. Red diamonds mark active sites subjected to negative selection.</p>
</text><graphic file="1471-2148-11-78-8" hint_layout="single"/></fig>
</sec>
</sec>
<sec>
<st>
<p>Discussion</p>
</st>
<p>In this work, we isolated PSL genes in a diploid potato (2<it>n </it>= 2<it>x </it>= 24) and through <it>in silico </it>analysis we identified PSL in other plant species. The function of PSL in plants can be inferred from <it>Arabidopsis </it>study on <it>AtPS1 </it>gene that appears responsible for the spindle orientation at meiosis II playing a regulatory function, likely via RNA decay <abbrgrp>
<abbr bid="B7">7</abbr>
</abbrgrp>.</p>
<p>In all the analysed species, except soybean and potato, <it>PSL </it>loci appear as singleton behaving as resistant to duplication. It is known that angiosperms are mostly paleopolyploids <abbrgrp>
<abbr bid="B19">19</abbr>
</abbrgrp>, many having survived multiple duplication events <abbrgrp>
<abbr bid="B20">20</abbr>
</abbrgrp>. Analysis of genome sequences shows that some genes duplicate and persist as multiple copies after whole genome duplication (WGD) while other genes are iteratively returned to singleton status. Moreover, it seems that the singleton status is consistently restored for some functional gene groups like those involved in DNA repair or signal transduction <abbrgrp>
<abbr bid="B21">21</abbr>
</abbrgrp>. The evolutive history of potato and soybean could explain the expansion of <it>PSL </it>genes occurring in these species that have experienced recent WGD. In cultivated potato (2<it>n </it>= 4<it>x </it>= 48), WGD is reported to have occurred about 10 million of years ago (<it>mya</it>) after the divergence from <it>Solanum lycopersicum </it>
<abbrgrp>
<abbr bid="B22">22</abbr>
</abbrgrp>. In soybean (2<it>n </it>= 2<it>x </it>= 40), which seems to be an ancient allopolyploid on the basis of two different centromeric repeat classes <abbrgrp>
<abbr bid="B23">23</abbr>
</abbrgrp>, two rounds of WGD happened the latter aging 10-15 <it>mya </it>
<abbrgrp>
<abbr bid="B24">24</abbr>
<abbr bid="B25">25</abbr>
</abbrgrp>. In addition, the location of the four soybean <it>PSLs </it>on different chromosomes reinforce their origin from WGD rather than tandem duplications.</p>
<p>The evident feature of PSL proteins is the contemporary presence of FHA and PINc domains. FHA is reported as a phosphothreonine (pT) binding domain showing a 11 beta-sandwich secondary structure, also containing small helical insertions between the beta-strands <abbrgrp>
<abbr bid="B15">15</abbr>
</abbrgrp>. The FHA active sites are usually located in the loops connecting b3/b4, b4/b5 and b6/b7 strands. RAD53p<sup>FHA </sup>arginine-70, serine-85 and asparagine-107 (corresponding to arginine-6, serine-21 and asparagine-60 in Figure <figr fid="F5">5</figr>) are involved in the interaction with the phosphopeptide backbone. Arginine-83 (corresponding to arginine-19 in Figure <figr fid="F5">5</figr>) is the most important aminoacid for FHA binding specificity. Indeed, its conversion to glycine shifts the binding from pTXXD to pTXXI peptides in RAD53p<sup>FHA </sup>interaction experiments using surface plasmon resonance <abbrgrp>
<abbr bid="B15">15</abbr>
</abbrgrp>. Glycine-69 and histidine-88 (corresponding to glycine-5 and histidine-24 in Figure <figr fid="F5">5</figr>) stabilize the architecture of the binding site. The remaining conserved residue, asparagine-112 (corresponding to asparagine-65 in Figure <figr fid="F5">5</figr>), is remote from the peptide binding site and serves to tether the beta turn between b7/8 to b10 <abbrgrp>
<abbr bid="B12">12</abbr>
<abbr bid="B15">15</abbr>
</abbrgrp>. The comparison of predicted secondary structure in our dataset showed a wide conservation among plant species regarding the number and the position of the beta-sheets and the presence of an helical insertion between the second and the third beta-sheet as a characteristic feature of monocot PSL proteins. The comparison of active sites showed that two PSL<sup>FHA </sup>residues involved in pT binding as well those involved in stabilisation of the architecture of the binding site are fully conserved except for asparagine-65 that is replaced with a different polarity residue (arginine) in AlPSL1, AtPS1, GmPSL3 and GmPSL4 with unknown possible effect on domain architecture. Moreover, as compared to RAD53p<sup>FHA</sup>, a conserved substitution in plants is a histidine instead of asparagine-60, known to be important for the selectivity of binding of phosphothreonine upon phosphoserine. The different polarity between these two residues might suggest a functional diversification of this active site. However, the consequences of this substitution in PSL<sup>FHA </sup>domain in terms of ligand binding cannot be easily predicted and should be assessed by protein:protein interaction analysis. It is also intriguing the lack of arginine-19 in plants raising the question of binding selectivity for plant PSL<sup>FHA</sup>.</p>
<p>As regards PINc domain, we made reference to human SMG6<sup>PINc </sup>which has RNAse activity and it is composed of alternating beta-sheets and alpha-helices. It is reported that hSMG6 is involved in NMD together with hSMG5 and hSMG7 <abbrgrp>
<abbr bid="B25">25</abbr>
</abbrgrp>. In Arabidopsis, <it>AtSMG7 </it>was proved to be involved in NMD and to be required for meiotic spindle organization in meiosis II <abbrgrp>
<abbr bid="B26">26</abbr>
</abbrgrp>. As reported by Glavan and colleagues <abbrgrp>
<abbr bid="B17">17</abbr>
</abbrgrp>, hSMG6 and other PINc domains show three conserved aspartic residues at positions 1251, 1353 and 1392 (corresponding to aspartate-6, -194 and -233 in Additional File <supplr sid="S2">2</supplr>) involved in Mg<sup>2+ </sup>binding. Threonine or serine embedded in the motif (T/S) XD is proposed to be the catalytic site on the basis of sequence alignment of PIN domains and it is located at residue-1390 (corresponding to threonine-231 in Figure <figr fid="F7">7</figr>) in hSMG6<sup>PINc</sup>.</p>
<p>Our results showed differences of secondary structures in PSL<sup>PINc </sup>domains even between phylogenetically close organisms. Moreover, the typical alternation between beta-sheets and alpha-helices does not seem to be respected. In spite of these differences, the active sites showed a wide conservation among plants. The aspartate-6 is conserved in all proteins except for SbPSL1 and SiPSL1 where an asparagine is present. This substitution maybe occurred after the divergence of BEP from PACCAD clades among <it>Poaceae </it>given its absence in <it>Oryza sativa</it>, <it>Brachypodium distachyon </it>and <it>Selaginella moellendorffii</it>. The residue-194 showed the major degree of variation among the species mostly characterized by glutamate instead of aspartate. This substitution is not related to phylogenesis, being glutamate present in both monocots and dicots while aspartate-194 is shared by the <it>Brassicaceae </it>BrPSL1, the <it>Selaginellaceae </it>SmPSL1 and the <it>Solanaceae </it>PSL3c. In <it>Carica papaya</it>, <it>Mimulus guttatus </it>and <it>Medicago truncatula </it>the change involved aminoacids with a different polarity such as lysine, alanine, and asparagine. The aspartate-233 is widely conserved among PSL proteins, except those of <it>Poaceae </it>OsPSL1, SiPSL1 and BdPSL1 showing a serine-233, likely a substitution occurred after the separation of monocots from dicots. GmPSL3 and SbPSL1 lack this residue due to PINc domain truncation at leucine-203 and lysine-227, respectively. As compared to hSMG6, the majority of plants showed a serine instead of threonine-231 that is present only in PtPSL1, SiPSL1, SmPSL1 and VvPSL1. This substitution should not compromise the PINc activity since serine and threonine are the most represented residues at this position in PINc domains of different organisms <abbrgrp>
<abbr bid="B16">16</abbr>
</abbrgrp>. Based on the evidence that the three aspartate residues of PINc domain are crucial for RNase activity, we can argue that PSLs lacking one of these residues or showing aminoacid with different polarity (BdPSL1, CpPSL1, CsPSL1, GmPSL3, MgPSL1, MtPSL1, SbPSL1, SiPSL1, OsPSL1) have no enzymatic activity. However, we cannot exclude that they are partners of other proteins retaining RNAse activity. Indeed, in human, it is known that hSMG5 lacking two aspartate residues respect to hSMG6 has no enzymatic activity but the interaction between hSMG5 and hSMG7 led to a functional nuclease activity of SMG7-SMG5 complex <abbrgrp>
<abbr bid="B17">17</abbr>
</abbrgrp>.</p>
<p>Functional analysis of <it>Arabidopsis PS1 </it>reinforced the evidence that the defects in meiotic spindle orientation in meiosis II led to the formation of diplopollen. Among the analysed species, <it>Manihot esculenta </it>(2<it>n </it>= 2<it>x </it>= 36) was reported to produce 2<it>n </it>pollen but the cytological mechanisms underlying its formation were not deeply investigated <abbrgrp>
<abbr bid="B27">27</abbr>
</abbrgrp>. Since the predicted MePSL protein has FHA and PINc active sites similar to those of other species, it is likely that the mechanism leading to 2<it>n </it>pollen in <it>Manihot esculenta </it>does not involve parallel spindles.</p>
<p>In the potato genotype analysed in this study, neither spindle defects nor 2<it>n </it>pollen have been reported <abbrgrp>
<abbr bid="B28">28</abbr>
</abbrgrp>. In this genotype, we identified three <it>PSL </it>loci and seven transcripts. Based on AtPS1 characterized by FHA and PINc domains we can suspect that PSL1a and PSL3c, carrying both domains, are functional proteins. In addition, it can be speculated that PSL1a that evidences the same PINc active residues of AtPS1 is the strongest candidate for the regulation of spindle orientation in meiosis II. The landscape of alternative splicing in potato PSL is not surprising since it has been already observed for genes involved in <it>Arabidopsis </it>meiosis. For instance, <it>AtSPO11-1</it>, involved in double strand breaks (DSBs) required for meiotic recombination, exhibits up to ten splicing forms showing PTCs <abbrgrp>
<abbr bid="B29">29</abbr>
</abbrgrp>. As inferred for <it>AtSPO11-1, PSLs </it>are possible target of NMD that could act as a post-transcriptional regulatory pathway for the proper expression of PSL. Alternative splicing was observed also in <it>Glycine max </it>but the lack of PTCs exclude NMD regulation of PSL transcripts. Defining the ligands of FHA and PINc domains and proving PSL as a component in NMD are essential to link PSLs to plant evolution by polyploidization via 2<it>n </it>gametes.</p>
</sec>
<sec>
<st>
<p>Conclusions</p>
</st>
<p>In this study, we show that PSLs are common genes across <it>Viridaeplantae</it>. We provide evidence that <it>PSLs </it>occur mostly as singleton in the analyzed genomes except in soybean and potato both characterized by a recent event of whole genome duplication. We provide useful insight into evolutionary preservation of FHA and PINc domains throughout plant PSL genes, suggesting a fundamental role of these domains for PSL function. FHA appeared to be highly conserved, while PINc secondary structure and specific active sites showed a less conserved landscape, suggesting a functional diversification among PSL genes.</p>
</sec>
<sec>
<st>
<p>Methods</p>
</st>
<sec>
<st>
<p>Plant material</p>
</st>
<p>A previously described <abbrgrp>
<abbr bid="B30">30</abbr>
</abbrgrp> diploid clone of potato (named T710) coming from hybridization between <it>Solanum tuberosum </it>haploid USW3304 (2<it>n </it>= 2<it>x </it>= 24) and <it>S. chacoense </it>(2<it>n </it>= 2<it>x </it>= 24) has been used to isolate <it>PSL </it>genes.</p>
</sec>
<sec>
<st>
<p>Potato <it>PSL </it>cloning and sequence analysis</p>
</st>
<p>Plant genomic DNA was isolated from leaves using the DNeasy Plant Mini Kit (QIAGEN <url>http://www1.qiagen.com</url>). Bacterial plasmid DNA was isolated using QIAprep Spin Miniprep Kit (QIAGEN). Total RNA from prebolting buds was extracted using the RNeasy Plant Mini Kit (QIAGEN) and then treated with DNase I (Invitrogen, <url>http://www.invitrogen.com</url>) to remove residual genomic DNA. Primer pairs for cloning designed with PRIMER3 (<url>http://frodo.wi.mit.edu/cgi-bin/primer3/primer3.cgi</url>) are listed in Additional file <supplr sid="S3">3</supplr>: Primers used in this study.</p>
<suppl id="S3">
<title>
<p>Additional file 3</p>
</title>
<text>
<p>
<b>Primers used in this study</b>. Primers used for the isolation of <it>PSL </it>genomic clone and cDNAs</p>
</text>
<file name="1471-2148-11-78-S3.PDF">
   <p>Click here for file</p>
</file>
</suppl>
<p>
<it>PSL1 </it>genomic fragment was amplified by PCR with primers PS_F1 and PS_R1 and full sequence with primers PS_F2 and PS_R2. The coding region of <it>PSL </it>transcripts was amplified by RT-PCR by using SuperScriptTM III Reverse Transcriptase (Invitrogen) with an oligo- dT<sub>12-18 </sub>(Invitrogen). The cDNA was subjected to PCR with PS_CF and PS_CR primers (Additional File <supplr sid="S3">3</supplr>). PCR products have been cloned into PCR2.1 with T/A Cloning kit (Invitrogen). Nucleotide sequencing was carried out by Eurofins MWG Operon sequencing service (Germany).</p>
<p>
<it>PSL1 </it>gene structure and cDNA predictions were carried out using FGENESH online tool (<url>http://linux1.softberry.com/berry.phtml?topic=fgenesh&amp;group=programs&amp;subgroup=gfind</url>) selecting Tomato as organism. <it>In silico </it>translation of cDNA sequences was carried out using the Expasy Translation tool (<url>http://www.expasy.ch/tools/dna.html</url>).</p>
</sec>
<sec>
<st>
<p>Identification of unannotated <it>PSL </it>genes</p>
</st>
<p>The PSL protein sequences were identified and collected by TBLASTN <abbrgrp>
<abbr bid="B31">31</abbr>
</abbrgrp> search against the Phytozome v6 database (<url>http://www.phytozome.net/</url>), SGN (SOL Genomic Network, <url>http://sgn.cornell.edu/</url>) and Potato Genome Sequencing Consortium (PGSC, <url>http://potatogenomics.plantbiology.msu.edu/index.php?p=blast</url>).</p>
</sec>
<sec>
<st>
<p>Molecular Phylogenetic analysis by Maximum Likelihood method</p>
</st>
<p>The PSL protein sequences were firstly aligned by MUSCLE <abbrgrp>
<abbr bid="B32">32</abbr>
</abbrgrp> using the default settings of MEGA5 <abbrgrp>
<abbr bid="B33">33</abbr>
</abbrgrp>. The best model for Maximum Likelihood phylogeny analysis was chosen testing all the available models in ProtTest version 2.4 <abbrgrp>
<abbr bid="B34">34</abbr>
</abbrgrp> with slow optimization strategy and selecting the one with highest AICc value. The evolutionary history was then inferred by using the Maximum Likelihood method according to Jones et al. w/freq. model <abbrgrp>
<abbr bid="B35">35</abbr>
</abbrgrp>. The bootstrap consensus tree was inferred from 100 replicates <abbrgrp>
<abbr bid="B36">36</abbr>
</abbrgrp>. Initial tree(s) for the heuristic search were obtained automatically as following. When the number of common sites is &lt;100 or less than one fourth of the total number of sites, the maximum parsimony method was used, otherwise BIONJ method with MCL distance matrix was used. A discrete Gamma distribution was used to model evolutionary rate differences among sites (4 categories, +G, parameter = 1.3900). The rate variation model allowed for some sites to be evolutionarily invariable ([+I], 9.0566% sites). All ambiguous positions were removed for each sequence pair. Evolutionary analyses were conducted in MEGA5.</p>
</sec>
<sec>
<st>
<p>PSL domain analysis</p>
</st>
<p>The locations of FHA and PINc domains within <it>PSL </it>genes were detected using SMART <abbrgrp>
<abbr bid="B37">37</abbr>
</abbrgrp>. Secondary structure prediction was performed using domains from PSL sequences as input into the PSIPRED secondary structure prediction server <abbrgrp>
<abbr bid="B38">38</abbr>
</abbrgrp>. The program MUSCLE <abbrgrp>
<abbr bid="B32">32</abbr>
</abbrgrp> was used to do multiple sequence alignments of FHA and PINc domains in PSL proteins, yRAD53p [NCBI:6325104] and hSMG6 [UniProt:Q86US8]. Conservation of phosphothreonine-binding residues in FHA and of RNAse activity residues in PINc were determined by alignment with yRAD53p and hSMG6, respectively.</p>
</sec>
<sec>
<st>
<p>Comparison of dN-dS values between <it>PSL </it>sequences</p>
</st>
<p>The coding sequences of PSLs were obtained from the databases reported in Additional File <supplr sid="S1">1</supplr>. The terminal codon was manually removed, then the codon alignment was performed by MUSCLE <abbrgrp>
<abbr bid="B32">32</abbr>
</abbrgrp> using the default settings of MEGA5 <abbrgrp>
<abbr bid="B33">33</abbr>
</abbrgrp> (Tamura <it>et al.</it>, personal communication). To select the best substitution model we used JmodelTest 0.1.1 <abbrgrp>
<abbr bid="B39">39</abbr>
<abbr bid="B40">40</abbr>
</abbrgrp> with default settings. The GTR <abbrgrp>
<abbr bid="B41">41</abbr>
</abbrgrp> model was selected as the best fitting as evidenced by AICc values.</p>
<p>The codon alignment was then uploaded on DataMonkey 2010 server <abbrgrp>
<abbr bid="B18">18</abbr>
<abbr bid="B42">42</abbr>
</abbrgrp> and dN-dS evaluated using GTR as substitution model, and SLAC (default settings except for Global dN/dS value = estimated with CI), FEL and REL <abbrgrp>
<abbr bid="B43">43</abbr>
</abbrgrp> alghoritms. Codons subjected to evolutive pressure were identified with Integrative Selection Analysis selecting significance levels for SLAC p &lt; 0.1, FEL p &lt; 0.1 and REL Bayes Factor &lt; 50. Only codons with a significant dN-dS value according to all the three methods were reported.</p>
</sec>
</sec>
<sec>
<st>
<p>Authors' contributions</p>
</st>
<p>RAC performed the molecular research and the protein domain analysis, participated in phylogenetic analysis, study design and drafted the manuscript. WS carried out the phylogenetic analysis and participated in bioinformatic analysis design. GC participated in molecular analysis. FMC performed the molecular analysis design and revised the manuscript. CC conceived of and coordinated the study, and revised the manuscript. All authors read and approved the final manuscript.</p>
</sec>
<sec>
<st>
<p>Note</p>
</st>
<p>#Contribution n 358 from CNR - National Research Council of Italy, Institute of Plant Genetics, Research Division Portici.</p>
</sec>
</bdy><bm>
<ack>
<sec>
<st>
<p>Acknowledgements</p>
</st>
<p>We wish to thank Ms. Rosa Paparo for helpful technical assistance, Prof. Domenico Carputo and Dr. Raphael Mercier for useful comments on the manuscript. This work was partially funded by Galileo Grant from University Italy-France.</p>
</sec>
</ack>
<refgrp><bibl id="B1"><title><p>Polyploidy in Plants</p></title><aug><au><snm>Tate</snm><fnm>JA</fnm></au><au><snm>Soltis</snm><fnm>DE</fnm></au><au><snm>Soltis</snm><fnm>PS</fnm></au></aug><source>The Evolution of the Genome</source><publisher>Elsevier Inc</publisher><editor>Gregory T</editor><pubdate>2005</pubdate><fpage>371</fpage><lpage>414</lpage><xrefbib><pubid idtype="doi">full_text</pubid></xrefbib></bibl><bibl id="B2"><title><p>Genomic plasticity and the diversity of polyploid plants</p></title><aug><au><snm>Leitch</snm><fnm>AR</fnm></au><au><snm>Leitch</snm><fnm>IJ</fnm></au></aug><source>Science</source><pubdate>2008</pubdate><volume>320</volume><fpage>481</fpage><lpage>483</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1126/science.1153585</pubid><pubid idtype="pmpid" link="fulltext">18436776</pubid></pubidlist></xrefbib></bibl><bibl id="B3"><title><p>Tansley Review No-78 - Gametes with the somatix chromosome-number - mechanisms of their formation and rolein the evolution of autopolyploid plants</p></title><aug><au><snm>Bretagnolle</snm><fnm>F</fnm></au><au><snm>Thompson</snm><fnm>J</fnm></au></aug><source>New Phytol</source><pubdate>1995</pubdate><volume>129</volume><issue>1</issue><fpage>1</fpage><lpage>22</lpage><xrefbib><pubid idtype="doi">10.1111/j.1469-8137.1995.tb03005.x</pubid></xrefbib></bibl><bibl id="B4"><title><p>Meiotic mutations and crop improvement</p></title><aug><au><snm>Consiglio</snm><fnm>F</fnm></au><au><snm>Carputo</snm><fnm>D</fnm></au><au><snm>Frusciante</snm><fnm>L</fnm></au><au><snm>Monti</snm><fnm>LM</fnm></au><au><snm>Conicella</snm><fnm>C</fnm></au></aug><source>Plant Breeding reviews</source><publisher>Purdue University, John Wiley &amp; Sons, Inc</publisher><editor>Janick J</editor><pubdate>2007</pubdate><volume>28</volume><fpage>163</fpage><lpage>203</lpage><xrefbib><pubid idtype="doi">full_text</pubid></xrefbib></bibl><bibl id="B5"><title><p>SWITCH1 (SWI1): a novel protein required for the establishment of sister chromatid cohesion and for bivalent formation at meiosis</p></title><aug><au><snm>Mercier</snm><fnm>R</fnm></au><au><snm>Vezon</snm><fnm>D</fnm></au><au><snm>Bullier</snm><fnm>E</fnm></au><au><snm>Motamayor</snm><fnm>J</fnm></au><au><snm>Sellier</snm><fnm>A</fnm></au><au><snm>Lefevre</snm><fnm>F</fnm></au><au><snm>Pelletier</snm><fnm>G</fnm></au><au><snm>Horlow</snm><fnm>C</fnm></au></aug><source>Gene Dev</source><pubdate>2001</pubdate><volume>15</volume><issue>14</issue><fpage>1859</fpage><lpage>1871</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1101/gad.203201</pubid><pubid idtype="pmcid">312743</pubid><pubid idtype="pmpid">11459834</pubid></pubidlist></xrefbib></bibl><bibl id="B6"><title><p>Gamete formation without meiosis in Arabidopsis</p></title><aug><au><snm>Ravi</snm><fnm>M</fnm></au><au><snm>Marimuthu</snm><fnm>MPA</fnm></au><au><snm>Siddiqi</snm><fnm>I</fnm></au></aug><source>Nature</source><pubdate>2008</pubdate><volume>451</volume><fpage>1121</fpage><lpage>1124</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1038/nature06557</pubid><pubid idtype="pmpid" link="fulltext">18272967</pubid></pubidlist></xrefbib></bibl><bibl id="B7"><title><p>Mutations in AtPS1 (Arabidopsis thaliana parallel spindle 1) lead to the production of diploid pollen grains</p></title><aug><au><snm>d&apos;Erfurth</snm><fnm>I</fnm></au><au><snm>Jolivet</snm><fnm>S</fnm></au><au><snm>Froger</snm><fnm>N</fnm></au><au><snm>Catrice</snm><fnm>O</fnm></au><au><snm>Novatchkova</snm><fnm>M</fnm></au><au><snm>Simon</snm><fnm>M</fnm></au><au><snm>Jenczewski</snm><fnm>E</fnm></au><au><snm>Mercier</snm><fnm>R</fnm></au></aug><source>PLoS Genet</source><pubdate>2008</pubdate><volume>4</volume><fpage>e1000274</fpage><xrefbib><pubidlist><pubid idtype="pmcid">2581889</pubid><pubid idtype="pmpid">19043546</pubid></pubidlist></xrefbib></bibl><bibl id="B8"><title><p>Cytological studies on the development of pollen grains in different races of solanum</p></title><aug><au><snm>Fukuda</snm><fnm>Y</fnm></au></aug><source>Bot Mag Tokyo</source><pubdate>1927</pubdate><volume>41</volume><fpage>459</fpage><lpage>474</lpage></bibl><bibl id="B9"><title><p>The inheritance of three mechanisms of diplandroid (2<it>n </it>pollen) formation in diploid potatoes</p></title><aug><au><snm>Mok</snm><fnm>D</fnm></au><au><snm>Peloquin</snm><fnm>J</fnm></au></aug><source>Heredity</source><pubdate>1975</pubdate><volume>35</volume><fpage>1</fpage><lpage>8</lpage><xrefbib><pubid idtype="doi">10.1038/hdy.1975.100</pubid></xrefbib></bibl><bibl id="B10"><title><p>Meiotic mutants in potato. Valuable variants</p></title><aug><au><snm>Peloquin</snm><fnm>SJ</fnm></au><au><snm>Boiteux</snm><fnm>LS</fnm></au><au><snm>Carputo</snm><fnm>D</fnm></au></aug><source>Genetics</source><pubdate>1999</pubdate><volume>153</volume><fpage>1493</fpage><lpage>1499</lpage><xrefbib><pubidlist><pubid idtype="pmcid">1460881</pubid><pubid idtype="pmpid">10581260</pubid></pubidlist></xrefbib></bibl><bibl id="B11"><title><p>The FHA domain mediates phosphoprotein interactions</p></title><aug><au><snm>Li</snm><fnm>J</fnm></au><au><snm>Lee</snm><fnm>GI</fnm></au><au><snm>Van Doren</snm><fnm>SR</fnm></au><au><snm>Walker</snm><fnm>JC</fnm></au></aug><source>J Cell Sci</source><pubdate>2000</pubdate><volume>113</volume><fpage>4143</fpage><lpage>4149</lpage><xrefbib><pubid idtype="pmpid" link="fulltext">11069759</pubid></xrefbib></bibl><bibl id="B12"><title><p>The FHA domain</p></title><aug><au><snm>Durocher</snm><fnm>D</fnm></au><au><snm>Jackson</snm><fnm>SP</fnm></au></aug><source>FEBS Lett</source><pubdate>2002</pubdate><volume>513</volume><fpage>58</fpage><lpage>66</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1016/S0014-5793(01)03294-X</pubid><pubid idtype="pmpid" link="fulltext">11911881</pubid></pubidlist></xrefbib></bibl><bibl id="B13"><title><p>The coupling of alternative splicing and nonsense-mediated mRNA decay</p></title><aug><au><snm>Lareau</snm><fnm>LF</fnm></au><au><snm>Brooks</snm><fnm>AN</fnm></au><au><snm>Soergel</snm><fnm>DAW</fnm></au><au><snm>Meng</snm><fnm>Q</fnm></au><au><snm>Brenner</snm><fnm>SE</fnm></au></aug><source>Adv Exp Med Biol</source><pubdate>2007</pubdate><volume>623</volume><fpage>190</fpage><lpage>211</lpage><xrefbib><pubidlist><pubid idtype="doi">full_text</pubid><pubid idtype="pmpid">18380348</pubid></pubidlist></xrefbib></bibl><bibl id="B14"><title><p>A rule for termination-codon position within intron-containing genes: when nonsense affects RNA abundance</p></title><aug><au><snm>Nagy</snm><fnm>E</fnm></au><au><snm>Maquat</snm><fnm>LE</fnm></au></aug><source>Trends Biochem Sci</source><pubdate>1998</pubdate><volume>23</volume><issue>6</issue><fpage>198</fpage><lpage>199</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1016/S0968-0004(98)01208-0</pubid><pubid idtype="pmpid" link="fulltext">9644970</pubid></pubidlist></xrefbib></bibl><bibl id="B15"><title><p>The molecular basis of FHA domain:phosphopeptide binding specificity and implications for phospho-dependent signaling mechanisms</p></title><aug><au><snm>Durocher</snm><fnm>D</fnm></au><au><snm>Taylor</snm><fnm>IA</fnm></au><au><snm>Sarbassova</snm><fnm>D</fnm></au><au><snm>Haire</snm><fnm>LF</fnm></au><au><snm>Westcott</snm><fnm>SL</fnm></au><au><snm>Jackson</snm><fnm>SP</fnm></au><au><snm>Smerdon</snm><fnm>SJ</fnm></au><au><snm>Yaffe</snm><fnm>MB</fnm></au></aug><source>Mol Cell</source><pubdate>2000</pubdate><volume>6</volume><issue>5</issue><fpage>1169</fpage><lpage>1182</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1016/S1097-2765(00)00114-3</pubid><pubid idtype="pmpid" link="fulltext">11106755</pubid></pubidlist></xrefbib></bibl><bibl id="B16"><title><p>Distant structural homology leads to the functional characterization of an archaeal PIN domain as an exonuclease</p></title><aug><au><snm>Arcus</snm><fnm>VL</fnm></au><au><snm>B&#228;ckbro</snm><fnm>K</fnm></au><au><snm>Roos</snm><fnm>A</fnm></au><au><snm>Daniel</snm><fnm>EL</fnm></au><au><snm>Baker</snm><fnm>EN</fnm></au></aug><source>J Biol Chem</source><pubdate>2004</pubdate><volume>279</volume><issue>16</issue><fpage>16471</fpage><lpage>16478</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1074/jbc.M313833200</pubid><pubid idtype="pmpid" link="fulltext">14734548</pubid></pubidlist></xrefbib></bibl><bibl id="B17"><title><p>Structures of the PIN domains of SMG6 and SMG5 reveal a nuclease within the mRNA surveillance complex</p></title><aug><au><snm>Glavan</snm><fnm>F</fnm></au><au><snm>Behm-Ansmant</snm><fnm>I</fnm></au><au><snm>Izaurralde</snm><fnm>E</fnm></au><au><snm>Conti</snm><fnm>E</fnm></au></aug><source>The EMBO Journal</source><pubdate>2006</pubdate><volume>25</volume><issue>21</issue><fpage>5117</fpage><lpage>5125</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1038/sj.emboj.7601377</pubid><pubid idtype="pmcid">1630413</pubid><pubid idtype="pmpid">17053788</pubid></pubidlist></xrefbib></bibl><bibl id="B18"><title><p>Datamonkey: rapid detection of selective pressure on individual sites of codon alignments</p></title><aug><au><snm>Pond</snm><fnm>SLK</fnm></au><au><snm>Frost</snm><fnm>SDW</fnm></au></aug><source>Bioinformatics</source><pubdate>2005</pubdate><volume>21</volume><issue>10</issue><fpage>2531</fpage><lpage>2533</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1093/bioinformatics/bti320</pubid><pubid idtype="pmpid" link="fulltext">15713735</pubid></pubidlist></xrefbib></bibl><bibl id="B19"><title><p>Unravelling angiosperm genome evolution by phylogenetic analysis of chromosomal duplication events</p></title><aug><au><snm>Bowers</snm><fnm>JE</fnm></au><au><snm>Chapman</snm><fnm>BA</fnm></au><au><snm>Rong</snm><fnm>J</fnm></au><au><snm>Paterson</snm><fnm>AH</fnm></au></aug><source>Nature</source><pubdate>2003</pubdate><volume>422</volume><issue>6930</issue><fpage>433</fpage><lpage>438</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1038/nature01521</pubid><pubid idtype="pmpid" link="fulltext">12660784</pubid></pubidlist></xrefbib></bibl><bibl id="B20"><title><p>Ancient polyploidization predating divergence of the cereals, and its consequences for comparative genomics</p></title><aug><au><snm>Paterson</snm><fnm>AH</fnm></au><au><snm>Bowers</snm><fnm>JE</fnm></au><au><snm>Chapman</snm><fnm>BA</fnm></au></aug><source>Proc Natl Acad Sci USA</source><pubdate>2004</pubdate><volume>101</volume><issue>26</issue><fpage>9903</fpage><lpage>9908</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1073/pnas.0307901101</pubid><pubid idtype="pmcid">470771</pubid><pubid idtype="pmpid">15161969</pubid></pubidlist></xrefbib></bibl><bibl id="B21"><title><p>Evolutionary genetics of genome merger and doubling in plants</p></title><aug><au><snm>Doyle</snm><fnm>JJ</fnm></au><au><snm>Flagel</snm><fnm>LE</fnm></au><au><snm>Paterson</snm><fnm>AH</fnm></au><au><snm>Rapp</snm><fnm>RA</fnm></au><au><snm>Soltis</snm><fnm>DE</fnm></au><au><snm>Soltis</snm><fnm>PS</fnm></au><au><snm>Wendel</snm><fnm>JF</fnm></au></aug><source>Annu Rev Genet</source><pubdate>2008</pubdate><volume>42</volume><fpage>443</fpage><lpage>461</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1146/annurev.genet.42.110807.091524</pubid><pubid idtype="pmpid" link="fulltext">18983261</pubid></pubidlist></xrefbib></bibl><bibl id="B22"><title><p>Plants with double genomes might have had a better chance to survive the Cretaceous-Tertiary extinction event</p></title><aug><au><snm>Fawcett</snm><fnm>JA</fnm></au><au><snm>Maere</snm><fnm>S</fnm></au><au><snm>Van de Peer</snm><fnm>Y</fnm></au></aug><source>Proc Natl Acad Sci USA</source><pubdate>2009</pubdate><volume>106</volume><issue>14</issue><fpage>5737</fpage><lpage>5742</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1073/pnas.0900906106</pubid><pubid idtype="pmcid">2667025</pubid><pubid idtype="pmpid">19325131</pubid></pubidlist></xrefbib></bibl><bibl id="B23"><title><p>Molecular and chromosomal evidence for allopolyploidy in soybean</p></title><aug><au><snm>Gill</snm><fnm>N</fnm></au><au><snm>Findley</snm><fnm>S</fnm></au><au><snm>Walling</snm><fnm>JG</fnm></au><au><snm>Hans</snm><fnm>C</fnm></au><au><snm>Ma</snm><fnm>J</fnm></au><au><snm>Doyle</snm><fnm>J</fnm></au><au><snm>Stacey</snm><fnm>G</fnm></au><au><snm>Jackson</snm><fnm>SA</fnm></au></aug><source>Plant Physiol</source><pubdate>2009</pubdate><volume>151</volume><issue>3</issue><fpage>1167</fpage><lpage>1174</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1104/pp.109.137935</pubid><pubid idtype="pmcid">2773056</pubid><pubid idtype="pmpid">19605552</pubid></pubidlist></xrefbib></bibl><bibl id="B24"><title><p>Paleopolyploidy and gene duplication in soybean and other legumes</p></title><aug><au><snm>Shoemaker</snm><fnm>RC</fnm></au><au><snm>Schlueter</snm><fnm>J</fnm></au><au><snm>Doyle</snm><fnm>JJ</fnm></au></aug><source>Curr Opin Plant Biol</source><pubdate>2006</pubdate><volume>9</volume><issue>2</issue><fpage>104</fpage><lpage>109</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1016/j.pbi.2006.01.007</pubid><pubid idtype="pmpid" link="fulltext">16458041</pubid></pubidlist></xrefbib></bibl><bibl id="B25"><title><p>The nonsense-mediated decay RNA surveillance pathway</p></title><aug><au><snm>Chang</snm><fnm>Y-F</fnm></au><au><snm>Imam</snm><fnm>JS</fnm></au><au><snm>Wilkinson</snm><fnm>MF</fnm></au></aug><source>Annu Rev Biochem</source><pubdate>2007</pubdate><volume>76</volume><fpage>51</fpage><lpage>74</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1146/annurev.biochem.76.050106.093909</pubid><pubid idtype="pmpid" link="fulltext">17352659</pubid></pubidlist></xrefbib></bibl><bibl id="B26"><title><p>Arabidopsis SMG7 protein is required for exit from meiosis</p></title><aug><au><snm>Riehs</snm><fnm>N</fnm></au><au><snm>Akimcheva</snm><fnm>S</fnm></au><au><snm>Puizina</snm><fnm>J</fnm></au><au><snm>Bulankova</snm><fnm>P</fnm></au><au><snm>Idol</snm><fnm>RA</fnm></au><au><snm>Siroky</snm><fnm>J</fnm></au><au><snm>Schleiffer</snm><fnm>A</fnm></au><au><snm>Schweizer</snm><fnm>D</fnm></au><au><snm>Shippen</snm><fnm>DE</fnm></au><au><snm>Riha</snm><fnm>K</fnm></au></aug><source>J Cell Sci</source><pubdate>2008</pubdate><volume>121</volume><fpage>2208</fpage><lpage>2216</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1242/jcs.027862</pubid><pubid idtype="pmpid" link="fulltext">18544632</pubid></pubidlist></xrefbib></bibl><bibl id="B27"><title><p>A cytogenetic study of bilateral sexual polyploidization in cassava (<it>Manihot esculenta </it>Crantz)</p></title><aug><au><snm>Ogburia</snm><fnm>M</fnm></au><au><snm>Yabuya</snm><fnm>T</fnm></au><au><snm>Adachi</snm><fnm>T</fnm></au></aug><source>Plant Breeding</source><pubdate>2002</pubdate><volume>121</volume><fpage>278</fpage><lpage>280</lpage><xrefbib><pubid idtype="doi">10.1046/j.1439-0523.2002.00694.x</pubid></xrefbib></bibl><bibl id="B28"><title><p>Cytological evidences of SDR-FDR mixture in the formation of 2n eggs in a potato diploid clone</p></title><aug><au><snm>Conicella</snm><fnm>C</fnm></au><au><snm>Barone</snm><fnm>A</fnm></au><au><snm>Giudice</snm><fnm>A</fnm></au><au><snm>Frusciante</snm><fnm>L</fnm></au><au><snm>Monti</snm><fnm>LM</fnm></au></aug><source>Theor Appl Genet</source><pubdate>1991</pubdate><volume>81</volume><issue>1</issue><fpage>59</fpage><lpage>63</lpage><xrefbib><pubid idtype="doi">10.1007/BF00226112</pubid></xrefbib></bibl><bibl id="B29"><title><p>Molecular characterisation of two paralogous SPO11 homologues in Arabidopsis thaliana</p></title><aug><au><snm>Hartung</snm><fnm>F</fnm></au><au><snm>Puchta</snm><fnm>H</fnm></au></aug><source>Nucleic Acids Res</source><pubdate>2000</pubdate><volume>28</volume><issue>7</issue><fpage>1548</fpage><lpage>1554</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1093/nar/28.7.1548</pubid><pubid idtype="pmcid">102794</pubid><pubid idtype="pmpid">10710421</pubid></pubidlist></xrefbib></bibl><bibl id="B30"><title><p>Selection of potato diploid hybrids for 2<it>n </it>gamete production</p></title><aug><au><snm>Barone</snm><fnm>C</fnm></au><au><snm>Frusciante</snm><fnm></fnm></au></aug><source>Journal of Genetics and Breeding</source><pubdate>1993</pubdate><volume>47</volume><fpage>313</fpage><lpage>318</lpage></bibl><bibl id="B31"><title><p>Protein database searches for multiple alignments</p></title><aug><au><snm>Altschul</snm><fnm>SF</fnm></au><au><snm>Lipman</snm><fnm>DJ</fnm></au></aug><source>Proc Natl Acad Sci USA</source><pubdate>1990</pubdate><volume>87</volume><issue>14</issue><fpage>5509</fpage><lpage>5513</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1073/pnas.87.14.5509</pubid><pubid idtype="pmcid">54354</pubid><pubid idtype="pmpid">2196570</pubid></pubidlist></xrefbib></bibl><bibl id="B32"><title><p>MUSCLE: multiple sequence alignment with high accuracy and high throughput</p></title><aug><au><snm>Edgar</snm><fnm>RC</fnm></au></aug><source>NAR</source><pubdate>2004</pubdate><volume>32</volume><issue>5</issue><fpage>1792</fpage><lpage>1797</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1093/nar/gkh340</pubid><pubid idtype="pmcid">390337</pubid><pubid idtype="pmpid">15034147</pubid></pubidlist></xrefbib></bibl><bibl id="B33"><title><p>MEGA4: Molecular Evolutionary Genetics Analysis (MEGA) software version 4.0</p></title><aug><au><snm>Tamura</snm><fnm>K</fnm></au><au><snm>Dudley</snm><fnm>J</fnm></au><au><snm>Nei</snm><fnm>M</fnm></au><au><snm>Kumar</snm><fnm>S</fnm></au></aug><source>Mol Biol Evol</source><pubdate>2007</pubdate><volume>24</volume><fpage>1596</fpage><lpage>1599</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1093/molbev/msm092</pubid><pubid idtype="pmpid" link="fulltext">17488738</pubid></pubidlist></xrefbib></bibl><bibl id="B34"><title><p>ProtTest: Selection of best-fit models of protein evolution</p></title><aug><au><snm>Abascal</snm><fnm>F</fnm></au><au><snm>Zardoya</snm><fnm>R</fnm></au><au><snm>Posada</snm><fnm>D</fnm></au></aug><source>Bioinformatics</source><pubdate>2005</pubdate><volume>21</volume><fpage>2104</fpage><lpage>2105</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1093/bioinformatics/bti263</pubid><pubid idtype="pmpid" link="fulltext">15647292</pubid></pubidlist></xrefbib></bibl><bibl id="B35"><title><p>The rapid generation of mutation data matrices from protein sequences</p></title><aug><au><snm>Jones</snm><fnm>DT</fnm></au><au><snm>Taylor</snm><fnm>WR</fnm></au><au><snm>Thornton</snm><fnm>JM</fnm></au></aug><source>Comput Appl Biosci</source><pubdate>1992</pubdate><volume>8</volume><issue>3</issue><fpage>275</fpage><lpage>282</lpage><xrefbib><pubid idtype="pmpid">1633570</pubid></xrefbib></bibl><bibl id="B36"><title><p>Confidence limits on phylogenies: An approach using the bootstrap</p></title><aug><au><snm>Felsenstein</snm><fnm></fnm></au></aug><source>Evolution</source><pubdate>1985</pubdate><volume>39</volume><fpage>783</fpage><lpage>791</lpage><xrefbib><pubid idtype="doi">10.2307/2408678</pubid></xrefbib></bibl><bibl id="B37"><title><p>SMART, a simple modular architecture research tool: identification of signaling domains</p></title><aug><au><snm>Schultz</snm><fnm>J</fnm></au><au><snm>Milpetz</snm><fnm>F</fnm></au><au><snm>Bork</snm><fnm>P</fnm></au><au><snm>Ponting</snm><fnm>CP</fnm></au></aug><source>Proc Natl Acad Sci USA</source><pubdate>1998</pubdate><volume>95</volume><issue>11</issue><fpage>5857</fpage><lpage>5864</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1073/pnas.95.11.5857</pubid><pubid idtype="pmcid">34487</pubid><pubid idtype="pmpid">9600884</pubid></pubidlist></xrefbib></bibl><bibl id="B38"><title><p>The PSIPRED protein structure prediction server</p></title><aug><au><snm>McGuffin</snm><fnm>LJ</fnm></au><au><snm>Bryson</snm><fnm>K</fnm></au><au><snm>Jones</snm><fnm>DT</fnm></au></aug><source>Bioinformatics</source><pubdate>2000</pubdate><volume>16</volume><issue>4</issue><fpage>404</fpage><lpage>405</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1093/bioinformatics/16.4.404</pubid><pubid idtype="pmpid" link="fulltext">10869041</pubid></pubidlist></xrefbib></bibl><bibl id="B39"><title><p>jModelTest: Phylogenetic Model Averaging</p></title><aug><au><snm>Posada</snm><fnm>D</fnm></au></aug><source>Mol Biol Evol</source><inpress/></bibl><bibl id="B40"><title><p>A simple, fast and accurate method to estimate large phylogenies by maximum-likelihood</p></title><aug><au><snm>Guindon</snm><fnm>S</fnm></au><au><snm>Gascuel</snm><fnm>O</fnm></au></aug><source>Syst Biol</source><pubdate>2003</pubdate><volume>52</volume><fpage>696</fpage><lpage>704</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1080/10635150390235520</pubid><pubid idtype="pmpid" link="fulltext">14530136</pubid></pubidlist></xrefbib></bibl><bibl id="B41"><title><p>Some Probabilistic and Statistical Problems in the Analysis of DNA Sequences</p></title><aug><au><snm>Tavar&#233;</snm><fnm>S</fnm></au></aug><source>Lectures on Mathematics in the Life Sciences</source><editor>AMS</editor><pubdate>1986</pubdate><volume>17</volume><fpage>57</fpage><lpage>86</lpage></bibl><bibl id="B42"><title><p>HyPhy: hypothesis testing using phylogenies</p></title><aug><au><snm>Pond</snm><fnm>SLK</fnm></au><au><snm>Muse</snm><fnm>SV</fnm></au></aug><source>Bioinformatics</source><pubdate>2005</pubdate><volume>21</volume><issue>5</issue><fpage>676</fpage><lpage>679</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1093/bioinformatics/bti079</pubid><pubid idtype="pmpid" link="fulltext">15509596</pubid></pubidlist></xrefbib></bibl><bibl id="B43"><title><p>Not So Different After All: A Comparison of Methods for Detecting Amino Acid Sites Under Selection</p></title><aug><au><snm>Pond</snm><fnm>SLK</fnm></au><au><snm>Frost</snm><fnm>SDW</fnm></au></aug><source>Mol Biol Evol</source><pubdate>2005</pubdate><volume>22</volume><issue>5</issue><fpage>1208</fpage><lpage>1222</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1093/molbev/msi105</pubid><pubid idtype="pmpid" link="fulltext">15703242</pubid></pubidlist></xrefbib></bibl></refgrp>
</bm></art>