<?xml version='1.0'?>
<!DOCTYPE art SYSTEM 'http://www.biomedcentral.com/xml/article.dtd'>
<art>
   <ui>1471-2148-8-190</ui>
   <ji>1471-2148</ji>
   <fm>
      <dochead>Research article</dochead>
      <bibl>
         <title>
            <p>Evolution of SET-domain protein families in the unicellular and multicellular Ascomycota fungi</p>
         </title>
         <aug>
            <au id="A1">
               <snm>Veerappan</snm>
               <mi>S</mi>
               <fnm>Chendhore</fnm>
               <insr iid="I1"/>
               <email>cveerappan@gmail.com</email>
            </au>
            <au id="A2" ca="yes">
               <snm>Avramova</snm>
               <fnm>Zoya</fnm>
               <insr iid="I1"/>
               <email>zavramova2@unl.edu</email>
            </au>
            <au id="A3">
               <snm>Moriyama</snm>
               <mi>N</mi>
               <fnm>Etsuko</fnm>
               <insr iid="I2"/>
               <email>emoriyama2@unl.edu</email>
            </au>
         </aug>
         <insg>
            <ins id="I1">
               <p>School of Biological Sciences, University of Nebraska-Lincoln, Lincoln, NE, USA</p>
            </ins>
            <ins id="I2">
               <p>School of Biological Sciences and Center for Plant Science Innovation, University of Nebraska-Lincoln, Lincoln, NE, USA</p>
            </ins>
         </insg>
         <source>BMC Evolutionary Biology</source>
         <issn>1471-2148</issn>
         <pubdate>2008</pubdate>
         <volume>8</volume>
         <issue>1</issue>
         <fpage>190</fpage>
         <url>http://www.biomedcentral.com/1471-2148/8/190</url>
         <xrefbib>
            <pubidlist>
               <pubid idtype="pmpid">18593478</pubid>
               <pubid idtype="doi">10.1186/1471-2148-8-190</pubid>
            </pubidlist>
         </xrefbib>
      </bibl>
      <history>
         <rec>
            <date>
               <day>23</day>
               <month>2</month>
               <year>2008</year>
            </date>
         </rec>
         <acc>
            <date>
               <day>01</day>
               <month>7</month>
               <year>2008</year>
            </date>
         </acc>
         <pub>
            <date>
               <day>01</day>
               <month>7</month>
               <year>2008</year>
            </date>
         </pub>
      </history>
      <cpyrt>
         <year>2008</year>
         <collab>Veerappan et al; licensee BioMed Central Ltd.</collab>
         <note>This is an Open Access article distributed under the terms of the Creative Commons Attribution License (<url>http://creativecommons.org/licenses/by/2.0</url>), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</note>
      </cpyrt>
      <abs>
         <sec>
            <st>
               <p>Abstract</p>
            </st>
            <sec>
               <st>
                  <p>Background</p>
               </st>
               <p>The evolution of multicellularity is accompanied by the occurrence of differentiated tissues, of organismal developmental programs, and of mechanisms keeping the balance between proliferation and differentiation. Initially, the SET-domain proteins were associated exclusively with regulation of developmental genes in metazoa. However, finding of <it>SET-domain </it>genes in the unicellular yeasts <it>Saccharomyces cerevisiae </it>and <it>Schizosaccharomyces pombe </it>suggested that SET-domain proteins regulate a much broader variety of biological programs. Intuitively, it is expected that the numbers, types, and biochemical specificity of SET-domain proteins of multicellular <it>versus </it>unicellular forms would reflect the differences in their biology. However, comparisons across the unicellular and multicellular domains of life are complicated by the lack of knowledge of the ancestral <it>SET-domain </it>genes. Even within the crown group, different biological systems might use the epigenetic 'code' differently, adapting it to organism-specific needs. Simplifying the model, we undertook a systematic phylogenetic analysis of one monophyletic fungal group (Ascomycetes) containing unicellular yeasts, Saccharomycotina (hemiascomycetes), and a filamentous fungal group, Pezizomycotina (euascomycetes).</p>
            </sec>
            <sec>
               <st>
                  <p>Results</p>
               </st>
               <p>Systematic analysis of the <it>SET-domain </it>genes across an entire eukaryotic phylum has outlined clear distinctions in the <it>SET-domain </it>gene collections in the unicellular and in the multicellular (filamentous) relatives; diversification of <it>SET-domain </it>gene families has increased further with the expansion and elaboration of multicellularity in animal and plant systems. We found several ascomycota-specific <it>SET-domain </it>gene groups; each was unique to either Saccharomycotina or Pezizomycotina fungi. Our analysis revealed that the numbers and types of <it>SET-domain </it>genes in the Saccharomycotina did not reflect the habitats, pathogenicity, mechanisms of sexuality, or the ability to undergo morphogenic transformations. However, novel genes have appeared for functions associated with the transition to multicellularity. Descendents of most of the <it>SET</it>-<it>domain </it>gene families found in the filamentous fungi could be traced in the genomes of extant animals and plants, albeit as more complex structural forms.</p>
            </sec>
            <sec>
               <st>
                  <p>Conclusion</p>
               </st>
               <p><it>SET-domain </it>genes found in the filamentous species but absent from the unicellular sister group reflect two alternative evolutionary events: deletion from the yeast genomes or appearance of novel structures in filamentous fungal groups. There were no Ascomycota-specific <it>SET-domain </it>gene families (<it>i.e</it>., absent from animal and plant genomes); however, plants and animals share <it>SET-domain </it>gene subfamilies that do not exist in the fungi. Phylogenetic and gene-structure analyses defined several animal and plant <it>SET</it>-<it>domain </it>genes as sister groups while those of fungal origin were basal to them. Plants and animals also share SET-domain subfamilies that do not exist in fungi.</p>
            </sec>
         </sec>
      </abs>
   </fm>
   <meta>
      <classifications>
         <classification type="bmc" subtype="user_supplied_xml" id="endnote"/>
      </classifications>
   </meta>
   <bdy>
      <sec>
         <st>
            <p>Background</p>
         </st>
         <p>The genes encoding proteins that contain SET-domain sequences (<it>SET-domain </it>genes) are ancient, existing in the Bacterial Domain of life <abbrgrp><abbr bid="B1">1</abbr></abbrgrp>, but have proliferated and evolved novel functions connected with the appearance of eukaryotes. Because SET-domain proteins modify chromatin by methylating specific lysines on the histone tails <abbrgrp><abbr bid="B2">2</abbr><abbr bid="B3">3</abbr><abbr bid="B4">4</abbr></abbrgrp>, it is not surprising that <it>SET-domain </it>genes are present in eukaryotes from the simple unicellular organisms to the multicellular animals and plants. The pattern and the complexity of epigenetic marks 'written' by the SET-domain proteins correlate with the increased requirements of multicellular organisms <abbrgrp><abbr bid="B5">5</abbr><abbr bid="B6">6</abbr></abbrgrp> including regulation of proliferation, ontogenesis, adhesion-mediated silencing, and disease <abbrgrp><abbr bid="B7">7</abbr><abbr bid="B8">8</abbr><abbr bid="B9">9</abbr><abbr bid="B10">10</abbr><abbr bid="B11">11</abbr></abbrgrp>.</p>
         <p>The fungal division Ascomycota provides a unique model to trace the fate of the <it>SET-domain </it>genes in connection to multicellularity within one monophyletic group. It contains the unicellular Saccharomycotina (hemiascomycetes) and the multicellular filamentous fungi Pezizomycotina (or euascomycetes) as sister-groups. <it>Schizosaccharomyces pombe </it>(<it>S. pombe</it>), which belongs to the Taphrinomycotina (or archiascomycete), is an outgroup <abbrgrp><abbr bid="B12">12</abbr><abbr bid="B13">13</abbr></abbrgrp>. The fungi selected from the Saccharomycotina for this study include species existing as unicellular yeasts [<it>Saccharomyces cerevisiae </it>(<it>S. cerevisiae</it>), <it>Candida glabrata (C. glabrata), Debaryomyces hansenii </it>(<it>D. hansenii</it>), and <it>Yarrowia lipolytica </it>(<it>Y</it>.<it> lipolytica</it>)], as a permanently filamentous yeast, <it>Ashbya gossypii </it>(<it>A. gossypii</it>), or as an organism that changes morphology from the yeast to the filamentous forms in response to environmental cues, the dimorphic <it>Candida albicans </it>(<it>C. albicans</it>).</p>
         <p>The Saccharomycotina has diverged from the filamentous fungi around 400 million years ago and at estimated >1 billion years ago from animals and plants <abbrgrp><abbr bid="B13">13</abbr><abbr bid="B14">14</abbr><abbr bid="B15">15</abbr></abbrgrp>. Substantial events might have taken place during such periods of time; thereby, by comparing the <it>SET-domain </it>genes in unicellular and filamentous Ascomycetes, it might be possible to identify families that have evolved in connection with the appearance of the filamentous (multicellular) forms. <it>Neurospora crassa (N. crassa) </it>is a standard model for the Pezizomycotina (euascomycetes) group because it is a generalist species, less specialized in its biology than many pathogens, symbionts, and fungi of narrow habitat <abbrgrp><abbr bid="B16">16</abbr><abbr bid="B17">17</abbr></abbrgrp>. <it>N. crassa </it>is a bearer of the ancestral characteristics, allowing comparisons of <it>SET-domain </it>gene representation in the closely related pathogen <it>Magnaporthe grisea </it>(<it>M. grisea</it>), in the slightly more distant <it>Fusarium graminearum </it>(<it>F</it>. <it>graminearum</it>), and in the more distantly related <it>Aspergillus fumigatus </it>(<it>A. fumigatus</it>).<it> SET-domain </it>genes found in <it>N. crassa </it>and in the other filamentous fungi, but not in their unicellular relatives, could illustrate evolution of genes connected with transition to multicellular functions; unshared <it>SET-domain </it>genes between nonpathogenic and pathogenic relatives might be related to pathogenicity. Extended comparisons with SET-domain types and families present in the metazoan and plant genomes could outline evolutionary relationships between plant, animal, and fungal kingdoms. A few multicellular genomes, including an invertebrate (<it>Drosophila melanogaster</it>), a mammal (<it>Mus musculus</it>), and a plant (<it>Arabidopsis thaliana</it>), are provided as a reference and are not discussed in detail.</p>
         <p>The main questions asked here were: <it>first</it>, whether the type and the number of <it>SET-domain </it>genes in species from the same phylum would correlate with their existence as unicellular or multicellular (filamentous) forms; <it>second</it>, whether the ability of some yeasts to acquire dimorphic forms, to exist as a permanently filamentous yeast, or to occupy a specific niche (<it>i.e</it>., to act in fermentation or as pathogens) would be reflected by the <it>SET-domain </it>gene collection within the genome; <it>third</it>, whether presence of certain <it>SET</it>-<it>domain </it>genes in a fungal genome would suggest occurrence of novel <it>SET-domain </it>genes or loss of existing <it>SET-domain </it>genes; and <it>fourth</it>, to examine the phylogenetic relationships between the <it>SET</it>-<it>domain </it>gene families of yeast, of higher filamentous fungi, of animals, and of plants to see if their evolution would parallel the transition steps from unicellular, to simple multicellular, and to more complex multicellular systems.</p>
      </sec>
      <sec>
         <st>
            <p>Results</p>
         </st>
         <sec>
            <st>
               <p>Overall genome representation, distribution, and phylogenetic analysis of the fungal SET-domain proteins</p>
            </st>
            <p>Current phylogenetic studies of the kingdom Fungi define Ascomycota as a monophyletic group <abbrgrp><abbr bid="B12">12</abbr><abbr bid="B13">13</abbr></abbrgrp>. For an insight into the evolution of fungal <it>SET-domain </it>genes and their relationship with the genes of higher eukaryotes, we reconstructed phylogenetic trees using the highly conserved SET-domain region (~150 amino acids) <abbrgrp><abbr bid="B18">18</abbr></abbrgrp>. First, we performed a series of similarity searches including profile hidden Markov models against fourteen genomes: eleven fungal (<it>Ascomycetes</it>), two animal (one mammalian and one invertebrate), and one plant (<it>Arabidopsis</it>) genomes (see Additional file <supplr sid="S1">1</supplr>). One hundred and eighty two SET-domain sequences were identified (see Additional file <supplr sid="S2">2</supplr>). Phylogenetic analyses were performed to identify the SET-domain protein families and subfamilies. Maximum likelihood phylogeny reconstructed from the selected 113 representative sequences is shown in Fig. <figr fid="F1">1</figr>. The phylogenetic clustering based on SET-domain sequences reflected presence/absence of group-specific architectural motifs (see Additional file <supplr sid="S3">3</supplr>). We also note that the SET-domain based phylogeny does not seem to support consistently any particular evolutionary relationships among the three kingdoms: fungi, animals, and plants (see also Fig. <figr fid="F2">2</figr>).</p>
            <suppl id="S1">
               <title>
                  <p>Additional file 1</p>
               </title>
               <text>
                  <p>Fourteen genomes used in this study.</p>
               </text>
               <file name="1471-2148-8-190-S1.pdf">
                  <p>Click here for file</p>
               </file>
            </suppl>
            <suppl id="S2">
               <title>
                  <p>Additional file 2</p>
               </title>
               <text>
                  <p>The draft phylogeny including all 182 SET-domain sequences found in this study.</p>
               </text>
               <file name="1471-2148-8-190-S2.pdf">
                  <p>Click here for file</p>
               </file>
            </suppl>
            <suppl id="S3">
               <title>
                  <p>Additional file 3</p>
               </title>
               <text>
                  <p>Gene architecture of SET-domain protein families.</p>
               </text>
               <file name="1471-2148-8-190-S3.pdf">
                  <p>Click here for file</p>
               </file>
            </suppl>
            <fig id="F1">
               <title>
                  <p>Figure 1</p>
               </title>
               <caption>
                  <p>Maximum likelihood phylogeny of 113 representative SET-domain sequences</p>
               </caption>
               <text>
                  <p><b>Maximum likelihood phylogeny of 113 representative SET-domain sequences. </b>Bootstrap values for the major SET-domain families that are higher than 60% by either of the maximum likelihood (ML) or the maximum parsimony (MP) methods are shown at the node (the two % values are ML/MP). Internal branches supporting the major SET-domain families with higher than 80% ML bootstrap values are also indicated by thick lines. Within the major SET-domain protein groups, bootstrap values by the ML analysis greater than 60%, 70%, 80%, and 90% are indicated by stars (*), filled circles (&#9679;), filled squares (&#9632;), and filled triangles (&#9650;), respectively. SET-domain protein subgroups discussed in the main text are indicated by numbers. Representative domain names are shown for the major SET-domain protein families (see Additional file <supplr sid="S3">3</supplr> for the domain names). Species abbreviations are given in Additional file <supplr sid="S1">1</supplr>. For this ML phylogeny, the gamma shape parameter and the proportion of invariant sites were estimated to be 0.779 and 0, respectively.</p>
               </text>
               <graphic file="1471-2148-8-190-1"/>
            </fig>
            <fig id="F2">
               <title>
                  <p>Figure 2</p>
               </title>
               <caption>
                  <p>Distribution of SET-domain genes in the three kingdoms</p>
               </caption>
               <text>
                  <p>Distribution of SET-domain genes in the three kingdoms. Saccharomycotina and Pezizomycotina fungi are shown as "Sac" and "Pez", respectively, among the fungal kingdom. Species or group-specific duplication events of SET-domain genes are illustrated by dashed arrows. SET5/6 related filamentous fungi are shown as "SET-fil". Arrows marked with * indicate that such relationship is not significant and inconclusive. SET-MYND and SUV3-9 are found in <it>S. pombe </it>as well as Pezizomycotina fungi, but missing from Saccharomycotina fungi. This is indicated by the arrows marked with "(-Sac)".</p>
               </text>
               <graphic file="1471-2148-8-190-2"/>
            </fig>
            <p>The distribution of the SET-domain proteins across 14 genomes is summarized in Fig. <figr fid="F3">3</figr>. All 108 SET-domain sequences retrieved from the fungal genomes that belong to the group of the histone methyltransferases are included. SET-domain containing proteins from the RuBisCo, cytochrome C, and the recently discovered ribosomal protein lysine methyltransferases <abbrgrp><abbr bid="B19">19</abbr></abbrgrp> are excluded. Among the SET-domain sequences of animal and plant origin, included are only those that are used as reference for the fungal proteins.</p>
            <fig id="F3">
               <title>
                  <p>Figure 3</p>
               </title>
               <caption>
                  <p>Distribution of SET-domain families in fourteen genomes</p>
               </caption>
               <text>
                  <p><b>Distribution of SET-domain families in fourteen genomes.</b> Shaded in yellow are genes found in all tested species; peach-colored genes were found only in the Saccharomycotina. Genes found in tested genomes except in the Saccharomycotina are shaded in turquoise, while those found only in multicellular species are shown in pink. Genes specific for the filamentous fungi are shown in grey. A related gene found in <it>Arabidopsis </it>is shaded in grey as well. Metazoa-specific genes are shaded in Bordeaux red. Footnotes: <sup>a </sup>genome size; <sup>b </sup>approximate numbers of predicted open reading frames; <sup>c </sup>the <it>Su(var)3-9 </it>gene in <it>S. pombe </it>is known as the <it>Clr4</it>; <sup>d </sup>one of the two copies is known as the <it>DIM-5 </it>gene in <it>N. crassa </it>and the second shows a weak similarity to G9a; <sup>e </sup>filamentous fungal genes belonging to the large SET5/6 family; <sup>f </sup>"unknown" genes have no significant support to cluster with any SET-domain family identified in this study; <sup>g </sup>numbers in parentheses indicate numbers of total SET-domain genes found in these animal/plant genomes <abbrgrp><abbr bid="B20">20</abbr><abbr bid="B21">21</abbr></abbrgrp>.</p>
               </text>
               <graphic file="1471-2148-8-190-3"/>
            </fig>
            <p>The sizes of the fourteen genomes and the total numbers of ORFs are not linearly correlated; the genomes of the filamentous fungi are approximately three times the size of the yeasts (except <it>Y. lipolytica</it>) while the overall number of identified ORFs is only 1.6-to-2 fold higher than in yeasts. On the other hand, the numbers of SET-domain containing genes in the Pezizomycotina species (13&#8211;23) are about 3-fold higher compared to those of the yeasts (5&#8211;6; Fig. <figr fid="F3">3</figr>). Non-linear increasement in the number of <it>SET-domain </it>genes is even more pronounced in the animal and plant genomes: about 70 predicted <it>SET-domain </it>genes in the human and mouse genomes, around 40 in <it>D. melanogaster </it><abbrgrp><abbr bid="B20">20</abbr></abbrgrp>, and around 40 in <it>A. thaliana </it><abbrgrp><abbr bid="B21">21</abbr></abbrgrp>. Increased numbers and diversification of encoded SET-domain proteins reflect, perhaps, increased diversity and complexity of multicellular programs.</p>
            <p>Despite the smaller number of ORFs than most Saccharomycotina fungi <abbrgrp><abbr bid="B22">22</abbr></abbrgrp>, the single-living Taphrinomycotina (archiascomycete), <it>S. pombe</it>, carries three <it>SET-domain </it>genes (<it>SET9, MYND-SET</it>, and a <it>Su(var)3-9 </it>homologue). These <it>SET-domain </it>genes are present in the Pezizomycotina and in animals/plant, but remarkably absent from the other yeasts (except <it>Y. lipolytica</it>, which carries a <it>SET9 </it>gene) (Fig. <figr fid="F3">3</figr>). Presence of these genes supports a conclusion that <it>S. pombe </it>and <it>Y. lipolytica </it>share many common features with the filamentous relatives <abbrgrp><abbr bid="B22">22</abbr><abbr bid="B23">23</abbr></abbrgrp>. Lastly, <it>S. pombe </it>and <it>Y. lipolytica </it>carry one copy of a <it>SET-domain </it>gene of unclear origins. These <it>S. pombe</it>-specific and the <it>Y. lipolytica</it>-specific putative SET-domain proteins, unrelated to each other or to any of the 182 sequences analyzed, are annotated as 'unknown' (Fig. <figr fid="F3">3</figr>) and were excluded from further analyses.</p>
            <p>Saccharomycotina genomes have been subjected to expansions (whole-genome duplications) and deletions (reductive evolution), which have shaped the genome sizes of extant yeasts <abbrgrp><abbr bid="B23">23</abbr><abbr bid="B24">24</abbr></abbrgrp> varying between 9.2 Mb (<it>A. gossypii</it>) to 20 Mb (<it>Y. lipolytica</it>) and overall putative gene numbers between ~4,700 and ~7,000 (Fig. <figr fid="F3">3</figr>). However, the total number of the <it>SET-domain </it>genes in all tested Saccharomycotina is remarkably constant: five or six. Most yeast <it>SET-domain </it>genes are single-copy representatives of distinct subgroups. The <it>SET4 </it>gene, an apparent duplication of <it>SET3 </it>(see below for further discussion) accounts for the difference in the number of <it>SET-domain </it>genes of <it>S. cerevisiae </it>and <it>C. glabrata versus </it>other yeasts; <it>SET6 </it>might have been deleted from the genome of <it>Y. lipolytica </it>or, alternatively, a duplication of <it>SET5 </it>to yield <it>SET6 </it>might have taken place, after the separation of <it>Y. lipolytica </it>from the other yeasts <abbrgrp><abbr bid="B23">23</abbr></abbrgrp>. We note also that both <it>SET5 </it>and <it>SET6 </it>are absent from <it>S. pombe</it>, indicating that these genes encode functions limited to the specific needs of the Saccharomycotina group.</p>
         </sec>
         <sec>
            <st>
               <p>Distribution of each SET-domain group among fungi</p>
            </st>
            <p>Analyses of the distribution of the fungal <it>SET-domain </it>types will be carried out in the context of the transition from unicellular to multicellular filamentous, and to animal/plant multicellular systems. We shall follow genes that are present in: all studied genomes, Saccharomycotina-specific genes, families that are excluded from the Saccharomycotina, and those that are found in specific fungal groups.</p>
            <sec>
               <st>
                  <p>1. <it>SET-domain </it>genes present in all studied genomes</p>
               </st>
               <p>Genes preserved in all tested genomes, across the kingdoms, suggest that they are involved in 'core' cell functions rather than in functions associated with multicellularity. Proteins that satisfy this criterion belong to SET1, SET2, and SET3 subfamilies (Fig. <figr fid="F3">3</figr>). It is specifically noted that these subfamilies belong in larger families containing groups that might have evolved later, possibly in connection with multicellularity. Fungal SET-domain proteins are analyzed in reference with the <it>S. cerevisiae </it>families because they have been studied best and because the <it>SET-domain </it>genes of this yeast have been a model and a reference for histone lysine methyltransferase analysis in multicellular systems.</p>
               <sec>
                  <st>
                     <p>The SET1/TRITHORAX family</p>
                  </st>
                  <p>The <it>SET1 </it>gene of <it>S. cerevisiae </it>encodes a member of the large TRITHORAX family (named after the <it>Drosophila </it>Trithorax protein). The SET domain and the adjacent cysteine-rich motif (post-SET) are the two most highly conserved sequences defining a protein's belonging to the family (Fig. <figr fid="F4">4</figr>). Two subfamilies, the SET1 and the Trithorax (TRX), are distinguished here. We note that in all fungi examined (Saccharomycotina, Pezizomycotina, as well as <it>S. pombe</it>), the family is represented by a single copy of the SET1-type; by comparison, animal/plant genomes contain several genes from this family, including the SET1- and the TRX-subtypes. Distribution of SET1/TRX proteins is summarized also in Fig. <figr fid="F2">2a</figr>.</p>
                  <fig id="F4">
                     <title>
                        <p>Figure 4</p>
                     </title>
                     <caption>
                        <p>Domain architecture of the SET1/TRX family (a: the SET1 subfamily, b: the TRX subfamily)</p>
                     </caption>
                     <text>
                        <p><b>Domain architecture of the SET1/TRX family (a: the SET1 subfamily, b: the TRX subfamily).</b> The divergent RNA recognition motifs (RRM) found in <it>Arabidopsis </it>and yeast (not identified in domain databases) are indicated by the pale color in the figure. Domains are not drawn to scale. For the TRX subfamily, representatives of two major animal subgroups differing by the positioning of the FYRN-FYRC (DAST) domains are shown. Note the presence of conserved domains between the animal and the plant representatives. For more structures see Additional file <supplr sid="S3">3</supplr>.</p>
                     </text>
                     <graphic file="1471-2148-8-190-4"/>
                  </fig>
               </sec>
               <sec>
                  <st>
                     <p>The SET1 subfamily</p>
                  </st>
                  <p>The ancestral gene encoding SET1 in <it>S. cerevisiae </it>has been conserved throughout the evolution and is present in all species examined in this study (Fig. <figr fid="F3">3</figr>). The SET-post-SET sequences (located at the C-terminus) are highly conserved in all examined SET1 proteins (Fig. <figr fid="F4">4a</figr>, see also Additional file <supplr sid="S3">3</supplr>). In the Saccharomycotina, the upstream regions are highly conserved as well: <it>C. glabrata </it>and <it>A. gossippi </it>are 56% similar (37% identical) and 54% similar (34% identical) to the upstream <it>S. cerevisiae </it>sequence, respectively; the region is more diverged between the Saccharomycotina, <it>S. pombe</it>, and the Pezizomycotina fungi, the similarity with <it>S. cerevisiae </it>ranging from 41 to 39%, respectively. The upstream regions contain an RRM (RNA-recognition motif) <abbrgrp><abbr bid="B25">25</abbr><abbr bid="B26">26</abbr></abbrgrp>, which may affect the level of K4 methylation <abbrgrp><abbr bid="B27">27</abbr><abbr bid="B28">28</abbr></abbrgrp>. Most likely, it does not bind RNA but may interact with protein, instead <abbrgrp><abbr bid="B29">29</abbr><abbr bid="B30">30</abbr></abbrgrp>. Congruent with the domain architecture, the fungal SET1 proteins form a well-supported group (SET1 group 3 in Fig. <figr fid="F1">1</figr>) with the <it>S. pombe </it>SET1 as the most outgroup. SET1-related proteins, with conserved RRM upstream of the SET domain, are present also in the animal and plant genomes suggesting preservation of the <it>SET1</it>-encoded function in the evolution from fungi to metazoa/plants. One gene copy in <it>Drosophila </it>encodes a SET1 homologue (EAL24599, identified by the Heterochromatin Genome Project, which was recently released and not included in the sequence set we analyzed), whereas in mouse, as well as in human, the gene has undergone duplication (see Additional file <supplr sid="S2">2</supplr>); in <it>Arabidopsis</it>, the <it>ATXR7 </it>gene (<it>At5g42400</it>) encodes a SET1 counterpart (BAB10481). SET1-homologues of animal and plant origin cluster in SET1 group 2 with bootstrap supporting values between 60% and 70% (Fig. <figr fid="F1">1</figr>).</p>
                  <p>Thereby, if the conserved <it>SET1</it>-related genes from the genomes of unicellular fungi, of filamentous fungi, and of metazoa/plant were orthologues, they would play 'core' cellular roles not connected with the occurrence of multicellularity. This does not preclude involvement of SET1-related genes in regulation of development in multicellular organisms.</p>
               </sec>
               <sec>
                  <st>
                     <p>The Trithorax (TRX) subfamily</p>
                  </st>
                  <p>This group, shown as SET1 groups 1 and 4 in Fig. <figr fid="F1">1</figr>, contains no gene of fungal origin and will not be discussed in detail. However, we emphasize that in animal/plant genomes, an ancestral <it>SET1-</it>related gene has multiplied and diversified its structure and, most likely, function. The PHD (plant homeodomain) and the FYRN- and FYRC-domains (collectively called <ul>D</ul>omain <ul>a</ul>ssociated with <ul>S</ul>ET in <ul>T</ul>rithorax, DAST, in <abbrgrp><abbr bid="B31">31</abbr></abbrgrp>) are considered to be signature motifs for the members of the Trithorax subfamily (Fig. <figr fid="F4">4b</figr>). It is noted that the two subgroups of animal Trithorax proteins differ by the position of the FYRN- and FYRC-domains (juxtaposed or separated), while the plant Trithorax homologues, ATX3, ATX4, and ATX5, do not carry DAST motifs <abbrgrp><abbr bid="B31">31</abbr></abbrgrp>. Hallmark of the SET1/Trithorax family proteins is their biochemical activity methylating histone H3-lysine4 (H3K4). SET1 is responsible for the overall chromatin mono-, di-, and tri-methylation of H3K4 in <it>S. cerevisiae</it>, while known animal and plant Trithorax enzymes modify only a limited fraction of target nucleosomes <abbrgrp><abbr bid="B32">32</abbr><abbr bid="B33">33</abbr><abbr bid="B34">34</abbr></abbrgrp>. The roles of the additional motifs are largely unknown but acquisition of new building blocks reflects the evolution of the proteins in parallel with the requirements for novel functions emerging in animals and plants.</p>
               </sec>
               <sec>
                  <st>
                     <p>The SET2/ASH1 family</p>
                  </st>
                  <p>All proteins from this family carry a SET domain (sufficiently different from that of SET1 family) preceded by a signature cysteine-rich peptide called AWS (<ul>a</ul>ssociated <ul>w</ul>ith <ul>S</ul>ET) (Fig. <figr fid="F5">5</figr>, see also Additional file <supplr sid="S3">3</supplr>). A single <it>SET2 </it>gene is present in the Saccharomycotina fungi and in <it>S. pombe</it>, two <it>SET2</it>-related genes are found in the filamentous fungi, and multiple copies are present in animal/plant genomes. Conserved in all examined species, the SET2/ASH1 proteins form a monophyletic group divided further into two major subfamilies as summarized in Fig. <figr fid="F2">2b</figr>.</p>
                  <fig id="F5">
                     <title>
                        <p>Figure 5</p>
                     </title>
                     <caption>
                        <p>Domain architecture of the SET2/ASH1 family (a: the SET2 subfamilies, b: the ASH-1 subfamily)</p>
                     </caption>
                     <text>
                        <p><b>Domain architecture of the SET2/ASH1 family (a: the SET2 subfamilies, b: the ASH-1 subfamily).</b> Domains are not drawn to scale. For more structures see Additional file <supplr sid="S3">3</supplr>.</p>
                     </text>
                     <graphic file="1471-2148-8-190-5"/>
                  </fig>
               </sec>
               <sec>
                  <st>
                     <p>The SET2 subfamily</p>
                  </st>
                  <p>Apparently, these genes are of an ancient origin existing before the divergence of the three kingdoms. They are present in all examined genomes (Fig. <figr fid="F3">3</figr>, SET2 group 2 in Fig. <figr fid="F1">1</figr>). In addition to the AWS-SET domains, all fungal members as well as animal (both vertebrate and invertebrate) members of the SET2-subgroup 2 carry a downstream WW-peptide participating, most likely, in protein-protein interactions and the SRI motif mediating RNA polymerase II interaction (Fig. <figr fid="F5">5a</figr>) <abbrgrp><abbr bid="B35">35</abbr><abbr bid="B36">36</abbr></abbrgrp>. It is conceivable, then, that <it>SET2 </it>subtype genes have been conserved for RNase polymerase II-interaction as a core cellular function and, thus, unlinked with multicellularity. We note also that the sole <it>S. pombe </it>SET2 copy (NP_594980) does not carry the WW motif. The <it>Arabidopsis </it>SET2 homologue candidates either cluster with animal SET2 proteins (AAC34358) or are located as the outgroup (AAF04434, as well as CAA18207 and AAC23419) (Fig. <figr fid="F1">1</figr>). The SET2 subfamily also includes metazoa-specific groups: NSD (nuclear receptor binding SET domain protein) and WHSC (Wolf-Hirschhorn syndrome candidate) (represented by NP_001001735 in Fig. <figr fid="F1">1</figr>; see also Fig. <figr fid="F2">2b</figr>).</p>
               </sec>
               <sec>
                  <st>
                     <p>The ASH1 subfamily</p>
                  </st>
                  <p>The second <it>SET2 </it>gene from the filamentous fungi was defined as an <it>ASH1</it>-family member by phylogenetic analysis (SET2 group 1 in Fig. <figr fid="F1">1</figr>). The family is named after the ASH1 protein of <it>Drosophila</it>. Notably, the fungal ASH1-members do not carry recognizable structural motifs other than SET, post-SET, and AWS, while animal and plant members contain additional domains (Fig. <figr fid="F5">5b</figr>, see also Additional file <supplr sid="S3">3</supplr>).</p>
                  <p>The SET2 family cluster is well supported (91% bootstrap value by maximum likelihood phylogeny) indicating that the SET2-ASH1 paralogues are phylogenetically and structurally closely related. It is noteworthy then, that the proteins from the two subfamilies have histone methyltransferase activities with different substrate specificities. The SET2 methyltransferases of <it>S. cerevisiae </it>and <it>S. pombe </it>are responsible for mono-, di-, and tri-methylation of H3K36 <abbrgrp><abbr bid="B35">35</abbr><abbr bid="B36">36</abbr></abbrgrp>; the SET2 orthologue in <it>N. crassa</it>, required for normal growth and expression of genes in the asexual and sexual differentiation pathways, also methylates H3K36 <abbrgrp><abbr bid="B37">37</abbr></abbrgrp>. However, the epigenetic activator ASH1 of <it>Drosophila </it>is a multi-catalytic histone methyl-transferase (HMTase) methylating H3K4, H3K9, and H4K20 <abbrgrp><abbr bid="B38">38</abbr></abbrgrp>, while the activity of an ASH1-related protein has not been reported for any orthologue of fungal origin.</p>
               </sec>
               <sec>
                  <st>
                     <p>The SET3 family</p>
                  </st>
                  <p><it>SET3</it>-related genes are found as single copies across the genomes of yeasts, multicellular fungi and animal genomes suggesting that a <it>SET3 </it>gene has existed in the common ancestor before the separation of the animal and fungal domains of life (summarized also in Fig. <figr fid="F2">2c</figr>). The fungal SET3 proteins form a well-supported cluster with subgroups, each carrying proteins from the Saccharomycotina fungi, the Pezizomycotina (filamentous) fungi, or animals. Amino acid substitutions in the catalytic site by arginine (R) residues are a hallmark of SET3-type proteins. The unusual SET domain (<b>RRSC</b>QPN, see Additional file <supplr sid="S4">4</supplr>) divergent from the motif critically involved in the HMTase enzyme function (<b>NHSC</b>DPN) accounts for the lack of histone methyltransferase activity of the <it>S. cerevisiae </it>SET3. However, it is a component of a histone deacetylase complex involved in the meiosis-specific repression of sporulation genes <abbrgrp><abbr bid="B39">39</abbr></abbrgrp>. In this context, it is difficult to predict the role of metazoan SET3 counterparts but their conservation implies biological significance.</p>
                  <suppl id="S4">
                     <title>
                        <p>Additional file 4</p>
                     </title>
                     <text>
                        <p>SET3 and SET4 multiple sequence alignment.</p>
                     </text>
                     <file name="1471-2148-8-190-S4.pdf">
                        <p>Click here for file</p>
                     </file>
                  </suppl>
                  <p>Two <it>Arabidopsis </it>proteins, ATXR5 and ATXR6 (NP_196541 and NP_197821, respectively), were reported earlier to be clustering with the <it>S. cerevisiae </it>SET3 and SET4 proteins <abbrgrp><abbr bid="B40">40</abbr></abbrgrp>. However, in our analysis, the <it>Arabidopsis </it>proteins did not segregate with the yeast SET3/4 (see the draft phylogeny in Additional file <supplr sid="S2">2</supplr>). Furthermore, a detailed comparative analysis with the yeast sequences did not reveal significant conservation. Most importantly, the <it>Arabidopsis </it>SET domain sequences do not carry the hallmark amino acid substitutions. ATXR5 and ATXR6 interact with the proliferating cell nuclear antigen (PCNA) and are critically involved in DNA replication, DNA repair, maintenance and heterochromatin formation <abbrgrp><abbr bid="B41">41</abbr></abbrgrp>.</p>
               </sec>
               <sec>
                  <st>
                     <p>The SET4 family</p>
                  </st>
                  <p>The SET4 proteins are phylogenetically related to SET3, clustering together with 95% bootstrap value (Fig. <figr fid="F1">1</figr>). However, SET4 proteins are found only in <it>S. cerevisiae </it>and its closest relatives. A whole-genome duplication event in the history of <it>Saccharomyces </it>has generated 'twin genes' found in the genomes of extant species <abbrgrp><abbr bid="B42">42</abbr></abbrgrp> and it is plausible that <it>SET3 </it>and <it>SET4 </it>genes are a consequence of this event. To elucidate the timing of the putative SET3/4 duplication, we performed additional similarity searches in <it>S. cerevisiae </it>and nine other Saccharomycotina species. SET4 proteins were found in <it>S. bayanus</it>, <it>S. castelli</it>, in addition to <it>S. cerevisiae </it>and <it>C. glabrata</it>. The result is consistent with reports placing the whole genome duplication after the divergence of related species: <it>A. gossypii</it>, <it>K. lactis</it>, and <it>K. waltii </it><abbrgrp><abbr bid="B24">24</abbr><abbr bid="B42">42</abbr></abbrgrp>. The maximum likelihood phylogenetic tree is also consistent with a SET 3/4 duplication occurring in the ancestral lineage leading to <it>Saccharomyces </it>species and <it>C. glabrata </it>(the shaded cluster in Fig. <figr fid="F6">6</figr>). Furthermore, the SET3 and SET4 sequences are more highly conserved between the <it>S. cerevisiae </it>and <it>S. bayanus </it>compared to <it>S. castelli </it>suggesting that <it>SET3 </it>gene duplication could place the whole-genome duplication event before the divergence of <it>C. glabrata </it>and the <it>Saccharomyces </it>species.</p>
                  <fig id="F6">
                     <title>
                        <p>Figure 6</p>
                     </title>
                     <caption>
                        <p>Maximum likelihood phylogeny of the SET3/4 families among ten Saccharomycotina species</p>
                     </caption>
                     <text>
                        <p><b>Maximum likelihood phylogeny of the SET3/4 families among ten Saccharomycotina species.</b> Bootstrap values greater than 60% are shown. The genomic position is indicated with accession numbers for the sequences from unannotated genomes. The gamma-shape parameter and the proportion of invariant sites were estimated to be 0.879 and 0, respectively.</p>
                     </text>
                     <graphic file="1471-2148-8-190-6"/>
                  </fig>
               </sec>
               <sec>
                  <st>
                     <p>The JmjC family</p>
                  </st>
                  <p>Although JmjC-SET/SET3 cluster is not supported by bootstrap analysis (Fig. <figr fid="F1">1</figr>), the structural relationship between the SET domains of JmjC-SET and the yeast SET3/4 families is reflected by the characteristic R-substitutions in the catalytically relevant sequences (see Additional files <supplr sid="S4">4</supplr> and <supplr sid="S5">5</supplr>). In contrast to SET3/4 families, this family is found only in filamentous fungi. It is represented by a single gene in <it>N. crassa </it>and in <it>A. fumigatus</it>, by a duplication in <it>M. grisea</it>, but has not been identified in <it>F. graminearum</it>. Therefore, the activity encoded by <it>JmjC</it>-<it>SET </it>plays subphylum-specific role but may be species-specific as well.</p>
                  <suppl id="S5">
                     <title>
                        <p>Additional file 5</p>
                     </title>
                     <text>
                        <p>The multiple sequence alignment of 113 representative SET-domain sequences.</p>
                     </text>
                     <file name="1471-2148-8-190-S5.pdf">
                        <p>Click here for file</p>
                     </file>
                  </suppl>
                  <p>None of the known JmjC proteins appears together with a SET-domain outside the filamentous fungi. The striking feature of this combination is that the JmjC domain is a catalytic module for histone lysine-demethylation <abbrgrp><abbr bid="B43">43</abbr><abbr bid="B44">44</abbr></abbrgrp>. Although their activity is evolutionarily conserved from human to yeast, as shown recently for seven JmjC proteins from <it>S. pombe </it><abbrgrp><abbr bid="B45">45</abbr></abbrgrp>, none of these proteins appears together with a SET. It re-enforces the conclusion that the combination of the two motifs is a feature uniquely occurring only in the filamentous fungal proteins. Whether the SET domain of JmjC has a methyltransferase activity has not been demonstrated. However, its possible relatedness with the SET domain from the SET3/4 families suggests that it might not be active as a histone methyltransferase.</p>
               </sec>
            </sec>
            <sec>
               <st>
                  <p>2. The SET5, SET6, and related families</p>
               </st>
               <p>Based on our phylogenetic analysis (Fig. <figr fid="F1">1</figr>), the distribution of SET5, SET6, and related families is summarized also in Fig. <figr fid="F2">2d</figr>. The yeasts' SET5 and SET6 families belong in a larger cluster that includes families represented across the kingdoms (SET-MYND), as well as families specific for the filamentous fungi (SET-TPR), and species-specific groups (SET-Mg and SET-Dm). Apparently, the cluster has originated from an ancestral gene that has undertaken different evolutionary paths at the separation of the fungi from the other kingdoms and at the separation of the Ascomycetal sister groups.</p>
               <sec>
                  <st>
                     <p>The SET5 and SET6 families</p>
                  </st>
                  <p><it>SET5 </it>and <it>SET6</it>, each segregating into a well-supported cluster, are yeast-specific genes absent from the genome of <it>S. pombe</it>. <it>SET6</it>, implicated in ergosterol biosynthesis <abbrgrp><abbr bid="B46">46</abbr></abbrgrp>, is absent from the more distant <it>Y. lipolytica </it>suggesting that the gene encodes a narrow Saccharomycotina-specific function. <it>SET5 </it>and <it>SET6 </it>encode related ~500 amino acid proteins containing the SET domain as the only recognizable conserved structural motif. A signature feature is the long SET-I (insertion region of ~125 a.a.) producing a split SET domain. No role has been established for SET5 yet but, apparently, it is yeast-specific.</p>
               </sec>
               <sec>
                  <st>
                     <p>Filamentous fungal genes related to the SET5/SET6 families</p>
                  </st>
                  <p>Several copies of genes encoding SET-domain proteins similar to the SET5/6 protein group were retrieved from the genomes of the filamentous fungi (shown in the draft phylogeny in Additional file <supplr sid="S2">2</supplr>). These genes encode relatively short (300&#8211;500 a.a.) proteins containing only SET domains. They were not included in Fig. <figr fid="F1">1</figr> because their sequences could not be aligned with confidence. No function is known for any of these proteins, but absence of apparent orthologues from yeast, as well as from animal/plant genomes, illustrates a highly specific evolution of these genes in the filamentous fungal genomes. In addition, a highly supported (99% bootstrap value) cluster of five <it>SET-</it>domain genes was found in <it>M. grisea</it>, encoding species-specific (SET-Mg) proteins. It is interesting to note also the <it>D. melanogaster </it>specific cluster (SET-Dm) in this group suggesting an intriguing property of the <it>SET5/6</it>-related genes towards proliferation and adaptation for species-specific needs by diverse organisms.</p>
               </sec>
               <sec>
                  <st>
                     <p>The TPR-SET family</p>
                  </st>
                  <p>Found only in the filamentous fungi, this family is a combination of a SET 5/6-related domain with an upstream <ul>t</ul>etratrico- <ul>p</ul>eptide <ul>r</ul>epeats (TPR, involved in protein-protein interactions) <abbrgrp><abbr bid="B47">47</abbr></abbrgrp>; two members of the family are present in <it>M. grisea </it>and <it>F. graminearum </it>genomes. We note that one <it>Arabidopsis </it>gene (<it>At1g26760</it>) encodes a SET-domain protein combined with TPR (AAF87042), which clusters with the fungal TPR-SET cluster (Fig. <figr fid="F1">1</figr>). However, this relationship is not strongly supported, making their evolutionary relationship unresolved.</p>
               </sec>
               <sec>
                  <st>
                     <p>The MYND-SET family</p>
                  </st>
                  <p>This family is noteworthy because it contains genes found in <it>S. pombe</it>, in the filamentous group, in metazoa, and in <it>Arabidopsis</it>, but not in the Saccharomycotina fungi. The SET domains of the MYND-SET family form a well-supported cluster within the larger SET5/SET6 family (Fig. <figr fid="F1">1</figr>), suggesting shared ancestry. MYND-SET proteins carry a Zn-binding domain called the MYND-finger (<ul>my</ul>eloid, <ul>n</ul>ervy, and <ul>D</ul>EAF-1 factor) involved in H3K36- and in H3K4-specific methylation <abbrgrp><abbr bid="B48">48</abbr><abbr bid="B49">49</abbr></abbrgrp>. The metazoan subfamily, SMYD, is critically involved in suppressing cell proliferation and carcinogenesis <abbrgrp><abbr bid="B48">48</abbr><abbr bid="B49">49</abbr></abbrgrp>. No role is known for any MYND-SET protein of either plant or fungal origin.</p>
               </sec>
            </sec>
            <sec>
               <st>
                  <p>3. Gene families conserved in all genomes, except in the Saccharomycotina</p>
               </st>
               <p>The SUV3-9 and SET9 (SUV4-20) families are represented in the genomes of the unicellular <it>S. pombe</it>, in the Pezizomycotina, as well as in the animal and plant genomes, but are not found in the Saccharomycotina. Absence from the entire Saccharomycotina group suggests that these genes are unlikely to be involved in 'core' functions but, rather, in mechanisms used by the fission, filamentous, and multicellular eukaryotes; alternatively, these orthologous genes might be used differently and, consequently, be involved in different cellular processes in the fungi than in animals and plants (discussed below).</p>
               <sec>
                  <st>
                     <p>The SUV3-9 family</p>
                  </st>
                  <p>Genes of this family have undergone extensive proliferation in animal and plant genomes, particularly in <it>Arabidopsis </it>(Fig. <figr fid="F3">3</figr>) <abbrgrp><abbr bid="B21">21</abbr></abbrgrp>. The family is divided into multiple subgroups illustrating its internal heterogeneity (Fig. <figr fid="F1">1</figr>) <abbrgrp><abbr bid="B34">34</abbr><abbr bid="B50">50</abbr></abbrgrp>. Here, analysis will be limited to the two subtypes, Su(var)3-9 and G9a, because they are relevant for Ascomycota (summarized also in Fig. <figr fid="F2">2e</figr>).</p>
               </sec>
               <sec>
                  <st>
                     <p>The Su(var)3-9 subfamily</p>
                  </st>
                  <p>The defining feature of the proteins (group 3 in the SUV3-9 cluster in Fig. <figr fid="F1">1</figr>) is the Su(var)3-9-type of SET domain and the <ul>pr</ul>e- <ul>S</ul>ET (PRS) motif located immediately upstream of the SET domain (Fig. <figr fid="F7">7a</figr>, see also Additional file <supplr sid="S3">3</supplr>). PRS contains nine invariant cysteine residues, coordinating three zinc ions, involved in the structural stability of the SET domain <abbrgrp><abbr bid="B51">51</abbr></abbrgrp>. The Su(var)3-9 protein discovered initially in <it>D. melanogaster </it>(NP_524357) is the founding member of the family. It belongs in a subgroup that carries an additional CHROMO domain. It is important to point out that the proteins of filamentous fungal origin (group 4), as well as the <it>Arabidopsis </it>family members (group 1; or the SUVR subgroup in <abbrgrp><abbr bid="B21">21</abbr></abbrgrp>), do not have the CHROMO domain but, nonetheless, belong in the Su(var)3-9 subfamily based on the SET-domain phylogeny (Fig. <figr fid="F1">1</figr>) and the pre-SET domain (Fig. <figr fid="F7">7a</figr>). A notable exception among the fungi is the <it>S. pombe </it>protein, which contains a chromodomain.</p>
                  <fig id="F7">
                     <title>
                        <p>Figure 7</p>
                     </title>
                     <caption>
                        <p>Domain architecture of the SUV39/G9a family (a: the SUV39 subfamily, b: the G9a subfamily)</p>
                     </caption>
                     <text>
                        <p><b>Domain architecture of the SUV39/G9a family (a: the SUV39 subfamily, b: the G9a subfamily).</b> Domains are not drawn to scale. For more structures see Additional file <supplr sid="S3">3</supplr>.</p>
                     </text>
                     <graphic file="1471-2148-8-190-7"/>
                  </fig>
                  <p>The Su(var)3-9 subfamily proteins catalyze methylation of K9 of histone H3 <abbrgrp><abbr bid="B2">2</abbr></abbrgrp> essential for heterochromatin formation in metazoa <abbrgrp><abbr bid="B52">52</abbr></abbrgrp> and in <it>S. pombe </it><abbrgrp><abbr bid="B53">53</abbr></abbrgrp>, and link histone H3K9 methylation with DNA methylation in <it>N. crassa </it><abbrgrp><abbr bid="B54">54</abbr><abbr bid="B55">55</abbr><abbr bid="B56">56</abbr></abbrgrp>. Absence of <it>Su(var)3-9 </it>genes from the Saccharomycotina fungi has had enormous consequences for the evolution of heterochromatin-like gene-silencing mechanisms in yeast (see Discussion).</p>
               </sec>
               <sec>
                  <st>
                     <p>The G9a subfamily</p>
                  </st>
                  <p>Discovered initially in the human major histocompatibility complex locus, the <it>G9a </it>genes are ubiquitously expressed. It is represented by a <it>M. musculus </it>sequence (NP_766133) in the SUV3-9 group 3 in Fig. <figr fid="F1">1</figr> (it forms a small cluster including <it>D. melanogaster </it>sequence in Additional file <supplr sid="S2">2</supplr>). The mammalian G9a proteins contain pre-SET and SET domains of the Su(var)3-9-type, plus ankyrin (ANK) repeats (Fig. <figr fid="F7">7b</figr>, see also Additional file <supplr sid="S3">3</supplr>) with a strong HMTase activity towards H3K9. In contrast to its paralogue Su(var)3-9, G9a plays a role as a transcriptional suppressor of genes in euchromatic regions but not in heterochromatin <abbrgrp><abbr bid="B56">56</abbr></abbrgrp>. <it>Arabidopsis </it>G9a proteins (group 2 in the SUV3-9 cluster in Fig. <figr fid="F1">1</figr>) carry SRA (SET and RING finger associated domain) instead of the ANK repeats. These <it>Arabidopsis </it>genes have proliferated (at least 10 genes belong to the this subfamily <abbrgrp><abbr bid="B21">21</abbr></abbrgrp>) and are involved in mediating DNA methylation <abbrgrp><abbr bid="B57">57</abbr><abbr bid="B58">58</abbr><abbr bid="B59">59</abbr></abbrgrp>. Among the examined fungi, an <it>N. crassa </it>protein (EAA30745) showed weak similarities against mammalian G9a SET-domain sequences (~30% identities). This fungal protein has only the SET domain (Fig. <figr fid="F7">7b</figr>). The phylogenetic relationship of the <it>N. crassa </it>sequence (EAA30745) has not been resolved (see the draft phylogeny in Additional file <supplr sid="S2">2</supplr>).</p>
               </sec>
               <sec>
                  <st>
                     <p>The SET9 (SUV4-20) family</p>
                  </st>
                  <p>The fungal <it>SET9 </it>genes belong to the <it>SUV4-20 </it>family characterized in metazoa <abbrgrp><abbr bid="B60">60</abbr><abbr bid="B61">61</abbr></abbrgrp>. Due to the lack of confidence in alignment, this family is not included in Fig. <figr fid="F1">1</figr>. In the draft phylogeny (see Additional file <supplr sid="S2">2</supplr>), these sequences are clustered and indicated in blue. Putative SET9 homologues are present in <it>S. pombe </it>(NP_588078) as well as in the filamentous fungi. Members of the family are absent from yeasts, except <it>Y. lipolytica </it>(XP_504243). <it>SET9</it>/<it>Su(var)4-20 </it>genes have not beenfound in <it>Arabidopsis </it>(nor in rice nor maize).</p>
                  <p>Proteins from the SUV4-20 family carry only a SET domain located at the N-terminal half (see Additional file <supplr sid="S3">3</supplr>). These genes encode histone methyl transferases, specifically tri-methylating H4K20, an evolutionarily conserved epigenetic mark for heterochromatin <abbrgrp><abbr bid="B61">61</abbr></abbrgrp>. Remarkably, in the fission yeast, SET9 generates methylated H4K20 but it is used as a sign for DNA repair mechanisms, not for heterochromatin <abbrgrp><abbr bid="B62">62</abbr></abbrgrp>.</p>
               </sec>
            </sec>
            <sec>
               <st>
                  <p>4. The Enhancer of zeste (E(Z)) family: a SET-domain family found only in the filamentous fungi and in higher multicellular organisms</p>
               </st>
               <p>In our analysis, <it>E(z)-</it>type genes were not found in the Saccharomycotina nor in the <it>S. pombe </it>genome. It is tempting to suggest that the ancestral <it>E(z) </it>gene has appeared as a novel SET-domain function inherited by extant filamentous fungi, animals, and plants (see also Fig. <figr fid="F2">2g</figr>). The structure of the E(z)-SET domain and its biochemical activity (see Discussion) are consistent with the idea that the gene has appeared later in the evolution.</p>
               <p>Three proteins found in the filamentous fungi (XP_381075, EAA35807, and XP_369092) cluster with the animal/plant E(z) family but with low bootstrap values (51% by the maximum likelihood and 79% by the maximum parsimony). No <it>E(z)</it>-related gene was identified in <it>A. fumigatus</it>, possibly representing gene-loss (incomplete genome sequence is also possible). The fungal proteins do not carry additional structural domains, which are uniquely conserved in the E(z) proteins of animal and plant origin (see Additional file <supplr sid="S3">3</supplr>) and belonging of the fungal genes to the <it>E(z)</it>-type group is inconclusive (marked with a star in Fig. <figr fid="F2">2g</figr>). E(z) methylates specifically H3K27 within Polycomb PRC2 complexes, a landmark for gene silencing mechanisms <abbrgrp><abbr bid="B63">63</abbr><abbr bid="B64">64</abbr></abbrgrp>. Absence of <it>E(z)</it>-related genes in the unicellular yeasts indicates that Polycomb mechanisms do not operate in these systems.</p>
            </sec>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Discussion</p>
         </st>
         <p>Correlations between genome evolution, overall gene content, and organismal complexity, revealed in whole-genome comparative analyses, have outlined evolutionary trends associated with the occurrence of multicellularity <abbrgrp><abbr bid="B17">17</abbr><abbr bid="B65">65</abbr></abbrgrp>. It was suggested that core biological functions, common for both unicellular and multicellular organisms, would be carried out by a comparable number of orthologous proteins, while specialized processes unique to multicellulars would use novel proteins <abbrgrp><abbr bid="B66">66</abbr></abbrgrp>. Our analyses support a positive correlation between the numbers and types of <it>SET-domain </it>genes found in multicellular versus unicellular fungi; a similar tendency was observed when the simpler multicellular (filamentous) fungi were compared with multicellular animal and plant genomes. Analysis of the distribution of individual <it>SET-domain </it>fungal genes, however, revealed some unexpected trends, particularly within the Saccharomycotina group.</p>
         <sec>
            <st>
               <p>Morphology, habitats, mating mechanisms and pathogenicity of yeasts and their <it>SET-domain </it>gene contents</p>
            </st>
            <p>Despite a broad range of lifestyles, life cycles, and mating mechanisms <abbrgrp><abbr bid="B67">67</abbr></abbrgrp>, the numbers of <it>SET-domain </it>genes in the respective yeast species is remarkably constant and the gene structure is highly conserved. Neither the large <it>Y. lipolytica </it>genome size nor the extensive gene-loss in <it>C. glabrata </it>is reflected by the numbers of the <it>SET-domain </it>genes suggesting that in yeasts, the collection of <it>SET-domain </it>genes has been selectively maintained. The permanently filamentous yeast <it>A. gossypii </it>displays high genomic synteny with <it>S. cerevisiae </it>but has a very different morphology and occupies a specific niche: a plant pathogen. <it>A. gossypii </it>grows as multinucleated hyphae in the subtropics, while <it>S. cerevisiae </it>proliferates as single cells associated with sugar-containing fruits <abbrgrp><abbr bid="B23">23</abbr><abbr bid="B67">67</abbr><abbr bid="B68">68</abbr><abbr bid="B69">69</abbr></abbrgrp>. <it>C. glabrata</it>, closely related to <it>S. cerevisiae</it>, is a pathogen living on mucosal human tissues, while the more distant pathogenic <it>C. albicans </it>grows as yeast but may switch to hyphal growth when exposed to host serum. Genes from pathogenic species that differ from <it>S. cerevisiae </it>might have a role in the virulence <abbrgrp><abbr bid="B23">23</abbr><abbr bid="B70">70</abbr></abbrgrp>. Despite the strikingly different biology of thee yeasts, their <it>SET-domain </it>genes are fully shared suggesting orthologous functions. In such a context, the yeast <it>SET-domain </it>genes cannot be considered critical for the dimorphic transitions, for hyphae formation, for long-range nuclear dynamics, or as factors contributing to the pathogenicity.</p>
            <p><it>SET-domain </it>genes do not seem to be underlying differences in the life cycles and sexual mechanisms adopted by the individual species, either. <it>D. hansenii </it>is homothallic with essentially haplontic life cycle and <it>Y. lipolytica </it>is heterothallic (self-sterile) with a haplo-diplontic cycle, while sexual cycles are unknown for <it>C. glabrata </it><abbrgrp><abbr bid="B23">23</abbr><abbr bid="B67">67</abbr></abbrgrp> and, until recently, for <it>C. albicans </it><abbrgrp><abbr bid="B71">71</abbr></abbrgrp>. Although <it>C. albicans </it>has mating-type-like (MTL) genes that resemble the mating-type genes of <it>S. cerevisiae</it>, <it>C. albicans </it>is able to mate under anaerobic conditions reflecting its adaptation as an anaerobic parasite <abbrgrp><abbr bid="B72">72</abbr></abbrgrp>. Finally, <it>D. hansenii </it>and <it>Y. lipolytica </it>have one mating type-locus, whereas <it>C. glabrata </it>possesses three mating type-like loci with configurations similar to that of <it>S. cerevisiae </it><abbrgrp><abbr bid="B73">73</abbr></abbrgrp>. Given the importance of the diversity of sexual mechanisms for the evolution of the species, it is remarkable that these mechanisms are, most likely, not connected with the evolution of the yeast <it>SET-domain </it>genes. We emphasize, however, that this conclusion does not preclude participation of <it>SET-domain </it>genes in these processes; rather, it suggests that the different biology of the species is not connected to specific diversifications of the <it>SET-domain </it>genes <it>per se</it>. It agrees with observations from whole-genome analyses that almost identical gene sets control diverse cellular functions in yeasts suggesting that orthologous genes might not play identical cellular roles in different yeast systems <abbrgrp><abbr bid="B23">23</abbr><abbr bid="B67">67</abbr><abbr bid="B69">69</abbr></abbrgrp>. Differences, like the presence of SET4 in <it>S. cerevisiae </it>and <it>C. glabrata </it>and the absence of SET6 from <it>Y. lipolytica </it>are, most likely, connected to the fermentative lifestyle <abbrgrp><abbr bid="B74">74</abbr></abbrgrp> and to metabolic specificities of organisms, rather than with differences in the morphology, in sexual mechanisms, or in pathogenicity.</p>
         </sec>
         <sec>
            <st>
               <p>Evolution of the activating and repressive marks in the fungi; Consequences of lost <it>SET-domain </it>genes in Saccharomycotina</p>
            </st>
            <p>The <it>SET9 </it>(<it>SUV4-20</it>), <it>SET-MYND</it>, and <it>Su(var)3-9 </it>genes, found in <it>S. pombe</it>, in filamentous fungi, in animals, and in plants, but absent from the Saccharomycotina, suggest that the encoded activities are not involved in 'core' functions. They are involved in generating marks for silencing mechanisms implying that these silencing mechanisms do not exist in the Saccharomycotina. SET9 (SUV4-20) tri-methylates H4K20, a mark for pericentric heterochromatin <abbrgrp><abbr bid="B61">61</abbr></abbrgrp> and MYND-SET family members (studied only in animals so far) generate H3K36 marks involved in suppression of cell proliferation <abbrgrp><abbr bid="B48">48</abbr><abbr bid="B49">49</abbr></abbrgrp>, while SUV39 methylates H3K9 critical for heterochromatin formation <abbrgrp><abbr bid="B52">52</abbr></abbrgrp>. Recent findings have suggested that epigenetic 'ON' marks are reduced in evolution, while the 'OFF' signs have significantly increased; unicellular organisms contain more marks associated with transcriptional activation, whereas mammals contain more modifications associated with repression <abbrgrp><abbr bid="B5">5</abbr><abbr bid="B6">6</abbr></abbrgrp>. Indeed, the only known histone methylation marks in <it>S. cerevisiae </it>(H3K4, H3K36 and H3K79) and the enzymes generating them (SET1, SET2, and DOT1, an unrelated to SET domain that methylases K79) are associated with gene-activation <abbrgrp><abbr bid="B75">75</abbr></abbrgrp>. Absence of <it>SET9, MYND- SET</it>, and <it>Su(var)3-9 </it>genes from the Saccharomycotina is consistent with this evolutionary trend. However, their presence in <it>S. pombe </it>(and of <it>SET9 </it>in <it>Y. lipolytica</it>) suggests that the 'silencing' marks established by the encoded activities are not signature features of multicellular genomes; moreover, the epigenetic marks might be 'read' differently. For example, SET9 of <it>S. pombe </it>methylates H4K20 but it is a sign for DNA damage response in <it>S. pombe </it><abbrgrp><abbr bid="B62">62</abbr></abbrgrp> rather than for heterochromatin <abbrgrp><abbr bid="B60">60</abbr><abbr bid="B61">61</abbr></abbrgrp>. Absence of genes encoding silencing marks in the Saccharomycotina, thereby, raises an important question described next.</p>
         </sec>
         <sec>
            <st>
               <p>How do yeasts silence genes and genomic regions?</p>
            </st>
            <p>Answers may be found in the remarkable ability of yeasts to adopt available means to achieve ends that are functionally similar but molecularly different from mechanisms employed by other systems. The most striking example is the loss of the <it>Su(var)3-9 </it>gene from the Saccharomycotina because loss of SUVR39 protein entailed loss of the epigenetic H3K9me mark, and of the entire machinery involved in making heterochromatin <abbrgrp><abbr bid="B56">56</abbr></abbrgrp>. Nonetheless, silencing processes involving the MAT cassettes, the ribosomal loci, and the telomeric regions are achieved through mechanisms similar to the assembly of heterochromatin <abbrgrp><abbr bid="B76">76</abbr></abbrgrp>. Even more surprising is that close <it>S. cerevisiae </it>relatives do not use the same tools but have evolved species-tailored mechanisms for achieving effects functionally similar to heterochromatin. There is no Sir1 (Silent information regulator 1) in <it>C. glabrata</it>, no Sir1 and Sir3 in <it>A. gossypii</it>; neither Sir nor the RNAi-pathways are conserved in <it>D. hansenii </it>and none of the <it>S. cerevisiae </it>heterochromatin factors was found in <it>Y. lipolytica </it><abbrgrp><abbr bid="B67">67</abbr></abbrgrp>. Collectively, the data illustrate the great evolutionary divergence of 'invented' mechanisms producing silencing effects in systems that have lost the epigenetic H3K9me mark.</p>
            <p>Furthermore, loss of <it>SET9 </it>and <it>MYND-SET </it>related genes encoding gene-silencing functions has resulted in evolving mechanisms that take advantage of the degree (mono-, di-, or tri-) methylation of the lysine NH2- groups to achieve different transcriptional outcomes for pertinent genes. In <it>S. cerevisiae</it>, di- (H3K4me2) or tri- (H3K4me3) methylated lysines are associated with non-active, or actively transcribed sequences, respectively <abbrgrp><abbr bid="B77">77</abbr><abbr bid="B78">78</abbr></abbrgrp>; in <it>Arabidopsis</it>, H3K4me2 marks are found at genes, independently of whether they were transcribed or not, while the H3K4me3 marks were enriched at active loci <abbrgrp><abbr bid="B79">79</abbr></abbrgrp>. In metazoa, both modifications are associated with actively transcribed genes but are differentially distributed along the gene sequence with the H3K4me3 marks accumulated at the transcription start-sites <abbrgrp><abbr bid="B20">20</abbr><abbr bid="B33">33</abbr></abbrgrp>.</p>
            <p>Thereby, the amount of methyl tags on the same lysine residue may function as repressive or activating signs in <it>S. cerevisiae</it>, while the fission, filamentous, and higher multicellular systems use additional signs carried by a larger number and diversity of methylation marks.</p>
         </sec>
         <sec>
            <st>
               <p><it>SET-domain </it>genes specific for the filamentous fungi</p>
            </st>
            <p>Consistent with their greater morphological and developmental complexity, the filamentous fungi have a greater number of genes than their unicellular relatives. Conserved proteins among the filamentous fungi may be implicated in their morphogenesis, while non-conserved, species-specific genes might be associated with pathogenic capabilities <abbrgrp><abbr bid="B16">16</abbr><abbr bid="B17">17</abbr><abbr bid="B80">80</abbr><abbr bid="B81">81</abbr><abbr bid="B82">82</abbr><abbr bid="B83">83</abbr><abbr bid="B84">84</abbr><abbr bid="B85">85</abbr></abbrgrp>. According to these criteria, the different types and numbers of <it>SET-domain </it>genes in the Pezizomycotina compared to the yeasts (Fig. <figr fid="F3">3</figr>) might encode functions linked with hyphal growth, mycelia development, or other processes underlying cellular morphogenesis and virulence. Furthermore, the genes from the JmjC-SET, TPR-SET, and SET5/6 related filamentous fungal genes encode fungal functions specific for the entire filamentous group, while the cluster of five-related genes present in <it>M. grisea</it>, but not in <it>N. crassa</it>, might be associated with life as a pathogen. Whether the <it>M. grisea</it>-specific SET-domain cluster is related to its pathogenicity is unknown. Variations in the <it>JmjC-SET </it>and the <it>TPR-SET </it>gene numbers in particular genomes (Fig. <figr fid="F3">3</figr>) are, most likely, due to species-specific deletion/duplication events. Whether these differences in copy numbers are linked with the pathogenicity of the species has not been demonstrated.</p>
            <p>We note also the phylogenetic relationship between the filamentous fungi-specific TPR-SET and SET5/6 related proteins, and the Saccharomycotina-specific SET5 and SET6 proteins (Fig. <figr fid="F1">1</figr>). It illustrates the divergent evolutionary paths of a common ancestor at the separation of the two Ascomycota subgroups. Similarly, the phylogenetically related SET domains of the JmjC-SET and SET3/SET4 subgroups suggest that they result from subphylum-specific divergence of a shared ancestor. The <it>SET3</it>-lineages has been conserved throughout the evolution and its duplication in <it>S. cerevisiae </it>and <it>C. glabrata </it>has produced <it>SET4</it>, while duplication in the filamentous group has produced the JmjC-SET cluster (Fig. <figr fid="F2">2c</figr>).</p>
         </sec>
         <sec>
            <st>
               <p><it>SET-domain </it>genes found exclusively in multicellular organisms</p>
            </st>
            <p><it>SET-domain </it>genes of multicellular organisms not found in any unicellular fungi may be involved in multicellular, rather than in 'core' functions. Members of the ASH1, the G9A subfamilies, and of the E(z) family are such candidates.</p>
            <p>In contrast to the 'core' function encoded by <it>SET2</it>, the <it>ASH1</it>-related copy found in the filamentous fungi has relatives only in multicellular organisms. The descendants of the ancestral <it>ASH1 </it>lineage in extant animal and plant genomes has evolved further by acquiring additional structural motifs. This process could be linked with the evolution of more complex morphology and communication systems operating in the higher multicellular systems.</p>
            <p>No paralogues of the <it>E(z) </it>genes have been recognized in the genomes of unicellular yeasts suggesting a later occurrence in the evolution. The structure of the E(z)-type SET-domain peptide may illustrate a discrete step in the evolution of the SET-domain protein structure/function. A Zn-finger involved in substrate specificity of histone methyltransferases <abbrgrp><abbr bid="B86">86</abbr></abbrgrp> is formed by the Cys in the consensus NHXC sequence and the post-SET domain motif (CxCxxxxC) conserved in a subset of SET-domain proteins. Loss of this structure is, most likely, a secondary event triggered by a single substitution of the C in the NHXC box leading to the loss of the post-SET domain and subsequent divergence of the biochemical specificity.</p>
            <p>Occurrence of E(z) might be linked with the transition to multicellularity, as suggested by the lack of K27 marks and <it>E(z) </it>genes from the unicellular yeasts, <it>Dictyostelium</it>, and <it>Chlamydomonas</it>. However, abundant H3K27me marks were found in <it>Tetrahymena </it><abbrgrp><abbr bid="B5">5</abbr></abbrgrp>, suggesting that H3K27me marks have evolved for species-specific needs as well. It will be very informative to establish the roles of the <it>E(z) </it>genes in <it>Tetrahymena</it>, particularly in view of the ability of animals and plant E(z) paralogues to assemble specific Polycomb-group complexes <abbrgrp><abbr bid="B87">87</abbr><abbr bid="B88">88</abbr><abbr bid="B89">89</abbr><abbr bid="B90">90</abbr></abbrgrp>. Furthermore, H3K27me and H3K4me marks establish a bivalent chromatin state at the nucleosomes of developmentally regulated genes in animal stem cells <abbrgrp><abbr bid="B91">91</abbr></abbrgrp> and in <it>Arabidopsis </it><abbrgrp><abbr bid="B92">92</abbr></abbrgrp>, defining them as histone marks for differentiation processes.</p>
            <p>Particularly important is the presence of conserved peptide domains related to the SANT-domain family, present in the E(z) proteins of animal and plant origins. Furthermore, in the reconstructed phylogeny based solely on SET-domain sequences, the E(z) animal and plant proteins cluster together with very high bootstrap values (99%). The conserved structure suggests that the last shared ancestor for the <it>E(z)-</it>lineage carrying these domains has existed before the divergence of the animal and plant genes. This putative ancestor has either originated after the separation of the fungal lineage, or has been deleted in the fungal ancestor. A similar pattern was noted for the evolution of the <it>TRX </it>genes, suggesting that the PcG/TrxG mechanism has been evolutionarily conserved in the animal and plant kingdoms but not in the fungal kingdom.</p>
         </sec>
         <sec>
            <st>
               <p>Origin of the 'orphan' <it>SET-domain </it>genes in the filamentous fungi</p>
            </st>
            <p>Recent large-scale comparison of sequence data has found over 40 <it>N. crassa </it>genes with no identifiable <it>S. cerevisiae </it>homologues <abbrgrp><abbr bid="B16">16</abbr></abbrgrp>. It was suggested that these "orphan" genes might have resulted from either loss of related genes from the <it>S. cerevisiae </it>lineage, from horizontal gene transfer into the <it>N. crassa </it>lineage, from newly generated 'innovative' genes, or from exceptional gene-divergence in <it>S. cerevisiae </it><abbrgrp><abbr bid="B16">16</abbr></abbrgrp>. Our analysis is congruent with processes of deletion from the Saccharomycotina genomes and with the assembly of novel genes in the filamentous fungi, as credible fates for the evolution of orphan <it>SET-domain </it>genes.</p>
            <p>The <it>SET9</it>, <it>SET-MYND</it>, and <it>Su(ver)3-9 </it>genes appear to have been lost from the Saccharomycotina genomes or from the lineage leading to the Saccharomycotina. Supporting evidence comes from the finding of highly conserved sequences for each of these three genes in <it>Rhizopus orizae</it>, which is an outgroup to Ascomycota and a member of the basal group, Zygomycetes. Existence of the orthologous protein candidates in the <it>R. orizae </it>genome (RO3G_14129: 48% identical and 66% similar to <it>S. pombe</it>'s SET9, RO3G_13695: 25% identical and 41% similar to <it>S. pombe</it>'s MYND-SET, and RO3G_16553: 43% identical and 59% similar to <it>S. pombe</it>'s <it>Su(ver)3-9</it>) suggests that ancestors of these genes have existed before the divergence of the Ascomycetes, and subsequently have been lost from the Saccharomycotina. The yeast <it>SET9-</it>gene lineage has been lost even later, after the divergence from <it>Y. lipolytica</it>.</p>
            <p>An interesting feature in the evolution of the <it>Su(var)3-9 </it>gene is the presence of the chromodomain in animal and <it>S. pombe </it>proteins. Because a chromodomain is absent from the proteins of the filamentous fungi and plants, it was suggested that the ancestral form, before the divergence of animal and fungal lineages, has carried the combination of the two motifs but the chromodomain has been lost from the filamentous fungi after their separation from the fission yeast <abbrgrp><abbr bid="B50">50</abbr></abbrgrp>. However, absence of a chromodomain in the <it>Su(var)3-9 </it>orthologue in <it>R. orizae </it>and in <it>Ustillago </it>(a member of the sister group Basidomycetes) suggests an alternative scenario: the ancient fungal <it>Su(var)3-9 </it>lineage did not have a chromodomain and acquisition of chromodomain encoding sequences by the <it>S. pombe </it>and animal <it>Su(var)3-9 </it>genes were two independent events. <it>Su(var)3-9</it>-related genes have remarkably proliferated in <it>Arabidopsis</it>, but no plant homologue carries a chromodomain.</p>
            <p>Homologues of <it>E(z), SET2(ASH1</it>)-like, <it>G9a</it>-like, <it>JmjC-SET, and TPR-SET </it>genes were not found in the <it>R. orizae </it>genome. It is plausible that these genes have occurred in the Pezizomycotina, required by multicellularity-related functions. The mechanism of genetic innovation in the filamentous lineage is unclear. Predicted trends underlying the occurrence of novel proteins may include evolution of new protein architecture from preexisting domains (including reshuffling of existing domains) and/or expansion of particular domain families by series of duplications, followed by specialization, to meet the specific needs of a species. It is clear that the <it>JmjC-SET and TPR-SET </it>genes have been retained in the Pezizomycetes for uniquely fungal processes, while the <it>E(z), SET2 (ASH1</it>), and <it>G9a</it>-lineages have been inherited in the animal and plant ancestors, where they have evolved further (Fig. <figr fid="F2">2</figr>).</p>
            <p>The limited number of animal and plant <it>SET-domain </it>genes included here represent descendants of lineages found in the unicellular fungal group (<it>SET1, SET2, SET3, Su(var)4-20, Su(var)3-9</it>, and <it>MYND-SET</it>) and of lineages found in the filamentous group (<it>ASH1</it>, <it>G9a</it>, and <it>E(z)</it>). Distinct subgroups, shared by animals and plants, have evolved within the larger families existing in the fungi (<it>i.e</it>., Trithorax, E(z)), as well as plant- or animal-kingdom specific lineages <abbrgrp><abbr bid="B21">21</abbr><abbr bid="B93">93</abbr><abbr bid="B94">94</abbr><abbr bid="B95">95</abbr></abbrgrp>. Furthermore, some animal-specific SET-domain families (SET8) are distributed throughout metazoan genomes, while others (SET7) are found only in vertebrates.</p>
         </sec>
         <sec>
            <st>
               <p>In Summary</p>
            </st>
            <p>The phylogenetic analysis allowed us to trace clear distinctions between species-, subphylum-, and kingdom-specific SET domains, as well as to recognize factors involved in core-cellular roles <it>versus </it>those likely to be associated with multicellular requirements. Surprisingly, the collection of <it>SET-</it>domain genes within yeast did not appear critical for differences in lifestyles, abilities to morph, sexual mechanisms, and pathogenicity of hemiascomycetes. However, <it>SET4, SET5</it>, and <it>SET6 </it>encode Saccharomycotina-specific functions and appearance of <it>SET4 </it>parallels the genome duplication of <it>Saccharomycetes </it>believed to be important for their fermentative abilities <abbrgrp><abbr bid="B42">42</abbr></abbrgrp>. <it>SET-domain </it>genes found in the filamentous species, but absent from the unicellular sister group, reflect two evolutionary events: deletion from the yeasts genomes (<it>SET9, MYND-SET</it>, and <it>Su(var)3-9</it>) and appearance of novel structures. The latter group involves genes (<it>JmjC-SET and TPR-SET</it>) originating for the subphylum Pezizomycetes-specific roles and genes apparently connected with the occurrence of multicellularity; descendants of these genes are found also in animal and plant genomes (<it>E(z), SET2(ASH1</it>), and <it>G9a</it>). There is no Ascomycota-specific SET-domain family or gene (present in fungal genomes but absent from animals and plants), while there are families found exclusively in plant and animal genomes, as well as plant-specific or animal-specific subgroups. Animal and plant <it>SET-domain </it>genes are ancestrally related with complex fungal proteins with diverse modular elements. A combinatorial assembly of various peptide domains generates enormous possibilities for variation and precision required for functioning and adaptation of multicellular organisms.</p>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Methods</p>
         </st>
         <sec>
            <st>
               <p>Sequences used</p>
            </st>
            <p>The SET-domain sequences were searched from fourteen complete genomes listed in Additional file <supplr sid="S1">1</supplr>. Twenty-four SET-domain sequences used as queries were collected from various sources, as listed in Additional file <supplr sid="S6">6</supplr>.</p>
            <suppl id="S6">
               <title>
                  <p>Additional file 6</p>
               </title>
               <text>
                  <p>SET-domain query sequences used to search new SET-domain proteins.</p>
               </text>
               <file name="1471-2148-8-190-S6.pdf">
                  <p>Click here for file</p>
               </file>
            </suppl>
         </sec>
         <sec>
            <st>
               <p>SET-domain protein mining</p>
            </st>
            <p>Four search methods were used to mine new SET-domain proteins from the fourteen genomes: BLAST protein similarity searches were conducted by BLASTP <abbrgrp><abbr bid="B96">96</abbr></abbrgrp> using the 24 SET-domain sequences as the queries against the non-redundant database available at National Center for Biotechnology Information (NCBI) with the default settings. To find similar protein regions from unannotated genomic regions, TBLASTN <abbrgrp><abbr bid="B96">96</abbr></abbrgrp> was used to perform similarity searches against nucleotide sequences of the fourteen genomes translated in all six frames. More sensitive searches were performed using the position specific iteration BLAST (PSI-BLAST) <abbrgrp><abbr bid="B97">97</abbr></abbrgrp>. Each query was used against individual genomes with the inclusion E-value threshold of 0.001 and four search iterations.</p>
            <sec>
               <st>
                  <p>Profile hidden Markov model searches</p>
               </st>
               <p>Profile hidden Markov models (HMMs), probabilistic models of multiple sequence alignments, were built and used to search for sequences with remote similarities <abbrgrp><abbr bid="B98">98</abbr></abbrgrp>. Using the sequences obtained from the BLAST searches and the query sequences, we selected 27 well-aligned sequences (see Additional file <supplr sid="S7">7</supplr>). A profile HMM was built using these sequences with the Sequence Alignment and Modeling System (SAM) <abbrgrp><abbr bid="B99">99</abbr><abbr bid="B100">100</abbr></abbrgrp>. Two programs of the SAM package were used: <it>buildmodel </it>for building the profile HMMs and <it>hmmscore </it>(with <it>-sw 2 </it>and-<it>calibrate 1 </it>options) for searching similar protein sequences from the genomes. The searches were conducted in each genome individually. The resulting hits from each organism were analyzed for the presence of the SET-domain and previously unidentified sequences were collected. We did not use a strict E-value threshold. Rather each hit within the default E-value threshold (10) was examined one by one for the existence of the SET-domain.</p>
               <suppl id="S7">
                  <title>
                     <p>Additional file 7</p>
                  </title>
                  <text>
                     <p>SET-domain proteins used for building the profile hidden Markov models.</p>
                  </text>
                  <file name="1471-2148-8-190-S7.pdf">
                     <p>Click here for file</p>
                  </file>
               </suppl>
               <p>After these similarity searches, 214 non-redundant hits were compiled from the twelve genomes (data not shown). Each of these 214 sequences was examined to confirm the presence of the SET domain by searching the Conserved Domain Database (CDD) available from NCBI <abbrgrp><abbr bid="B101">101</abbr></abbrgrp>, as well as the Simple Modular Architecture Research Tool (SMART) database <abbrgrp><abbr bid="B102">102</abbr><abbr bid="B103">103</abbr></abbrgrp>. Some dubious hits including too highly diverged sequences and those with very short SET-domain-like sequences were removed. Three fungal SET-MYND sequences (EAA36113, XP_381344, and XP_360530; see Additional file <supplr sid="S2">2</supplr>) were further used to search more SET-MYND sequences from fungal genomes using BLAST. After these analyses, we obtained 182 non-redundant SET-domain sequences. These sequences were used in our further analyses.</p>
            </sec>
         </sec>
         <sec>
            <st>
               <p>Multiple alignments of SET-domain sequences</p>
            </st>
            <p>CLUSTALX (version 1.83) <abbrgrp><abbr bid="B104">104</abbr></abbrgrp> was used to generate multiple alignments of SET-domain sequences (with the GONNET series protein weight matrices and gap opening penalty = 10 and gap extension penalty = .20). Due to the highly variable length of the SET-I region, poorly conserved sites across the sequences were removed. Some other highly variable positions were also removed and the alignments were adjusted manually (see Additional file <supplr sid="S8">8</supplr>).</p>
            <suppl id="S8">
               <title>
                  <p>Additional file 8</p>
               </title>
               <text>
                  <p>The multiple sequence alignment of all 182 non-redundant SET-domain sequences.</p>
               </text>
               <file name="1471-2148-8-190-S8.pdf">
                  <p>Click here for file</p>
               </file>
            </suppl>
         </sec>
         <sec>
            <st>
               <p>Phylogenetic analyses</p>
            </st>
            <p>A draft phylogeny was reconstructed using all of the 182 SET-domain sequences found in this study using the maximum likelihood method implemented in PHYML (version 2.4.4) <abbrgrp><abbr bid="B105">105</abbr></abbrgrp>. This draft phylogeny (see Additional file <supplr sid="S2">2</supplr>) was used in the further analyses. Protein domain architectures for each protein group are shown in Additional file <supplr sid="S3">3</supplr>. In order to produce a more reliable multiple alignment and phylogenies, we reduced the number of sequences by choosing representative SET-domain sequences using the draft phylogeny as a guide tree. Poorly aligned sequences and those not clustering clearly with any known major SET-domain families were removed. All fungal sequences were retained, while only one each representative hit from plants and animals was chosen from each SET-domain cluster of SET1, SET2, Su(var)3-9, and E(z). The selected representative sequences are indicated in the phylogeny in red in Additional file <supplr sid="S2">2</supplr>. The final multiple alignment including 113 sequences is shown in Additional file <supplr sid="S5">5</supplr>. Phylogenetic reconstruction was done using the maximum likelihood method (implemented in PHYML version 2.4.4) and the maximum parsimony method (implemented in PHYLIP version 3.65). The neighbor-joining method was not used because the estimated distance matrix using the JTT substitution model (implemented in PHYLIP 3.65 <abbrgrp><abbr bid="B106">106</abbr></abbrgrp>) generated estimation errors due to too many substitutions. For the maximum likelihood method, two sets of trees were reconstructed: one with no invariable site and a constant substitution rate among sites, and the other with the proportion of invariable sites and the gamma shape parameter estimated from the data. Reconstructed phylogenies were largely consistent. In our further analysis, we used the maximum likelihood phylogeny using the estimated proportion of invariable sites and gamma shape parameter. For the maximum parsimony method, the input sequence order was jumbled 10 times (see Additional file <supplr sid="S9">9</supplr>). Phylogenetic confidence was estimated by the bootstrap analysis <abbrgrp><abbr bid="B107">107</abbr></abbrgrp> with 500 pseudoreplicates for all phylogenetic analysis.</p>
            <suppl id="S9">
               <title>
                  <p>Additional file 9</p>
               </title>
               <text>
                  <p>The maximum parsimony tree reconstructed from the 113 representative SET-domain sequences.</p>
               </text>
               <file name="1471-2148-8-190-S9.pdf">
                  <p>Click here for file</p>
               </file>
            </suppl>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Authors' contributions</p>
         </st>
         <p>CSV collected data, carried out all of the bioinformatics analyses, and drafted the manuscript. ZA conceived of the study, contributed the discussion, and revised the manuscript. ENM conceived of the study, supervised the entire process of the study, and revised the manuscript. All authors read and approved the final manuscript.</p>
      </sec>
   </bdy>
   <bm>
      <ack>
         <sec>
            <st>
               <p>Acknowledgements</p>
            </st>
            <p>The authors are enormously indebted to Drs. A. Atkin and K. Nickerson (UNL) for the critical reading of the manuscript and helpful suggestions. The work was partially supported by an REU/NSF grant (0553630) to ZA.</p>
         </sec>
      </ack>
      <refgrp>
         <bibl id="B1">
            <title>
               <p>Origin of the Bacterial SET Domain Genes: Vertical or Horizontal?</p>
            </title>
            <aug>
               <au>
                  <snm>Alvarez-Venegas</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Sadder</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Tikhonov</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Avramova</snm>
                  <fnm>Z</fnm>
               </au>
            </aug>
            <source>Mol Biol Evol</source>
            <pubdate>2007</pubdate>
            <volume>24</volume>
            <fpage>482</fpage>
            <lpage>497</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">17148507</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B2">
            <title>
               <p>Regulation of chromatin structure by site-specific histone H3 methyltransferases</p>
            </title>
            <aug>
               <au>
                  <snm>Rea</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Eisenhaber</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>O'Carroll</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Strahl</snm>
                  <fnm>BD</fnm>
               </au>
               <au>
                  <snm>Sun</snm>
                  <fnm>ZW</fnm>
               </au>
               <au>
                  <snm>Schmid</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Opravil</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Mechtler</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Ponting</snm>
                  <fnm>CP</fnm>
               </au>
               <au>
                  <snm>Allis</snm>
                  <fnm>CD</fnm>
               </au>
               <au>
                  <snm>Jenuwein</snm>
                  <fnm>T</fnm>
               </au>
            </aug>
            <source>Nature</source>
            <pubdate>2000</pubdate>
            <volume>406</volume>
            <fpage>593</fpage>
            <lpage>599</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">10949293</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B3">
            <title>
               <p>Translating the Histone Code</p>
            </title>
            <aug>
               <au>
                  <snm>Jenuwein</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Allis</snm>
                  <fnm>CD</fnm>
               </au>
            </aug>
            <source>Science</source>
            <pubdate>2001</pubdate>
            <volume>298</volume>
            <fpage>1074</fpage>
         </bibl>
         <bibl id="B4">
            <title>
               <p>Chromatin modifications and their function</p>
            </title>
            <aug>
               <au>
                  <snm>Kouzarides</snm>
                  <fnm>T</fnm>
               </au>
            </aug>
            <source>Cell</source>
            <pubdate>2007</pubdate>
            <volume>128</volume>
            <fpage>693</fpage>
            <lpage>705</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">17320507</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B5">
            <title>
               <p>Organismal differences in post-translational modifications in histones H3 and H4</p>
            </title>
            <aug>
               <au>
                  <snm>Garcia</snm>
                  <fnm>BA</fnm>
               </au>
               <au>
                  <snm>Hake</snm>
                  <fnm>SB</fnm>
               </au>
               <au>
                  <snm>Diaz</snm>
                  <fnm>RL</fnm>
               </au>
               <au>
                  <snm>Kauer</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Morris</snm>
                  <fnm>SA</fnm>
               </au>
               <au>
                  <snm>Recht</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Shabanowitz</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Mishra</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Strahl</snm>
                  <fnm>BD</fnm>
               </au>
               <au>
                  <snm>Allis</snm>
                  <fnm>CD</fnm>
               </au>
               <au>
                  <snm>Hunt</snm>
                  <fnm>DF</fnm>
               </au>
            </aug>
            <source>J Biol Chem</source>
            <pubdate>2007</pubdate>
            <volume>282</volume>
            <fpage>7641</fpage>
            <lpage>7655</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">17194708</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B6">
            <title>
               <p>Identification of histone H3 lysine 36 acetylation as a highly conserved histone modification</p>
            </title>
            <aug>
               <au>
                  <snm>Morris</snm>
                  <fnm>SA</fnm>
               </au>
               <au>
                  <snm>Rao</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Garcia</snm>
                  <fnm>BA</fnm>
               </au>
               <au>
                  <snm>Hake</snm>
                  <fnm>SB</fnm>
               </au>
               <au>
                  <snm>Diaz</snm>
                  <fnm>RL</fnm>
               </au>
               <au>
                  <snm>Shabanowitz</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Hunt</snm>
                  <fnm>DF</fnm>
               </au>
               <au>
                  <snm>Allis</snm>
                  <fnm>CD</fnm>
               </au>
               <au>
                  <snm>Lieb</snm>
                  <fnm>JD</fnm>
               </au>
               <au>
                  <snm>Strahl</snm>
                  <fnm>BD</fnm>
               </au>
            </aug>
            <source>J Biol Chem</source>
            <pubdate>2007</pubdate>
            <volume>282</volume>
            <fpage>7632</fpage>
            <lpage>7640</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">17189264</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B7">
            <title>
               <p>Knockdown of ALR (MLL2) reveals ALR target genes and leads to alterations in cell adhesion and growth</p>
            </title>
            <aug>
               <au>
                  <snm>Issaeva</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Zonis</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Rozovskaia</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Orlovsky</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Croce</snm>
                  <fnm>CM</fnm>
               </au>
               <au>
                  <snm>Nakamura</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Mazo</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Eisenbach</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Canaani</snm>
                  <fnm>E</fnm>
               </au>
            </aug>
            <source>Mol Cell Biol</source>
            <pubdate>2007</pubdate>
            <volume>27</volume>
            <fpage>1889</fpage>
            <lpage>1903</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1820476</pubid>
                  <pubid idtype="pmpid" link="fulltext">17178841</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B8">
            <title>
               <p>Polycomb silencers control cell fate, development and cancer</p>
            </title>
            <aug>
               <au>
                  <snm>Sparmann</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>van Lohuizen</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>Nat Rev Cancer</source>
            <pubdate>2006</pubdate>
            <volume>6</volume>
            <fpage>846</fpage>
            <lpage>856</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">17060944</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B9">
            <title>
               <p>Su(var) genes regulate the balance between euchromatin and heterochromatin in Drosophila</p>
            </title>
            <aug>
               <au>
                  <snm>Ebert</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Schotta</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Lein</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Kubicek</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Krauss</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>Jenuwein</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Reuter</snm>
                  <fnm>G</fnm>
               </au>
            </aug>
            <source>Genes Dev</source>
            <pubdate>2004</pubdate>
            <volume>18</volume>
            <fpage>2973</fpage>
            <lpage>2983</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">534657</pubid>
                  <pubid idtype="pmpid" link="fulltext">15574598</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B10">
            <title>
               <p>Epigenetic inheritance of cell differentiation status</p>
            </title>
            <aug>
               <au>
                  <snm>Ng</snm>
                  <fnm>RK</fnm>
               </au>
               <au>
                  <snm>Gurdon</snm>
                  <fnm>JB</fnm>
               </au>
            </aug>
            <source>Cell Cycle</source>
            <pubdate>2008</pubdate>
            <volume>7</volume>
            <fpage>1173</fpage>
            <lpage>7</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmpid" link="fulltext">18418041</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B11">
            <title>
               <p>Histone tail modifications and noncanonical functions of histones: perspectives in cancer epigenetics</p>
            </title>
            <aug>
               <au>
                  <snm>Hadnagy</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Beaulieu</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Balicki</snm>
                  <fnm>D</fnm>
               </au>
            </aug>
            <source>Mol Cancer Ther</source>
            <pubdate>2008</pubdate>
            <volume>7</volume>
            <fpage>740</fpage>
            <lpage>748</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">18413789</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B12">
            <title>
               <p>A fungal phylogeny based on 42 complete genomes derived from supertree and combined gene analysis</p>
            </title>
            <aug>
               <au>
                  <snm>Fitzpatrick</snm>
                  <fnm>DA</fnm>
               </au>
               <au>
                  <snm>Logue</snm>
                  <fnm>ME</fnm>
               </au>
               <au>
                  <snm>Stajich</snm>
                  <fnm>JE</fnm>
               </au>
               <au>
                  <snm>Butler</snm>
                  <fnm>G</fnm>
               </au>
            </aug>
            <source>BMC Evol Biol</source>
            <pubdate>2006</pubdate>
            <volume>6</volume>
            <fpage>99</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1679813</pubid>
                  <pubid idtype="pmpid" link="fulltext">17121679</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B13">
            <title>
               <p>Loss of the flagellum happened only once in the fungal lineage: phylogenetic structure of kingdom Fungi inferred from RNA polymerase II subunit genes</p>
            </title>
            <aug>
               <au>
                  <snm>Liu</snm>
                  <fnm>YJ</fnm>
               </au>
               <au>
                  <snm>Hodson</snm>
                  <fnm>MC</fnm>
               </au>
               <au>
                  <snm>Hall</snm>
                  <fnm>BD</fnm>
               </au>
            </aug>
            <source>BMC Evol Biol</source>
            <pubdate>2006</pubdate>
            <volume>6</volume>
            <fpage>74</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1599754</pubid>
                  <pubid idtype="pmpid" link="fulltext">17010206</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B14">
            <title>
               <p>Detecting morphological convergence in true fungi, using 18S rRNA gene sequence data</p>
            </title>
            <aug>
               <au>
                  <snm>Berbee</snm>
                  <fnm>ML</fnm>
               </au>
               <au>
                  <snm>Taylor</snm>
                  <fnm>JW</fnm>
               </au>
            </aug>
            <source>Biosystems</source>
            <pubdate>1992</pubdate>
            <volume>28</volume>
            <fpage>117</fpage>
            <lpage>125</lpage>
            <xrefbib>
               <pubid idtype="pmpid">1292657</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B15">
            <title>
               <p>Phylogeny and systematics of the fungi with special reference to the Ascomycota and Basidiomycota</p>
            </title>
            <aug>
               <au>
                  <snm>Prillinger</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Lopandic</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Schweigkofler</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Deak</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Aarts</snm>
                  <fnm>HJ</fnm>
               </au>
               <au>
                  <snm>Bauer</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Sterflinger</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Kraus</snm>
                  <fnm>GF</fnm>
               </au>
               <au>
                  <snm>Maraz</snm>
                  <fnm>A</fnm>
               </au>
            </aug>
            <source>Chem Immunol</source>
            <pubdate>2002</pubdate>
            <volume>81</volume>
            <fpage>207</fpage>
            <lpage>295</lpage>
            <xrefbib>
               <pubid idtype="pmpid">12102002</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B16">
            <title>
               <p>Large-scale comparison of fungal sequence information: mechanisms of innovation in Neurospora crassa and gene loss in Saccharomyces cerevisiae</p>
            </title>
            <aug>
               <au>
                  <snm>Braun</snm>
                  <fnm>EL</fnm>
               </au>
               <au>
                  <snm>Halpern</snm>
                  <fnm>AL</fnm>
               </au>
               <au>
                  <snm>Nelson</snm>
                  <fnm>MA</fnm>
               </au>
               <au>
                  <snm>Natvig</snm>
                  <fnm>DO</fnm>
               </au>
            </aug>
            <source>Genome Res</source>
            <pubdate>2000</pubdate>
            <volume>10</volume>
            <fpage>416</fpage>
            <lpage>430</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">10779483</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B17">
            <title>
               <p>Lessons from the genome sequence of Neurospora crassa: tracing the path from genomic blueprint to multicellular organism</p>
            </title>
            <aug>
               <au>
                  <snm>Borkovich</snm>
                  <fnm>KA</fnm>
               </au>
               <au>
                  <snm>Alex</snm>
                  <fnm>LA</fnm>
               </au>
               <au>
                  <snm>Yarden</snm>
                  <fnm>O</fnm>
               </au>
               <au>
                  <snm>Freitag</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Turner</snm>
                  <fnm>GE</fnm>
               </au>
               <au>
                  <snm>Read</snm>
                  <fnm>ND</fnm>
               </au>
               <au>
                  <snm>Seiler</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Bell-Pedersen</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Paietta</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Plesofsky</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Plamann</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Goodrich-Tanrikulu</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Schulte</snm>
                  <fnm>U</fnm>
               </au>
               <au>
                  <snm>Mannhaupt</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Nargang</snm>
                  <fnm>FE</fnm>
               </au>
               <au>
                  <snm>Radford</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Selitrennikoff</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Galagan</snm>
                  <fnm>JE</fnm>
               </au>
               <au>
                  <snm>Dunlap</snm>
                  <fnm>JC</fnm>
               </au>
               <au>
                  <snm>Loros</snm>
                  <fnm>JJ</fnm>
               </au>
               <au>
                  <snm>Catcheside</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Inoue</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Aramayo</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Polymenis</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Selker</snm>
                  <fnm>EU</fnm>
               </au>
               <au>
                  <snm>Sachs</snm>
                  <fnm>MS</fnm>
               </au>
               <au>
                  <snm>Marzluf</snm>
                  <fnm>GA</fnm>
               </au>
               <au>
                  <snm>Paulsen</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Davis</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Ebbole</snm>
                  <fnm>DJ</fnm>
               </au>
               <au>
                  <snm>Zelter</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Kalkman</snm>
                  <fnm>ER</fnm>
               </au>
               <au>
                  <snm>O'Rourke</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Bowring</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Yeadon</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Ishii</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Suzuki</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Sakai</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Pratt</snm>
                  <fnm>R</fnm>
               </au>
            </aug>
            <source>Microbiol Mol Biol Rev</source>
            <pubdate>2004</pubdate>
            <volume>68</volume>
            <fpage>1</fpage>
            <lpage>108</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">362109</pubid>
                  <pubid idtype="pmpid" link="fulltext">15007097</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B18">
            <title>
               <p>The Drosophila trithorax proteins contain a novel variant of the nuclear receptor type DNA binding domain and an ancient conserved motif found in other chromosomal proteins</p>
            </title>
            <aug>
               <au>
                  <snm>Stassen</snm>
                  <fnm>MJ</fnm>
               </au>
               <au>
                  <snm>Bailey</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Nelson</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Chinwalla</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>Harte</snm>
                  <fnm>PJ</fnm>
               </au>
            </aug>
            <source>Mech Dev</source>
            <pubdate>1995</pubdate>
            <volume>52</volume>
            <fpage>209</fpage>
            <lpage>223</lpage>
            <xrefbib>
               <pubid idtype="pmpid">8541210</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B19">
            <title>
               <p>A novel SET domain methyltransferase in yeast: Rkm2-dependent trimethylation of ribosomal protein L12ab at lysine 10</p>
            </title>
            <aug>
               <au>
                  <snm>Porras-Yakushi</snm>
                  <fnm>TR</fnm>
               </au>
               <au>
                  <snm>Whitelegge</snm>
                  <fnm>JP</fnm>
               </au>
               <au>
                  <snm>Clarke</snm>
                  <fnm>S</fnm>
               </au>
            </aug>
            <source>J Biol Chem</source>
            <pubdate>2006</pubdate>
            <volume>281</volume>
            <fpage>35835</fpage>
            <lpage>35845</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">17005568</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B20">
            <title>
               <p>Histone methylation in transcriptional control</p>
            </title>
            <aug>
               <au>
                  <snm>Kouzarides</snm>
                  <fnm>T</fnm>
               </au>
            </aug>
            <source>Curr Opin Genet Dev</source>
            <pubdate>2002</pubdate>
            <volume>12</volume>
            <fpage>198</fpage>
            <lpage>209</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">11893494</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B21">
            <title>
               <p>The Arabidopsis thaliana genome contains at least 29 active genes encoding SET domain proteins that can be assigned to four evolutionarily conserved classes</p>
            </title>
            <aug>
               <au>
                  <snm>Baumbusch</snm>
                  <fnm>LO</fnm>
               </au>
               <au>
                  <snm>Thorstensen</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Krauss</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>Fischer</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Naumann</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Assalkhou</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Schulz</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Reuter</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Aalen</snm>
                  <fnm>RB</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2001</pubdate>
            <volume>29</volume>
            <fpage>4319</fpage>
            <lpage>4333</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">60187</pubid>
                  <pubid idtype="pmpid" link="fulltext">11691919</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B22">
            <title>
               <p>The genome sequence of Schizosaccharomyces pombe</p>
            </title>
            <aug>
               <au>
                  <snm>Wood</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>Gwilliam</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Rajandream</snm>
                  <fnm>MA</fnm>
               </au>
               <au>
                  <snm>Lyne</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Lyne</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Stewart</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Sgouros</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Peat</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Hayles</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Baker</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Basham</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Bowman</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Brooks</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Brown</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Brown</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Chillingworth</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Churcher</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Collins</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Connor</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Cronin</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Davis</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Feltwell</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Fraser</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Gentles</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Goble</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Hamlin</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Harris</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Hidalgo</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Hodgson</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Holroyd</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Hornsby</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Howarth</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Huckle</snm>
                  <fnm>EJ</fnm>
               </au>
               <au>
                  <snm>Hunt</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Jagels</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>James</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Jones</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Jones</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Leather</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>McDonald</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>McLean</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Mooney</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Moule</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Mungall</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Murphy</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Niblett</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Odell</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Oliver</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>O'Neil</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Pearson</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Quail</snm>
                  <fnm>MA</fnm>
               </au>
               <au>
                  <snm>Rabbinowitsch</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Rutherford</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Rutter</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Saunders</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Seeger</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Sharp</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Skelton</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Simmonds</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Squares</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Squares</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Stevens</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Taylor</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Taylor</snm>
                  <fnm>RG</fnm>
               </au>
               <au>
                  <snm>Tivey</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Walsh</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Warren</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Whitehead</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Woodward</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Volckaert</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Aert</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Robben</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Grymonprez</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Weltjens</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Vanstreels</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Rieger</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Schafer</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Muller-Auer</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Gabel</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Fuchs</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Dusterhoft</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Fritzc</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Holzer</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Moestl</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Hilbert</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Borzym</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Langer</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Beck</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Lehrach</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Reinhardt</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Pohl</snm>
                  <fnm>TM</fnm>
               </au>
               <au>
                  <snm>Eger</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Zimmermann</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Wedler</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Wambutt</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Purnelle</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Goffeau</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Cadieu</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Dreano</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Gloux</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Lelaure</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>Mottier</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Galibert</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Aves</snm>
                  <fnm>SJ</fnm>
               </au>
               <au>
                  <snm>Xiang</snm>
                  <fnm>Z</fnm>
               </au>
               <au>
                  <snm>Hunt</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Moore</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Hurst</snm>
                  <fnm>SM</fnm>
               </au>
               <au>
                  <snm>Lucas</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Rochet</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Gaillardin</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Tallada</snm>
                  <fnm>VA</fnm>
               </au>
               <au>
                  <snm>Garzon</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Thode</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Daga</snm>
                  <fnm>RR</fnm>
               </au>
               <au>
                  <snm>Cruzado</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Jimenez</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Sanchez</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>del Rey</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Benito</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Dominguez</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Revuelta</snm>
                  <fnm>JL</fnm>
               </au>
               <au>
                  <snm>Moreno</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Armstrong</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Forsburg</snm>
                  <fnm>SL</fnm>
               </au>
               <au>
                  <snm>Cerutti</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Lowe</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>McCombie</snm>
                  <fnm>WR</fnm>
               </au>
               <au>
                  <snm>Paulsen</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Potashkin</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Shpakovski</snm>
                  <fnm>GV</fnm>
               </au>
               <au>
                  <snm>Ussery</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Barrell</snm>
                  <fnm>BG</fnm>
               </au>
               <au>
                  <snm>Nurse</snm>
                  <fnm>P</fnm>
               </au>
            </aug>
            <source>Nature</source>
            <pubdate>2002</pubdate>
            <volume>415</volume>
            <fpage>871</fpage>
            <lpage>880</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">11859360</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B23">
            <title>
               <p>Genome evolution in yeasts</p>
            </title>
            <aug>
               <au>
                  <snm>Dujon</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Sherman</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Fischer</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Durrens</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Casaregola</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Lafontaine</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>De Montigny</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Marck</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Neuveglise</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Talla</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Goffard</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Frangeul</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Aigle</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Anthouard</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>Babour</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Barbe</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>Barnay</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Blanchin</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Beckerich</snm>
                  <fnm>JM</fnm>
               </au>
               <au>
                  <snm>Beyne</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Bleykasten</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Boisrame</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Boyer</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Cattolico</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Confanioleri</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>De Daruvar</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Despons</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Fabre</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Fairhead</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Ferry-Dumazet</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Groppi</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Hantraye</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Hennequin</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Jauniaux</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Joyet</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Kachouri</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Kerrest</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Koszul</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Lemaire</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Lesur</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Ma</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Muller</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Nicaud</snm>
                  <fnm>JM</fnm>
               </au>
               <au>
                  <snm>Nikolski</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Oztas</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Ozier-Kalogeropoulos</snm>
                  <fnm>O</fnm>
               </au>
               <au>
                  <snm>Pellenz</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Potier</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Richard</snm>
                  <fnm>GF</fnm>
               </au>
               <au>
                  <snm>Straub</snm>
                  <fnm>ML</fnm>
               </au>
               <au>
                  <snm>Suleau</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Swennen</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Tekaia</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Wesolowski-Louvel</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Westhof</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Wirth</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Zeniou-Meyer</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Zivanovic</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Bolotin-Fukuhara</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Thierry</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Bouchier</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Caudron</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Scarpelli</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Gaillardin</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Weissenbach</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Wincker</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Souciet</snm>
                  <fnm>JL</fnm>
               </au>
            </aug>
            <source>Nature</source>
            <pubdate>2004</pubdate>
            <volume>430</volume>
            <fpage>35</fpage>
            <lpage>44</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">15229592</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B24">
            <title>
               <p>After the duplication: gene loss and adaptation in Saccharomyces genomes</p>
            </title>
            <aug>
               <au>
                  <snm>Cliften</snm>
                  <fnm>PF</fnm>
               </au>
               <au>
                  <snm>Fulton</snm>
                  <fnm>RS</fnm>
               </au>
               <au>
                  <snm>Wilson</snm>
                  <fnm>RK</fnm>
               </au>
               <au>
                  <snm>Johnston</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>Genetics</source>
            <pubdate>2006</pubdate>
            <volume>172</volume>
            <fpage>863</fpage>
            <lpage>872</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1456250</pubid>
                  <pubid idtype="pmpid" link="fulltext">16322519</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B25">
            <title>
               <p>High conservation of the Set1/Rad6 axis of histone 3 lysine 4 methylation in budding and fission yeasts</p>
            </title>
            <aug>
               <au>
                  <snm>Roguev</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Schaft</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Shevchenko</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Aasland</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Stewart</snm>
                  <fnm>AF</fnm>
               </au>
            </aug>
            <source>J Biol Chem</source>
            <pubdate>2003</pubdate>
            <volume>278</volume>
            <fpage>8487</fpage>
            <lpage>8493</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">12488447</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B26">
            <title>
               <p>The Saccharomyces cerevisiae Set1 complex includes an Ash2 homologue and methylates histone 3 lysine 4</p>
            </title>
            <aug>
               <au>
                  <snm>Roguev</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Schaft</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Shevchenko</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Pijnappel</snm>
                  <fnm>WW</fnm>
               </au>
               <au>
                  <snm>Wilm</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Aasland</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Stewart</snm>
                  <fnm>AF</fnm>
               </au>
            </aug>
            <source>Embo J</source>
            <pubdate>2001</pubdate>
            <volume>20</volume>
            <fpage>7137</fpage>
            <lpage>7148</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">125774</pubid>
                  <pubid idtype="pmpid" link="fulltext">11742990</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B27">
            <title>
               <p>Global loss of Set1-mediated H3 Lys4 trimethylation is associated with silencing defects in Saccharomyces cerevisiae</p>
            </title>
            <aug>
               <au>
                  <snm>Fingerman</snm>
                  <fnm>IM</fnm>
               </au>
               <au>
                  <snm>Wu</snm>
                  <fnm>CL</fnm>
               </au>
               <au>
                  <snm>Wilson</snm>
                  <fnm>BD</fnm>
               </au>
               <au>
                  <snm>Briggs</snm>
                  <fnm>SD</fnm>
               </au>
            </aug>
            <source>J Biol Chem</source>
            <pubdate>2005</pubdate>
            <volume>280</volume>
            <fpage>28761</fpage>
            <lpage>28765</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">15964832</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B28">
            <title>
               <p>Histone trimethylation by Set1 is coordinated by the RRM, autoinhibitory, and catalytic domains</p>
            </title>
            <aug>
               <au>
                  <snm>Schlichter</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Cairns</snm>
                  <fnm>BR</fnm>
               </au>
            </aug>
            <source>Embo J</source>
            <pubdate>2005</pubdate>
            <volume>24</volume>
            <fpage>1222</fpage>
            <lpage>1231</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">556409</pubid>
                  <pubid idtype="pmpid" link="fulltext">15775977</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B29">
            <title>
               <p>The multiple faces of Set1</p>
            </title>
            <aug>
               <au>
                  <snm>Deh&#233;</snm>
                  <fnm>PM</fnm>
               </au>
               <au>
                  <snm>G&#233;li</snm>
                  <fnm>V</fnm>
               </au>
            </aug>
            <source>Biochem Cell Biol</source>
            <pubdate>2006</pubdate>
            <volume>84</volume>
            <fpage>536</fpage>
            <lpage>548</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">16936826</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B30">
            <title>
               <p>Structural characterization of Set1 RNA recognition motifs and their role in histone H3 lysine 4 methylation</p>
            </title>
            <aug>
               <au>
                  <snm>Tr&#233;saugues</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Deh&#233;</snm>
                  <fnm>PM</fnm>
               </au>
               <au>
                  <snm>Gu&#233;rois</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Rodriguez-Gil</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Varlet</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Salah</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Pamblanco</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Luciano</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Quevillon-Cheruel</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Sollier</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Leulliot</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Couprie</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Tordera</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>Zinn-Justin</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Chavez</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>van Tilbeurgh</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>G&#233;li</snm>
                  <fnm>V</fnm>
               </au>
            </aug>
            <source>J Mol Biol</source>
            <pubdate>2006</pubdate>
            <volume>359</volume>
            <fpage>1170</fpage>
            <lpage>1181</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">16787775</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B31">
            <title>
               <p>Two Arabidopsis homologs of the animal trithorax genes: a new structural domain is a signature feature of the trithorax gene family</p>
            </title>
            <aug>
               <au>
                  <snm>Alvarez-Venegas</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Avramova</snm>
                  <fnm>Z</fnm>
               </au>
            </aug>
            <source>Gene</source>
            <pubdate>2001</pubdate>
            <volume>271</volume>
            <fpage>215</fpage>
            <lpage>221</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">11418242</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B32">
            <title>
               <p>Methylation of lysine 4 on histone H3: intricacy of writing and reading a single epigenetic mark</p>
            </title>
            <aug>
               <au>
                  <snm>Ruthenburg</snm>
                  <fnm>AJ</fnm>
               </au>
               <au>
                  <snm>Allis</snm>
                  <fnm>CD</fnm>
               </au>
               <au>
                  <snm>Wysocka</snm>
                  <fnm>J</fnm>
               </au>
            </aug>
            <source>Mol Cell</source>
            <pubdate>2007</pubdate>
            <volume>25</volume>
            <fpage>15</fpage>
            <lpage>30</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">17218268</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B33">
            <title>
               <p>Active genes are tri-methylated at K4 of histone H3</p>
            </title>
            <aug>
               <au>
                  <snm>Santos-Rosa</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Schneider</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Bannister</snm>
                  <fnm>AJ</fnm>
               </au>
               <au>
                  <snm>Sherriff</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Bernstein</snm>
                  <fnm>BE</fnm>
               </au>
               <au>
                  <snm>Emre</snm>
                  <fnm>NC</fnm>
               </au>
               <au>
                  <snm>Schreiber</snm>
                  <fnm>SL</fnm>
               </au>
               <au>
                  <snm>Mellor</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Kouzarides</snm>
                  <fnm>T</fnm>
               </au>
            </aug>
            <source>Nature</source>
            <pubdate>2002</pubdate>
            <volume>419</volume>
            <fpage>407</fpage>
            <lpage>411</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">12353038</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B34">
            <title>
               <p>SET-domain proteins of the Su(var)3-9, E(z) and trithorax families</p>
            </title>
            <aug>
               <au>
                  <snm>Alvarez-Venegas</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Avramova</snm>
                  <fnm>Z</fnm>
               </au>
            </aug>
            <source>Gene</source>
            <pubdate>2002</pubdate>
            <volume>285</volume>
            <fpage>25</fpage>
            <lpage>37</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">12039029</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B35">
            <title>
               <p>Set2 is a nucleosomal histone H3-selective methyltransferase that mediates transcriptional repression</p>
            </title>
            <aug>
               <au>
                  <snm>Strahl</snm>
                  <fnm>BD</fnm>
               </au>
               <au>
                  <snm>Grant</snm>
                  <fnm>PA</fnm>
               </au>
               <au>
                  <snm>Briggs</snm>
                  <fnm>SD</fnm>
               </au>
               <au>
                  <snm>Sun</snm>
                  <fnm>ZW</fnm>
               </au>
               <au>
                  <snm>Bone</snm>
                  <fnm>JR</fnm>
               </au>
               <au>
                  <snm>Caldwell</snm>
                  <fnm>JA</fnm>
               </au>
               <au>
                  <snm>Mollah</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Cook</snm>
                  <fnm>RG</fnm>
               </au>
               <au>
                  <snm>Shabanowitz</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Hunt</snm>
                  <fnm>DF</fnm>
               </au>
               <au>
                  <snm>Allis</snm>
                  <fnm>CD</fnm>
               </au>
            </aug>
            <source>Mol Cell Biol</source>
            <pubdate>2002</pubdate>
            <volume>22</volume>
            <fpage>1298</fpage>
            <lpage>1306</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">134702</pubid>
                  <pubid idtype="pmpid" link="fulltext">11839797</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B36">
            <title>
               <p>Association of the histone methyltransferase Set2 with RNA polymerase II plays a role in transcription elongation</p>
            </title>
            <aug>
               <au>
                  <snm>Li</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Moazed</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Gygi</snm>
                  <fnm>SP</fnm>
               </au>
            </aug>
            <source>J Biol Chem</source>
            <pubdate>2002</pubdate>
            <volume>277</volume>
            <fpage>49383</fpage>
            <lpage>49388</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">12381723</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B37">
            <title>
               <p>Methylation of histone H3 lysine 36 is required for normal development in Neurospora crassa</p>
            </title>
            <aug>
               <au>
                  <snm>Adhvaryu</snm>
                  <fnm>KK</fnm>
               </au>
               <au>
                  <snm>Morris</snm>
                  <fnm>SA</fnm>
               </au>
               <au>
                  <snm>Strahl</snm>
                  <fnm>BD</fnm>
               </au>
               <au>
                  <snm>Selker</snm>
                  <fnm>EU</fnm>
               </au>
            </aug>
            <source>Eukaryot Cell</source>
            <pubdate>2005</pubdate>
            <volume>4</volume>
            <fpage>1455</fpage>
            <lpage>1464</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1214527</pubid>
                  <pubid idtype="pmpid" link="fulltext">16087750</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B38">
            <title>
               <p>Histone methylation by the Drosophila epigenetic transcriptional regulator Ash1</p>
            </title>
            <aug>
               <au>
                  <snm>Beisel</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Imhof</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Greene</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Kremmer</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Sauer</snm>
                  <fnm>F</fnm>
               </au>
            </aug>
            <source>Nature</source>
            <pubdate>2002</pubdate>
            <volume>419</volume>
            <fpage>857</fpage>
            <lpage>862</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">12397363</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B39">
            <title>
               <p>The S. cerevisiae SET3 complex includes two histone deacetylases, Hos2 and Hst1, and is a meiotic-specific repressor of the sporulation gene program</p>
            </title>
            <aug>
               <au>
                  <snm>Pijnappel</snm>
                  <fnm>WW</fnm>
               </au>
               <au>
                  <snm>Schaft</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Roguev</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Shevchenko</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Tekotte</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Wilm</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Rigaut</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Seraphin</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Aasland</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Stewart</snm>
                  <fnm>AF</fnm>
               </au>
            </aug>
            <source>Genes Dev</source>
            <pubdate>2001</pubdate>
            <volume>15</volume>
            <fpage>2991</fpage>
            <lpage>3004</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">312828</pubid>
                  <pubid idtype="pmpid" link="fulltext">11711434</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B40">
            <title>
               <p>Comparative analysis of SET domain proteins in maize and Arabidopsis reveals multiple duplications preceding the divergence of monocots and dicots</p>
            </title>
            <aug>
               <au>
                  <snm>Springer</snm>
                  <fnm>NM</fnm>
               </au>
               <au>
                  <snm>Napoli</snm>
                  <fnm>CA</fnm>
               </au>
               <au>
                  <snm>Selinger</snm>
                  <fnm>DA</fnm>
               </au>
               <au>
                  <snm>Pandey</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Cone</snm>
                  <fnm>KC</fnm>
               </au>
               <au>
                  <snm>Chandler</snm>
                  <fnm>VL</fnm>
               </au>
               <au>
                  <snm>Kaeppler</snm>
                  <fnm>HF</fnm>
               </au>
               <au>
                  <snm>Kaeppler</snm>
                  <fnm>SM</fnm>
               </au>
            </aug>
            <source>Plant Physiol</source>
            <pubdate>2003</pubdate>
            <volume>132</volume>
            <fpage>907</fpage>
            <lpage>925</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">167030</pubid>
                  <pubid idtype="pmpid" link="fulltext">12805620</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B41">
            <title>
               <p>Two cell-cycle regulated SET-domain proteins interact with proliferating cell nuclear antigen (PCNA) in Arabidopsis</p>
            </title>
            <aug>
               <au>
                  <snm>Raynaud</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Sozzani</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Glab</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Domenichini</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Perennes</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Cella</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Kondorosi</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Bergounioux</snm>
                  <fnm>C</fnm>
               </au>
            </aug>
            <source>Plant J</source>
            <pubdate>2006</pubdate>
            <volume>47</volume>
            <fpage>395</fpage>
            <lpage>407</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">16771839</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B42">
            <title>
               <p>Proof and evolutionary analysis of ancient genome duplication in the yeast Saccharomyces cerevisiae</p>
            </title>
            <aug>
               <au>
                  <snm>Kellis</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Birren</snm>
                  <fnm>BW</fnm>
               </au>
               <au>
                  <snm>Lander</snm>
                  <fnm>ES</fnm>
               </au>
            </aug>
            <source>Nature</source>
            <pubdate>2004</pubdate>
            <volume>428</volume>
            <fpage>617</fpage>
            <lpage>624</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">15004568</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B43">
            <title>
               <p>Reversal of histone lysine trimethylation by the JMJD2 family of histone demethylases</p>
            </title>
            <aug>
               <au>
                  <snm>Whetstine</snm>
                  <fnm>JR</fnm>
               </au>
               <au>
                  <snm>Nottke</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Lan</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Huarte</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Smolikov</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Chen</snm>
                  <fnm>Z</fnm>
               </au>
               <au>
                  <snm>Spooner</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Li</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Zhang</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Colaiacovo</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Shi</snm>
                  <fnm>Y</fnm>
               </au>
            </aug>
            <source>Cell</source>
            <pubdate>2006</pubdate>
            <volume>125</volume>
            <fpage>467</fpage>
            <lpage>481</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">16603238</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B44">
            <title>
               <p>Structural insights into histone demethylation by JMJD2 family members</p>
            </title>
            <aug>
               <au>
                  <snm>Chen</snm>
                  <fnm>Z</fnm>
               </au>
               <au>
                  <snm>Zang</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Whetstine</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Hong</snm>
                  <fnm>X</fnm>
               </au>
               <au>
                  <snm>Davrazou</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Kutateladze</snm>
                  <fnm>TG</fnm>
               </au>
               <au>
                  <snm>Simpson</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Mao</snm>
                  <fnm>Q</fnm>
               </au>
               <au>
                  <snm>Pan</snm>
                  <fnm>CH</fnm>
               </au>
               <au>
                  <snm>Dai</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Hagman</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Hansen</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Shi</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Zhang</snm>
                  <fnm>G</fnm>
               </au>
            </aug>
            <source>Cell</source>
            <pubdate>2006</pubdate>
            <volume>125</volume>
            <fpage>691</fpage>
            <lpage>702</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">16677698</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B45">
            <title>
               <p>The fission yeast Jmj2 reverses histone H3 Lysine 4 trimethylation</p>
            </title>
            <aug>
               <au>
                  <snm>Huarte</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Lan</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Kim</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Vaughn</snm>
                  <fnm>MW</fnm>
               </au>
               <au>
                  <snm>Zaratiegui</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Martienssen</snm>
                  <fnm>RA</fnm>
               </au>
               <au>
                  <snm>Buratowski</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Shi</snm>
                  <fnm>Y</fnm>
               </au>
            </aug>
            <source>J Biol Chem</source>
            <pubdate>2007</pubdate>
            <volume>282</volume>
            <fpage>21662</fpage>
            <lpage>21670</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">17550896</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B46">
            <title>
               <p>Functional dissection of protein complexes involved in yeast chromosome biology using a genetic interaction map</p>
            </title>
            <aug>
               <au>
                  <snm>Collins</snm>
                  <fnm>SR</fnm>
               </au>
               <au>
                  <snm>Miller</snm>
                  <fnm>KM</fnm>
               </au>
               <au>
                  <snm>Maas</snm>
                  <fnm>NL</fnm>
               </au>
               <au>
                  <snm>Roguev</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Fillingham</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Chu</snm>
                  <fnm>CS</fnm>
               </au>
               <au>
                  <snm>Schuldiner</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Gebbia</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Recht</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Shales</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Ding</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Xu</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Han</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Ingvarsdottir</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Cheng</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Andrews</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Boone</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Berger</snm>
                  <fnm>SL</fnm>
               </au>
               <au>
                  <snm>Hieter</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Zhang</snm>
                  <fnm>Z</fnm>
               </au>
               <au>
                  <snm>Brown</snm>
                  <fnm>GW</fnm>
               </au>
               <au>
                  <snm>Ingles</snm>
                  <fnm>CJ</fnm>
               </au>
               <au>
                  <snm>Emili</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Allis</snm>
                  <fnm>CD</fnm>
               </au>
               <au>
                  <snm>Toczyski</snm>
                  <fnm>DP</fnm>
               </au>
               <au>
                  <snm>Weissman</snm>
                  <fnm>JS</fnm>
               </au>
               <au>
                  <snm>Greenblatt</snm>
                  <fnm>JF</fnm>
               </au>
               <au>
                  <snm>Krogan</snm>
                  <fnm>NJ</fnm>
               </au>
            </aug>
            <source>Nature</source>
            <pubdate>2007</pubdate>
            <volume>446</volume>
            <fpage>806</fpage>
            <lpage>810</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">17314980</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B47">
            <title>
               <p>TPR proteins: the versatile helix</p>
            </title>
            <aug>
               <au>
                  <snm>D'Andrea</snm>
                  <fnm>LD</fnm>
               </au>
               <au>
                  <snm>Regan</snm>
                  <fnm>L</fnm>
               </au>
            </aug>
            <source>Trends Biochem Sci</source>
            <pubdate>2003</pubdate>
            <volume>28</volume>
            <fpage>655</fpage>
            <lpage>662</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">14659697</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B48">
            <title>
               <p>SMYD3 encodes a histone methyltransferase involved in the proliferation of cancer cells</p>
            </title>
            <aug>
               <au>
                  <snm>Hamamoto</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Furukawa</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Morita</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Iimura</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Silva</snm>
                  <fnm>FP</fnm>
               </au>
               <au>
                  <snm>Li</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Yagyu</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Nakamura</snm>
                  <fnm>Y</fnm>
               </au>
            </aug>
            <source>Nat Cell Biol</source>
            <pubdate>2004</pubdate>
            <volume>6</volume>
            <fpage>731</fpage>
            <lpage>740</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">15235609</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B49">
            <title>
               <p>Identification and characterization of Smyd2: a split SET/MYND domain-containing histone H3 lysine 36-specific methyltransferase that interacts with the Sin3 histone deacetylase complex</p>
            </title>
            <aug>
               <au>
                  <snm>Brown</snm>
                  <fnm>MA</fnm>
               </au>
               <au>
                  <snm>Sims</snm>
                  <fnm>RJ</fnm>
                  <suf>3rd</suf>
               </au>
               <au>
                  <snm>Gottlieb</snm>
                  <fnm>PD</fnm>
               </au>
               <au>
                  <snm>Tucker</snm>
                  <fnm>PW</fnm>
               </au>
            </aug>
            <source>Mol Cancer</source>
            <pubdate>2006</pubdate>
            <volume>5</volume>
            <fpage>26</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1524980</pubid>
                  <pubid idtype="pmpid" link="fulltext">16805913</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B50">
            <title>
               <p>The evolution of the histone methyltransferase gene Su(var)3-9 in metazoans includes a fusion with and a re-fission from a functionally unrelated gene</p>
            </title>
            <aug>
               <au>
                  <snm>Krauss</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>Fassl</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Fiebig</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Patties</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Sass</snm>
                  <fnm>H</fnm>
               </au>
            </aug>
            <source>BMC Evol Biol</source>
            <pubdate>2006</pubdate>
            <volume>6</volume>
            <fpage>18</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1435931</pubid>
                  <pubid idtype="pmpid" link="fulltext">16512904</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B51">
            <title>
               <p>Structural basis for the product specificity of histone lysine methyltransferases</p>
            </title>
            <aug>
               <au>
                  <snm>Zhang</snm>
                  <fnm>X</fnm>
               </au>
               <au>
                  <snm>Yang</snm>
                  <fnm>Z</fnm>
               </au>
               <au>
                  <snm>Khan</snm>
                  <fnm>SI</fnm>
               </au>
               <au>
                  <snm>Horton</snm>
                  <fnm>JR</fnm>
               </au>
               <au>
                  <snm>Tamaru</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Selker</snm>
                  <fnm>EU</fnm>
               </au>
               <au>
                  <snm>Cheng</snm>
                  <fnm>X</fnm>
               </au>
            </aug>
            <source>Mol Cell</source>
            <pubdate>2003</pubdate>
            <volume>12</volume>
            <fpage>177</fpage>
            <lpage>185</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">12887903</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B52">
            <title>
               <p>Selective recognition of methylated lysine 9 on histone H3 by the HP1 chromo domain</p>
            </title>
            <aug>
               <au>
                  <snm>Bannister</snm>
                  <fnm>AJ</fnm>
               </au>
               <au>
                  <snm>Zegerman</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Partridge</snm>
                  <fnm>JF</fnm>
               </au>
               <au>
                  <snm>Miska</snm>
                  <fnm>EA</fnm>
               </au>
               <au>
                  <snm>Thomas</snm>
                  <fnm>JO</fnm>
               </au>
               <au>
                  <snm>Allshire</snm>
                  <fnm>RC</fnm>
               </au>
               <au>
                  <snm>Kouzarides</snm>
                  <fnm>T</fnm>
               </au>
            </aug>
            <source>Nature</source>
            <pubdate>2001</pubdate>
            <volume>410</volume>
            <fpage>120</fpage>
            <lpage>124</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">11242054</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B53">
            <title>
               <p>The Clr7 and Clr8 directionality factors and the Pcu4 cullin mediate heterochromatin formation in the fission yeast Schizosaccharomyces pombe</p>
            </title>
            <aug>
               <au>
                  <snm>Thon</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Hansen</snm>
                  <fnm>KR</fnm>
               </au>
               <au>
                  <snm>Altes</snm>
                  <fnm>SP</fnm>
               </au>
               <au>
                  <snm>Sidhu</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Singh</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Verhein-Hansen</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Bonaduce</snm>
                  <fnm>MJ</fnm>
               </au>
               <au>
                  <snm>Klar</snm>
                  <fnm>AJ</fnm>
               </au>
            </aug>
            <source>Genetics</source>
            <pubdate>2005</pubdate>
            <volume>171</volume>
            <fpage>1583</fpage>
            <lpage>1595</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1456086</pubid>
                  <pubid idtype="pmpid" link="fulltext">16157682</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B54">
            <title>
               <p>Trimethylated lysine 9 of histone H3 is a mark for DNA methylation in Neurospora crassa</p>
            </title>
            <aug>
               <au>
                  <snm>Tamaru</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Zhang</snm>
                  <fnm>X</fnm>
               </au>
               <au>
                  <snm>McMillen</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Singh</snm>
                  <fnm>PB</fnm>
               </au>
               <au>
                  <snm>Nakayama</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Grewal</snm>
                  <fnm>SI</fnm>
               </au>
               <au>
                  <snm>Allis</snm>
                  <fnm>CD</fnm>
               </au>
               <au>
                  <snm>Cheng</snm>
                  <fnm>X</fnm>
               </au>
               <au>
                  <snm>Selker</snm>
                  <fnm>EU</fnm>
               </au>
            </aug>
            <source>Nat Genet</source>
            <pubdate>2003</pubdate>
            <volume>34</volume>
            <fpage>75</fpage>
            <lpage>79</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">12679815</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B55">
            <title>
               <p>HP1 is essential for DNA methylation in neurospora</p>
            </title>
            <aug>
               <au>
                  <snm>Freitag</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Hickey</snm>
                  <fnm>PC</fnm>
               </au>
               <au>
                  <snm>Khlafallah</snm>
                  <fnm>TK</fnm>
               </au>
               <au>
                  <snm>Read</snm>
                  <fnm>ND</fnm>
               </au>
               <au>
                  <snm>Selker</snm>
                  <fnm>EU</fnm>
               </au>
            </aug>
            <source>Mol Cell</source>
            <pubdate>2004</pubdate>
            <volume>13</volume>
            <fpage>427</fpage>
            <lpage>434</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">14967149</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B56">
            <title>
               <p>G9a histone methyltransferase plays a dominant role in euchromatic histone H3 lysine 9 methylation and is essential for early embryogenesis</p>
            </title>
            <aug>
               <au>
                  <snm>Tachibana</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Sugimoto</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Nozaki</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Ueda</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Ohta</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Ohki</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Fukuda</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Takeda</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Niida</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Kato</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Shinkai</snm>
                  <fnm>Y</fnm>
               </au>
            </aug>
            <source>Genes Dev</source>
            <pubdate>2002</pubdate>
            <volume>16</volume>
            <fpage>1779</fpage>
            <lpage>1791</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">186403</pubid>
                  <pubid idtype="pmpid" link="fulltext">12130538</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B57">
            <title>
               <p>Dual histone H3 methylation marks at lysines 9 and 27 required for interaction with CHROMOMETHYLASE3</p>
            </title>
            <aug>
               <au>
                  <snm>Lindroth</snm>
                  <fnm>AM</fnm>
               </au>
               <au>
                  <snm>Shultis</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Jasencakova</snm>
                  <fnm>Z</fnm>
               </au>
               <au>
                  <snm>Fuchs</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Johnson</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Schubert</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Patnaik</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Pradhan</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Goodrich</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Schubert</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Jenuwein</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Khorasanizadeh</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Jacobsen</snm>
                  <fnm>SE</fnm>
               </au>
            </aug>
            <source>Embo J</source>
            <pubdate>2004</pubdate>
            <volume>23</volume>
            <fpage>4286</fpage>
            <lpage>4296</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">524392</pubid>
                  <pubid idtype="pmpid" link="fulltext">15457214</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B58">
            <title>
               <p>Locus-specific control of DNA methylation by the Arabidopsis SUVH5 histone methyltransferase</p>
            </title>
            <aug>
               <au>
                  <snm>Ebbs</snm>
                  <fnm>ML</fnm>
               </au>
               <au>
                  <snm>Bender</snm>
                  <fnm>J</fnm>
               </au>
            </aug>
            <source>Plant Cell</source>
            <pubdate>2006</pubdate>
            <volume>18</volume>
            <fpage>1166</fpage>
            <lpage>1176</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1456864</pubid>
                  <pubid idtype="pmpid" link="fulltext">16582009</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B59">
            <title>
               <p>The SRA methyl-cytosine-binding domain links DNA and histone methylation</p>
            </title>
            <aug>
               <au>
                  <snm>Johnson</snm>
                  <fnm>LM</fnm>
               </au>
               <au>
                  <snm>Bostick</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Zhang</snm>
                  <fnm>X</fnm>
               </au>
               <au>
                  <snm>Kraft</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Henderson</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Callis</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Jacobsen</snm>
                  <fnm>SE</fnm>
               </au>
            </aug>
            <source>Curr Biol</source>
            <pubdate>2007</pubdate>
            <volume>17</volume>
            <fpage>379</fpage>
            <lpage>384</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1850948</pubid>
                  <pubid idtype="pmpid" link="fulltext">17239600</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B60">
            <title>
               <p>Histone H4 hyperacetylation precludes histone H4 lysine 20 trimethylation</p>
            </title>
            <aug>
               <au>
                  <snm>Sarg</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Helliger</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Talasz</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Koutzamani</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Lindner</snm>
                  <fnm>HH</fnm>
               </au>
            </aug>
            <source>J Biol Chem</source>
            <pubdate>2004</pubdate>
            <volume>279</volume>
            <fpage>53458</fpage>
            <lpage>53464</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">15456746</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B61">
            <title>
               <p>A silencing pathway to induce H3-K9 and H4-K20 trimethylation at constitutive heterochromatin</p>
            </title>
            <aug>
               <au>
                  <snm>Schotta</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Lachner</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Sarma</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Ebert</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Sengupta</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Reuter</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Reinberg</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Jenuwein</snm>
                  <fnm>T</fnm>
               </au>
            </aug>
            <source>Genes Dev</source>
            <pubdate>2004</pubdate>
            <volume>18</volume>
            <fpage>1251</fpage>
            <lpage>1262</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">420351</pubid>
                  <pubid idtype="pmpid" link="fulltext">15145825</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B62">
            <title>
               <p>Histone modification-dependent and -independent pathways for recruitment of checkpoint protein Crb2 to double-strand breaks</p>
            </title>
            <aug>
               <au>
                  <snm>Du</snm>
                  <fnm>LL</fnm>
               </au>
               <au>
                  <snm>Nakamura</snm>
                  <fnm>TM</fnm>
               </au>
               <au>
                  <snm>Russell</snm>
                  <fnm>P</fnm>
               </au>
            </aug>
            <source>Genes Dev</source>
            <pubdate>2006</pubdate>
            <volume>20</volume>
            <fpage>1583</fpage>
            <lpage>1596</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1482479</pubid>
                  <pubid idtype="pmpid" link="fulltext">16778077</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B63">
            <title>
               <p>The SET-domain protein superfamily: protein lysine methyltransferases</p>
            </title>
            <aug>
               <au>
                  <snm>Dillon</snm>
                  <fnm>SC</fnm>
               </au>
               <au>
                  <snm>Zhang</snm>
                  <fnm>X</fnm>
               </au>
               <au>
                  <snm>Trievel</snm>
                  <fnm>RC</fnm>
               </au>
               <au>
                  <snm>Cheng</snm>
                  <fnm>X</fnm>
               </au>
            </aug>
            <source>Genome Biol</source>
            <pubdate>2005</pubdate>
            <volume>6</volume>
            <fpage>227</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1273623</pubid>
                  <pubid idtype="pmpid" link="fulltext">16086857</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B64">
            <title>
               <p>Role of histone H3 lysine 27 methylation in Polycomb-group silencing</p>
            </title>
            <aug>
               <au>
                  <snm>Cao</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Wang</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Wang</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Xia</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Erdjument-Bromage</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Tempst</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Jones</snm>
                  <fnm>RS</fnm>
               </au>
               <au>
                  <snm>Zhang</snm>
                  <fnm>Y</fnm>
               </au>
            </aug>
            <source>Science</source>
            <pubdate>2002</pubdate>
            <volume>298</volume>
            <fpage>1039</fpage>
            <lpage>1043</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">12351676</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B65">
            <title>
               <p>A molecular timescale of eukaryote evolution and the rise of complex multicellular life</p>
            </title>
            <aug>
               <au>
                  <snm>Hedges</snm>
                  <fnm>SB</fnm>
               </au>
               <au>
                  <snm>Blair</snm>
                  <fnm>JE</fnm>
               </au>
               <au>
                  <snm>Venturi</snm>
                  <fnm>ML</fnm>
               </au>
               <au>
                  <snm>Shoe</snm>
                  <fnm>JL</fnm>
               </au>
            </aug>
            <source>BMC Evol Biol</source>
            <pubdate>2004</pubdate>
            <volume>4</volume>
            <fpage>2</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">341452</pubid>
                  <pubid idtype="pmpid" link="fulltext">15005799</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B66">
            <title>
               <p>Origin of multicellular eukaryotes - insights from proteome comparisons</p>
            </title>
            <aug>
               <au>
                  <snm>Aravind</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Subramanian</snm>
                  <fnm>G</fnm>
               </au>
            </aug>
            <source>Curr Opin Genet Dev</source>
            <pubdate>1999</pubdate>
            <volume>9</volume>
            <fpage>688</fpage>
            <lpage>694</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">10607613</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B67">
            <title>
               <p>Comparative genomics in hemiascomycete yeasts: evolution of sex, silencing, and subtelomeres</p>
            </title>
            <aug>
               <au>
                  <snm>Fabre</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Muller</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Therizols</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Lafontaine</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Dujon</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Fairhead</snm>
                  <fnm>C</fnm>
               </au>
            </aug>
            <source>Mol Biol Evol</source>
            <pubdate>2005</pubdate>
            <volume>22</volume>
            <fpage>856</fpage>
            <lpage>873</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">15616141</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B68">
            <title>
               <p>The Ashbya gossypii genome as a tool for mapping the ancient Saccharomyces cerevisiae genome</p>
            </title>
            <aug>
               <au>
                  <snm>Dietrich</snm>
                  <fnm>FS</fnm>
               </au>
               <au>
                  <snm>Voegeli</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Brachat</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Lerch</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Gates</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Steiner</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Mohr</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Pohlmann</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Luedi</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Choi</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Wing</snm>
                  <fnm>RA</fnm>
               </au>
               <au>
                  <snm>Flavier</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Gaffney</snm>
                  <fnm>TD</fnm>
               </au>
               <au>
                  <snm>Philippsen</snm>
                  <fnm>P</fnm>
               </au>
            </aug>
            <source>Science</source>
            <pubdate>2004</pubdate>
            <volume>304</volume>
            <fpage>304</fpage>
            <lpage>307</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">15001715</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B69">
            <title>
               <p>Homologues of yeast polarity genes control the development of multinucleated hyphae in Ashbya gossypii</p>
            </title>
            <aug>
               <au>
                  <snm>Philippsen</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Kaufmann</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Schmitz</snm>
                  <fnm>HP</fnm>
               </au>
            </aug>
            <source>Curr Opin Microbiol</source>
            <pubdate>2005</pubdate>
            <volume>8</volume>
            <fpage>370</fpage>
            <lpage>377</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">16023404</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B70">
            <title>
               <p>The diploid genome sequence of Candida albicans</p>
            </title>
            <aug>
               <au>
                  <snm>Jones</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Federspiel</snm>
                  <fnm>NA</fnm>
               </au>
               <au>
                  <snm>Chibana</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Dungan</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Kalman</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Magee</snm>
                  <fnm>BB</fnm>
               </au>
               <au>
                  <snm>Newport</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Thorstenson</snm>
                  <fnm>YR</fnm>
               </au>
               <au>
                  <snm>Agabian</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Magee</snm>
                  <fnm>PT</fnm>
               </au>
               <au>
                  <snm>Davis</snm>
                  <fnm>RW</fnm>
               </au>
               <au>
                  <snm>Scherer</snm>
                  <fnm>S</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci U S A</source>
            <pubdate>2004</pubdate>
            <volume>101</volume>
            <fpage>7329</fpage>
            <lpage>7334</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">409918</pubid>
                  <pubid idtype="pmpid" link="fulltext">15123810</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B71">
            <title>
               <p>Mating in Candida albicans and the search for a sexual cycle</p>
            </title>
            <aug>
               <au>
                  <snm>Bennett</snm>
                  <fnm>RJ</fnm>
               </au>
               <au>
                  <snm>Johnson</snm>
                  <fnm>AD</fnm>
               </au>
            </aug>
            <source>Annu Rev Microbiol</source>
            <pubdate>2005</pubdate>
            <volume>59</volume>
            <fpage>233</fpage>
            <lpage>255</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">15910278</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B72">
            <title>
               <p>In vivo and in vitro anaerobic mating in Candida albicans</p>
            </title>
            <aug>
               <au>
                  <snm>Dumitru</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Navarathna</snm>
                  <fnm>DH</fnm>
               </au>
               <au>
                  <snm>Semighini</snm>
                  <fnm>CP</fnm>
               </au>
               <au>
                  <snm>Elowsky</snm>
                  <fnm>CG</fnm>
               </au>
               <au>
                  <snm>Dumitru</snm>
                  <fnm>RV</fnm>
               </au>
               <au>
                  <snm>Dignard</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Whiteway</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Atkin</snm>
                  <fnm>AL</fnm>
               </au>
               <au>
                  <snm>Nickerson</snm>
                  <fnm>KW</fnm>
               </au>
            </aug>
            <source>Eukaryot Cell</source>
            <pubdate>2007</pubdate>
            <volume>6</volume>
            <fpage>465</fpage>
            <lpage>472</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1828919</pubid>
                  <pubid idtype="pmpid" link="fulltext">17259544</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B73">
            <title>
               <p>Three mating type-like loci in Candida glabrata</p>
            </title>
            <aug>
               <au>
                  <snm>Srikantha</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Lachke</snm>
                  <fnm>SA</fnm>
               </au>
               <au>
                  <snm>Soll</snm>
                  <fnm>DR</fnm>
               </au>
            </aug>
            <source>Eukaryot Cell</source>
            <pubdate>2003</pubdate>
            <volume>2</volume>
            <fpage>328</fpage>
            <lpage>340</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">154844</pubid>
                  <pubid idtype="pmpid" link="fulltext">12684382</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B74">
            <title>
               <p>Molecular evidence for an ancient duplication of the entire yeast genome</p>
            </title>
            <aug>
               <au>
                  <snm>Wolfe</snm>
                  <fnm>KH</fnm>
               </au>
               <au>
                  <snm>Shields</snm>
                  <fnm>DC</fnm>
               </au>
            </aug>
            <source>Nature</source>
            <pubdate>1997</pubdate>
            <volume>387</volume>
            <fpage>708</fpage>
            <lpage>713</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">9192896</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B75">
            <title>
               <p>Histone lysine methylation: a signature for chromatin function</p>
            </title>
            <aug>
               <au>
                  <snm>Sims</snm>
                  <fnm>RJ</fnm>
                  <suf>3rd</suf>
               </au>
               <au>
                  <snm>Nishioka</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Reinberg</snm>
                  <fnm>D</fnm>
               </au>
            </aug>
            <source>Trends Genet</source>
            <pubdate>2003</pubdate>
            <volume>19</volume>
            <fpage>629</fpage>
            <lpage>639</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">14585615</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B76">
            <title>
               <p>The establishment, inheritance, and function of silenced chromatin in Saccharomyces cerevisiae</p>
            </title>
            <aug>
               <au>
                  <snm>Rusche</snm>
                  <fnm>LN</fnm>
               </au>
               <au>
                  <snm>Kirchmaier</snm>
                  <fnm>AL</fnm>
               </au>
               <au>
                  <snm>Rine</snm>
                  <fnm>J</fnm>
               </au>
            </aug>
            <source>Annu Rev Biochem</source>
            <pubdate>2003</pubdate>
            <volume>72</volume>
            <fpage>481</fpage>
            <lpage>516</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">12676793</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B77">
            <title>
               <p>Methylation of histone H3 Lys 4 in coding regions of active genes</p>
            </title>
            <aug>
               <au>
                  <snm>Bernstein</snm>
                  <fnm>BE</fnm>
               </au>
               <au>
                  <snm>Humphrey</snm>
                  <fnm>EL</fnm>
               </au>
               <au>
                  <snm>Erlich</snm>
                  <fnm>RL</fnm>
               </au>
               <au>
                  <snm>Schneider</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Bouman</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Liu</snm>
                  <fnm>JS</fnm>
               </au>
               <au>
                  <snm>Kouzarides</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Schreiber</snm>
                  <fnm>SL</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci U S A</source>
            <pubdate>2002</pubdate>
            <volume>99</volume>
            <fpage>8695</fpage>
            <lpage>8700</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">124361</pubid>
                  <pubid idtype="pmpid" link="fulltext">12060701</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B78">
            <title>
               <p>Histone H3 lysine 4 methylation is mediated by Set1 and required for cell growth and rDNA silencing in Saccharomyces cerevisiae</p>
            </title>
            <aug>
               <au>
                  <snm>Briggs</snm>
                  <fnm>SD</fnm>
               </au>
               <au>
                  <snm>Bryk</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Strahl</snm>
                  <fnm>BD</fnm>
               </au>
               <au>
                  <snm>Cheung</snm>
                  <fnm>WL</fnm>
               </au>
               <au>
                  <snm>Davie</snm>
                  <fnm>JK</fnm>
               </au>
               <au>
                  <snm>Dent</snm>
                  <fnm>SY</fnm>
               </au>
               <au>
                  <snm>Winston</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Allis</snm>
                  <fnm>CD</fnm>
               </au>
            </aug>
            <source>Genes Dev</source>
            <pubdate>2001</pubdate>
            <volume>15</volume>
            <fpage>3286</fpage>
            <lpage>3295</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">312847</pubid>
                  <pubid idtype="pmpid" link="fulltext">11751634</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B79">
            <title>
               <p>Methylation patterns of histone H3 Lys 4, Lys 9 and Lys 27 in transcriptionally active and inactive Arabidopsis genes and in atx1 mutants</p>
            </title>
            <aug>
               <au>
                  <snm>Alvarez-Venegas</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Avramova</snm>
                  <fnm>Z</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2005</pubdate>
            <volume>33</volume>
            <fpage>5199</fpage>
            <lpage>5207</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1214549</pubid>
                  <pubid idtype="pmpid" link="fulltext">16157865</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B80">
            <title>
               <p>The genome sequence of the filamentous fungus Neurospora crassa</p>
            </title>
            <aug>
               <au>
                  <snm>Galagan</snm>
                  <fnm>JE</fnm>
               </au>
               <au>
                  <snm>Calvo</snm>
                  <fnm>SE</fnm>
               </au>
               <au>
                  <snm>Borkovich</snm>
                  <fnm>KA</fnm>
               </au>
               <au>
                  <snm>Selker</snm>
                  <fnm>EU</fnm>
               </au>
               <au>
                  <snm>Read</snm>
                  <fnm>ND</fnm>
               </au>
               <au>
                  <snm>Jaffe</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>FitzHugh</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Ma</snm>
                  <fnm>LJ</fnm>
               </au>
               <au>
                  <snm>Smirnov</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Purcell</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Rehman</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Elkins</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Engels</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Wang</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Nielsen</snm>
                  <fnm>CB</fnm>
               </au>
               <au>
                  <snm>Butler</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Endrizzi</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Qui</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Ianakiev</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Bell-Pedersen</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Nelson</snm>
                  <fnm>MA</fnm>
               </au>
               <au>
                  <snm>Werner-Washburne</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Selitrennikoff</snm>
                  <fnm>CP</fnm>
               </au>
               <au>
                  <snm>Kinsey</snm>
                  <fnm>JA</fnm>
               </au>
               <au>
                  <snm>Braun</snm>
                  <fnm>EL</fnm>
               </au>
               <au>
                  <snm>Zelter</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Schulte</snm>
                  <fnm>U</fnm>
               </au>
               <au>
                  <snm>Kothe</snm>
                  <fnm>GO</fnm>
               </au>
               <au>
                  <snm>Jedd</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Mewes</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Staben</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Marcotte</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Greenberg</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Roy</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Foley</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Naylor</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Stange-Thomann</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Barrett</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Gnerre</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Kamal</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Kamvysselis</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Mauceli</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Bielke</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Rudd</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Frishman</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Krystofova</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Rasmussen</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Metzenberg</snm>
                  <fnm>RL</fnm>
               </au>
               <au>
                  <snm>Perkins</snm>
                  <fnm>DD</fnm>
               </au>
               <au>
                  <snm>Kroken</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Cogoni</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Macino</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Catcheside</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Li</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Pratt</snm>
                  <fnm>RJ</fnm>
               </au>
               <au>
                  <snm>Osmani</snm>
                  <fnm>SA</fnm>
               </au>
               <au>
                  <snm>DeSouza</snm>
                  <fnm>CP</fnm>
               </au>
               <au>
                  <snm>Glass</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Orbach</snm>
                  <fnm>MJ</fnm>
               </au>
               <au>
                  <snm>Berglund</snm>
                  <fnm>JA</fnm>
               </au>
               <au>
                  <snm>Voelker</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Yarden</snm>
                  <fnm>O</fnm>
               </au>
               <au>
                  <snm>Plamann</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Seiler</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Dunlap</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Radford</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Aramayo</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Natvig</snm>
                  <fnm>DO</fnm>
               </au>
               <au>
                  <snm>Alex</snm>
                  <fnm>LA</fnm>
               </au>
               <au>
                  <snm>Mannhaupt</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Ebbole</snm>
                  <fnm>DJ</fnm>
               </au>
               <au>
                  <snm>Freitag</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Paulsen</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Sachs</snm>
                  <fnm>MS</fnm>
               </au>
               <au>
                  <snm>Lander</snm>
                  <fnm>ES</fnm>
               </au>
               <au>
                  <snm>Nusbaum</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Birren</snm>
                  <fnm>B</fnm>
               </au>
            </aug>
            <source>Nature</source>
            <pubdate>2003</pubdate>
            <volume>422</volume>
            <fpage>859</fpage>
            <lpage>868</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">12712197</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B81">
            <title>
               <p>Deep and comparative analysis of the mycelium and appressorium transcriptomes of Magnaporthe grisea using MPSS, RL-SAGE, and oligoarray methods</p>
            </title>
            <aug>
               <au>
                  <snm>Gowda</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Venu</snm>
                  <fnm>RC</fnm>
               </au>
               <au>
                  <snm>Raghupathy</snm>
                  <fnm>MB</fnm>
               </au>
               <au>
                  <snm>Nobuta</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Li</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Wing</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Stahlberg</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Couglan</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Haudenschild</snm>
                  <fnm>CD</fnm>
               </au>
               <au>
                  <snm>Dean</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Nahm</snm>
                  <fnm>BH</fnm>
               </au>
               <au>
                  <snm>Meyers</snm>
                  <fnm>BC</fnm>
               </au>
               <au>
                  <snm>Wang</snm>
                  <fnm>GL</fnm>
               </au>
            </aug>
            <source>BMC Genomics</source>
            <pubdate>2006</pubdate>
            <volume>7</volume>
            <fpage>310</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1764740</pubid>
                  <pubid idtype="pmpid" link="fulltext">17156450</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B82">
            <title>
               <p>The genome sequence of the rice blast fungus Magnaporthe grisea</p>
            </title>
            <aug>
               <au>
                  <snm>Dean</snm>
                  <fnm>RA</fnm>
               </au>
               <au>
                  <snm>Talbot</snm>
                  <fnm>NJ</fnm>
               </au>
               <au>
                  <snm>Ebbole</snm>
                  <fnm>DJ</fnm>
               </au>
               <au>
                  <snm>Farman</snm>
                  <fnm>ML</fnm>
               </au>
               <au>
                  <snm>Mitchell</snm>
                  <fnm>TK</fnm>
               </au>
               <au>
                  <snm>Orbach</snm>
                  <fnm>MJ</fnm>
               </au>
               <au>
                  <snm>Thon</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Kulkarni</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Xu</snm>
                  <fnm>JR</fnm>
               </au>
               <au>
                  <snm>Pan</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Read</snm>
                  <fnm>ND</fnm>
               </au>
               <au>
                  <snm>Lee</snm>
                  <fnm>YH</fnm>
               </au>
               <au>
                  <snm>Carbone</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Brown</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Oh</snm>
                  <fnm>YY</fnm>
               </au>
               <au>
                  <snm>Donofrio</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Jeong</snm>
                  <fnm>JS</fnm>
               </au>
               <au>
                  <snm>Soanes</snm>
                  <fnm>DM</fnm>
               </au>
               <au>
                  <snm>Djonovic</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Kolomiets</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Rehmeyer</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Li</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Harding</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Kim</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Lebrun</snm>
                  <fnm>MH</fnm>
               </au>
               <au>
                  <snm>Bohnert</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Coughlan</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Butler</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Calvo</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Ma</snm>
                  <fnm>LJ</fnm>
               </au>
               <au>
                  <snm>Nicol</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Purcell</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Nusbaum</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Galagan</snm>
                  <fnm>JE</fnm>
               </au>
               <au>
                  <snm>Birren</snm>
                  <fnm>BW</fnm>
               </au>
            </aug>
            <source>Nature</source>
            <pubdate>2005</pubdate>
            <volume>434</volume>
            <fpage>980</fpage>
            <lpage>986</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">15846337</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B83">
            <title>
               <p>Genetic control of fungal differentiation: the three sporulation pathways of Neurospora crassa</p>
            </title>
            <aug>
               <au>
                  <snm>Springer</snm>
                  <fnm>ML</fnm>
               </au>
            </aug>
            <source>Bioessays</source>
            <pubdate>1993</pubdate>
            <volume>15</volume>
            <fpage>365</fpage>
            <lpage>374</lpage>
            <xrefbib>
               <pubid idtype="pmpid">8357339</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B84">
            <title>
               <p>Functional characterization and localization of the Aspergillus nidulans formin SEPA</p>
            </title>
            <aug>
               <au>
                  <snm>Sharpless</snm>
                  <fnm>KE</fnm>
               </au>
               <au>
                  <snm>Harris</snm>
                  <fnm>SD</fnm>
               </au>
            </aug>
            <source>Mol Biol Cell</source>
            <pubdate>2002</pubdate>
            <volume>13</volume>
            <fpage>469</fpage>
            <lpage>479</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">65642</pubid>
                  <pubid idtype="pmpid" link="fulltext">11854405</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B85">
            <title>
               <p>Transcriptome analysis of Aspergillus nidulans exposed to camptothecin-induced DNA damage</p>
            </title>
            <aug>
               <au>
                  <snm>Malavazi</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Savoldi</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Di Mauro</snm>
                  <fnm>SM</fnm>
               </au>
               <au>
                  <snm>Menck</snm>
                  <fnm>CF</fnm>
               </au>
               <au>
                  <snm>Harris</snm>
                  <fnm>SD</fnm>
               </au>
               <au>
                  <snm>Goldman</snm>
                  <fnm>MH</fnm>
               </au>
               <au>
                  <snm>Goldman</snm>
                  <fnm>GH</fnm>
               </au>
            </aug>
            <source>Eukaryot Cell</source>
            <pubdate>2006</pubdate>
            <volume>5</volume>
            <fpage>1688</fpage>
            <lpage>1704</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1595335</pubid>
                  <pubid idtype="pmpid" link="fulltext">17030995</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B86">
            <title>
               <p>SET domains and histone methylation</p>
            </title>
            <aug>
               <au>
                  <snm>Xiao</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Wilson</snm>
                  <fnm>JR</fnm>
               </au>
               <au>
                  <snm>Gamblin</snm>
                  <fnm>SJ</fnm>
               </au>
            </aug>
            <source>Curr Opin Struct Biol</source>
            <pubdate>2003</pubdate>
            <volume>13</volume>
            <fpage>699</fpage>
            <lpage>705</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">14675547</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B87">
            <title>
               <p>The functions of E(Z)/EZH2-mediated methylation of lysine 27 in histone H3</p>
            </title>
            <aug>
               <au>
                  <snm>Cao</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Zhang</snm>
                  <fnm>Y</fnm>
               </au>
            </aug>
            <source>Curr Opin Genet Dev</source>
            <pubdate>2004</pubdate>
            <volume>14</volume>
            <fpage>155</fpage>
            <lpage>164</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">15196462</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B88">
            <title>
               <p>Epigenetic regulation of cellular memory by the Polycomb and Trithorax group proteins</p>
            </title>
            <aug>
               <au>
                  <snm>Ringrose</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Paro</snm>
                  <fnm>R</fnm>
               </au>
            </aug>
            <source>Annu Rev Genet</source>
            <pubdate>2004</pubdate>
            <volume>38</volume>
            <fpage>413</fpage>
            <lpage>443</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">15568982</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B89">
            <title>
               <p>Division of labor in polycomb group repression</p>
            </title>
            <aug>
               <au>
                  <snm>Levine</snm>
                  <fnm>SS</fnm>
               </au>
               <au>
                  <snm>King</snm>
                  <fnm>IF</fnm>
               </au>
               <au>
                  <snm>Kingston</snm>
                  <fnm>RE</fnm>
               </au>
            </aug>
            <source>Trends Biochem Sci</source>
            <pubdate>2004</pubdate>
            <volume>29</volume>
            <fpage>478</fpage>
            <lpage>485</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">15337121</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B90">
            <title>
               <p>Epigenetic control of plant development by Polycomb-group proteins</p>
            </title>
            <aug>
               <au>
                  <snm>Schubert</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Clarenz</snm>
                  <fnm>O</fnm>
               </au>
               <au>
                  <snm>Goodrich</snm>
                  <fnm>J</fnm>
               </au>
            </aug>
            <source>Curr Opin Plant Biol</source>
            <pubdate>2005</pubdate>
            <volume>8</volume>
            <fpage>553</fpage>
            <lpage>561</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">16043386</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B91">
            <title>
               <p>A bivalent chromatin structure marks key developmental genes in embryonic stem cells</p>
            </title>
            <aug>
               <au>
                  <snm>Bernstein</snm>
                  <fnm>BE</fnm>
               </au>
               <au>
                  <snm>Mikkelsen</snm>
                  <fnm>TS</fnm>
               </au>
               <au>
                  <snm>Xie</snm>
                  <fnm>X</fnm>
               </au>
               <au>
                  <snm>Kamal</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Huebert</snm>
                  <fnm>DJ</fnm>
               </au>
               <au>
                  <snm>Cuff</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Fry</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Meissner</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Wernig</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Plath</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Jaenisch</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Wagschal</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Feil</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Schreiber</snm>
                  <fnm>SL</fnm>
               </au>
               <au>
                  <snm>Lander</snm>
                  <fnm>ES</fnm>
               </au>
            </aug>
            <source>Cell</source>
            <pubdate>2006</pubdate>
            <volume>125</volume>
            <fpage>315</fpage>
            <lpage>326</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">16630819</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B92">
            <title>
               <p>The Arabidopsis homologs of trithorax (ATX1) and enhancer of zeste (CLF) establish 'bivalent chromatin marks' at the silent AGAMOUS locus</p>
            </title>
            <aug>
               <au>
                  <snm>Saleh</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Al-Abdallat</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Ndamukong</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Alvarez-Venegas</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Avramova</snm>
                  <fnm>Z</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2007</pubdate>
            <volume>35</volume>
            <fpage>6290</fpage>
            <lpage>6296</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">2094062</pubid>
                  <pubid idtype="pmpid" link="fulltext">17881378</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B93">
            <title>
               <p>NSD1 is essential for early post-implantation development and has a catalytically active SET domain</p>
            </title>
            <aug>
               <au>
                  <snm>Rayasam</snm>
                  <fnm>GV</fnm>
               </au>
               <au>
                  <snm>Wendling</snm>
                  <fnm>O</fnm>
               </au>
               <au>
                  <snm>Angrand</snm>
                  <fnm>PO</fnm>
               </au>
               <au>
                  <snm>Mark</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Niederreither</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Song</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Lerouge</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Hager</snm>
                  <fnm>GL</fnm>
               </au>
               <au>
                  <snm>Chambon</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Losson</snm>
                  <fnm>R</fnm>
               </au>
            </aug>
            <source>Embo J</source>
            <pubdate>2003</pubdate>
            <volume>22</volume>
            <fpage>3153</fpage>
            <lpage>3163</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">162140</pubid>
                  <pubid idtype="pmpid" link="fulltext">12805229</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B94">
            <title>
               <p>Mitotic-specific methylation of histone H4 Lys 20 follows increased PR-Set7 expression and its localization to mitotic chromosomes</p>
            </title>
            <aug>
               <au>
                  <snm>Rice</snm>
                  <fnm>JC</fnm>
               </au>
               <au>
                  <snm>Nishioka</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Sarma</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Steward</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Reinberg</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Allis</snm>
                  <fnm>CD</fnm>
               </au>
            </aug>
            <source>Genes Dev</source>
            <pubdate>2002</pubdate>
            <volume>16</volume>
            <fpage>2225</fpage>
            <lpage>2230</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">186671</pubid>
                  <pubid idtype="pmpid" link="fulltext">12208845</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B95">
            <title>
               <p>Specificity and mechanism of the histone methyltransferase Pr-Set7</p>
            </title>
            <aug>
               <au>
                  <snm>Xiao</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Jing</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Kelly</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Walker</snm>
                  <fnm>PA</fnm>
               </au>
               <au>
                  <snm>Muskett</snm>
                  <fnm>FW</fnm>
               </au>
               <au>
                  <snm>Frenkiel</snm>
                  <fnm>TA</fnm>
               </au>
               <au>
                  <snm>Martin</snm>
                  <fnm>SR</fnm>
               </au>
               <au>
                  <snm>Sarma</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Reinberg</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Gamblin</snm>
                  <fnm>SJ</fnm>
               </au>
               <au>
                  <snm>Wilson</snm>
                  <fnm>JR</fnm>
               </au>
            </aug>
            <source>Genes Dev</source>
            <pubdate>2005</pubdate>
            <volume>19</volume>
            <fpage>1444</fpage>
            <lpage>1454</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1151661</pubid>
                  <pubid idtype="pmpid" link="fulltext">15933069</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B96">
            <title>
               <p>Basic local alignment search tool</p>
            </title>
            <aug>
               <au>
                  <snm>Altschul</snm>
                  <fnm>SF</fnm>
               </au>
               <au>
                  <snm>Gish</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Miller</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Myers</snm>
                  <fnm>EW</fnm>
               </au>
               <au>
                  <snm>Lipman</snm>
                  <fnm>DJ</fnm>
               </au>
            </aug>
            <source>J Mol Biol</source>
            <pubdate>1990</pubdate>
            <volume>215</volume>
            <fpage>403</fpage>
            <lpage>410</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">2231712</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B97">
            <title>
               <p>Gapped BLAST and PSI-BLAST: a new generation of protein database search programs</p>
            </title>
            <aug>
               <au>
                  <snm>Altschul</snm>
                  <fnm>SF</fnm>
               </au>
               <au>
                  <snm>Madden</snm>
                  <fnm>TL</fnm>
               </au>
               <au>
                  <snm>Schaffer</snm>
                  <fnm>AA</fnm>
               </au>
               <au>
                  <snm>Zhang</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Zhang</snm>
                  <fnm>Z</fnm>
               </au>
               <au>
                  <snm>Miller</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Lipman</snm>
                  <fnm>DJ</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>1997</pubdate>
            <volume>25</volume>
            <fpage>3389</fpage>
            <lpage>3402</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">146917</pubid>
                  <pubid idtype="pmpid" link="fulltext">9254694</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B98">
            <title>
               <p>Profile hidden Markov models</p>
            </title>
            <aug>
               <au>
                  <snm>Eddy</snm>
                  <fnm>SR</fnm>
               </au>
            </aug>
            <source>Bioinformatics</source>
            <pubdate>1998</pubdate>
            <volume>14</volume>
            <fpage>755</fpage>
            <lpage>763</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">9918945</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B99">
            <title>
               <p>Hidden Markov models in computational biology. Applications to protein modeling</p>
            </title>
            <aug>
               <au>
                  <snm>Krogh</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Brown</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Mian</snm>
                  <fnm>IS</fnm>
               </au>
               <au>
                  <snm>Sjolander</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Haussler</snm>
                  <fnm>D</fnm>
               </au>
            </aug>
            <source>J Mol Biol</source>
            <pubdate>1994</pubdate>
            <volume>235</volume>
            <fpage>1501</fpage>
            <lpage>1531</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">8107089</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B100">
            <title>
               <p>Hidden Markov models for sequence analysis: extension and analysis of the basic method</p>
            </title>
            <aug>
               <au>
                  <snm>Hughey</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Krogh</snm>
                  <fnm>A</fnm>
               </au>
            </aug>
            <source>Comput Appl Biosci</source>
            <pubdate>1996</pubdate>
            <volume>12</volume>
            <fpage>95</fpage>
            <lpage>107</lpage>
            <xrefbib>
               <pubid idtype="pmpid">8744772</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B101">
            <title>
               <p>CDD: a Conserved Domain Database for protein classification</p>
            </title>
            <aug>
               <au>
                  <snm>Marchler-Bauer</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Anderson</snm>
                  <fnm>JB</fnm>
               </au>
               <au>
                  <snm>Cherukuri</snm>
                  <fnm>PF</fnm>
               </au>
               <au>
                  <snm>DeWeese-Scott</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Geer</snm>
                  <fnm>LY</fnm>
               </au>
               <au>
                  <snm>Gwadz</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>He</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Hurwitz</snm>
                  <fnm>DI</fnm>
               </au>
               <au>
                  <snm>Jackson</snm>
                  <fnm>JD</fnm>
               </au>
               <au>
                  <snm>Ke</snm>
                  <fnm>Z</fnm>
               </au>
               <au>
                  <snm>Lanczycki</snm>
                  <fnm>CJ</fnm>
               </au>
               <au>
                  <snm>Liebert</snm>
                  <fnm>CA</fnm>
               </au>
               <au>
                  <snm>Liu</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Lu</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Marchler</snm>
                  <fnm>GH</fnm>
               </au>
               <au>
                  <snm>Mullokandov</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Shoemaker</snm>
                  <fnm>BA</fnm>
               </au>
               <au>
                  <snm>Simonyan</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>Song</snm>
                  <fnm>JS</fnm>
               </au>
               <au>
                  <snm>Thiessen</snm>
                  <fnm>PA</fnm>
               </au>
               <au>
                  <snm>Yamashita</snm>
                  <fnm>RA</fnm>
               </au>
               <au>
                  <snm>Yin</snm>
                  <fnm>JJ</fnm>
               </au>
               <au>
                  <snm>Zhang</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Bryant</snm>
                  <fnm>SH</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2005</pubdate>
            <volume>33</volume>
            <fpage>D192</fpage>
            <lpage>6</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">540023</pubid>
                  <pubid idtype="pmpid" link="fulltext">15608175</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B102">
            <title>
               <p>SMART 4.0: towards genomic data integration</p>
            </title>
            <aug>
               <au>
                  <snm>Letunic</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Copley</snm>
                  <fnm>RR</fnm>
               </au>
               <au>
                  <snm>Schmidt</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Ciccarelli</snm>
                  <fnm>FD</fnm>
               </au>
               <au>
                  <snm>Doerks</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Schultz</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Ponting</snm>
                  <fnm>CP</fnm>
               </au>
               <au>
                  <snm>Bork</snm>
                  <fnm>P</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2004</pubdate>
            <volume>32</volume>
            <fpage>D142</fpage>
            <lpage>4</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">308822</pubid>
                  <pubid idtype="pmpid" link="fulltext">14681379</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B103">
            <title>
               <p>SMART: a web-based tool for the study of genetically mobile domains</p>
            </title>
            <aug>
               <au>
                  <snm>Schultz</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Copley</snm>
                  <fnm>RR</fnm>
               </au>
               <au>
                  <snm>Doerks</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Ponting</snm>
                  <fnm>CP</fnm>
               </au>
               <au>
                  <snm>Bork</snm>
                  <fnm>P</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2000</pubdate>
            <volume>28</volume>
            <fpage>231</fpage>
            <lpage>234</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">102444</pubid>
                  <pubid idtype="pmpid" link="fulltext">10592234</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B104">
            <title>
               <p>The CLUSTAL_X windows interface: flexible strategies for multiple sequence alignment aided by quality analysis tools</p>
            </title>
            <aug>
               <au>
                  <snm>Thompson</snm>
                  <fnm>JD</fnm>
               </au>
               <au>
                  <snm>Gibson</snm>
                  <fnm>TJ</fnm>
               </au>
               <au>
                  <snm>Plewniak</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Jeanmougin</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Higgins</snm>
                  <fnm>DG</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>1997</pubdate>
            <volume>25</volume>
            <fpage>4876</fpage>
            <lpage>4882</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">147148</pubid>
                  <pubid idtype="pmpid" link="fulltext">9396791</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B105">
            <title>
               <p>A simple, fast, and accurate algorithm to estimate large phylogenies by maximum likelihood</p>
            </title>
            <aug>
               <au>
                  <snm>Guindon</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Gascuel</snm>
                  <fnm>O</fnm>
               </au>
            </aug>
            <source>Syst Biol</source>
            <pubdate>2003</pubdate>
            <volume>52</volume>
            <fpage>696</fpage>
            <lpage>704</lpage>
            <xrefbib>
               <pubid idtype="pmpid">14530136</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B106">
            <title>
               <p>PHYLIP (Phylogeny Inference Package) Version 3.6</p>
            </title>
            <aug>
               <au>
                  <snm>Felsenstein</snm>
                  <fnm>J</fnm>
               </au>
            </aug>
            <url>http://evolution.genetics.washington.edu/phylip.html</url>
         </bibl>
         <bibl id="B107">
            <title>
               <p>Confidence limits on phylogenies: An approach using the bootstrap</p>
            </title>
            <aug>
               <au>
                  <snm>Felsenstein</snm>
                  <fnm>J</fnm>
               </au>
            </aug>
            <source>Evolution</source>
            <pubdate>1985</pubdate>
            <volume>39</volume>
            <fpage>783</fpage>
            <lpage>791</lpage>
         </bibl>
      </refgrp>
   </bm>
</art>
