<?xml version='1.0'?>
<!DOCTYPE art SYSTEM 'http://www.biomedcentral.com/xml/article.dtd'>
<art>
   <ui>1471-2164-7-197</ui>
   <ji>1471-2164</ji>
   <fm>
      <dochead>Research article</dochead>
      <bibl>
         <title>
            <p>An inventory of mucin genes in the chicken genome shows that the mucin domain of Muc13 is encoded by multiple exons and that ovomucin is part of a locus of related gel-forming mucins</p>
         </title>
         <aug>
            <au id="A1">
               <snm>Lang</snm>
               <fnm>Tiange</fnm>
               <insr iid="I1"/>
               <email>tiange.lang@medkem.gu.se</email>
            </au>
            <au id="A2">
               <snm>Hansson</snm>
               <mi>C</mi>
               <fnm>Gunnar</fnm>
               <insr iid="I1"/>
               <email>gunnar.hansson@medkem.gu.se</email>
            </au>
            <au id="A3" ca="yes">
               <snm>Samuelsson</snm>
               <fnm>Tore</fnm>
               <insr iid="I1"/>
               <email>tore.samuelsson@medkem.gu.se</email>
            </au>
         </aug>
         <insg>
            <ins id="I1">
               <p>Department of Medical Biochemistry, Goteborg University, Goteborg, Sweden</p>
            </ins>
         </insg>
         <source>BMC Genomics</source>
         <issn>1471-2164</issn>
         <pubdate>2006</pubdate>
         <volume>7</volume>
         <issue>1</issue>
         <fpage>197</fpage>
         <url>http://www.biomedcentral.com/1471-2164/7/197</url>
         <xrefbib>
            <pubidlist>
               <pubid idtype="pmpid">16887038</pubid>
               <pubid idtype="doi">10.1186/1471-2164-7-197</pubid>
            </pubidlist>
         </xrefbib>
      </bibl>
      <history>
         <rec>
            <date>
               <day>11</day>
               <month>5</month>
               <year>2006</year>
            </date>
         </rec>
         <acc>
            <date>
               <day>03</day>
               <month>8</month>
               <year>2006</year>
            </date>
         </acc>
         <pub>
            <date>
               <day>03</day>
               <month>8</month>
               <year>2006</year>
            </date>
         </pub>
      </history>
      <cpyrt>
         <year>2006</year>
         <collab>Lang et al; licensee BioMed Central Ltd.</collab>
         <note>This is an Open Access article distributed under the terms of the Creative Commons Attribution License (<url>http://creativecommons.org/licenses/by/2.0</url>), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</note>
      </cpyrt>
      <abs>
         <sec>
            <st>
               <p>Abstract</p>
            </st>
            <sec>
               <st>
                  <p>Background</p>
               </st>
               <p>Mucins are large glycoproteins that cover epithelial surfaces of the body. All mucins contain at least one PTS domain, a region rich in proline, threonine and serine. Mucins are also characterized by von Willebrand D (VWD) domains or SEA domains. We have developed computational methods to identify mucin genes and proteins based on these properties of the proteins. Using such methods we are able to characterize different organisms where genome sequence is available with respect to their mucin repertoire.</p>
            </sec>
            <sec>
               <st>
                  <p>Results</p>
               </st>
               <p>We have here made a comprehensive analysis of potential mucins encoded by the chicken (<it>Gallus gallus</it>) genome. Three transmembrane mucins (Muc4, Muc13, and Muc16) and four gel-forming mucins (Muc6, Muc2, Muc5ac, and Muc5b) were identified. The gel-forming mucins are encoded within a locus similar to the corresponding human mucins. However, the chicken has an additional gene inserted between <it>Muc2 </it>and <it>Muc5ac </it>that encodes the the &#945;-subunit of ovomucin, a protein similar to Muc2, but it is lacking a PTS domain. We also show that the &#946;-subunit of ovomucin is the orthologue of human MUC6. The transmembrane <it>Muc13 </it>gene is in chicken as well as in mammals adjacent to the <it>HEG </it>(heart of glass) gene. HEG has PTS, EGF and transmembrane domains like Muc13, suggesting that these two proteins are evolutionary related. Unlike previously known mucins, the PTS domain of Muc13 is encoded by multiple exons, where each exon encodes a repeat unit of the PTS domain.</p>
            </sec>
            <sec>
               <st>
                  <p>Conclusion</p>
               </st>
               <p>We report new mucin homologues in chicken and this information will aid in understanding the evolution of mucins in vertebrates. The fact that ovomucin, a protein not found in mammals, was located in the same locus as other gel-forming mucins provides strong support that these proteins are evolutionary related. Furthermore, a relationship of HEG and the transmembrane Muc13 is suggested on the basis of their biochemical properties and their presence in the same locus. Finally, our finding that the chicken Muc13 is distributed between multiple exons raises the interesting possibility that the length of the PTS domain could be controlled by alternative splicing.</p>
            </sec>
         </sec>
      </abs>
   </fm>
   <meta>
      <classifications>
         <classification type="bmc" subtype="user_supplied_xml" id="refman"/>
      </classifications>
   </meta>
   <bdy>
      <sec>
         <st>
            <p>Background</p>
         </st>
         <p>The mucosal surfaces are all covered by mucus largely made up of the large glycoproteins referred to as mucins. Mucins play an important role in protection, but some mucins also take part in cell surface signaling and are important for cancer development and progression. Typical for the mucins are the large mucin (PTS) domains rich in the amino acids Ser, Thr and Pro, often characterized by perfect or imperfect tandem repeats <abbrgrp><abbr bid="B1">1</abbr></abbrgrp>. Most mucins also have other characteristic domains such as von Willebrand D (VWD) or SEA (sea urchin sperm protein-enterokinase-agrin) domains. We have developed bioinformatics methods to identify and characterize mucin genes based on these distinct properties of mucins <abbrgrp><abbr bid="B2">2</abbr></abbrgrp>. Using such methods, we recently carried out an analysis of the puffer fish <it>Fugu rubripes </it><abbrgrp><abbr bid="B2">2</abbr></abbrgrp>.</p>
         <p>There are two major types of mucins, membrane-bound and secreted. In human, nine membrane-bound (MUC1, MUC3A, MUC3B, MUC4, MUC12, MUC13, MUC16 and MUC17) <abbrgrp><abbr bid="B3">3</abbr><abbr bid="B4">4</abbr><abbr bid="B5">5</abbr><abbr bid="B6">6</abbr><abbr bid="B7">7</abbr><abbr bid="B8">8</abbr><abbr bid="B9">9</abbr></abbrgrp> and seven secreted mucins (MUC2, MUC5B, MUC5AC, MUC6, MUC7, MUC19 and MUC20) <abbrgrp><abbr bid="B10">10</abbr><abbr bid="B11">11</abbr><abbr bid="B12">12</abbr><abbr bid="B13">13</abbr><abbr bid="B14">14</abbr><abbr bid="B15">15</abbr><abbr bid="B16">16</abbr></abbrgrp> have been identified. The secreted mucins can be further sub-divided as being either gel-forming (MUC2, MUC5B, MUC5AC, MUC6 and MUC19) or not (MUC7 and MUC20). The ability to form gels is dependent on the capacity of monomers to form polymeric structures. Gel-forming mucins have three VWD domains in their N-terminal ends that are involved in polymerization through intermolecular disulfide-bonds. They also have a cysteine-knot (CK) domain at their C-terminal ends (reviewed in <abbrgrp><abbr bid="B17">17</abbr></abbrgrp>). The VWD domain was first identified in the prepro-von Willebrand factor <abbrgrp><abbr bid="B18">18</abbr></abbrgrp>, hence its name. The gel forming mucins and the von Willebrand factor dimerize with the help of their C-terminal VWD domains in the endoplasmic reticulum (ER) <abbrgrp><abbr bid="B17">17</abbr><abbr bid="B18">18</abbr><abbr bid="B19">19</abbr></abbrgrp> and oligomerize through their N-terminal VWD domains in the acidic compartments of the Golgi complex <abbrgrp><abbr bid="B17">17</abbr><abbr bid="B20">20</abbr></abbrgrp>. The human transmembrane mucins are all characterized by either a SEA domain or a special variant of the VWD domain that is lacking cysteines. Several of the human transmembrane mucins are known to or predicted to be cleaved in their SEA or VWD domains <abbrgrp><abbr bid="B21">21</abbr></abbrgrp>.</p>
         <p>To understand the evolution of mucins, we are systematically examining the distribution and structure of mucins in different organisms. The results of such analysis will ultimately provide a better understanding of the function of the human mucins. It is also important to study mucins from organisms such as <it>C. elegans</it>, <it>Drosophila</it>, zebrafish and mouse as these are important experimental model systems. The previously analyzed puffer fish <it>Fugu rubripes </it><abbrgrp><abbr bid="B2">2</abbr></abbrgrp> has a gene repertoire similar in size to that of man, but according to our analysis it seems to lack several of the mucins found in the human genome. In particular, this is the case for the transmembrane mucins as only one such gene was identified in the fish whereas the human genome encodes at least nine different.</p>
         <p>Sequencing and annotating mucin genes is notoriously difficult due to their large size and repetitive nature. Therefore, the identification and classification of putative novel mucins requires a variety of bioinformatics tools as well as expert biological knowledge. Continuing our analysis of animal mucin genes, we now report on novel mucins identified in the chicken genome. Previously, a chicken MUC4-related protein was known and Muc16 was identified by Duraisamy et al <abbrgrp><abbr bid="B22">22</abbr></abbrgrp>. In addition, two chicken mucin-related proteins have been reported and are referred to as the &#945;- and &#946;-subunits of ovomucin, the major component of egg white and responsible for its gel-like properties <abbrgrp><abbr bid="B23">23</abbr></abbrgrp>. We now show that the previously reported &#946;-subunit is the chicken orthologue of human MUC6. We also report on a chicken Muc13 gene. This gene has an unusual organization as the tandem repeats of the PTS domain is encoded by multiple exons where each exon encodes one repeat.</p>
      </sec>
      <sec>
         <st>
            <p>Results and discussion</p>
         </st>
         <sec>
            <st>
               <p>Mucin genes may be reliably predicted using bioinformatics methods</p>
            </st>
            <p>To identify mucins we have used a method (PTSpred) to predict PTS/mucin domains where we analyzed both predicted proteins as well as genomic sequences translated in all six possible reading frames. We have previously applied this method to analyze the puffer fish <it>Fugu rubripes </it><abbrgrp><abbr bid="B2">2</abbr></abbrgrp>. We have here used that method to analyze human and chicken proteins as well as genomic sequences. We have also taken advantage of the fact that all mucins (MUC7 and MUC20 excluded) contain either von Willebrand D (VWD) or SEA domains. Thus, we have analyzed proteins with Pfam models of the VWD and SEA domains and Genewise <abbrgrp><abbr bid="B24">24</abbr></abbrgrp> was used to screen genomic sequences using the same models.</p>
            <p>We considered a protein sequence to be a potential mucin if it contained at least one PTS domain as well as a VWD or SEA domain. Such candidates were further evaluated by phylogenetic analyses of SEA/VWD domains. A protein was considered a strong mucin candidate only in case the phylogenetic analysis supported a relationship between its SEA or VWD domain(s) to those of previously characterized mammalian mucins.</p>
            <p>To test the efficiency of our computational methods we first analyzed available human protein sequences as well as the human genome assembly (for details see Materials and Methods). All human mucins, except MUC7 and MUC20, contain PTS domains as well as either VWD or SEA domains. In summary, our methods successfully identified all of these previously known mucins. In addition, we identified MUC19 <abbrgrp><abbr bid="B15">15</abbr><abbr bid="B25">25</abbr></abbrgrp> that was not known at the time we carried out this work. With the help of EST, mRNA, protein and genome sequences we were also able to reconstruct more complete and accurate human mucin protein sequences and elucidate gene structures (T. Lang et al., unpublished). These results show that our computational methods are reliable in terms of mucin gene predictions and that rigorous analysis of available sequence information is necessary in order to derive reliable predictions of gene and protein sequences.</p>
         </sec>
         <sec>
            <st>
               <p>Prediction of chicken mucin genes</p>
            </st>
            <p>We have now analyzed the chicken genome for mucin genes making use of the assembled genomic sequence and the proteins predicted by ENSEMBL <abbrgrp><abbr bid="B26">26</abbr></abbrgrp>. The genome assembly used in this work is expected to be approximately 90% complete. We have used methods described above for screening of the human and <it>F. rubripes </it>genomes. Most of the VWD and SEA domains identified in searches with hmmer and Genewise could be attributed either to mucins or to other previously known human proteins containing these domains. The predicted chicken mucin genes were characterized by a variety of bioinformatics tools and comparisons with known mucin genes and proteins from other species. For instance, all the sequences of the known mucin genes were aligned to the chicken genome. In this way, we could not only identify the human homologues, but also obtain a more complete sequence and understanding of the predicted chicken mucin. For more information on our current assembly of chicken mucins genes, including a comparison to the ENSEMBL predictions, the reader is referred to our mucin web site <abbrgrp><abbr bid="B27">27</abbr></abbrgrp>. A summary of our current inventory of mucins in man, mouse, chicken and the fish <it>Fugu rubripes </it>is shown in Table <tblr tid="T1">1</tblr>.</p>
            <tbl id="T1">
               <title>
                  <p>Table 1</p>
               </title>
               <caption>
                  <p>Mucins identified in man, mouse, chicken and puffer fish</p>
               </caption>
               <tblbdy cols="7">
                  <r>
                     <c ca="left">
                        <p>Mucin</p>
                     </c>
                     <c ca="left">
                        <p>Type<sup>a</sup></p>
                     </c>
                     <c ca="left">
                        <p>Characteristic Pfam domain</p>
                     </c>
                     <c ca="left">
                        <p>Chicken</p>
                     </c>
                     <c ca="left">
                        <p>Human</p>
                     </c>
                     <c ca="left">
                        <p>Mouse</p>
                     </c>
                     <c ca="left">
                        <p>
                           <it>F. rubripes</it>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c cspan="7">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>MUC1</p>
                     </c>
                     <c ca="left">
                        <p>TM</p>
                     </c>
                     <c ca="left">
                        <p>SEA</p>
                     </c>
                     <c ca="left">
                        <p>? (0/10)<sup>d</sup></p>
                     </c>
                     <c ca="left">
                        <p>+</p>
                     </c>
                     <c ca="left">
                        <p>+</p>
                     </c>
                     <c ca="left">
                        <p>?</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>MUC2</p>
                     </c>
                     <c ca="left">
                        <p>G</p>
                     </c>
                     <c ca="left">
                        <p>VWD</p>
                     </c>
                     <c ca="left">
                        <p>+ (23/53)</p>
                     </c>
                     <c ca="left">
                        <p>+</p>
                     </c>
                     <c ca="left">
                        <p>+</p>
                     </c>
                     <c ca="left">
                        <p>+<sup>b</sup></p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>MUC3</p>
                     </c>
                     <c ca="left">
                        <p>TM</p>
                     </c>
                     <c ca="left">
                        <p>SEA</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>+</p>
                     </c>
                     <c ca="left">
                        <p>+?<sup>c</sup></p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>MUC4</p>
                     </c>
                     <c ca="left">
                        <p>TM</p>
                     </c>
                     <c ca="left">
                        <p>VWD</p>
                     </c>
                     <c ca="left">
                        <p>+ (0/23)</p>
                     </c>
                     <c ca="left">
                        <p>+</p>
                     </c>
                     <c ca="left">
                        <p>+</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>MUC5AC</p>
                     </c>
                     <c ca="left">
                        <p>G</p>
                     </c>
                     <c ca="left">
                        <p>VWD</p>
                     </c>
                     <c ca="left">
                        <p>+ (8/42)</p>
                     </c>
                     <c ca="left">
                        <p>+</p>
                     </c>
                     <c ca="left">
                        <p>+</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>MUC5B</p>
                     </c>
                     <c ca="left">
                        <p>G</p>
                     </c>
                     <c ca="left">
                        <p>VWD</p>
                     </c>
                     <c ca="left">
                        <p>+ (11/37)</p>
                     </c>
                     <c ca="left">
                        <p>+</p>
                     </c>
                     <c ca="left">
                        <p>+</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>MUC6</p>
                     </c>
                     <c ca="left">
                        <p>G</p>
                     </c>
                     <c ca="left">
                        <p>VWD</p>
                     </c>
                     <c ca="left">
                        <p>+ (19/31)</p>
                     </c>
                     <c ca="left">
                        <p>+</p>
                     </c>
                     <c ca="left">
                        <p>+</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>MUC7</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>+</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>MUC10</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>+</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>MUC12</p>
                     </c>
                     <c ca="left">
                        <p>TM</p>
                     </c>
                     <c ca="left">
                        <p>SEA</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>+</p>
                     </c>
                     <c ca="left">
                        <p>+?<sup>c</sup></p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>MUC13</p>
                     </c>
                     <c ca="left">
                        <p>TM</p>
                     </c>
                     <c ca="left">
                        <p>SEA</p>
                     </c>
                     <c ca="left">
                        <p>+ (13/24)</p>
                     </c>
                     <c ca="left">
                        <p>+</p>
                     </c>
                     <c ca="left">
                        <p>+</p>
                     </c>
                     <c ca="left">
                        <p>+</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>MUC14</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>+</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>MUC15</p>
                     </c>
                     <c ca="left">
                        <p>TM</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>+</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>MUC16</p>
                     </c>
                     <c ca="left">
                        <p>TM</p>
                     </c>
                     <c ca="left">
                        <p>SEA</p>
                     </c>
                     <c ca="left">
                        <p>+ (0/16)</p>
                     </c>
                     <c ca="left">
                        <p>+</p>
                     </c>
                     <c ca="left">
                        <p>+</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>MUC17</p>
                     </c>
                     <c ca="left">
                        <p>TM</p>
                     </c>
                     <c ca="left">
                        <p>SEA</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>+</p>
                     </c>
                     <c ca="left">
                        <p>+?<sup>c</sup></p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>MUC19</p>
                     </c>
                     <c ca="left">
                        <p>G</p>
                     </c>
                     <c ca="left">
                        <p>VWD</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>+</p>
                     </c>
                     <c ca="left">
                        <p>+</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>MUC20</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>+</p>
                     </c>
                     <c ca="left">
                        <p>+</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Ovomucin</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>VWD</p>
                     </c>
                     <c ca="left">
                        <p>+ (45/45)</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
               </tblbdy>
               <tblfn>
                  <p><sup>a </sup>'TM' refers to transmembrane domain and 'G' gel-forming mucin</p>
                  <p><sup>b</sup>All VWD-containing mucins in <it>Fugu rubripes </it>were named Muc2, although the evolutionary relationship of these mucins to the human mucins MUC2/5AC/5B/6 is not clear.</p>
                  <p><sup>c</sup>The mucin gene cluster in mouse for the <it>Muc3/Muc12/Muc17 </it>mucins is incompletely sequenced. A mouse mucin has been described as Muc3 [42], but is most likely the orthologue of the human MUC17.</p>
                  <p><sup>d</sup>The numbers within parentheses indicate the number of exons supported by chicken ESTs as compared to the total number of exons. For Muc1, Muc5ac, and Muc16 the indicated total number of exons is a minimum number as the complete gene structure is not known.</p>
               </tblfn>
            </tbl>
            <p>A total of eight strong mucin candidates in chicken were identified, all with PTS domains and with either VWD (5) or SEA (3) domains. Analysis of the proteins with VWD domains revealed <it>Muc2</it>, <it>Muc5ac</it>, <it>Muc5b</it>, and <it>Muc6 </it>genes that are located in a cluster on chicken chromosome 5 and are discussed in more detail below. In addition we found a homologue to Muc4 on chromosome 9. We observed that a protein predicted by ENSEMBL contained the major portion of Muc4, including a part of the PTS domain followed by AMOP, VWD, EGF1, EGF2, and TM domains characteristic for the human MUC4 <abbrgrp><abbr bid="B6">6</abbr></abbrgrp>. The missing N-terminal part, including a signal sequence and the major part of the PTS domain, was reconstructed from the genome sequence. The resulting protein sequence is partially identical to a protein previously described as Muc4-related (Genbank: <ext-link ext-link-type="gen" ext-link-id="XP_426704.1">XP_426704.1</ext-link>) <abbrgrp><abbr bid="B22">22</abbr></abbrgrp>. The VWD domain of the human MUC4 is unusual in that it lacks cysteines, and this is also true for the chicken Muc4. The sequences of the human and chicken VWD domains are similar (Fig. <figr fid="F1">1</figr>) and taken together our information about the chicken protein strongly supports an orthologous relationship to human MUC4.</p>
            <fig id="F1">
               <title>
                  <p>Figure 1</p>
               </title>
               <caption>
                  <p>Phylogenetic tree of von Willebrand D domains in human and chicken mucins</p>
               </caption>
               <text>
                  <p>Phylogenetic tree of von Willebrand D domains in human and chicken mucins. A neighbor-joining tree was obtained by ClustalW using 1000 bootstrap replicates. Bootstrap percentages above 50 are shown. Groups containing the VWD1, VWD2, VWD3 and VWD4 domains of mucin type are shown with a shaded background. Animals represented are human (h), mouse (m) and chicken (c).</p>
               </text>
               <graphic file="1471-2164-7-197-1"/>
            </fig>
            <p>An analysis of predicted proteins with SEA domains identified chicken Muc13 and Muc16 homologues as well as a weak Muc1 candidate. Muc13 is described further below. The chicken Muc16 protein, previously identified by Duraisamy et al <abbrgrp><abbr bid="B22">22</abbr></abbrgrp>, is encoded on chromosome 28 and has a PTS domain followed by at least four SEA domains. The assignment as Muc16 based on phylogenetic analysis (Fig. <figr fid="F2">2</figr>) agrees with previous results <abbrgrp><abbr bid="B22">22</abbr></abbrgrp> and is also consistent with the fact that human MUC16 is the only mucin known to have multiple SEA domains <abbrgrp><abbr bid="B8">8</abbr></abbrgrp>.</p>
            <fig id="F2">
               <title>
                  <p>Figure 2</p>
               </title>
               <caption>
                  <p>Phylogenetic tree of SEA domains in human, mouse, chicken and zebrafish</p>
               </caption>
               <text>
                  <p>Phylogenetic tree of SEA domains in human, mouse, chicken and zebrafish. A neighbor-joining tree was obtained by ClustalW using 1000 bootstrap replicates. Bootstrap percentages above 50 are shown. The groups containing the Muc1, Muc13 and Muc16 mucins are shown with a shaded background. Animals represented are human (h), mouse (m), chicken (c) and zebrafish (z).</p>
               </text>
               <graphic file="1471-2164-7-197-2"/>
            </fig>
            <p>Two different proteins with SEA domains related to human MUC1 were identified. One of these were previously analyzed and it was proposed that it is more closely related to a heparin sulfate proteoglycan than to mammalian Muc1 <abbrgrp><abbr bid="B22">22</abbr></abbrgrp>. The other MUC1-related protein was analyzed here. However, it did not convincingly cluster with the SEA domains of other Muc1 proteins ('unknown' in Fig. <figr fid="F2">2</figr>). Furthermore, the SEA domain of this protein is preceded by a PTS domain, but a transmembrane domain characteristic of MUC1 could not be identified. Finally, the N-terminal region of this putative mucin gene cannot be analyzed due to a gap in the genomic sequence. Therefore, it is not possible at this stage to predict the existence of a chicken Muc1.</p>
            <p>We expect the predicted genes to be bona fide mucin genes because of the strong similarity to mucins from other species with respect to protein sequence, protein domain structure as well as gene structure. In general, it is difficult to distinguish between bona fide genes and pseudogenes. However, an analysis of available chicken ESTs provides evidence of expression for a majority of mucins genes that we have identified (Table <tblr tid="T1">1</tblr>). Thus, only in the case of Muc4, Muc16 and for the protein distantly related to Muc1 we were not able to find a corresponding EST sequence. The absence of EST support is not conclusive, as the available chicken EST data is not expected to be comprehensive.</p>
            <p>At the same time it must be pointed out that there are limitations to our approach. We are not able to effectively identify mucins that are lacking VWD and SEA domains, mainly because that PTSpred will give rise to a number of false positive sequences. In addition, we might fail to detect mucin candidates because genome assemblies are incomplete, particularly with respect to mucin genes, and because of limitations in gene prediction procedures.</p>
         </sec>
         <sec>
            <st>
               <p>The ovomucin gene is part of a gene cluster with gel-forming mucins Muc2, Muc5ac, Muc5b and Muc6</p>
            </st>
            <p>Five VWD-containing proteins were found within a region of chromosome 5, covering 12 million bases. The domain structure of the proteins on chromosome 5 suggested that this region has an organization similar to the human <it>MUC2/5AC/5B/6 </it>gene cluster. The relative gene order and polarity was identical to the corresponding human mucins as shown in Fig. <figr fid="F3">3</figr>. Thus, the <it>Muc6 </it>mucin is positioned next to and in the opposite direction to <it>Muc2</it>, <it>Muc5ac </it>and <it>Muc5b</it>.</p>
            <fig id="F3">
               <title>
                  <p>Figure 3</p>
               </title>
               <caption>
                  <p>Organization of the gene cluster for gel-forming mucins in chicken as compared to man</p>
               </caption>
               <text>
                  <p>Organization of the gene cluster for gel-forming mucins in chicken as compared to man. The orientation of the genes is indicated by arrows. The major difference between the two organisms is the presence of the ovomucin gene in chicken.</p>
               </text>
               <graphic file="1471-2164-7-197-3"/>
            </fig>
            <p>The domain structures of the individual chicken gel-forming mucins were analyzed and the results are shown in Fig. <figr fid="F4">4</figr>. Typically, these mucins have three VWD domains followed by alternating PTS and CysD domains, and at the C-terminal end a cysteine-knot (CK) domain. The Muc2 and ovomucin proteins have an additional VWD domain. The chicken Muc2 ortholog was identified as the protein most similar to the human MUC2. However, the central part of the predicted molecule contained at least three CysD and four PTS domains, whereas the human protein only has two CysD and two PTS domains. A gap in the genomic sequence precludes further comparison and a conclusion as to the differences. The chicken Muc5ac and Muc5b proteins have a similar domain structure with central repeated CysD and PTS domains as in the human orthologues. However, chicken Muc5B lacks the C-terminal VWD domain in contrast to the human orthologue. The chicken Muc5ac genomic sequence has a large gap in its 3'-end preventing further comparison. Also for Muc6 the domain structure is identical to the human orthologue (Fig. <figr fid="F4">4</figr>), but a gap in the 3' genomic sequence makes it impossible to compare this region.</p>
            <fig id="F4">
               <title>
                  <p>Figure 4</p>
               </title>
               <caption>
                  <p>Domain organization of mucins in the chicken gel-forming mucin cluster</p>
               </caption>
               <text>
                  <p>Domain organization of mucins in the chicken gel-forming mucin cluster. Dotted lines indicate a gap in the genome assembly and when such gaps occur, a minimum size of the protein is indicated.</p>
               </text>
               <graphic file="1471-2164-7-197-4"/>
            </fig>
            <p>All VWD domains identified by a screen of the chicken genome with Genewise were also compared to previously known VWD domains using BLAST and ClustalW. The phylogenetic tree from a ClustalW analysis is shown in Fig. <figr fid="F1">1</figr>. Interestingly, all the mucin VWD domains are clustered in a characteristic manner based on their position in the mucins as we have previously shown for <it>Fugu rubripes </it>mucins <abbrgrp><abbr bid="B2">2</abbr></abbrgrp>. The different VWD domains numbered 1&#8211;4 in Fig. <figr fid="F1">1</figr> are clearly homologous such that the chicken VWD-1 is most closely related to the human VWD-1, etc. The grouping of the human and chicken VWD domains strongly supports our assignment of the chicken mucins as Muc2, Muc5ac, Muc5b and Muc6, respectively.</p>
            <p>When the chromosome 5 locus of chicken is compared to the corresponding locus in human, the most notable difference is the presence of one additional gene in the chicken. The predicted protein contains four VWD domains organized as for the gel-forming mucins (Fig. <figr fid="F4">4</figr>). This protein was recently cloned by Watanabe <it>et al </it><abbrgrp><abbr bid="B23">23</abbr></abbrgrp> and referred to as the &#945;-subunit of ovomucin. An additional subunit called &#946; has also been described <abbrgrp><abbr bid="B23">23</abbr></abbrgrp>. Both subunits are abundant in egg white and are responsible for its gel-like properties. However, from the results presented here it is obvious that the &#946;-subunit of ovomucin is an orthologue to the human MUC6. In the following, the &#945;-subunit of ovomucin is simply referred to as ovomucin as this protein is specific to the chicken mucin locus. Ovomucin has a similar domain structure as the other proteins in the cluster except that it does not contain the PTS domain characteristic of mucins (Fig. <figr fid="F4">4</figr>).</p>
            <p>Interestingly, from the phylogenetic tree in Fig. <figr fid="F1">1</figr> it seems that the VWD domains of ovomucin are more deeply branched than Muc2/5ac/5b, suggesting that this ovomucin is a more ancient protein. It will be interesting to further study this issue by identifying homologues of the gel-forming mucins in other species. Preliminary results suggest that there are ovomucin homologues in <it>X. tropicalis </it>and in the fishes <it>F. rubripes</it>, <it>T. nigroviridis </it>and <it>D. rerio </it>(zebrafish). However, ovomucin is not present in man and rodents. The tree in Fig. <figr fid="F1">1</figr> also seems to suggest that Muc6 is more deeply branched than the other gel-forming mucins of the same locus, raising the possibility that this protein is the ancestral form of the Muc2/5ac/5b/Muc6 proteins.</p>
         </sec>
         <sec>
            <st>
               <p>The PTS domain of Muc13 is encoded by multiple exons where each exon corresponds to a repeated unit</p>
            </st>
            <p>A gene encoding the chicken Muc13 orthologue was identified on chromosome 7. The protein has an N-terminal signal sequence, followed by one PTS, one SEA, one transmembrane domain and a cytoplasmic tail. There is a gap in the genome assembly encoding the PTS domain and therefore the full sequence of this domain cannot be predicted. However, the PTS domain is composed of at least 12 repeats, each 20 amino acids in length (Fig. <figr fid="F5">5</figr>).</p>
            <fig id="F5">
               <title>
                  <p>Figure 5</p>
               </title>
               <caption>
                  <p>Comparison of the MUC13 sequence in man and chicken</p>
               </caption>
               <text>
                  <p>Comparison of the MUC13 sequence in man and chicken. <it>A</it>. Genomic organization of exons and introns and the domains encoded by the exons. <it>B</it>. Amino acid sequences of the PTS (mucin) domains of the chicken and human MUC13. For the chicken PTS each line corresponds to one exon. <it>C</it>. Alignment of the amino acid sequence C-terminal of the PTS domain. Identical and similar amino acids are indicated with black and grey, respectively, and domains are shown under each sequence.</p>
               </text>
               <graphic file="1471-2164-7-197-5"/>
            </fig>
            <p>Typical for the PTS domains of previously known mucins are that these are built from tandem repeats that often show a remarkable length polymorphism (VNTR, variable number of tandem repeats) <abbrgrp><abbr bid="B1">1</abbr><abbr bid="B28">28</abbr></abbrgrp>. The mechanism and functional significance of this variability in length is currently not known, but there are several indications that such variation is associated with disease. The allele length of <it>MUC1 </it>has been linked to susceptibility to <it>Helicobacter pylori </it>infection and gastric cancer <abbrgrp><abbr bid="B29">29</abbr><abbr bid="B30">30</abbr></abbrgrp>. Furthermore, it has recently been suggested that the allele length of <it>MUC1 </it>influences the expression of tumor associated carbohydrate antigens and possibly also the aggressiveness of gastric cancer <abbrgrp><abbr bid="B31">31</abbr></abbrgrp>.</p>
            <p>For all previously described mucins, including the human MUC13, the PTS domain is found within a single large exon. However, the chicken Muc13 PTS domain is encoded by multiple exons. There is a chicken EST (Genbank accession <ext-link ext-link-type="gen" ext-link-id="AJ452523">AJ452523</ext-link>) that gives support to this conclusion. As with most other mucins, the chicken Muc13 PTS domain contains repeats. It is interesting to note that the sequences encoded by the exons are nearly identical, i.e. the sequence encoded by one exon corresponds to a repeat unit of the PTS domain (Fig. <figr fid="F5">5</figr>).</p>
            <p>The chicken Muc13 tandem repeats thus have a different genomic organization as compared to higher animals. An analysis of zebrafish proteins (unpublished) identified a Muc13 homologue (Fig. <figr fid="F2">2</figr>) with its gene encoding the PTS domain divided into several exons. These findings suggest that this organization of the PTS domain represents an ancestral design of the vertebrate Muc13 gene and perhaps of other mucins.</p>
            <p>The genomic organization of the Muc13 gene raises the possibility that a variation in length of the PTS domain may be accomplished not only by recombination events, as is the case for the human MUC1 polymorphism, but also by a regulation of splicing of mucin domain exons. This will allow a length variation not only between individuals, but also within one and the same individual.</p>
         </sec>
         <sec>
            <st>
               <p>Relationship of Muc13 and HEG</p>
            </st>
            <p>The HEG (heart of glass) gene was first identified in zebrafish where it regulates the growth of the heart <abbrgrp><abbr bid="B32">32</abbr></abbrgrp>. Three isoforms are generated, one of which is predicted to be transmembrane and two secreted. Homologues have been identified in vertebrates, including man, and we now identified a chicken homologue. We observed that this protein is encoded by a gene adjacent to the <it>Muc13 </it>gene on chromosome 7 and these two genes have the same polarity. Synteny between chicken and man extends even beyond these two genes in both directions. Analysis of the human and mouse genomes shows that the <it>HEG </it>and <it>Muc13 </it>genes are organized in the same manner in these animals. Although Muc13 has an SEA domain which is absent in HEG, there are interesting similiarities between the two proteins as both have transmembrane, EGF and PTS domains. This observation together with the fact that the genes are in the same locus suggest an evolutionary relationship.</p>
         </sec>
         <sec>
            <st>
               <p>The evolution of vertebrate mucins</p>
            </st>
            <p>The results of our inventory of mucins in vertebrates are summarized in Table <tblr tid="T1">1</tblr>. In our previous study of <it>F. rubripes </it>mucins, we concluded that this vertebrate has a set of gel-forming mucins comparable to those of man and rodents. Further analysis of <it>F. rubripes</it>, <it>Tetraodon nigroviridis</it>, and <it>Danio rerio </it>as well as <it>Ciona intestinalis </it>suggest that some of the mucins are quite different to the higher vertebrate mucins (in preparation). In the chicken however, we found obvious homologues of the primate and rodent Muc2, Muc5ac, Muc5b, and Muc6. They are homologous both with respect to sequence of the VWD domains as well as to their localization and direction in the gene cluster. Interestingly, the chicken cluster contains an additional gene encoding ovomucin, found in egg white. This gene seems to be present also in frogs and fishes (Lang, T., et al., unpublished observation), but has disappeared during the development of mammals and might not be needed by animals where the fertilized egg is developed within the female body. A more detailed study of the phylogenetic distribution of ovomucin will probably give more clues as to its evolutionary history.</p>
            <p>Whereas fish, chicken, and man have a reasonably similar set of gel-forming mucins, many of the transmembrane type mucins are missing in chicken and fish. In particular, this is true for fish as only genes encoding Muc13 and a MUC1-like protein was identified in <it>F. rubripes</it>. In the chicken, also the transmembrane mucins Muc4 and Muc16 were identified. Still the transmembrane mucins homologous to the human MUC3, MUC12, and MUC17 seem to be missing in chicken as well as in fishes. These mucins might therefore be a more recent development in vertebrates. A more detailed view of the phylogeny of these proteins will be crucial to better understand the evolution of mucins. Therefore, it is necessary to carry out a careful inventory of mucins in more animals.</p>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Conclusion</p>
         </st>
         <p>We have identified several novel mucin homologues in chicken. We have shown that chicken has a set of mucins comparable to that of human although we fail to identify a homologue to the gel-forming MUC19 and to the transmembrane MUC3, MUC12, MUC15 and MUC17 proteins.</p>
         <p>Ovomucin, similar to Muc2 but without a PTS domain, is a protein found in chicken but not in mammals. We now have shown that the gene encoding ovomucin is part of a locus highly homologous to a human locus containing the <it>Muc6</it>, <it>Muc2</it>, <it>Muc5ac</it>, and <it>Muc5b </it>genes. We have also demonstrated that the protein referred to as the &#946;-subunit of ovomucin is a protein homologous to human MUC6.</p>
         <p>The chicken transmembrane mucin Muc13, as well as the homologues in man and mouse, contains SEA, EGF and PTS domains on the extracellular side of the membrane. Both in chicken and mammals the <it>HEG </it>gene was found to be located adjacent to the <it>Muc13 </it>gene. HEG is a transmembrane protein with EGF and PTS domains as Muc13, although no SEA domain can be identified in HEG. Therefore, an evolutionary relationship between Muc13 and HEG is implied.</p>
         <p>Finally, we have shown that the PTS domain of Muc13 is encoded by multiple exons, where each exon encodes a repeat unit of the PTS domain. This is in contrast to previously described PTS domains that are all encoded by one exon only. Allelic polymorphism affecting the length of the PTS domain is observed in human mucins. The gene organization in chicken suggests that a variability in the PTS domain could also be accomplished within an individual through alternative splicing.</p>
      </sec>
      <sec>
         <st>
            <p>Methods</p>
         </st>
         <sec>
            <st>
               <p>Sources of sequence information</p>
            </st>
            <p>As sources for protein and genomic sequences we used UCSC <abbrgrp><abbr bid="B33">33</abbr></abbrgrp>, Ensembl <abbrgrp><abbr bid="B26">26</abbr></abbrgrp>, NCBI <abbrgrp><abbr bid="B34">34</abbr></abbrgrp>, and Celera <abbrgrp><abbr bid="B35">35</abbr></abbrgrp>. Protein domain profiles were from Pfam <abbrgrp><abbr bid="B36">36</abbr></abbrgrp>. We made use of the ENSEMBL version of the chicken genome, 27.1d. The genomic DNA sequence had 111864 contigs, with a total of 1,08 &#215; 10<sup>9 </sup>bases.</p>
         </sec>
         <sec>
            <st>
               <p>Bioinformatic methods</p>
            </st>
            <p>For identification of PTS domains we made use of PTSpred that can be used to search both DNA and protein sequences <abbrgrp><abbr bid="B2">2</abbr></abbrgrp>. To identify Pfam domains we used for protein sequences hmmpfam of the hmmer package <abbrgrp><abbr bid="B37">37</abbr></abbrgrp> and for nucleotide sequences Genewise <abbrgrp><abbr bid="B24">24</abbr></abbrgrp>. Transmembrane domains were identified by TMHMM <abbrgrp><abbr bid="B38">38</abbr></abbrgrp> and signal sequences by SignalP <abbrgrp><abbr bid="B39">39</abbr></abbrgrp>. For exon prediction Genscan <abbrgrp><abbr bid="B40">40</abbr></abbrgrp> was used. Alignments of proteins and DNA were done by BLAST, ClustalW <abbrgrp><abbr bid="B41">41</abbr></abbrgrp> or programs of the GCG package (GCG, Madison, WI). The repetitive nature of the PTS domains was analyzed with Dotplot of the GCG package. In house Perl scripts were used for additional tasks.</p>
         </sec>
         <sec>
            <st>
               <p>Analysis of chicken PTS, VWD and SEA domains of chicken</p>
            </st>
            <p>Two different sets of proteins were considered, on the one hand, proteins predicted by ENSEMBL and on the other hand, proteins predicted by <it>ab initio </it>methods. When analyzing the ENSEMBL proteins the PTS domain, VWD and SEA domain analysis identified 146, 53 and 26 proteins, respectively. Analysis of the corresponding set of proteins predicted by ab initio methods resulted in 78 PTS, 52 VWD and 17 SEA domain candidates. Ten different proteins had both PTS and either VWD or SEA domains. Two were identified as related to otogelin. The remaining eight proteins were identified as Muc1 (weak candidate), Muc2, Muc4, Muc5ac, Muc5b, Muc6, Muc13, and Muc16 and are described under Results and Discussion. We also used Genewise to scan the chicken genome sequence but that analysis did not result in any additional strong mucin candidates.</p>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Authors' contributions</p>
         </st>
         <p>TL carried out all bioinformatics analyses and prepared all figures. GH and TS conceived of the study and drafted the manuscript jointly. All authors read and approved the final manuscript.</p>
      </sec>
   </bdy>
   <bm>
      <ack>
         <sec>
            <st>
               <p>Acknowledgements</p>
            </st>
            <p>Tiange Lang has been supported by the Swedish Knowledge Foundation through the Industrial PhD program in Medical Bioinformatics at the Centre for Medical Innovations (CMI) at the Karolinska Institute and by a grant from the Sahlgren's Hospital (grant to Nils Lycke). The project was supported by The Swedish Research Council (No. 7461).</p>
         </sec>
      </ack>
      <refgrp>
         <bibl id="B1">
            <title>
               <p>Mucin in cancer: protection and control of the cell surface</p>
            </title>
            <aug>
               <au>
                  <snm>Hollingsworth</snm>
                  <fnm>MA</fnm>
               </au>
               <au>
                  <snm>Swanson</snm>
                  <fnm>BJ</fnm>
               </au>
            </aug>
            <source>Nat Rev Cancer</source>
            <pubdate>2004</pubdate>
            <volume>4</volume>
            <fpage>45</fpage>
            <lpage>60</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/nrc1251</pubid>
                  <pubid idtype="pmpid" link="fulltext">14681689</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B2">
            <title>
               <p>Bioinformatic identification of polymerizing and transmembrane mucins in the puffer fish Fugu rubripes.</p>
            </title>
            <aug>
               <au>
                  <snm>Lang</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Alexandersson</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Hansson</snm>
                  <fnm>GC</fnm>
               </au>
               <au>
                  <snm>Samuelsson</snm>
                  <fnm>T</fnm>
               </au>
            </aug>
            <source>Glycobiology</source>
            <pubdate>2004</pubdate>
            <volume>14</volume>
            <fpage>521</fpage>
            <lpage>527</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/glycob/cwh066</pubid>
                  <pubid idtype="pmpid" link="fulltext">15044386</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B3">
            <title>
               <p>Molecular cloning and expression of human tumor-associated polymorphic epithelial mucin</p>
            </title>
            <aug>
               <au>
                  <snm>Gendler</snm>
                  <fnm>SJ</fnm>
               </au>
               <au>
                  <snm>Lancaster</snm>
                  <fnm>CA</fnm>
               </au>
               <au>
                  <snm>Taylor-Papadimitriou</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Duhig</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Peat</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Burchell</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Pemberton</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Lalani</snm>
                  <fnm>EN</fnm>
               </au>
               <au>
                  <snm>Wilson</snm>
                  <fnm>D</fnm>
               </au>
            </aug>
            <source>J Biol Chem</source>
            <pubdate>1990</pubdate>
            <volume>265 nr.25</volume>
            <fpage>15286</fpage>
            <lpage>15293</lpage>
         </bibl>
         <bibl id="B4">
            <title>
               <p>Multiple transcripts of MUC3: Evidence for two genes MUC3A and MUC3B</p>
            </title>
            <aug>
               <au>
                  <snm>Pratt</snm>
                  <fnm>WS</fnm>
               </au>
               <au>
                  <snm>Crawley</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Hicks</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Ho</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Nash</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Kim</snm>
                  <fnm>YS</fnm>
               </au>
               <au>
                  <snm>Gum</snm>
                  <fnm>JR</fnm>
               </au>
               <au>
                  <snm>Swallow</snm>
                  <fnm>DM</fnm>
               </au>
            </aug>
            <source>Biochem Biophys Res Commun</source>
            <pubdate>2000</pubdate>
            <volume>275</volume>
            <fpage>916</fpage>
            <lpage>923</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1006/bbrc.2000.3406</pubid>
                  <pubid idtype="pmpid" link="fulltext">10973822</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B5">
            <title>
               <p>Two novel mucin genes down-regulated in colorectal cancer identified by differential display</p>
            </title>
            <aug>
               <au>
                  <snm>Williams</snm>
                  <fnm>SJ</fnm>
               </au>
               <au>
                  <snm>McGuckin</snm>
                  <fnm>MA</fnm>
               </au>
               <au>
                  <snm>Gotley</snm>
                  <fnm>DC</fnm>
               </au>
               <au>
                  <snm>Eyre</snm>
                  <fnm>HJ</fnm>
               </au>
               <au>
                  <snm>Sutherland</snm>
                  <fnm>GR</fnm>
               </au>
               <au>
                  <snm>Antalis</snm>
                  <fnm>TM</fnm>
               </au>
            </aug>
            <source>Cancer Res</source>
            <pubdate>1999</pubdate>
            <volume>59</volume>
            <fpage>4083</fpage>
            <lpage>4089</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">10463611</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B6">
            <title>
               <p>Complete sequence of the human mucin MUC4: a putative cell membrane-associated mucin</p>
            </title>
            <aug>
               <au>
                  <snm>Moniaux</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Nollet</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Degand</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Laine</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Aubert</snm>
                  <fnm>JP</fnm>
               </au>
            </aug>
            <source>Biochem J</source>
            <pubdate>1999</pubdate>
            <volume>338</volume>
            <fpage>325</fpage>
            <lpage>333</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1220057</pubid>
                  <pubid idtype="pmpid" link="fulltext">10024507</pubid>
                  <pubid idtype="doi">10.1042/0264-6021:3380325</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B7">
            <title>
               <p>MUC13, a novel human cell surface mucin expressed by epithelial and hemopoietic cells</p>
            </title>
            <aug>
               <au>
                  <snm>Williams</snm>
                  <fnm>SJ</fnm>
               </au>
               <au>
                  <snm>Wreschner</snm>
                  <fnm>DH</fnm>
               </au>
               <au>
                  <snm>Tran</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Eyre</snm>
                  <fnm>HJ</fnm>
               </au>
               <au>
                  <snm>Sutherland</snm>
                  <fnm>GR</fnm>
               </au>
               <au>
                  <snm>McGuckin</snm>
                  <fnm>MA</fnm>
               </au>
            </aug>
            <source>J Biol Chem</source>
            <pubdate>2001</pubdate>
            <volume>276</volume>
            <fpage>18327</fpage>
            <lpage>18336</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1074/jbc.M008850200</pubid>
                  <pubid idtype="pmpid" link="fulltext">11278439</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B8">
            <title>
               <p>Ovarian cancer antigen CA125 is encoded by the MUC16 mucin gene</p>
            </title>
            <aug>
               <au>
                  <snm>Yin</snm>
                  <fnm>BWT</fnm>
               </au>
               <au>
                  <snm>Dnistrian</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Lloyd</snm>
                  <fnm>KO</fnm>
               </au>
            </aug>
            <source>Int J Cancer</source>
            <pubdate>2002</pubdate>
            <volume>98</volume>
            <fpage>737</fpage>
            <lpage>740</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1002/ijc.10250</pubid>
                  <pubid idtype="pmpid" link="fulltext">11920644</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B9">
            <title>
               <p>MUC17, a novel membrane-tethered mucin</p>
            </title>
            <aug>
               <au>
                  <snm>Gum</snm>
                  <fnm>JR</fnm>
               </au>
               <au>
                  <snm>Crawley</snm>
                  <fnm>SC</fnm>
               </au>
               <au>
                  <snm>Hicks</snm>
                  <fnm>JW</fnm>
               </au>
               <au>
                  <snm>Szymkowski</snm>
                  <fnm>DE</fnm>
               </au>
               <au>
                  <snm>Kim</snm>
                  <fnm>YS</fnm>
               </au>
            </aug>
            <source>Biochem Biophys Res Commun</source>
            <pubdate>2002</pubdate>
            <volume>291</volume>
            <fpage>466</fpage>
            <lpage>475</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1006/bbrc.2002.6475</pubid>
                  <pubid idtype="pmpid" link="fulltext">11855812</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B10">
            <title>
               <p>Molecular cloning of human intestinal mucin (MUC2) cDNA. Identification of the amino terminus and overall sequence similarity to prepro-von Willebrand factor</p>
            </title>
            <aug>
               <au>
                  <snm>Gum</snm>
                  <fnm>JR</fnm>
               </au>
               <au>
                  <snm>Hicks</snm>
                  <fnm>JW</fnm>
               </au>
               <au>
                  <snm>Toribara</snm>
                  <fnm>NW</fnm>
               </au>
               <au>
                  <snm>Siddiki</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Kim</snm>
                  <fnm>YS</fnm>
               </au>
            </aug>
            <source>J Biol Chem</source>
            <pubdate>1994</pubdate>
            <volume>269</volume>
            <fpage>2440</fpage>
            <lpage>2446</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">8300571</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B11">
            <title>
               <p>Genomic organization of the human mucin gene MUC5B-cDNA and genomic sequences upstream of the large central exon</p>
            </title>
            <aug>
               <au>
                  <snm>Desseyn</snm>
                  <fnm>JL</fnm>
               </au>
               <au>
                  <snm>Buisine</snm>
                  <fnm>MP</fnm>
               </au>
               <au>
                  <snm>Porchet</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Aubert</snm>
                  <fnm>JP</fnm>
               </au>
               <au>
                  <snm>Laine</snm>
                  <fnm>A</fnm>
               </au>
            </aug>
            <source>J Biol Chem</source>
            <pubdate>1998</pubdate>
            <volume>273</volume>
            <fpage>30157</fpage>
            <lpage>30164</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1074/jbc.273.46.30157</pubid>
                  <pubid idtype="pmpid" link="fulltext">9804771</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B12">
            <title>
               <p>Cloning of the amino-terminal and 5'-flanking region of the human MUC5AC mucin gene and transcriptional up-regulation by bacterial exoproducts</p>
            </title>
            <aug>
               <au>
                  <snm>Li</snm>
                  <fnm>DZ</fnm>
               </au>
               <au>
                  <snm>Gallup</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Fan</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Szymkowski</snm>
                  <fnm>DE</fnm>
               </au>
               <au>
                  <snm>Basbaum</snm>
                  <fnm>CB</fnm>
               </au>
            </aug>
            <source>J Biol Chem</source>
            <pubdate>1998</pubdate>
            <volume>273</volume>
            <fpage>6812</fpage>
            <lpage>6820</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1074/jbc.273.12.6812</pubid>
                  <pubid idtype="pmpid" link="fulltext">9506983</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B13">
            <title>
               <p>Human gastric mucin</p>
            </title>
            <aug>
               <au>
                  <snm>Toribara</snm>
                  <fnm>NW</fnm>
               </au>
               <au>
                  <snm>Roberton</snm>
                  <fnm>AM</fnm>
               </au>
               <au>
                  <snm>HO</snm>
                  <fnm>SB</fnm>
               </au>
               <au>
                  <snm>Kuo</snm>
                  <fnm>WL</fnm>
               </au>
               <au>
                  <snm>Gum</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Hicks</snm>
                  <fnm>JW</fnm>
               </au>
               <au>
                  <snm>Gum</snm>
                  <fnm>JR</fnm>
               </au>
               <au>
                  <snm>Byrd</snm>
                  <fnm>JC</fnm>
               </au>
               <au>
                  <snm>Siddiki</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Kim</snm>
                  <fnm>YS</fnm>
               </au>
            </aug>
            <source>J Biol Chem</source>
            <pubdate>1993</pubdate>
            <volume>268 nr.8</volume>
            <fpage>5879</fpage>
            <lpage>5885</lpage>
         </bibl>
         <bibl id="B14">
            <title>
               <p>Molecular cloning, sequence, and specificity of expression of the gene encoding the low molecular weight human salivary mucin (MUC7)</p>
            </title>
            <aug>
               <au>
                  <snm>Bobek</snm>
                  <fnm>LA</fnm>
               </au>
               <au>
                  <snm>Tsai</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Biesbrock</snm>
                  <fnm>AR</fnm>
               </au>
               <au>
                  <snm>Levine</snm>
                  <fnm>MJ</fnm>
               </au>
            </aug>
            <source>J Biol Chem</source>
            <pubdate>1993</pubdate>
            <volume>268</volume>
            <fpage>20563</fpage>
            <lpage>20569</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">7690757</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B15">
            <title>
               <p>Genome-Wide Search and Identification of a Novel Gel-Forming Mucin MUC19/Muc19 in Glandular Tissues</p>
            </title>
            <aug>
               <au>
                  <snm>Chen</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Zhao</snm>
                  <fnm>YH</fnm>
               </au>
               <au>
                  <snm>Kalaslavadi</snm>
                  <fnm>TB</fnm>
               </au>
               <au>
                  <snm>Hamati</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Nehrke</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Le</snm>
                  <fnm>AD</fnm>
               </au>
               <au>
                  <snm>Ann</snm>
                  <fnm>DK</fnm>
               </au>
               <au>
                  <snm>Wu</snm>
                  <fnm>R</fnm>
               </au>
            </aug>
            <source>Am J Resp Cell Mol Biol</source>
            <pubdate>2004</pubdate>
            <volume>30</volume>
            <fpage>155</fpage>
            <lpage>165</lpage>
            <xrefbib>
               <pubid idtype="doi">10.1165/rcmb.2003-0103OC</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B16">
            <title>
               <p>MUC20 Suppresses the Hepatocyte Growth Factor-Induced Grb2-Ras Pathway by Binding to a Multifunctional Docking Site of Met</p>
            </title>
            <aug>
               <au>
                  <snm>Higuchi</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Orita</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Katsuya</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Yamasaki</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Akiyama</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Li</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Yamamoto</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Saito</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Nakamura</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>Mol Cell Biol</source>
            <pubdate>2004</pubdate>
            <volume>24</volume>
            <fpage>7456</fpage>
            <lpage>7468</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">506992</pubid>
                  <pubid idtype="pmpid" link="fulltext">15314156</pubid>
                  <pubid idtype="doi">10.1128/MCB.24.17.7456-7468.2004</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B17">
            <title>
               <p>The structure and assembly of secreted mucins</p>
            </title>
            <aug>
               <au>
                  <snm>Perez-Vilar</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Hill</snm>
                  <fnm>RL</fnm>
               </au>
            </aug>
            <source>J Biol Chem</source>
            <pubdate>1999</pubdate>
            <volume>274</volume>
            <fpage>31751</fpage>
            <lpage>31754</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1074/jbc.274.45.31751</pubid>
                  <pubid idtype="pmpid" link="fulltext">10542193</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B18">
            <title>
               <p>von Willebrand factor</p>
            </title>
            <aug>
               <au>
                  <snm>Sadler</snm>
                  <fnm>JE</fnm>
               </au>
            </aug>
            <source>J Biol Chem</source>
            <pubdate>1991</pubdate>
            <volume>266</volume>
            <fpage>22777</fpage>
            <lpage>22780</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">1744071</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B19">
            <title>
               <p>Dimerization of the human MUC2 mucin in the endoplasmic reticulum is followed by a N-glycosylation-dependent transfer of the mono- and dimers to the Golgi apparatus</p>
            </title>
            <aug>
               <au>
                  <snm>Asker</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Axelsson</snm>
                  <fnm>MAB</fnm>
               </au>
               <au>
                  <snm>Olofsson</snm>
                  <fnm>SO</fnm>
               </au>
               <au>
                  <snm>Hansson</snm>
                  <fnm>GC</fnm>
               </au>
            </aug>
            <source>J Biol Chem</source>
            <pubdate>1998</pubdate>
            <volume>273</volume>
            <fpage>18857</fpage>
            <lpage>18863</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1074/jbc.273.30.18857</pubid>
                  <pubid idtype="pmpid" link="fulltext">9668061</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B20">
            <title>
               <p>O-glycosylated MUC2 monomer and dimer from LS 174T cells are water-soluble, whereas larger MUC2 species formed early during biosynthesis are insoluble and contain nonreducible intermolecular bonds</p>
            </title>
            <aug>
               <au>
                  <snm>Axelsson</snm>
                  <fnm>MAB</fnm>
               </au>
               <au>
                  <snm>Asker</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Hansson</snm>
                  <fnm>GC</fnm>
               </au>
            </aug>
            <source>J Biol Chem</source>
            <pubdate>1998</pubdate>
            <volume>273</volume>
            <fpage>18864</fpage>
            <lpage>18870</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1074/jbc.273.30.18864</pubid>
                  <pubid idtype="pmpid" link="fulltext">9668062</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B21">
            <title>
               <p>Auto-proteolysis coupled to protein folding in the SEA domain of the membrane-bound MUC1 mucin</p>
            </title>
            <aug>
               <au>
                  <snm>Macao</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Johansson</snm>
                  <fnm>DGA</fnm>
               </au>
               <au>
                  <snm>Hansson</snm>
                  <fnm>GC</fnm>
               </au>
               <au>
                  <snm>H&#228;rd</snm>
                  <fnm>T</fnm>
               </au>
            </aug>
            <source>Nature Struct Mol Biol</source>
            <pubdate>2006</pubdate>
            <volume>13</volume>
            <fpage>71</fpage>
            <lpage>76</lpage>
            <xrefbib>
               <pubid idtype="doi">10.1038/nsmb1035</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B22">
            <title>
               <p>Distinct evolution of the human carcinoma-associated transmembrane mucins, MUC1, MUC4 AND MUC16</p>
            </title>
            <aug>
               <au>
                  <snm>Duraisamy</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Ramasamy</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Kharbanda</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Kufe</snm>
                  <fnm>D</fnm>
               </au>
            </aug>
            <source>Gene</source>
            <pubdate>2006</pubdate>
            <volume>373</volume>
            <fpage>28</fpage>
            <lpage>34</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/j.gene.2005.12.021</pubid>
                  <pubid idtype="pmpid" link="fulltext">16500040</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B23">
            <title>
               <p>Amino acid sequence of a-subunit in hen egg white ovomucin deduced from cloned cDNA</p>
            </title>
            <aug>
               <au>
                  <snm>Watanabe</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Shimoyamada</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Onizuka</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Akiyama</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Niwa</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Ido</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Tsuge</snm>
                  <fnm>Y</fnm>
               </au>
            </aug>
            <source>DNA Sequence</source>
            <pubdate>2004</pubdate>
            <volume>15</volume>
            <fpage>251</fpage>
            <lpage>261</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1080/10425170410001723921</pubid>
                  <pubid idtype="pmpid" link="fulltext">15620212</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B24">
            <title>
               <p>GeneWise and Genomewise</p>
            </title>
            <aug>
               <au>
                  <snm>Birney</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Clamp</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Durbin</snm>
                  <fnm>R</fnm>
               </au>
            </aug>
            <source>Genome Res</source>
            <pubdate>2004</pubdate>
            <volume>14</volume>
            <fpage>988</fpage>
            <lpage>995</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">479130</pubid>
                  <pubid idtype="pmpid" link="fulltext">15123596</pubid>
                  <pubid idtype="doi">10.1101/gr.1865504</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B25">
            <title>
               <p>The Gene Encoding Mouse Muc19: cDNA, Genomic Organization and Relationship to Smgc</p>
            </title>
            <aug>
               <au>
                  <snm>Culp</snm>
                  <fnm>DJ</fnm>
               </au>
               <au>
                  <snm>Latchney</snm>
                  <fnm>LR</fnm>
               </au>
               <au>
                  <snm>Fallon</snm>
                  <fnm>MA</fnm>
               </au>
               <au>
                  <snm>Denny</snm>
                  <fnm>PA</fnm>
               </au>
               <au>
                  <snm>Denny</snm>
                  <fnm>PC</fnm>
               </au>
               <au>
                  <snm>Couwenhoven</snm>
                  <fnm>RI</fnm>
               </au>
               <au>
                  <snm>Chuang</snm>
                  <fnm>S</fnm>
               </au>
            </aug>
            <source>Physiol Genomics</source>
            <pubdate>2004</pubdate>
            <volume>19</volume>
            <fpage>303</fpage>
            <lpage>318</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1152/physiolgenomics.00161.2004</pubid>
                  <pubid idtype="pmpid" link="fulltext">15340121</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B26">
            <title>
               <p>Ensembl [http://www.ensembl.org/]</p>
            </title>
         </bibl>
         <bibl id="B27">
            <title>
               <p>Mucin web site [http://www.medkem.gu.se/mucinbiology/databases]</p>
            </title>
         </bibl>
         <bibl id="B28">
            <title>
               <p>The human tumour-associated epithelial mucins are coded by an expressed hypervariable gene locus PUM</p>
            </title>
            <aug>
               <au>
                  <cnm>Swallow.DM</cnm>
               </au>
               <au>
                  <snm>Gendler</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Griffiths</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Corney</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Taylor-Papadimitriou</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Bramwell</snm>
                  <fnm>ME</fnm>
               </au>
            </aug>
            <source>Nature</source>
            <pubdate>1987</pubdate>
            <volume>328</volume>
            <fpage>82</fpage>
            <lpage>84</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/328082a0</pubid>
                  <pubid idtype="pmpid" link="fulltext">3600778</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B29">
            <title>
               <p>MUC1 gene polymorphism in the gastric carcinogenesis pathway</p>
            </title>
            <aug>
               <au>
                  <snm>Silva</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Carvalho</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Peixoto</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Seixas</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Almeida</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Carneiro</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Mesquira</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Figueiredo</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Nogeira</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Swallow</snm>
                  <fnm>DM</fnm>
               </au>
               <au>
                  <snm>Amorim</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>David</snm>
                  <fnm>L</fnm>
               </au>
            </aug>
            <source>Eur J Human Gen</source>
            <pubdate>2001</pubdate>
            <volume>9</volume>
            <fpage>548</fpage>
            <lpage>552</lpage>
            <xrefbib>
               <pubid idtype="doi">10.1038/sj.ejhg.5200677</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B30">
            <title>
               <p>Altered expression and allelic association of the hypervariable membrane mucin MUC1 in Helicobacter pylori gastritis</p>
            </title>
            <aug>
               <au>
                  <snm>Vinall</snm>
                  <fnm>LE</fnm>
               </au>
               <au>
                  <snm>King</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Novelli</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Green</snm>
                  <fnm>CA</fnm>
               </au>
               <au>
                  <snm>Daniels</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Hilkens</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Sarner</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Swallow</snm>
                  <fnm>DM</fnm>
               </au>
            </aug>
            <source>Gastroenterology</source>
            <pubdate>2002</pubdate>
            <volume>123</volume>
            <fpage>41</fpage>
            <lpage>49</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1053/gast.2002.34157</pubid>
                  <pubid idtype="pmpid" link="fulltext">12105832</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B31">
            <title>
               <p>Thomsen-Friedenreich antigen expression in gastric carcinomas is associated with MUC1 mucin VNTR polymorphism</p>
            </title>
            <aug>
               <au>
                  <snm>Santos-Silva</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Fonseca</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Caffrey</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Carvalho</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Mesquita</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Reis</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Almeida</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>David</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Hollingsworth</snm>
                  <fnm>MA</fnm>
               </au>
            </aug>
            <source>Glycobiology</source>
            <pubdate>2005</pubdate>
            <volume>15</volume>
            <fpage>511</fpage>
            <lpage>517</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/glycob/cwi027</pubid>
                  <pubid idtype="pmpid" link="fulltext">15604091</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B32">
            <title>
               <p>heart of glass regulates the concentric growth of the heart in zebrafish</p>
            </title>
            <aug>
               <au>
                  <snm>Mably</snm>
                  <fnm>JD</fnm>
               </au>
               <au>
                  <snm>Mohideen</snm>
                  <fnm>MA</fnm>
               </au>
               <au>
                  <snm>Burns</snm>
                  <fnm>CG</fnm>
               </au>
               <au>
                  <snm>Chen</snm>
                  <fnm>JN</fnm>
               </au>
               <au>
                  <snm>Fishman</snm>
                  <fnm>MC</fnm>
               </au>
            </aug>
            <source>Curr Biol</source>
            <pubdate>2003</pubdate>
            <volume>13</volume>
            <fpage>2138</fpage>
            <lpage>2147</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/j.cub.2003.11.055</pubid>
                  <pubid idtype="pmpid" link="fulltext">14680629</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B33">
            <title>
               <p>UCSC Genome Bioinformatics [http://genome.ucsc.edu/]</p>
            </title>
         </bibl>
         <bibl id="B34">
            <title>
               <p>Ensembl [http://www.ensembl.org/]</p>
            </title>
         </bibl>
         <bibl id="B35">
            <title>
               <p>NCBI, National Center for Biotechnology information [http://www.ncbi.nlm.nih.gov/]</p>
            </title>
         </bibl>
         <bibl id="B36">
            <title>
               <p>Celera Genomics [http://www.celera.com/]</p>
            </title>
         </bibl>
         <bibl id="B37">
            <title>
               <p>Pfam [http://www.sanger.ac.uk/Software/Pfam/]</p>
            </title>
         </bibl>
         <bibl id="B38">
            <title>
               <p>HMMER: profile HMMs for protein sequence analysis [http://hmmer.wustl.edu/]</p>
            </title>
         </bibl>
         <bibl id="B39">
            <title>
               <p>A hidden Markov model for predicting transmembrane helices in protein sequences</p>
            </title>
            <aug>
               <au>
                  <snm>Sonnhammer</snm>
                  <fnm>EL</fnm>
               </au>
               <au>
                  <snm>von Heijne</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Krogh</snm>
                  <fnm>A</fnm>
               </au>
            </aug>
            <source>Proc Int Conf Intell Syst Mol Biol</source>
            <pubdate>1998</pubdate>
            <volume>6</volume>
            <fpage>175</fpage>
            <lpage>182</lpage>
            <xrefbib>
               <pubid idtype="pmpid">9783223</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B40">
            <title>
               <p>Identification of prokaryotic and eukaryotic signal peptides and prediction of their cleavage sites</p>
            </title>
            <aug>
               <au>
                  <snm>Nielsen</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Engelbrecht</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Brunak</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>von Heijne</snm>
                  <fnm>G</fnm>
               </au>
            </aug>
            <source>Prot Eng</source>
            <pubdate>1997</pubdate>
            <volume>10</volume>
            <fpage>1</fpage>
            <lpage>6</lpage>
            <xrefbib>
               <pubid idtype="doi">10.1093/protein/10.1.1</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B41">
            <title>
               <p>Prediction of complete gene structures in human genomic DNA</p>
            </title>
            <aug>
               <au>
                  <snm>Burge</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Karlin</snm>
                  <fnm>S</fnm>
               </au>
            </aug>
            <source>J Mol Biol</source>
            <pubdate>1997</pubdate>
            <volume>268</volume>
            <fpage>78</fpage>
            <lpage>94</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1006/jmbi.1997.0951</pubid>
                  <pubid idtype="pmpid" link="fulltext">9149143</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B42">
            <title>
               <p>CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalities and weight matrix choice</p>
            </title>
            <aug>
               <au>
                  <snm>Thompson</snm>
                  <fnm>JD</fnm>
               </au>
               <au>
                  <snm>Higgins</snm>
                  <fnm>DG</fnm>
               </au>
               <au>
                  <snm>Gibson</snm>
                  <fnm>TJ</fnm>
               </au>
            </aug>
            <source>Nucl Acid Res</source>
            <pubdate>1994</pubdate>
            <volume>22</volume>
            <fpage>4673</fpage>
            <lpage>4680</lpage>
         </bibl>
         <bibl id="B43">
            <title>
               <p>The carboxyl-terminal sequence of rat intestinal mucin RMuc3 contains a putative transmembrane region and two EGF-like motifs</p>
            </title>
            <aug>
               <au>
                  <snm>Khatri</snm>
                  <fnm>IA</fnm>
               </au>
               <au>
                  <snm>Forstner</snm>
                  <fnm>GG</fnm>
               </au>
               <au>
                  <snm>Forstner</snm>
                  <fnm>JF</fnm>
               </au>
            </aug>
            <source>Bioch Biophys Acta</source>
            <pubdate>1997</pubdate>
            <volume>1326</volume>
            <fpage>7</fpage>
            <lpage>11</lpage>
         </bibl>
      </refgrp>
   </bm>
</art>

