<?xml version='1.0'?>
<!DOCTYPE art SYSTEM 'http://www.biomedcentral.com/xml/article.dtd'>
<art><ui>1471-2164-13-87</ui><ji>1471-2164</ji><fm>
<dochead>Research article</dochead>
<bibl>
<title>
<p>Antigen-presenting genes and genomic copy number variations in the Tasmanian devil MHC</p>
</title>
<aug>
<au id="A1"><snm>Cheng</snm><fnm>Yuanyuan</fnm><insr iid="I1"/><email>yuanyuan.cheng@sydney.edu.au</email></au>
<au id="A2"><snm>Stuart</snm><fnm>Andrew</fnm><insr iid="I2"/><email>abs004@gmail.com</email></au>
<au id="A3"><snm>Morris</snm><fnm>Katrina</fnm><insr iid="I1"/><email>katrina.morris@sydney.edu.au</email></au>
<au id="A4"><snm>Taylor</snm><fnm>Robyn</fnm><insr iid="I3"/><email>Robyn.Taylor@dpipwe.tas.gov.au</email></au>
<au id="A5"><snm>Siddle</snm><fnm>Hannah</fnm><insr iid="I1"/><insr iid="I4"/><email>hvs26@cam.ac.uk</email></au>
<au id="A6"><snm>Deakin</snm><fnm>Janine</fnm><insr iid="I5"/><email>Janine.Deakin@anu.edu.au</email></au>
<au id="A7"><snm>Jones</snm><fnm>Menna</fnm><insr iid="I6"/><email>Menna.Jones@utas.edu.au</email></au>
<au id="A8"><snm>Amemiya</snm><mi>T</mi><fnm>Chris</fnm><insr iid="I2"/><email>camemiya@benaroyaresearch.org</email></au>
<au id="A9" ca="yes"><snm>Belov</snm><fnm>Katherine</fnm><insr iid="I1"/><insr iid="I7"/><email>kathy.belov@sydney.edu.au</email></au>
</aug>
<insg>
<ins id="I1"><p>Faculty of Veterinary Science, University of Sydney, Sydney, NSW, Australia</p></ins>
<ins id="I2"><p>Genome Resource Center, Benaroya Research Institute at Virginia Mason, Seattle, WA, USA</p></ins>
<ins id="I3"><p>Department of Primary Industries, Parks, Water and Environment, Prospect, TAS, Australia</p></ins>
<ins id="I4"><p>Department of Pathology, University of Cambridge, Cambridge, UK</p></ins>
<ins id="I5"><p>Research School of Biology, The Australian National University, Canberra, ACT, Australia</p></ins>
<ins id="I6"><p>School of Zoology, University of Tasmania, Hobart, TAS, Australia</p></ins>
<ins id="I7"><p>Faculty of Veterinary Science, University of Sydney, RMC Gunn B19, Sydney, NSW 2006, Australia</p></ins>
</insg>
<source>BMC Genomics</source>
<section><title><p>Non-human and non-rodent vertebrate genomics</p></title></section>
<issn>1471-2164</issn>
<pubdate>2012</pubdate>
<volume>13</volume>
<issue>1</issue>
<fpage>87</fpage>
<url>http://www.biomedcentral.com/1471-2164/13/87</url>
<xrefbib><pubidlist><pubid idtype="doi">10.1186/1471-2164-13-87</pubid><pubid idtype="pmpid">22404855</pubid></pubidlist></xrefbib>
</bibl>
<history><rec><date><day>1</day><month>8</month><year>2011</year></date></rec><acc><date><day>12</day><month>3</month><year>2012</year></date></acc><pub><date><day>12</day><month>3</month><year>2012</year></date></pub></history>
<cpyrt><year>2012</year><collab>Cheng et al; licensee BioMed Central Ltd.</collab><note>This is an Open Access article distributed under the terms of the Creative Commons Attribution License (<url>http://creativecommons.org/licenses/by/2.0</url>), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</note></cpyrt>
<kwdg>
<kwd>MHC</kwd>
<kwd>Tasmanian devil</kwd>
<kwd>Copy number variation</kwd>
<kwd>Devil facial tumour disease</kwd>
</kwdg>
<abs>
<sec>
<st>
<p>Abstract</p>
</st>
<sec>
<st>
<p>Background</p>
</st>
<p>The Tasmanian devil (<it>Sarcophilus harrisii</it>) is currently under threat of extinction due to an unusual fatal contagious cancer called Devil Facial Tumour Disease (DFTD). DFTD is caused by a clonal tumour cell line that is transmitted between unrelated individuals as an allograft without triggering immune rejection due to low levels of Major Histocompatibility Complex (MHC) diversity in Tasmanian devils.</p>
</sec>
<sec>
<st>
<p>Results</p>
</st>
<p>Here we report the characterization of the genomic regions encompassing MHC Class I and Class II genes in the Tasmanian devil. Four genomic regions approximately 960 kb in length were assembled and annotated using BAC contigs and physically mapped to devil Chromosome 4q. 34 genes and pseudogenes were identified, including five Class I and four Class II loci. Interestingly, when two haplotypes from two individuals were compared, three genomic copy number variants with sizes ranging from 1.6 to 17 kb were observed within the classical Class I gene region. One deletion is particularly important as it turns a Class Ia gene into a pseudogene in one of the haplotypes. This deletion explains the previously observed variation in the Class I allelic number between individuals. The frequency of this deletion is highest in the northwestern devil population and lowest in southeastern areas.</p>
</sec>
<sec>
<st>
<p>Conclusions</p>
</st>
<p>The third sequenced marsupial MHC provides insights into the evolution of this dynamic genomic region among the diverse marsupial species. The two sequenced devil MHC haplotypes revealed three copy number variations that are likely to significantly affect immune response and suggest that future work should focus on the role of copy number variations in disease susceptibility in this species.</p>
</sec>
</sec>
</abs>
</fm><bdy>
<sec>
<st>
<p>Background</p>
</st>
<p>The Tasmanian devil (<it>Sarcophilus harrisii</it>), an endemic species on the island state of Tasmania, Australia, is the largest remaining carnivorous marsupial in the world <abbrgrp>
<abbr bid="B1">1</abbr>
</abbrgrp>. Tasmanian devils ("devils" for short) were found on mainland Australia up to 3,000 to 4,000 years ago <abbrgrp>
<abbr bid="B2">2</abbr>
</abbrgrp>. The Tasmanian population has been isolated for over 12,000 years and has undergone two population crashes, due to the current disease epidemic and in around 1900 <abbrgrp>
<abbr bid="B3">3</abbr>
</abbrgrp>. As a result, the devil population has an overall low level of genetic diversity <abbrgrp>
<abbr bid="B4">4</abbr>
</abbrgrp>. Currently, the devil faces extinction due to the emergence of a fatal contagious cancer - Devil Facial Tumour Disease (DFTD). DFTD was first detected in 1996 at Mt William National Park in the northeast of Tasmania (Figure <figr fid="F1">1</figr>) <abbrgrp>
<abbr bid="B3">3</abbr>
</abbrgrp>. Since then, it has rapidly spread south and westwards to over 85% of the original devil distributional range and caused severe population declines <abbrgrp>
<abbr bid="B5">5</abbr>
</abbrgrp>. Research has revealed that DFTD is a clonal cell line, likely of Schwann cell origin, which is transmitted between individuals as an allograft by cellular inoculation (e.g. biting) <abbrgrp>
<abbr bid="B6">6</abbr>
<abbr bid="B7">7</abbr>
</abbrgrp>. The "grafted" tumour cells can overcome immunological barriers of the host and adapt in unrelated individuals without inducing immune rejection <abbrgrp>
<abbr bid="B8">8</abbr>
</abbrgrp>. The low level of genetic diversity at the Major Histocompatibility Complex (MHC) is believed to have contributed to this process <abbrgrp>
<abbr bid="B9">9</abbr>
</abbrgrp>.</p>
<fig id="F1"><title><p>Figure 1</p></title><caption><p>Map of Tasmania showing the sampling sites, location of the first citing of DFTD in 1996 (Mt William), and the present location of the disease front</p></caption><text>
   <p><b>Map of Tasmania showing the sampling sites, location of the first citing of DFTD in 1996 (Mt William), and the present location of the disease front</b>. The black segment in the pie charts shows the proportion of individuals with a deletion at <it>Saha-UA</it>, and the grey segment shows the proportion with intact <it>Saha-UA </it>gene.</p>
</text><graphic file="1471-2164-13-87-1" hint_layout="single"/></fig>
<p>The MHC is one of the most studied gene regions in vertebrates due to its critical roles in disease resistance and transplantation success. MHC genes have been cloned and characterized from representatives of all vertebrate classes except agnathan fish <abbrgrp>
<abbr bid="B10">10</abbr>
</abbrgrp>. Based on the structure and function of their encoded proteins, MHC genes are grouped into three classes (Class I, II and III) <abbrgrp>
<abbr bid="B11">11</abbr>
</abbrgrp>. Class I and II genes are further classified by function as classical and nonclassical, with classical Class I and II genes encoding cell surface molecules that present antigens to T lymphocytes <abbrgrp>
<abbr bid="B11">11</abbr>
</abbrgrp>. The MHC Class I genes are encoded by an &#945; chain, which associates with a &#946;<sub>2</sub>-microglobulin chain to become a functional Class I molecule <abbrgrp>
<abbr bid="B10">10</abbr>
</abbrgrp>. Classical Class I (Class Ia) molecules are ubiquitously expressed in all tissue types and function in the recognition and destruction of foreign, virus-infected or malignant cells by cytotoxic T cells <abbrgrp>
<abbr bid="B12">12</abbr>
</abbrgrp>. Nonclassical Class I (Class Ib) genes encode Class I-like molecules with varied functions, and generally exhibit lower expression levels, tissue specific expression and/or lower levels of polymorphism <abbrgrp>
<abbr bid="B13">13</abbr>
</abbrgrp>. MHC Class II molecules are heterodimers of an &#945; chain and a &#946; chain, both of which are encoded in the MHC. They are expressed in cells participating in immune responses such as B lymphocytes, dendritic cells and macrophages. These molecules present antigens derived from intravesicular and extracellular pathogens to CD4 helper T cells, which release signals to trigger antibody production and inflammatory responses that kill the pathogens <abbrgrp>
<abbr bid="B14">14</abbr>
</abbrgrp>. In most species, there are multiple closely related MHC gene paralogues, which result from gene duplication events during the evolution of MHC <abbrgrp>
<abbr bid="B15">15</abbr>
<abbr bid="B16">16</abbr>
</abbrgrp>. Due to selective pressures from ever-changing pathogens in the environment, these antigen-presenting MHC genes evolve rapidly and are usually highly polymorphic in their peptide-binding regions (exon 2 and 3 of Class I genes; exon 2 of Class II genes), enabling the immune system to recognize an extensive range of pathogenic antigens <abbrgrp>
<abbr bid="B17">17</abbr>
</abbrgrp>.</p>
<p>Previously, we isolated and characterized devil MHC Class I and Class II &#946; chain transcripts <abbrgrp>
<abbr bid="B18">18</abbr>
</abbrgrp> and demonstrated low levels of genetic variation in these sequences, especially in eastern Tasmania <abbrgrp>
<abbr bid="B19">19</abbr>
</abbrgrp>. In the northwest, a slightly higher level of diversity was observed using single-strand conformation polymorphism (SSCP) typing, while sequencing revealed variation in Class I allelic numbers in different individuals <abbrgrp>
<abbr bid="B19">19</abbr>
</abbrgrp>, but due to the inherent difficulties in studying the MHC in non-model species, we were not able to rule out null alleles or experimental artefacts. Here we describe the MHC Class I and Class II regions of the Tasmanian devil based on the construction of BAC contigs. We confirm that variation in the number of Class I genes occurs due to a deletion within a Class Ia locus.</p>
</sec>
<sec>
<st>
<p>Results</p>
</st>
<sec>
<st>
<p>Characterization of MHC regions</p>
</st>
<p>The MHC regions of two individuals were characterized. Cedric, whose parents came from DFTD-free northwestern Tasmania (father from Arthur River and mother from Woolnorth; Figure <figr fid="F1">1</figr>), produced an antibody response when injected with irradiated DFTD cells (Alex Kreiss, University of Tasmania, personal communication) and at the beginning of the project was thought to be a "resistant" animal because he did not develop DFTD following a disease challenge. Since then Cedric has succumbed to DFTD, but remains one of very few animals to be able to mount an antibody response to DFTD. The other individual, named Spirit, was originally from Bangor (Figure <figr fid="F1">1</figr>) and was euthanized in 2008 due to multiple DFTD lesions and metastases to the lungs.</p>
<p>Four genomic regions of approximately 960 kb in length were assembled and annotated based on ten fully sequenced BAC clones (seven from Cedric and three from Spirit) (Figure <figr fid="F2">2</figr>). 34 genes and pseudogenes were predicted, comprising five Class I genes, four Class II genes, four antigen-processing genes, eight Class III genes and thirteen other genes and pseudogenes. GenBank accession numbers of the BAC clones and coordinates of the predicted genes are shown in Table <tblr tid="T1">1</tblr>. The annotated genes were mostly named after their orthologues in the human with the exceptions of MHC Class I and II genes, which were given species-specific names based on the nomenclature proposed by Klein and colleagues <abbrgrp>
<abbr bid="B20">20</abbr>
</abbrgrp> and their evolutionary relationships with previously characterized marsupial genes <abbrgrp>
<abbr bid="B21">21</abbr>
<abbr bid="B22">22</abbr>
</abbrgrp>.</p>
<fig id="F2"><title><p>Figure 2</p></title><caption><p>Schematic diagram of Tasmanian devil genomic regions containing MHC genes</p></caption><text>
   <p><b>Schematic diagram of Tasmanian devil genomic regions containing MHC genes</b>. Arrows represent annotated genes. BAC clones used for sequence assembly are indicated by lines below the annotation. For the Class I Region 1, alignment of two haplotypes is shown, as significant differences are found between them. The three gaps within haplotype 2 (and BAC C158O23 and C161A10) represent three segments that are not present in this haplotype as compared to haplotype 1. Asterisks indicate positions of 12 putative LINE fragments.</p>
</text><graphic file="1471-2164-13-87-2" hint_layout="double"/></fig>
<tbl id="T1"><title><p>Table 1</p></title><caption><p>List of annotated BAC clones.</p></caption><tblbdy cols="6">
      <r>
         <c ca="left">
            <p>
               <b>BAC</b>
            </p>
         </c>
         <c ca="left">
            <p>
               <b>Accession</b>
            </p>
         </c>
         <c ca="left">
            <p>
               <b>Start</b>
            </p>
         </c>
         <c ca="left">
            <p>
               <b>End</b>
            </p>
         </c>
         <c ca="left">
            <p>
               <b>Strand</b>
            </p>
         </c>
         <c ca="left">
            <p>
               <b>Description</b>
            </p>
         </c>
      </r>
      <r>
         <c cspan="6">
            <hr/>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>C22M16</p>
         </c>
         <c ca="left">
            <p>FQ482140</p>
         </c>
         <c ca="left">
            <p>24985</p>
         </c>
         <c ca="left">
            <p>22341</p>
         </c>
         <c ca="left">
            <p>-</p>
         </c>
         <c ca="left">
            <p>Proteasome subunit, beta type, 8</p>
         </c>
      </r>
      <r>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c cspan="4">
            <hr/>
         </c>
      </r>
      <r>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c ca="left">
            <p>42720</p>
         </c>
         <c ca="left">
            <p>30519</p>
         </c>
         <c ca="left">
            <p>-</p>
         </c>
         <c ca="left">
            <p>Transporter 1, ATP-binding cassette, sub-family B</p>
         </c>
      </r>
      <r>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c cspan="4">
            <hr/>
         </c>
      </r>
      <r>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c ca="left">
            <p>43296</p>
         </c>
         <c ca="left">
            <p>46824</p>
         </c>
         <c ca="left">
            <p>+</p>
         </c>
         <c ca="left">
            <p>Proteasome subunit, beta type, 9</p>
         </c>
      </r>
      <r>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c cspan="4">
            <hr/>
         </c>
      </r>
      <r>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c ca="left">
            <p>67202</p>
         </c>
         <c ca="left">
            <p>64208</p>
         </c>
         <c ca="left">
            <p>-</p>
         </c>
         <c ca="left">
            <p>MHC Class I antigen Saha-UK</p>
         </c>
      </r>
      <r>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c cspan="4">
            <hr/>
         </c>
      </r>
      <r>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c ca="left">
            <p>98510</p>
         </c>
         <c ca="left">
            <p>88331</p>
         </c>
         <c ca="left">
            <p>-</p>
         </c>
         <c ca="left">
            <p>Transporter 2, ATP-binding cassette, sub-family B</p>
         </c>
      </r>
      <r>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c cspan="4">
            <hr/>
         </c>
      </r>
      <r>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c ca="left">
            <p>125729</p>
         </c>
         <c ca="left">
            <p>116120</p>
         </c>
         <c ca="left">
            <p>-</p>
         </c>
         <c ca="left">
            <p>Peptidase inhibitor 16-like</p>
         </c>
      </r>
      <r>
         <c cspan="6">
            <hr/>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>C158O23</p>
         </c>
         <c ca="left">
            <p>FQ482137</p>
         </c>
         <c ca="left">
            <p>98067</p>
         </c>
         <c ca="left">
            <p>99377</p>
         </c>
         <c ca="left">
            <p>+</p>
         </c>
         <c ca="left">
            <p>MHC Class I pseudogene</p>
         </c>
      </r>
      <r>
         <c cspan="6">
            <hr/>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>C161A10</p>
         </c>
         <c ca="left">
            <p>FQ482138</p>
         </c>
         <c ca="left">
            <p>97718</p>
         </c>
         <c ca="left">
            <p>77164</p>
         </c>
         <c ca="left">
            <p>-</p>
         </c>
         <c ca="left">
            <p>Mitochondrial carrier homolog 1</p>
         </c>
      </r>
      <r>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c cspan="4">
            <hr/>
         </c>
      </r>
      <r>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c ca="left">
            <p>122296</p>
         </c>
         <c ca="left">
            <p>155379</p>
         </c>
         <c ca="left">
            <p>+</p>
         </c>
         <c ca="left">
            <p>FYVE, RhoGEF and PH domain containing 2</p>
         </c>
      </r>
      <r>
         <c cspan="6">
            <hr/>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>S211H13</p>
         </c>
         <c ca="left">
            <p>FQ482144</p>
         </c>
         <c ca="left">
            <p>67289</p>
         </c>
         <c ca="left">
            <p>70233</p>
         </c>
         <c ca="left">
            <p>+</p>
         </c>
         <c ca="left">
            <p>MHC Class I antigen Saha-UA</p>
         </c>
      </r>
      <r>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c cspan="4">
            <hr/>
         </c>
      </r>
      <r>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c ca="left">
            <p>118698</p>
         </c>
         <c ca="left">
            <p>121642</p>
         </c>
         <c ca="left">
            <p>+</p>
         </c>
         <c ca="left">
            <p>MHC Class I antigen Saha-UB</p>
         </c>
      </r>
      <r>
         <c cspan="6">
            <hr/>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>S421K17</p>
         </c>
         <c ca="left">
            <p>FQ482146</p>
         </c>
         <c ca="left">
            <p>67403</p>
         </c>
         <c ca="left">
            <p>64222</p>
         </c>
         <c ca="left">
            <p>-</p>
         </c>
         <c ca="left">
            <p>Transporter 2, ATP-binding cassette, sub-family B pseudogene</p>
         </c>
      </r>
      <r>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c cspan="4">
            <hr/>
         </c>
      </r>
      <r>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c ca="left">
            <p>36094</p>
         </c>
         <c ca="left">
            <p>33148</p>
         </c>
         <c ca="left">
            <p>-</p>
         </c>
         <c ca="left">
            <p>MHC Class I antigen Saha-UC</p>
         </c>
      </r>
      <r>
         <c cspan="6">
            <hr/>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>C491I11</p>
         </c>
         <c ca="left">
            <p>FQ482142</p>
         </c>
         <c ca="left">
            <p>8066</p>
         </c>
         <c ca="left">
            <p>18154</p>
         </c>
         <c ca="left">
            <p>+</p>
         </c>
         <c ca="left">
            <p>Ubiquitin specific peptidase 49</p>
         </c>
      </r>
      <r>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c cspan="4">
            <hr/>
         </c>
      </r>
      <r>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c ca="left">
            <p>26396</p>
         </c>
         <c ca="left">
            <p>24778</p>
         </c>
         <c ca="left">
            <p>-</p>
         </c>
         <c ca="left">
            <p>Translocase of outer mitochondrial membrane 6</p>
         </c>
      </r>
      <r>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c cspan="4">
            <hr/>
         </c>
      </r>
      <r>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c ca="left">
            <p>32637</p>
         </c>
         <c ca="left">
            <p>30374</p>
         </c>
         <c ca="left">
            <p>-</p>
         </c>
         <c ca="left">
            <p>Prickle homolog 4 (Drosophila)</p>
         </c>
      </r>
      <r>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c cspan="4">
            <hr/>
         </c>
      </r>
      <r>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c ca="left">
            <p>53046</p>
         </c>
         <c ca="left">
            <p>56295</p>
         </c>
         <c ca="left">
            <p>+</p>
         </c>
         <c ca="left">
            <p>MHC Class I antigen Saha-UD</p>
         </c>
      </r>
      <r>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c cspan="4">
            <hr/>
         </c>
      </r>
      <r>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c ca="left">
            <p>70180</p>
         </c>
         <c ca="left">
            <p>88114</p>
         </c>
         <c ca="left">
            <p>+</p>
         </c>
         <c ca="left">
            <p>Fibroblast growth factor receptor substrate 3</p>
         </c>
      </r>
      <r>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c cspan="4">
            <hr/>
         </c>
      </r>
      <r>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c ca="left">
            <p>105924</p>
         </c>
         <c ca="left">
            <p>114685</p>
         </c>
         <c ca="left">
            <p>+</p>
         </c>
         <c ca="left">
            <p>Progastricsin (pepsinogen C)</p>
         </c>
      </r>
      <r>
         <c cspan="6">
            <hr/>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>S50J3</p>
         </c>
         <c ca="left">
            <p>FQ482147</p>
         </c>
         <c ca="left">
            <p>154766</p>
         </c>
         <c ca="left">
            <p>172999</p>
         </c>
         <c ca="left">
            <p>+</p>
         </c>
         <c ca="left">
            <p>Progastricsin (pepsinogen C) pseudogene</p>
         </c>
      </r>
      <r>
         <c cspan="6">
            <hr/>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>C120J14</p>
         </c>
         <c ca="left">
            <p>FQ790236</p>
         </c>
         <c ca="left">
            <p>47684</p>
         </c>
         <c ca="left">
            <p>38237</p>
         </c>
         <c ca="left">
            <p>-</p>
         </c>
         <c ca="left">
            <p>MHC Class II DA &#946; chain 1</p>
         </c>
      </r>
      <r>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c cspan="4">
            <hr/>
         </c>
      </r>
      <r>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c ca="left">
            <p>119765</p>
         </c>
         <c ca="left">
            <p>110505</p>
         </c>
         <c ca="left">
            <p>-</p>
         </c>
         <c ca="left">
            <p>MHC Class II DA &#946; chain 2</p>
         </c>
      </r>
      <r>
         <c cspan="6">
            <hr/>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>C105P19</p>
         </c>
         <c ca="left">
            <p>FQ790235</p>
         </c>
         <c ca="left">
            <p>83441</p>
         </c>
         <c ca="left">
            <p>74884</p>
         </c>
         <c ca="left">
            <p>-</p>
         </c>
         <c ca="left">
            <p>MHC Class II DA &#946; chain 3</p>
         </c>
      </r>
      <r>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c cspan="4">
            <hr/>
         </c>
      </r>
      <r>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c ca="left">
            <p>148648</p>
         </c>
         <c ca="left">
            <p>135902</p>
         </c>
         <c ca="left">
            <p>-</p>
         </c>
         <c ca="left">
            <p>Guanylate cyclase activator 1A</p>
         </c>
      </r>
      <r>
         <c cspan="6">
            <hr/>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>C12G5</p>
         </c>
         <c ca="left">
            <p>FQ790241</p>
         </c>
         <c ca="left">
            <p>13651</p>
         </c>
         <c ca="left">
            <p>9537</p>
         </c>
         <c ca="left">
            <p>-</p>
         </c>
         <c ca="left">
            <p>MHC Class II DA &#945; chain</p>
         </c>
      </r>
      <r>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c cspan="4">
            <hr/>
         </c>
      </r>
      <r>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c ca="left">
            <p>60582</p>
         </c>
         <c ca="left">
            <p>59696</p>
         </c>
         <c ca="left">
            <p>-</p>
         </c>
         <c ca="left">
            <p>Protein EURL homolog</p>
         </c>
      </r>
      <r>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c cspan="4">
            <hr/>
         </c>
      </r>
      <r>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c ca="left">
            <p>73613</p>
         </c>
         <c ca="left">
            <p>71308</p>
         </c>
         <c ca="left">
            <p>-</p>
         </c>
         <c ca="left">
            <p>Vesicle-associated membrane protein 4-like</p>
         </c>
      </r>
      <r>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c cspan="4">
            <hr/>
         </c>
      </r>
      <r>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c ca="left">
            <p>84027</p>
         </c>
         <c ca="left">
            <p>87750</p>
         </c>
         <c ca="left">
            <p>+</p>
         </c>
         <c ca="left">
            <p>NOTCH4 pseudogene</p>
         </c>
      </r>
      <r>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c cspan="4">
            <hr/>
         </c>
      </r>
      <r>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c ca="left">
            <p>108416</p>
         </c>
         <c ca="left">
            <p>112090</p>
         </c>
         <c ca="left">
            <p>+</p>
         </c>
         <c ca="left">
            <p>Pre-B-cell leukemia homeobox 2</p>
         </c>
      </r>
      <r>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c cspan="4">
            <hr/>
         </c>
      </r>
      <r>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c ca="left">
            <p>114221</p>
         </c>
         <c ca="left">
            <p>117186</p>
         </c>
         <c ca="left">
            <p>+</p>
         </c>
         <c ca="left">
            <p>Advanced glycosylation end product-specific receptor</p>
         </c>
      </r>
      <r>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c cspan="4">
            <hr/>
         </c>
      </r>
      <r>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c ca="left">
            <p>127655</p>
         </c>
         <c ca="left">
            <p>131100</p>
         </c>
         <c ca="left">
            <p>+</p>
         </c>
         <c ca="left">
            <p>1-acylglycerol-3-phosphate O-acyltransferase 1</p>
         </c>
      </r>
      <r>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c cspan="4">
            <hr/>
         </c>
      </r>
      <r>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c ca="left">
            <p>135753</p>
         </c>
         <c ca="left">
            <p>133198</p>
         </c>
         <c ca="left">
            <p>-</p>
         </c>
         <c ca="left">
            <p>Epidermal growth factor-like protein 8</p>
         </c>
      </r>
      <r>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c cspan="4">
            <hr/>
         </c>
      </r>
      <r>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c ca="left">
            <p>141212</p>
         </c>
         <c ca="left">
            <p>138220</p>
         </c>
         <c ca="left">
            <p>-</p>
         </c>
         <c ca="left">
            <p>Palmitoyl-protein thioesterase 2</p>
         </c>
      </r>
      <r>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c cspan="4">
            <hr/>
         </c>
      </r>
      <r>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c ca="left">
            <p>148877</p>
         </c>
         <c ca="left">
            <p>150447</p>
         </c>
         <c ca="left">
            <p>+</p>
         </c>
         <c ca="left">
            <p>Proline-rich transmembrane protein 1</p>
         </c>
      </r>
   </tblbdy><tblfn>
      <p>Genes in the BAC overlapping regions are only shown once. Letters C and S in BAC clone names indicate Cedric's and Spirit's library, respectively</p>
   </tblfn></tbl>
<p>Two of the analysed regions contain Class I genes (Region 1 and 2) and the other two contain Class II (Region 3 and 4) (Figure <figr fid="F2">2</figr>). All four regions were physically mapped to the same area on devil Chromosome 4q, indicating the genomic location of the devil MHC (see Figure <figr fid="F3">3</figr> for Region 1 and 3, and Additional file <supplr sid="S1">1</supplr>: Figure S1 for Region 2 and 4). Based on comparisons to the MHC of the grey short-tailed opossum (<it>Monodelphis domestica</it>; herein referred to as "opossum") and the tammar wallaby (<it>Macropus eugenii</it>), Region 3 is likely located between Region 1 and Region 4, which encompasses a suite of well-conserved Class III genes (Figure <figr fid="F4">4</figr>). In Region 2, Class I <it>Saha-UD </it>is flanked by five non-MHC genes <it>USP49, TOMM6, PRICKLE4, FRS3 </it>and <it>PGC</it>, which are found ~12 MB proximal from the MHC in human and ~16 Mb distal from the MHC in opossum; however, according to FISH mapping result, these genes are closer to the MHC in the devil than in opossum (Additional file <supplr sid="S1">1</supplr>: Figure S1).</p>
<fig id="F3"><title><p>Figure 3</p></title><caption><p>FISH image showing genomic locations of Tasmanian devil MHC Class I and II genes</p></caption><text>
   <p><b>FISH image showing genomic locations of Tasmanian devil MHC Class I and II genes</b>.</p>
</text><graphic file="1471-2164-13-87-3" hint_layout="double"/></fig>
<suppl id="S1">
<title>
<p>Additional file 1</p>
</title>
<text>
<p>
<b>Figure S1 FISH image showing genomic locations of BAC clone C12G5 and C491I11</b>. The individual used in this slide has a balanced translocation between Chr.1 and Chr.3.</p>
</text>
<file name="1471-2164-13-87-S1.PDF">
   <p>Click here for file</p>
</file>
</suppl>
<fig id="F4"><title><p>Figure 4</p></title><caption><p>Comparison of MHC regions containing Class I or II genes between Tasmanian devil, grey short-tailed opossum <abbrgrp><abbr bid="B21">21</abbr></abbrgrp> and tammar wallaby <abbrgrp><abbr bid="B35">35</abbr></abbrgrp></p></caption><text>
   <p><b>Comparison of MHC regions containing Class I or II genes between Tasmanian devil, grey short-tailed opossum </b><abbrgrp><abbr bid="B21">21</abbr></abbrgrp><b>and tammar wallaby </b><abbrgrp><abbr bid="B35">35</abbr></abbrgrp>. Class I, II and antigen-processing genes are represented with red, blue and purple arrows, respectively. Class II regions of tammar wallaby are not shown due to high complexity.</p>
</text><graphic file="1471-2164-13-87-4" hint_layout="double"/></fig>
</sec>
<sec>
<st>
<p>Class I genes</p>
</st>
<p>A total of five Class I genes were identified and named <it>Saha-UA</it>, <it>UB</it>, <it>UC</it>, <it>UD </it>and <it>UK</it>. Except <it>Saha-UD</it>, they are all closely linked (Region 1 in Figure <figr fid="F2">2</figr>) and are localised with <it>PSMB8</it>, <it>PSMB9</it>, <it>TAP1 </it>and <it>TAP2 </it>genes, which are involved in antigen processing and transportation. <it>Saha-UA</it>, <it>UB</it>, <it>UC </it>and <it>UD </it>have evolved from recent gene duplications in the devil lineage (shown in Figure <figr fid="F5">5</figr>) and are not orthologous to Class I genes in the opossum or the tammar wallaby. <it>Saha-UK </it>is orthologous to <it>Modo-UK </it>in the opossum and <it>Maeu-UK </it>in the tammar wallaby (Figure <figr fid="F5">5</figr>), both of which have been suggested to be nonclassical genes <abbrgrp>
<abbr bid="B22">22</abbr>
<abbr bid="B23">23</abbr>
</abbrgrp>.</p>
<fig id="F5"><title><p>Figure 5</p></title><caption><p>Phylogenetic analysis of Tasmanian devil MHC Class I sequences variants</p></caption><text>
   <p><b>Phylogenetic analysis of Tasmanian devil MHC Class I sequences variants</b>. The phylogenetic relationship was inferred using the Neighbour-Joining method <abbrgrp><abbr bid="B46">46</abbr></abbrgrp>. The percentage of replicate trees in which the associated sequences clustered together in the bootstrap test (1,000 replicates) are displayed next to the branches, indicating the level of reliability of the phylogeny <abbrgrp><abbr bid="B47">47</abbr></abbrgrp>. Bootstrap frequencies lower than 50% are not shown. Phylogenetic analysis was conducted in MEGA5 <abbrgrp><abbr bid="B45">45</abbr></abbrgrp>. <it>Modo-UK </it>[GenBank:<ext-link ext-link-id="EU886686" ext-link-type="gen">EU886686</ext-link>] and <it>Maeu-UK </it>[GenBank:<ext-link ext-link-id="CU463018" ext-link-type="gen">CU463018</ext-link>] are orthologous <it>UK </it>genes in the grey short-tailed opossum and tammar wallaby. <it>Modo-UA </it>[GenBank:<ext-link ext-link-id="DQ067089" ext-link-type="gen">DQ067089</ext-link>], <it>UB </it>[GenBank:<ext-link ext-link-id="NM_001079820" ext-link-type="gen">NM_001079820</ext-link>] and <it>UC </it>[GenBank:<ext-link ext-link-id="NM_001079819" ext-link-type="gen">NM_001079819</ext-link>] are Class I genes in the opossum.</p>
</text><graphic file="1471-2164-13-87-5" hint_layout="double"/></fig>
<p>The genomic data allowed us to assign MHC alleles to loci using phylogenetics (Figure <figr fid="F5">5</figr>, Additional file <supplr sid="S2">2</supplr>: Figure S2). Table <tblr tid="T2">2</tblr> describes the alleles sequenced on each of the two haplotypes, and by deduction the alleles found on the un-sequenced haplotypes of Cedric and Spirit. We were able to resolve that previously isolated devil Class I sequence variants <it>SahaI*27 </it>and <it>28</it>, <it>35</it>, <it>46</it>, and <it>32 </it>belong to locus <it>Saha-UC</it>, <it>UA</it>, <it>UB </it>and <it>UD</it>, respectively. Meanwhile, five novel alleles, named <it>SahaI*86 - 90</it>, were found and assigned to genes. Alleles from <it>Saha-UD</it>, which were previously called group 2 alleles <abbrgrp>
<abbr bid="B19">19</abbr>
</abbrgrp>, are divergent from other genes and form a discrete phylogenetic clade with 94% bootstrap support. <it>Saha-UA</it>, <it>UB </it>and <it>UC </it>are closely related, with <it>UA </it>and <it>UB </it>alleles interspersed within the same clade. At nucleotide level, the sequence identity between these three genes is extremely high (&gt; 97.7%), with <it>Saha-UA </it>and <it>UB </it>sharing up to 99% nucleotide identity in their introns (Table <tblr tid="T3">3</tblr>).</p>
<suppl id="S2">
<title>
<p>Additional file 2</p>
</title>
<text>
<p>
<b>Figure S2 Alignment of sequences used to generate Figure </b>
<figr fid="F5">5</figr>
<b>, including all Tasmanian devil Class I alleles identified so far</b>.</p>
</text>
<file name="1471-2164-13-87-S2.PDF">
   <p>Click here for file</p>
</file>
</suppl>
<tbl id="T2"><title><p>Table 2</p></title><caption><p>MHC Class I haplotypes of two model Tasmanian devils - Cedric and Spirit</p></caption><tblbdy cols="5">
      <r>
         <c ca="left">
            <p>
               <b>Class Ia loci</b>
            </p>
         </c>
         <c cspan="2" ca="center">
            <p>
               <b>Cedric's haplotypes </b>
            </p>
         </c>
         <c cspan="2" ca="center">
            <p>
               <b>Spirit's haplotypes </b>
            </p>
         </c>
      </r>
      <r>
         <c cspan="5">
            <hr/>
         </c>
      </r>
      <r>
         <c>
            <p/>
         </c>
         <c ca="center">
            <p>a (BAC sequenced)</p>
         </c>
         <c ca="center">
            <p>b</p>
         </c>
         <c ca="center">
            <p>a (BAC sequenced)</p>
         </c>
         <c ca="center">
            <p>b</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>
               <it>Saha-UA</it>
            </p>
         </c>
         <c ca="left">
            <p>-</p>
         </c>
         <c ca="left">
            <p>-</p>
         </c>
         <c ca="left">
            <p>SahaI*35</p>
         </c>
         <c ca="left">
            <p>SahaI*35</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>
               <it>Saha-UB</it>
            </p>
         </c>
         <c ca="left">
            <p>SahaI*86</p>
         </c>
         <c ca="left">
            <p>SahaI*87</p>
         </c>
         <c ca="left">
            <p>SahaI*46</p>
         </c>
         <c ca="left">
            <p>SahaI*90</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>
               <it>Saha-UC</it>
            </p>
         </c>
         <c ca="left">
            <p>SahaI*88</p>
         </c>
         <c ca="left">
            <p>SahaI*89</p>
         </c>
         <c ca="left">
            <p>SahaI*28</p>
         </c>
         <c ca="left">
            <p>SahaI*27</p>
         </c>
      </r>
   </tblbdy></tbl>
<tbl id="T3"><title><p>Table 3</p></title><caption><p>Comparison of Tasmanian devil MHC Class Ia genes.</p></caption><tblbdy cols="5">
      <r>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c cspan="3" ca="center">
            <p>
               <b>Sequence identity</b>
            </p>
         </c>
      </r>
      <r>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c cspan="3">
            <hr/>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>
               <b>Gene</b>
            </p>
         </c>
         <c ca="left">
            <p>
               <b>Length (bp)</b>
            </p>
         </c>
         <c ca="left">
            <p>
               <b>
                  <it>Saha-UA</it>
               </b>
            </p>
         </c>
         <c ca="left">
            <p>
               <b>
                  <it>Saha-UB</it>
               </b>
            </p>
         </c>
         <c ca="left">
            <p>
               <b>
                  <it>Saha-UC</it>
               </b>
            </p>
         </c>
      </r>
      <r>
         <c cspan="5">
            <hr/>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>
               <it>Saha-UA</it>
            </p>
         </c>
         <c ca="left">
            <p>2921</p>
         </c>
         <c ca="left">
            <p>-</p>
         </c>
         <c ca="left">
            <p>99.0%</p>
         </c>
         <c ca="left">
            <p>97.8%</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>
               <it>Saha-UB</it>
            </p>
         </c>
         <c ca="left">
            <p>2921</p>
         </c>
         <c ca="left">
            <p>98.9%</p>
         </c>
         <c ca="left">
            <p>-</p>
         </c>
         <c ca="left">
            <p>97.7%</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>
               <it>Saha-UC</it>
            </p>
         </c>
         <c ca="left">
            <p>2923</p>
         </c>
         <c ca="left">
            <p>98.3%</p>
         </c>
         <c ca="left">
            <p>98.3%</p>
         </c>
         <c ca="left">
            <p>-</p>
         </c>
      </r>
   </tblbdy><tblfn>
      <p>Sequence identity in exons (shown below the diagonal) and introns (shown above the diagonal) are calculated separately</p>
   </tblfn></tbl>
<tbl id="T4"><title><p>Table 4</p></title><caption><p>Comparison of Tasmanian devil MHC DAB genes.</p></caption><tblbdy cols="5">
      <r>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c ca="left">
            <p>
               <b>Sequence identity</b>
            </p>
         </c>
         <c>
            <p/>
         </c>
      </r>
      <r>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c cspan="3">
            <hr/>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>
               <b>Gene</b>
            </p>
         </c>
         <c ca="left">
            <p>
               <b>Length (bp)</b>
            </p>
         </c>
         <c ca="left">
            <p>
               <b>
                  <it>Saha-DAB1</it>
               </b>
            </p>
         </c>
         <c ca="left">
            <p>
               <b>
                  <it>Saha-DAB2</it>
               </b>
            </p>
         </c>
         <c ca="left">
            <p>
               <b>
                  <it>Saha-DAB3</it>
               </b>
            </p>
         </c>
      </r>
      <r>
         <c cspan="5">
            <hr/>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>
               <it>Saha-DAB1</it>
            </p>
         </c>
         <c ca="left">
            <p>9448</p>
         </c>
         <c ca="left">
            <p>-</p>
         </c>
         <c ca="left">
            <p>95.7%</p>
         </c>
         <c ca="left">
            <p>61.9%</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>
               <it>Saha-DAB2</it>
            </p>
         </c>
         <c ca="left">
            <p>9261</p>
         </c>
         <c ca="left">
            <p>96.9%</p>
         </c>
         <c ca="left">
            <p>-</p>
         </c>
         <c ca="left">
            <p>62.8%</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>
               <it>Saha-DAB3</it>
            </p>
         </c>
         <c ca="left">
            <p>8558</p>
         </c>
         <c ca="left">
            <p>95.3%</p>
         </c>
         <c ca="left">
            <p>96.1%</p>
         </c>
         <c ca="left">
            <p>-</p>
         </c>
      </r>
   </tblbdy><tblfn>
      <p>Sequence identity in exons (shown below the diagonal) and introns (shown above the diagonal) are calculated separately Sequence identity</p>
   </tblfn></tbl>
<p>The sequence 200 bp upstream of the translation start sites was searched for gene regulatory elements, as promoters of MHC Class I genes in mammals are mostly contained within this range <abbrgrp>
<abbr bid="B24">24</abbr>
</abbrgrp>. In <it>Saha-UA</it>, <it>UB </it>and <it>UC</it>, this region is highly similar, with only one single nucleotide variation in the TATA sites. Putative sites of an enhancer A, an interferon stimulated response element (ISRE), the S-X-Y motifs, CAAT and TATA boxes were identified (Figure <figr fid="F6">6</figr>). <it>Saha-UD </it>was not included in the alignment in Figure <figr fid="F6">6</figr> due to low sequence identity. Compared to <it>Saha-UA</it>, <it>UB </it>and <it>UC</it>, regulatory elements including enhancers, ISRE, S and X motifs are not conserved at this locus, which may indicate different expression or function of this gene.</p>
<fig id="F6"><title><p>Figure 6</p></title><caption><p>Putative promoter elements of Tasmanian devil MHC Class Ia genes</p></caption><text>
   <p><b>Putative promoter elements of Tasmanian devil MHC Class Ia genes</b>. The boxed sequences indicate putative sites of an enhancer A, an interferon stimulated response element (ISRE), the S-X-Y motifs and the CAAT and TATA boxes.</p>
</text><graphic file="1471-2164-13-87-6" hint_layout="single"/></fig>
</sec>
<sec>
<st>
<p>Comparison of two MHC class I haplotypes</p>
</st>
<p>The MHC region containing the Class Ia genes in Cedric and Spirit (Region 1 in Figure <figr fid="F2">2</figr>) contains three indels. Spirit's haplotype has an intact <it>Saha-UA </it>gene, whereas Cedric's has a deletion (~1646 bp in size) that results in the loss of a large portion (from exon 2 to intron 5) of <it>Saha-UA </it>and renders it a pseudogene. Furthermore, Cedric's haplotype also lacks two other long segments that are present in Spirit's, one (~12230 bp) lying between the pseudogene and <it>Saha-UB</it>, and the other (~16970 bp) between <it>Saha-UB </it>and <it>Saha-UC</it>. Further investigation on this genomic region revealed 12 putative Long INterspersed Element (LINE) segments interspersing the MHC genes (Figure <figr fid="F2">2</figr>), nine of which closely flank (within 2 kb) the indels and <it>Saha-UA</it>, <it>UB </it>and <it>UC</it>. Two of the putative LINEs are located inside the indels and only found in Spirit's haplotype.</p>
<p>A PCR test was developed to detect the presence or absence of the deletion in <it>Saha</it>-<it>UA </it>in devils. Haplotypes without the deletion generate a single band of 1917 bp in size, which contains amplicons from <it>Saha-UA</it>, <it>UB </it>and <it>UC </it>genes. A second band (271 bp) is generated when a haplotype contains the deletion in <it>Saha-UA</it>, representing an amplicon from only the pseudogene, which has arisen due to the deletion (Figure <figr fid="F7">7</figr>).</p>
<fig id="F7"><title><p>Figure 7</p></title><caption><p>Gel image showing two types of Tasmanian devil MHC Class I haplotypes</p></caption><text>
   <p><b>Gel image showing two types of Tasmanian devil MHC Class I haplotypes</b>. Gel lanes: A - Spirit's type with intact Class I gene <it>Saha-UA</it>; B - Cedric's type with a deletion in <it>Saha-UA</it>; C - negative control; D - HyperLadder I (Bioline).</p>
</text><graphic file="1471-2164-13-87-7" hint_layout="single"/></fig>
</sec>
<sec>
<st>
<p>Population study</p>
</st>
<p>The frequency of the <it>Saha-UA </it>deletion was assessed in samples from seven geographic areas across Tasmania (see pie charts in Figure <figr fid="F1">1</figr>). 72 individuals were investigated: 25 from DFTD-affected areas in eastern Tasmania (five from each of St Marys, Narawntapu, Bronte Park, Buckland and Fentonbury); 12 from West Pencil Pine, an area on the disease front; and 35 from the DFTD-free northwestern coast region. The deletion is most prevalent in the northwest (found in 54.8% of tested individuals), followed by West Pencil Pine (41.7%), Bronte Park (40%), Narawntapu (40%) and St Marys (20%). None of individuals from Fentonbury and Buckland contained the deletion.</p>
</sec>
<sec>
<st>
<p>Class II genes</p>
</st>
<p>Four Class II genes were identified, all belonging to the marsupial Class II <it>DA </it>gene family. Previously isolated DAB transcripts were aligned against the three &#946; chain paralogues and 100% match was found between transcripts SahaDAB*01 [GenBank:<ext-link ext-link-id="EF591102" ext-link-type="gen">EF591102</ext-link>], SahaDAB*03 [GenBank:<ext-link ext-link-id="EF591104" ext-link-type="gen">EF591104</ext-link>] and SahaDAB*05 [GenBank:<ext-link ext-link-id="EF591105" ext-link-type="gen">EF591105</ext-link>] and genes <it>Saha-DAB1</it>, <it>DAB2 </it>and <it>DAB3</it>, respectively. Sequence comparison of these three genes is shown in Table <tblr tid="T4">4</tblr>. <it>Saha-DAB1 </it>and <it>DAB2 </it>share high nucleotide identity in both exons (96.9%) and introns (95.7%). They are both similar to <it>Saha-DAB3 </it>in exon sequences, but differ from it significantly in the introns due to multiple deletions/insertions of nucleotide fragments. In the 5' untranslated region, putative sites of S-X-Y motifs are identified in both &#945; and &#946; chain genes, though TATA and CAAT elements are not found in these genes (Figure <figr fid="F8">8</figr>).</p>
<fig id="F8"><title><p>Figure 8</p></title><caption><p>Upstream sequences of Tasmanian devil MHC Class II &#946; chain and &#945; chain genes</p></caption><text>
   <p><b>Upstream sequences of Tasmanian devil MHC Class II &#946; chain and &#945; chain genes</b>. The boxed sequences indicate putative sites of S-X-Y motifs.</p>
</text><graphic file="1471-2164-13-87-8" hint_layout="single"/></fig>
</sec>
</sec>
<sec>
<st>
<p>Discussion</p>
</st>
<sec>
<st>
<p>Function of MHC genes</p>
</st>
<p>Based on the genomic characterization of the MHC genes, as well as expression and diversity studies, we are now in the position to assign previously identified MHC alleles to loci and make informed deductions about the likely role of these genes in immune response.</p>
<p>Four Class II loci are described in this study, <it>Saha-DAA</it>, <it>DAB1</it>, <it>DAB2 </it>and <it>DAB3</it>. Transcripts from the three &#946; chain genes have previously been described <abbrgrp>
<abbr bid="B18">18</abbr>
</abbrgrp>. These loci likely encode functional antigen-presenting molecules based on homology to other members of the <it>DA </it>gene family, which have been studied in a variety of marsupial species [reviewed in <abbrgrp>
<abbr bid="B25">25</abbr>
</abbrgrp>]. No other Class II gene families have been identified in the devil, indicating that either the devil only has one functional Class II gene family or the other genes are too divergent from known marsupial Class II genes to be detected by the probes.</p>
<p>Amongst the five characterized Class I loci, we propose that <it>Saha-UA</it>, <it>UB </it>and <it>UC </it>are functional Class Ia genes on the basis of three facts. First, their transcripts have been amplified from all tissue types examined so far, including blood, spleen, skin, liver, kidney and DFTD cells <abbrgrp>
<abbr bid="B9">9</abbr>
<abbr bid="B18">18</abbr>
</abbrgrp>. Second, genetic variation, though the level is not high, is present at these loci, particularly in the putative peptide-binding regions (Figure <figr fid="F9">9</figr>). Third, promoter elements in the 5' untranslated region are well conserved between these genes and the Class Ia genes in the opossum and the tammar wallaby <abbrgrp>
<abbr bid="B22">22</abbr>
</abbrgrp>, sharing 91.8% and 83.6% nucleotide identity respectively. Whether the single nucleotide variation in TATA box affects the expression level of <it>Saha-UA </it>remains uncertain. The number of Class Ia loci in the devil is comparable to those in the opossum (one) and the tammar wallaby (three).</p>
<fig id="F9"><title><p>Figure 9</p></title><caption><p>Amino acid alignment of partial &#945;1 domain of <it>Saha-UA, UB </it>and <it>UC </it>alleles</p></caption><text>
   <p><b>Amino acid alignment of partial &#945;1 domain of <it>Saha-UA, UB </it>and <it>UC </it>alleles</b>. Asterisks indicate putative peptide-binding sites in a Class I molecule <abbrgrp><abbr bid="B48">48</abbr></abbrgrp>.</p>
</text><graphic file="1471-2164-13-87-9" hint_layout="single"/></fig>
<p>
<it>Saha-UK </it>has been found transcribed in the blood and spleen and is likely a Class Ib gene based on orthology with the Class Ib <it>Modo-UK </it>in the opossum and <it>Maeu-UK </it>in the tammar wallaby. It has been suggested that the high conservation of this gene over extended evolutionary periods indicates that it may serve a critical, marsupial-specific function <abbrgrp>
<abbr bid="B22">22</abbr>
</abbrgrp>.</p>
<p>
<it>Saha-UD</it>, which has been found transcribed in the blood, spleen and DFTD cells, shows features of a Class Ib gene. This locus has significantly lower levels of polymorphism than <it>Saha-UA</it>, <it>UB </it>and <it>UC</it>, and its 5' regulatory elements are divergent. Within the &#945;1 domain, <it>Saha-UD </it>alleles share extremely high sequence identity (&gt; 97.7%) and only one of the 15 putative peptide-binding amino acid residues is polymorphic (Figure <figr fid="F10">10</figr>). Elucidation of the functional role of <it>Saha-UD </it>in the devil remains to be determined.</p>
<fig id="F10"><title><p>Figure 10</p></title><caption><p>Amino acid alignment of partial &#945;1 domain of <it>Saha-UD </it>alleles</p></caption><text>
   <p><b>Amino acid alignment of partial &#945;1 domain of <it>Saha-UD </it>alleles</b>. Asterisks indicate putative peptide-binding sites in a Class I molecule <abbrgrp><abbr bid="B48">48</abbr></abbrgrp>.</p>
</text><graphic file="1471-2164-13-87-10" hint_layout="single"/></fig>
<p>Genomic characterization allowed us to assign previously characterized "group 1" alleles to Class Ia loci <it>Saha-UA</it>, <it>UB </it>and <it>UC</it>, and "group 2" alleles to <it>Saha-UD </it>(<abbrgrp>
<abbr bid="B19">19</abbr>
</abbrgrp>, Figure <figr fid="F5">5</figr>). The previous description of individuals containing only "group 1" or "group 2" alleles <abbrgrp>
<abbr bid="B19">19</abbr>
</abbrgrp> was likely due to experimental artefact caused by nucleotide mismatches (two in <it>Saha-UA</it>, <it>UB </it>and <it>UC</it>; four in <it>Saha-UD</it>) within the PCR primer hybridisation sites. Here, we propose that Class Ia loci <it>Saha-UB </it>and <it>UC </it>are likely present in all devils, whereas <it>Saha-UA </it>is lost in certain haplotypes. We suggest that in future studies new primers should be designed for <it>Saha-UD </it>and the three Class Ia genes separately to ensure high primer efficiency.</p>
</sec>
<sec>
<st>
<p>Copy number variations (CNVs) in the MHC</p>
</st>
<p>A genomic CNV is a duplication or deletion of a genomic segment larger than 1 kb in size <abbrgrp>
<abbr bid="B26">26</abbr>
</abbrgrp>. In addition to single nucleotide polymorphisms, CNVs represent a major class of genetic variation and are widespread in the human genome <abbrgrp>
<abbr bid="B27">27</abbr>
<abbr bid="B28">28</abbr>
</abbrgrp>. Large-scale genome-wide disease association studies have revealed a number of CNVs in the HLA that are associated with Crohn's disease, rheumatoid arthritis and type 1 diabetes <abbrgrp>
<abbr bid="B26">26</abbr>
<abbr bid="B29">29</abbr>
</abbrgrp>.</p>
<p>When we first proposed the hypothesis of "Class I gene copy number variation" in the devil, we were referring to the concept of CNV in a narrow sense, namely variation in the number of Class I loci <abbrgrp>
<abbr bid="B19">19</abbr>
</abbrgrp>. With the new findings from this study, it is necessary to expand the concept to take into account other CNVs of non-coding sequences lying within the devil MHC.</p>
<p>Three genomic copy number variants that intersperse with the MHC Class Ia genes were identified between Cedric's and Spirit's haplotypes. The one within <it>Saha-UA </it>is convincingly a deletion, while the other two are difficult to classify as either duplications or deletions. The loss of functional Class I genes due to CNVs is not specific to the devil. A case has been reported for a human haplotype where a 4 kb deletion turns <it>HLA-A </it>into a pseudogene <abbrgrp>
<abbr bid="B30">30</abbr>
</abbrgrp>. The mechanisms underlying the generation of these CNVs in the devil MHC are not clear, though a possible explanation may be implied from the presence of the putative LINE fragments in the affected regions encompassing the three CNVs and <it>Saha-UA</it>, <it>UB </it>and <it>UC </it>genes (Figure <figr fid="F2">2</figr>). These retrotransposons may play a role in causing the CNVs as evidence has been found in primate genomes that LINEs are centrally involved in the generation of CNVs and can mediate deletions up to 18 kb in size <abbrgrp>
<abbr bid="B31">31</abbr>
<abbr bid="B32">32</abbr>
</abbrgrp>.</p>
<p>The higher frequency of the <it>Saha-UA </it>deletion in the northwest may indicate that the eastern and western populations have been exposed to different selective pressures. It is tempting to continue to speculate that the deletion of <it>Saha-UA</it>, which was found in Cedric and occurs in high frequency in the northwest of Tasmania, may provide animals with some level of resilience to DFTD. This is consistent with the observation that the spread of DFTD is slowing down as the disease front reaches genetically disparate populations in the northwest and that the incidence of disease in these populations is much lower than in similar populations in the east <abbrgrp>
<abbr bid="B33">33</abbr>
</abbrgrp>. This gene deletion may be advantageous for the following reasons. First, the loss of <it>Saha-UA </it>may increase the MHC antigenic dissimilarity between host and tumour, as a <it>Saha-UA </it>allele, <it>SahaI*35</it>, has been found to be transcribed in the tumour <abbrgrp>
<abbr bid="B19">19</abbr>
</abbrgrp>. Second, the CNVs may alter the expression level of the adjacent MHC genes by affecting regulatory elements such as promoters and enhancers as well as inhibitory elements. In fact, it has been found that CNVs can even influence the expression of genes that are up to 1 Mb away <abbrgrp>
<abbr bid="B34">34</abbr>
</abbrgrp>. Here we have only looked at two devil MHC haplotypes and have only scratched the surface of the genetic variation that could be present amongst devil genomes. The role of this variation in the varied susceptibility/resistance of devils to DFTD needs to be further investigated.</p>
</sec>
<sec>
<st>
<p>Comparison with the opossum and tammar wallaby MHC</p>
</st>
<p>Previous comparative studies revealed that the MHC of the opossum and the tammar wallaby exhibit distinct features in gene content and organisation <abbrgrp>
<abbr bid="B21">21</abbr>
<abbr bid="B35">35</abbr>
</abbrgrp>. Sequencing of Tasmanian devil MHC regions has provided us a better understanding of the marsupial MHC.</p>
<p>In the tammar wallaby, all Class Ia genes (<it>UA, UB </it>and <it>UC</it>) are un-linked to the MHC region <abbrgrp>
<abbr bid="B22">22</abbr>
</abbrgrp>, whereas in the opossum the only confirmed Class Ia gene (<it>UA1</it>) is located within the MHC and close to antigen-processing genes (<abbrgrp>
<abbr bid="B21">21</abbr>
</abbrgrp>, Figure <figr fid="F4">4</figr>). The organisation of Class Ia genes in the devil is similar to that of the opossum, with <it>UA, UB </it>and <it>UC </it>adjacent to <it>TAP1, TAP2, PSMB9 </it>and <it>PSMB8</it>. This indicates that the dispersal of Class I genes across the genome in the tammar wallaby is not a common characteristic of the Australasian marsupial and has occurred after the divergence of devil and tammar wallaby's common ancestor at ~66 million years before present <abbrgrp>
<abbr bid="B36">36</abbr>
</abbrgrp>.</p>
<p>Four Class II gene families have been characterized in the opossum - <it>DA, DB, DC </it>and <it>DM </it>
<abbrgrp>
<abbr bid="B21">21</abbr>
</abbrgrp>. <it>DA</it>, <it>DB </it>and <it>DM </it>have also been identified in the tammar wallaby. The Class II genes have undergone large-scale expansion in the tammar wallaby, resulting in up to 10 <it>DAB </it>loci <abbrgrp>
<abbr bid="B35">35</abbr>
</abbrgrp>. However, such an expansion of Class II genes is not seen in the devil, with devils having a single <it>DAA </it>and three <it>DAB </it>genes. On the contrary, the devil genome may have undergone gene deletions that have lead to loss of the <it>DB </it>and <it>DC </it>gene families.</p>
<p>In this study, we have focused on characterizing MHC Class Ia and II genes in the devil. Future work will involve characterization of the functional roles of the putative Class Ib genes, as well as genes involved in the antigen processing machinery.</p>
</sec>
</sec>
<sec>
<st>
<p>Conclusions</p>
</st>
<p>Four Tasmanian devil genomic regions containing five MHC Class I genes and four MHC Class II genes were characterized by BAC based sequencing, which allowed us to assign previously sequenced MHC alleles to loci. We propose that <it>Saha-UA</it>, <it>UB </it>and <it>UC </it>are Class Ia genes, <it>Saha-UK </it>is a transcribed Class Ib gene and the role of <it>Saha-UD </it>remains to be determined. The expression of <it>Saha-UA</it>, <it>UB </it>and <it>UC </it>may be influenced by three genomic CNVs that are found within or adjacent to these loci. Future studies should focus on the role of CNVs in the MHC in susceptibility/resistance of devils to DFTD.</p>
</sec>
<sec>
<st>
<p>Methods</p>
</st>
<sec>
<st>
<p>Construction of bacterial artificial chromosome (BAC) libraries</p>
</st>
<p>Two BAC libraries were produced. The first one, designated VMRC-49, was constructed from whole blood genomic DNA of Cedric (three-year-old male). The second library, designated VMRC-50, was constructed from genomic DNA extracted from the liver of Spirit (two-year-old male). The genomes of both animals were recently sequenced <abbrgrp>
<abbr bid="B37">37</abbr>
</abbrgrp>. The genome assembly was not able to provide an accurate picture of the MHC, which contains multiple closely related genes that have arisen from recent gene duplications. Therefore, a BAC clone sequencing project was required.</p>
<p>The BAC libraries were produced in the Genome Resource Center at the Benaroya Research Institute at Virginia Mason, Seattle, USA. Detailed procedures of library construction have been described previously <abbrgrp>
<abbr bid="B38">38</abbr>
<abbr bid="B39">39</abbr>
</abbrgrp>. The quality of the DNA was checked by running a pulsed field gel electrophoresis (PFGE) on a CHEF-DR<sup>&#174; </sup>III system (BioRad). The DNA was partially digested in an <it>EcoRI/EcoRI-methylase </it>competition reaction and size fractionated by analytical PFGE on a BioRad CHEF Mapper<sup>&#174; </sup>XA system. DNA fragments from the appropriate size fraction were ligated into the CopyControl&#8482; pCC1BAC&#8482; vector from Epicentre Technologies and transformed into Invitrogen ElectroMAX&#8482; DH10B&#8482; T1 Phage-Resistant <it>E. coli </it>cells. Transformants were arrayed into 384-well LB/chloramphenicol/glycerin microtiter plates (Genetix) using colony-picking robots (Norgren Systems) and subsequently gridded onto 22 &#215; 22 cm high-density nylon filters with a Total Array System (BioRobotics Ltd.). The average insert size was estimated to be ~140 kb. A total of 672 microtiter plates, which contain 258,048 BAC clones, and 14 high-density filters were produced for library VMRC-49. Assuming the size of the devil genome is similar to that of the opossum genome, which is around 3.6 Gb, this library will represent 10x coverage of the devil genome. For library VMRC-50, 432 plates containing 165,888 BAC clones and nine high-density filters were generated, estimated to represent 6.5x whole genome coverage.</p>
</sec>
<sec>
<st>
<p>Characterization of MHC-positive BAC clones</p>
</st>
<sec>
<st>
<p>MHC probes</p>
</st>
<p>MHC Class I and Class II &#946; chain probes for library screening were designed based on devil cDNA sequences <abbrgrp>
<abbr bid="B18">18</abbr>
</abbrgrp>. Two Class I probes were used to screen both libraries. The first one was a 274 bp fragment from Class I gene exon 2, which was amplified using PCR primers and conditions described previously <abbrgrp>
<abbr bid="B9">9</abbr>
</abbrgrp>. The second Class I probe was a 191 bp fragment from exon 4, amplified using forward primer 5' -CAGTGCCGGGCCCAGGACTTTTA -3' and reverse primer 5' -CCCTCGTGCTGAACTCGGCAGGT -3'. PCR was carried out on devil genomic DNA in a total volume of 25 &#956;l, which contains 1x High Fidelity Buffer (Invitrogen) consisting of 60 mM Tris-HCl (pH 8.9) and 18 mM (NH<sub>4</sub>)<sub>2</sub>SO<sub>4</sub>, 2.5 mM MgSO<sub>4</sub>, 0.2 mM each dNTP, 0.8 &#956;M each primer, and 1.5 U of Platinum <it>Taq </it>DNA Polymerase High Fidelity (Invitrogen). PCR amplifications were performed on a BioRad MJ Mini Personal Thermal Cycler at the following conditions: 100&#176;C hot lid; 94&#176;C initial denaturation for 3 min; 32 cycles of 94&#176;C denaturation for 30 sec, 60&#176;C annealing for 30 sec, 72&#176;C extension for 30 sec; and 72&#176;C final extension for 10 min. Library VMRC-49 was also screened with two Class II probes for &#946; chain and &#945; chain genes. The &#946; chain probe was a 237 bp fragment from devil <it>DAB </it>gene exon 3, amplified with forward primer 5' - AGCCCGAGGTGACTGTGTATC -3' and reverse primer 5'- CGTGGCAGGTGTAGACATCTC -3'. PCR conditions were same as above. The &#945; chain probe was designed from the exon 2 of a tammar wallaby <it>DAA </it>gene ([GenBank:<ext-link ext-link-id="CU464025" ext-link-type="gen">CU464025</ext-link>], position 141572-141757) and amplified from tammar wallaby genomic DNA using the same PCR reagents and conditions as described above. All PCR amplicons were isolated by running a 1.8% agarose gel using HyperLadder IV (Bioline) as size marker, and purified from the gel using MoBio UltraClean 15 DNA Purification Kit.</p>
</sec>
<sec>
<st>
<p>Library screening</p>
</st>
<p>Radioactively labelled probes were synthesized from approximately 50 ng of PCR amplified MHC gene fragments with either [&#945;-<sup>32</sup>P]dCTP or [&#945;-<sup>32</sup>P]dATP (PerkinElmer) using Random Primed DNA Labeling Kit from Roche Applied Science. Unincorporated dNTPs were removed from the probe with Illustra ProbeQuant G-50 Micro Columns from GE Healthcare. Hybridisation of BAC filters with MHC probes was carried out overnight at 60&#176;C in Amersham Rapid-hyb&#8482; Buffer (GE Healthcare). Excess background and non-specific binding was minimized by washing the filters at 60&#176;C with a series of buffers (2x SSC, 2x SSC with 0.1% SDS, 1x SSC with 0.1% SDS, and 0.5x SSC with 0.1% SDS). Amersham Hyperfilm MP (GE Healthcare) was used to visualize positive clones after one to three days of exposure to the filters.</p>
</sec>
<sec>
<st>
<p>BAC clone sequencing</p>
</st>
<p>BAC DNA of the positive clones was purified from 100 ml LB/chloramphenicol bacterial culture using QIAGEN Large-Construct Kit. BACs containing unique MHC Class I or II genes were confirmed by direct end sequencing of BAC DNA with MHC primers, using standard sequencing service at the Australian Genome Research Facility Ltd. Fingerprinting analysis and complete sequencing of MHC-positive BAC clones was conducted at the Wellcome Trust Sanger Institute, Cambridge, UK. Traditional Sanger sequencing method was employed to ensure high assembly accuracy of paralogous MHC genes.</p>
</sec>
<sec>
<st>
<p>BAC sequence annotation</p>
</st>
<p>BAC sequences were aligned with human genomic and transcript sequences, non-human reference RNA sequences, and known or predicted opossum and tammar wallaby transcripts using web-based BLAST programs <abbrgrp>
<abbr bid="B40">40</abbr>
</abbrgrp>. Genes were annotated manually based on the best BLAST hits and in accordance with the recommendations of the Human and vertebrate analysis and annotation guidelines <abbrgrp>
<abbr bid="B41">41</abbr>
</abbrgrp>. Overlapping BAC sequences were identified and aligned using BLASTN and ClustalW programs <abbrgrp>
<abbr bid="B42">42</abbr>
</abbrgrp>. A wallaby LINE-1 segment [GenBank:<ext-link ext-link-id="DQ275763" ext-link-type="gen">DQ275763</ext-link>] was used to search for putative LINE segments.</p>
</sec>
</sec>
<sec>
<st>
<p>Fluorescent <it>in situ </it>hybridisation (FISH)</p>
</st>
<p>BAC clones containing MHC Class I or II genes were physically mapped to a male devil karyotype following the protocol described previously by Alsop and colleagues <abbrgrp>
<abbr bid="B43">43</abbr>
</abbrgrp>. Approximately 1 mg of BAC DNA was used to produce probes that were labelled by nick translation with either SpectrumOrange dUTP or SpectrumGreen dUTP (Abbott Molecular Inc.). Labelled probes were hybridised overnight to devil chromosomes, which were denatured for 1 min 40 sec. Slides were washed once at 60&#176;C in 0.4x SSC with 0.3% Tween20 for 2 min and then once in 2x SSC with 0.1% Tween20 for 30 sec at room temperature. Chromosomes were counterstained in DAPI and mounted in VECTASHIELD<sup>&#174; </sup>Mounting Medium from Vector Laboratories Inc. A Zeiss Axioplan2 epifluorescence microscope was used to visualize the fluorescent signals. Images of DAPI stained metaphase chromosomes and fluorescent signals were captured using a SPOT RT Monochrome CCD charge-coupled device camera (Diagnostic Instruments Inc.) and merged using IP Lab imaging software (Scanalytics Inc.).</p>
</sec>
<sec>
<st>
<p>Sequencing and analysis of MHC class I alleles</p>
</st>
<p>To test the Class I gene number variation hypothesis, it was necessary to ensure that all Class I loci in both Cedric and Spirit were characterized. To do this, we isolated and sequenced all Class I alleles from both individuals and assigned them to genes. We amplified exon 2 of the Class I genes from genomic DNA using protocols developed by Siddle and colleagues <abbrgrp>
<abbr bid="B9">9</abbr>
</abbrgrp> with modifications in the reverse primer sequence (5'- CTCGCTCTGGTTGTAGTAGCC 3'). Two independent PCRs were performed for each individual. The PCR amplicons were gel-purified and cloned in a pGEM-T Easy Vector (Promega)/JM109 High Efficiency Competent Cells (Promega) cloning system. 32 positive clones were picked for each individual and plasmids were extracted using QIAGEN DirectPrep 96 MiniPrep Kit on a QIAvac Multiwell vacuum manifold (QIAGEN). Plasmids were sequenced with T7 primer at the Australian Genome Research Facility, Sydney, Australia. Sequences were quality-checked using Sequencher 4.1.4 (Gene Codes) and aligned with previous identified devil Class I alleles <abbrgrp>
<abbr bid="B19">19</abbr>
</abbrgrp> in BioEdit 7.0.9 <abbrgrp>
<abbr bid="B44">44</abbr>
</abbrgrp>. To minimize errors yielded during PCR, cloning and sequencing, new sequence variants were determined to be real alleles only if they were found in more than one PCR amplification. Alleles were assigned to genomic loci based on nucleotide sequence similarity. Evolutionary relationships of devil Class I sequences were analysed by constructing a Neighbour-Joining phylogenetic tree with MEGA5 <abbrgrp>
<abbr bid="B45">45</abbr>
</abbrgrp>.</p>
</sec>
<sec>
<st>
<p>PCR primer design</p>
</st>
<p>Two pairs of new PCR primers were designed using program Oligo 6.7 (Molecular Biology Insights). One pair (forward 5'- TTTGCAAGCTTCCATGTCTCT -3', reverse 5'-CACTTGTGCTTGGAGTTCAGA -3') amplified partial exon 2 to exon 3 of the Class I gene <it>Saha-UK </it>from devil cDNA. The other pair was designed to detect a deletion in Class I gene <it>Saha-UA</it>, with the forward primer 5'- TGTCCCCCCCTCCGTCTCAG -3'binding to the end of intron 1 and the reverse 5'- CAGGAGAGGAGACCACACTAAGAT -3'binding to the end of intron 5. In about 5% of samples, this set of primers was found to work less efficiently, which could be due to nucleotide variations in primer binding sites. PCR conditions for both primer sets were the same as the one provided above. PCR products were sequenced to confirm correct amplification sites.</p>
</sec>
</sec>
<sec>
<st>
<p>Authors' contributions</p>
</st>
<p>YC, AS, KM, RT constructed the BAC libraries under the supervision of CTA. YC carried out all laboratory experiments, analysed the data and wrote the first draft of the manuscript. HS provided supervision in the lab. JD supervised FISH experiments. MJ provided devil samples. KB conceived and designed the project and revised the manuscript together with YC. All authors read drafts of the manuscript.</p>
</sec>
</bdy><bm>
<ack>
<sec>
<st>
<p>Acknowledgements</p>
</st>
<p>This work was funded by an ARC Future Fellowship to KB (FT0992212), the Eric Guiler fund and the Tasmanian Department of Primary Industries, Parks, Water and the Environment. YC was supported by an Endeavour International Postgraduate Research Scholarship, KM by an Australian Postgraduate Award and an ARC Linkage Grant. We thank Carol Churcher, Lucy Matthews, David Willey, Claire Murnane, Nicola Corton, Jennifer Harrow and Charlie Steward at the Wellcome Trust Sanger Institute, Cambridge for mapping, shotgun library making, capillary sequencing and finishing the BAC clones, Elizabeth Murchison for establishing the collaboration with the Sanger Institute, and the Save the Tasmanian Devil Program team and Rodrigo Hamede and Shelly Lachish at the University of Tasmania for sample collection collection and Ke-jun Wei from ANU for DNA extraction.</p>
</sec>
</ack>
<refgrp><bibl id="B1"><title><p>Large marsupial carnivores</p></title><aug><au><snm>Jones</snm><fnm>ME</fnm></au></aug><source>The Encyclopedia of Mammals</source><publisher>Oxford, UK: Oxford University Press</publisher><editor>Macdonald DW</editor><pubdate>2001</pubdate><fpage>814</fpage><lpage>817</lpage></bibl><bibl id="B2"><title><p>Tasmanian devil (<it>Sarcophilus harrisi</it>) extinction on the Australian mainland in the mid-Holocene: multicausality and ENSO intensification</p></title><aug><au><snm>Brown</snm><fnm>OJF</fnm></au></aug><source>Alcheringa: An Australasian Journal of Palaeontology</source><pubdate>2006</pubdate><volume>30</volume><fpage>49</fpage><lpage>57</lpage><xrefbib><pubid idtype="doi">10.1080/03115510609506855</pubid></xrefbib></bibl><bibl id="B3"><title><p>Emerging disease and population decline of an island endemic, the Tasmanian devil <it>Sarcophilus harrisi</it></p></title><aug><au><snm>Hawkins</snm><fnm>CE</fnm></au><au><snm>Baars</snm><fnm>C</fnm></au><au><snm>Hesterman</snm><fnm>H</fnm></au><au><snm>Hocking</snm><fnm>GJ</fnm></au><au><snm>Jones</snm><fnm>ME</fnm></au><au><snm>Lazenby</snm><fnm>B</fnm></au><au><snm>Mann</snm><fnm>D</fnm></au><au><snm>Mooney</snm><fnm>N</fnm></au><au><snm>Pemberton</snm><fnm>D</fnm></au><au><snm>Pyecroft</snm><fnm>S</fnm></au><etal/></aug><source>Biological Conservation</source><pubdate>2006</pubdate><volume>131</volume><fpage>307</fpage><lpage>324</lpage><xrefbib><pubid idtype="doi">10.1016/j.biocon.2006.04.010</pubid></xrefbib></bibl><bibl id="B4"><title><p>Genetic diversity and population structure of Tasmanian devils, the largest marsupial carnivore</p></title><aug><au><snm>Jones</snm><fnm>ME</fnm></au><au><snm>Paetkau</snm><fnm>D</fnm></au><au><snm>Geffen</snm><fnm>E</fnm></au><au><snm>Moritz</snm><fnm>C</fnm></au></aug><source>Molecular Ecology</source><pubdate>2004</pubdate><volume>13</volume><fpage>2197</fpage><lpage>2209</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1111/j.1365-294X.2004.02239.x</pubid><pubid idtype="pmpid" link="fulltext">15245394</pubid></pubidlist></xrefbib></bibl><bibl id="B5"><title><p>Seasonal, demographic and density-related patterns of contact between Tasmanian devils: Implications for transmission of Devil Facial Tumour Disease</p></title><aug><au><snm>Hamede</snm><fnm>R</fnm></au><au><snm>McCallum</snm><fnm>H</fnm></au><au><snm>Jones</snm><fnm>ME</fnm></au></aug><source>Austral Ecology</source><pubdate>2008</pubdate><volume>33</volume><fpage>614</fpage><lpage>622</lpage><xrefbib><pubid idtype="doi">10.1111/j.1442-9993.2007.01827.x</pubid></xrefbib></bibl><bibl id="B6"><title><p>Allograft theory: transmission of devil facial-tumour disease</p></title><aug><au><snm>Pearse</snm><fnm>AM</fnm></au><au><snm>Swift</snm><fnm>K</fnm></au></aug><source>Nature</source><pubdate>2006</pubdate><volume>439</volume><fpage>549</fpage><xrefbib><pubidlist><pubid idtype="doi">10.1038/439549a</pubid><pubid idtype="pmpid" link="fulltext">16452970</pubid></pubidlist></xrefbib></bibl><bibl id="B7"><title><p>The Tasmanian Devil Transcriptome Reveals Schwann Cell Origins of a Clonally Transmissible Cancer</p></title><aug><au><snm>Murchison</snm><fnm>EP</fnm></au><au><snm>Tovar</snm><fnm>C</fnm></au><au><snm>Hsu</snm><fnm>A</fnm></au><au><snm>Bender</snm><fnm>HS</fnm></au><au><snm>Kheradpour</snm><fnm>P</fnm></au><au><snm>Rebbeck</snm><fnm>CA</fnm></au><au><snm>Obendorf</snm><fnm>D</fnm></au><au><snm>Conlan</snm><fnm>C</fnm></au><au><snm>Bahlo</snm><fnm>M</fnm></au><au><snm>Blizzard</snm><fnm>CA</fnm></au><etal/></aug><source>Science</source><pubdate>2010</pubdate><volume>327</volume><fpage>84</fpage><lpage>87</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1126/science.1180616</pubid><pubid idtype="pmcid">2982769</pubid><pubid idtype="pmpid" link="fulltext">20044575</pubid></pubidlist></xrefbib></bibl><bibl id="B8"><title><p>The Immune Response of the Tasmanian Devil (<it>Sarcophilus harrisi</it>) and Devil Facial Tumour Disease</p></title><aug><au><snm>Woods</snm><fnm>G</fnm></au><au><snm>Kreiss</snm><fnm>A</fnm></au><au><snm>Belov</snm><fnm>K</fnm></au><au><snm>Siddle</snm><fnm>H</fnm></au><au><snm>Obendorf</snm><fnm>D</fnm></au><au><snm>Muller</snm><fnm>H</fnm></au></aug><source>EcoHealth</source><pubdate>2007</pubdate><volume>4</volume><fpage>338</fpage><lpage>345</lpage><xrefbib><pubid idtype="doi">10.1007/s10393-007-0117-1</pubid></xrefbib></bibl><bibl id="B9"><title><p>Transmission of a fatal clonal tumor by biting occurs due to depleted MHC diversity in a threatened carnivorous marsupial</p></title><aug><au><snm>Siddle</snm><fnm>HV</fnm></au><au><snm>Kreiss</snm><fnm>A</fnm></au><au><snm>Eldridge</snm><fnm>MDB</fnm></au><au><snm>Noonan</snm><fnm>E</fnm></au><au><snm>Clarke</snm><fnm>CJ</fnm></au><au><snm>Pyecroft</snm><fnm>S</fnm></au><au><snm>Woods</snm><fnm>GM</fnm></au><au><snm>Belov</snm><fnm>K</fnm></au></aug><source>Proceedings of the National Academy of Sciences</source><pubdate>2007</pubdate><volume>104</volume><fpage>16221</fpage><lpage>16226</lpage><xrefbib><pubid idtype="doi">10.1073/pnas.0704580104</pubid></xrefbib></bibl><bibl id="B10"><aug><au><snm>Janeway</snm><fnm>CA</fnm></au><au><snm>Travers</snm><fnm>P</fnm></au><au><snm>Walport</snm><fnm>M</fnm></au><au><snm>Shlomchik</snm><fnm>M</fnm></au></aug><source>Immunobiology: the immune system in health and disease</source><publisher>New York, USA: Garland Publishing</publisher><pubdate>2001</pubdate></bibl><bibl id="B11"><title><p>Histocompatibility Antigens Structure and Function</p></title><aug><au><snm>Parham</snm><fnm>P</fnm></au><au><snm>Strominger</snm><fnm>JL</fnm></au></aug><source>Receptors and Recognition Series B</source><publisher>New York: Chapman and Hall</publisher><pubdate>1982</pubdate><volume>14</volume></bibl><bibl id="B12"><title><p>Mechanisms of MHC class I-restricted antigen processing and cross-presentation</p></title><aug><au><snm>Cresswell</snm><fnm>P</fnm></au><au><snm>Ackerman</snm><fnm>AL</fnm></au><au><snm>Giodini</snm><fnm>A</fnm></au><au><snm>Peaper</snm><fnm>DR</fnm></au><au><snm>Wearsch</snm><fnm>PA</fnm></au></aug><source>Immunological Reviews</source><pubdate>2005</pubdate><volume>207</volume><fpage>145</fpage><lpage>157</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1111/j.0105-2896.2005.00316.x</pubid><pubid idtype="pmpid" link="fulltext">16181333</pubid></pubidlist></xrefbib></bibl><bibl id="B13"><title><p>Antigen presentation by non-classical class I molecules</p></title><aug><au><snm>Stroynowski</snm><fnm>I</fnm></au><au><snm>Lindahl</snm><fnm>KF</fnm></au></aug><source>Current Opinion in Immunology</source><pubdate>1994</pubdate><volume>6</volume><fpage>38</fpage><lpage>44</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1016/0952-7915(94)90031-0</pubid><pubid idtype="pmpid">8172679</pubid></pubidlist></xrefbib></bibl><bibl id="B14"><title><p>MHC class II-restricted antigen processing and presentation</p></title><aug><au><snm>Pieters</snm><fnm>J</fnm></au></aug><source>Advances in Immunology</source><pubdate>2000</pubdate><volume>75</volume><fpage>159</fpage><lpage>208</lpage><xrefbib><pubid idtype="pmpid">10879284</pubid></xrefbib></bibl><bibl id="B15"><title><p>Evolution by the birth-and-death process in multigene families of the vertebrate immune system</p></title><aug><au><snm>Nei</snm><fnm>M</fnm></au><au><snm>Gu</snm><fnm>X</fnm></au><au><snm>Sitnikova</snm><fnm>T</fnm></au></aug><source>Proceedings of the National Academy of Sciences</source><pubdate>1997</pubdate><volume>94</volume><fpage>7799</fpage><lpage>7806</lpage><xrefbib><pubid idtype="doi">10.1073/pnas.94.15.7799</pubid></xrefbib></bibl><bibl id="B16"><title><p>Mini-review: Defense strategies and immunity-related genes</p></title><aug><au><snm>Trowsdale</snm><fnm>J</fnm></au><au><snm>Parham</snm><fnm>P</fnm></au></aug><source>European Journal of Immunology</source><pubdate>2004</pubdate><volume>34</volume><fpage>7</fpage><lpage>17</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1002/eji.200324693</pubid><pubid idtype="pmpid" link="fulltext">14971025</pubid></pubidlist></xrefbib></bibl><bibl id="B17"><aug><au><snm>Hughes</snm><fnm>AL</fnm></au></aug><source>Adaptive Evolution of Genes and Genomes</source><publisher>New York, USA: Oxford University Press</publisher><pubdate>1999</pubdate></bibl><bibl id="B18"><title><p>Characterization of major histocompatibility complex class I and class II genes from the Tasmanian devil (<it>Sarcophilus harrisi</it>)</p></title><aug><au><snm>Siddle</snm><fnm>HV</fnm></au><au><snm>Sanderson</snm><fnm>C</fnm></au><au><snm>Belov</snm><fnm>K</fnm></au></aug><source>Immunogenetics</source><pubdate>2007</pubdate><volume>59</volume><fpage>753</fpage><lpage>760</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1007/s00251-007-0238-2</pubid><pubid idtype="pmpid" link="fulltext">17673996</pubid></pubidlist></xrefbib></bibl><bibl id="B19"><title><p>MHC gene copy number variation in Tasmanian devils: implications for the spread of a contagious cancer</p></title><aug><au><snm>Siddle</snm><fnm>HV</fnm></au><au><snm>Marzec</snm><fnm>J</fnm></au><au><snm>Cheng</snm><fnm>Y</fnm></au><au><snm>Jones</snm><fnm>M</fnm></au><au><snm>Belov</snm><fnm>K</fnm></au></aug><source>Proceedings of the Royal Society B</source><pubdate>2010</pubdate><volume>277</volume><fpage>2001</fpage><lpage>2006</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1098/rspb.2009.2362</pubid><pubid idtype="pmcid">2880097</pubid><pubid idtype="pmpid" link="fulltext">20219742</pubid></pubidlist></xrefbib></bibl><bibl id="B20"><title><p>Nomenclature for the major histocompatibility complexes of different species: a proposal</p></title><aug><au><snm>Klein</snm><fnm>J</fnm></au><au><snm>Bontrop</snm><fnm>RE</fnm></au><au><snm>Dawkins</snm><fnm>RL</fnm></au><au><snm>Erlich</snm><fnm>HA</fnm></au><au><snm>Gyllensten</snm><fnm>UB</fnm></au><au><snm>Heise</snm><fnm>ER</fnm></au><au><snm>Jones</snm><fnm>PP</fnm></au><au><snm>Parham</snm><fnm>P</fnm></au><au><snm>Wakeland</snm><fnm>EK</fnm></au><au><snm>Watkins</snm><fnm>DI</fnm></au></aug><source>Immunogenetics</source><pubdate>1990</pubdate><volume>31</volume><fpage>217</fpage><lpage>219</lpage><xrefbib><pubid idtype="pmpid">2329006</pubid></xrefbib></bibl><bibl id="B21"><title><p>Reconstructing an Ancestral Mammalian Immune Supercomplex from a Marsupial Major Histocompatibility Complex</p></title><aug><au><snm>Belov</snm><fnm>K</fnm></au><au><snm>Deakin</snm><fnm>JE</fnm></au><au><snm>Papenfuss</snm><fnm>AT</fnm></au><au><snm>Baker</snm><fnm>ML</fnm></au><au><snm>Melman</snm><fnm>SD</fnm></au><au><snm>Siddle</snm><fnm>HV</fnm></au><au><snm>Gouin</snm><fnm>N</fnm></au><au><snm>Goode</snm><fnm>DL</fnm></au><au><snm>Sargeant</snm><fnm>TJ</fnm></au><au><snm>Robinson</snm><fnm>MD</fnm></au><etal/></aug><source>PLoS Biology</source><pubdate>2006</pubdate><volume>4</volume><fpage>e46</fpage><xrefbib><pubidlist><pubid idtype="doi">10.1371/journal.pbio.0040046</pubid><pubid idtype="pmcid">1351924</pubid><pubid idtype="pmpid" link="fulltext">16435885</pubid></pubidlist></xrefbib></bibl><bibl id="B22"><title><p>MHC-linked and un-linked class I genes in the wallaby</p></title><aug><au><snm>Siddle</snm><fnm>HV</fnm></au><au><snm>Deakin</snm><fnm>JE</fnm></au><au><snm>Coggill</snm><fnm>P</fnm></au><au><snm>Hart</snm><fnm>E</fnm></au><au><snm>Cheng</snm><fnm>Y</fnm></au><au><snm>Wong</snm><fnm>ESW</fnm></au><au><snm>Harrow</snm><fnm>J</fnm></au><au><snm>Beck</snm><fnm>S</fnm></au><au><snm>Belov</snm><fnm>K</fnm></au></aug><source>BMC Genomics</source><pubdate>2009</pubdate><volume>10</volume><fpage>310</fpage><xrefbib><pubidlist><pubid idtype="doi">10.1186/1471-2164-10-310</pubid><pubid idtype="pmcid">2719672</pubid><pubid idtype="pmpid" link="fulltext">19602235</pubid></pubidlist></xrefbib></bibl><bibl id="B23"><title><p>Evolution of the opossum major histocompatibility complex: evidence for diverse alternative splice patterns and low polymorphism among class I genes</p></title><aug><au><snm>Baker</snm><fnm>ML</fnm></au><au><snm>Melman</snm><fnm>SD</fnm></au><au><snm>Huntley</snm><fnm>J</fnm></au><au><snm>Miller</snm><fnm>RD</fnm></au></aug><source>Immunology</source><pubdate>2009</pubdate><volume>128</volume><fpage>e418</fpage><lpage>e431</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1111/j.1365-2567.2008.02994.x</pubid><pubid idtype="pmcid">2753908</pubid><pubid idtype="pmpid" link="fulltext">19191910</pubid></pubidlist></xrefbib></bibl><bibl id="B24"><title><p>Evolution of class I promoter sequences: relationship to function and diversity</p></title><aug><au><snm>Vallejo</snm><fnm>AN</fnm></au><au><snm>Pease</snm><fnm>LR</fnm></au></aug><source>Immunological Reviews</source><pubdate>1995</pubdate><volume>143</volume><fpage>249</fpage><lpage>262</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1111/j.1600-065X.1995.tb00678.x</pubid><pubid idtype="pmpid">7558079</pubid></pubidlist></xrefbib></bibl><bibl id="B25"><title><p>The Marsupial Major Histocompatibility Complex</p></title><aug><au><snm>Cheng</snm><fnm>Y</fnm></au><au><snm>Siddle</snm><fnm>HV</fnm></au><au><snm>Belov</snm><fnm>K</fnm></au></aug><source>Marsupial Genetics and Genomics</source><publisher>Dordrecht, Springer</publisher><editor>Deakin JE, Waters PD, Graves JAM</editor><pubdate>2010</pubdate></bibl><bibl id="B26"><title><p>Genome-Wide Analysis of Copy Number Variation in Type 1 Diabetes</p></title><aug><au><snm>Grayson</snm><fnm>BL</fnm></au><au><snm>Smith</snm><fnm>ME</fnm></au><au><snm>Thomas</snm><fnm>JW</fnm></au><au><snm>Wang</snm><fnm>L</fnm></au><au><snm>Dexheimer</snm><fnm>P</fnm></au><au><snm>Jeffrey</snm><fnm>J</fnm></au><au><snm>Fain</snm><fnm>PR</fnm></au><au><snm>Nanduri</snm><fnm>P</fnm></au><au><snm>Eisenbarth</snm><fnm>GS</fnm></au><au><snm>Aune</snm><fnm>TM</fnm></au></aug><source>PLoS ONE</source><pubdate>2010</pubdate><volume>5</volume><fpage>e15393</fpage><xrefbib><pubidlist><pubid idtype="doi">10.1371/journal.pone.0015393</pubid><pubid idtype="pmcid">2981564</pubid><pubid idtype="pmpid" link="fulltext">21085585</pubid></pubidlist></xrefbib></bibl><bibl id="B27"><title><p>Detection of large-scale variation in the human genome</p></title><aug><au><snm>Iafrate</snm><fnm>AJ</fnm></au><au><snm>Feuk</snm><fnm>L</fnm></au><au><snm>Rivera</snm><fnm>MN</fnm></au><au><snm>Listewnik</snm><fnm>ML</fnm></au><au><snm>Donahoe</snm><fnm>PK</fnm></au><au><snm>Qi</snm><fnm>Y</fnm></au><au><snm>Scherer</snm><fnm>SW</fnm></au><au><snm>Lee</snm><fnm>C</fnm></au></aug><source>Nature Genetics</source><pubdate>2004</pubdate><volume>36</volume><fpage>949</fpage><lpage>951</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1038/ng1416</pubid><pubid idtype="pmpid" link="fulltext">15286789</pubid></pubidlist></xrefbib></bibl><bibl id="B28"><title><p>Large-Scale Copy Number Polymorphism in the Human Genome</p></title><aug><au><snm>Sebat</snm><fnm>J</fnm></au><au><snm>Lakshmi</snm><fnm>B</fnm></au><au><snm>Troge</snm><fnm>J</fnm></au><au><snm>Alexander</snm><fnm>J</fnm></au><au><snm>Young</snm><fnm>J</fnm></au><au><snm>Lundin</snm><fnm>P</fnm></au><au><snm>M&#229;n&#233;r</snm><fnm>S</fnm></au><au><snm>Massa</snm><fnm>H</fnm></au><au><snm>Walker</snm><fnm>M</fnm></au><au><snm>Chi</snm><fnm>M</fnm></au><etal/></aug><source>Science</source><pubdate>2004</pubdate><volume>305</volume><fpage>525</fpage><lpage>528</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1126/science.1098918</pubid><pubid idtype="pmpid" link="fulltext">15273396</pubid></pubidlist></xrefbib></bibl><bibl id="B29"><title><p>Genome-wide association study of CNVs in 16,000 cases of eight common diseases and 3,000 shared controls</p></title><aug><au><cnm>The Wellcome Trust Case Control Consortium</cnm></au></aug><source>Nature</source><pubdate>2010</pubdate><volume>464</volume><fpage>713</fpage><lpage>720</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1038/nature08979</pubid><pubid idtype="pmcid">2892339</pubid><pubid idtype="pmpid" link="fulltext">20360734</pubid></pubidlist></xrefbib></bibl><bibl id="B30"><title><p>Inactivation of a functional HLA-A gene: A 4-kb deletion turns HLA-A*24 into a pseudogene</p></title><aug><au><snm>Voorter</snm><fnm>CEM</fnm></au><au><snm>Lauterbach</snm><fnm>N</fnm></au><au><snm>Tilanus</snm><fnm>MGJ</fnm></au></aug><source>Human Immunology</source><pubdate>2010</pubdate><volume>71</volume><fpage>1197</fpage><lpage>1202</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1016/j.humimm.2010.09.010</pubid><pubid idtype="pmpid" link="fulltext">20858522</pubid></pubidlist></xrefbib></bibl><bibl id="B31"><title><p>Genomic rearrangements by LINE-1 insertion-mediated deletion in the human and chimpanzee lineages</p></title><aug><au><snm>Han</snm><fnm>K</fnm></au><au><snm>Sen</snm><fnm>SK</fnm></au><au><snm>Wang</snm><fnm>J</fnm></au><au><snm>Callinan</snm><fnm>PA</fnm></au><au><snm>Lee</snm><fnm>J</fnm></au><au><snm>Cordaux</snm><fnm>R</fnm></au><au><snm>Liang</snm><fnm>P</fnm></au><au><snm>Batzer</snm><fnm>MA</fnm></au></aug><source>Nucleic Acids Research</source><pubdate>2005</pubdate><volume>33</volume><fpage>4040</fpage><lpage>4052</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1093/nar/gki718</pubid><pubid idtype="pmcid">1179734</pubid><pubid idtype="pmpid" link="fulltext">16034026</pubid></pubidlist></xrefbib></bibl><bibl id="B32"><title><p>L1 recombination-associated deletions generate human genomic variation</p></title><aug><au><snm>Han</snm><fnm>K</fnm></au><au><snm>Lee</snm><fnm>J</fnm></au><au><snm>Meyer</snm><fnm>TJ</fnm></au><au><snm>Remedios</snm><fnm>P</fnm></au><au><snm>Goodwin</snm><fnm>L</fnm></au><au><snm>Batzer</snm><fnm>MA</fnm></au></aug><source>Proceedings of the National Academy of Sciences</source><pubdate>2008</pubdate><volume>105</volume><fpage>19366</fpage><lpage>19371</lpage><xrefbib><pubid idtype="doi">10.1073/pnas.0807866105</pubid></xrefbib></bibl><bibl id="B33"><title><p>Reduced Effect of Tasmanian Devil Facial Tumor Disease at the Disease Front</p></title><aug><au><snm>Hamede</snm><fnm>R</fnm></au><au><snm>Lachish</snm><fnm>S</fnm></au><au><snm>Belov</snm><fnm>K</fnm></au><au><snm>Woods</snm><fnm>G</fnm></au><au><snm>Kreiss</snm><fnm>A</fnm></au><au><snm>Pearse</snm><fnm>A-M</fnm></au><au><snm>Lazenby</snm><fnm>B</fnm></au><au><snm>Jones</snm><fnm>M</fnm></au><au><snm>McCallum</snm><fnm>H</fnm></au></aug><source>Conservation Biology</source><pubdate>2011</pubdate><note>early view DOI:10.1111/j.1523-1739.2011.01747.x</note></bibl><bibl id="B34"><title><p>Relative Impact of Nucleotide and Copy Number Variation on Gene Expression Phenotypes</p></title><aug><au><snm>Stranger</snm><fnm>BE</fnm></au><au><snm>Forrest</snm><fnm>MS</fnm></au><au><snm>Dunning</snm><fnm>M</fnm></au><au><snm>Ingle</snm><fnm>CE</fnm></au><au><snm>Beazley</snm><fnm>C</fnm></au><au><snm>Thorne</snm><fnm>N</fnm></au><au><snm>Redon</snm><fnm>R</fnm></au><au><snm>Bird</snm><fnm>CP</fnm></au><au><snm>de Grassi</snm><fnm>A</fnm></au><au><snm>Lee</snm><fnm>C</fnm></au><etal/></aug><source>Science</source><pubdate>2007</pubdate><volume>315</volume><fpage>848</fpage><lpage>853</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1126/science.1136678</pubid><pubid idtype="pmcid">2665772</pubid><pubid idtype="pmpid" link="fulltext">17289997</pubid></pubidlist></xrefbib></bibl><bibl id="B35"><title><p>The tammar wallaby major histocompatibility complex shows evidence of past genomic instability</p></title><aug><au><snm>Siddle</snm><fnm>H</fnm></au><au><snm>Deakin</snm><fnm>J</fnm></au><au><snm>Coggill</snm><fnm>P</fnm></au><au><snm>Wilming</snm><fnm>L</fnm></au><au><snm>Harrow</snm><fnm>J</fnm></au><au><snm>Kaufman</snm><fnm>J</fnm></au><au><snm>Beck</snm><fnm>S</fnm></au><au><snm>Belov</snm><fnm>K</fnm></au></aug><source>BMC Genomics</source><pubdate>2011</pubdate><volume>12</volume><fpage>421</fpage><xrefbib><pubidlist><pubid idtype="doi">10.1186/1471-2164-12-421</pubid><pubid idtype="pmcid">3179965</pubid><pubid idtype="pmpid" link="fulltext">21854592</pubid></pubidlist></xrefbib></bibl><bibl id="B36"><title><p>DNA-hybridisation Studies of Marsupials and their Implications for Metatherian Classification</p></title><aug><au><snm>Kirsch</snm><fnm>JAW</fnm></au><au><snm>Lapointe</snm><fnm>F-J</fnm></au><au><snm>Springer</snm><fnm>MS</fnm></au></aug><source>Australian Journal of Zoology</source><pubdate>1997</pubdate><volume>45</volume><fpage>211</fpage><lpage>280</lpage><xrefbib><pubid idtype="doi">10.1071/ZO96030</pubid></xrefbib></bibl><bibl id="B37"><title><p>Genetic diversity and population structure of the endangered marsupial <it>Sarcophilus harrisii </it>(Tasmanian devil)</p></title><aug><au><snm>Miller</snm><fnm>W</fnm></au><au><snm>Hayes</snm><fnm>VM</fnm></au><au><snm>Ratan</snm><fnm>A</fnm></au><au><snm>Petersen</snm><fnm>DC</fnm></au><au><snm>Wittekindt</snm><fnm>NE</fnm></au><au><snm>Miller</snm><fnm>J</fnm></au><au><snm>Walenz</snm><fnm>B</fnm></au><au><snm>Knight</snm><fnm>J</fnm></au><au><snm>Qi</snm><fnm>J</fnm></au><au><snm>Zhao</snm><fnm>F</fnm></au><etal/></aug><source>Proceedings of the National Academy of Sciences</source><pubdate>2011</pubdate><note>published ahead of print June 27, 2011, doi:2010.1073/pnas.1102838108</note></bibl><bibl id="B38"><title><p>Construction of P1 artificial chromosome (PAC) libraries from lower vertebrates</p></title><aug><au><snm>Amemiya</snm><fnm>CT</fnm></au><au><snm>Ota</snm><fnm>T</fnm></au><au><snm>Litman</snm><fnm>GW</fnm></au></aug><source>Nonmammalian Genomic Analyses: A Practical Guide</source><publisher>San Diego: Academic Press</publisher><editor>Birren B, Lai E</editor><pubdate>1996</pubdate><fpage>223</fpage><lpage>256</lpage></bibl><bibl id="B39"><title><p>An improved approach for construction of bacterial artificial chromosome libraries</p></title><aug><au><snm>Oosegawa</snm><fnm>K</fnm></au><au><snm>Woon</snm><fnm>PY</fnm></au><au><snm>Zhao</snm><fnm>B</fnm></au><au><snm>Frengen</snm><fnm>E</fnm></au><au><snm>Tateno</snm><fnm>M</fnm></au><au><snm>Catanese</snm><fnm>JJ</fnm></au><au><snm>de Jong</snm><fnm>PJ</fnm></au></aug><source>Genomics</source><pubdate>1998</pubdate><volume>52</volume><fpage>1</fpage><lpage>8</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1006/geno.1998.5423</pubid><pubid idtype="pmpid" link="fulltext">9740665</pubid></pubidlist></xrefbib></bibl><bibl id="B40"><title><p>Basic local alignment search tool</p></title><aug><au><snm>Altschul</snm><fnm>SF</fnm></au><au><snm>Gish</snm><fnm>W</fnm></au><au><snm>Miller</snm><fnm>W</fnm></au><au><snm>Myers</snm><fnm>EW</fnm></au><au><snm>Lipman</snm><fnm>DJ</fnm></au></aug><source>Journal of Molecular Biology</source><pubdate>1990</pubdate><volume>215</volume><fpage>403</fpage><lpage>410</lpage><xrefbib><pubid idtype="pmpid">2231712</pubid></xrefbib></bibl><bibl id="B41"><title><p>Human and Vertebrate Analysis and Annotation Guidelines</p></title><url>http://www.sanger.ac.uk/research/projects/vertebrategenome/havana/assets/guidelines.pdf</url></bibl><bibl id="B42"><title><p>CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position specific gap penalties and weight matrix choice</p></title><aug><au><snm>Thompson</snm><fnm>JD</fnm></au><au><snm>Higgins</snm><fnm>DG</fnm></au><au><snm>Gibson</snm><fnm>TJ</fnm></au></aug><source>Nucleic Acids Research</source><pubdate>1994</pubdate><volume>22</volume><fpage>4673</fpage><lpage>4680</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1093/nar/22.22.4673</pubid><pubid idtype="pmcid">308517</pubid><pubid idtype="pmpid" link="fulltext">7984417</pubid></pubidlist></xrefbib></bibl><bibl id="B43"><title><p>Characterizing the chromosomes of the Australian model marsupial <it>Macropus eugeni </it>(tammar wallaby)</p></title><aug><au><snm>Alsop</snm><fnm>AE</fnm></au><au><snm>Miethke</snm><fnm>P</fnm></au><au><snm>Rofe</snm><fnm>R</fnm></au><au><snm>Koina</snm><fnm>E</fnm></au><au><snm>Sankovic</snm><fnm>N</fnm></au><au><snm>Deakin</snm><fnm>JE</fnm></au><au><snm>Haines</snm><fnm>H</fnm></au><au><snm>Rapkins</snm><fnm>RW</fnm></au><au><snm>Graves</snm><fnm>JAM</fnm></au></aug><source>Chromosome Research</source><pubdate>2005</pubdate><volume>13</volume><fpage>627</fpage><lpage>636</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1007/s10577-005-0989-2</pubid><pubid idtype="pmpid" link="fulltext">16170627</pubid></pubidlist></xrefbib></bibl><bibl id="B44"><title><p>Bioedit: a user friendly biological sequence alignment editor and analysis program for windows 95/98/NT</p></title><aug><au><snm>Hall</snm><fnm>T</fnm></au></aug><source>Nucleic Acids Symposium Series</source><pubdate>1999</pubdate><volume>41</volume><fpage>95</fpage><lpage>98</lpage></bibl><bibl id="B45"><title><p>MEGA4: Molecular Evolutionary Genetics Analysis (MEGA) software version 4.0</p></title><aug><au><snm>Tamura</snm><fnm>K</fnm></au><au><snm>Dudley</snm><fnm>J</fnm></au><au><snm>Nei</snm><fnm>M</fnm></au><au><snm>Kumar</snm><fnm>S</fnm></au></aug><source>Molecular Biology and Evolution</source><pubdate>2007</pubdate><note>10.1093/molbev/msm1092</note></bibl><bibl id="B46"><title><p>The neighbor-joining method: A new method for reconstructing phylogenetic trees</p></title><aug><au><snm>Saitou</snm><fnm>N</fnm></au><au><snm>Nei</snm><fnm>M</fnm></au></aug><source>Molecular Biology and Evolution</source><pubdate>1987</pubdate><volume>4</volume><fpage>406</fpage><lpage>425</lpage><xrefbib><pubid idtype="pmpid" link="fulltext">3447015</pubid></xrefbib></bibl><bibl id="B47"><title><p>Confidence limits on phylogenies: An approach using the bootstrap</p></title><aug><au><snm>Felsenstein</snm><fnm>J</fnm></au></aug><source>Evolution</source><pubdate>1985</pubdate><volume>39</volume><fpage>783</fpage><lpage>791</lpage><xrefbib><pubid idtype="doi">10.2307/2408678</pubid></xrefbib></bibl><bibl id="B48"><title><p>Structure, function, and diversity of class I major histocompatibility complex molecules</p></title><aug><au><snm>Bjorkman</snm><fnm>PJ</fnm></au><au><snm>Parham</snm><fnm>P</fnm></au></aug><source>Annual Review of Biochemistry</source><pubdate>1990</pubdate><volume>59</volume><fpage>253</fpage><lpage>288</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1146/annurev.bi.59.070190.001345</pubid><pubid idtype="pmpid" link="fulltext">2115762</pubid></pubidlist></xrefbib></bibl></refgrp>
</bm></art>