<?xml version='1.0'?>
<!DOCTYPE art SYSTEM 'http://www.biomedcentral.com/xml/article.dtd'>
<art>
   <ui>1471-2164-9-228</ui>
   <ji>1471-2164</ji>
   <fm>
      <dochead>Research article</dochead>
      <bibl>
         <title>
            <p>Sequence features of <it>HLA-DRB1 </it>locus define putative basis for gene conversion and point mutations</p>
         </title>
         <aug>
            <au id="A1">
               <snm>von Salom&#233;</snm>
               <fnm>Jenny</fnm>
               <insr iid="I2"/>
               <insr iid="I3"/>
               <insr iid="I4"/>
               <email>jenny.vonsalome@karolinska.se</email>
            </au>
            <au id="A2" ca="yes">
               <snm>Kukkonen</snm>
               <mi>P</mi>
               <fnm>Jyrki</fnm>
               <insr iid="I1"/>
               <insr iid="I3"/>
               <insr iid="I4"/>
               <email>jyrki.kukkonen@helsinki.fi</email>
            </au>
         </aug>
         <insg>
            <ins id="I1">
               <p>University of Helsinki, Department of Basic Veterinary Sciences, Helsinki, Finland</p>
            </ins>
            <ins id="I2">
               <p>Karolinska University Hospital, Department of Clinical Genetics, Stockholm, Sweden</p>
            </ins>
            <ins id="I3">
               <p>&#197;bo Akademi University, Department of Biology, Turku, Finland</p>
            </ins>
            <ins id="I4">
               <p>Uppsala University, Department of Neuroscience, Physiology, Uppsala, Sweden</p>
            </ins>
         </insg>
         <source>BMC Genomics</source>
         <issn>1471-2164</issn>
         <pubdate>2008</pubdate>
         <volume>9</volume>
         <issue>1</issue>
         <fpage>228</fpage>
         <url>http://www.biomedcentral.com/1471-2164/9/228</url>
         <xrefbib>
            <pubidlist>
               <pubid idtype="pmpid">18489735</pubid>
               <pubid idtype="doi">10.1186/1471-2164-9-228</pubid>
            </pubidlist>
         </xrefbib>
      </bibl>
      <history>
         <rec>
            <date>
               <day>07</day>
               <month>11</month>
               <year>2007</year>
            </date>
         </rec>
         <acc>
            <date>
               <day>19</day>
               <month>5</month>
               <year>2008</year>
            </date>
         </acc>
         <pub>
            <date>
               <day>19</day>
               <month>5</month>
               <year>2008</year>
            </date>
         </pub>
      </history>
      <cpyrt>
         <year>2008</year>
         <collab>von Salom&#233; and Kukkonen; licensee BioMed Central Ltd.</collab>
         <note>This is an Open Access article distributed under the terms of the Creative Commons Attribution License (<url>http://creativecommons.org/licenses/by/2.0</url>), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</note>
      </cpyrt>
      <abs>
         <sec>
            <st>
               <p>Abstract</p>
            </st>
            <sec>
               <st>
                  <p>Background</p>
               </st>
               <p>HLA/MHC class II molecules show high degree of polymorphism in the human population. The individual polymorphic motifs have been suggested to be propagated and mixed by transfer of genetic material (recombination, gene conversion) between alleles, but no clear molecular basis for this has been identified as yet. A large number of MHC class II allele sequences is publicly available and could be used to analyze the sequence features behind the recombination, revealing possible basis for such recombination processes both in HLA class II genes and other genes, which recombination acts upon.</p>
            </sec>
            <sec>
               <st>
                  <p>Results</p>
               </st>
               <p>In this study we analyzed the vast dataset of human allelic variants (49 full coding sequences, 374 full exon 2 sequences) of the most polymorphic MHC class II locus, <it>HLA-DRB1</it>, and identified many previously unknown sequence features possibly contributing to the recombination. The CpG-dinucleotide content of exon 2 (containing the antigen-binding sites and subsequently a high degree of polymorphism) was much elevated as compared to the other exons despite similar overall G+C content. Furthermore, the CpG pattern was highly conserved. We also identified more complex, highly conserved sequence motifs in exon 2. Some of these can be identified as putative recombination motifs previously found in other genes, but most are previously unidentified.</p>
            </sec>
            <sec>
               <st>
                  <p>Conclusion</p>
               </st>
               <p>The identified sequence features could putatively act in recombination allowing either less (CpG dinucleotides) or more specific DNA cleavage (complex sequences) or homologous recombination (complex sequences).</p>
            </sec>
         </sec>
      </abs>
   </fm>
   <meta>
      <classifications>
         <classification type="bmc" subtype="user_supplied_xml" id="endnote"/>
      </classifications>
   </meta>
   <bdy>
      <sec>
         <st>
            <p>Background</p>
         </st>
         <p>Over the last few years our knowledge of the mechanism of recombination has increased substantially. Still, the knowledge is to a large extent based on simple organisms such as E. coli and yeasts, as the vertebrate genome is not equally readily or rapidly monitored or manipulated. It is well known that homologous pairing and strand exchange involved in recombination in the eukaryotic cell is promoted by specific recombination proteins <abbrgrp><abbr bid="B1">1</abbr></abbrgrp>, and that recombination is tightly linked to DNA replication and repair. For example, double strand breaks are repaired by recombination using information from homologous DNA molecules. Moreover, stalled replication can be re-started by forming a recombination intermediate with assistance from recombination proteins at the replication fork <abbrgrp><abbr bid="B2">2</abbr></abbrgrp>. Recombination also generates diversity essential for, e.g., the vertebrate adaptive immune system (immunoglobulins and T-cell receptor genes) and long-term genome evolution. The term illegitimate recombination was coined to describe one type of "novel" recombination, which, in contrast to the classical (homologous) recombination, requires no or only short stretches of sequence homology [reviewed in <abbrgrp><abbr bid="B3">3</abbr><abbr bid="B4">4</abbr><abbr bid="B5">5</abbr></abbrgrp>]. Despite recent advances in the investigation of eukaryotic recombination, little is known about the mechanisms of illegitimate recombination, except for some specific cases like the immunoglobulin gene rearrangements.</p>
         <p>The major histocompatibility complex (MHC) class II loci encode heterodimeric cell surface receptors that present peptide antigens to helper T-cells so that an appropriate immune response can be induced. In man, the by-far most polymorphic MHC class II locus is <it>HLA-DRB1</it>; as of march 2008 the <it>HLA-DRB1 </it>locus had over 540 alleles <abbrgrp><abbr bid="B6">6</abbr><abbr bid="B7">7</abbr></abbrgrp> and is thus one of the most polymorphic loci in the human genome. A large number of low-frequency alleles is apparently maintained in the human population by balancing selection. The peptide fragments are bound by interactions with the peptide backbone and amino acid side chains in the second exon-coded part of <it>HLA-DRB1 </it>(<it>DRB1</it>-e2), termed antigen recognition sites (ARS). Each individual carries a maximum number of two different inherited alleles per locus (assuming heterozygocity), while the greater allelic diversity is present in the population, putatively allowing population adaptation to pathogens.</p>
         <p>ARS polymorphisms are thought to be created by point mutations, which are propagated by some recombination events, e.g. gene conversion. This view is based on the observed patchwork pattern of apparently exchanged motifs and the fact that synonymous substitutions are also much elevated in the <it>DRB1</it>-e2 (hitch-hiking with the non-synonymous substitutions) <abbrgrp><abbr bid="B8">8</abbr><abbr bid="B9">9</abbr><abbr bid="B10">10</abbr><abbr bid="B11">11</abbr></abbrgrp>. However, there is little direct evidence for any recombination in MHC class II ARS, and no clear recombinogenic motifs or mechanisms have as yet been identified. Since the multiple ARS of <it>DRB1</it>-e2 are spread over a small region of 200 bp only, exchange of very small blocks of DNA is needed to create the pattern of polymorphism seen. This, again, is in sharp contrast to the classical (homologous) recombination, which requires significant stretches of sequence homology and exchanges relatively large blocks of generic material. Therefore, due to the apparent high activity of illegitimate recombination in <it>DRB1</it>-e2 and the large number of allelic sequences known, <it>DRB1</it>-e2 seems to be a uniquely suitable target for investigations of mechanisms behind illegitimate recombination. As it is known that specific DNA sequences can enhance or mediate recombination, we have in this study targeted the vast database of known human <it>HLA-DRB1 </it>alleles in the quest for possible sequence motifs that would enable recombination. The analyses identify strongly conserved sequence features as well as recombinogenic motifs previously recognized in other genes, which may thus lie at the basis of recombination events creating new alleles.</p>
      </sec>
      <sec>
         <st>
            <p>Results</p>
         </st>
         <sec>
            <st>
               <p>Diversity in the antigen-binding exon</p>
            </st>
            <p><it>DRB1</it>-e2 displays much higher degree of sequence diversity than the other exons in <it>DRB1 </it>(Fig. <figr fid="F1">1A</figr>), independently of whether gross-diversity or non-synonymous (aminoacid-changing) diversity is analyzed <abbrgrp><abbr bid="B12">12</abbr></abbrgrp>. As seen in earlier studies [see e.g. <abbrgrp><abbr bid="B12">12</abbr></abbrgrp>], the synonymous diversity is also elevated in exon 2 (Fig. <figr fid="F1">1B</figr>), supporting the view that recombination, e.g. gene conversion is involved in creating polymorphism in this exon <abbrgrp><abbr bid="B13">13</abbr><abbr bid="B14">14</abbr><abbr bid="B10">10</abbr></abbrgrp>. Consequently, synonymous substitutions would "hitch-hike" within the same exchanged DNA blocks as the non-synonymous substitutions and remain conserved due to selection forces acting on the non-synonymous substitutions. Indeed, synonymous substitutions were mainly found either in the same codons as the non-synonymous ones (Fig. <figr fid="F1">1C</figr>, the "complex" trace in its entity and the overlap of the non-synonymous and synonymous traces) or in their close vicinity (Fig. <figr fid="F1">1C</figr>).</p>
            <fig id="F1">
               <title>
                  <p>Figure 1</p>
               </title>
               <caption>
                  <p>The <it>HLA-DRB1 </it>exon diversity</p>
               </caption>
               <text>
                  <p>The <it>HLA-DRB1 </it>exon diversity. A, <it>DRB1 </it>exon 2 diversity compared to the rest of the coding region (fused exons 1, 3, 4, 5 and 6) in the dataset including the entire <it>DRB1 </it>coding region (49 sequences). Mean &#177; sem is shown. B, synonymous and non-synonymous diversity in the <it>DRB1 </it>coding region in the dataset including the entire <it>DRB1 </it>coding region. In the short exon 5 (24 bp) half of the alleles have G instead of C at the nucleotide position 22, resulting in high apparent diversity for the whole exon. Mean &#177; sem is shown. C, sliding window analysis of non-synonymous, synonymous and complex substitutions in the <it>DRB1</it>-e2 in the dataset including the complete <it>DRB1</it>-e2. Complex stands for complex combinations of non-synonymous and synonymous substitutions in the same codon. The graph illustrates the contribution of these different components in <it>d</it>, which is not equal to <it>d</it><sub><it>synonymous </it></sub>and <it>d</it><sub><it>non-synonymous </it></sub>(<it>d</it>, as calculated here does not take into consideration the capability of the codon to mutate in synonymous and non-synonymous manner).</p>
               </text>
               <graphic file="1471-2164-9-228-1"/>
            </fig>
         </sec>
         <sec>
            <st>
               <p>Frequency of transitions and transversions in the coding region</p>
            </st>
            <p>The higher diversity of <it>DRB1</it>-e2 is also reflected by a higher frequency of both transitions (T&#8596;C, A&#8596;G) and transversions (T&#8596;A, T&#8596;G, C&#8596;A, C&#8596;G), as compared to the rest of the <it>DRB1 </it>exons (Fig. <figr fid="F2">2</figr>). In general, transitions occur at higher frequencies than transversions in our genome <abbrgrp><abbr bid="B15">15</abbr></abbrgrp>. However, while the transition/transversion-ratio was about 2 in the fused other exons (between 2 and 3 in the separate exons 1, 3 and 4, which have a length comparable to exon 2), it was 0.8 in exon 2. The results using the full dataset of 374 complete <it>DRB1</it>-e2 (transitions = 3.4 &#177; 0.0%, transversions = 4.2 &#177; 0.1%, ratio = 0.8) were similar to the 49 complete coding sequences (Fig. <figr fid="F2">2</figr>). The transition/transversion ratio near unity in <it>DRB1</it>-e2 is logical in the light of previous studies, which show that when sequences diverge and mutations accumulate, the transition/transversion-ratio decreases finally approaching 1 due to transition saturation <abbrgrp><abbr bid="B16">16</abbr><abbr bid="B17">17</abbr></abbrgrp>.</p>
            <fig id="F2">
               <title>
                  <p>Figure 2</p>
               </title>
               <caption>
                  <p>Transitions and transversions in <it>HLA-DRB1</it>, based on the dataset including the entire <it>DRB1 </it>coding region (49 sequences)</p>
               </caption>
               <text>
                  <p>Transitions and transversions in <it>HLA-DRB1</it>, based on the dataset including the entire <it>DRB1 </it>coding region (49 sequences). Mean &#177; sem is shown.</p>
               </text>
               <graphic file="1471-2164-9-228-2"/>
            </fig>
         </sec>
         <sec>
            <st>
               <p>CpG dinucleotide enrichment and conservation in DRB1-e2</p>
            </st>
            <p>The G+C level was similar across all <it>DRB1 </it>exons (Fig. <figr fid="F3">3</figr>). At the determined G+C content of the whole of <it>DRB1 </it>of 60%, the theoretical level of CpG should under neutral conditions be 9% (see below). When present in the CpG-dinucleotide, cytosine is often methylated. Methyl-cytosine can then deaminate to uracil, and thus lead to the transition C&#8594;T or, if occurring in the complementary strand, a G&#8594;A transition. Thus, CG may mutate to TG or CA, and genes regularly have a lower than mathematically expected level of CpG dinucleotides. The high propensity of CpG to mutate may also effectively engage DNA repair. DNA repair induces double strand breaks and may support recombination events, which may explain why CpG-rich sequences have been identified to display high recombination activity.</p>
            <fig id="F3">
               <title>
                  <p>Figure 3</p>
               </title>
               <caption>
                  <p>G+C content in <it>HLA-DRB1 </it>exons 1&#8211;6, based on the dataset including the entire <it>DRB1 </it>coding region (49 sequences)</p>
               </caption>
               <text>
                  <p>G+C content in <it>HLA-DRB1 </it>exons 1&#8211;6, based on the dataset including the entire <it>DRB1 </it>coding region (49 sequences). The dotted line indicates the overall average G+C. Mean &#177; sem is shown.</p>
               </text>
               <graphic file="1471-2164-9-228-3"/>
            </fig>
            <p>Theoretically, CpG content = ((G+C content/100%)/2)<sup>2 </sup>* 100% (e.g. for G+C content of 50%, CpG content = 6.25%). The actual CpG content depends on the age of the gene, but in average the content would be below one third of the theoretical (e.g. for G+C content of 50%, CpG content &lt; 2%) <abbrgrp><abbr bid="B18">18</abbr></abbrgrp>. As expected, the determined CpG content of <it>DRB1</it>-e1, and -3-6 was well below the theoretical level of 9% (Fig. <figr fid="F4">4A</figr>; in average ~1/6 of the theoretical level [dotted line in Fig. <figr fid="F4">4B</figr>]). In contrast, CpG content of <it>DRB1</it>-e2 was surprisingly high, about 8% (Fig. <figr fid="F4">4A</figr>), which suggests that the CpG level is almost fully preserved in <it>DRB1</it>-e2 (see also Fig. <figr fid="F4">4B</figr>). In addition, the distribution of CpG dinucleotides in <it>DRB1</it>-e2 was to a very high extent conserved in all alleles (Fig. <figr fid="F5">5AB</figr>).</p>
            <fig id="F4">
               <title>
                  <p>Figure 4</p>
               </title>
               <caption>
                  <p>CpG-dinucleotide content in <it>HLA-DRB1 </it>exons 1&#8211;6, based on the dataset including the entire <it>DRB1 </it>coding region (49 sequences)</p>
               </caption>
               <text>
                  <p>CpG-dinucleotide content in <it>HLA-DRB1 </it>exons 1&#8211;6, based on the dataset including the entire <it>DRB1 </it>coding region (49 sequences). A, the observed CpG-dinucleotide content. B, the observed CpG level (as in A) divided by the mathematically estimated CpG content (based on the total G+C level). Mean &#177; sem is shown. The ratios were separately calculated for each allele and then averaged.</p>
               </text>
               <graphic file="1471-2164-9-228-4"/>
            </fig>
            <fig id="F5">
               <title>
                  <p>Figure 5</p>
               </title>
               <caption>
                  <p>CpG distribution in <it>DRB1</it>-e2, based on the dataset including the 374 complete <it>DRB1</it>-e2 sequences</p>
               </caption>
               <text>
                  <p>CpG distribution in <it>DRB1</it>-e2, based on the dataset including the 374 complete <it>DRB1</it>-e2 sequences. A, each individual sequence lined under each other in the consensus numbering order starting from <it>DRB1</it>*010101. Black boxes indicate CpG dinucleotides and gray boxes other dinucleotides. B, CpG frequency for each nucleotide position. The dotted line indicates 100%.</p>
               </text>
               <graphic file="1471-2164-9-228-5"/>
            </fig>
         </sec>
         <sec>
            <st>
               <p>Motifs potentially involved in site-specific recombination</p>
            </st>
            <p>There is a number of sequence features or motifs proposed to be recognized by specific nuclease complexes, resulting in double strand breaks and increased recombination rate <abbrgrp><abbr bid="B19">19</abbr><abbr bid="B20">20</abbr></abbrgrp>. We further analyzed the sequence data sets to explore the possibility that specific recombination motifs are involved in creating polymorphism in <it>DRB1</it>-e2 (Table <tblr tid="T1">1</tblr>).</p>
            <tbl id="T1">
               <title>
                  <p>Table 1</p>
               </title>
               <caption>
                  <p>Motifs used in the screening of <it>DRB1</it>-e2. </p>
               </caption>
               <tblbdy cols="3">
                  <r>
                     <c ca="left">
                        <p>
                           <b>Motif description</b>
                        </p>
                     </c>
                     <c ca="left">
                        <p>
                           <b>Motif sequence</b>
                        </p>
                     </c>
                     <c ca="left">
                        <p>
                           <b>Reference</b>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c cspan="3">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Polypurine/-pyrimidine tract</p>
                     </c>
                     <c ca="left">
                        <p>5'-RRRRR-3'/5'-YYYYY-3'</p>
                     </c>
                     <c ca="left">
                        <p>[47, 19, 48, 49]</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Alternating purine-pyrimidine tract</p>
                     </c>
                     <c ca="left">
                        <p>5'-RYRYR-3'/5'-YRYRY-3'</p>
                     </c>
                     <c ca="left">
                        <p>[19, 50]</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Immunoglobulin heavy chain class switch repeats</p>
                     </c>
                     <c ca="left">
                        <p>5'-GAGCT-3'/5'-AGCTC-3'</p>
                     </c>
                     <c ca="left">
                        <p>[51, 49]</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>5'-GGGCT-3'/5'-AGCCC-3'</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>5'-GGGGT-3'/5'-ACCCC-3'</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>5'-TGGGG-3'/5'-CCCCA-3'</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>5'-TGAGC-3'/5'-GCTCA-3'</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>DNA polymerase arrest site</p>
                     </c>
                     <c ca="left">
                        <p>5'-WGGAG-3'/5'-CTCCW-3'</p>
                     </c>
                     <c ca="left">
                        <p>[49]</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Deletion hotspot consensus</p>
                     </c>
                     <c ca="left">
                        <p>5'-TGRRKM-3'/5'-KMYYCA-3'</p>
                     </c>
                     <c ca="left">
                        <p>[28, 49]</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Heptamer recombination signal</p>
                     </c>
                     <c ca="left">
                        <p>5'-CACAGTG-3'/5'-CACTGTG-3'</p>
                     </c>
                     <c ca="left">
                        <p>[22]</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Nonamer recombination signal</p>
                     </c>
                     <c ca="left">
                        <p>5'-ACAAAAACC-3'/5'-GGTTTTTGT-3'</p>
                     </c>
                     <c ca="left">
                        <p>[22]</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Chi-like sequence</p>
                     </c>
                     <c ca="left">
                        <p>5'-GCTGGGG-3'/5'-CCCCAGC-3'</p>
                     </c>
                     <c ca="left">
                        <p>[40, 52]</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Chi-like sequence</p>
                     </c>
                     <c ca="left">
                        <p>5'-CCAG-3'/5'-CTGG-3'</p>
                     </c>
                     <c ca="left">
                        <p>[53, 54]</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Chi-like sequence</p>
                     </c>
                     <c ca="left">
                        <p>5'-GCWGGWGG-3'/5'-CCWCCWGC-3'</p>
                     </c>
                     <c ca="left">
                        <p>[55]</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Topoisomerase I consensus cleavage sites</p>
                     </c>
                     <c ca="left">
                        <p>5'-CAT-3'/5'-ATG-3'</p>
                     </c>
                     <c ca="left">
                        <p>[56]</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>5'-CTY-3'/5'-RAG-3'</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>5'-GTY-3'/5'-RAC-3'</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>DNA polymerase A pause site core sequence</p>
                     </c>
                     <c ca="left">
                        <p>5'-GAG-3'/5'-CTC-3'</p>
                     </c>
                     <c ca="left">
                        <p>[57]</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>5'-ACG-3'/5'-CGT-3'</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>DNA polymerase A/B frameshift hotspots</p>
                     </c>
                     <c ca="left">
                        <p>5'-TGGNGT-3'/5'-ACNCCA-3'</p>
                     </c>
                     <c ca="left">
                        <p>[58, 59]</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Vertebrate topoisomerase II consensus cleavage site</p>
                     </c>
                     <c ca="left">
                        <p>5'-RNYNNCNNGYNGKTNYNY-3'/</p>
                     </c>
                     <c ca="left">
                        <p>[60, 61]</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>5'-RNRNAMCNRCNNGNNTNY-3'</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Human hypervariable minisatellite core sequence</p>
                     </c>
                     <c ca="left">
                        <p>5'-GGGCAGGANG-3'/5'-CNTCCTGCCC-3'</p>
                     </c>
                     <c ca="left">
                        <p>[62]</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>DNA polymerase A frameshift hotspots</p>
                     </c>
                     <c ca="left">
                        <p>5'-TCCCCC-3'/5'-GGGGGA-3'</p>
                     </c>
                     <c ca="left">
                        <p>[59, 63]</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>DNA polymerase B frameshift hotspots</p>
                     </c>
                     <c ca="left">
                        <p>5'-TTTT-3'/5'-AAAA-3'</p>
                     </c>
                     <c ca="left">
                        <p>[58]</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Indel hotspot</p>
                     </c>
                     <c ca="left">
                        <p>5'-GTAAGT-3'/5'-ACTTAC-3'</p>
                     </c>
                     <c ca="left">
                        <p>[64]</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Hotspot motif</p>
                     </c>
                     <c ca="left">
                        <p>5'-CCTCCCT-3'/5'-AGGGAGG-3'</p>
                     </c>
                     <c ca="left">
                        <p>[63]</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Repeat element motif</p>
                     </c>
                     <c ca="left">
                        <p>5'-CCCCACCCC-3'/5'-GGGGTGGGG-3'</p>
                     </c>
                     <c ca="left">
                        <p>[63]</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Double strand break-generating motif</p>
                     </c>
                     <c ca="left">
                        <p>5'-TGGGGG-3'/5'-CCCCCA-3'</p>
                     </c>
                     <c ca="left">
                        <p>[63]</p>
                     </c>
                  </r>
               </tblbdy>
               <tblfn>
                  <p>The sequences of the complementary strands are separated by "/". The ambiguity code symbols are: R = A/G, Y = C/T, K = G/T, M = A/C, S = G/C, W = A/T, N = A/C/G/T.</p>
               </tblfn>
            </tbl>
            <p>Recombination signal sequences (RSSs) are involved in the diversification of antibody genes, initiated by DNA double-strand breaks introduced in the vicinity of RSSs. The RSSs are composed of conserved heptamer and nonamer motifs separated by a spacer of 12 or 23 bp <abbrgrp><abbr bid="B21">21</abbr></abbrgrp>. The heptamer motifs, especially the first three bases (CAC), are the most influential on recombination efficiency and are usually the most conserved <abbrgrp><abbr bid="B22">22</abbr></abbrgrp>. We found a heptamer-like motif (5'-CAC<b>G</b>GTG-3', the bold letter is replaced in 7% of the alleles) at position 254&#8211;260 in exon 2.</p>
            <p>Class switch recombination (CSR) refers to the event when a lymphocyte changes the type of immunoglobulin it produces <abbrgrp><abbr bid="B23">23</abbr></abbrgrp>. Also CSR involves recombination via DNA double strand breaks at switch regions containing repetitive elements (predominantly, 5'-GAGCT-3' and 5'-GGGGT-3'). The immunoglobulin heavy chain class switch repeat GAGCT was present at position 145&#8211;149 and 248&#8211;252 in all but one (<it>DRB1</it>*0423) of 374 <it>DRB1 </it>alleles (Table <tblr tid="T2">2</tblr>). Another immunoglobulin heavy chain class switch repeat (5'-TGGGG-3') was present in all alleles except for alleles in the lineage <it>DRB1</it>*07 (Table <tblr tid="T2">2</tblr>). This immunoglobulin heavy chain class switch repeats was also present in the exon 2 sequences excluded from the analysis due to missing bases in either the 3' or 5' end (see Additional file <supplr sid="S1">1</supplr>).</p>
            <suppl id="S1">
               <title>
                  <p>Additional file 1</p>
               </title>
               <text>
                  <p>The 98 incomplete <it>HLA-DRB1 </it>exon 2 sequences excluded from the analyses.</p>
               </text>
               <file name="1471-2164-9-228-S1.txt">
                  <p>Click here for file</p>
               </file>
            </suppl>
            <tbl id="T2">
               <title>
                  <p>Table 2</p>
               </title>
               <caption>
                  <p>Motifs previously identified in other genes found in the <it>DRB1</it>-e2</p>
               </caption>
               <tblbdy cols="4">
                  <r>
                     <c ca="left">
                        <p>
                           <b>Motif description</b>
                        </p>
                     </c>
                     <c ca="left">
                        <p>
                           <b>Motif sequence</b>
                        </p>
                     </c>
                     <c ca="left">
                        <p>
                           <b>Position</b>
                        </p>
                     </c>
                     <c ca="left">
                        <p>
                           <b>Not present in</b>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Polypurine tract</p>
                     </c>
                     <c ca="left">
                        <p>5'-RRRRR(RRR)-3'</p>
                     </c>
                     <c ca="left">
                        <p>88&#8211;95</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>121&#8211;125<sup>a</sup></p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>178&#8211;184</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>246&#8211;250</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Polypyrimidine tract</p>
                     </c>
                     <c ca="left">
                        <p>5'-YYYYY-3'</p>
                     </c>
                     <c ca="left">
                        <p>5&#8211;10</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>36&#8211;41</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Immunoglobulin heavy chain class switch repeat</p>
                     </c>
                     <c ca="left">
                        <p>5'-GAGCT-3'</p>
                     </c>
                     <c ca="left">
                        <p>141&#8211;145</p>
                     </c>
                     <c ca="left">
                        <p><it>DRB1</it>*0328</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>248&#8211;252</p>
                     </c>
                     <c ca="left">
                        <p><it>DRB1</it>*0452</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>5'-TGGGG-3'</p>
                     </c>
                     <c ca="left">
                        <p>145&#8211;149</p>
                     </c>
                     <c ca="left">
                        <p><it>DRB1</it>*07</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Deletion hotspot consensus</p>
                     </c>
                     <c ca="left">
                        <p>5'-TGAAGA-3'</p>
                     </c>
                     <c ca="left">
                        <p>37&#8211;42<sup>b</sup></p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>5'-TGRRKM-3'</p>
                     </c>
                     <c ca="left">
                        <p>145&#8211;150</p>
                     </c>
                     <c ca="left">
                        <p><it>DRB1</it>*07</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>250&#8211;255<sup>b</sup></p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Chi-like sequence</p>
                     </c>
                     <c ca="left">
                        <p>5'-GCTGGGG-3'</p>
                     </c>
                     <c ca="left">
                        <p>143&#8211;149</p>
                     </c>
                     <c ca="left">
                        <p><it>DRB1</it>*07</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>5'-CTGG-3'</p>
                     </c>
                     <c ca="left">
                        <p>144&#8211;147<sup>b</sup></p>
                     </c>
                     <c ca="left">
                        <p><it>DRB1</it>*07</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>167&#8211;170<sup>b</sup></p>
                     </c>
                     <c ca="left">
                        <p><it>DRB1</it>*0705</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>176&#8211;179</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Topoisomerase I consensus cleavage site</p>
                     </c>
                     <c ca="left">
                        <p>5'-CTY-3'</p>
                     </c>
                     <c ca="left">
                        <p>38&#8211;40</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>251&#8211;253</p>
                     </c>
                     <c ca="left">
                        <p><it>DRB1</it>*0423, *0452</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>5'-GTY-3'</p>
                     </c>
                     <c ca="left">
                        <p>4&#8211;6</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>31&#8211;33</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>47&#8211;49<sup>b</sup></p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>108&#8211;110<sup>b</sup></p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>114&#8211;116<sup>b</sup></p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>171&#8211;173<sup>b</sup></p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>183&#8211;185<sup>b</sup></p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>213&#8211;215<sup>b</sup></p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>231&#8211;233<sup>b</sup></p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>DNA polymerase a pause site core sequence</p>
                     </c>
                     <c ca="left">
                        <p>5'-GAG-3'</p>
                     </c>
                     <c ca="left">
                        <p>51&#8211;53</p>
                     </c>
                     <c ca="left">
                        <p><it>DRB1</it>*0439</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>121&#8211;123</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>246&#8211;251 (2&#215;)</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>268&#8211;270</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>5'-ACG-3'</p>
                     </c>
                     <c ca="left">
                        <p>2&#8211;4</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>115&#8211;117</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>116&#8211;118<sup>b</sup></p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Deletion hotspot</p>
                     </c>
                     <c ca="left">
                        <p>5'-YYYTG-3'</p>
                     </c>
                     <c ca="left">
                        <p>7&#8211;11</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>177&#8211;181<sup>b</sup></p>
                     </c>
                     <c ca="left">
                        <p><it>DRB1</it>*0705</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>187&#8211;191</p>
                     </c>
                     <c ca="left">
                        <p><it>DRB1</it>*1374</p>
                     </c>
                  </r>
               </tblbdy>
               <tblfn>
                  <p>Only the motifs present in at least an (almost) entire allelic family are presented; some less common motifs are presented in the text. The ambiguity code symbols are: R = A/G, Y = C/T, K = G/T, M = A/C, S = G/C, W = A/T, N = A/C/G/T.</p>
                  <p><sup>a</sup>motif &#177; 1 bp</p>
                  <p><sup>b </sup>motif in non-coding strand corresponding to these bases in the coding strand.</p>
               </tblfn>
            </tbl>
            <p>Chi (crossover hotspot instigator, &#967;) is an octamer recombination hotspot (5'-GCTGGTGG-3') of the major recombination pathway in <it>E. coli </it>[reviewed in <abbrgrp><abbr bid="B24">24</abbr></abbrgrp>]. Recombination by this pathway is initiated by double-strand breaks occurring at chi sequences. Variants of this motif are suggested to have partial recombinogenic activity, and chi-like sequences have been speculated to be involved in both deletion and translocation events in man <abbrgrp><abbr bid="B25">25</abbr><abbr bid="B26">26</abbr><abbr bid="B27">27</abbr></abbrgrp>. A chi-like sequences at nucleotide position 143&#8211;149 in exon 2 was found in all alleles except for the <it>DRB1</it>*07 allelic lineage (Table <tblr tid="T2">2</tblr>). The chi-like sequence in <it>DRB1</it>-e2 was overlapping with a motif reported to be a deletion hotspot consensus sequence (5'-TGRRKM-3'), suggested to be involved in illegitimate recombination <abbrgrp><abbr bid="B28">28</abbr><abbr bid="B29">29</abbr></abbrgrp>. We located this hotspot sequence at positions 145&#8211;150 in all alleles except for the allelic lineage <it>DRB1</it>*07 (Table <tblr tid="T2">2</tblr>). Moreover, this sequence was present in the non-coding strand of all alleles at coding strand position 37&#8211;42 (Table <tblr tid="T2">2</tblr>).</p>
            <p>Several types of the recombination motifs screened for were also found in the other exons of <it>DRB1 </it>(the dataset of 49 complete coding sequences) (not shown).</p>
         </sec>
         <sec>
            <st>
               <p>Conserved sequence stretches and motifs in DRB1-e2</p>
            </st>
            <p>Despite the high degree of variability in <it>DRB1</it>-e2 we could, to our surprise, find 19 stretches of a length 3&#8211;13 bp that have no variation at all among different <it>DRB1</it>-e2 (Table <tblr tid="T3">3</tblr> and Fig. <figr fid="F6">6</figr>). Some of these fully conserved bases corresponded to the known motifs as identified above (Table <tblr tid="T3">3</tblr>).</p>
            <tbl id="T3">
               <title>
                  <p>Table 3</p>
               </title>
               <caption>
                  <p>Fully conserved stretches of a minimum of 3 bp in all <it>DRB1</it>-e2 sequences</p>
               </caption>
               <tblbdy cols="3">
                  <r>
                     <c ca="left">
                        <p>
                           <b>Position</b>
                        </p>
                     </c>
                     <c ca="left">
                        <p>
                           <b>Sequence (underlined letters corresponding to motif in table 2)</b>
                        </p>
                     </c>
                     <c ca="left">
                        <p>
                           <b>Corresponding motif in Table 2</b>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c cspan="3">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>4&#8211;8</p>
                     </c>
                     <c ca="left">
                        <p>5'-<ul>GTTTC</ul>-3'</p>
                     </c>
                     <c ca="left">
                        <p>Polypyrimidine tract</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>30&#8211;32</p>
                     </c>
                     <c ca="left">
                        <p>5'-TGT-3'</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>36&#8211;43</p>
                     </c>
                     <c ca="left">
                        <p>5'-T<ul>TCTTCA</ul>A-3'</p>
                        <p>5'-<ul>TTCTTC</ul>AA-3'</p>
                     </c>
                     <c ca="left">
                        <p>Deletion hotspot consensus sequence (5'-TGAAGA-3') in non-coding strand</p>
                        <p>Polypyrimidine tract (5'-TTCTTC-3')</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>47&#8211;49</p>
                     </c>
                     <c ca="left">
                        <p>5'-GAC-3'</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>51&#8211;54</p>
                     </c>
                     <c ca="left">
                        <p>5'-GAGC-3'</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>56&#8211;60</p>
                     </c>
                     <c ca="left">
                        <p>5'-GGTGC-3'</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>107&#8211;119</p>
                     </c>
                     <c ca="left">
                        <p>5'-CGACAGCGACGTG-3'</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>121&#8211;124</p>
                     </c>
                     <c ca="left">
                        <p>5'-<ul>GGGA</ul>-3'</p>
                     </c>
                     <c ca="left">
                        <p>Polypurine tract</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>142&#8211;145</p>
                     </c>
                     <c ca="left">
                        <p>5'-<ul>AGCT</ul>-3'</p>
                     </c>
                     <c ca="left">
                        <p>Part of the immunoglobulin heavy chain class switch repeat (5'-G<ul>AGCT</ul>-3')<sup>a</sup></p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>5'-A<ul>GCT</ul>-3'</p>
                     </c>
                     <c ca="left">
                        <p>Part of the chi-like sequence (5'-<ul>GCT</ul>GGGG-3')<sup>a</sup></p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>147&#8211;149</p>
                     </c>
                     <c ca="left">
                        <p>5'-<ul>GGG</ul>-3'</p>
                     </c>
                     <c ca="left">
                        <p>Part of deletion hotspot consensus sequence (5'-TG<ul>RRK</ul>M-3')<sup>a</sup></p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>5'-<ul>GGG</ul>-3'</p>
                     </c>
                     <c ca="left">
                        <p>Part of the chi-like sequence (5'-GCTG<ul>GGG</ul>-3')<sup>a</sup></p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>167&#8211;174</p>
                     </c>
                     <c ca="left">
                        <p>5'-CTGGAACA-3'</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>179&#8211;182</p>
                     </c>
                     <c ca="left">
                        <p>5'-GAAG-3'</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>210&#8211;215</p>
                     </c>
                     <c ca="left">
                        <p>5'-GTGGAC-3'</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>222&#8211;227</p>
                     </c>
                     <c ca="left">
                        <p>5'-TGCAGA-3'</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>229&#8211;235</p>
                     </c>
                     <c ca="left">
                        <p>5'-ACAACTA-3'</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>246&#8211;250</p>
                     </c>
                     <c ca="left">
                        <p>5'-<ul>GAGAG</ul>-3'</p>
                     </c>
                     <c ca="left">
                        <p>Polypurine trac</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>248&#8211;250</p>
                     </c>
                     <c ca="left">
                        <p>5'-<ul>GAG</ul>-3'</p>
                     </c>
                     <c ca="left">
                        <p>Part of the immunoglobulin heavy chain class switch repeat (5'-<ul>GAG</ul>CT-3')<sup>a</sup></p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>252&#8211;256</p>
                     </c>
                     <c ca="left">
                        <p>5'-<ul>TTCA</ul>C-3'</p>
                     </c>
                     <c ca="left">
                        <p>Deletion hotspot consensus sequence</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>261&#8211;263</p>
                     </c>
                     <c ca="left">
                        <p>5'-CAG-3'</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>267&#8211;269</p>
                     </c>
                     <c ca="left">
                        <p>5'-CGA-3'</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
               </tblbdy>
               <tblfn>
                  <p><sup>a</sup>the full motif as in Table 2</p>
               </tblfn>
            </tbl>
            <fig id="F6">
               <title>
                  <p>Figure 6</p>
               </title>
               <caption>
                  <p>Sliding window analysis of nucleotide diversity in <it>HLA-DRB1 </it>exon 2, displaying stretches of totally conserved bases in the 374 <it>DRB1</it>-e2 sequences (of the length &#8805; 3 bp; thick grey lines below the abscissa)</p>
               </caption>
               <text>
                  <p>Sliding window analysis of nucleotide diversity in <it>HLA-DRB1 </it>exon 2, displaying stretches of totally conserved bases in the 374 <it>DRB1</it>-e2 sequences (of the length &#8805; 3 bp; thick grey lines below the abscissa). Also indicated are the previously identified ARS-coding codons (thick black lines above the diversity graph).</p>
               </text>
               <graphic file="1471-2164-9-228-6"/>
            </fig>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Discussion</p>
         </st>
         <p>In this study we identify several distinct features of exon 2 of <it>DRB1</it>. One of these is its high CpG content, possibly leading to a high degree of a) point mutations and b) DNA repair. However, not only is the CpG level high in <it>DRB1</it>-e2, but also the CpG pattern is highly conserved in <it>DRB</it>-e2. It therefore appears unlikely that CpG-dinucleotides would support ARS polymorphism by point mutations. More likely is that the conserved CpG pattern is explained by frequent DNA repair, which, by introducing double-strand DNA cleavage followed by non-homologous end-joining, is one of the suggested mechanisms of gene conversion [reviewed in <abbrgrp><abbr bid="B3">3</abbr><abbr bid="B5">5</abbr><abbr bid="B30">30</abbr></abbrgrp>]. Earlier studies of MHC class I nucleotide sequences in mice have proposed that regions with high levels of CpG dinucleotides are involved in non-reciprocal recombination (gene conversion) <abbrgrp><abbr bid="B31">31</abbr></abbrgrp>. Analyzes of human MHC class I and II sequences also have reported increased CpG dinucleotide levels in regions suggested to be involved in gene conversion <abbrgrp><abbr bid="B32">32</abbr></abbrgrp>. CpG nucleotide could be preserved if the repair system had a bias towards G:C pairs instead of A:T pairs <abbrgrp><abbr bid="B33">33</abbr></abbrgrp> as suggested for regions with high recombination activity <abbrgrp><abbr bid="B34">34</abbr></abbrgrp>. However, it should be born in mind that unmethylated CpG dinucleotides, in contrast to the cytosine-methylated CpG:s, mutate at normal rates and regions with high CpG contents may have low levels of methylation <abbrgrp><abbr bid="B35">35</abbr></abbrgrp>. Conservation of CpG dinucleotides may therefore be a result of either low germ-line methylation or a specific selection against the loss of CpGs <abbrgrp><abbr bid="B36">36</abbr></abbrgrp>. However, the highly significant pattern of conserved CpG in <it>HLA-DRB1-e2 </it>can be considered unlikely even if the CpG dinucleotides were unmethylated and mutated at the rate of other bases.</p>
         <p>A few eukaryotic endonucleases with specific DNA recognition sequences involved in DNA recombination, such as topoisomerase I <abbrgrp><abbr bid="B37">37</abbr></abbrgrp>, Endo.SceI <abbrgrp><abbr bid="B38">38</abbr></abbrgrp> and homing endonucleases [reviewed in <abbrgrp><abbr bid="B39">39</abbr></abbrgrp>], have been identified. The enzymes have in common that they recognize a more or less strictly defined DNA sequence and cleave at it or some distance from it. In addition, a number of other conserved sequence motifs associated with high recombination activity (such as the chi-like sequences) but without a pinpointed endonuclease/recombinase have been recognized <abbrgrp><abbr bid="B40">40</abbr><abbr bid="B19">19</abbr><abbr bid="B41">41</abbr></abbrgrp>. In this study, we screened <it>DRB1</it>-e2 for "known" recombination, translocation and deletion motifs. A heptamer-like motif was found in all investigated <it>DRB1 </it>alleles and an immunoglobulin heavy chain class switch repeat was found in all but one of the <it>DRB1 </it>alleles. Moreover, a chi-like sequence and a deletion hotspot consensus sequence were present in all alleles except for the *07 lineage. It is thinkable that the <it>DRB1</it>*07 allelic lineage, which contains least alleles of all the lineages, may have lost one of these motifs and therefore gained a limited ability to recombine. This may also be true for other, even less frequent motifs, also including conserved CpGs. Currently, not enough is known about the function of the specific motifs found in order to speculate further on their function. Interspersed among the highly polymorphic areas of <it>DRB1</it>-e2, we found multiple short stretches of bases that have no variation at all between the <it>DRB1 </it>alleles in the dataset. Conserved amino acid motifs can be important for the maintenance of the overall structure of the antigen-binding groove, but as these stretches also lack synonymous substitutions they may have a function in allowing recombination between alleles via illegitimate recombination. This could occur either by offering homology for recombination, by allowing cleavage by some specific enzymes or by stabilization of DNA's secondary structure. Comparison of these sequences to known sequence motifs associated with recombination (see above) produced no hits, which is by no means surprising as indeed only few motifs are known and even fewer verified.</p>
         <p>It should be born in mind that the specific sequence motifs screened for are for the most part very short and may thus appear in a random fashion in any sequence analyzed. Indeed, some motifs were found in the other exons of <it>HLA-DRB1</it>, not subject to recombination. However, the fact that exon 2 is subject to high rate of recombination &#8211; in contrast to the other exons which are highly conserved &#8211; makes random conservation of such stretches unlikely, especially in such a large pool of allelic sequences. Yet the most remarkable features of <it>DRB1</it>-e2 are, rather than the known recombinogenic motifs found, a) the fully conserved sequence stretches and b) high CpG content and the conserved CpG pattern.</p>
      </sec>
      <sec>
         <st>
            <p>Conclusion</p>
         </st>
         <p>We have identified in <it>DRB1</it>-e2 both some known recombination motifs and multiple putative motifs. The latter include both the conserved CpG pattern and other fully conserved sequence motifs. Although the role of these sequence features in the recombination processes in <it>DRB1 </it>is speculative, it is obvious that the known recombination motifs identified here cannot be enough to support the full spectrum of recombination. 22 variable and 15 conserved <it>DRB1</it>-e2 ARS-coding codons, spread over 245 bp (Fig. <figr fid="F6">6</figr>), are known, and each of the variable ARS codons should probably be able to recombine separately from the others, theoretically requiring 23 recombination breakpoints. Whether this indeed is the case, will be deduced from full mapping of the <it>DRB1</it>-e2 recombination profile, which is currently in progress. If the conserved sequence motifs identified here indeed are important in recombination, they would likely be present in other regions of the genome with high recombination activity. This will also be addressed in future studies.</p>
      </sec>
      <sec>
         <st>
            <p>Methods</p>
         </st>
         <sec>
            <st>
               <p>Nucleotide sequences used</p>
            </st>
            <p>For the analysis, sequences from the IMGT/HLA database <abbrgrp><abbr bid="B6">6</abbr><abbr bid="B7">7</abbr></abbrgrp> were used. The datasets analyzed were the 374 complete exon 2 sequences and 49 complete coding sequences (exons 1&#8211;6). Full descriptions of the datasets can be found in the Additional files <supplr sid="S2">2</supplr> and <supplr sid="S3">3</supplr>.</p>
            <suppl id="S2">
               <title>
                  <p>Additional file 2</p>
               </title>
               <text>
                  <p>The 374 complete <it>HLA-DRB1 </it>exon 2 sequences used in the analyses.</p>
               </text>
               <file name="1471-2164-9-228-S2.txt">
                  <p>Click here for file</p>
               </file>
            </suppl>
            <suppl id="S3">
               <title>
                  <p>Additional file 3</p>
               </title>
               <text>
                  <p>The 49 complete <it>HLA-DRB1 </it>coding sequences used in the analyses.</p>
               </text>
               <file name="1471-2164-9-228-S3.txt">
                  <p>Click here for file</p>
               </file>
            </suppl>
         </sec>
         <sec>
            <st>
               <p>Analysis of diversity, transition/transversion-ratios, G+C and CpG contents</p>
            </st>
            <p>The sequences were aligned using ClustalW <abbrgrp><abbr bid="B42">42</abbr></abbrgrp>. The mean synonymous and non-synonymous diversities (<it>d</it>) were estimated by pairwise comparison of the number of nucleotide substitutions using the Jukes-Cantor method <abbrgrp><abbr bid="B43">43</abbr></abbrgrp> with the MEGA3.1 software <abbrgrp><abbr bid="B44">44</abbr></abbrgrp>. Sliding-window analyses of the nucleotide diversities were performed using DnaSP 4.10.9 <abbrgrp><abbr bid="B45">45</abbr></abbrgrp> Analyses of the transition/transversion-ratios and the G+C and CpG contents were done with SWAAP 1.0.2 <abbrgrp><abbr bid="B46">46</abbr></abbrgrp> and MEGA3.1. The sliding window analyses of the CpG content were performed using Microsoft Excel.</p>
         </sec>
         <sec>
            <st>
               <p>Analysis of motifs potentially involved in site-specific recombination</p>
            </st>
            <p>Recombination has been suggested to be promoted by common sequence features or motifs <abbrgrp><abbr bid="B20">20</abbr></abbrgrp>, known or postulated to be recognized by specific nuclease complexes, leading to double strand break and increased recombination rate. We screened <it>DRB1</it>-e2 (coding and non-coding strands) in MEGA3.1 for sequence motifs previously shown to be involved in recombination, to explore the possibility that specific motifs are involved in creating new polymorphisms. The motifs screened for are listed in Table <tblr tid="T1">1</tblr>.</p>
         </sec>
      </sec>
      <sec>
         <st>
            <p>List of abbreviations</p>
         </st>
         <p>ARS: antigen-recognition site(s); bp: basepairs; Chi: crossover hotspot instigator; CpG: CG-dinucleotide (in DNA); CSR: class switch recombination; <it>d</it>: nucleotide diversity; G+C content: content of guanine and cytosine nucleotides (in DNA), <it>DRB1</it>-e2: exon 2 of the <it>HLA-DRB1 </it>gene; HLA: human leukocyte antigen; MHC: major histocompatibility complex; RSSs: recombination signal sequences.</p>
      </sec>
      <sec>
         <st>
            <p>Authors' contributions</p>
         </st>
         <p>JPK designed the study, JvS and JPK performed the analyses and wrote the manuscript. Both authors read and approved the final manuscript.</p>
      </sec>
   </bdy>
   <bm>
      <ack>
         <sec>
            <st>
               <p>Acknowledgements</p>
            </st>
            <p>This study was supported by grants from the Novo Nordisk Foundation, the Sigrid Jus&#233;lius Foundation, the Magnus Ehrnrooth Foundation, the K. Albin Johansson Foundation, the Swedish Research Council, Uppsala University, &#197;bo Akademi University and the University of Helsinki Research Funds.</p>
         </sec>
      </ack>
      <refgrp>
         <bibl id="B1">
            <title>
               <p>The replication-recombination connection: insights from genomics</p>
            </title>
            <aug>
               <au>
                  <snm>Gruss</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Michel</snm>
                  <fnm>B</fnm>
               </au>
            </aug>
            <source>Curr Opin Microbiol</source>
            <pubdate>2001</pubdate>
            <volume>4</volume>
            <issue>5</issue>
            <fpage>595</fpage>
            <lpage>601</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/S1369-5274(00)00256-3</pubid>
                  <pubid idtype="pmpid" link="fulltext">11587938</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B2">
            <title>
               <p>Molecular views of recombination proteins and their control</p>
            </title>
            <aug>
               <au>
                  <snm>West</snm>
                  <fnm>SC</fnm>
               </au>
            </aug>
            <source>Nat Rev Mol Cell Biol</source>
            <pubdate>2003</pubdate>
            <volume>4</volume>
            <issue>6</issue>
            <fpage>435</fpage>
            <lpage>445</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/nrm1127</pubid>
                  <pubid idtype="pmpid" link="fulltext">12778123</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B3">
            <title>
               <p>Recombination: a frank view of exchanges and vice versa</p>
            </title>
            <aug>
               <au>
                  <snm>Haber</snm>
                  <fnm>JE</fnm>
               </au>
            </aug>
            <source>Curr Opin Cell Biol</source>
            <pubdate>2000</pubdate>
            <volume>12</volume>
            <issue>3</issue>
            <fpage>286</fpage>
            <lpage>292</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/S0955-0674(00)00090-9</pubid>
                  <pubid idtype="pmpid" link="fulltext">10801454</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B4">
            <title>
               <p>Molecular mechanisms of exon shuffling: illegitimate recombination</p>
            </title>
            <aug>
               <au>
                  <snm>van Rijk</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Bloemendal</snm>
                  <fnm>H</fnm>
               </au>
            </aug>
            <source>Genetica</source>
            <pubdate>2003</pubdate>
            <volume>118</volume>
            <issue>2-3</issue>
            <fpage>245</fpage>
            <lpage>249</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1023/A:1024138600624</pubid>
                  <pubid idtype="pmpid" link="fulltext">12868613</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B5">
            <title>
               <p>Illegitimate DNA integration in mammalian cells</p>
            </title>
            <aug>
               <au>
                  <snm>Wurtele</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Little</snm>
                  <fnm>KC</fnm>
               </au>
               <au>
                  <snm>Chartrand</snm>
                  <fnm>P</fnm>
               </au>
            </aug>
            <source>Gene Ther</source>
            <pubdate>2003</pubdate>
            <volume>10</volume>
            <issue>21</issue>
            <fpage>1791</fpage>
            <lpage>1799</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/sj.gt.3302074</pubid>
                  <pubid idtype="pmpid" link="fulltext">12960968</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B6">
            <title>
               <p>IMGT/HLA and IMGT/MHC: sequence databases for the study of the major histocompatibility complex</p>
            </title>
            <aug>
               <au>
                  <snm>Robinson</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Waller</snm>
                  <fnm>MJ</fnm>
               </au>
               <au>
                  <snm>Parham</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>de Groot</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Bontrop</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Kennedy</snm>
                  <fnm>LJ</fnm>
               </au>
               <au>
                  <snm>Stoehr</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Marsh</snm>
                  <fnm>SG</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2003</pubdate>
            <volume>31</volume>
            <issue>1</issue>
            <fpage>311</fpage>
            <lpage>314</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">165517</pubid>
                  <pubid idtype="pmpid" link="fulltext">12520010</pubid>
                  <pubid idtype="doi">10.1093/nar/gkg070</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B7">
            <title>
               <p>IMGT/HLA Database</p>
            </title>
            <url>http://www.ebi.ac.uk/imgt/hla</url>
         </bibl>
         <bibl id="B8">
            <title>
               <p>A determination of the frequency of gene conversion in unmanipulated mouse sperm</p>
            </title>
            <aug>
               <au>
                  <snm>Hogstrand</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Bohme</snm>
                  <fnm>J</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci U S A</source>
            <pubdate>1994</pubdate>
            <volume>91</volume>
            <issue>21</issue>
            <fpage>9921</fpage>
            <lpage>9925</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">44929</pubid>
                  <pubid idtype="pmpid" link="fulltext">7937918</pubid>
                  <pubid idtype="doi">10.1073/pnas.91.21.9921</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B9">
            <title>
               <p>Analysis of HLA class II haplotypes in the Cayapa Indians of Ecuador: a novel DRB1 allele reveals evidence for convergent evolution and balancing selection at position 86</p>
            </title>
            <aug>
               <au>
                  <snm>Titus-Trachtenberg</snm>
                  <fnm>EA</fnm>
               </au>
               <au>
                  <snm>Rickards</snm>
                  <fnm>O</fnm>
               </au>
               <au>
                  <snm>De Stefano</snm>
                  <fnm>GF</fnm>
               </au>
               <au>
                  <snm>Erlich</snm>
                  <fnm>HA</fnm>
               </au>
            </aug>
            <source>Am J Hum Genet</source>
            <pubdate>1994</pubdate>
            <volume>55</volume>
            <issue>1</issue>
            <fpage>160</fpage>
            <lpage>167</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1918223</pubid>
                  <pubid idtype="pmpid">8023844</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B10">
            <title>
               <p>Gene conversion vs point mutation in generating variability at the antigen recognition site of major histocompatibility complex loci</p>
            </title>
            <aug>
               <au>
                  <snm>Ohta</snm>
                  <fnm>T</fnm>
               </au>
            </aug>
            <source>J Mol Evol</source>
            <pubdate>1995</pubdate>
            <volume>41</volume>
            <issue>2</issue>
            <fpage>115</fpage>
            <lpage>119</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1007/BF00170662</pubid>
                  <pubid idtype="pmpid">7666440</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B11">
            <title>
               <p>New HLA-DPB1 alleles generated by interallelic gene conversion detected by analysis of sperm</p>
            </title>
            <aug>
               <au>
                  <snm>Zangenberg</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Huang</snm>
                  <fnm>MM</fnm>
               </au>
               <au>
                  <snm>Arnheim</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Erlich</snm>
                  <fnm>H</fnm>
               </au>
            </aug>
            <source>Nat Genet</source>
            <pubdate>1995</pubdate>
            <volume>10</volume>
            <issue>4</issue>
            <fpage>407</fpage>
            <lpage>414</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/ng0895-407</pubid>
                  <pubid idtype="pmpid" link="fulltext">7670490</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B12">
            <title>
               <p>Full-length sequence analysis of the HLA-DRB1 locus suggests a recent origin of alleles</p>
            </title>
            <aug>
               <au>
                  <snm>von Salome</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Gyllensten</snm>
                  <fnm>U</fnm>
               </au>
               <au>
                  <snm>Bergstrom</snm>
                  <fnm>TF</fnm>
               </au>
            </aug>
            <source>Immunogenetics</source>
            <pubdate>2007</pubdate>
            <volume>59</volume>
            <issue>4</issue>
            <fpage>261</fpage>
            <lpage>271</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1007/s00251-007-0196-8</pubid>
                  <pubid idtype="pmpid" link="fulltext">17345114</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B13">
            <title>
               <p>Polymorphism of human Ia antigens: gene conversion between two DR beta loci results in a new HLA-D/DR specificity</p>
            </title>
            <aug>
               <au>
                  <snm>Gorski</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Mach</snm>
                  <fnm>B</fnm>
               </au>
            </aug>
            <source>Nature</source>
            <pubdate>1986</pubdate>
            <volume>322</volume>
            <issue>6074</issue>
            <fpage>67</fpage>
            <lpage>70</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/322067a0</pubid>
                  <pubid idtype="pmpid" link="fulltext">3459965</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B14">
            <title>
               <p>Allelic diversity is generated by intraexon sequence exchange at the DRB1 locus of primates</p>
            </title>
            <aug>
               <au>
                  <snm>Gyllensten</snm>
                  <fnm>UB</fnm>
               </au>
               <au>
                  <snm>Sundvall</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Erlich</snm>
                  <fnm>HA</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci U S A</source>
            <pubdate>1991</pubdate>
            <volume>88</volume>
            <issue>9</issue>
            <fpage>3686</fpage>
            <lpage>3690</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">51517</pubid>
                  <pubid idtype="pmpid" link="fulltext">2023919</pubid>
                  <pubid idtype="doi">10.1073/pnas.88.9.3686</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B15">
            <title>
               <p>Substitution-rate variation among sites and the estimation of transition bias</p>
            </title>
            <aug>
               <au>
                  <snm>Wakeley</snm>
                  <fnm>J</fnm>
               </au>
            </aug>
            <source>Mol Biol Evol</source>
            <pubdate>1994</pubdate>
            <volume>11</volume>
            <issue>3</issue>
            <fpage>436</fpage>
            <lpage>442</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">8015437</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B16">
            <title>
               <p>Estimating the transition/transversion ratio from independent pairwise comparisons with an assumed phylogeny</p>
            </title>
            <aug>
               <au>
                  <snm>Purvis</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Bromham</snm>
                  <fnm>L</fnm>
               </au>
            </aug>
            <source>J Mol Evol</source>
            <pubdate>1997</pubdate>
            <volume>44</volume>
            <issue>1</issue>
            <fpage>112</fpage>
            <lpage>119</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1007/PL00006117</pubid>
                  <pubid idtype="pmpid" link="fulltext">9010143</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B17">
            <title>
               <p>Estimation of the transition/transversion rate bias and species sampling</p>
            </title>
            <aug>
               <au>
                  <snm>Yang</snm>
                  <fnm>Z</fnm>
               </au>
               <au>
                  <snm>Yoder</snm>
                  <fnm>AD</fnm>
               </au>
            </aug>
            <source>J Mol Evol</source>
            <pubdate>1999</pubdate>
            <volume>48</volume>
            <issue>3</issue>
            <fpage>274</fpage>
            <lpage>283</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1007/PL00006470</pubid>
                  <pubid idtype="pmpid" link="fulltext">10093216</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B18">
            <title>
               <p>The expected equilibrium of the CpG dinucleotide in vertebrate genomes under a mutation model</p>
            </title>
            <aug>
               <au>
                  <snm>Sved</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Bird</snm>
                  <fnm>A</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci U S A</source>
            <pubdate>1990</pubdate>
            <volume>87</volume>
            <issue>12</issue>
            <fpage>4692</fpage>
            <lpage>4696</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">54183</pubid>
                  <pubid idtype="pmpid" link="fulltext">2352943</pubid>
                  <pubid idtype="doi">10.1073/pnas.87.12.4692</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B19">
            <title>
               <p>Translocation and gross deletion breakpoints in human inherited disease and cancer I: Nucleotide composition and recombination-associated motifs</p>
            </title>
            <aug>
               <au>
                  <snm>Abeysinghe</snm>
                  <fnm>SS</fnm>
               </au>
               <au>
                  <snm>Chuzhanova</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Krawczak</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Ball</snm>
                  <fnm>EV</fnm>
               </au>
               <au>
                  <snm>Cooper</snm>
                  <fnm>DN</fnm>
               </au>
            </aug>
            <source>Hum Mutat</source>
            <pubdate>2003</pubdate>
            <volume>22</volume>
            <issue>3</issue>
            <fpage>229</fpage>
            <lpage>244</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1002/humu.10254</pubid>
                  <pubid idtype="pmpid" link="fulltext">12938088</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B20">
            <title>
               <p>Microdeletions and microinsertions causing human genetic disease: common mechanisms of mutagenesis and the role of local DNA sequence complexity</p>
            </title>
            <aug>
               <au>
                  <snm>Ball</snm>
                  <fnm>EV</fnm>
               </au>
               <au>
                  <snm>Stenson</snm>
                  <fnm>PD</fnm>
               </au>
               <au>
                  <snm>Abeysinghe</snm>
                  <fnm>SS</fnm>
               </au>
               <au>
                  <snm>Krawczak</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Cooper</snm>
                  <fnm>DN</fnm>
               </au>
               <au>
                  <snm>Chuzhanova</snm>
                  <fnm>NA</fnm>
               </au>
            </aug>
            <source>Hum Mutat</source>
            <pubdate>2005</pubdate>
            <volume>26</volume>
            <issue>3</issue>
            <fpage>205</fpage>
            <lpage>213</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1002/humu.20212</pubid>
                  <pubid idtype="pmpid" link="fulltext">16086312</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B21">
            <title>
               <p>Distinct DNA sequence and structure requirements for the two steps of V(D)J recombination signal cleavage</p>
            </title>
            <aug>
               <au>
                  <snm>Ramsden</snm>
                  <fnm>DA</fnm>
               </au>
               <au>
                  <snm>McBlane</snm>
                  <fnm>JF</fnm>
               </au>
               <au>
                  <snm>van Gent</snm>
                  <fnm>DC</fnm>
               </au>
               <au>
                  <snm>Gellert</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>Embo J</source>
            <pubdate>1996</pubdate>
            <volume>15</volume>
            <issue>12</issue>
            <fpage>3197</fpage>
            <lpage>3206</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">450263</pubid>
                  <pubid idtype="pmpid">8670820</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B22">
            <title>
               <p>DNA sequence and structure requirements for cleavage of V(D)J recombination signal sequences</p>
            </title>
            <aug>
               <au>
                  <snm>Cuomo</snm>
                  <fnm>CA</fnm>
               </au>
               <au>
                  <snm>Mundy</snm>
                  <fnm>CL</fnm>
               </au>
               <au>
                  <snm>Oettinger</snm>
                  <fnm>MA</fnm>
               </au>
            </aug>
            <source>Mol Cell Biol</source>
            <pubdate>1996</pubdate>
            <volume>16</volume>
            <issue>10</issue>
            <fpage>5683</fpage>
            <lpage>5690</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">231568</pubid>
                  <pubid idtype="pmpid" link="fulltext">8816481</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B23">
            <title>
               <p>DNA sequences at immunoglobulin switch region recombination sites</p>
            </title>
            <aug>
               <au>
                  <snm>Dunnick</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Hertz</snm>
                  <fnm>GZ</fnm>
               </au>
               <au>
                  <snm>Scappino</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Gritzmacher</snm>
                  <fnm>C</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>1993</pubdate>
            <volume>21</volume>
            <issue>3</issue>
            <fpage>365</fpage>
            <lpage>372</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">309126</pubid>
                  <pubid idtype="pmpid" link="fulltext">8441648</pubid>
                  <pubid idtype="doi">10.1093/nar/21.3.365</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B24">
            <title>
               <p>Homologous recombination near and far from DNA breaks: alternative roles and contrasting views</p>
            </title>
            <aug>
               <au>
                  <snm>Smith</snm>
                  <fnm>GR</fnm>
               </au>
            </aug>
            <source>Annu Rev Genet</source>
            <pubdate>2001</pubdate>
            <volume>35</volume>
            <fpage>243</fpage>
            <lpage>274</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1146/annurev.genet.35.102401.090509</pubid>
                  <pubid idtype="pmpid" link="fulltext">11700284</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B25">
            <title>
               <p>BCL2 oncogene translocation is mediated by a chi-like consensus</p>
            </title>
            <aug>
               <au>
                  <snm>Wyatt</snm>
                  <fnm>RT</fnm>
               </au>
               <au>
                  <snm>Rudders</snm>
                  <fnm>RA</fnm>
               </au>
               <au>
                  <snm>Zelenetz</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Delellis</snm>
                  <fnm>RA</fnm>
               </au>
               <au>
                  <snm>Krontiris</snm>
                  <fnm>TG</fnm>
               </au>
            </aug>
            <source>J Exp Med</source>
            <pubdate>1992</pubdate>
            <volume>175</volume>
            <issue>6</issue>
            <fpage>1575</fpage>
            <lpage>1588</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">2119260</pubid>
                  <pubid idtype="pmpid">1588282</pubid>
                  <pubid idtype="doi">10.1084/jem.175.6.1575</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B26">
            <title>
               <p>Sequence variant in the intron 10 of the RET oncogene in a patient with microfollicular thyroid carcinoma with medullar differentiation: implications for newly generated chi-like sequence</p>
            </title>
            <aug>
               <au>
                  <snm>Veljkovic</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Dzodic</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Neskovic</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Stanojevic</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Milovanovic</snm>
                  <fnm>Z</fnm>
               </au>
               <au>
                  <snm>Opric</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Dimitrijevic</snm>
                  <fnm>B</fnm>
               </au>
            </aug>
            <source>Med Oncol</source>
            <pubdate>2004</pubdate>
            <volume>21</volume>
            <issue>4</issue>
            <fpage>319</fpage>
            <lpage>324</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1385/MO:21:4:319</pubid>
                  <pubid idtype="pmpid" link="fulltext">15579915</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B27">
            <title>
               <p>A novel Alu-mediated 61-kb deletion of the von Willebrand factor (VWF) gene whose breakpoints co-locate with putative matrix attachment regions</p>
            </title>
            <aug>
               <au>
                  <snm>Xie</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Wang</snm>
                  <fnm>X</fnm>
               </au>
               <au>
                  <snm>Cooper</snm>
                  <fnm>DN</fnm>
               </au>
               <au>
                  <snm>Chuzhanova</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Fang</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Cai</snm>
                  <fnm>X</fnm>
               </au>
               <au>
                  <snm>Wang</snm>
                  <fnm>Z</fnm>
               </au>
               <au>
                  <snm>Wang</snm>
                  <fnm>H</fnm>
               </au>
            </aug>
            <source>Blood Cells Mol Dis</source>
            <pubdate>2006</pubdate>
            <volume>36</volume>
            <issue>3</issue>
            <fpage>385</fpage>
            <lpage>391</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/j.bcmd.2006.03.003</pubid>
                  <pubid idtype="pmpid" link="fulltext">16690331</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B28">
            <title>
               <p>Gene deletions causing human genetic disease: mechanisms of mutagenesis and the role of the local DNA sequence environment</p>
            </title>
            <aug>
               <au>
                  <snm>Krawczak</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Cooper</snm>
                  <fnm>DN</fnm>
               </au>
            </aug>
            <source>Hum Genet</source>
            <pubdate>1991</pubdate>
            <volume>86</volume>
            <issue>5</issue>
            <fpage>425</fpage>
            <lpage>441</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1007/BF00194629</pubid>
                  <pubid idtype="pmpid">2016084</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B29">
            <title>
               <p>Meiotic microdeletion breakpoints in the BRCA1 gene are significantly associated with symmetric DNA-sequence elements</p>
            </title>
            <aug>
               <au>
                  <snm>Schmucker</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Krawczak</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>Am J Hum Genet</source>
            <pubdate>1997</pubdate>
            <volume>61</volume>
            <issue>6</issue>
            <fpage>1454</fpage>
            <lpage>1456</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1716079</pubid>
                  <pubid idtype="pmpid" link="fulltext">9399889</pubid>
                  <pubid idtype="doi">10.1086/301631</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B30">
            <title>
               <p>DNA double-strand break repair: From mechanistic understanding to cancer treatment</p>
            </title>
            <aug>
               <au>
                  <snm>Helleday</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Lo</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>van Gent</snm>
                  <fnm>DC</fnm>
               </au>
               <au>
                  <snm>Engelward</snm>
                  <fnm>BP</fnm>
               </au>
            </aug>
            <source>DNA Repair (Amst)</source>
            <pubdate>2007</pubdate>
            <volume>6</volume>
            <issue>7</issue>
            <fpage>923</fpage>
            <lpage>935</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/j.dnarep.2007.02.006</pubid>
                  <pubid idtype="pmpid" link="fulltext">17363343</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B31">
            <title>
               <p>Polymorphism in mouse and human class I H-2 and HLA genes is not the result of random independent point mutations</p>
            </title>
            <aug>
               <au>
                  <snm>Jaulin</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Perrin</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Abastado</snm>
                  <fnm>JP</fnm>
               </au>
               <au>
                  <snm>Dumas</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Papamatheakis</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Kourilsky</snm>
                  <fnm>P</fnm>
               </au>
            </aug>
            <source>Immunogenetics</source>
            <pubdate>1985</pubdate>
            <volume>22</volume>
            <issue>5</issue>
            <fpage>453</fpage>
            <lpage>470</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1007/BF00418091</pubid>
                  <pubid idtype="pmpid">3934067</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B32">
            <title>
               <p>Gene conversion of major histocompatibility complex genes is associated with CpG-rich regions</p>
            </title>
            <aug>
               <au>
                  <snm>Hogstrand</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Bohme</snm>
                  <fnm>J</fnm>
               </au>
            </aug>
            <source>Immunogenetics</source>
            <pubdate>1999</pubdate>
            <volume>49</volume>
            <issue>5</issue>
            <fpage>446</fpage>
            <lpage>455</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1007/s002510050518</pubid>
                  <pubid idtype="pmpid">10199921</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B33">
            <title>
               <p>Biased gene conversion: implications for genome and sex evolution</p>
            </title>
            <aug>
               <au>
                  <snm>Marais</snm>
                  <fnm>G</fnm>
               </au>
            </aug>
            <source>Trends Genet</source>
            <pubdate>2003</pubdate>
            <volume>19</volume>
            <issue>6</issue>
            <fpage>330</fpage>
            <lpage>338</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/S0168-9525(03)00116-1</pubid>
                  <pubid idtype="pmpid" link="fulltext">12801726</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B34">
            <title>
               <p>A new perspective on isochore evolution</p>
            </title>
            <aug>
               <au>
                  <snm>Duret</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Eyre-Walker</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Galtier</snm>
                  <fnm>N</fnm>
               </au>
            </aug>
            <source>Gene</source>
            <pubdate>2006</pubdate>
            <volume>385</volume>
            <fpage>71</fpage>
            <lpage>74</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/j.gene.2006.04.030</pubid>
                  <pubid idtype="pmpid" link="fulltext">16971063</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B35">
            <title>
               <p>Comprehensive analysis of CpG islands in human chromosomes 21 and 22</p>
            </title>
            <aug>
               <au>
                  <snm>Takai</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Jones</snm>
                  <fnm>PA</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci U S A</source>
            <pubdate>2002</pubdate>
            <volume>99</volume>
            <issue>6</issue>
            <fpage>3740</fpage>
            <lpage>3745</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">122594</pubid>
                  <pubid idtype="pmpid" link="fulltext">11891299</pubid>
                  <pubid idtype="doi">10.1073/pnas.052410099</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B36">
            <title>
               <p>Hyperconserved CpG domains underlie Polycomb-binding sites</p>
            </title>
            <aug>
               <au>
                  <snm>Tanay</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>O'Donnell</snm>
                  <fnm>AH</fnm>
               </au>
               <au>
                  <snm>Damelin</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Bestor</snm>
                  <fnm>TH</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci U S A</source>
            <pubdate>2007</pubdate>
            <volume>104</volume>
            <issue>13</issue>
            <fpage>5521</fpage>
            <lpage>5526</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1838490</pubid>
                  <pubid idtype="pmpid" link="fulltext">17376869</pubid>
                  <pubid idtype="doi">10.1073/pnas.0609746104</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B37">
            <title>
               <p>Topoisomerase I involvement in illegitimate recombination in Saccharomyces cerevisiae</p>
            </title>
            <aug>
               <au>
                  <snm>Zhu</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Schiestl</snm>
                  <fnm>RH</fnm>
               </au>
            </aug>
            <source>Mol Cell Biol</source>
            <pubdate>1996</pubdate>
            <volume>16</volume>
            <issue>4</issue>
            <fpage>1805</fpage>
            <lpage>1812</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">231167</pubid>
                  <pubid idtype="pmpid" link="fulltext">8657156</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B38">
            <title>
               <p>An endonuclease with multiple cutting sites, Endo.SceI, initiates genetic recombination at its cutting site in yeast mitochondria</p>
            </title>
            <aug>
               <au>
                  <snm>Nakagawa</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Morishima</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Shibata</snm>
                  <fnm>T</fnm>
               </au>
            </aug>
            <source>Embo J</source>
            <pubdate>1992</pubdate>
            <volume>11</volume>
            <issue>7</issue>
            <fpage>2707</fpage>
            <lpage>2715</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">556746</pubid>
                  <pubid idtype="pmpid">1628629</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B39">
            <title>
               <p>Homing endonuclease structure and function</p>
            </title>
            <aug>
               <au>
                  <snm>Stoddard</snm>
                  <fnm>BL</fnm>
               </au>
            </aug>
            <source>Q Rev Biophys</source>
            <pubdate>2005</pubdate>
            <volume>38</volume>
            <issue>1</issue>
            <fpage>49</fpage>
            <lpage>95</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1017/S0033583505004063</pubid>
                  <pubid idtype="pmpid" link="fulltext">16336743</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B40">
            <title>
               <p>Mutation in the CYP21B gene (Ile-172----Asn) causes steroid 21-hydroxylase deficiency</p>
            </title>
            <aug>
               <au>
                  <snm>Amor</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Parker</snm>
                  <fnm>KL</fnm>
               </au>
               <au>
                  <snm>Globerman</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>New</snm>
                  <fnm>MI</fnm>
               </au>
               <au>
                  <snm>White</snm>
                  <fnm>PC</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci U S A</source>
            <pubdate>1988</pubdate>
            <volume>85</volume>
            <issue>5</issue>
            <fpage>1600</fpage>
            <lpage>1604</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">279821</pubid>
                  <pubid idtype="pmpid" link="fulltext">3257825</pubid>
                  <pubid idtype="doi">10.1073/pnas.85.5.1600</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B41">
            <title>
               <p>Generation of DNA double-strand breaks by two independent enzymatic activities in nuclear extracts</p>
            </title>
            <aug>
               <au>
                  <snm>Blanco</snm>
                  <fnm>MG</fnm>
               </au>
               <au>
                  <snm>Boan</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Barros</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Castano</snm>
                  <fnm>JG</fnm>
               </au>
               <au>
                  <snm>Gomez-Marquez</snm>
                  <fnm>J</fnm>
               </au>
            </aug>
            <source>J Mol Biol</source>
            <pubdate>2005</pubdate>
            <volume>351</volume>
            <issue>5</issue>
            <fpage>995</fpage>
            <lpage>1006</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/j.jmb.2005.06.065</pubid>
                  <pubid idtype="pmpid" link="fulltext">16051267</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B42">
            <title>
               <p>CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice</p>
            </title>
            <aug>
               <au>
                  <snm>Thompson</snm>
                  <fnm>JD</fnm>
               </au>
               <au>
                  <snm>Higgins</snm>
                  <fnm>DG</fnm>
               </au>
               <au>
                  <snm>Gibson</snm>
                  <fnm>TJ</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>1994</pubdate>
            <volume>22</volume>
            <issue>22</issue>
            <fpage>4673</fpage>
            <lpage>4680</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">308517</pubid>
                  <pubid idtype="pmpid" link="fulltext">7984417</pubid>
                  <pubid idtype="doi">10.1093/nar/22.22.4673</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B43">
            <title>
               <p>Evolution of protein molecules</p>
            </title>
            <aug>
               <au>
                  <snm>Jukes</snm>
                  <fnm>TH</fnm>
               </au>
               <au>
                  <snm>Cantor</snm>
                  <fnm>CR</fnm>
               </au>
            </aug>
            <source>Mammalian Protein Metabolism</source>
            <publisher>New York , Academic Press</publisher>
            <editor>Munro, H.N</editor>
            <pubdate>1969</pubdate>
            <fpage>21</fpage>
            <lpage>32</lpage>
         </bibl>
         <bibl id="B44">
            <title>
               <p>MEGA3: Integrated software for Molecular Evolutionary Genetics Analysis and sequence alignment</p>
            </title>
            <aug>
               <au>
                  <snm>Kumar</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Tamura</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Nei</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>Brief Bioinform</source>
            <pubdate>2004</pubdate>
            <volume>5</volume>
            <issue>2</issue>
            <fpage>150</fpage>
            <lpage>163</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/bib/5.2.150</pubid>
                  <pubid idtype="pmpid" link="fulltext">15260895</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B45">
            <title>
               <p>DnaSP, DNA polymorphism analyses by the coalescent and other methods</p>
            </title>
            <aug>
               <au>
                  <snm>Rozas</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Sanchez-DelBarrio</snm>
                  <fnm>JC</fnm>
               </au>
               <au>
                  <snm>Messeguer</snm>
                  <fnm>X</fnm>
               </au>
               <au>
                  <snm>Rozas</snm>
                  <fnm>R</fnm>
               </au>
            </aug>
            <source>Bioinformatics</source>
            <pubdate>2003</pubdate>
            <volume>19</volume>
            <issue>18</issue>
            <fpage>2496</fpage>
            <lpage>2497</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/bioinformatics/btg359</pubid>
                  <pubid idtype="pmpid" link="fulltext">14668244</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B46">
            <title>
               <p>SWAAP - A tool for analyzing substitutions and similarity in multiple alignments</p>
            </title>
            <aug>
               <au>
                  <snm>Pride</snm>
                  <fnm>DT</fnm>
               </au>
            </aug>
            <url>http://www.bacteriamuseum.org/SWAAP/SwaapPage.htm</url>
         </bibl>
         <bibl id="B47">
            <title>
               <p>G-quadruplex DNA structures--variations on a theme</p>
            </title>
            <aug>
               <au>
                  <snm>Simonsson</snm>
                  <fnm>T</fnm>
               </au>
            </aug>
            <source>Biol Chem</source>
            <pubdate>2001</pubdate>
            <volume>382</volume>
            <issue>4</issue>
            <fpage>621</fpage>
            <lpage>628</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1515/BC.2001.073</pubid>
                  <pubid idtype="pmpid">11405224</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B48">
            <title>
               <p>Quadruplex DNA: sequence, topology and structure</p>
            </title>
            <aug>
               <au>
                  <snm>Burge</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Parkinson</snm>
                  <fnm>GN</fnm>
               </au>
               <au>
                  <snm>Hazel</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Todd</snm>
                  <fnm>AK</fnm>
               </au>
               <au>
                  <snm>Neidle</snm>
                  <fnm>S</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2006</pubdate>
            <volume>34</volume>
            <issue>19</issue>
            <fpage>5402</fpage>
            <lpage>5415</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1636468</pubid>
                  <pubid idtype="pmpid" link="fulltext">17012276</pubid>
                  <pubid idtype="doi">10.1093/nar/gkl655</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B49">
            <title>
               <p>Gross genomic rearrangements involving deletions in the CFTR gene: characterization of six new events from a large cohort of hitherto unidentified cystic fibrosis chromosomes and meta-analysis of the underlying mechanisms</p>
            </title>
            <aug>
               <au>
                  <snm>Ferec</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Casals</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Chuzhanova</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Macek</snm>
                  <fnm>M</fnm>
                  <suf>Jr.</suf>
               </au>
               <au>
                  <snm>Bienvenu</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Holubova</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>King</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>McDevitt</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Castellani</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Farrell</snm>
                  <fnm>PM</fnm>
               </au>
               <au>
                  <snm>Sheridan</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Pantaleo</snm>
                  <fnm>SJ</fnm>
               </au>
               <au>
                  <snm>Loumi</snm>
                  <fnm>O</fnm>
               </au>
               <au>
                  <snm>Messaoud</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Cuppens</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Torricelli</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Cutting</snm>
                  <fnm>GR</fnm>
               </au>
               <au>
                  <snm>Williamson</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Ramos</snm>
                  <fnm>MJ</fnm>
               </au>
               <au>
                  <snm>Pignatti</snm>
                  <fnm>PF</fnm>
               </au>
               <au>
                  <snm>Raguenes</snm>
                  <fnm>O</fnm>
               </au>
               <au>
                  <snm>Cooper</snm>
                  <fnm>DN</fnm>
               </au>
               <au>
                  <snm>Audrezet</snm>
                  <fnm>MP</fnm>
               </au>
               <au>
                  <snm>Chen</snm>
                  <fnm>JM</fnm>
               </au>
            </aug>
            <source>Eur J Hum Genet</source>
            <pubdate>2006</pubdate>
            <volume>14</volume>
            <issue>5</issue>
            <fpage>567</fpage>
            <lpage>576</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/sj.ejhg.5201590</pubid>
                  <pubid idtype="pmpid" link="fulltext">16493442</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B50">
            <title>
               <p>DNA mismatches and GC-rich motifs target transposition by the RAG1/RAG2 transposase</p>
            </title>
            <aug>
               <au>
                  <snm>Tsai</snm>
                  <fnm>CL</fnm>
               </au>
               <au>
                  <snm>Chatterji</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Schatz</snm>
                  <fnm>DG</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2003</pubdate>
            <volume>31</volume>
            <issue>21</issue>
            <fpage>6180</fpage>
            <lpage>6190</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">275461</pubid>
                  <pubid idtype="pmpid" link="fulltext">14576304</pubid>
                  <pubid idtype="doi">10.1093/nar/gkg819</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B51">
            <title>
               <p>Chromosomal stability and the DNA double-stranded break connection</p>
            </title>
            <aug>
               <au>
                  <snm>van Gent</snm>
                  <fnm>DC</fnm>
               </au>
               <au>
                  <snm>Hoeijmakers</snm>
                  <fnm>JH</fnm>
               </au>
               <au>
                  <snm>Kanaar</snm>
                  <fnm>R</fnm>
               </au>
            </aug>
            <source>Nat Rev Genet</source>
            <pubdate>2001</pubdate>
            <volume>2</volume>
            <issue>3</issue>
            <fpage>196</fpage>
            <lpage>206</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/35056049</pubid>
                  <pubid idtype="pmpid" link="fulltext">11256071</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B52">
            <title>
               <p>Structural analysis of the chimeric CYP21P/CYP21 gene in steroid 21-hydroxylase deficiency</p>
            </title>
            <aug>
               <au>
                  <snm>Lee</snm>
                  <fnm>HH</fnm>
               </au>
               <au>
                  <snm>Niu</snm>
                  <fnm>DM</fnm>
               </au>
               <au>
                  <snm>Lin</snm>
                  <fnm>RW</fnm>
               </au>
               <au>
                  <snm>Chan</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Lin</snm>
                  <fnm>CY</fnm>
               </au>
            </aug>
            <source>J Hum Genet</source>
            <pubdate>2002</pubdate>
            <volume>47</volume>
            <issue>10</issue>
            <fpage>517</fpage>
            <lpage>522</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1007/s100380200077</pubid>
                  <pubid idtype="pmpid" link="fulltext">12376740</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B53">
            <title>
               <p>A common sequence motif near nonhomologous recombination breakpoints involving Ig sequences</p>
            </title>
            <aug>
               <au>
                  <snm>Chou</snm>
                  <fnm>CL</fnm>
               </au>
               <au>
                  <snm>Morrison</snm>
                  <fnm>SL</fnm>
               </au>
            </aug>
            <source>J Immunol</source>
            <pubdate>1993</pubdate>
            <volume>150</volume>
            <issue>12</issue>
            <fpage>5350</fpage>
            <lpage>5360</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">8515063</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B54">
            <title>
               <p>A 1.1-kb duplication in the p67-phox gene causes chronic granulomatous disease</p>
            </title>
            <aug>
               <au>
                  <snm>Borgato</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Bonizzato</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Lunardi</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Dusi</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Andrioli</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Scarperi</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Corrocher</snm>
                  <fnm>R</fnm>
               </au>
            </aug>
            <source>Hum Genet</source>
            <pubdate>2001</pubdate>
            <volume>108</volume>
            <issue>6</issue>
            <fpage>504</fpage>
            <lpage>510</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1007/s004390100526</pubid>
                  <pubid idtype="pmpid" link="fulltext">11499676</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B55">
            <title>
               <p>The human minisatellite consensus at breakpoints of oncogene translocations</p>
            </title>
            <aug>
               <au>
                  <snm>Krowczynska</snm>
                  <fnm>AM</fnm>
               </au>
               <au>
                  <snm>Rudders</snm>
                  <fnm>RA</fnm>
               </au>
               <au>
                  <snm>Krontiris</snm>
                  <fnm>TG</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>1990</pubdate>
            <volume>18</volume>
            <issue>5</issue>
            <fpage>1121</fpage>
            <lpage>1127</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">330424</pubid>
                  <pubid idtype="pmpid" link="fulltext">1969618</pubid>
                  <pubid idtype="doi">10.1093/nar/18.5.1121</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B56">
            <title>
               <p>Association of crossover points with topoisomerase I cleavage sites: a model for nonhomologous recombination</p>
            </title>
            <aug>
               <au>
                  <snm>Bullock</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Champoux</snm>
                  <fnm>JJ</fnm>
               </au>
               <au>
                  <snm>Botchan</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>Science</source>
            <pubdate>1985</pubdate>
            <volume>230</volume>
            <issue>4728</issue>
            <fpage>954</fpage>
            <lpage>958</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1126/science.2997924</pubid>
                  <pubid idtype="pmpid" link="fulltext">2997924</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B57">
            <title>
               <p>A DNA polymerase alpha pause site is a hot spot for nucleotide misinsertion</p>
            </title>
            <aug>
               <au>
                  <snm>Fry</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Loeb</snm>
                  <fnm>LA</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci U S A</source>
            <pubdate>1992</pubdate>
            <volume>89</volume>
            <issue>2</issue>
            <fpage>763</fpage>
            <lpage>767</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">48319</pubid>
                  <pubid idtype="pmpid" link="fulltext">1731352</pubid>
                  <pubid idtype="doi">10.1073/pnas.89.2.763</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B58">
            <title>
               <p>The mutational specificity of DNA polymerase-beta during in vitro DNA synthesis. Production of frameshift, base substitution, and deletion mutations</p>
            </title>
            <aug>
               <au>
                  <snm>Kunkel</snm>
                  <fnm>TA</fnm>
               </au>
            </aug>
            <source>J Biol Chem</source>
            <pubdate>1985</pubdate>
            <volume>260</volume>
            <issue>9</issue>
            <fpage>5787</fpage>
            <lpage>5796</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">3988773</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B59">
            <title>
               <p>The mutational specificity of DNA polymerases-alpha and -gamma during in vitro DNA synthesis</p>
            </title>
            <aug>
               <au>
                  <snm>Kunkel</snm>
                  <fnm>TA</fnm>
               </au>
            </aug>
            <source>J Biol Chem</source>
            <pubdate>1985</pubdate>
            <volume>260</volume>
            <issue>23</issue>
            <fpage>12866</fpage>
            <lpage>12874</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">3930505</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B60">
            <title>
               <p>Drosophila topoisomerase II double-strand DNA cleavage: analysis of DNA sequence homology at the cleavage site</p>
            </title>
            <aug>
               <au>
                  <snm>Sander</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Hsieh</snm>
                  <fnm>TS</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>1985</pubdate>
            <volume>13</volume>
            <issue>4</issue>
            <fpage>1057</fpage>
            <lpage>1072</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">341056</pubid>
                  <pubid idtype="pmpid" link="fulltext">2987816</pubid>
                  <pubid idtype="doi">10.1093/nar/13.4.1057</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B61">
            <title>
               <p>Intrinsic intermolecular DNA ligation activity of eukaryotic topoisomerase II. Potential roles in recombination</p>
            </title>
            <aug>
               <au>
                  <snm>Gale</snm>
                  <fnm>KC</fnm>
               </au>
               <au>
                  <snm>Osheroff</snm>
                  <fnm>N</fnm>
               </au>
            </aug>
            <source>J Biol Chem</source>
            <pubdate>1992</pubdate>
            <volume>267</volume>
            <issue>17</issue>
            <fpage>12090</fpage>
            <lpage>12097</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">1318309</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B62">
            <title>
               <p>Hypervariable 'minisatellite' regions in human DNA</p>
            </title>
            <aug>
               <au>
                  <snm>Jeffreys</snm>
                  <fnm>AJ</fnm>
               </au>
               <au>
                  <snm>Wilson</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>Thein</snm>
                  <fnm>SL</fnm>
               </au>
            </aug>
            <source>Nature</source>
            <pubdate>1985</pubdate>
            <volume>314</volume>
            <issue>6006</issue>
            <fpage>67</fpage>
            <lpage>73</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/314067a0</pubid>
                  <pubid idtype="pmpid">3856104</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B63">
            <title>
               <p>A fine-scale map of recombination rates and hotspots across the human genome</p>
            </title>
            <aug>
               <au>
                  <snm>Myers</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Bottolo</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Freeman</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>McVean</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Donnelly</snm>
                  <fnm>P</fnm>
               </au>
            </aug>
            <source>Science</source>
            <pubdate>2005</pubdate>
            <volume>310</volume>
            <issue>5746</issue>
            <fpage>321</fpage>
            <lpage>324</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1126/science.1117196</pubid>
                  <pubid idtype="pmpid" link="fulltext">16224025</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B64">
            <title>
               <p>Meta-analysis of indels causing human genetic disease: mechanisms of mutagenesis and the role of local DNA sequence complexity</p>
            </title>
            <aug>
               <au>
                  <snm>Chuzhanova</snm>
                  <fnm>NA</fnm>
               </au>
               <au>
                  <snm>Anassis</snm>
                  <fnm>EJ</fnm>
               </au>
               <au>
                  <snm>Ball</snm>
                  <fnm>EV</fnm>
               </au>
               <au>
                  <snm>Krawczak</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Cooper</snm>
                  <fnm>DN</fnm>
               </au>
            </aug>
            <source>Hum Mutat</source>
            <pubdate>2003</pubdate>
            <volume>21</volume>
            <issue>1</issue>
            <fpage>28</fpage>
            <lpage>44</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1002/humu.10146</pubid>
                  <pubid idtype="pmpid" link="fulltext">12497629</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
      </refgrp>
   </bm>
</art>
