<?xml version='1.0'?>
<!DOCTYPE art SYSTEM 'http://www.biomedcentral.com/xml/article.dtd'>
<art>
   <ui>1471-2164-11-284</ui>
   <ji>1471-2164</ji>
   <fm>
      <dochead>Research article</dochead>
      <bibl>
         <title>
            <p>Copy number variation in the bovine genome</p>
         </title>
         <aug>
            <au id="A1"><snm>Fadista</snm><fnm>Jo&#227;o</fnm><insr iid="I1"/><email>joao.fadista@agrsci.dk</email></au>
            <au id="A2"><snm>Thomsen</snm><fnm>Bo</fnm><insr iid="I1"/><email>bo.thomsen@agrsci.dk</email></au>
            <au id="A3"><snm>Holm</snm><fnm>Lars-Erik</fnm><insr iid="I1"/><email>larserik.holm@agrsci.dk</email></au>
            <au ca="yes" id="A4"><snm>Bendixen</snm><fnm>Christian</fnm><insr iid="I1"/><email>christian.bendixen@agrsci.dk</email></au>
         </aug>
         <insg>
            <ins id="I1"><p>Group of Molecular Genetics and Systems Biology, Department of Genetics and Biotechnology, Faculty of Agricultural Sciences, Aarhus University, Blichers All&#233; 20, DK-8830 Tjele, Denmark</p></ins>
         </insg>
         <source>BMC Genomics</source>
         <issn>1471-2164</issn>
         <pubdate>2010</pubdate>
         <volume>11</volume>
         <issue>1</issue>
         <fpage>284</fpage>
         <url>http://www.biomedcentral.com/1471-2164/11/284</url>
         <xrefbib><pubidlist><pubid idtype="pmpid">20459598</pubid><pubid idtype="doi">10.1186/1471-2164-11-284</pubid></pubidlist></xrefbib>
      </bibl>
      <history><rec><date><day>11</day><month>12</month><year>2009</year></date></rec><acc><date><day>6</day><month>5</month><year>2010</year></date></acc><pub><date><day>6</day><month>5</month><year>2010</year></date></pub></history>
      <cpyrt><year>2010</year><collab>Fadista et al; licensee BioMed Central Ltd.</collab><note>This is an Open Access article distributed under the terms of the Creative Commons Attribution License (&lt;url&gt;http://creativecommons.org/licenses/by/2.0&lt;/url&gt;), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</note></cpyrt>
      <abs>
         <sec>
            <st>
               <p>Abstract</p>
            </st>
            <sec>
               <st>
                  <p>Background</p>
               </st>
               <p>Copy number variations (CNVs), which represent a significant source of genetic diversity in mammals, have been shown to be associated with phenotypes of clinical relevance and to be causative of disease. Notwithstanding, little is known about the extent to which CNV contributes to genetic variation in cattle.</p>
            </sec>
            <sec>
               <st>
                  <p>Results</p>
               </st>
               <p>We designed and used a set of NimbleGen CGH arrays that tile across the assayable portion of the cattle genome with approximately 6.3 million probes, at a median probe spacing of 301 bp. This study reports the highest resolution map of copy number variation in the cattle genome, with 304 CNV regions (CNVRs) being identified among the genomes of 20 bovine samples from 4 dairy and beef breeds. The CNVRs identified covered 0.68% (22 Mb) of the genome, and ranged in size from 1.7 to 2,031 kb (median size 16.7 kb). About 20% of the CNVs co-localized with segmental duplications, while 30% encompass genes, of which the majority is involved in environmental response. About 10% of the human orthologous of these genes are associated with human disease susceptibility and, hence, may have important phenotypic consequences.</p>
            </sec>
            <sec>
               <st>
                  <p>Conclusions</p>
               </st>
               <p>Together, this analysis provides a useful resource for assessment of the impact of CNVs regarding variation in bovine health and production traits.</p>
            </sec>
         </sec>
      </abs>
   </fm>
   <bdy>
      <sec>
         <st>
            <p>Background</p>
         </st>
         <p>Cattle, part of the Cetartiodactyl order of eutherian mammals <abbrgrp><abbr bid="B1">1</abbr></abbrgrp>, is an important source of human nutrition worldwide as well as the most studied ruminant model of metabolism, reproduction, and disease <abbrgrp><abbr bid="B2">2</abbr></abbrgrp>. Following the milestone publication of the cattle genome assembly along with annotation of functional elements and variation <abbrgrp><abbr bid="B2">2</abbr><abbr bid="B3">3</abbr></abbrgrp>, we are now enabled to search for genomic regions that impact the genetic variation of important phenotypic traits.</p>
         <p>Genomic structural variation, including insertions, duplications, deletions, inversions and translocations of DNA, has long been known to be present in animal genomes <abbrgrp><abbr bid="B4">4</abbr><abbr bid="B5">5</abbr></abbrgrp> but had predominantly been assumed to be to rare events and often associated with disease. This notion changed in 2004 when two groups of researchers published the first genome-wide maps of copy number variation in seemingly healthy individuals <abbrgrp><abbr bid="B6">6</abbr><abbr bid="B7">7</abbr></abbrgrp>. Copy number variant (CNV) is described as a segment of DNA &gt; = 1 kb that is copy number variable when compared with a reference genome <abbrgrp><abbr bid="B8">8</abbr></abbrgrp>. Before these landmark studies, it was thought that SNPs were the major source of genetic variation between individuals <abbrgrp><abbr bid="B9">9</abbr></abbrgrp> but genomic structural genetic variation is now known to cover more base pairs <abbrgrp><abbr bid="B10">10</abbr><abbr bid="B11">11</abbr><abbr bid="B12">12</abbr><abbr bid="B13">13</abbr><abbr bid="B14">14</abbr><abbr bid="B15">15</abbr><abbr bid="B16">16</abbr><abbr bid="B17">17</abbr></abbrgrp>, and to have a higher per-locus mutation rate than SNPs do <abbrgrp><abbr bid="B18">18</abbr></abbrgrp>.</p>
         <p>There are indications that CNVs appear throughout the genome not only in humans, but also in other primates <abbrgrp><abbr bid="B19">19</abbr><abbr bid="B20">20</abbr><abbr bid="B21">21</abbr></abbrgrp>, rodents <abbrgrp><abbr bid="B22">22</abbr><abbr bid="B23">23</abbr><abbr bid="B24">24</abbr><abbr bid="B25">25</abbr><abbr bid="B26">26</abbr><abbr bid="B27">27</abbr><abbr bid="B28">28</abbr><abbr bid="B29">29</abbr><abbr bid="B30">30</abbr></abbrgrp>, flies <abbrgrp><abbr bid="B31">31</abbr><abbr bid="B32">32</abbr></abbrgrp>, dogs <abbrgrp><abbr bid="B33">33</abbr></abbrgrp>, chickens <abbrgrp><abbr bid="B34">34</abbr></abbrgrp> and cattle <abbrgrp><abbr bid="B35">35</abbr></abbrgrp>. Nevertheless, other than humans and mice <abbrgrp><abbr bid="B29">29</abbr><abbr bid="B36">36</abbr><abbr bid="B37">37</abbr><abbr bid="B38">38</abbr><abbr bid="B39">39</abbr><abbr bid="B40">40</abbr></abbrgrp>, little is known about how CNVs contribute to normal phenotypic variation and disease susceptibility. Up until now, relatively few studies have confirmed the presence of CNVs in cattle <abbrgrp><abbr bid="B35">35</abbr><abbr bid="B41">41</abbr><abbr bid="B42">42</abbr></abbrgrp>, of which only one study focused on genome-wide detection of CNVs <abbrgrp><abbr bid="B35">35</abbr></abbrgrp>, but at low resolution using version 3 of bovine genome assembly <abbrgrp><abbr bid="B2">2</abbr></abbrgrp>.</p>
         <p>Here we report the use of high-resolution oligonucleotide array comparative genomic hybridization (array CGH) to identify 304 CNV regions in 20 animals (14 Holsteins, 2 Red Danish, 3 Simmental and 1 Hereford). With an average probe spacing of 420 bp relative to the latest bovine genome assembly (BT4, 2007) <abbrgrp><abbr bid="B2">2</abbr></abbrgrp>, this analysis provides the highest-resolution map of copy number variation in the cattle genome to date.</p>
      </sec>
      <sec>
         <st>
            <p>Results</p>
         </st>
         <sec>
            <st>
               <p>Experiment design</p>
            </st>
            <p>The goal of our study was to characterize levels and patterns of copy number variation among bovine animals. Therefore, to assess the bovine CNV landscape, the genomic DNA of 20 bovine samples from two dairy (14 Holsteins, 2 Red Danish) and two beef breeds (3 Simmental, 1 Hereford) were analyzed. Assessment of copy number variation between samples was done using a set of Nimblegen HD2 CGH arrays that tile across the genome with approximately 6.3 million unique oligo probes with a mean probe spacing of 420 bp, using the latest genome assembly (BT4) <abbrgrp><abbr bid="B2">2</abbr></abbrgrp>.</p>
            <p>We opted for a dye-swap loop array design, rather than a common reference design, so that each sample was hybridized to two different samples in two different dye orientations. Dye swap is used to compensate for dye bias, while the loop design (known to be more efficient than the reference design <abbrgrp><abbr bid="B43">43</abbr><abbr bid="B44">44</abbr></abbrgrp>) is applied to help assign the CNV gain and loss status more accurately for each sample based on the number of samples with the CNV.</p>
         </sec>
         <sec>
            <st>
               <p>Array CGH evaluation</p>
            </st>
            <p>For evaluation of our array CGH platform, four sex-mismatched arrays and one self-self hybridization (all in dye swaps) were used to assess the false positive rate (see Methods). Probes were interpreted as revealing a copy-number difference if the standard error of the log-intensity ratio was beyond an intensity-ratio threshold. The adequacy of this threshold in detecting copy-number differences was confirmed by conducting sex-mismatched hybridizations, comparing the number of X-linked probes beyond the threshold. From the 88.52 Mb length of chromosome X, 3.21 Mb were greater than the threshold, yielding an estimate of 3.62% for the rate of false positives (FP). The false positive rate (FPR) is conservatively overestimated due to: (1) the assumption that there are no CNVs in the chromosome X of sex-mismatched arrays; (2) calling for FP was done at individual arrays rather than if they were detected in both dye swaps; (3) and because the self-self hybridization array yielded much lower FPR at individual array calling (0.0085%), and zero FPR when calling CNVs detected in both dye swap.</p>
         </sec>
         <sec>
            <st>
               <p>Pattern and frequency of CNV regions</p>
            </st>
            <p>Since copy-number changes are relative for array CGH data, unambiguous ascertainment of the ancestral state of a CNV and (subsequent) identification of duplications and deletions is challenging. We have therefore chosen a design where a dye swap is coupled with a loop design, with each animal sample hybridized with two other animal samples, enabling us to distinguish between a deletion and duplication as well as the animal origin of the CNV (Figure <figr fid="F1">1</figr>). Since identical CNVs, when called in different animals, might be assigned different boundaries due to technical and/or biological sources of variability, overlapping CNVs were handled as a whole and named copy number variable regions (CNVRs) <abbrgrp><abbr bid="B10">10</abbr><abbr bid="B45">45</abbr></abbrgrp>.</p>
            <fig id="F1"><title><p>Figure 1</p></title><caption><p>Example of an identification procedure for CNV gain or loss status</p></caption><text>
   <p><b>Example of an identification procedure for CNV gain or loss status</b>. Y axis represents log2ratios and X axis represents genomic positions along chromosome 7. (A) animals <it>2 vs. 19</it>; (B) <it>19 vs. 6 </it>(gain in 19 or loss in 6); (C) <it>6 vs. 20 </it>(loss in 6 or gain in 20) and (D) <it>20 vs. 17</it>. The only plots that show a CNV are B and C, and since the only animal common to those hybridizations is animal 6, we classify this CNV as a deletion in animal 6.</p>
</text><graphic file="1471-2164-11-284-1"/></fig>
            <p>After applying a stringent CNV calling pipeline with a theoretical 1.5 kb resolution for CNV detection (Figure <figr fid="F2">2</figr> and Methods), 304 putative CNVRs were identified, averaging 47 CNVs per animal (Additional file <supplr sid="S1">1</supplr>, Table S1), with 70% (212) of the CNVRs observed in more than one animal. Although CNVRs detected in more than one animal of different families and/or breeds could be defined as frequent, the relationship between some of the animals precludes such classification (Additional file <supplr sid="S2">2</supplr>, Figure S1). The relatively poor breakpoint estimation also prevents information regarding whether these CNVRs are identical-by-descent (arisen before the divergence of these cattle breeds), or separate events occurred independently in different breeds (in putative structurally fragile genomic regions).</p>
            <suppl id="S1">
               <title>
                  <p>Additional file 1</p>
               </title>
               <text>
                  <p><b>Excel file includes Table S1</b>. CNV regions detected and the number of CNVs per sample.</p>
               </text>
               <file name="1471-2164-11-284-S1.XLS">
   <p>Click here for file</p>
</file>
            </suppl>
            <suppl id="S2">
               <title>
                  <p>Additional file 2</p>
               </title>
               <text>
                  <p><b>Powerpoint file includes Figure S1</b>. Pedigree for the related animals. The numbers correspond to the sample ID, described in Table S1.</p>
               </text>
               <file name="1471-2164-11-284-S2.PPT">
   <p>Click here for file</p>
</file>
            </suppl>
            <fig id="F2"><title><p>Figure 2</p></title><caption><p>CNV calling pipeline (details in Methods section)</p></caption><text>
   <p><b>CNV calling pipeline (details in Methods section)</b>. Data analysis procedure to discover copy number variations in array CGH data.</p>
</text><graphic file="1471-2164-11-284-2"/></fig>
            <p>CNVRs were detected on all chromosomes, but were distributed throughout the genome in a non-random manner (Figure <figr fid="F3">3</figr>) with little correlation between CNVRs occurrence and chromosome length (Additional file <supplr sid="S3">3</supplr>, Figure S2). This is coherent with previous studies on heterogeneous distribution of CNVs in primates <abbrgrp><abbr bid="B10">10</abbr><abbr bid="B21">21</abbr></abbrgrp>, but the low number of samples used in this study prevents us from drawing any conclusions regarding putative genomic CNV hotspots.</p>
            <suppl id="S3">
               <title>
                  <p>Additional file 3</p>
               </title>
               <text>
                  <p><b>Excel file includes Figure S2</b>. Correlation between chromosome length and number of CNVRs.</p>
               </text>
               <file name="1471-2164-11-284-S3.XLS">
   <p>Click here for file</p>
</file>
            </suppl>
            <fig id="F3"><title><p>Figure 3</p></title><caption><p>Bovine karyotype, with CNVR distribution</p></caption><text>
   <p><b>Bovine karyotype, with CNVR distribution</b>. Green arrows correspond to gains and red arrows correspond to losses.</p>
</text><graphic file="1471-2164-11-284-3"/></fig>
            <p>Out of the total number of CNVRs detected, 49.7% (151) partially overlap gaps in the assembly (BT4), which indicates that the CNVRs have a high probability of being linked with gaps within the reference cattle genome assembly (permutation test, p-value &lt; 0.001). This stresses the need for unraveling of these genomic regions of high structural complexity. The CNVRs detected vary in size from 1.7 kb to 2 Mb with a median size of 16.7 kb, and encompass approximately 23 Mb or 0.68% of the bovine genome (Table <tblr tid="T1">1</tblr> and Figure <figr fid="F4">4</figr>). The biggest region showing copy loss is 2.03 Mb on chromosome 13 in animal 16, while the biggest region showing copy number gain was detected in animals 6 and 7 showing a 417 kb amplification without overlapping any gene nor SD (Additional file <supplr sid="S1">1</supplr>, Table S1).</p>
            <tbl id="T1"><title><p>Table 1</p></title><caption><p>Characteristics of the CNV regions, with sizes in base pairs (bp).</p></caption><tblbdy cols="8">
      <r>
         <c ca="right">
            <p>
               <b>Type</b>
            </p>
         </c>
         <c ca="right">
            <p>
               <b>CNVRs</b>
            </p>
         </c>
         <c ca="right">
            <p>
               <b>Mean size</b>
            </p>
         </c>
         <c ca="right">
            <p>
               <b>Median size</b>
            </p>
         </c>
         <c ca="right">
            <p>
               <b>Size range</b>
            </p>
         </c>
         <c ca="right">
            <p>
               <b>CNVR Content</b>
            </p>
         </c>
         <c ca="right">
            <p>
               <b>Sequence covered</b>
            </p>
         </c>
         <c ca="right">
            <p>
               <b>% CNVR</b>
            </p>
         </c>
      </r>
      <r>
         <c cspan="8">
            <hr/>
         </c>
      </r>
      <r>
         <c ca="right">
            <p>Loss</p>
         </c>
         <c ca="right">
            <p>202</p>
         </c>
         <c ca="right">
            <p>77,420</p>
         </c>
         <c ca="right">
            <p>16,543</p>
         </c>
         <c ca="right">
            <p>1,716 - 2,031,343</p>
         </c>
         <c ca="right">
            <p>15,638,770</p>
         </c>
         <c>
            <p/>
         </c>
         <c ca="right">
            <p>0.482</p>
         </c>
      </r>
      <r>
         <c ca="right">
            <p>Gain</p>
         </c>
         <c ca="right">
            <p>102</p>
         </c>
         <c ca="right">
            <p>62,147</p>
         </c>
         <c ca="right">
            <p>19,580</p>
         </c>
         <c ca="right">
            <p>1,737 - 416,858</p>
         </c>
         <c ca="right">
            <p>6,339,041</p>
         </c>
         <c ca="right">
            <p>3,247,516,410</p>
         </c>
         <c ca="right">
            <p>0.195</p>
         </c>
      </r>
      <r>
         <c ca="right">
            <p>All</p>
         </c>
         <c ca="right">
            <p>304</p>
         </c>
         <c ca="right">
            <p>72,295</p>
         </c>
         <c ca="right">
            <p>16,678</p>
         </c>
         <c ca="right">
            <p>1,716 - 2,031,343</p>
         </c>
         <c ca="right">
            <p>21,977,811</p>
         </c>
         <c>
            <p/>
         </c>
         <c ca="right">
            <p>0.677</p>
         </c>
      </r>
   </tblbdy></tbl>
            <fig id="F4"><title><p>Figure 4</p></title><caption><p>Size range distribution of the CNVRs detected</p></caption><text>
   <p><b>Size range distribution of the CNVRs detected</b>.</p>
</text><graphic file="1471-2164-11-284-4"/></fig>
            <p>When comparing the size distribution between the 202 losses and the 102 gains, no significant difference was found (Wilcoxon rank sum test, p-value &gt; 0.05), although we detected significantly more losses than gains (exact binomial test, p-value = 1.01e-08). This bias in detecting more deletions could be due to both biological and technical reasons. One of the main mechanisms responsible for the CNV formation, non-allelic homologous recombination (NAHR), has been shown to generate more deletions than duplications <abbrgrp><abbr bid="B46">46</abbr></abbrgrp>. As also noted by others <abbrgrp><abbr bid="B10">10</abbr><abbr bid="B31">31</abbr><abbr bid="B47">47</abbr></abbrgrp>, a technical bias favoring the detection of deletions may be responsible since our CNV detection pipeline have more power to detect a loss (log2(1/2) = -1) than a gain (log2(3/2) = 0.58). With 69% of the CNVRs described within 50 kb in size (figure <figr fid="F4">4</figr>), it should be noted that a significant proportion of the CNVs are near our effective resolution of 1.5 kb. This indicates that the experimental detection of 304 CNVRs may greatly underestimate the actual number of CNVs in the cattle genome, and that a substantial proportion of CNVs could be smaller than 1.5 kb in size. CNVs &gt;2 Mb in size were not detected, which may be a consequence of the number and size of sequence gaps in the current outline of the cattle genome sequence assembly (75 654 gaps spanning 5.8% of the assembly).</p>
            <p>When assessing hybridization signals in the unassembled chromosome (ChrUn), it was verified that only the male vs. female hybridizations were detecting CNVs in some regions. Although the number of females in this study is small (n = 2), the findings suggest that these regions may be from the bovine chr Y (Additional file <supplr sid="S11">11</supplr>, Table S9). It is known that the <it>Bos taurus </it>genome assembly was not only composed by a female animal, but also had a BAC library sequenced from a male animal, from which the corresponding Y chromosome scaffolds were unlabeled and placed in the ChrUn <abbrgrp><abbr bid="B48">48</abbr></abbrgrp>. Consequently, this study highlights regions for future genome assembly improvements.</p>
            <suppl id="S11">
               <title>
                  <p>Additional file 11</p>
               </title>
               <text>
                  <p><b>Excel file includes Table S9</b>. Sequences in ChrUn that putatively belong to chromosome Y.</p>
               </text>
               <file name="1471-2164-11-284-S11.XLS">
   <p>Click here for file</p>
</file>
            </suppl>
            <p>In accordance with analyses conducted in humans <abbrgrp><abbr bid="B10">10</abbr><abbr bid="B11">11</abbr></abbrgrp>, we detected that the GC content of the CNVRs (43.6%) are slightly larger than of the whole genome (41.8%), which supports the notion that CNVs arise more often in GC rich regions.</p>
         </sec>
         <sec>
            <st>
               <p>SDs are associated with CNVs</p>
            </st>
            <p>Segmental duplications (SDs), defined as regions of length &gt; = 1 kb with at least 90% sequence identity <abbrgrp><abbr bid="B49">49</abbr></abbrgrp>, are important elements in the formation of CNVs via non-allelic homologous recombination (NAHR) throughout the mammalian lineage <abbrgrp><abbr bid="B39">39</abbr><abbr bid="B50">50</abbr><abbr bid="B51">51</abbr></abbrgrp>. To test whether the non-random association between CNVs and SDs was preserved in our high-resolution data, the overlap of CNVs with segmental duplications was determined. Segmental duplications were overlapped by 20% (61) of the CNVRs, which implies that CNVs are enriched near segmental duplication (permutation test, p-value &lt; 0.001). It should be noted that the enrichment is increased when testing only the CNVRs bigger than 20 kb, with segmental duplications overlapping 47% of those CNVRs. This is also consistent with previous CNV studies reporting a stronger association between segmental duplications and long CNVRs <abbrgrp><abbr bid="B10">10</abbr><abbr bid="B45">45</abbr></abbrgrp>.</p>
         </sec>
         <sec>
            <st>
               <p>Functional analysis</p>
            </st>
            <p>Nearly 30% (90) of the CNVRs encompassed 348 full-length genes as annotated in Ensembl <abbrgrp><abbr bid="B52">52</abbr></abbrgrp> (Additional file <supplr sid="S4">4</supplr>, Table S2), but contrary to segmental duplications, the enrichment of CNVRs in genic areas is not significant. This indicates that the gene content of the CNVRs does not significantly differ from the whole bovine genome. The fact that none of the 481 ultraconserved elements <abbrgrp><abbr bid="B53">53</abbr></abbrgrp>, nor the 611 new long conserved noncoding sequences in vertebrates <abbrgrp><abbr bid="B54">54</abbr></abbrgrp>, were found in the CNV regions (whereas six of them would be expected by chance, permutation p-value &lt; 0.001), further supports the notion that CNVs are significantly depleted in highly conserved functional elements.</p>
            <suppl id="S4">
               <title>
                  <p>Additional file 4</p>
               </title>
               <text>
                  <p><b>Excel file includes Table S2</b>. CNVRs and ensembl genes within CNVRs.</p>
               </text>
               <file name="1471-2164-11-284-S4.XLS">
   <p>Click here for file</p>
</file>
            </suppl>
            <p>In order to determine the likely biological effects of the 348 copy number variant genes, a gene ontology (GO) analysis was performed with the EasyGO tool <abbrgrp><abbr bid="B55">55</abbr></abbrgrp>. Genes that were not completely included within the CNVRs were excluded from the GO analysis, since the breakpoint definition of CNVRs can be equivocal <abbrgrp><abbr bid="B56">56</abbr><abbr bid="B57">57</abbr></abbrgrp>. Table <tblr tid="T2">2</tblr> shows that genes involved in environmental response are over-represented in the bovine CNVRs, as also seen in other studies of mammalian genomes <abbrgrp><abbr bid="B10">10</abbr><abbr bid="B21">21</abbr><abbr bid="B25">25</abbr><abbr bid="B33">33</abbr></abbrgrp>.</p>
            <tbl id="T2"><title><p>Table 2</p></title><caption><p>Enriched GO terms associated with the CNV regions (FDR p-value &#8804; 0.01).</p></caption><tblbdy cols="3">
      <r>
         <c ca="center" cspan="3">
            <p>
               <b>Cellular_Component</b>
            </p>
         </c>
      </r>
      <r>
         <c cspan="3">
            <hr/>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>GO term</p>
         </c>
         <c ca="left">
            <p>GO name</p>
         </c>
         <c ca="left">
            <p>p-value</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>GO:0031224</p>
         </c>
         <c ca="left">
            <p>intrinsic to membrane</p>
         </c>
         <c ca="left">
            <p>3.08e-10</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>GO:0016021</p>
         </c>
         <c ca="left">
            <p>integral to membrane</p>
         </c>
         <c ca="left">
            <p>3.08e-10</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>GO:0016020</p>
         </c>
         <c ca="left">
            <p>membrane</p>
         </c>
         <c ca="left">
            <p>2.73e-10</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>GO:0044425</p>
         </c>
         <c ca="left">
            <p>membrane part</p>
         </c>
         <c ca="left">
            <p>2.16e-10</p>
         </c>
      </r>
      <r>
         <c cspan="3">
            <hr/>
         </c>
      </r>
      <r>
         <c ca="center" cspan="3">
            <p>
               <b>Molecular_Function</b>
            </p>
         </c>
      </r>
      <r>
         <c cspan="3">
            <hr/>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>GO term</p>
         </c>
         <c ca="left">
            <p>GO name</p>
         </c>
         <c ca="left">
            <p>p-value</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>GO:0060089</p>
         </c>
         <c ca="left">
            <p>molecular transducer activity</p>
         </c>
         <c ca="left">
            <p>1.36e-29</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>GO:0004867</p>
         </c>
         <c ca="left">
            <p>serine-type endopeptidase inhibitor activity</p>
         </c>
         <c ca="left">
            <p>0.00448</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>GO:0004984</p>
         </c>
         <c ca="left">
            <p>olfactory receptor activity</p>
         </c>
         <c ca="left">
            <p>1.6e-26</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>GO:0004872</p>
         </c>
         <c ca="left">
            <p>receptor activity</p>
         </c>
         <c ca="left">
            <p>8.73e-31</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>GO:0004871</p>
         </c>
         <c ca="left">
            <p>signal transducer activity</p>
         </c>
         <c ca="left">
            <p>1.36e-29</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>GO:0004930</p>
         </c>
         <c ca="left">
            <p>G-protein coupled receptor activity</p>
         </c>
         <c ca="left">
            <p>1.6e-26</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>GO:0005044</p>
         </c>
         <c ca="left">
            <p>scavenger receptor activity</p>
         </c>
         <c ca="left">
            <p>1.53e-06</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>GO:0004888</p>
         </c>
         <c ca="left">
            <p>transmembrane receptor activity</p>
         </c>
         <c ca="left">
            <p>9.94e-32</p>
         </c>
      </r>
      <r>
         <c cspan="3">
            <hr/>
         </c>
      </r>
      <r>
         <c ca="center" cspan="3">
            <p>
               <b>Biological Process</b>
            </p>
         </c>
      </r>
      <r>
         <c cspan="3">
            <hr/>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>GO term</p>
         </c>
         <c ca="left">
            <p>GO name</p>
         </c>
         <c ca="left">
            <p>p-value</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>GO:0007165</p>
         </c>
         <c ca="left">
            <p>signal transduction</p>
         </c>
         <c ca="left">
            <p>7.62e-16</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>GO:0007166</p>
         </c>
         <c ca="left">
            <p>cell surface receptor linked signal transduction</p>
         </c>
         <c ca="left">
            <p>3.74e-21</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>GO:0050789</p>
         </c>
         <c ca="left">
            <p>regulation of biological process</p>
         </c>
         <c ca="left">
            <p>3.84e-10</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>GO:0065007</p>
         </c>
         <c ca="left">
            <p>biological regulation</p>
         </c>
         <c ca="left">
            <p>9.61e-10</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>GO:0007186</p>
         </c>
         <c ca="left">
            <p>G-protein coupled receptor protein signaling pathway</p>
         </c>
         <c ca="left">
            <p>1.13e-21</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>GO:0050794</p>
         </c>
         <c ca="left">
            <p>regulation of cellular process</p>
         </c>
         <c ca="left">
            <p>5.01e-10</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>GO:0006952</p>
         </c>
         <c ca="left">
            <p>defense response</p>
         </c>
         <c ca="left">
            <p>0.0107</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>GO:0007154</p>
         </c>
         <c ca="left">
            <p>cell communication</p>
         </c>
         <c ca="left">
            <p>1.82e-15</p>
         </c>
      </r>
   </tblbdy></tbl>
            <p>The following step was to test if genes unaffected by CNVs exhibited a different selective constraint than the ones affected. To test this, the dN/dS ratios for orthologous genes between the cow and human species were compared (Table <tblr tid="T3">3</tblr> and Additional file <supplr sid="S5">5</supplr>, Table S3). Knowing that dN is the number of nucleotide differences per non synonymous site and dS the number of nucleotide differences per synonymous site, dN/dS &lt; 1 suggests that amino acid change is selectively constrained (purifying selection), while dN/dS &#8805; 1 suggests a relaxation of that same selection (positive selection). It was determined that both deleted and duplicated genes have dN/dS ratios significantly higher than those for non-polymorphic genes. This result, as for the over represented set of 'environmental response' genes, might indicate a relaxation of constrains due to the redundancy expected from the variable number of gene copies <abbrgrp><abbr bid="B58">58</abbr><abbr bid="B59">59</abbr><abbr bid="B60">60</abbr><abbr bid="B61">61</abbr></abbrgrp>.</p>
            <tbl id="T3"><title><p>Table 3</p></title><caption><p>Evolutionary rates for monomorphic and CNV genes.</p></caption><tblbdy cols="4">
      <r>
         <c>
            <p/>
         </c>
         <c ca="center">
            <p>
               <b>Duplication CNV</b>
            </p>
         </c>
         <c ca="center">
            <p>
               <b>Deletion CNV</b>
            </p>
         </c>
         <c ca="center">
            <p>
               <b>No CNV</b>
            </p>
         </c>
      </r>
      <r>
         <c cspan="4">
            <hr/>
         </c>
      </r>
      <r>
         <c ca="center">
            <p>
               <b>dN/dS</b>
            </p>
         </c>
         <c ca="center">
            <p>0.339</p>
         </c>
         <c ca="center">
            <p>0.392</p>
         </c>
         <c ca="center">
            <p>0.165</p>
         </c>
      </r>
      <r>
         <c cspan="4">
            <hr/>
         </c>
      </r>
      <r>
         <c ca="center">
            <p>
               <b>Wilcoxon rank-sum test, P value</b>
            </p>
         </c>
         <c ca="center">
            <p>1.39E-06</p>
         </c>
         <c ca="center">
            <p>&lt; 2.2e-16</p>
         </c>
         <c ca="center">
            <p>-</p>
         </c>
      </r>
   </tblbdy><tblfn>
      <p>dN - nonsynonymous rate; dS - synonymous rate; p-value compares CNV genes with monomorphic genes.</p>
   </tblfn></tbl>
            <suppl id="S5">
               <title>
                  <p>Additional file 5</p>
               </title>
               <text>
                  <p><b>Excel file includes Table S3</b>. Cow-human ortholog genes with the respective dN and dS values (data from Ensembl database <abbrgrp><abbr bid="B52">52</abbr></abbrgrp>).</p>
               </text>
               <file name="1471-2164-11-284-S5.XLS">
   <p>Click here for file</p>
</file>
            </suppl>
            <p>When examining the human orthologs for the cattle genes <abbrgrp><abbr bid="B52">52</abbr></abbrgrp> affected by CNV, we studied 167 human ortholog genes of which 84 overlap with the genomic coordinates of previously reported human structural variation, as seen in the Database of Genomic Variants (DGV) <abbrgrp><abbr bid="B7">7</abbr></abbrgrp>. Since it is unlikely that CNVs in the human-cow common ancestor would have been conserved, the overlapping CNVs most probably reflect the existence of orthologous genomic regions of structural instability that are prone to recurrently generate polymorphisms in both species. However, this may also indicate that the CNVs annotated in DGV, being derived using different technological and analytical platforms, have a large variance in CNV resolution which may overestimate CNV sizes <abbrgrp><abbr bid="B14">14</abbr></abbrgrp>. Consequently, even if the annotated CNVs represent true structural variation it is difficult to estimate the actual boundaries of the CNV and subsequently the overlap of DGV CNVs with the CNVs identified here.</p>
         </sec>
         <sec>
            <st>
               <p>CNV affecting genes associated with disease</p>
            </st>
            <p>Querying for copy number variant genes that had an orthologous human gene with OMIM morbid ID reference <abbrgrp><abbr bid="B62">62</abbr></abbrgrp>, revealed that 19 of these genes have been associated in human disease (susceptibility to sarcoidoisis and Alzheimer's disease, myopathy, encephalopathy, ataxia, etc - Additional file <supplr sid="S6">6</supplr>, Table S4). Likewise, when probing orthologous human genes involved in genome-wide association studies (GWAS) <abbrgrp><abbr bid="B63">63</abbr></abbrgrp>, 12 genes associated with human disease traits were found (Additional file <supplr sid="S7">7</supplr>, Table S5). We also queried the Animal QTL database <abbrgrp><abbr bid="B64">64</abbr></abbrgrp> that holds publicly available QTL data on livestock species. Retrieving all the bovine QTLs within 2 Mb of our CNVRs resulted in 110 QTLs, which can hold putative valuable information for some important traits of interest (Additional file <supplr sid="S8">8</supplr>, Table S6). The database of Online Mendelian Inheritance in Animals (OMIA) <abbrgrp><abbr bid="B65">65</abbr></abbrgrp> was also queried, and 21 cow phenotypes within 2 Mb of CNVRs were retrieved (Additional file <supplr sid="S9">9</supplr>, Table S7).</p>
            <suppl id="S6">
               <title>
                  <p>Additional file 6</p>
               </title>
               <text>
                  <p><b>Text file includes Table S4</b>. Genes within CNV regions for which cow-human orthology exist and have a OMIM morbid ID <abbrgrp><abbr bid="B62">62</abbr></abbrgrp>.</p>
               </text>
               <file name="1471-2164-11-284-S6.TXT">
   <p>Click here for file</p>
</file>
            </suppl>
            <suppl id="S7">
               <title>
                  <p>Additional file 7</p>
               </title>
               <text>
                  <p><b>Excel file includes Table S5</b>. Genes within CNV regions for which cow-human orthology exist and have been associated with a phenotype by a GWAS study.</p>
               </text>
               <file name="1471-2164-11-284-S7.XLS">
   <p>Click here for file</p>
</file>
            </suppl>
            <suppl id="S8">
               <title>
                  <p>Additional file 8</p>
               </title>
               <text>
                  <p><b>Excel file includes Table S6</b>. All CNVRs and QTLs within 2 Mb of a CNVR.</p>
               </text>
               <file name="1471-2164-11-284-S8.XLS">
   <p>Click here for file</p>
</file>
            </suppl>
            <suppl id="S9">
               <title>
                  <p>Additional file 9</p>
               </title>
               <text>
                  <p><b>Excel file includes Table S7</b>. All CNVRs and OMIA genes within 2 Mb of a CNVR.</p>
               </text>
               <file name="1471-2164-11-284-S9.XLS">
   <p>Click here for file</p>
</file>
            </suppl>
         </sec>
         <sec>
            <st>
               <p>Comparison with other mammalian CNV studies</p>
            </st>
            <p>Next, we compared the number of CNVRs detected here with CNVRs from other studies (Table <tblr tid="T4">4</tblr>). To minimize technical CNV detection biases we: (1) used data only from the same platform (when available), (2) used data from the highest resolution genome-wide survey published on each species queried and (3) required that the study was non biased to any particular genomic region. The main finding of this comparison is that an increased resolution of the array platform increases the number of detected CNVs. This supports our finding that the bulk of CNVs in mammalian genomes are small events, implying that the characterization of the mammalian CNV landscape is far from complete. A comparison with other non-human CNV studies shows that the number of CNVRs/sample does not follow the same trend. This is expected since we have employed a number of related animals as well as the overall genetic variation in cattle is known to be reduced relative to mouse and man. Another contributing factor might be that our stringent CNV calling criteria hampers the detection of putative true CNVs. Concerning the platform used to assess CNVs in humans <abbrgrp><abbr bid="B17">17</abbr></abbrgrp>; with a resolution to find CNVs 3 times bigger than ours, a similar difference when detecting CNVR/sample would be expected. This is not the case because CNV counts are known to be inversely proportional to their size, as seen here and elsewhere (DGV <abbrgrp><abbr bid="B7">7</abbr></abbrgrp>).</p>
            <tbl id="T4"><title><p>Table 4</p></title><caption><p>Comparison between this and other mammalian CNV studies using array CGH.</p></caption><tblbdy cols="9">
      <r>
         <c ca="right">
            <p>
               <b>Species</b>
            </p>
         </c>
         <c ca="right">
            <p>
               <b>Samples</b>
            </p>
         </c>
         <c ca="right">
            <p>
               <b>CNVRs</b>
            </p>
         </c>
         <c ca="right">
            <p>
               <b>CNVR/sample</b>
            </p>
         </c>
         <c ca="right">
            <p>
               <b>Mean size</b>
            </p>
         </c>
         <c ca="right">
            <p>
               <b>Platform</b>
            </p>
         </c>
         <c ca="right">
            <p>
               <b>Mean spacing</b>
            </p>
         </c>
         <c ca="right">
            <p>
               <b>Resolution</b>
            </p>
         </c>
         <c ca="right">
            <p>
               <b>Ref</b>
            </p>
         </c>
      </r>
      <r>
         <c cspan="9">
            <hr/>
         </c>
      </r>
      <r>
         <c ca="right">
            <p>Cow</p>
         </c>
         <c ca="right">
            <p>20</p>
         </c>
         <c ca="right">
            <p>304</p>
         </c>
         <c ca="right">
            <p>15</p>
         </c>
         <c ca="right">
            <p>72</p>
         </c>
         <c ca="right">
            <p>3*2.1 M</p>
         </c>
         <c ca="right">
            <p>0.4</p>
         </c>
         <c ca="right">
            <p>2</p>
         </c>
         <c ca="right">
            <p>This</p>
         </c>
      </r>
      <r>
         <c ca="right">
            <p>Cow</p>
         </c>
         <c ca="right">
            <p>5</p>
         </c>
         <c ca="right">
            <p>25</p>
         </c>
         <c ca="right">
            <p>5</p>
         </c>
         <c ca="right">
            <p>128</p>
         </c>
         <c ca="right">
            <p>385 k</p>
         </c>
         <c ca="right">
            <p>6</p>
         </c>
         <c ca="right">
            <p>28.8</p>
         </c>
         <c ca="right">
            <p>
               <abbrgrp>
                  <abbr bid="B35">35</abbr>
               </abbrgrp>
            </p>
         </c>
      </r>
      <r>
         <c ca="right">
            <p>Dog</p>
         </c>
         <c ca="right">
            <p>9</p>
         </c>
         <c ca="right">
            <p>60</p>
         </c>
         <c ca="right">
            <p>7</p>
         </c>
         <c ca="right">
            <p>310</p>
         </c>
         <c ca="right">
            <p>385 k</p>
         </c>
         <c ca="right">
            <p>4.7</p>
         </c>
         <c ca="right">
            <p>23.5</p>
         </c>
         <c ca="right">
            <p>
               <abbrgrp>
                  <abbr bid="B33">33</abbr>
               </abbrgrp>
            </p>
         </c>
      </r>
      <r>
         <c ca="right">
            <p>Mouse</p>
         </c>
         <c ca="right">
            <p>20</p>
         </c>
         <c ca="right">
            <p>1333</p>
         </c>
         <c ca="right">
            <p>66</p>
         </c>
         <c ca="right">
            <p>64</p>
         </c>
         <c ca="right">
            <p>2.1 M</p>
         </c>
         <c ca="right">
            <p>1</p>
         </c>
         <c ca="right">
            <p>5</p>
         </c>
         <c ca="right">
            <p>
               <abbrgrp>
                  <abbr bid="B45">45</abbr>
               </abbrgrp>
            </p>
         </c>
      </r>
      <r>
         <c ca="right">
            <p>Rat</p>
         </c>
         <c ca="right">
            <p>3</p>
         </c>
         <c ca="right">
            <p>33</p>
         </c>
         <c ca="right">
            <p>11</p>
         </c>
         <c ca="right">
            <p>256</p>
         </c>
         <c ca="right">
            <p>385 k</p>
         </c>
         <c ca="right">
            <p>5</p>
         </c>
         <c ca="right">
            <p>25</p>
         </c>
         <c ca="right">
            <p>
               <abbrgrp>
                  <abbr bid="B30">30</abbr>
               </abbrgrp>
            </p>
         </c>
      </r>
      <r>
         <c ca="right">
            <p>Macaque</p>
         </c>
         <c ca="right">
            <p>10</p>
         </c>
         <c ca="right">
            <p>123</p>
         </c>
         <c ca="right">
            <p>12</p>
         </c>
         <c ca="right">
            <p>102</p>
         </c>
         <c ca="right">
            <p>385 k</p>
         </c>
         <c ca="right">
            <p>6.5</p>
         </c>
         <c ca="right">
            <p>32.5</p>
         </c>
         <c ca="right">
            <p>
               <abbrgrp>
                  <abbr bid="B21">21</abbr>
               </abbrgrp>
            </p>
         </c>
      </r>
      <r>
         <c ca="right">
            <p>Human</p>
         </c>
         <c ca="right">
            <p>40</p>
         </c>
         <c ca="right">
            <p>8599</p>
         </c>
         <c ca="right">
            <p>215</p>
         </c>
         <c ca="right">
            <p>20</p>
         </c>
         <c ca="right">
            <p>20*2.1 M</p>
         </c>
         <c ca="right">
            <p>0.06</p>
         </c>
         <c ca="right">
            <p>0.5</p>
         </c>
         <c ca="right">
            <p>
               <abbrgrp>
                  <abbr bid="B17">17</abbr>
               </abbrgrp>
            </p>
         </c>
      </r>
   </tblbdy><tblfn>
      <p>Mean size, Mean spacing and Resolution are given in kilobases (kb).</p>
   </tblfn></tbl>
         </sec>
         <sec>
            <st>
               <p>Validation of the CNVRs</p>
            </st>
            <p>To evaluate the accuracy of the copy number assignments, quantitative real time-PCR was used as described previously <abbrgrp><abbr bid="B47">47</abbr></abbrgrp>. Briefly, two control regions, one site on the X chromosome and one site on an autosome, plus ten potential CNV regions were selected. Six quantitative PCRs immediately confirmed the existence of copy number variation in these regions (Additional file <supplr sid="S10">10</supplr>, Table S8), whereas primer sets in four regions did not work satisfactorily (see Methods). Primer sets were therefore re-designed within these four CNV regions, and using these new primers the PCR reactions were performed successfully. Thus, the existence of copy number variation in all ten regions was confirmed by quantitative PCR.</p>
            <suppl id="S10">
               <title>
                  <p>Additional file 10</p>
               </title>
               <text>
                  <p><b>Excel file includes Table S8</b>. Primers and result plots for the RT-PCR experiments.</p>
               </text>
               <file name="1471-2164-11-284-S10.XLS">
   <p>Click here for file</p>
</file>
            </suppl>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Discussion</p>
         </st>
         <p>The study outlined in this paper yields the highest-resolution analysis of bovine CNVs to date. Using a genome-wide tiling oligo array CGH, the largest number of CNV regions yet reported in cattle (304 CNVRs; average of 47 per genome) have been identified. Almost all (98%) of the CNV regions discovered here are novel relative to previous reports <abbrgrp><abbr bid="B35">35</abbr><abbr bid="B42">42</abbr></abbrgrp>, thereby vastly expanding our insight of genome structural variation in cattle. With an effective resolution of 1.5 kb in detecting CNVs, resulting in a median CNV size of 16.7 kb, our data shows that at least 0.68% of the cattle genome can vary in copy number in seemingly healthy animals. This is most probably an underestimate of the true genomic fraction that is tolerant to copy number due to the low number of animals sampled and their close relatedness.</p>
         <p>As detected previously, not only in cattle <abbrgrp><abbr bid="B35">35</abbr></abbrgrp> but also in other species <abbrgrp><abbr bid="B10">10</abbr><abbr bid="B25">25</abbr><abbr bid="B27">27</abbr><abbr bid="B33">33</abbr></abbrgrp>, CNVs are strongly associated with segmental duplications (SDs). This SD relation creates a lack of probe coverage in and around duplicated sequences <abbrgrp><abbr bid="B66">66</abbr></abbrgrp>, which significantly hampers the applicability of genome-wide association studies using SNP arrays to tag SDs-driven CNVs.</p>
         <p>In addition, our data suggests that smaller CNVs (&lt;50 kb) are much more frequent than larger ones, which is in agreement with other high resolution studies <abbrgrp><abbr bid="B14">14</abbr><abbr bid="B17">17</abbr><abbr bid="B45">45</abbr></abbrgrp>. If this can be extrapolated for the whole cattle genome, the commercially available Illumina 50 k SNP panel (with an average probe spacing of 54 kb <abbrgrp><abbr bid="B67">67</abbr></abbrgrp>) would not be sufficient to detect the bulk of existing CNVs. Consequently, further characterization of cattle CNVs should be done with similar high-density array CGH or using next-generation sequencing technologies. The latter identifies a more complete size and class ranges of structural variation <abbrgrp><abbr bid="B68">68</abbr><abbr bid="B69">69</abbr><abbr bid="B70">70</abbr><abbr bid="B71">71</abbr><abbr bid="B72">72</abbr><abbr bid="B73">73</abbr><abbr bid="B74">74</abbr><abbr bid="B75">75</abbr><abbr bid="B76">76</abbr><abbr bid="B77">77</abbr><abbr bid="B78">78</abbr><abbr bid="B79">79</abbr></abbrgrp>.</p>
         <p>As previously shown, copy number variants can have an impact on phenotypic variation mainly due to gene-dosage effects <abbrgrp><abbr bid="B80">80</abbr></abbrgrp>, and are often associated with disease susceptibility <abbrgrp><abbr bid="B38">38</abbr><abbr bid="B39">39</abbr><abbr bid="B40">40</abbr><abbr bid="B81">81</abbr></abbrgrp>. In this analysis, CNVRs were found to be enriched for genes with functions related to environmental response, such as immune and sensory functions previously noticed in other species <abbrgrp><abbr bid="B10">10</abbr><abbr bid="B25">25</abbr><abbr bid="B27">27</abbr><abbr bid="B33">33</abbr><abbr bid="B35">35</abbr></abbrgrp>. The enrichment aspect is an interesting finding, since variation in immunity related genes have been associated with disease. In particular, genes of the major histocompatibility complex (MHC) <url>http://www.ebi.ac.uk/ipd/mhc/</url>, of which some are included in our dataset, are reported to be responsible for differences in predisposition to diseases like mastitis, dermatophilosis and other tick infections <abbrgrp><abbr bid="B41">41</abbr></abbrgrp>. Concerning the genetics of milk production and lactation, we found none of the 197 unique milk protein genes and the over 6000 mammary-related genes within our CNV regions. This is expected since these genes are known to be highly conserved and evolving more slowly than other genes in the bovine genome <abbrgrp><abbr bid="B82">82</abbr></abbrgrp>.</p>
         <p>Many genes and QTLs associated with human and cow diseases were found to be copy number variable or located nearby CNV regions. The fact that some bovine CNVs occurred in regions orthologous to human CNVs, reflect most likely recurrent CNV formation, rather than ancestral CNVs maintained in both species. These regions could be hotspots of CNV genesis due to their fragile structural architecture that prompts frequent rearrangements.</p>
      </sec>
      <sec>
         <st>
            <p>Conclusions</p>
         </st>
         <p>In summary, the data presented here extends and establish the fact that a significant part of cattle genome is copy number variable within and between breeds and that our high-resolution array CGH is a valid method to detect bovine CNVs in a genome-wide manner. With a limited amount of sampled animals and breeds, and a stringent CNV calling criteria, the CNV regions reported here are believed to be highly reliable, but the number might greatly underestimate true number of CNVs in cattle populations. Consequently, future studies are required to assess the functional significance of CNVs and their impact on health and productive efficiency in cattle.</p>
      </sec>
      <sec>
         <st>
            <p>Methods</p>
         </st>
         <sec>
            <st>
               <p>Sample preparation</p>
            </st>
            <p>The genomic DNA of 20 bovine samples was obtained from 4 dairy and beef breeds (14 Holsteins, 3 Simmental 2 Red danish and 1 Hereford). The pedigree scheme for the related animals is in Additional file <supplr sid="S2">2</supplr>, Figure S1. DNA was extracted and purified from blood as described elsewhere <abbrgrp><abbr bid="B47">47</abbr></abbrgrp>, in order to pass Nimblegen quality control requirements. We adhered to our national and institutional guidelines for the ethical use and treatment of animals in experiments.</p>
         </sec>
         <sec>
            <st>
               <p>Array CGH</p>
            </st>
            <p>DNA fragmentation, labeling, hybridization, washing and array imaging were carried out according to the manufacturer's protocol and done as previously described <abbrgrp><abbr bid="B83">83</abbr></abbrgrp>. Briefly, the genomic DNA samples were fragmented by sonication and labeled with fluorescent dyes Cy3 and Cy5. According to the dye swap loop design (Additional file <supplr sid="S12">12</supplr>, Table S10), samples were co-hybridized with a MAUI hybridization system (BioMicro Systems) to custom-made cattle CGH 2.1 M (HD2) arrays (Roche NimbleGen, Madison, WI). In order to cover the latest bovine genome assembly (bt4) with high density, the custom CGH arrays were planned in 3 designs. Each design covered a specific set of chromosomes with 2.1 million probes, which yielded 420 bp of average probe spacing (301 bp median probe spacing). The probe design fundamentals are described by the array manufacturer and elsewhere <abbrgrp><abbr bid="B47">47</abbr></abbrgrp>.</p>
            <suppl id="S12">
               <title>
                  <p>Additional file 12</p>
               </title>
               <text>
                  <p><b>Excel file includes Table S10</b>. Array CGH hybridizations. Extra information on the samples and experiment design is in GEO <abbrgrp><abbr bid="B92">92</abbr></abbrgrp>.</p>
               </text>
               <file name="1471-2164-11-284-S12.XLS">
   <p>Click here for file</p>
</file>
            </suppl>
            <p>The arrays were scanned using a 5 &#956;m scanner, and Nimblescan software (Roche Nimblegen, Madison, WI) was used to retrieve fluorescent intensity raw data from the scanned images of the oligonucleotide tiling arrays. For each spot on the array, log2-ratios of the Cy3-labeled test sample versus Cy-5 reference sample were computed. Before normalization and segmentation analysis, spatial correction was applied. Spatial correction reduces some artifacts observed in CGH data from 2.1 M arrays, adjusting position-dependent non-uniformity of signals across the array. Specifically, locally weighted polynomial regression (loess) was used to adjust signal intensities based on X, Y feature position <abbrgrp><abbr bid="B84">84</abbr></abbrgrp>. Normalization was then performed using the q-spline method <abbrgrp><abbr bid="B85">85</abbr></abbrgrp>, followed by segmentation using the CNV calling algorithm segMNT <abbrgrp><abbr bid="B86">86</abbr></abbrgrp>. This algorithm is shown to outperform both DNACopy <abbrgrp><abbr bid="B87">87</abbr></abbrgrp>, which is one of the most widely used CNV calling algorithm in the literature, and StepGram <abbrgrp><abbr bid="B88">88</abbr></abbrgrp>, the algorithm used by Agilent for CGH arrays. The segments with mean log2ratio &#8805; |0.4| and at least 5 consecutive probes were retained. From these, a CNV was called if it was detected in both dye swap arrays and detected at least in two different dye swap hybridizations (i.e. in two hybridizations with an animal in common). Since the CNV calling pipeline requires at least 5 consecutive probes before calling a region copy number variant, our theoretical resolution for CNV detection is 1465 bp (median spacing*4 + median oligo length*5).</p>
         </sec>
         <sec>
            <st>
               <p>False positive rate</p>
            </st>
            <p>The false positive rate was calculated based on the 8 sex-mismatched arrays in this study: the length of chromosome &#215; (from all the 8 hybs.) having a log2-ratio with a different signal than it should (given the sex-mismatched hybridization), dividing by the length of chrX multiplied by the number of sex-mismatched arrays (25,694,212/(88,516,663*8) = 3.62%). Since the FPR can be overestimated from sex-mismatched arrays, due to the assumption that no CNV exist in the chrX of sex-mismatched arrays, the FPR was also calculated from the self-self experiment and was determined as the length of sequence that would normally be called a CNV with our CNV calling pipeline. The FPR was determined as being 0.0085%.</p>
         </sec>
         <sec>
            <st>
               <p>Enrichment analysis</p>
            </st>
            <p>Bovine segmental duplication (SD) data was retrieved from <abbrgrp><abbr bid="B89">89</abbr></abbrgrp>. They used two independent approaches to detect segmental duplications: WGAC (whole-genome assembly comparison), which is a BLAST-based analysis of all assembled sequence that detects self alignments (&gt;90% and 1 kb); and WSSD (whole-genome shotgun sequence detection), which is an assembly-independent approach that examines the reference sequence for an increase in WGS read depth-of-coverage. This strategy has been used previously to map SDs in the human <abbrgrp><abbr bid="B49">49</abbr></abbrgrp> and mouse <abbrgrp><abbr bid="B27">27</abbr></abbrgrp> genomes. From their global data we choose to filter out those SDs bigger than 94% identity using WGAC if they were not also confirmed by WSSD. The reason for this relates to the fact that the assembly of highly similar duplicated sequences will often be missed, collapsed or mis-assigned <abbrgrp><abbr bid="B90">90</abbr><abbr bid="B91">91</abbr></abbrgrp>.</p>
            <p>The association of CNVRs with genomic features (SDs, assembly gaps, genes and conserved elements) was tested by randomly permuting the genomic position of each CNVR 10 000 times and determining the sequence content of the resulting region or flanking regions.</p>
         </sec>
         <sec>
            <st>
               <p>Real-Time PCR</p>
            </st>
            <p>Validation with RT-PCR was executed as previously described <abbrgrp><abbr bid="B47">47</abbr></abbrgrp>, with the Applied Biosystems 7900HT Sequence Detection System used for the Taqman assays, and downstream analysis performed with SDS 2.2 software. The full sequence of the CNVR was BLASTN-searched against the bovine genome sequence in order to identify a subregion that was unique and specific to the chromosomal location of the CNVR. PCR primers and probes were designed in this subregion of the CNVR using the ProbeFinder software from Roche Applied Science (Additional file <supplr sid="S10">10</supplr>, Table S8). Criteria for classifying a "not working" primer involved two parameters: reaction efficiency below 85%, and Pearson correlation of each standard curve below 0.95. Only ten of the original twenty bovine samples were used due to lack of DNA availability. For each target, the relative quantification analysis with a reference female sample was done to calculate estimated copy numbers of each sample.</p>
         </sec>
         <sec>
            <st>
               <p>Data availability</p>
            </st>
            <p>The full data set and designs from the oligo array CGH experiments have been submitted to GEO <abbrgrp><abbr bid="B92">92</abbr></abbrgrp> under the accession ID GSE18174.</p>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Authors' contributions</p>
         </st>
         <p>JF, BT, LEH and CB designed the project. JF performed data analysis and drafted the manuscript. BT planned the RT-PCR validation experiments. CB was the principal investigator of the project. All the authors have contributed to writing this manuscript and have read and approved the contents of the final submitted version.</p>
      </sec>
   </bdy>
   <bm>
      <ack>
         <sec>
            <st>
               <p>Acknowledgements</p>
            </st>
            <p>We thank the Bovine Genome Sequencing and Analysis Consortium for providing access to the <it>Bos Taurus </it>genome sequence assembly. We thank S&#248;ren Svendsen and Hanne J&#248;rgensen for technical assistance. This work was conducted as part of the SABRETRAIN Project, funded by the Marie Curie Host Fellowships for Early Stage Research Training, as part of the 6th Framework Programme of the European Commission. We also acknowledge the support from The Danish Food Industry Agency, Danish Ministry of Food, Agriculture and Fisheries as well as support from Viking Genetics.</p>
         </sec>
      </ack>
      <refgrp><bibl id="B1"><title><p>Unlocking the Bovine Genome</p></title><aug><au><snm>Tellam</snm><fnm>RL</fnm></au><au><snm>Lemay</snm><fnm>DG</fnm></au><au><snm>Van Tassell</snm><fnm>CP</fnm></au><au><snm>Lewin</snm><fnm>HA</fnm></au><au><snm>Worley</snm><fnm>KC</fnm></au><au><snm>Elsik</snm><fnm>CG</fnm></au></aug><source>BMC Genomics</source><pubdate>2009</pubdate><volume>10</volume><fpage>193</fpage><xrefbib><pubidlist><pubid idtype="doi">10.1186/1471-2164-10-193</pubid><pubid idtype="pmcid">2680899</pubid><pubid idtype="pmpid">19393070</pubid></pubidlist></xrefbib></bibl><bibl id="B2"><title><p>The genome sequence of taurine cattle: A window to ruminant biology and evolution</p></title><aug><au><cnm>Bovine Genome Sequencing and Analysis Consortium</cnm></au></aug><source>Science</source><pubdate>2009</pubdate><volume>324</volume><fpage>522</fpage><lpage>528</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1126/science.1169588</pubid><pubid idtype="pmcid">2943200</pubid><pubid idtype="pmpid" link="fulltext">19390049</pubid></pubidlist></xrefbib></bibl><bibl id="B3"><title><p>Genome-wide survey of SNP variation uncovers the genetic structure of cattle breeds</p></title><aug><au><cnm>Bovine HapMap Consortium</cnm></au></aug><source>Science</source><pubdate>2009</pubdate><volume>324</volume><fpage>528</fpage><lpage>32</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1126/science.1167936</pubid><pubid idtype="pmcid">2735092</pubid><pubid idtype="pmpid">19390050</pubid></pubidlist></xrefbib></bibl><bibl id="B4"><title><p>The Bar "gene" duplication</p></title><aug><au><snm>Bridges</snm><fnm>CB</fnm></au></aug><source>Science</source><pubdate>1936</pubdate><volume>83</volume><fpage>210</fpage><lpage>11</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1126/science.83.2148.210</pubid><pubid idtype="pmpid" link="fulltext">17796454</pubid></pubidlist></xrefbib></bibl><bibl id="B5"><title><p>Study of somatic chromosomes from 9 mongoloid children</p></title><aug><au><snm>Lejeune</snm><fnm>J</fnm></au><au><snm>Gautier</snm><fnm>M</fnm></au><au><snm>Turpin</snm><fnm>R</fnm></au></aug><source>C R Hebd Seances Acad Sci</source><pubdate>1959</pubdate><volume>248</volume><fpage>1721</fpage><lpage>1722</lpage><xrefbib><pubid idtype="pmpid">13639368</pubid></xrefbib></bibl><bibl id="B6"><title><p>Large-scale copy number polymorphism in the human genome</p></title><aug><au><snm>Sebat</snm><fnm>J</fnm></au><au><snm>Lakshmi</snm><fnm>B</fnm></au><au><snm>Troge</snm><fnm>J</fnm></au><au><snm>Alexander</snm><fnm>J</fnm></au><au><snm>Young</snm><fnm>J</fnm></au><au><snm>Lundin</snm><fnm>P</fnm></au><au><snm>M&#229;n&#233;r</snm><fnm>S</fnm></au><au><snm>Massa</snm><fnm>H</fnm></au><au><snm>Walker</snm><fnm>M</fnm></au><au><snm>Chi</snm><fnm>M</fnm></au><au><snm>Navin</snm><fnm>N</fnm></au><au><snm>Lucito</snm><fnm>R</fnm></au><au><snm>Healy</snm><fnm>J</fnm></au><au><snm>Hicks</snm><fnm>J</fnm></au><au><snm>Ye</snm><fnm>K</fnm></au><au><snm>Reiner</snm><fnm>A</fnm></au><au><snm>Gilliam</snm><fnm>TC</fnm></au><au><snm>Trask</snm><fnm>B</fnm></au><au><snm>Patterson</snm><fnm>N</fnm></au><au><snm>Zetterberg</snm><fnm>A</fnm></au><au><snm>Wigler</snm><fnm>M</fnm></au></aug><source>Science</source><pubdate>2004</pubdate><volume>305</volume><fpage>525</fpage><lpage>528</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1126/science.1098918</pubid><pubid idtype="pmpid" link="fulltext">15273396</pubid></pubidlist></xrefbib></bibl><bibl id="B7"><title><p>Detection of large-scale variation in the human genome</p></title><aug><au><snm>Iafrate</snm><fnm>AJ</fnm></au><au><snm>Feuk</snm><fnm>L</fnm></au><au><snm>Rivera</snm><fnm>MN</fnm></au><au><snm>Listewnik</snm><fnm>ML</fnm></au><au><snm>Donahoe</snm><fnm>PK</fnm></au><au><snm>Qi</snm><fnm>Y</fnm></au><au><snm>Scherer</snm><fnm>SW</fnm></au><au><snm>Lee</snm><fnm>C</fnm></au></aug><source>Nat Genet</source><pubdate>2004</pubdate><volume>36</volume><fpage>949</fpage><lpage>951</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1038/ng1416</pubid><pubid idtype="pmpid" link="fulltext">15286789</pubid></pubidlist></xrefbib></bibl><bibl id="B8"><title><p>Structural variation in the human genome</p></title><aug><au><snm>Feuk</snm><fnm>L</fnm></au><au><snm>Carson</snm><fnm>AR</fnm></au><au><snm>Scherer</snm><fnm>SW</fnm></au></aug><source>Nat Rev Genet</source><pubdate>2006</pubdate><volume>7</volume><fpage>85</fpage><lpage>97</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1038/nrg1767</pubid><pubid idtype="pmpid" link="fulltext">16418744</pubid></pubidlist></xrefbib></bibl><bibl id="B9"><title><p>A haplotype map of the human genome</p></title><aug><au><cnm>International HapMap Consortium</cnm></au></aug><source>Nature</source><pubdate>2005</pubdate><volume>437</volume><fpage>1299</fpage><lpage>1320</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1038/nature04226</pubid><pubid idtype="pmcid">1880871</pubid><pubid idtype="pmpid">16255080</pubid></pubidlist></xrefbib></bibl><bibl id="B10"><title><p>Global variation in copy number in the human genome</p></title><aug><au><snm>Redon</snm><fnm>R</fnm></au><au><snm>Ishikawa</snm><fnm>S</fnm></au><au><snm>Fitch</snm><fnm>KR</fnm></au><au><snm>Feuk</snm><fnm>L</fnm></au><au><snm>Perry</snm><fnm>GH</fnm></au><au><snm>Andrews</snm><fnm>TD</fnm></au><au><snm>Fiegler</snm><fnm>H</fnm></au><au><snm>Shapero</snm><fnm>MH</fnm></au><au><snm>Carson</snm><fnm>AR</fnm></au><au><snm>Chen</snm><fnm>W</fnm></au><etal/></aug><source>Nature</source><pubdate>2006</pubdate><volume>444</volume><fpage>444</fpage><lpage>454</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1038/nature05329</pubid><pubid idtype="pmcid">2669898</pubid><pubid idtype="pmpid">17122850</pubid></pubidlist></xrefbib></bibl><bibl id="B11"><title><p>A comprehensive analysis of common copy-number variations in the human genome</p></title><aug><au><snm>Wong</snm><fnm>KK</fnm></au><au><snm>deLeeuw</snm><fnm>RJ</fnm></au><au><snm>Dosanjh</snm><fnm>NS</fnm></au><au><snm>Kimm</snm><fnm>LR</fnm></au><au><snm>Cheng</snm><fnm>Z</fnm></au><au><snm>Horsman</snm><fnm>DE</fnm></au><au><snm>MacAulay</snm><fnm>C</fnm></au><au><snm>Ng</snm><fnm>RT</fnm></au><au><snm>Brown</snm><fnm>CJ</fnm></au><au><snm>Eichler</snm><fnm>EE</fnm></au><au><snm>Lam</snm><fnm>WL</fnm></au></aug><source>Am J Hum Genet</source><pubdate>2007</pubdate><volume>80</volume><fpage>91</fpage><lpage>104</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1086/510560</pubid><pubid idtype="pmcid">1785303</pubid><pubid idtype="pmpid">17160897</pubid></pubidlist></xrefbib></bibl><bibl id="B12"><title><p>Germ-line dna copy number variation frequencies in a large north american population</p></title><aug><au><snm>Zogopoulos</snm><fnm>G</fnm></au><au><snm>Ha</snm><fnm>KC</fnm></au><au><snm>Naqib</snm><fnm>F</fnm></au><au><snm>Moore</snm><fnm>S</fnm></au><au><snm>Kim</snm><fnm>H</fnm></au><au><snm>Montpetit</snm><fnm>A</fnm></au><au><snm>Robidoux</snm><fnm>F</fnm></au><au><snm>Laflamme</snm><fnm>P</fnm></au><au><snm>Cotterchio</snm><fnm>M</fnm></au><au><snm>Greenwood</snm><fnm>C</fnm></au><au><snm>Scherer</snm><fnm>SW</fnm></au><au><snm>Zanke</snm><fnm>B</fnm></au><au><snm>Hudson</snm><fnm>TJ</fnm></au><au><snm>Bader</snm><fnm>GD</fnm></au><au><snm>Gallinger</snm><fnm>S</fnm></au></aug><source>Hum Genet</source><pubdate>2007</pubdate><volume>122</volume><fpage>345</fpage><lpage>353</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1007/s00439-007-0404-5</pubid><pubid idtype="pmpid" link="fulltext">17638019</pubid></pubidlist></xrefbib></bibl><bibl id="B13"><title><p>Array CGH analysis of copy number variation identifies 1284 new genes variant in healthy white males: implications for association studies of complex diseases</p></title><aug><au><snm>de Smith</snm><fnm>AJ</fnm></au><au><snm>Tsalenko</snm><fnm>A</fnm></au><au><snm>Sampas</snm><fnm>N</fnm></au><au><snm>Scheffer</snm><fnm>A</fnm></au><au><snm>Yamada</snm><fnm>NA</fnm></au><au><snm>Tsang</snm><fnm>P</fnm></au><au><snm>Ben-Dor</snm><fnm>A</fnm></au><au><snm>Yakhini</snm><fnm>Z</fnm></au><au><snm>Ellis</snm><fnm>RJ</fnm></au><au><snm>Bruhn</snm><fnm>L</fnm></au><au><snm>Laderman</snm><fnm>S</fnm></au><au><snm>Froguel</snm><fnm>P</fnm></au><au><snm>Blakemore</snm><fnm>AIF</fnm></au></aug><source>Hum Mol Genet</source><pubdate>2007</pubdate><volume>16</volume><fpage>2783</fpage><lpage>2794</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1093/hmg/ddm208</pubid><pubid idtype="pmpid" link="fulltext">17666407</pubid></pubidlist></xrefbib></bibl><bibl id="B14"><title><p>The Fine-Scale and Complex Architecture of Human Copy-Number Variation</p></title><aug><au><snm>Perry</snm><fnm>GH</fnm></au><au><snm>Ben-Dor</snm><fnm>A</fnm></au><au><snm>Tsalenko</snm><fnm>A</fnm></au><au><snm>Sampas</snm><fnm>N</fnm></au><au><snm>Rodriguez-Revenga</snm><fnm>L</fnm></au><au><snm>Tran</snm><fnm>CW</fnm></au><au><snm>Scheffer</snm><fnm>A</fnm></au><au><snm>Steinfeld</snm><fnm>I</fnm></au><au><snm>Tsang</snm><fnm>P</fnm></au><au><snm>Yamada</snm><fnm>NA</fnm></au><au><snm>Park</snm><fnm>HS</fnm></au><au><snm>Kim</snm><fnm>JI</fnm></au><au><snm>Seo</snm><fnm>JS</fnm></au><au><snm>Yakhini</snm><fnm>Z</fnm></au><au><snm>Laderman</snm><fnm>S</fnm></au><au><snm>Bruhn</snm><fnm>L</fnm></au><au><snm>Lee</snm><fnm>C</fnm></au></aug><source>Am J Hum Genet</source><pubdate>2008</pubdate><volume>82</volume><fpage>685</fpage><lpage>695</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1016/j.ajhg.2007.12.010</pubid><pubid idtype="pmcid">2661628</pubid><pubid idtype="pmpid">18304495</pubid></pubidlist></xrefbib></bibl><bibl id="B15"><title><p>High-resolution mapping and analysis of copy number variations in the human genome: A data resource for clinical and research applications</p></title><aug><au><snm>Shaikh</snm><fnm>TH</fnm></au><au><snm>Gai</snm><fnm>X</fnm></au><au><snm>Perin</snm><fnm>JC</fnm></au><au><snm>Glessner</snm><fnm>JT</fnm></au><au><snm>Xie</snm><fnm>H</fnm></au><au><snm>Murphy</snm><fnm>K</fnm></au><au><snm>O&apos;Hara</snm><fnm>R</fnm></au><au><snm>Casalunovo</snm><fnm>T</fnm></au><au><snm>Conlin</snm><fnm>LK</fnm></au><au><snm>D&apos;Arcy</snm><fnm>M</fnm></au><etal/></aug><source>Genome Res</source><pubdate>2009</pubdate><volume>19</volume><fpage>1682</fpage><lpage>1690</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1101/gr.083501.108</pubid><pubid idtype="pmcid">2752118</pubid><pubid idtype="pmpid">19592680</pubid></pubidlist></xrefbib></bibl><bibl id="B16"><title><p>The diploid genome sequence of an individual human</p></title><aug><au><snm>Levy</snm><fnm>S</fnm></au><au><snm>Sutton</snm><fnm>G</fnm></au><au><snm>Ng</snm><fnm>PC</fnm></au><au><snm>Feuk</snm><fnm>L</fnm></au><au><snm>Halpern</snm><fnm>AL</fnm></au><au><snm>Walenz</snm><fnm>BP</fnm></au><au><snm>Axelrod</snm><fnm>N</fnm></au><au><snm>Huang</snm><fnm>J</fnm></au><au><snm>Kirkness</snm><fnm>EF</fnm></au><au><snm>Denisov</snm><fnm>G</fnm></au><etal/></aug><source>PLoS Biol</source><pubdate>2007</pubdate><volume>5</volume><fpage>e254</fpage><xrefbib><pubidlist><pubid idtype="doi">10.1371/journal.pbio.0050254</pubid><pubid idtype="pmcid">1964779,1964779</pubid><pubid idtype="pmpid">17803354</pubid></pubidlist></xrefbib></bibl><bibl id="B17"><title><p>Origins and functional impact of copy number variation in the human genome</p></title><aug><au><snm>Conrad</snm><fnm>DF</fnm></au><au><snm>Pinto</snm><fnm>D</fnm></au><au><snm>Redon</snm><fnm>R</fnm></au><au><snm>Feuk</snm><fnm>L</fnm></au><au><snm>Gokcumen</snm><fnm>O</fnm></au><au><snm>Zhang</snm><fnm>Y</fnm></au><au><snm>Aerts</snm><fnm>J</fnm></au><au><snm>Andrews</snm><fnm>TD</fnm></au><au><snm>Barnes</snm><fnm>C</fnm></au><au><snm>Campbell</snm><fnm>P</fnm></au><au><snm>Fitzgerald</snm><fnm>T</fnm></au><au><snm>Hu</snm><fnm>M</fnm></au><au><snm>Ihm</snm><fnm>CH</fnm></au><au><snm>Kristiansson</snm><fnm>K</fnm></au><au><snm>Macarthur</snm><fnm>DG</fnm></au><au><snm>Macdonald</snm><fnm>JR</fnm></au><au><snm>Onyiah</snm><fnm>I</fnm></au><au><snm>Pang</snm><fnm>AW</fnm></au><au><snm>Robson</snm><fnm>S</fnm></au><au><snm>Stirrups</snm><fnm>K</fnm></au><au><snm>Valsesia</snm><fnm>A</fnm></au><au><snm>Walter</snm><fnm>K</fnm></au><au><snm>Wei</snm><fnm>J</fnm></au><au><cnm>Wellcome Trust Case Control Consortium</cnm></au><au><snm>Tyler-Smith</snm><fnm>C</fnm></au><au><snm>Carter</snm><fnm>NP</fnm></au><au><snm>Lee</snm><fnm>C</fnm></au><au><snm>Scherer</snm><fnm>SW</fnm></au><au><snm>Hurles</snm><fnm>ME</fnm></au></aug><source>Nature</source><pubdate>2010</pubdate><volume>464</volume><fpage>704</fpage><lpage>712</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1038/nature08516</pubid><pubid idtype="pmpid" link="fulltext">19812545</pubid></pubidlist></xrefbib></bibl><bibl id="B18"><title><p>Genomic rearrangements and sporadic disease</p></title><aug><au><snm>Lupski</snm><fnm>JR</fnm></au></aug><source>Nat Genet</source><pubdate>2007</pubdate><volume>39</volume><issue>Supplement</issue><fpage>S43</fpage><lpage>S47</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1038/ng2084</pubid><pubid idtype="pmpid" link="fulltext">17597781</pubid></pubidlist></xrefbib></bibl><bibl id="B19"><title><p>Hotspots for copy number variation in chimpanzees and humans</p></title><aug><au><snm>Perry</snm><fnm>GH</fnm></au><au><snm>Tchinda</snm><fnm>J</fnm></au><au><snm>McGrath</snm><fnm>SD</fnm></au><au><snm>Zhang</snm><fnm>J</fnm></au><au><snm>Picker</snm><fnm>SR</fnm></au><au><snm>Caceres</snm><fnm>AM</fnm></au><au><snm>Iafrate</snm><fnm>AJ</fnm></au><au><snm>Tyler-Smith</snm><fnm>C</fnm></au><au><snm>Scherer</snm><fnm>SW</fnm></au><au><snm>Eichler</snm><fnm>EE</fnm></au><au><snm>Stone</snm><fnm>AC</fnm></au><au><snm>Lee</snm><fnm>C</fnm></au></aug><source>Proc Natl Acad Sci USA</source><pubdate>2006</pubdate><volume>103</volume><fpage>8006</fpage><lpage>8011</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1073/pnas.0602318103</pubid><pubid idtype="pmcid">1472420</pubid><pubid idtype="pmpid">16702545</pubid></pubidlist></xrefbib></bibl><bibl id="B20"><title><p>Structural divergence between the human and chimpanzee genomes</p></title><aug><au><snm>Kehrer-Sawatzki</snm><fnm>H</fnm></au><au><snm>Cooper</snm><fnm>DN</fnm></au></aug><source>Hum Genet</source><pubdate>2007</pubdate><volume>120</volume><fpage>759</fpage><lpage>778</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1007/s00439-006-0270-6</pubid><pubid idtype="pmpid" link="fulltext">17066299</pubid></pubidlist></xrefbib></bibl><bibl id="B21"><title><p>Analysis of copy number variation in the rhesus macaque genome identifies candidate loci for evolutionary and human disease studies</p></title><aug><au><snm>Lee</snm><fnm>AS</fnm></au><au><snm>Gutierrez-Arcelus</snm><fnm>M</fnm></au><au><snm>Perry</snm><fnm>GH</fnm></au><au><snm>Vallender</snm><fnm>EJ</fnm></au><au><snm>Johnson</snm><fnm>WE</fnm></au><au><snm>Miller</snm><fnm>GM</fnm></au><au><snm>Korbel</snm><fnm>JO</fnm></au><au><snm>Lee</snm><fnm>C</fnm></au></aug><source>Hum Mol Genet</source><pubdate>2008</pubdate><volume>17</volume><fpage>1127</fpage><lpage>1136</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1093/hmg/ddn002</pubid><pubid idtype="pmpid" link="fulltext">18180252</pubid></pubidlist></xrefbib></bibl><bibl id="B22"><title><p>Significant gene content variation characterizes the genomes of inbred mouse strains</p></title><aug><au><snm>Cutler</snm><fnm>G</fnm></au><au><snm>Marshall</snm><fnm>LA</fnm></au><au><snm>Chin</snm><fnm>N</fnm></au><au><snm>Baribault</snm><fnm>H</fnm></au><au><snm>Kassner</snm><fnm>PD</fnm></au></aug><source>Genome Res</source><pubdate>2007</pubdate><volume>17</volume><fpage>1743</fpage><lpage>1754</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1101/gr.6754607</pubid><pubid idtype="pmcid">2099583</pubid><pubid idtype="pmpid">17989247</pubid></pubidlist></xrefbib></bibl><bibl id="B23"><title><p>Genomic segmental polymorphisms in inbred mouse strains</p></title><aug><au><snm>Li</snm><fnm>J</fnm></au><au><snm>Jiang</snm><fnm>T</fnm></au><au><snm>Mao</snm><fnm>JH</fnm></au><au><snm>Balmain</snm><fnm>A</fnm></au><au><snm>Peterson</snm><fnm>L</fnm></au><au><snm>Harris</snm><fnm>C</fnm></au><au><snm>Rao</snm><fnm>PH</fnm></au><au><snm>Havlak</snm><fnm>P</fnm></au><au><snm>Gibbs</snm><fnm>R</fnm></au><au><snm>Cai</snm><fnm>WW</fnm></au></aug><source>Nat Genet</source><pubdate>2004</pubdate><volume>36</volume><fpage>952</fpage><lpage>954</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1038/ng1417</pubid><pubid idtype="pmpid" link="fulltext">15322544</pubid></pubidlist></xrefbib></bibl><bibl id="B24"><title><p>Complex haplotypes, copy number polymorphisms and coding variation in two recently divergent mouse strains</p></title><aug><au><snm>Adams</snm><fnm>DJ</fnm></au><au><snm>Dermitzakis</snm><fnm>ET</fnm></au><au><snm>Cox</snm><fnm>T</fnm></au><au><snm>Smith</snm><fnm>J</fnm></au><au><snm>Davies</snm><fnm>R</fnm></au><au><snm>Banerjee</snm><fnm>R</fnm></au><au><snm>Bonfield</snm><fnm>J</fnm></au><au><snm>Mullikin</snm><fnm>JC</fnm></au><au><snm>Chung</snm><fnm>YJ</fnm></au><au><snm>Rogers</snm><fnm>J</fnm></au><au><snm>Bradley</snm><fnm>A</fnm></au></aug><source>Nat Genet</source><pubdate>2005</pubdate><volume>37</volume><fpage>532</fpage><lpage>536</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1038/ng1551</pubid><pubid idtype="pmpid" link="fulltext">15852006</pubid></pubidlist></xrefbib></bibl><bibl id="B25"><title><p>A high-resolution map of segmental DNA copy number variation in the mouse genome</p></title><aug><au><snm>Graubert</snm><fnm>TA</fnm></au><au><snm>Cahan</snm><fnm>P</fnm></au><au><snm>Edwin</snm><fnm>D</fnm></au><au><snm>Selzer</snm><fnm>RR</fnm></au><au><snm>Richmond</snm><fnm>TA</fnm></au><au><snm>Eis</snm><fnm>PS</fnm></au><au><snm>Shannon</snm><fnm>WD</fnm></au><au><snm>Li</snm><fnm>X</fnm></au><au><snm>McLeod</snm><fnm>HL</fnm></au><au><snm>Cheverud</snm><fnm>JM</fnm></au><au><snm>Ley</snm><fnm>TJ</fnm></au></aug><source>PLoS Genet</source><pubdate>2007</pubdate><volume>3</volume><fpage>e3</fpage><xrefbib><pubidlist><pubid idtype="doi">10.1371/journal.pgen.0030003</pubid><pubid idtype="pmcid">1761046,1761046</pubid><pubid idtype="pmpid">17206864</pubid></pubidlist></xrefbib></bibl><bibl id="B26"><title><p>Recurrent DNA copy number variation in the laboratory mouse</p></title><aug><au><snm>Egan</snm><fnm>CM</fnm></au><au><snm>Sridhar</snm><fnm>S</fnm></au><au><snm>Wigler</snm><fnm>M</fnm></au><au><snm>Hall</snm><fnm>IM</fnm></au></aug><source>Nat Genet</source><pubdate>2007</pubdate><volume>39</volume><fpage>1384</fpage><lpage>1389</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1038/ng.2007.19</pubid><pubid idtype="pmpid" link="fulltext">17965714</pubid></pubidlist></xrefbib></bibl><bibl id="B27"><title><p>Mouse segmental duplication and copy number variation</p></title><aug><au><snm>She</snm><fnm>X</fnm></au><au><snm>Cheng</snm><fnm>Z</fnm></au><au><snm>Zollner</snm><fnm>S</fnm></au><au><snm>Church</snm><fnm>DM</fnm></au><au><snm>Eichler</snm><fnm>EE</fnm></au></aug><source>Nat Genet</source><pubdate>2008</pubdate><volume>40</volume><fpage>909</fpage><lpage>914</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1038/ng.172</pubid><pubid idtype="pmcid">2574762</pubid><pubid idtype="pmpid">18500340</pubid></pubidlist></xrefbib></bibl><bibl id="B28"><title><p>Mapping segmental and sequence variations among laboratory mice using BAC array CGH</p></title><aug><au><snm>Snijders</snm><fnm>AM</fnm></au><au><snm>Nowak</snm><fnm>NJ</fnm></au><au><snm>Huey</snm><fnm>B</fnm></au><au><snm>Fridlyand</snm><fnm>J</fnm></au><au><snm>Law</snm><fnm>S</fnm></au><au><snm>Conroy</snm><fnm>J</fnm></au><au><snm>Tokuyasu</snm><fnm>T</fnm></au><au><snm>Demir</snm><fnm>K</fnm></au><au><snm>Chiu</snm><fnm>R</fnm></au><au><snm>Mao</snm><fnm>JH</fnm></au><au><snm>Jain</snm><fnm>AN</fnm></au><au><snm>Jones</snm><fnm>SJ</fnm></au><au><snm>Balmain</snm><fnm>A</fnm></au><au><snm>Pinkel</snm><fnm>D</fnm></au><au><snm>Albertson</snm><fnm>DG</fnm></au></aug><source>Genome Res</source><pubdate>2005</pubdate><volume>15</volume><fpage>302</fpage><lpage>311</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1101/gr.2902505</pubid><pubid idtype="pmcid">546532</pubid><pubid idtype="pmpid">15687294</pubid></pubidlist></xrefbib></bibl><bibl id="B29"><title><p>Segmental copy number variation shapes tissue transcriptome</p></title><aug><au><snm>Henrichsen</snm><fnm>CN</fnm></au><au><snm>Vinckenbosch</snm><fnm>N</fnm></au><au><snm>Z&#246;llner</snm><fnm>S</fnm></au><au><snm>Chaignat</snm><fnm>E</fnm></au><au><snm>Pradervand</snm><fnm>S</fnm></au><au><snm>Sch&#252;tz</snm><fnm>F</fnm></au><au><snm>Ruedi</snm><fnm>M</fnm></au><au><snm>Kaessmann</snm><fnm>H</fnm></au><au><snm>Reymond</snm><fnm>A</fnm></au></aug><source>Nat Genet</source><pubdate>2009</pubdate><volume>41</volume><fpage>424</fpage><lpage>429</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1038/ng.345</pubid><pubid idtype="pmpid" link="fulltext">19270705</pubid></pubidlist></xrefbib></bibl><bibl id="B30"><title><p>Distribution and functional impact of DNA copy number variation in the rat</p></title><aug><au><snm>Guryev</snm><fnm>V</fnm></au><au><snm>Saar</snm><fnm>K</fnm></au><au><snm>Adamovic</snm><fnm>T</fnm></au><au><snm>Verheul</snm><fnm>M</fnm></au><au><snm>van Heesch</snm><fnm>SA</fnm></au><au><snm>Cook</snm><fnm>S</fnm></au><au><snm>Pravenec</snm><fnm>M</fnm></au><au><snm>Aitman</snm><fnm>T</fnm></au><au><snm>Jacob</snm><fnm>H</fnm></au><au><snm>Shull</snm><fnm>JD</fnm></au><au><snm>Hubner</snm><fnm>N</fnm></au><au><snm>Cuppen</snm><fnm>E</fnm></au></aug><source>Nat Genet</source><pubdate>2008</pubdate><volume>40</volume><fpage>538</fpage><lpage>545</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1038/ng.141</pubid><pubid idtype="pmpid" link="fulltext">18443591</pubid></pubidlist></xrefbib></bibl><bibl id="B31"><title><p>A portrait of copy-number polymorphism in Drosophila melanogaster</p></title><aug><au><snm>Dopman</snm><fnm>EB</fnm></au><au><snm>Hartl</snm><fnm>DL</fnm></au></aug><source>Proc Natl Acad Sci USA</source><pubdate>2007</pubdate><volume>104</volume><fpage>19920</fpage><lpage>19925</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1073/pnas.0709888104</pubid><pubid idtype="pmcid">2148398</pubid><pubid idtype="pmpid">18056801</pubid></pubidlist></xrefbib></bibl><bibl id="B32"><title><p>Natural selection shapes genome-wide patterns of copy-number polymorphism in Drosophila melanogaster</p></title><aug><au><snm>Emerson</snm><fnm>JJ</fnm></au><au><snm>Cardoso-Moreira</snm><fnm>M</fnm></au><au><snm>Borevitz</snm><fnm>JO</fnm></au><au><snm>Long</snm><fnm>M</fnm></au></aug><source>Science</source><pubdate>2008</pubdate><volume>320</volume><fpage>1629</fpage><lpage>1631</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1126/science.1158078</pubid><pubid idtype="pmpid" link="fulltext">18535209</pubid></pubidlist></xrefbib></bibl><bibl id="B33"><title><p>Mapping dna structural variation in dogs</p></title><aug><au><snm>Chen</snm><fnm>WK</fnm></au><au><snm>Swartz</snm><fnm>JD</fnm></au><au><snm>Rush</snm><fnm>LJ</fnm></au><au><snm>Alvarez</snm><fnm>CE</fnm></au></aug><source>Genome Res</source><pubdate>2009</pubdate><volume>19</volume><fpage>500</fpage><lpage>509</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1101/gr.083741.108</pubid><pubid idtype="pmcid">2661804</pubid><pubid idtype="pmpid">19015322</pubid></pubidlist></xrefbib></bibl><bibl id="B34"><title><p>Copy Number Variation Analysis in Chicken</p></title><aug><au><snm>Crooijmans</snm><fnm>RPMA</fnm></au><au><snm>Groenen</snm><fnm>MAM</fnm></au></aug><source>Proceedings of Plant &amp; Animal Genomes XVII Conference; 10-14 January 2009; San Diego</source><pubdate>2009</pubdate><fpage>W397</fpage></bibl><bibl id="B35"><title><p>Detection of germline and somatic copy number variations in cattle</p></title><aug><au><snm>Liu</snm><fnm>GE</fnm></au><au><snm>Van Tassel</snm><fnm>CP</fnm></au><au><snm>Sonstegard</snm><fnm>TS</fnm></au><au><snm>Li</snm><fnm>RW</fnm></au><au><snm>Alexander</snm><fnm>LJ</fnm></au><au><snm>Keele</snm><fnm>JW</fnm></au><au><snm>Matukumalli</snm><fnm>LK</fnm></au><au><snm>Smith</snm><fnm>TP</fnm></au><au><snm>Gasbarre</snm><fnm>LC</fnm></au></aug><source>Developments in Biologicals</source><pubdate>2008</pubdate><volume>132</volume><fpage>231</fpage><lpage>237</lpage><xrefbib><pubidlist><pubid idtype="doi">full_text</pubid><pubid idtype="pmpid">18817307</pubid></pubidlist></xrefbib></bibl><bibl id="B36"><title><p>Relative impact of nucleotide and copy number variation on gene expression phenotypes</p></title><aug><au><snm>Stranger</snm><fnm>BE</fnm></au><au><snm>Forrest</snm><fnm>MS</fnm></au><au><snm>Dunning</snm><fnm>M</fnm></au><au><snm>Ingle</snm><fnm>CE</fnm></au><au><snm>Beazley</snm><fnm>C</fnm></au><au><snm>Thorne</snm><fnm>N</fnm></au><au><snm>Redon</snm><fnm>R</fnm></au><au><snm>Bird</snm><fnm>CP</fnm></au><au><snm>de Grassi</snm><fnm>A</fnm></au><au><snm>Lee</snm><fnm>C</fnm></au><au><snm>Tyler-Smith</snm><fnm>C</fnm></au><au><snm>Carter</snm><fnm>N</fnm></au><au><snm>Scherer</snm><fnm>SW</fnm></au><au><snm>Tavar&#233;</snm><fnm>S</fnm></au><au><snm>Deloukas</snm><fnm>P</fnm></au><au><snm>Hurles</snm><fnm>ME</fnm></au><au><snm>Dermitzakis</snm><fnm>ET</fnm></au></aug><source>Science</source><pubdate>2007</pubdate><volume>315</volume><fpage>848</fpage><lpage>853</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1126/science.1136678</pubid><pubid idtype="pmcid">2665772</pubid><pubid idtype="pmpid">17289997</pubid></pubidlist></xrefbib></bibl><bibl id="B37"><title><p>Diet and the evolution of human amylase gene copy number variation</p></title><aug><au><snm>Perry</snm><fnm>GH</fnm></au><au><snm>Dominy</snm><fnm>NJ</fnm></au><au><snm>Claw</snm><fnm>KG</fnm></au><au><snm>Lee</snm><fnm>AS</fnm></au><au><snm>Fiegler</snm><fnm>H</fnm></au><au><snm>Redon</snm><fnm>R</fnm></au><au><snm>Werner</snm><fnm>J</fnm></au><au><snm>Villanea</snm><fnm>FA</fnm></au><au><snm>Mountain</snm><fnm>JL</fnm></au><au><snm>Misra</snm><fnm>R</fnm></au><au><snm>Carter</snm><fnm>NP</fnm></au><au><snm>Lee</snm><fnm>C</fnm></au><au><snm>Stone</snm><fnm>AC</fnm></au></aug><source>Nat Genet</source><pubdate>2007</pubdate><volume>39</volume><fpage>1256</fpage><lpage>1260</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1038/ng2123</pubid><pubid idtype="pmcid">2377015</pubid><pubid idtype="pmpid">17828263</pubid></pubidlist></xrefbib></bibl><bibl id="B38"><title><p>Copy number variation and susceptibility to human disorders (review)</p></title><aug><au><snm>Shastry</snm><fnm>BS</fnm></au></aug><source>Molecular Medicine reports</source><pubdate>2009</pubdate><volume>2</volume><fpage>143</fpage><lpage>147</lpage></bibl><bibl id="B39"><title><p>Genomic disorders: molecular mechanisms for rearrangements and conveyed phenotypes</p></title><aug><au><snm>Lupski</snm><fnm>JR</fnm></au><au><snm>Stankiewicz</snm><fnm>P</fnm></au></aug><source>PLoS Genet</source><pubdate>2005</pubdate><volume>1</volume><fpage>e49</fpage><xrefbib><pubidlist><pubid idtype="doi">10.1371/journal.pgen.0010049</pubid><pubid idtype="pmcid">1352149</pubid><pubid idtype="pmpid">16444292</pubid></pubidlist></xrefbib></bibl><bibl id="B40"><title><p>Side effects of genome structural changes</p></title><aug><au><snm>Reymond</snm><fnm>A</fnm></au><au><snm>Henrichsen</snm><fnm>CN</fnm></au><au><snm>Harewood</snm><fnm>L</fnm></au><au><snm>Merla</snm><fnm>G</fnm></au></aug><source>Curr Opin Genet Dev</source><pubdate>2007</pubdate><volume>17</volume><fpage>381</fpage><lpage>386</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1016/j.gde.2007.08.009</pubid><pubid idtype="pmpid" link="fulltext">17913489</pubid></pubidlist></xrefbib></bibl><bibl id="B41"><title><p>A critical analysis of disease-associated DNA polymorphisms in the genes of cattle, goat, sheep, and pig</p></title><aug><au><snm>Ibeagha-Awemu</snm><fnm>EM</fnm></au><au><snm>Kgwatalala</snm><fnm>P</fnm></au><au><snm>Ibeagha</snm><fnm>AE</fnm></au><au><snm>Zhao</snm><fnm>X</fnm></au></aug><source>Mamm Genome</source><pubdate>2008</pubdate><volume>19</volume><fpage>226</fpage><lpage>245</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1007/s00335-008-9101-5</pubid><pubid idtype="pmcid">2323435</pubid><pubid idtype="pmpid">18350334</pubid></pubidlist></xrefbib></bibl><bibl id="B42"><title><p>Development and Characterization of a High Density SNP Genotyping Assay for Cattle</p></title><aug><au><snm>Matukumalli</snm><fnm>LK</fnm></au><au><snm>Lawley</snm><fnm>CT</fnm></au><au><snm>Schnabel</snm><fnm>RD</fnm></au><au><snm>Taylor</snm><fnm>JF</fnm></au><au><snm>Allan</snm><fnm>MF</fnm></au><au><snm>Heaton</snm><fnm>MP</fnm></au><au><snm>O&apos;Connell</snm><fnm>J</fnm></au><au><snm>Moore</snm><fnm>SS</fnm></au><au><snm>Smith</snm><fnm>TP</fnm></au><au><snm>Sonstegard</snm><fnm>TS</fnm></au><au><snm>Van Tassell</snm><fnm>CP</fnm></au></aug><source>PLoS ONE</source><pubdate>2009</pubdate><volume>4</volume><fpage>e5350</fpage><xrefbib><pubidlist><pubid idtype="doi">10.1371/journal.pone.0005350</pubid><pubid idtype="pmcid">2669730</pubid><pubid idtype="pmpid">19390634</pubid></pubidlist></xrefbib></bibl><bibl id="B43"><title><p>Review of microarray experimental design strategies for genetical genomics studies</p></title><aug><au><snm>Rosa</snm><fnm>GJ</fnm></au><au><snm>de Leon</snm><fnm>N</fnm></au><au><snm>Rosa</snm><fnm>AJ</fnm></au></aug><source>Physiol Genomics</source><pubdate>2006</pubdate><volume>28</volume><fpage>15</fpage><lpage>23</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1152/physiolgenomics.00106.2006</pubid><pubid idtype="pmpid" link="fulltext">16985008</pubid></pubidlist></xrefbib></bibl><bibl id="B44"><title><p>Design of microarray experiments for genetical genomics studies</p></title><aug><au><snm>Bueno Filho</snm><fnm>JS</fnm></au><au><snm>Gilmour</snm><fnm>SG</fnm></au><au><snm>Rosa</snm><fnm>GJ</fnm></au></aug><source>Genetics</source><pubdate>2006</pubdate><volume>174</volume><fpage>945</fpage><lpage>957</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1534/genetics.106.057281</pubid><pubid idtype="pmcid">1602072</pubid><pubid idtype="pmpid">16888340</pubid></pubidlist></xrefbib></bibl><bibl id="B45"><title><p>The impact of copy number variation on local gene expression in mouse hematopoietic stem and progenitor cells</p></title><aug><au><snm>Cahan</snm><fnm>P</fnm></au><au><snm>Li</snm><fnm>Y</fnm></au><au><snm>Izumi</snm><fnm>M</fnm></au><au><snm>Graubert</snm><fnm>TA</fnm></au></aug><source>Nature Genet</source><pubdate>2009</pubdate><volume>41</volume><fpage>430</fpage><lpage>437</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1038/ng.350</pubid><pubid idtype="pmcid">2728431</pubid><pubid idtype="pmpid">19270704</pubid></pubidlist></xrefbib></bibl><bibl id="B46"><title><p>Germline rates of de novo meiotic deletions and duplications causing several genomic disorders</p></title><aug><au><snm>Turner</snm><fnm>DJ</fnm></au><au><snm>Miretti</snm><fnm>M</fnm></au><au><snm>Rajan</snm><fnm>D</fnm></au><au><snm>Fiegler</snm><fnm>H</fnm></au><au><snm>Carter</snm><fnm>NP</fnm></au><au><snm>Blayney</snm><fnm>ML</fnm></au><au><snm>Beck</snm><fnm>S</fnm></au><au><snm>Hurles</snm><fnm>ME</fnm></au></aug><source>Nature Genet</source><pubdate>2008</pubdate><volume>40</volume><fpage>90</fpage><lpage>95</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1038/ng.2007.40</pubid><pubid idtype="pmcid">2669897</pubid><pubid idtype="pmpid">18059269</pubid></pubidlist></xrefbib></bibl><bibl id="B47"><title><p>A Snapshot of CNVs in the Pig Genome</p></title><aug><au><snm>Fadista</snm><fnm>J</fnm></au><au><snm>Nygaard</snm><fnm>M</fnm></au><au><snm>Holm</snm><fnm>LE</fnm></au><au><snm>Thomsen</snm><fnm>B</fnm></au><au><snm>Bendixen</snm><fnm>C</fnm></au></aug><source>PLoS ONE</source><pubdate>2008</pubdate><volume>3</volume><fpage>e3916</fpage><xrefbib><pubidlist><pubid idtype="doi">10.1371/journal.pone.0003916</pubid><pubid idtype="pmcid">2596487</pubid><pubid idtype="pmpid">19079605</pubid></pubidlist></xrefbib></bibl><bibl id="B48"><title><p>Bos taurus genome assembly</p></title><aug><au><snm>Liu</snm><fnm>Y</fnm></au><au><snm>Qin</snm><fnm>X</fnm></au><au><snm>Song</snm><fnm>XZH</fnm></au><au><snm>Jiang</snm><fnm>H</fnm></au><au><snm>Shen</snm><fnm>Y</fnm></au><au><snm>Durbin</snm><fnm>KJ</fnm></au><au><snm>Lien</snm><fnm>S</fnm></au><au><snm>Kent</snm><fnm>MP</fnm></au><au><snm>Sodeland</snm><fnm>M</fnm></au><au><snm>Ren</snm><fnm>Y</fnm></au><au><snm>Zhang</snm><fnm>L</fnm></au><au><snm>Sodergren</snm><fnm>E</fnm></au><au><snm>Havlak</snm><fnm>P</fnm></au><au><snm>Worley</snm><fnm>KC</fnm></au><au><snm>Weinstock</snm><fnm>GM</fnm></au><au><snm>Gibbs</snm><fnm>RA</fnm></au></aug><source>BMC Genomics</source><pubdate>2009</pubdate><volume>10</volume><fpage>180</fpage><xrefbib><pubidlist><pubid idtype="doi">10.1186/1471-2164-10-180</pubid><pubid idtype="pmcid">2686734</pubid><pubid idtype="pmpid">19393050</pubid></pubidlist></xrefbib></bibl><bibl id="B49"><title><p>Recent segmental duplications in the human genome</p></title><aug><au><snm>Bailey</snm><fnm>JA</fnm></au><au><snm>Gu</snm><fnm>Z</fnm></au><au><snm>Clark</snm><fnm>RA</fnm></au><au><snm>Reinert</snm><fnm>K</fnm></au><au><snm>Samonte</snm><fnm>RV</fnm></au><au><snm>Schwartz</snm><fnm>S</fnm></au><au><snm>Adams</snm><fnm>MD</fnm></au><au><snm>Myers</snm><fnm>EW</fnm></au><au><snm>Li</snm><fnm>PW</fnm></au><au><snm>Evan</snm><fnm>EE</fnm></au></aug><source>Science</source><pubdate>2002</pubdate><volume>297</volume><fpage>1003</fpage><lpage>1007</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1126/science.1072047</pubid><pubid idtype="pmpid" link="fulltext">12169732</pubid></pubidlist></xrefbib></bibl><bibl id="B50"><title><p>Implications of human genome architecture for rearrangement-based disorders: the genomic basis of disease</p></title><aug><au><snm>Shaw</snm><fnm>CJ</fnm></au><au><snm>Lupski</snm><fnm>JR</fnm></au></aug><source>Hum Mol Genet</source><pubdate>2004</pubdate><volume>13</volume><fpage>R57</fpage><lpage>R64</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1093/hmg/ddh073</pubid><pubid idtype="pmpid" link="fulltext">14764619</pubid></pubidlist></xrefbib></bibl><bibl id="B51"><title><p>Mechanisms of change in gene copy number</p></title><aug><au><snm>Hastings</snm><fnm>PJ</fnm></au><au><snm>Lupski</snm><fnm>JR</fnm></au><au><snm>Rosenberg</snm><fnm>SM</fnm></au><au><snm>Ira</snm><fnm>G</fnm></au></aug><source>Nat Rev Genet</source><pubdate>2009</pubdate><volume>10</volume><fpage>551</fpage><lpage>564</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1038/nrg2593</pubid><pubid idtype="pmcid">2864001</pubid><pubid idtype="pmpid">19597530</pubid></pubidlist></xrefbib></bibl><bibl id="B52"><title><p>Ensembl 2009</p></title><aug><au><snm>Hubbard</snm><fnm>TJB</fnm></au><au><snm>Aken</snm><fnm>BL</fnm></au><au><snm>Ayling</snm><fnm>S</fnm></au><au><snm>Ballester</snm><fnm>B</fnm></au><au><snm>Beal</snm><fnm>K</fnm></au><au><snm>Bragin</snm><fnm>E</fnm></au><au><snm>Brent</snm><fnm>S</fnm></au><au><snm>Chen</snm><fnm>Y</fnm></au><au><snm>Clapham</snm><fnm>P</fnm></au><au><snm>Clarke</snm><fnm>L</fnm></au><etal/></aug><source>Nucl Acids Res</source><pubdate>2009</pubdate><volume>37</volume><issue>Suppl 1</issue><fpage>D690</fpage><lpage>D697</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1093/nar/gkn828</pubid><pubid idtype="pmcid">2686571</pubid><pubid idtype="pmpid">19033362</pubid></pubidlist></xrefbib></bibl><bibl id="B53"><title><p>Ultraconserved elements in the human genome</p></title><aug><au><snm>Bejerano</snm><fnm>G</fnm></au><au><snm>Pheasant</snm><fnm>M</fnm></au><au><snm>Makunin</snm><fnm>I</fnm></au><au><snm>Stephen</snm><fnm>S</fnm></au><au><snm>Kent</snm><fnm>WJ</fnm></au><au><snm>Mattick</snm><fnm>JS</fnm></au><au><snm>Haussler</snm><fnm>D</fnm></au></aug><source>Science</source><pubdate>2004</pubdate><volume>304</volume><fpage>1321</fpage><lpage>1325</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1126/science.1098119</pubid><pubid idtype="pmpid" link="fulltext">15131266</pubid></pubidlist></xrefbib></bibl><bibl id="B54"><title><p>Identification and characterization of new long conserved noncoding sequences in vertebrates</p></title><aug><au><snm>Sakuraba</snm><fnm>Y</fnm></au><au><snm>Kimura</snm><fnm>T</fnm></au><au><snm>Masuya</snm><fnm>H</fnm></au><au><snm>Noguchi</snm><fnm>H</fnm></au><au><snm>Sezutsu</snm><fnm>H</fnm></au><au><snm>Takahasi</snm><fnm>KR</fnm></au><au><snm>Toyoda</snm><fnm>A</fnm></au><au><snm>Fukumura</snm><fnm>R</fnm></au><au><snm>Murata</snm><fnm>T</fnm></au><au><snm>Sakaki</snm><fnm>Y</fnm></au><au><snm>Yamamura</snm><fnm>M</fnm></au><au><snm>Wakana</snm><fnm>S</fnm></au><au><snm>Noda</snm><fnm>T</fnm></au><au><snm>Shiroishi</snm><fnm>T</fnm></au><au><snm>Gondo</snm><fnm>Y</fnm></au></aug><source>Mamm Genome</source><pubdate>2008</pubdate><volume>19</volume><fpage>703</fpage><lpage>712</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1007/s00335-008-9152-7</pubid><pubid idtype="pmpid" link="fulltext">19015917</pubid></pubidlist></xrefbib></bibl><bibl id="B55"><title><p>EasyGO: Gene Ontology-based annotation and functional enrichment analysis tool for agronomical species</p></title><aug><au><snm>Xin</snm><fnm>Z</fnm></au><au><snm>Zhen</snm><fnm>S</fnm></au></aug><source>BMC Genomics</source><pubdate>2007</pubdate><volume>8</volume><fpage>246</fpage><xrefbib><pubidlist><pubid idtype="doi">10.1186/1471-2164-8-246</pubid><pubid idtype="pmcid">1940007</pubid><pubid idtype="pmpid">17645808</pubid></pubidlist></xrefbib></bibl><bibl id="B56"><title><p>Paired-End Mapping Reveals Extensive Structural Variation in the Human Genome</p></title><aug><au><snm>Korbel</snm><fnm>JO</fnm></au><au><snm>Urban</snm><fnm>AE</fnm></au><au><snm>Affourtit</snm><fnm>JP</fnm></au><au><snm>Godwin</snm><fnm>B</fnm></au><au><snm>Grubert</snm><fnm>F</fnm></au><au><snm>Simons</snm><fnm>JF</fnm></au><au><snm>Kim</snm><fnm>PM</fnm></au><au><snm>Palejev</snm><fnm>D</fnm></au><au><snm>Carriero</snm><fnm>NJ</fnm></au><au><snm>Du</snm><fnm>L</fnm></au><au><snm>Taillon</snm><fnm>BE</fnm></au><au><snm>Chen</snm><fnm>Z</fnm></au><au><snm>Tanzer</snm><fnm>A</fnm></au><au><snm>Saunders</snm><fnm>AC</fnm></au><au><snm>Chi</snm><fnm>J</fnm></au><au><snm>Yang</snm><fnm>F</fnm></au><au><snm>Carter</snm><fnm>NP</fnm></au><au><snm>Hurles</snm><fnm>ME</fnm></au><au><snm>Weissman</snm><fnm>SM</fnm></au><au><snm>Harkins</snm><fnm>TT</fnm></au><au><snm>Gerstein</snm><fnm>MB</fnm></au><au><snm>Egholm</snm><fnm>M</fnm></au><au><snm>Snyder</snm><fnm>M</fnm></au></aug><source>Science</source><pubdate>2007</pubdate><volume>318</volume><fpage>420</fpage><lpage>426</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1126/science.1149504</pubid><pubid idtype="pmcid">2674581</pubid><pubid idtype="pmpid">17901297</pubid></pubidlist></xrefbib></bibl><bibl id="B57"><title><p>Genomic drift and copy number variation of sensory receptor genes in humans</p></title><aug><au><snm>Nozawa</snm><fnm>M</fnm></au><au><snm>Kawahara</snm><fnm>Y</fnm></au><au><snm>Nei</snm><fnm>M</fnm></au></aug><source>Proc Natl Acad Sci USA</source><pubdate>2007</pubdate><volume>104</volume><fpage>20421</fpage><lpage>20426</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1073/pnas.0709956104</pubid><pubid idtype="pmcid">2154446</pubid><pubid idtype="pmpid">18077390</pubid></pubidlist></xrefbib></bibl><bibl id="B58"><title><p>Role of selection in fixation of gene duplications</p></title><aug><au><snm>Kondrashov</snm><fnm>FA</fnm></au><au><snm>Kondrashov</snm><fnm>AS</fnm></au></aug><source>J Theor Biol</source><pubdate>2006</pubdate><volume>239</volume><fpage>141</fpage><lpage>151</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1016/j.jtbi.2005.08.033</pubid><pubid idtype="pmpid" link="fulltext">16242725</pubid></pubidlist></xrefbib></bibl><bibl id="B59"><title><p>Bias of Selection on Human Copy-Number Variants</p></title><aug><au><snm>Nguyen</snm><fnm>DQ</fnm></au><au><snm>Webber</snm><fnm>C</fnm></au><au><snm>Ponting</snm><fnm>CP</fnm></au></aug><source>PLoS Genet</source><pubdate>2006</pubdate><volume>2</volume><fpage>e20</fpage><xrefbib><pubidlist><pubid idtype="doi">10.1371/journal.pgen.0020020</pubid><pubid idtype="pmcid">1366494,1366494</pubid><pubid idtype="pmpid">16482228</pubid></pubidlist></xrefbib></bibl><bibl id="B60"><aug><au><snm>Ohno</snm><fnm>S</fnm></au></aug><source>Evolution by gene duplication</source><publisher>New York: Springer-Verlag</publisher><pubdate>1970</pubdate></bibl><bibl id="B61"><title><p>Reduced purifying selection prevails over positive selection in human copy number variant evolution</p></title><aug><au><snm>Nguyen</snm><fnm>DQ</fnm></au><au><snm>Webber</snm><fnm>C</fnm></au><au><snm>Hehir-Kwa</snm><fnm>J</fnm></au><au><snm>Pfundt</snm><fnm>R</fnm></au><au><snm>Veltman</snm><fnm>J</fnm></au><au><snm>Ponting</snm><fnm>CP</fnm></au></aug><source>Genome Res</source><pubdate>2008</pubdate><volume>18</volume><fpage>1711</fpage><lpage>1723</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1101/gr.077289.108</pubid><pubid idtype="pmcid">2577867</pubid><pubid idtype="pmpid">18687881</pubid></pubidlist></xrefbib></bibl><bibl id="B62"><title><p>Online Mendelian Inheritance in Man (OMIM)</p></title><url>http://www.ncbi.nlm.nih.gov/omim/</url></bibl><bibl id="B63"><title><p>Potential etiologic and functional implications of genome-wide association loci for human diseases and traits</p></title><aug><au><snm>Hindorff</snm><fnm>LA</fnm></au><au><snm>Sethupathy</snm><fnm>P</fnm></au><au><snm>Junkins</snm><fnm>HA</fnm></au><au><snm>Ramos</snm><fnm>EM</fnm></au><au><snm>Mehta</snm><fnm>JP</fnm></au><au><snm>Collins</snm><fnm>FS</fnm></au><au><snm>Manolio</snm><fnm>TA</fnm></au></aug><source>Proc Natl Acad Sci USA</source><pubdate>2009</pubdate><volume>106</volume><fpage>9362</fpage><lpage>9367</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1073/pnas.0903103106</pubid><pubid idtype="pmcid">2687147</pubid><pubid idtype="pmpid">19474294</pubid></pubidlist></xrefbib></bibl><bibl id="B64"><title><p>Animal QTLdb: Beyond a Repository - A Public Platform for QTL Comparisons and Integration with Diverse Types of Structural Genomic Information</p></title><aug><au><snm>Hu</snm><fnm>Z</fnm></au><au><snm>Reecy</snm><fnm>JM</fnm></au></aug><source>Mamm Genome</source><pubdate>2007</pubdate><volume>18</volume><fpage>1</fpage><lpage>4</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1007/s00335-006-0105-8</pubid><pubid idtype="pmpid" link="fulltext">17245610</pubid></pubidlist></xrefbib></bibl><bibl id="B65"><title><p>Online Mendelian Inheritance in Animals (OMIA)</p></title><url>http://omia.angis.org.au/</url></bibl><bibl id="B66"><title><p>Systematic assessment of copy number variant detection via genome-wide SNP genotyping</p></title><aug><au><snm>Cooper</snm><fnm>GM</fnm></au><au><snm>Zerr</snm><fnm>T</fnm></au><au><snm>Kidd</snm><fnm>JM</fnm></au><au><snm>Eichler</snm><fnm>EE</fnm></au><au><snm>Nickerson</snm><fnm>DA</fnm></au></aug><source>Nat Genet</source><pubdate>2008</pubdate><volume>40</volume><fpage>1199</fpage><lpage>1203</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1038/ng.236</pubid><pubid idtype="pmcid">2759751</pubid><pubid idtype="pmpid">18776910</pubid></pubidlist></xrefbib></bibl><bibl id="B67"><title><p>High-resolution haplotype block structure in the cattle genome</p></title><aug><au><snm>Villa-Angulo</snm><fnm>R</fnm></au><au><snm>Matukumalli</snm><fnm>LK</fnm></au><au><snm>Gill</snm><fnm>CA</fnm></au><au><snm>Choi</snm><fnm>J</fnm></au><au><snm>Van Tassell</snm><fnm>CP</fnm></au><au><snm>Grefenstette</snm><fnm>JJ</fnm></au></aug><source>BMC Genetics</source><pubdate>2009</pubdate><volume>10</volume><fpage>19</fpage><xrefbib><pubidlist><pubid idtype="doi">10.1186/1471-2156-10-19</pubid><pubid idtype="pmcid">2684545</pubid><pubid idtype="pmpid">19393054</pubid></pubidlist></xrefbib></bibl><bibl id="B68"><title><p>The complete genome of an individual by massively parallel DNA sequencing</p></title><aug><au><snm>Wheeler</snm><fnm>DA</fnm></au><au><snm>Srinivasan</snm><fnm>M</fnm></au><au><snm>Egholm</snm><fnm>M</fnm></au><au><snm>Shen</snm><fnm>Y</fnm></au><au><snm>Chen</snm><fnm>L</fnm></au><au><snm>McGuire</snm><fnm>A</fnm></au><au><snm>He</snm><fnm>W</fnm></au><au><snm>Chen</snm><fnm>YJ</fnm></au><au><snm>Makhijani</snm><fnm>V</fnm></au><au><snm>Roth</snm><fnm>GT</fnm></au><au><snm>Gomes</snm><fnm>X</fnm></au><au><snm>Tartaro</snm><fnm>K</fnm></au><au><snm>Niazi</snm><fnm>F</fnm></au><au><snm>Turcotte</snm><fnm>CL</fnm></au><au><snm>Irzyk</snm><fnm>GP</fnm></au><au><snm>Lupski</snm><fnm>JR</fnm></au><au><snm>Chinault</snm><fnm>C</fnm></au><au><snm>Song</snm><fnm>XZ</fnm></au><au><snm>Liu</snm><fnm>Y</fnm></au><au><snm>Yuan</snm><fnm>Y</fnm></au><au><snm>Nazareth</snm><fnm>L</fnm></au><au><snm>Qin</snm><fnm>X</fnm></au><au><snm>Muzny</snm><fnm>DM</fnm></au><au><snm>Margulies</snm><fnm>M</fnm></au><au><snm>Weinstock</snm><fnm>GM</fnm></au><au><snm>Gibbs</snm><fnm>RA</fnm></au><au><snm>Rothberg</snm><fnm>JM</fnm></au></aug><source>Nature</source><pubdate>2008</pubdate><volume>452</volume><fpage>872</fpage><lpage>876</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1038/nature06884</pubid><pubid idtype="pmpid" link="fulltext">18421352</pubid></pubidlist></xrefbib></bibl><bibl id="B69"><title><p>Accurate whole human genome sequencing using reversible terminator chemistry</p></title><aug><au><snm>Bentley</snm><fnm>D</fnm></au><au><snm>Balasubramanian</snm><fnm>S</fnm></au><au><snm>Swerdlow</snm><fnm>H</fnm></au><au><snm>Smith</snm><fnm>G</fnm></au><au><snm>Milton</snm><fnm>J</fnm></au><au><snm>Brown</snm><fnm>C</fnm></au><au><snm>Hall</snm><fnm>K</fnm></au><au><snm>Evers</snm><fnm>D</fnm></au><au><snm>Barnes</snm><fnm>C</fnm></au><au><snm>Bignell</snm><fnm>H</fnm></au><etal/></aug><source>Nature</source><pubdate>2008</pubdate><volume>456</volume><fpage>53</fpage><lpage>59</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1038/nature07517</pubid><pubid idtype="pmcid">2581791</pubid><pubid idtype="pmpid">18987734</pubid></pubidlist></xrefbib></bibl><bibl id="B70"><title><p>The diploid genome sequence of an Asian individual</p></title><aug><au><snm>Wang</snm><fnm>J</fnm></au><au><snm>Wang</snm><fnm>W</fnm></au><au><snm>Li</snm><fnm>R</fnm></au><au><snm>Li</snm><fnm>Y</fnm></au><au><snm>Tian</snm><fnm>G</fnm></au><au><snm>Goodman</snm><fnm>L</fnm></au><au><snm>Fan</snm><fnm>W</fnm></au><au><snm>Zhang</snm><fnm>J</fnm></au><au><snm>Li</snm><fnm>J</fnm></au><au><snm>Guo</snm><fnm>Y</fnm></au><etal/></aug><source>Nature</source><pubdate>2008</pubdate><volume>456</volume><fpage>60</fpage><lpage>65</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1038/nature07484</pubid><pubid idtype="pmcid">2716080</pubid><pubid idtype="pmpid">18987735</pubid></pubidlist></xrefbib></bibl><bibl id="B71"><title><p>The first Korean genome sequence and analysis: Full genome sequencing for a socio-ethnic group</p></title><aug><au><snm>Ahn</snm><fnm>SM</fnm></au><au><snm>Kim</snm><fnm>TH</fnm></au><au><snm>Lee</snm><fnm>S</fnm></au><au><snm>Kim</snm><fnm>D</fnm></au><au><snm>Ghang</snm><fnm>H</fnm></au><au><snm>Kim</snm><fnm>DS</fnm></au><au><snm>Kim</snm><fnm>BC</fnm></au><au><snm>Kim</snm><fnm>SY</fnm></au><au><snm>Kim</snm><fnm>WY</fnm></au><au><snm>Kim</snm><fnm>C</fnm></au><au><snm>Park</snm><fnm>D</fnm></au><au><snm>Lee</snm><fnm>YS</fnm></au><au><snm>Kim</snm><fnm>S</fnm></au><au><snm>Reja</snm><fnm>R</fnm></au><au><snm>Jho</snm><fnm>S</fnm></au><au><snm>Kim</snm><fnm>CG</fnm></au><au><snm>Cha</snm><fnm>JY</fnm></au><au><snm>Kim</snm><fnm>KH</fnm></au><au><snm>Lee</snm><fnm>B</fnm></au><au><snm>Bhak</snm><fnm>J</fnm></au><au><snm>Kim</snm><fnm>SJ</fnm></au></aug><source>Genome Res</source><pubdate>2009</pubdate><volume>19</volume><fpage>1622</fpage><lpage>1629</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1101/gr.092197.109</pubid><pubid idtype="pmcid">2752128</pubid><pubid idtype="pmpid">19470904</pubid></pubidlist></xrefbib></bibl><bibl id="B72"><title><p>Sequence and structural variation in a human genome uncovered by short-read, massively parallel ligation sequencing using two base encoding</p></title><aug><au><snm>McKernan</snm><fnm>KJ</fnm></au><au><snm>Peckham</snm><fnm>HE</fnm></au><au><snm>Costa</snm><fnm>GL</fnm></au><au><snm>McLaughlin</snm><fnm>SF</fnm></au><au><snm>Fu</snm><fnm>Y</fnm></au><au><snm>Tsung</snm><fnm>EF</fnm></au><au><snm>Clouser</snm><fnm>CR</fnm></au><au><snm>Duncan</snm><fnm>C</fnm></au><au><snm>Ichikawa</snm><fnm>JK</fnm></au><au><snm>Lee</snm><fnm>CC</fnm></au><etal/></aug><source>Genome Res</source><pubdate>2009</pubdate><volume>19</volume><fpage>1527</fpage><lpage>1541</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1101/gr.091868.109</pubid><pubid idtype="pmcid">2752135</pubid><pubid idtype="pmpid">19546169</pubid></pubidlist></xrefbib></bibl><bibl id="B73"><title><p>A highly annotated whole-genome sequence of a Korean individual</p></title><aug><au><snm>Kim</snm><fnm>JI</fnm></au><au><snm>Ju</snm><fnm>YS</fnm></au><au><snm>Park</snm><fnm>H</fnm></au><au><snm>Kim</snm><fnm>S</fnm></au><au><snm>Lee</snm><fnm>S</fnm></au><au><snm>Yi</snm><fnm>JH</fnm></au><au><snm>Mudge</snm><fnm>J</fnm></au><au><snm>Miller</snm><fnm>NA</fnm></au><au><snm>Hong</snm><fnm>D</fnm></au><au><snm>Bell</snm><fnm>CJ</fnm></au><etal/></aug><source>Nature</source><pubdate>2009</pubdate><volume>460</volume><fpage>1011</fpage><lpage>1015</lpage><xrefbib><pubidlist><pubid idtype="pmcid">2860965</pubid><pubid idtype="pmpid">19587683</pubid></pubidlist></xrefbib></bibl><bibl id="B74"><title><p>High-resolution mapping of copy-number alterations with massively parallel sequencing</p></title><aug><au><snm>Chiang</snm><fnm>DY</fnm></au><au><snm>Getz</snm><fnm>G</fnm></au><au><snm>Jaffe</snm><fnm>DB</fnm></au><au><snm>O&apos;Kelly</snm><fnm>MJ</fnm></au><au><snm>Zhao</snm><fnm>X</fnm></au><au><snm>Carter</snm><fnm>SL</fnm></au><au><snm>Russ</snm><fnm>C</fnm></au><au><snm>Nusbaum</snm><fnm>C</fnm></au><au><snm>Meyerson</snm><fnm>M</fnm></au><au><snm>Lander</snm><fnm>ES</fnm></au></aug><source>Nat Methods</source><pubdate>2008</pubdate><volume>6</volume><fpage>99</fpage><lpage>103</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1038/nmeth.1276</pubid><pubid idtype="pmcid">2630795</pubid><pubid idtype="pmpid">19043412</pubid></pubidlist></xrefbib></bibl><bibl id="B75"><title><p>CNV-seq, a new method to detect copy number variation using high-throughput sequencing</p></title><aug><au><snm>Xie</snm><fnm>C</fnm></au><au><snm>Tammi</snm><fnm>M</fnm></au></aug><source>BMC Bioinformatics</source><pubdate>2009</pubdate><volume>10</volume><fpage>80</fpage><xrefbib><pubidlist><pubid idtype="doi">10.1186/1471-2105-10-80</pubid><pubid idtype="pmcid">2667514</pubid><pubid idtype="pmpid">19267900</pubid></pubidlist></xrefbib></bibl><bibl id="B76"><title><p>Sensitive and accurate detection of copy number variants using read depth of coverage</p></title><aug><au><snm>Yoon</snm><fnm>S</fnm></au><au><snm>Xuan</snm><fnm>Z</fnm></au><au><snm>Makarov</snm><fnm>V</fnm></au><au><snm>Ye</snm><fnm>K</fnm></au><au><snm>Sebat</snm><fnm>J</fnm></au></aug><source>Genome Res</source><pubdate>2009</pubdate><volume>19</volume><fpage>1586</fpage><lpage>1592</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1101/gr.092981.109</pubid><pubid idtype="pmcid">2752127</pubid><pubid idtype="pmpid">19657104</pubid></pubidlist></xrefbib></bibl><bibl id="B77"><title><p>Personalized copy number and segmental duplication maps using next-generation sequencing</p></title><aug><au><snm>Alkan</snm><fnm>C</fnm></au><au><snm>Kidd</snm><fnm>JM</fnm></au><au><snm>Marques-Bonet</snm><fnm>T</fnm></au><au><snm>Aksay</snm><fnm>G</fnm></au><au><snm>Antonacci</snm><fnm>F</fnm></au><au><snm>Hormozdiari</snm><fnm>F</fnm></au><au><snm>Kitzman</snm><fnm>JO</fnm></au><au><snm>Baker</snm><fnm>C</fnm></au><au><snm>Malig</snm><fnm>M</fnm></au><au><snm>Mutlu</snm><fnm>O</fnm></au><au><snm>Sahinalp</snm><fnm>SC</fnm></au><au><snm>Gibbs</snm><fnm>RA</fnm></au><au><snm>Eichler</snm><fnm>EE</fnm></au></aug><source>Nat Genet</source><pubdate>2009</pubdate><volume>41</volume><fpage>1061</fpage><lpage>1067</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1038/ng.437</pubid><pubid idtype="pmcid">2875196</pubid><pubid idtype="pmpid">19718026</pubid></pubidlist></xrefbib></bibl><bibl id="B78"><title><p>Breakdancer: an algorithm for high-resolution mapping of genomic structural variation</p></title><aug><au><snm>Chen</snm><fnm>K</fnm></au><au><snm>Wallis</snm><fnm>JW</fnm></au><au><snm>McLellan</snm><fnm>MD</fnm></au><au><snm>Larson</snm><fnm>DE</fnm></au><au><snm>Kalicki</snm><fnm>JM</fnm></au><au><snm>Pohl</snm><fnm>CS</fnm></au><au><snm>McGrath</snm><fnm>SD</fnm></au><au><snm>Wendl</snm><fnm>MC</fnm></au><au><snm>Zhang</snm><fnm>Q</fnm></au><au><snm>Locke</snm><fnm>DP</fnm></au><au><snm>Shi</snm><fnm>X</fnm></au><au><snm>Fulton</snm><fnm>RS</fnm></au><au><snm>Ley</snm><fnm>TJ</fnm></au><au><snm>Wilson</snm><fnm>RK</fnm></au><au><snm>Ding</snm><fnm>L</fnm></au><au><snm>Mardis</snm><fnm>ER</fnm></au></aug><source>Nat Methods</source><pubdate>2009</pubdate><volume>6</volume><fpage>677</fpage><lpage>681</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1038/nmeth.1363</pubid><pubid idtype="pmpid" link="fulltext">19668202</pubid></pubidlist></xrefbib></bibl><bibl id="B79"><title><p>Modil: detecting small indels from clone-end sequencing with mixtures of distributions</p></title><aug><au><snm>Lee</snm><fnm>S</fnm></au><au><snm>Hormozdiari</snm><fnm>F</fnm></au><au><snm>Alkan</snm><fnm>C</fnm></au><au><snm>Brudno</snm><fnm>M</fnm></au></aug><source>Nat Methods</source><pubdate>2009</pubdate><volume>6</volume><fpage>473</fpage><lpage>474</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1038/nmeth.f.256</pubid><pubid idtype="pmpid" link="fulltext">19483690</pubid></pubidlist></xrefbib></bibl><bibl id="B80"><title><p>The current excitement about copy-number variation: how it relates to gene duplications and protein families</p></title><aug><au><snm>Korbel</snm><fnm>JO</fnm></au><au><snm>Kim</snm><fnm>PM</fnm></au><au><snm>Chen</snm><fnm>X</fnm></au><au><snm>Urban</snm><fnm>AE</fnm></au><au><snm>Weissman</snm><fnm>S</fnm></au><au><snm>Snyder</snm><fnm>M</fnm></au><au><snm>Gerstein</snm><fnm>MB</fnm></au></aug><source>Curr Opin Struct Biol</source><pubdate>2008</pubdate><volume>18</volume><fpage>366</fpage><lpage>374</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1016/j.sbi.2008.02.005</pubid><pubid idtype="pmcid">2577873</pubid><pubid idtype="pmpid">18511261</pubid></pubidlist></xrefbib></bibl><bibl id="B81"><title><p>Copy-number variations associated with neuropsychiatric conditions</p></title><aug><au><snm>Cook</snm><fnm>H</fnm></au><au><snm>Scherer</snm><fnm>SW</fnm></au></aug><source>Nature</source><pubdate>2008</pubdate><volume>455</volume><fpage>919</fpage><lpage>923</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1038/nature07458</pubid><pubid idtype="pmpid" link="fulltext">18923514</pubid></pubidlist></xrefbib></bibl><bibl id="B82"><title><p>The bovine lactation genome: insights into the evolution of mammalian milk</p></title><aug><au><snm>Lemay</snm><fnm>DG</fnm></au><au><snm>Lynn</snm><fnm>DJ</fnm></au><au><snm>Martin</snm><fnm>WF</fnm></au><au><snm>Neville</snm><fnm>MC</fnm></au><au><snm>Casey</snm><fnm>TM</fnm></au><au><snm>Rincon</snm><fnm>G</fnm></au><au><snm>Kriventseva</snm><fnm>EV</fnm></au><au><snm>Barris</snm><fnm>WC</fnm></au><au><snm>Hinrichs</snm><fnm>AS</fnm></au><au><snm>Molenaar</snm><fnm>AJ</fnm></au><au><snm>Pollard</snm><fnm>KS</fnm></au><au><snm>Maqbool</snm><fnm>NJ</fnm></au><au><snm>Singh</snm><fnm>K</fnm></au><au><snm>Murney</snm><fnm>R</fnm></au><au><snm>Zdobnov</snm><fnm>EM</fnm></au><au><snm>Tellam</snm><fnm>RL</fnm></au><au><snm>Medrano</snm><fnm>JF</fnm></au><au><snm>German</snm><fnm>JB</fnm></au><au><snm>Rijnkels</snm><fnm>M</fnm></au></aug><source>Genome Biol</source><pubdate>2009</pubdate><volume>10</volume><fpage>R43</fpage><xrefbib><pubidlist><pubid idtype="doi">10.1186/gb-2009-10-4-r43</pubid><pubid idtype="pmcid">2688934</pubid><pubid idtype="pmpid">19393040</pubid></pubidlist></xrefbib></bibl><bibl id="B83"><title><p>Analysis of chromosome breakpoints in neuroblastoma at sub-kilobase resolution using fine-tiling oligonucleotide array CGH</p></title><aug><au><snm>Selzer</snm><fnm>RR</fnm></au><au><snm>Richmond</snm><fnm>TA</fnm></au><au><snm>Pofahl</snm><fnm>NJ</fnm></au><au><snm>Green</snm><fnm>RD</fnm></au><au><snm>Eis</snm><fnm>PS</fnm></au><au><snm>Nair</snm><fnm>P</fnm></au><au><snm>Brothman</snm><fnm>AR</fnm></au><au><snm>Stallings</snm><fnm>RL</fnm></au></aug><source>Genes Chrom Cancer</source><pubdate>2005</pubdate><volume>44</volume><fpage>305</fpage><lpage>319</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1002/gcc.20243</pubid><pubid idtype="pmpid" link="fulltext">16075461</pubid></pubidlist></xrefbib></bibl><bibl id="B84"><title><p>Normalization of cDNA microarray data</p></title><aug><au><snm>Smyth</snm><fnm>GK</fnm></au><au><snm>Speed</snm><fnm>TP</fnm></au></aug><source>Methods</source><pubdate>2003</pubdate><volume>31</volume><fpage>265</fpage><lpage>273</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1016/S1046-2023(03)00155-5</pubid><pubid idtype="pmpid" link="fulltext">14597310</pubid></pubidlist></xrefbib></bibl><bibl id="B85"><title><p>A new non-linear normalization method for reducing variability in DNA microarray experiments</p></title><aug><au><snm>Workman</snm><fnm>C</fnm></au><au><snm>Jensen</snm><fnm>LJ</fnm></au><au><snm>Jarmer</snm><fnm>H</fnm></au><au><snm>Berka</snm><fnm>R</fnm></au><au><snm>Gautier</snm><fnm>L</fnm></au><au><snm>Nielser</snm><fnm>HB</fnm></au><au><snm>Saxild</snm><fnm>HH</fnm></au><au><snm>Nielsen</snm><fnm>C</fnm></au><au><snm>Brunak</snm><fnm>S</fnm></au><au><snm>Knudsen</snm><fnm>S</fnm></au></aug><source>Genome Biol</source><pubdate>2002</pubdate><volume>3</volume><fpage>1</fpage><lpage>16</lpage><xrefbib><pubid idtype="doi">10.1186/gb-2002-3-9-research0048</pubid></xrefbib></bibl><bibl id="B86"><title><p>Novel Uses for Machine Learning and Other Computational Methods for the Design and Interpretation of Genetic Microarrays</p></title><aug><au><snm>Molla</snm><fnm>M</fnm></au></aug><source>PhD thesis</source><publisher>University of Wisconsin-Madison Department of Computer Sciences</publisher><pubdate>2007</pubdate></bibl><bibl id="B87"><title><p>Circular binary segmentation for the analysis of array-based DNA copy number data</p></title><aug><au><snm>Olshen</snm><fnm>A</fnm></au><au><snm>Venkatraman</snm><fnm>E</fnm></au><au><snm>Lucito</snm><fnm>R</fnm></au><au><snm>Wigler</snm><fnm>M</fnm></au></aug><source>Biostatistics</source><pubdate>2004</pubdate><volume>5</volume><fpage>557</fpage><lpage>572</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1093/biostatistics/kxh008</pubid><pubid idtype="pmpid" link="fulltext">15475419</pubid></pubidlist></xrefbib></bibl><bibl id="B88"><title><p>Efficient calculation of interval scores for DNA copy number data analysis</p></title><aug><au><snm>Lipson</snm><fnm>D</fnm></au><au><snm>Aumann</snm><fnm>Y</fnm></au><au><snm>Ben-Dor</snm><fnm>A</fnm></au><au><snm>Linial</snm><fnm>N</fnm></au><au><snm>Yakhini</snm><fnm>Z</fnm></au></aug><source>J Comput Biol</source><pubdate>2006</pubdate><volume>13</volume><fpage>215</fpage><lpage>228</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1089/cmb.2006.13.215</pubid><pubid idtype="pmpid" link="fulltext">16597236</pubid></pubidlist></xrefbib></bibl><bibl id="B89"><title><p>Analysis of recent segmental duplications in the bovine genome</p></title><aug><au><snm>Liu</snm><fnm>GE</fnm></au><au><snm>Ventura</snm><fnm>M</fnm></au><au><snm>Cellamare</snm><fnm>A</fnm></au><au><snm>Chen</snm><fnm>Li</fnm></au><au><snm>Cheng</snm><fnm>Z</fnm></au><au><snm>Bin</snm><fnm>Zhu</fnm></au><au><snm>Li</snm><fnm>C</fnm></au><au><snm>Song</snm><fnm>J</fnm></au><au><snm>Eichler</snm><fnm>EE</fnm></au></aug><source>BMC Genomics</source><pubdate>2009</pubdate><volume>10</volume><fpage>571</fpage><xrefbib><pubidlist><pubid idtype="doi">10.1186/1471-2164-10-571</pubid><pubid idtype="pmcid">2796684</pubid><pubid idtype="pmpid">19951423</pubid></pubidlist></xrefbib></bibl><bibl id="B90"><title><p>Shotgun sequence assembly and recent segmental duplications within the human genome</p></title><aug><au><snm>She</snm><fnm>X</fnm></au><au><snm>Jiang</snm><fnm>Z</fnm></au><au><snm>Clark</snm><fnm>RA</fnm></au><au><snm>Liu</snm><fnm>G</fnm></au><au><snm>Cheng</snm><fnm>Z</fnm></au><au><snm>Tuzun</snm><fnm>E</fnm></au><au><snm>Church</snm><fnm>DM</fnm></au><au><snm>Sutton</snm><fnm>G</fnm></au><au><snm>Halpern</snm><fnm>AL</fnm></au><au><snm>Eichler</snm><fnm>EE</fnm></au></aug><source>Nature</source><pubdate>2004</pubdate><volume>431</volume><fpage>927</fpage><lpage>930</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1038/nature03062</pubid><pubid idtype="pmpid" link="fulltext">15496912</pubid></pubidlist></xrefbib></bibl><bibl id="B91"><title><p>Segmental duplications: organization and impact within the current human genome project assembly</p></title><aug><au><snm>Bailey</snm><fnm>JA</fnm></au><au><snm>Yavor</snm><fnm>AM</fnm></au><au><snm>Massa</snm><fnm>HF</fnm></au><au><snm>Trask</snm><fnm>BJ</fnm></au><au><snm>Eichler</snm><fnm>EE</fnm></au></aug><source>Genome Res</source><pubdate>2001</pubdate><volume>11</volume><fpage>1005</fpage><lpage>1017</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1101/gr.GR-1871R</pubid><pubid idtype="pmcid">311093</pubid><pubid idtype="pmpid">11381028</pubid></pubidlist></xrefbib></bibl><bibl id="B92"><title><p>NCBI GEO: archive for high-throughput functional genomic data</p></title><aug><au><snm>Barrett</snm><fnm>T</fnm></au><au><snm>Troup</snm><fnm>DB</fnm></au><au><snm>Wilhite</snm><fnm>SE</fnm></au><au><snm>Ledoux</snm><fnm>P</fnm></au><au><snm>Rudnev</snm><fnm>D</fnm></au><au><snm>Evangelista</snm><fnm>C</fnm></au><au><snm>Kim</snm><fnm>IF</fnm></au><au><snm>Soboleva</snm><fnm>A</fnm></au><au><snm>Tomashevsky</snm><fnm>M</fnm></au><au><snm>Marshall</snm><fnm>KA</fnm></au><au><snm>Phillippy</snm><fnm>KH</fnm></au><au><snm>Sherman</snm><fnm>PM</fnm></au><au><snm>Muertter</snm><fnm>RN</fnm></au><au><snm>Edgar</snm><fnm>R</fnm></au></aug><source>Nucleic Acids Res</source><pubdate>2009</pubdate><issue>37 Database</issue><fpage>D885</fpage><lpage>D890</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1093/nar/gkn764</pubid><pubid idtype="pmcid">2686538</pubid><pubid idtype="pmpid">18940857</pubid></pubidlist></xrefbib></bibl></refgrp>
   </bm>
</art>