Email updates

Keep up to date with the latest news and content from BMC Genetics and BioMed Central.

Open Access Highly Accessed Research article

Genomic scan of selective sweeps in thin and fat tail sheep breeds for identifying of candidate regions associated with fat deposition

Mohammad Hossein Moradi1*, Ardeshir Nejati-Javaremi1, Mohammad Moradi-Shahrbabak1, Ken G Dodds2 and John C McEwan2

Author Affiliations

1 Department of Animal Science-Excellent centre for improving sheep carcass quality and quantity, University of Tehran, PO Box 3158711167-4111, Karaj, Iran

2 Centre for Reproduction and Genomics, AgResearch, Invermay, Mosgiel, New Zealand

For all author emails, please log on.

BMC Genetics 2012, 13:10  doi:10.1186/1471-2156-13-10


The electronic version of this article is the complete one and can be found online at: http://www.biomedcentral.com/1471-2156/13/10


Received:28 August 2011
Accepted:26 February 2012
Published:26 February 2012

© 2012 Moradi et al; licensee BioMed Central Ltd.

This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Abstract

Background

Identification of genomic regions that have been targets of selection for phenotypic traits is one of the most important and challenging areas of research in animal genetics. However, currently there are relatively few genomic regions identified that have been subject to positive selection. In this study, a genome-wide scan using ~50,000 Single Nucleotide Polymorphisms (SNPs) was performed in an attempt to identify genomic regions associated with fat deposition in fat-tail breeds. This trait and its modification are very important in those countries grazing these breeds.

Results

Two independent experiments using either Iranian or Ovine HapMap genotyping data contrasted thin and fat tail breeds. Population differentiation using FST in Iranian thin and fat tail breeds revealed seven genomic regions. Almost all of these regions overlapped with QTLs that had previously been identified as affecting fat and carcass yield traits in beef and dairy cattle. Study of selection sweep signatures using FST in thin and fat tail breeds sampled from the Ovine HapMap project confirmed three of these regions located on Chromosomes 5, 7 and X. We found increased homozygosity in these regions in favour of fat tail breeds on chromosome 5 and X and in favour of thin tail breeds on chromosome 7.

Conclusions

In this study, we were able to identify three novel regions associated with fat deposition in thin and fat tail sheep breeds. Two of these were associated with an increase of homozygosity in the fat tail breeds which would be consistent with selection for mutations affecting fat tail size several thousand years after domestication.

Background

The domestication of livestock represents a crucial step in human history. The rise of civilizations could not happen without domestication of plants and animals. Sheep (Ovis aries) is the first grazing animal known to have been domesticated [1]. Multiple mitochondrial lineages suggest that domestication occurred several times, as in other livestock species such as cattle, goat and pig [2]. Recognition of the origin of domestication is difficult by the fact that the first domestic animals were no different from their wild counterparts [3]. In spite of this, the archaeozoological evidence suggests that the domestication of sheep occurred during the Neolithic revolution approximately 9000 years ago [4] in a region in northern Iraq and nearby regions in Iran [5]. Since domestication, sheep have established in a wide geographical range due to their adaptability to poor nutrition diets, tolerance to extreme climatic conditions and their manageable size [6].

Fat tail breeds are an important class of sheep breeds that are first documented as being present 5000 years ago. The earliest depiction of a fat tail sheep is on an Uruk III stone vessel of 3000 BC and fat and thin tail sheep appear together on a mosaic standard from Ur dated around 2400 BC [3,7]. The fact that the fat tail breeds are now prevalent in the Fertile Crescent, where sheep were originally domesticated, while thin tail sheep breeds are predominant in peripheral areas [7] and that the wild ancestor of sheep is thin tail suggest that the first domesticated sheep were thin tail and fat tail was developed later. The evidence shows that sheep were being farmed throughout Europe 5000 years ago [8].

The fat tail is considered as an adaptive response of animals to a hazardous environment and is a valuable energy reserve for the animal during migration and winter. Until recently it had additional value to the herder because it was used to preserve cooked meat for longer periods of time and also as an energy reserve during times of drought and famine. Therefore the climatic variation as well as the associated requirements of humans led to artificial selection for higher fat tail weight across generations [9-11]. Nowadays in intensive and semi-intensive systems most of the advantages of a large fat tail have reduced their importance and therefore, a decrease in fat tail size is often desirable for producers and consumers. Fat deposition requires more energy than the deposition of lean tissue, animal fat has lost much of its market demand and monetary value and sheep producers have easy access to other forms of auxiliary feeding [12]. These breeds are commonly found in a wide range of countries in Asia especially the Middle East and North Africa [7]. To date, several investigations into the inheritance of fat tails have been undertaken [10,11,13-18], nevertheless the genes affecting fat deposition in fat tail breeds are still unknown.

The study of genes underlying phenotypic variation can be performed in two different ways, firstly from phenotype to genome, which is performed by LD based association mapping or by targeting particular candidate genes identified based on homology to known genes, and secondly from genome to phenotype, which involves the statistical evaluation of genomic data to identify likely targets of past selection using selective sweep analysis [19,20]. The elimination of standing variation in regions linked to a recently fixed beneficial mutation is known as a "selective sweep" and has recently been the focus of much theoretical and empirical attention [21-23].

In contrast to natural populations, domesticated species provide an exciting opportunity to understand how artificial selection promotes rapid phenotypic evolution [22]. With an hypothesis that different selection pressures operated in thin and fat tail breeds over the history of time and somehow the selection acts on a variant that is advantageous only in one breed, it is expected that the frequency of that variant may differ across populations to a greater extent than predicted for variants evolving neutrally in all populations [21]. Identifying of these genome regions, which have been subject to such selective sweeps could reveal the mutations which are responsible for the fat deposition in these breeds. The examination of variation in SNP allele frequencies between populations, which can be quantified by the statistic FST, is a promising strategy for detecting signatures of selection [21,24].

To date a relatively small number of examples have successfully identified genomic regions subject to positive selection in different domestic animals [20,25-38]. The constraint to identifying selection signatures in sheep has been the limited density of markers. This limitation has recently been reduced with the availability of tens of thousands of single nucleotide polymorphisms (SNPs) using the Ovine SNP50k BeadChip (http://www.sheephapmap.org webcite.) In this research two independent resources have been investigated to identify regions associated with fat deposition in these breeds. The first data set was comprised of one thin and one fat tail Iranian sheep breed while the second was comprised of several other fat and thin tail breeds selected from the Ovine HapMap project.

Sheep production constitutes the most important component of the Iranian livestock industry with a total of approximately 50 million head. Twenty seven breeds and ecotypes have been documented in Iran, the majority derived in situ. The Lori-Bakhtiari sheep breed is one of the most common indigenous breeds. It is well adapted to the hilly and mountainous Bakhtiari region stretched out to southern Zagros Mountains (Figure 1), with a population of more than 1.7 million. The animals are kept mostly in villages under semi-intensive systems. Relative to other Iranian fat tail breeds the Lori-Bakhtiari is a large breed, having the largest fat tail by girth and weight [39]. Sheep breeds vary in tail length and this breed, perhaps surprisingly, has a rather short tail length. Zel is the only thin-tail Iranian breed and it is present largely on the northern slopes of the Elburz mountain range near the Caspian Sea [40] representing around 3% of Iranian sheep.

thumbnailFigure 1. Traditional distributions of the two Iranian breeds used in this study are shown in (a) with breed examples of Lori Bakhtiari (b) and Zel (c).

The aim of this study was to find selective sweeps between the Iranian thin and fat tail breeds using dense SNP markers and to compare with those from similarly divergent breeds extracted from the Ovine HapMap dataset (http://www.sheephapmap.org webcite.) Our work provides the first genome wide characterization of selective sweeps in thin and fat tail breeds. Each animal was genotyped with approximately 50,000 SNPs and a variety of selection sweep tests were utilized.

Results

Genotyping and data mining

A total of 94 animals consisting of 47 samples per breed were genotyped on the Illumina OvineSNP50K Beadchip assay in the Zel-Lori Bakhtiari data set. One animal in each breed had greater than 5% missing data and these were excluded from further analysis. The second data set used in this study was SNP genotyping data from similarly divergent breeds in the Ovine HapMap project (Table 1). After data cleaning (see methods) a total of 45,611 and 48,053 SNPs passed the filtering criteria in the Zel-Lori Bakhtiari and HapMap data sets respectively and were included in the final analyses. The overall average distances between 2 adjacent SNPs were relatively consistent among the chromosomes, being about 60 kb in the Zel-Lori Bakhtiari and 58 kb in the HapMap data sets. In all cases the locations used were obtained from OAR true chromosomes (ver.1.0, as at 5/2008) from CSIRO [41]. The principal component analysis (PCA) in Zel-Lori Bakhtiari data set, using the individual SNPs as the data, resulted in the first two principal components (PC1 and PC2) explaining 18.6% and 2.9% of the variance respectively. We found that PC1 separated out the two breeds from each other while one animal in each breed was separated from all other animals for PC2 (Figure 2). These two animals were excluded from further analysis. Finally, 45 animals (36 females and 9 males) per breed passed the data cleaning steps and 95% and 95.1% of all remaining SNPs were in Hardy-Weinberg equilibrium at the 5% level in Zel and Lori-Bakhtiari breeds respectively. For the SNPs analyzed in this study, the average MAF over all samples was 0.29 (SD = 0.13) in the Zel-Lori Bakhtiari data set and 0.30 (SD = 0.12) in the HapMap data set.

Table 1. Origin and sample size of different breeds included in this study

thumbnailFigure 2. Animals clustered on the basis of principal components analysis using individual genotypes. Zel and Lori Bakhtiari breeds are shown by green and blue circles respectively.

Genomic distribution of FST in Zel-Lori Bakhtiari data set

The plot of windowed FST against location is shown in Figure 3. The average of differentiation between Zel and Lori Bakhtiari breeds was 0.024 (SD = 0.035). As shown in this figure, in several instances outlier SNPs tended to cluster to similar regions. Specifically, we found evidence of selection in seven regions with windowed FST value > 0.20 on chromosomes 2 (between 55,861-56,300 kb), 2 (between 73,631-73,784 kb), 3 (between 146,615-146,676 kb), 5 (between 47,149-47,263 kb), 7 (between 30,512-30,585 kb), 7 (between 46,642-46,843 kb) and X (between 58,621-61,452 kb).

thumbnailFigure 3. Distribution of windowed FST values for Zel versus Lori-Bakhtiari breeds by chromosome. SNP position in the genome is shown on the X-axis, and windowed FST is plotted on the Y-axis. Regions with arrows above had windowed FST value > 0.20 and were later examined for further analysis.

The average FST for autosomal and X-linked SNPs was significantly different (0.024 and 0.035, respectively; t test, t = 6.2, P < 10-10). A higher average FST for X-chromosome SNPs could occur because of its smaller effective population size compared with that of the autosomes.

Another genetic distance measurement including unbiased estimates of FST as described by Weir and Cockerham [42] was also calculated. Because the results were highly correlated (r = 0.995) with the above results, so have not been presented.

Study of Bovine genes and published QTLs in regions showing evidence of selection

Seven regions showing the largest signals of selection in Zel-Lori Bakhtiari data set were chosen for further analysis. As the current annotation of the sheep genome is not as comprehensive as cattle, the regions of interest in O. aries were compared to the corresponding areas in B. taurus. Dot plots for corresponding areas of Ovine and Bovine genomes showed strong co-linear relationships between the two considered sequences in all regions and rearrangements were not observed (Additional file 1: Figure S1).

Additional file 1. Figure S1: Dot plots comparing ovine sequences (y axis), and their corresponding area on bovine genome (x axis) for different regions.

Format: PDF Size: 986KB Download file

This file can be viewed with: Adobe Acrobat ReaderOpen Data

A summary of orthologous area in both species and published bovine genes and QTLs is presented in table 2. Orthologus genes in the bovine genome were identified using the BLAT genome search with UCSC Genome Browser [43].

Table 2. Bovine genes and published QTLs in regions showing evidence of selection in Zel vs Lori Bakhtiari data set

Online databases of published QTL in beef and dairy cattle, show that the regions identified here were previously been found to be in regions harboring QTL affecting fat and also carcass yield traits (Table 2). For example, the regions on chromosomes 2 overlapped with QTLs previously suggested as being related with fat depth, and also chromosomes 2 (second region) and 5 with fat thickness and both regions on chromosomes 7 with milk fat yield.

Genomic distribution of FST in Ovine HapMap data set and comparison with Zel-Lori Bakhtiari data set results

An independent resource comprised of similarly divergent breeds for thin and fat tail breeds in the Ovine HapMap project were selected to determine whether the regions with large allele differentiation in Zel-Lori Bakhtiari data set could be confirmed. The pattern of FST across the genome was also calculated in the Ovine HapMap data set (fat and thin tail breeds were pooled) and values were averaged in a sliding window. The windowed FST was then plotted against location in the genome. The windowed FST values, at each common SNP, were correlated in the two data sets with r = 0.413, N = 45,238 and the average of differentiation between thin and fat tail breeds in the Ovine HapMap data set was 0.027 (SD = 0.038). The sliding window average FST revealed good agreement between both data sets for regions on chromosomes 5, 7 (second region) and X (Figure 4).

thumbnailFigure 4. Distribution of windowed FST values for Zel-Lori Bakhtiari and HapMap data sets for candidate regions by chromosome. SNP position in the genome is shown on the X-axis, and windowed FST is plotted on the Y-axis. All arrows show regions with largest allele differentiation in Zel-Lori Bakhtiari data set and the red arrows are highlighting the peaks that confirmed by HapMap dataset. Zel-Lori Bakhtiari and HapMap data sets are shown by blue and red circles respectively.

Calculation of empirical p-values for windowed FST values in each data set (Table 3) shows that the 3 regions with the largest differentiation in Zel-Lori Bakhtiari data set are also significant in the Ovine HapMap data set (P < 0.01). On the other hand, combining p-values from independent tests of significance in Zel-Lori Bakhtiari and Ovine Hapmap data sets using Fisher's combined test revealed the first region on chromosome 2 (P < 0.0001) in addition to these same significant regions.

Table 3. The table of significant test for Zel-Lori Bakhtiri and ovine HapMap data set.

The Weir and Cockerham method of estimating FST [42] was also performed for the ovine HapMap data and a high correlation (r = 0.992) was observed. Previous reports [e.g. [24,52]] also indicated these methods led to similar results; therefore, a strong correlation between these two measures is not surprising.

Study of median homozygosity in candidate regions

Median run of homozygosity (Figure 5) was increased at the candidate regions on chromosome 5 and X for Lori Bakhtiari (fat tail) and at the candidate region on chromosome 7 for Zel (thin tail). The largest differences of median homozygosity were located on chromosome X and this was present for a longer distance as well, whereas these statistics were lower on Chromosome 5. A study of median homozygosity in the HapMap data set for thin and fat tail breeds (data not shown) revealed similar results.

thumbnailFigure 5. Median homozygosity run length (n = 50 SNPs) surrounding candidate regions in Zel (blue triangular) and Lori Bakhtiari (red square) breeds. Regions with largest FST (> 0.20) value are shown on the X-axis.

Estimation of effective population size and haplotype age

The average extent of LD in the genome was used to estimate the effective population size at various times in the past. The estimated effective population size show a persistent decline from 2000 down to 20 generations ago, declining from 4900 (both breeds) to 840 in Zel and 532 in Lori Bakhtiari.

The graph of effective population sizes suggests a distinctive time point when the breeds separated ~1,100 generations ago (Additional file 2: Figure S2). Assuming an average generation interval in sheep of around 5 years or 5500 before present, this is congruent with the first archaeological evidence for the fat tail sheep breeds (~5,000 years ago).

Additional file 2. Figure S2: Change in effective population size across generations as estimated from linkage disequilibrium data.

Format: PDF Size: 37KB Download file

This file can be viewed with: Adobe Acrobat ReaderOpen Data

The average effective population size over the period was calculated using reciprocals as described by Falconer [53]. Using this average estimated effective population size and based on the current frequency of sweeps in our regions of interest [54], the age of sweeps under the assumption of selection neutrality and genetic drift has been inferred for chromosomes 5, 7 and X to be approximately 7100, 9600, and 6900 generations before present respectively.

These estimated ages should not be taken to imply a burst of selection at a particular time; however, these ages might represent areas in the parameter space in which we have good power.

Discussion

Approximately 25% of the world sheep population comprises fat tail breeds that are grazed in a wide range of countries especially in Asia and northern parts of Africa. In these typically arid countries sheep breeding has an important place in the local economy and as a source of protein. Due to the overt and easily defined nature of a fat tail, the gene variants affecting the phenotypic expression of this trait are a topic of both theoretical and economic curiosity. The former from the perspective of the impact and nature of domestication and the latter because the trait is now markedly less important commercially due to improved forage availability and decreased price for the product.

In this research we have developed an initial selection map for the fat tail trait in sheep. The objective was to localize genomic regions potentially affecting the trait. This was undertaken by comparing fat and thin tail breeds using the newly available 50K SNPChip. The chip has already proven its ability to map causal mutations for traits showing Mendelian inheritance such as yellow fat [55] and microphthalmia [56].

It has been previously shown in cattle, that domestication, subsequent breed formation and artificial selection, leave detectable signatures of selection in several regions of the genome [36]. Given the similarities in domestication time and selection between cattle and sheep, it is likely that the ovine genome contains similar signatures.

The Iranian breeds were chosen for this study, because they originated both near the centre of domestication [5] and near the first recorded archaeological evidence of fat tail sheep [3,7]. It was felt this geographic proximity would reduce false positives due to bottlenecks and selection pressure due to alternative factors such as climate, disease and pasture types. However, we expected that some signals would still be spurious so we then independently validated the results using a different set of breeds obtained from the Ovine HapMap project. The latter project did not formally phenotype the individuals concerned so these breeds were classified by us solely on publicly available breed descriptions. It is important to note that the selection sweep approach is one of the few viable approaches for investigating the genomics of this trait given the poor level of historical recording and DNA sampling in the relevant breeds. It is also one of the most cost effective and powerful techniques albeit as previously stated also subject to potential false positive signals.

Several statistical approaches have been devised to detect evidence of selective sweeps. These different statistics detect different aspects of the pattern of variation left by selection of a beneficial mutation. Generally, the power of the tests also depends on the strength, duration and age of the selective sweep [21]. Unlike FST, tests based on linkage disequilibrium like iHS [57] and XP-EHH [58] which are used primarily on human SNP data, where there are now millions of SNPs available, are dependent on SNP spacing and frequency as they are multi-marker tests. The 50,000 SNPs available for sheep may not provide enough information for these tests; it has been suggested in cattle that 150,000 evenly spaced SNPs would be required to study selective signatures in all parts of the bovine genome [29]. Additionally, the power of these methods also depends on the ancestral allele information which is still available for only a portion of the SNPs on the ovine chip.

However, it should be considered that robust inference of recent positive selection using FST is complicated by the fact that the distribution of genetic variation effect due to selection can be difficult to distinguish from that which arises after certain demographic events. Some previous studies that have attempted to identify signatures of selection based on patterns of population differentiation have used simulations to obtain the expected distribution of FST under the assumption of no selection [24,33,34]. However, the simulated distribution of FST is strongly dependent on its underlying assumptions about population demographic history, which is rarely known with any degree of certainty [24,37]. Despite this problem, examining several thousand loci using newly available SNP chips provides an exciting opportunity to distinguish between the effects of population structure, positive selection and ascertainment bias. Demographic events and ascertainment bias would be expected to alter patterns of FST across the whole genome in a similar way, while selection events would be expected to alter FST values only in selected and nearby loci [29-34].

In general, we found a low differentiation between populations, with a mean of 0.024 (SD = 0.035) for the Zel-Lori Bakhtiari data set and 0.027 ± 0.038 for the HapMap data set. This agrees with Kijas et al. [6] who, using 23 domestic breeds and two wild sheep species, revealed that sheep breeds showed generally low differentiation which is consistent with their short evolutionary history.

The general pattern of the signals across chromosomes was different in the HapMap data set and there were differences in the locations of the major peaks (data not shown). However, Barendse et al. [29] observed similar results when comparing Bovine Australian and HapMap data. Their windowed FST values between two data sets were correlated with r = 0.346, whereas this correlation in our data sets was r = 0.413. In general this correlation with different breeds may suggest that each study had identified signals of divergence particular to the genetic history of those breeds, and only some of which may be due to selection (as described by Barendse et al. [29]). This correlation for our candidate regions was improved to r = 0.963.

We found few genes in the regions of interest and no particular candidate genes related to fat deposition were identified. However, the current annotation of the cattle genome is not as comprehensive as in humans, so these areas cannot be dismissed as not containing any genes or regulatory elements. It has been observed that while some proposed candidates for selection have strong support in the form of a functional mutation with an identified phenotypic effect, often the functional target is unknown [21]. As the cattle genome becomes more comprehensively annotated in these regions likely targets of selection may be identified.

A variety of alternative comparisons, with the various thin and fat tail breeds using the combined HapMap and Zel-Lori Bakhtiari data set, can be examined in addition to the comparisons presented here. In a comparison, when we compared some fat tail breeds together (Afshari and Lori Bakhtiari breeds), we found the FST peak in the same location on chromosome 2 region 1 which was significant based on Fisher's combined test. The orthologous area for this region (using Blat) on the cattle genome would be around 8:62,600,000-63,000,000 (Table 2). Several bovine studies have also detected signatures of selection in this region. MacEachern et al. [34] compared differences in allelic frequencies of Australian Angus and Holstein cattle at 7,500 SNPs. They reported a region with large differences among breeds at 61,300,000 to 62,500,000 bp on BTA8. Stella et al. [37] observed a significantly large signal of selection in the same location for the Holstein and Jersey breeds. Although neither study has reported any particular candidate gene for this observation, previously identified QTLs (Table 2) could suggest a candidate region for body composition and carcass traits in this chromosomal region.

Using median homozygosity plot in these regions, we identified that homozygosity has been increased on chromosome 5 and X in favour of fat tail breeds. This is consistent with a recessive mode of inheritance. Therefore future studies should track both the phenotype of this trait and the genotype status in these regions in F2 crosses to provide independent and causal evidence that these regions do in fact affect the fat tail trait and to verify its mode of inheritance.

Given that one of the most differentiated regions was located on chromosome X, this suggests sex linked differences could be present. In several unpublished experiments when a research institute in central Iran crossed wild sheep (male) with a couple of fat tail ewes, according to the reports all resulting lambs were thin tail. This suggests dominance of the thin tail phenotype and does not support a simple single gene located on sex chromosome alone. Similarly, when two European thin tail breeds were crossed with Iranian fat tail ewes, all of the more than 350 crossbred lambs of both sexes subsequently born were thin tail and no fat tail lambs were observed. This supports the previous observations and again does not support sex linked effects, unless some epistatic effects exist or any relevant single gene located on chromosome X should be located in a pseudo-autosomal regions.

In general, there is little published information related to tail fatness in crosses between native fat and thin tail sheep breeds and also the relatively small number of animals used in these studies makes it difficult to obtain reliable estimates pertaining to this question [10,11,13-18]. However, in several experiments measurements suggest crosses were intermediate between fat and thin tail sheep [11,17]. This conflicts with the unpublished Iranian results and does raise the possibility that the results observed may depend on the fat and thin tail breeds used in the cross. In our opinion, more detail and larger scale experiments are needed to confirm the results on growth and carcass characteristics of crossbred lambs of different sexes. If such studies are undertaken the nature of inheritance on the X chromosome can be rather difficult to elucidate, especially if imprinting is also suspected [59].

A result which is irrelevant to the inheritance of the trait, but provides an insight into a possible mechanism of fat deposition in this organ, are the results of Gökdal et al. [10] who examined the effects of docking in fat tail breeds. The carcasses of the docked group contained more kidney, pelvic and internal fat than the intact lambs as well as a higher percentage of subcutaneous and intramuscular fat. The weights of the different carcass cuts of the docked lambs were also heavier that those of the intact group. However, there was little change in overall carcass composition suggesting that the genes affecting the fat tail phenotype are associated with the localization of fat stores to a regional depot rather than control of the overall level of fat deposition. This observation also may provide support to the suggestion that this trait is a result of human mediated selection as it is difficult to postulate any natural selection criterion favoring deposition in this area.

Finally, a search for comparable depots in other species especially grazing mammals identified Bacterian humps in camels and humps in bos indicus cattle breeds as possible analogous features that have been selected for in animals grazing in arid and tropical regions subjected to wide fluctuations in food supply. The closest equivalent structure present in all mammals is possibly the mammary gland with its associated mammary fad pad. The current work, while not able to answer these hypotheses about the origin of the fat tail and analogous structures in other species directly, does offer hope that these questions will soon be able to be answered by further experimentation.

Conclusions

In this paper, we present the first genome wide characterization of selective sweeps in thin and fat tail breeds with the aim of identifying regions associated with fat deposition. This trait is one of the most challenging areas of research in most countries grazing fat tail breeds. The analysis was performed on two independent resources using either Iranian or Ovine HapMap genotyping data for thin and fat tail breeds. Population differentiation using FST in Iranian thin and fat tail breeds revealed seven genomic regions. Almost all of these regions overlapped with QTLs that had previously been identified as affecting fat and carcass yield traits in beef and dairy cattle. Study of selection sweep signatures using FST in thin and fat tail breeds sampled from the Ovine HapMap project confirmed three of these regions located on chromosomes 5, 7 and X. Using a median run of homozygosity plot in these regions, we identified that homozygosity has been increased on chromosome 5 and X in favour of fat tail breeds and on chromosome 7 in favour of thin tail breeds. Those associated with an increase of homozygosity in the fat tail breed would be consistent with selection for mutations affecting fat tail size several thousand years after domestication.

Methods

Animal samples

In this research two independent data sets were used to compare frequencies of SNP alleles in thin and fat tail breeds.

Zel-Lori Bakhtiari data set

Blood samples were collected from flocks which have recently utilized the registration and recording system of the ABCI (Animal Breeding Centre of Iran). Two factors were considered for selecting the samples: selection of unrelated animals and sampling those that spanned the diversity of the breed. When sampling in pedigreed animals we ensured that they had no common grandparents and in non pedigreed animals we typically selected 4-5 animals from each flock each from different age classes. All animals had the following phenotypic records measured: herd no., location, GPS coordinates, sex, horn status, parity, approximate age, wool and skin colour, fat tail dimensions, height and weight. Animal sampling for the Zel breed was performed in the northern province of Mazandaran, south of the Caspian Sea and for the Lori-Bakhtiari breed in the Chaharmahal and Bakhtiari provinces, located in the south western part of Iran close to the Zagros Mountain ranges on farms distributed across the traditional rearing area of each breed (Figure 1). In general, more than 100 blood samples per each breed were collected and finally 47 samples were selected to be genotyped in each breed.

Ovine HapMap data set

The second data set used in this study were genotypes of similarly divergent breeds from the Ovine HapMap project of the ISGC (International Sheep Genomics Consortium) and are available to the public at http://www.sheephapmap.org webcite. There are many thin tail breeds in the HapMap project, so four breeds from geographically diverse locations and similar tail shape to Zel breed were selected. Since fat tail breeds in the HapMap project were mostly from a relatively narrow geographic region and their number was limited, three breeds with similar histories and fat tail shape to Lori Bakhtiari breed was selected (Table 1).

Genotyping and data quality control

The Ovine HapMap project genotype data were downloaded from http://www.sheephapmap.org/download.php webcite. Genomic DNA in the Zel-Lori Bakhtiari data set was extracted from whole blood by applying a modified salting out protocol [60] and DNA samples diluted to 50 ng/ul for genotyping. A total of 94 animals consisting of 47 samples per breed were genotyped on the Illumina OvineSNP50K Beadchip assay at the Centre for Reproduction and Genomics (CRG), Invermay, New Zealand using standard procedures (http://www.illumina.com webcite) The same set of 49,018 SNPs randomly distributed across the genome which passed the HapMap criteria [61] were used in both data sets.

To ensure the overall quality of samples and a consistent set of genotypes, quality control filters were applied to the initial data. All samples with more than 5% missing data were excluded from further analysis. The rationale for this is that DNA which is not of high quality will be more likely to have more missing data and also have incorrect genotype calls [29]. Then for each SNP, minor allele frequency (MAF) and percentage of calls (how many sheep the marker worked for) was calculated. The SNPs that had a call rate less than 95% and a MAF (over all animals) less than 2% were discarded [62]. For the remaining SNPs outlier departure from Hardy-Weinberg equilibrium (p < 10-6) over all animals of a breed were used for identifying genotyping errors. Although departure from HWE might result from selection, it is most likely that technical problems explain this result and they were therefore excluded from subsequent analyses [63]. The Bonferroni correction (β = α/n) was used to address the problem of multiple comparisons [64]. The number of tests was taken to be the number of SNPs (n = 50,000) giving β = 10-6 corresponding to α = 0.05 experiment-wise error. A principal component analysis (PCA) was performed using the prcomp function in the R package and the samples which were located outside of their expected breed cluster were excluded from further analysis. Finally markers were filtered to exclude loci assigned to unmapped contigs. The HapMap genotyping data had already passed the quality control process and only MAF and call rate were considered (Additional file 3: Table S3 and Additional file 4: Table S4).

Additional file 3. Table S3: Summary table for data cleaning in the Zel-Lori Bakhtiari data set.

Format: PDF Size: 68KB Download file

This file can be viewed with: Adobe Acrobat ReaderOpen Data

Additional file 4. Table S4: Summary table for data cleaning in the Ovine HapMap data set.

Format: PDF Size: 63KB Download file

This file can be viewed with: Adobe Acrobat ReaderOpen Data

Analysis

Estimates of Wright's FST and Weir & Cockerham's Theta

To determine a genome wide pattern of positive selection, and to compare this with the Ovine HapMap project samples, the basic form of Wright's fixation index (FST), which measures the degree of genetic differentiation between subpopulations, based on genetic polymorphism data, was calculated as described by MacEachern et al. [34]:

F S T = H T - H S H T

Where HT denotes the expected heterozygosities for overall total populations,

H T = 1 - ( p - 2 - q - 2 )

And HS denotes the expected heterozygosities in subpopulations,

H S = i = 1 n H exp i × n i N T o t a l

In the above formula, p ̄ and q ̄ stand for the frequency of allele A1 and A2 over the total population and Hexpi and ni denote expected heterozygosity and sample size in subpopulation i.

All FST values in this study are for a single locus. The value of FST can theoretically range from 0 (no differentiation) to 1 (complete differentiation, in which populations are fixed for different alleles). Signatures of positive selection can be recognized when adjacent SNPs all show high FST [24], due to the hitch-hiking effect [65], implying divergent selection between breeds.

Smoothing, where a moving average of a certain number of markers is taken, is a method of looking for regions where selection is apparent over multiple markers, rather than one-off high values. The optimal size of the window depends on how long ago the selection sweep occurred, as LD breaks down with time. An arbitrary window of 5 markers (~300 Kbp) was chosen as it appeared to provide the better signal. Smoothing raises the issue of an appropriate adjustment for multiple testing, but we use it only for graphic illustration, with the singular SNP values used when looking at specific regions. The windowed FST values were plotted against genome location.

One of the main problems with Wright's measure of FST is that it does not account for sampling error. This was corrected for by Weir and Cockerham [42] who developed the unbiased estimator. We used this method in both data sets. As originally defined [66], the range of FST is between 0 and 1. However, it is possible for the above unbiased estimate of FST to assume negative values as well [24].

By ranking the sliding window FST in the each data set, the empirical P-value was calculated for each SNP as the proportion of SNPs with a sliding window FST value at least as extreme as the value for that SNP.

We applied Fisher's [67] method to combine the results from the k = 2 independent resources used, using the following statistic which follows a chi-square distribution with 2k degrees of freedom, under the null hypothesis:

X 2 = - 2 i = 1 K log e ( p i )

Where, pi is the p-value for the ith hypothesis test. The significance level considered for each experiment was 10-2 and the chosen level for combined experiment was 10-4.

Study of genes and QTLs in our candidate regions

It is useful to compare regions of interest in O. aries to the corresponding areas in B. taurus, as the taurine genome is better annotated. Before comparisons of Ovine and Bovine sequences, dot plots were created using Gepard [68], with Ovine (OAR true chromosomes v 1.0) sequences obtained from Ovine Gbrowse [69], and Bovine (btau4) sequences obtained from the UCSC Genome Browser [43]. A dot plot is a graphical method for comparing two sequences, and identifies regions of similarity between them by organizing one sequence on the x-axis, and another one on the y-axis of a plot [70]. When the residues of both sequences match at the same location, a dot is drawn. The straightness of the line is indicative of the relationship between the two sequences.

In order to discover if any of the regions implicated by the whole genome analysis contained genes of interest we used OAR true chromosomes (ver.1.0, as at 5/2008) from CSIRO [41]. After obtaining the Ovine sequence of our candidate regions using this browser, the genes in cow genome were identified by BLAT search using the UCSC Genome Browser on Cow Oct. 2007 (Baylor 4.0/bosTau4) Assembly. BLAT on DNA is designed to quickly find sequences of 95% and greater similarity of length 25 bases or more [43]. Gene function was determined using Online Mendelian Inheritance in Man (OMIM) at http://www.ncbi.nlm.nih.gov/omim/ webcite and Uniprot at http://biogps.gnf.org/ webcite.

We also explored two QTL databases available online http://genomes.sapac.edu.au/bovineqtl/index.html webcite, http://www.animalgenome.org/QTLdb/cattle.html webcite to identify any overlapping of the candidate regions with published QTL in dairy and beef cattle.

Median run of homozygosity

One method for looking for selection between breeds and also finding the breed that has been under selection is a comparison of homozygosity in a region. When a causal mutation is selected for in one breed and not the other, one breed would show high homozygosity in a genomic interval while the other does not. For each SNP in each animal, the length of a run of homozygosity (number of consecutive homozygous SNPs including the one being considered) was calculated (this would be zero if the SNP being considered was heterozygous). For each marker the median length, over the breed, of the run of homozygosity was calculated and plotted against genomic position in the candidate regions (25 SNPs on each side) for each breed. This is similar to looking at LD in those areas where a run of homozygosity (and therefore a high median homozygosity) indicates selection. Median homozygosity on chromosome X was calculated using only females (36 animals per each breed).

Estimation of Ne and haplotype age

A pair of haplotypes was estimated for each animal in the sample using fastPHASE Version 1.2 [71]. The estimated pairwise haplotype frequencies were used to calculate the squared correlation coefficient between the 2 loci (r2) following Hill and Robertson [72] as:

r 2 = ( f r e q ( A 1 - B 1 ) × f r e q ( A 2 - B 2 ) - f r e q ( A 1 - B 2 ) × f r e q ( A 2 - B 1 ) ) 2 f r e q ( A 1 ) × f r e q ( A 2 ) × f r e q ( B 1 ) × f r e q ( B 2 )

Where for example freq (A1-B1) is the frequency of haplotype marker 1 allele 1, marker 2 allele 1, freq (A1) is the frequency of marker 1 allele 1, and freq (B1) is the frequency of marker 2 allele 1 [27].

The effective population size was then estimated using the approximate expectation of r2 as:

E ( r 2 ) = 1 1 + 4 N t c

Where Nt is the effective population size 1/(2c) generations in the past, E(r2) is estimated by the average of r2 values for all pairs of SNPs, and c is the median of distances in Morgans [27,73-75].

To obtain a crude estimate of the ages of sweeps, we used the age estimation based on current frequency as -4N [p(log p)/(1 - p)], where N is effective population size and p is the current frequency of the derived allele (selected sweep) [53].

Authors' contributions

MHM planned, performed the analyses and drafted the manuscript, ANJ contributed to the Iranian data set collection and supervised the analysis, MMS coordinated the study and sample collection, KGD provided statistical and analysis support and JCM supervised the analysis and provided the Ovine HapMap Data set. All authors have contributed to the editing of the article, and approved the final manuscript.

Acknowledgements

The authors gratefully acknowledge the International Sheep Genomics Consortium for access to the Ovine HapMap genotypes prior to publication and also Animal Breeding Center of Iran (ABCI) for access to the records and animals of the Iranian breeds. Thanks to the staff from university of Tehran, Animal Science Research Institute of Iran and AgResearch, especially Dr. Abbasi, Dr. Amirinia, Dr. Rudiger Brauning, Ms. Rayna Anderson, Ms. Kathryn McRae and Ms. Natalie Pickering who helped and supported this research. The authors also acknowledge the financial contributions of Animal Science Research Institute of Iran, Mobarakandish Institute and AgResearch, New Zealand.

References

  1. Chessa B, Pereira F, Arnaud F, Amorim A, Goyache F, Mainland I, Kao RR, Pemberton JM, Beraldi D, Stear MJ, et al.: Revealing the history of Sheep domestication using retrovirus integrations.

    Science 2009, 324(5926):532-536. PubMed Abstract | Publisher Full Text | PubMed Central Full Text OpenURL

  2. Meadows JRS, Cemal I, Karaca O, Gootwine E, Kijas JW: Five Ovine mitochondrial lineages identified from Sheep breeds of the Near East.

    Genetics 2007, 175(3):1371-1379. PubMed Abstract | Publisher Full Text | PubMed Central Full Text OpenURL

  3. Ryder ML: Sheep and Man. London: Duckworth press; 1983:17-92. OpenURL

  4. Handley LJL, Byrne K, Santucci F, Townsend S, Taylor M, Bruford MW, Hewitt GM: Genetic structure of European sheep breeds.

    Heredity 2007, 99:620-631. PubMed Abstract | Publisher Full Text OpenURL

  5. Zeder MA: Animal domestication in the Zagros: a review of past and current research.

    Pale'orient 1999, 25:11-26. PubMed Abstract OpenURL

  6. Kijas JW, Townley D, Dalrymple BP, Heaton MP, Maddox JF, McGrath A, Wilson P, Ingersoll RG, McCulloch R, McWilliam S, Tang D, McEwan J, et al.: A Genome Wide Survey of SNP Variation Reveals the Genetic Structure of Sheep Breeds.

    PLoS ONE 2009, 4(3):e4668. PubMed Abstract | Publisher Full Text | PubMed Central Full Text OpenURL

  7. Davidson A: The Oxford Companion to Food. 2nd edition. Edited by Tom Jaine. Oxford University Press; 2006:293-294.

  8. Ryder M: Evolution of domesticated animals. Longman: London and New York; 1981:63-85. OpenURL

  9. Ermias E, Yami A, Rege JEO: Fat deposition in tropical sheep as adaptive attribute to periodic feed fluctuation.

    J Anim Breed Genet 2002, 119:235-246. Publisher Full Text OpenURL

  10. Gokdal O, Ulker H, Karakus F, Cengiz F, Temur C, Handil H: Growth, feedlot performance and carcass characteristics of Karakas and crossbred lambs (F1) (Ile de France × Akkaraman (G1) × Karakas) under rural farm conditions in Turkey.

    South African Journal of Animal Science 2004, 34(4):223-232. OpenURL

  11. Kashan NEJ, Manafi-Azar GH, Afzalzadeh A, Salehi A: Growth performance and carcass quality of fattening lambs from fat-tailed and tailed sheep breeds.

    Small Ruminant Research 2005, 60:267-271. Publisher Full Text OpenURL

  12. Nejati-Javaremi A, Izadi F, Rahmati Gh, Moradi M: Selection in fat-tailed sheep based on two traits of fat-tail and body weight versus single-trait total body weight.

    Int J Agri Biol 2007, 9(4):645-648. OpenURL

  13. Güney O: Commercial crossbreeding between Ile-de-France, Rambouillet, Chios and local fat-tail Awassi for market lamb production.

    Small Ruminant Research 1990, 3(6):449-456. OpenURL

  14. Farid A: Slaughter and carcass characteristics of three fat-tailed sheep breeds and their crosses with Corriedale and Targhee rams.

    Small Ruminant Research 1991, 5(3):255-271. Publisher Full Text OpenURL

  15. Momani-Shaker M, Abdullah AY, Kridli RT, Blaha J, Sada I, Sovjak R: Fattening performance and carcass value of Awassi ram lambs, F1 crossbreds of Romanov × Awassi and Charollais × Awassi in Jordan.

    Czech J Anim Sci 2002, 47(10):429-438. OpenURL

  16. Unal N, Akcapinar H, Aytac M, Atasoy F: Fattening performance and carcass traits in crossbred ram lambs.

    Medycyna Wet 2006, 62(2):401-404. OpenURL

  17. Khaldari M, Kashan NEJ, Afzalzadeh A, Salehi A: Growth and carcass characteristics of crossbred progeny from lean tailed and fat tailed sheep breeds.

    South African Journal of Animal Science 2007, 37(1):51-56. OpenURL

  18. Marai IFM, Daader AH, Bahgat LB: Performance traits of purebred Ossimi and Rahmani lambs and their crosses with Finnsheep born under two accelerated mating systems.

    Arch Tierz 2009, 52:497-51. OpenURL

  19. Akey JM: Constructing genomic maps of positive selection in humans: Where do we go from here.

    Genome Res 2009, 19(5):711-722. PubMed Abstract | Publisher Full Text OpenURL

  20. Qanbari S, Pimentel ECG, Tetens J, Thaller G, Lichtner P, Sharifi AR, Simianer H: A genome-wide scan for signatures of recent selection in Holstein cattle.

    Anim Genet 2010, 41(4):377-89. PubMed Abstract | Publisher Full Text OpenURL

  21. Sabeti PC, Schaffner SF, Fry B, Lohmueller J, Varilly P, Shamovsky O, Palma A, Mikkelsen TS, Altshuler D, Lande ES: Positive natural selection in the human lineage.

    Science 2006, 3129(5780):1614-1620. OpenURL

  22. Biswas S, Akey JM: Genomic insights into positive selection.

    TRENDS in Genetics 2006, 22(8):437-436. PubMed Abstract | Publisher Full Text OpenURL

  23. McRae K: Signatures of selective sweeps in parasite selection flocks. University of Otago, Zoology department: MSc thesis; 2011. OpenURL

  24. Akey JM, Zhang G, Zhang K, Jin L, Shriver MD: Interrogating a high density SNP map for signatures of natural selection.

    Genome Res 2002, 12(12):1805-14. PubMed Abstract | Publisher Full Text | PubMed Central Full Text OpenURL

  25. Pollinger JP, Bustamante CD, Fledel-Alon A, Schmutz S, Gray MM, Wayne RK: Selective sweep mapping of genes with large phenotypic effects.

    Genome Research 2005, 15:1809-1819. PubMed Abstract | Publisher Full Text | PubMed Central Full Text OpenURL

  26. Worley K, Carey J, Veitch A, Coltman DW: Detecting the signature of selection on immune genes in highly structured populations of wild sheep (Ovis dalli).

    Mol Ecol 2006, 15(3):623-637. PubMed Abstract | Publisher Full Text OpenURL

  27. Hayes BJ, Lien S, Nilsen H, Olsen HG, Berg P, MacEachern S, Potter S, Meuwissen THE: The origin of selection signatures on bovine chromosome six.

    Anim Genet 2008, 39(2):105-111. PubMed Abstract | Publisher Full Text OpenURL

  28. Prasad A, Schnabel RD, McKay SD, Murdoch B, Stothard P, Kolbehdari D, Wang Z, Taylor JF, Moore SS: Linkage disequilibrium and signatures of selection on chromosomes 19 and 29 in beef and dairy cattle.

    Anim Genet 2008, 39:597-605. PubMed Abstract | Publisher Full Text | PubMed Central Full Text OpenURL

  29. Barendse W, Harrison BE, Bunch RJ, Thomas MB, Turner LB: Genome wide signatures of positive selection: The comparison of independent samples and the identification of regions associated to traits.

    BMC Genomics 2009, 10:178. PubMed Abstract | BioMed Central Full Text | PubMed Central Full Text OpenURL

  30. Dayo GK, Thevenon S, Berthier D, Moazami-Goudarzi K, Denis C, Cuny G, Eggen A, Gautier M: Blackwell Publishing Ltd Detection of selection signatures within candidate regions underlying trypanotolerance in outbred cattle populations.

    Molecular Ecology 2009, 18:1801-1813. PubMed Abstract | Publisher Full Text OpenURL

  31. Flori L, Fritz S, Jaffre zic F, Boussaha M, Gut I, Heath S, Foulley JL, Gautier M: The genome response to artificial selection: a case study in dairy cattle.

    PLoS ONE 2009, 4(8):e6595. PubMed Abstract | Publisher Full Text | PubMed Central Full Text OpenURL

  32. Gu J, Orr N, Park SD, Katz LM, Sulimova G, MacHugh DE, Hill EW: A genome scan for positive selection in thoroughbred horses.

    PLoS ONE 2009, 4(6):e5767. PubMed Abstract | Publisher Full Text | PubMed Central Full Text OpenURL

  33. Hayes BJ, Chamberlain AJ, Maceachern S, Savin K, McPartlan H, MacLeod I, Sethuraman L, Goddard ME: A genome map of divergent artificial selection between Bos taurus dairy cattle and Bos taurus beef cattle.

    Anim Genet 2009, 40(2):176-184. PubMed Abstract | Publisher Full Text OpenURL

  34. MacEachern S, Hayes B, McEwan J, Goddard M: An examination of positive selection and changing effective population size in Angus and Holstein cattle populations (Bos taurus) using a high density SNP genotyping platform and the contribution of ancient polymorphism to genomic diversity in Domestic cattle.

    BMC Genomics 2009, 10:181. PubMed Abstract | BioMed Central Full Text | PubMed Central Full Text OpenURL

  35. Pariset L, Joost S, Ajmone-Marsan P, Valentini A: Econogene Consortium(EC): Landscape genomics and biased FST approaches reveal single nucleotide polymorphisms under selection in goat breeds of North-East Mediterranean.

    BMC Genetics 2009, 10:7. PubMed Abstract | Publisher Full Text | PubMed Central Full Text OpenURL

  36. The Bovine HapMap Consortium: Genome wide survey of SNP variation uncovers the genetic structure of cattle breeds.

    Science 2009, 324(5926):528-532. PubMed Abstract | Publisher Full Text | PubMed Central Full Text OpenURL

  37. Stella A, Ajmone-Marsan P, Lazzari B: Boettcher Paul: Identification of selection signatures in cattle breeds selected for dairy production.

    Genetics 2010, 185:1451-1461. PubMed Abstract | Publisher Full Text | PubMed Central Full Text OpenURL

  38. Qanbari S, Gianola D, Hayes B, Schenkel F, Miller S, Moore S, Thaller G, Simianer H: Application of site and haplotype-frequency based approaches for detecting selection signatures in cattle.

    BMC Genomics 2011, 12:318. PubMed Abstract | BioMed Central Full Text | PubMed Central Full Text OpenURL

  39. Vatankhah M, Talebi MA: Heritability estimates and correlations between production and reproductive traits in Lori-Bakhtiari sheep in Iran.

    South African Journal of Animal Science 2008, 38(2):110-118. OpenURL

  40. Kamalzadeh A, Rajabaigy M, Kiasat A: Livestock production systems and trends in livestock industry in Iran.

    J Agri Soc Sci 2008, 04:183-88. OpenURL

  41. Dalrymple BP, Kirkness EF, Nefedov M, McWilliam S, Ratnakumar A, Barris W, Zhao S, Shetty J, Maddox JF, O'Grady M, et al.: Using comparative genomics to reorder the human genome sequence into a virtual sheep genome.

    Genome Biology 2007, 8(7):R152. PubMed Abstract | BioMed Central Full Text | PubMed Central Full Text OpenURL

  42. Weir BS, Cockerham CC: Estimating F-statistics for the analysis of population structure.

    Evolution 1984, 38(6):1358-1370. Publisher Full Text OpenURL

  43. The UCSC Genome Browser [http://genome.ucsc.edu/] webcite

  44. Casas E, Stone RT, Keele JW, Shackelford SD, Kappes SM, Koohmaraie M: A comprehensive search for quantitative trait loci affecting growth and carcass composition of cattle segregating alternative forms of the myostatin gene.

    J Anim Sci 2001, 79:854-860. PubMed Abstract | Publisher Full Text OpenURL

  45. McClure MC, Morsci NS, Schnabel RD, Kim JW, Yao P, Rolf MM, McKay SD, Gregg SJ, Chapple RH, Northcutt SL, Taylor JF: A genome scan for quantitative trait loci influencing carcass, post-natal growth and reproductive traits in commercial Angus cattle.

    Animal Genetics 2010, 41:597-607. PubMed Abstract | Publisher Full Text OpenURL

  46. Casas E, Shackelford SD, Keele JW, Stone RT, Kappes SM, Koohmaraie M: Quantitative trait loci affecting growth and carcass composition of cattle segregating alternate forms of myostatin.

    J Anim Sci 2000, 78:560-569. PubMed Abstract | Publisher Full Text OpenURL

  47. MacNeil MD, Grosz MD: Genome-wide scans for QTL affecting carcass traits in Hereford × composite double.

    J Anim Sci 2002, 80:2316-2324. PubMed Abstract | Publisher Full Text OpenURL

  48. Casas E, Shackelford SD, Keele JW, Koohmaraie M, Smith TPL, Stone RT: Detection of quantitative trait loci for growth and carcass composition in cattle.

    J Anim Sci 2003, 81:2976-2983. PubMed Abstract | Publisher Full Text OpenURL

  49. Li C, Basarab J, Snelling WM, Benkel B, Murdoch B, Hansen C, Moore SS: Assessment of positional candidate genes MYF5 and IGF1 for growth on bovine chromosome 5 in commercial lines of Bos taurus.

    J Anim Sci 2004, 82:1-7. PubMed Abstract OpenURL

  50. Schrooten C, Bink MCAM, Bovenhuis H: Whole genome scans to detect chromosomal regions affecting multiple traits in dairy cattle.

    J Dairy Sci 2004, 87:3550-3560. PubMed Abstract | Publisher Full Text OpenURL

  51. Ashwell MS, Heyen DW, Weller JI, Ron M, Sonstegard TS, Van Tassell CP, Lewin HA: Detection of quantitative trait loci influencing conformation traits and calving ease in Holstein-Friesian cattle.

    J Dairy Sci 2005, 88:4111-4119. PubMed Abstract | Publisher Full Text OpenURL

  52. Weicker JJ, Brumfield RT, et al.: Estimating the unbiased estimator θ for population genetic survey data.

    Evolution 2001, 55(12):2601-2605. PubMed Abstract OpenURL

  53. Falconer DS, Mackay TFC: Introduction to quantitative genetics. 3rd edition. Addison Wesley Longman, Harlow, Essex, UK; 1996:72-73. OpenURL

  54. Stephens JC, Reich DE, Goldstein DB, Shin HD, Smith MW, Carrington M, Winkler C, Huttley GA, Allikmets R, Schriml L, et al.: Dating the origin of the CCR5-D32 AID-resistance allele by the coalescence of haplotypes.

    Am J Hum Genet 1998, 62:1507-1515. PubMed Abstract | Publisher Full Text | PubMed Central Full Text OpenURL

  55. Tecofsky RTJ, Gemmell NJ, Johnson PL, McEwan J: Searching for genetic markers for yellow fat in New Zealand sheep [abstract].

    In proceedings of the ComBio Conference 2009. OpenURL

  56. Becker D, Tetens J, Brunner A, Burstel D, Ganter M, Kijas J, Drogemuller C: Microphthalmia in Texel sheep is associated with a missense mutation in the paired-like homeodomain 3 PITX gene.

    PLoS ONE 2010, 5(1):e8689. PubMed Abstract | Publisher Full Text | PubMed Central Full Text OpenURL

  57. Voight BF, Kudaravalli S, Wen XQ, Pritchard JK: A map of recent positive selection in the human genome.

    PLoS Biology 2006, 4(3):e72. PubMed Abstract | Publisher Full Text | PubMed Central Full Text OpenURL

  58. Sabeti PC, Varilly P, Fry B, Lohmueller J, Hostetter E, Cotsapas C, Xie XH, Byrne EH, McCarroll SA, Gaudet R, et al.: Genome-wide detection and characterization of positive selection in human populations.

    Nature 2007, 449(7164):913-U912. PubMed Abstract | Publisher Full Text | PubMed Central Full Text OpenURL

  59. Davies W, Isles AR, Wilkinson LS: Imprinted genes and mental dysfunction.

    Ann Med 2001, 33(6):428-436. PubMed Abstract | Publisher Full Text OpenURL

  60. Helms C: Salting out Procedure for Human DNA extraction. [http://humgen.wustl.edu/hdk_lab_manual/dna/dna2.html] webcite

    The Donis-Keller Lab - Lab Manual Homepage OpenURL

  61. Kijas J, Lenstra JA, Hayes B, Boitard S, Neto LP, Cristobal MS, Servin B, McCulloch R, Whan V, Gietzen K, et al.: Genome wide analysis of the world's sheep breeds reveals high levels of historic mixture and strong recent selection.

    PLoS Biology, in press. OpenURL

  62. The Wellcome Trust Case Control Consortium: Genome-wide association study of 14,000 cases of seven common diseases and 3,000 shared controls.

    Nature 2007, 447(7145):661-78. PubMed Abstract | Publisher Full Text | PubMed Central Full Text OpenURL

  63. Teo YY, Fry AE, Clark TG, Tai ES, Seielstad M: On the usage of HWE for identifying genotyping errors.

    Annals of Human Genetics 2007, 71:701-703. PubMed Abstract | Publisher Full Text OpenURL

  64. Abdi H: Bonferroni and Šidàk corrections for multiple comparisons. In Encyclopedia of Measurement and Statistics. Edited by Salkind N. Sage: Thousand Oaks; 2007:103-107. OpenURL

  65. Maynard-Smith J, Haigh J: The hitch-hiking effect of a favourable gene.

    Genet Res 1974, 23:23-35. PubMed Abstract | Publisher Full Text OpenURL

  66. Wright S: The genetical structure of populations.

    Ann Eugen 1951, 15:323-354. OpenURL

  67. Fisher RA: Statistical Methods for Research Workers. London: Oliver and Boyd; 1932. OpenURL

  68. Krumsiek J, Arnold R, Rattei T: Gepard: A rapid and sensitive tool for creating dotplots on genome scale.

    Bioinformatics 2007, 23(8):1026-1028. PubMed Abstract | Publisher Full Text OpenURL

  69. The Ovine Genome Browser [http://www.livestockgenomics.csiro.au/sheep/oar1.0.php] webcite

  70. Maizel JV, Lenk RP: Enhanced graphic matrix analysis of nucleic acid and protein sequences.

    Proceedings of the National Academy of Sciences of the United States of America 1981, 78(12):7665-7669. PubMed Abstract | Publisher Full Text | PubMed Central Full Text OpenURL

  71. Scheet P, Stephens M: A fast and flexible statistical model for large-scale population genotype data: applications to inferring missing genotypes and haplotypic phase.

    Am J Hum Genet 2006, 78(4):629-644. PubMed Abstract | Publisher Full Text | PubMed Central Full Text OpenURL

  72. Hill WG, Robertson A: Linkage disequilibrium in finite populations.

    Theoretical and Applied Genetics 1968, 38:226-231. Publisher Full Text OpenURL

  73. Hayes BJ, Visscher PM, McPartlan HC, Goddard ME: Novel multilocus measure of linkage disequilibrium to estimate past effective population size.

    Genome Res 2003, 13(4):635-643. PubMed Abstract | Publisher Full Text | PubMed Central Full Text OpenURL

  74. Tenesa A, Navarro P, Hayes BJ, Duffy DL, Clarke GM, Goddard ME, Visscher PM: Recent human effective population size estimated from linkage disequilibrium.

    Genome Res 2007, 17(4):520-526. PubMed Abstract | Publisher Full Text | PubMed Central Full Text OpenURL

  75. Villa-Angulo R, Matukumalli LM, Gill CA, Choi G, Van Tassle CP, Grefenstette JJ: High-resolution haplotype block structure in the cattle genome.

    BMC Genetics 2009, 10:19. PubMed Abstract | Publisher Full Text | PubMed Central Full Text OpenURL