Genomic scan of selective sweeps in thin and fat tail sheep breeds for identifying of candidate regions associated with fat deposition

Moradi, Mohammad Hossein; Nejati-Javaremi, Ardeshir; Moradi-Shahrbabak, Mohammad; Dodds, Ken G; McEwan, John C

doi:10.1186/1471-2156-13-10

Research article
Open access
Published: 26 February 2012

Genomic scan of selective sweeps in thin and fat tail sheep breeds for identifying of candidate regions associated with fat deposition

Mohammad Hossein Moradi¹,
Ardeshir Nejati-Javaremi¹,
Mohammad Moradi-Shahrbabak¹,
Ken G Dodds² &
…
John C McEwan²

BMC Genetics volume 13, Article number: 10 (2012) Cite this article

15k Accesses
159 Citations
7 Altmetric
Metrics details

Abstract

Background

Identification of genomic regions that have been targets of selection for phenotypic traits is one of the most important and challenging areas of research in animal genetics. However, currently there are relatively few genomic regions identified that have been subject to positive selection. In this study, a genome-wide scan using ~50,000 Single Nucleotide Polymorphisms (SNPs) was performed in an attempt to identify genomic regions associated with fat deposition in fat-tail breeds. This trait and its modification are very important in those countries grazing these breeds.

Results

Two independent experiments using either Iranian or Ovine HapMap genotyping data contrasted thin and fat tail breeds. Population differentiation using F_ST in Iranian thin and fat tail breeds revealed seven genomic regions. Almost all of these regions overlapped with QTLs that had previously been identified as affecting fat and carcass yield traits in beef and dairy cattle. Study of selection sweep signatures using F_ST in thin and fat tail breeds sampled from the Ovine HapMap project confirmed three of these regions located on Chromosomes 5, 7 and X. We found increased homozygosity in these regions in favour of fat tail breeds on chromosome 5 and X and in favour of thin tail breeds on chromosome 7.

Conclusions

In this study, we were able to identify three novel regions associated with fat deposition in thin and fat tail sheep breeds. Two of these were associated with an increase of homozygosity in the fat tail breeds which would be consistent with selection for mutations affecting fat tail size several thousand years after domestication.

Background

The domestication of livestock represents a crucial step in human history. The rise of civilizations could not happen without domestication of plants and animals. Sheep (Ovis aries) is the first grazing animal known to have been domesticated [1]. Multiple mitochondrial lineages suggest that domestication occurred several times, as in other livestock species such as cattle, goat and pig [2]. Recognition of the origin of domestication is difficult by the fact that the first domestic animals were no different from their wild counterparts [3]. In spite of this, the archaeozoological evidence suggests that the domestication of sheep occurred during the Neolithic revolution approximately 9000 years ago [4] in a region in northern Iraq and nearby regions in Iran [5]. Since domestication, sheep have established in a wide geographical range due to their adaptability to poor nutrition diets, tolerance to extreme climatic conditions and their manageable size [6].

Fat tail breeds are an important class of sheep breeds that are first documented as being present 5000 years ago. The earliest depiction of a fat tail sheep is on an Uruk III stone vessel of 3000 BC and fat and thin tail sheep appear together on a mosaic standard from Ur dated around 2400 BC [3, 7]. The fact that the fat tail breeds are now prevalent in the Fertile Crescent, where sheep were originally domesticated, while thin tail sheep breeds are predominant in peripheral areas [7] and that the wild ancestor of sheep is thin tail suggest that the first domesticated sheep were thin tail and fat tail was developed later. The evidence shows that sheep were being farmed throughout Europe 5000 years ago [8].

The fat tail is considered as an adaptive response of animals to a hazardous environment and is a valuable energy reserve for the animal during migration and winter. Until recently it had additional value to the herder because it was used to preserve cooked meat for longer periods of time and also as an energy reserve during times of drought and famine. Therefore the climatic variation as well as the associated requirements of humans led to artificial selection for higher fat tail weight across generations [9–11]. Nowadays in intensive and semi-intensive systems most of the advantages of a large fat tail have reduced their importance and therefore, a decrease in fat tail size is often desirable for producers and consumers. Fat deposition requires more energy than the deposition of lean tissue, animal fat has lost much of its market demand and monetary value and sheep producers have easy access to other forms of auxiliary feeding [12]. These breeds are commonly found in a wide range of countries in Asia especially the Middle East and North Africa [7]. To date, several investigations into the inheritance of fat tails have been undertaken [10, 11, 13–18], nevertheless the genes affecting fat deposition in fat tail breeds are still unknown.

The study of genes underlying phenotypic variation can be performed in two different ways, firstly from phenotype to genome, which is performed by LD based association mapping or by targeting particular candidate genes identified based on homology to known genes, and secondly from genome to phenotype, which involves the statistical evaluation of genomic data to identify likely targets of past selection using selective sweep analysis [19, 20]. The elimination of standing variation in regions linked to a recently fixed beneficial mutation is known as a "selective sweep" and has recently been the focus of much theoretical and empirical attention [21–23].

In contrast to natural populations, domesticated species provide an exciting opportunity to understand how artificial selection promotes rapid phenotypic evolution [22]. With an hypothesis that different selection pressures operated in thin and fat tail breeds over the history of time and somehow the selection acts on a variant that is advantageous only in one breed, it is expected that the frequency of that variant may differ across populations to a greater extent than predicted for variants evolving neutrally in all populations [21]. Identifying of these genome regions, which have been subject to such selective sweeps could reveal the mutations which are responsible for the fat deposition in these breeds. The examination of variation in SNP allele frequencies between populations, which can be quantified by the statistic F_ST, is a promising strategy for detecting signatures of selection [21, 24].

To date a relatively small number of examples have successfully identified genomic regions subject to positive selection in different domestic animals [20, 25–38]. The constraint to identifying selection signatures in sheep has been the limited density of markers. This limitation has recently been reduced with the availability of tens of thousands of single nucleotide polymorphisms (SNPs) using the Ovine SNP50k BeadChip (http://www.sheephapmap.org.) In this research two independent resources have been investigated to identify regions associated with fat deposition in these breeds. The first data set was comprised of one thin and one fat tail Iranian sheep breed while the second was comprised of several other fat and thin tail breeds selected from the Ovine HapMap project.

Sheep production constitutes the most important component of the Iranian livestock industry with a total of approximately 50 million head. Twenty seven breeds and ecotypes have been documented in Iran, the majority derived in situ. The Lori-Bakhtiari sheep breed is one of the most common indigenous breeds. It is well adapted to the hilly and mountainous Bakhtiari region stretched out to southern Zagros Mountains (Figure 1), with a population of more than 1.7 million. The animals are kept mostly in villages under semi-intensive systems. Relative to other Iranian fat tail breeds the Lori-Bakhtiari is a large breed, having the largest fat tail by girth and weight [39]. Sheep breeds vary in tail length and this breed, perhaps surprisingly, has a rather short tail length. Zel is the only thin-tail Iranian breed and it is present largely on the northern slopes of the Elburz mountain range near the Caspian Sea [40] representing around 3% of Iranian sheep.

The aim of this study was to find selective sweeps between the Iranian thin and fat tail breeds using dense SNP markers and to compare with those from similarly divergent breeds extracted from the Ovine HapMap dataset (http://www.sheephapmap.org.) Our work provides the first genome wide characterization of selective sweeps in thin and fat tail breeds. Each animal was genotyped with approximately 50,000 SNPs and a variety of selection sweep tests were utilized.

Results

Genotyping and data mining

A total of 94 animals consisting of 47 samples per breed were genotyped on the Illumina OvineSNP50K Beadchip assay in the Zel-Lori Bakhtiari data set. One animal in each breed had greater than 5% missing data and these were excluded from further analysis. The second data set used in this study was SNP genotyping data from similarly divergent breeds in the Ovine HapMap project (Table 1). After data cleaning (see methods) a total of 45,611 and 48,053 SNPs passed the filtering criteria in the Zel-Lori Bakhtiari and HapMap data sets respectively and were included in the final analyses. The overall average distances between 2 adjacent SNPs were relatively consistent among the chromosomes, being about 60 kb in the Zel-Lori Bakhtiari and 58 kb in the HapMap data sets. In all cases the locations used were obtained from OAR true chromosomes (ver.1.0, as at 5/2008) from CSIRO [41]. The principal component analysis (PCA) in Zel-Lori Bakhtiari data set, using the individual SNPs as the data, resulted in the first two principal components (PC1 and PC2) explaining 18.6% and 2.9% of the variance respectively. We found that PC1 separated out the two breeds from each other while one animal in each breed was separated from all other animals for PC2 (Figure 2). These two animals were excluded from further analysis. Finally, 45 animals (36 females and 9 males) per breed passed the data cleaning steps and 95% and 95.1% of all remaining SNPs were in Hardy-Weinberg equilibrium at the 5% level in Zel and Lori-Bakhtiari breeds respectively. For the SNPs analyzed in this study, the average MAF over all samples was 0.29 (SD = 0.13) in the Zel-Lori Bakhtiari data set and 0.30 (SD = 0.12) in the HapMap data set.

Table 1 Origin and sample size of different breeds included in this study

Full size table

Genomic distribution of F_ST in Zel-Lori Bakhtiari data set

The plot of windowed F_ST against location is shown in Figure 3. The average of differentiation between Zel and Lori Bakhtiari breeds was 0.024 (SD = 0.035). As shown in this figure, in several instances outlier SNPs tended to cluster to similar regions. Specifically, we found evidence of selection in seven regions with windowed F_ST value > 0.20 on chromosomes 2 (between 55,861-56,300 kb), 2 (between 73,631-73,784 kb), 3 (between 146,615-146,676 kb), 5 (between 47,149-47,263 kb), 7 (between 30,512-30,585 kb), 7 (between 46,642-46,843 kb) and X (between 58,621-61,452 kb).

The average F_ST for autosomal and X-linked SNPs was significantly different (0.024 and 0.035, respectively; t test, t = 6.2, P < 10^-10). A higher average F_ST for X-chromosome SNPs could occur because of its smaller effective population size compared with that of the autosomes.

Another genetic distance measurement including unbiased estimates of F_ST as described by Weir and Cockerham [42] was also calculated. Because the results were highly correlated (r = 0.995) with the above results, so have not been presented.

Study of Bovine genes and published QTLs in regions showing evidence of selection

Seven regions showing the largest signals of selection in Zel-Lori Bakhtiari data set were chosen for further analysis. As the current annotation of the sheep genome is not as comprehensive as cattle, the regions of interest in O. aries were compared to the corresponding areas in B. taurus. Dot plots for corresponding areas of Ovine and Bovine genomes showed strong co-linear relationships between the two considered sequences in all regions and rearrangements were not observed (Additional file 1: Figure S1).

A summary of orthologous area in both species and published bovine genes and QTLs is presented in table 2. Orthologus genes in the bovine genome were identified using the BLAT genome search with UCSC Genome Browser [43].

Table 2 Bovine genes and published QTLs in regions showing evidence of selection in Zel vs Lori Bakhtiari data set

Full size table

Online databases of published QTL in beef and dairy cattle, show that the regions identified here were previously been found to be in regions harboring QTL affecting fat and also carcass yield traits (Table 2). For example, the regions on chromosomes 2 overlapped with QTLs previously suggested as being related with fat depth, and also chromosomes 2 (second region) and 5 with fat thickness and both regions on chromosomes 7 with milk fat yield.

Genomic distribution of F_ST in Ovine HapMap data set and comparison with Zel-Lori Bakhtiari data set results

An independent resource comprised of similarly divergent breeds for thin and fat tail breeds in the Ovine HapMap project were selected to determine whether the regions with large allele differentiation in Zel-Lori Bakhtiari data set could be confirmed. The pattern of F_ST across the genome was also calculated in the Ovine HapMap data set (fat and thin tail breeds were pooled) and values were averaged in a sliding window. The windowed F_ST was then plotted against location in the genome. The windowed F_ST values, at each common SNP, were correlated in the two data sets with r = 0.413, N = 45,238 and the average of differentiation between thin and fat tail breeds in the Ovine HapMap data set was 0.027 (SD = 0.038). The sliding window average F_ST revealed good agreement between both data sets for regions on chromosomes 5, 7 (second region) and X (Figure 4).

Calculation of empirical p-values for windowed F_ST values in each data set (Table 3) shows that the 3 regions with the largest differentiation in Zel-Lori Bakhtiari data set are also significant in the Ovine HapMap data set (P < 0.01). On the other hand, combining p-values from independent tests of significance in Zel-Lori Bakhtiari and Ovine Hapmap data sets using Fisher's combined test revealed the first region on chromosome 2 (P < 0.0001) in addition to these same significant regions.

Table 3 The table of significant test for Zel-Lori Bakhtiri and ovine HapMap data set.

Full size table

The Weir and Cockerham method of estimating F_ST[42] was also performed for the ovine HapMap data and a high correlation (r = 0.992) was observed. Previous reports [e.g. [24, 52]] also indicated these methods led to similar results; therefore, a strong correlation between these two measures is not surprising.

Study of median homozygosity in candidate regions

Median run of homozygosity (Figure 5) was increased at the candidate regions on chromosome 5 and X for Lori Bakhtiari (fat tail) and at the candidate region on chromosome 7 for Zel (thin tail). The largest differences of median homozygosity were located on chromosome X and this was present for a longer distance as well, whereas these statistics were lower on Chromosome 5. A study of median homozygosity in the HapMap data set for thin and fat tail breeds (data not shown) revealed similar results.

Estimation of effective population size and haplotype age

The average extent of LD in the genome was used to estimate the effective population size at various times in the past. The estimated effective population size show a persistent decline from 2000 down to 20 generations ago, declining from 4900 (both breeds) to 840 in Zel and 532 in Lori Bakhtiari.

The graph of effective population sizes suggests a distinctive time point when the breeds separated ~1,100 generations ago (Additional file 2: Figure S2). Assuming an average generation interval in sheep of around 5 years or 5500 before present, this is congruent with the first archaeological evidence for the fat tail sheep breeds (~5,000 years ago).

The average effective population size over the period was calculated using reciprocals as described by Falconer [53]. Using this average estimated effective population size and based on the current frequency of sweeps in our regions of interest [54], the age of sweeps under the assumption of selection neutrality and genetic drift has been inferred for chromosomes 5, 7 and X to be approximately 7100, 9600, and 6900 generations before present respectively.

These estimated ages should not be taken to imply a burst of selection at a particular time; however, these ages might represent areas in the parameter space in which we have good power.

Discussion

Approximately 25% of the world sheep population comprises fat tail breeds that are grazed in a wide range of countries especially in Asia and northern parts of Africa. In these typically arid countries sheep breeding has an important place in the local economy and as a source of protein. Due to the overt and easily defined nature of a fat tail, the gene variants affecting the phenotypic expression of this trait are a topic of both theoretical and economic curiosity. The former from the perspective of the impact and nature of domestication and the latter because the trait is now markedly less important commercially due to improved forage availability and decreased price for the product.

In this research we have developed an initial selection map for the fat tail trait in sheep. The objective was to localize genomic regions potentially affecting the trait. This was undertaken by comparing fat and thin tail breeds using the newly available 50K SNPChip. The chip has already proven its ability to map causal mutations for traits showing Mendelian inheritance such as yellow fat [55] and microphthalmia [56].

It has been previously shown in cattle, that domestication, subsequent breed formation and artificial selection, leave detectable signatures of selection in several regions of the genome [36]. Given the similarities in domestication time and selection between cattle and sheep, it is likely that the ovine genome contains similar signatures.

The Iranian breeds were chosen for this study, because they originated both near the centre of domestication [5] and near the first recorded archaeological evidence of fat tail sheep [3, 7]. It was felt this geographic proximity would reduce false positives due to bottlenecks and selection pressure due to alternative factors such as climate, disease and pasture types. However, we expected that some signals would still be spurious so we then independently validated the results using a different set of breeds obtained from the Ovine HapMap project. The latter project did not formally phenotype the individuals concerned so these breeds were classified by us solely on publicly available breed descriptions. It is important to note that the selection sweep approach is one of the few viable approaches for investigating the genomics of this trait given the poor level of historical recording and DNA sampling in the relevant breeds. It is also one of the most cost effective and powerful techniques albeit as previously stated also subject to potential false positive signals.

Several statistical approaches have been devised to detect evidence of selective sweeps. These different statistics detect different aspects of the pattern of variation left by selection of a beneficial mutation. Generally, the power of the tests also depends on the strength, duration and age of the selective sweep [21]. Unlike F_ST, tests based on linkage disequilibrium like iHS [57] and XP-EHH [58] which are used primarily on human SNP data, where there are now millions of SNPs available, are dependent on SNP spacing and frequency as they are multi-marker tests. The 50,000 SNPs available for sheep may not provide enough information for these tests; it has been suggested in cattle that 150,000 evenly spaced SNPs would be required to study selective signatures in all parts of the bovine genome [29]. Additionally, the power of these methods also depends on the ancestral allele information which is still available for only a portion of the SNPs on the ovine chip.

However, it should be considered that robust inference of recent positive selection using F_ST is complicated by the fact that the distribution of genetic variation effect due to selection can be difficult to distinguish from that which arises after certain demographic events. Some previous studies that have attempted to identify signatures of selection based on patterns of population differentiation have used simulations to obtain the expected distribution of F_ST under the assumption of no selection [24, 33, 34]. However, the simulated distribution of F_ST is strongly dependent on its underlying assumptions about population demographic history, which is rarely known with any degree of certainty [24, 37]. Despite this problem, examining several thousand loci using newly available SNP chips provides an exciting opportunity to distinguish between the effects of population structure, positive selection and ascertainment bias. Demographic events and ascertainment bias would be expected to alter patterns of F_ST across the whole genome in a similar way, while selection events would be expected to alter F_ST values only in selected and nearby loci [29–34].

In general, we found a low differentiation between populations, with a mean of 0.024 (SD = 0.035) for the Zel-Lori Bakhtiari data set and 0.027 ± 0.038 for the HapMap data set. This agrees with Kijas et al. [6] who, using 23 domestic breeds and two wild sheep species, revealed that sheep breeds showed generally low differentiation which is consistent with their short evolutionary history.

The general pattern of the signals across chromosomes was different in the HapMap data set and there were differences in the locations of the major peaks (data not shown). However, Barendse et al. [29] observed similar results when comparing Bovine Australian and HapMap data. Their windowed F_ST values between two data sets were correlated with r = 0.346, whereas this correlation in our data sets was r = 0.413. In general this correlation with different breeds may suggest that each study had identified signals of divergence particular to the genetic history of those breeds, and only some of which may be due to selection (as described by Barendse et al. [29]). This correlation for our candidate regions was improved to r = 0.963.

We found few genes in the regions of interest and no particular candidate genes related to fat deposition were identified. However, the current annotation of the cattle genome is not as comprehensive as in humans, so these areas cannot be dismissed as not containing any genes or regulatory elements. It has been observed that while some proposed candidates for selection have strong support in the form of a functional mutation with an identified phenotypic effect, often the functional target is unknown [21]. As the cattle genome becomes more comprehensively annotated in these regions likely targets of selection may be identified.

A variety of alternative comparisons, with the various thin and fat tail breeds using the combined HapMap and Zel-Lori Bakhtiari data set, can be examined in addition to the comparisons presented here. In a comparison, when we compared some fat tail breeds together (Afshari and Lori Bakhtiari breeds), we found the F_ST peak in the same location on chromosome 2 region 1 which was significant based on Fisher's combined test. The orthologous area for this region (using Blat) on the cattle genome would be around 8:62,600,000-63,000,000 (Table 2). Several bovine studies have also detected signatures of selection in this region. MacEachern et al. [34] compared differences in allelic frequencies of Australian Angus and Holstein cattle at 7,500 SNPs. They reported a region with large differences among breeds at 61,300,000 to 62,500,000 bp on BTA8. Stella et al. [37] observed a significantly large signal of selection in the same location for the Holstein and Jersey breeds. Although neither study has reported any particular candidate gene for this observation, previously identified QTLs (Table 2) could suggest a candidate region for body composition and carcass traits in this chromosomal region.

Using median homozygosity plot in these regions, we identified that homozygosity has been increased on chromosome 5 and X in favour of fat tail breeds. This is consistent with a recessive mode of inheritance. Therefore future studies should track both the phenotype of this trait and the genotype status in these regions in F₂ crosses to provide independent and causal evidence that these regions do in fact affect the fat tail trait and to verify its mode of inheritance.

Given that one of the most differentiated regions was located on chromosome X, this suggests sex linked differences could be present. In several unpublished experiments when a research institute in central Iran crossed wild sheep (male) with a couple of fat tail ewes, according to the reports all resulting lambs were thin tail. This suggests dominance of the thin tail phenotype and does not support a simple single gene located on sex chromosome alone. Similarly, when two European thin tail breeds were crossed with Iranian fat tail ewes, all of the more than 350 crossbred lambs of both sexes subsequently born were thin tail and no fat tail lambs were observed. This supports the previous observations and again does not support sex linked effects, unless some epistatic effects exist or any relevant single gene located on chromosome X should be located in a pseudo-autosomal regions.

In general, there is little published information related to tail fatness in crosses between native fat and thin tail sheep breeds and also the relatively small number of animals used in these studies makes it difficult to obtain reliable estimates pertaining to this question [10, 11, 13–18]. However, in several experiments measurements suggest crosses were intermediate between fat and thin tail sheep [11, 17]. This conflicts with the unpublished Iranian results and does raise the possibility that the results observed may depend on the fat and thin tail breeds used in the cross. In our opinion, more detail and larger scale experiments are needed to confirm the results on growth and carcass characteristics of crossbred lambs of different sexes. If such studies are undertaken the nature of inheritance on the X chromosome can be rather difficult to elucidate, especially if imprinting is also suspected [59].

A result which is irrelevant to the inheritance of the trait, but provides an insight into a possible mechanism of fat deposition in this organ, are the results of Gökdal et al. [10] who examined the effects of docking in fat tail breeds. The carcasses of the docked group contained more kidney, pelvic and internal fat than the intact lambs as well as a higher percentage of subcutaneous and intramuscular fat. The weights of the different carcass cuts of the docked lambs were also heavier that those of the intact group. However, there was little change in overall carcass composition suggesting that the genes affecting the fat tail phenotype are associated with the localization of fat stores to a regional depot rather than control of the overall level of fat deposition. This observation also may provide support to the suggestion that this trait is a result of human mediated selection as it is difficult to postulate any natural selection criterion favoring deposition in this area.

Finally, a search for comparable depots in other species especially grazing mammals identified Bacterian humps in camels and humps in bos indicus cattle breeds as possible analogous features that have been selected for in animals grazing in arid and tropical regions subjected to wide fluctuations in food supply. The closest equivalent structure present in all mammals is possibly the mammary gland with its associated mammary fad pad. The current work, while not able to answer these hypotheses about the origin of the fat tail and analogous structures in other species directly, does offer hope that these questions will soon be able to be answered by further experimentation.

Conclusions

In this paper, we present the first genome wide characterization of selective sweeps in thin and fat tail breeds with the aim of identifying regions associated with fat deposition. This trait is one of the most challenging areas of research in most countries grazing fat tail breeds. The analysis was performed on two independent resources using either Iranian or Ovine HapMap genotyping data for thin and fat tail breeds. Population differentiation using F_ST in Iranian thin and fat tail breeds revealed seven genomic regions. Almost all of these regions overlapped with QTLs that had previously been identified as affecting fat and carcass yield traits in beef and dairy cattle. Study of selection sweep signatures using F_ST in thin and fat tail breeds sampled from the Ovine HapMap project confirmed three of these regions located on chromosomes 5, 7 and X. Using a median run of homozygosity plot in these regions, we identified that homozygosity has been increased on chromosome 5 and X in favour of fat tail breeds and on chromosome 7 in favour of thin tail breeds. Those associated with an increase of homozygosity in the fat tail breed would be consistent with selection for mutations affecting fat tail size several thousand years after domestication.

Methods

Animal samples

In this research two independent data sets were used to compare frequencies of SNP alleles in thin and fat tail breeds.

Zel-Lori Bakhtiari data set

Blood samples were collected from flocks which have recently utilized the registration and recording system of the ABCI (Animal Breeding Centre of Iran). Two factors were considered for selecting the samples: selection of unrelated animals and sampling those that spanned the diversity of the breed. When sampling in pedigreed animals we ensured that they had no common grandparents and in non pedigreed animals we typically selected 4-5 animals from each flock each from different age classes. All animals had the following phenotypic records measured: herd no., location, GPS coordinates, sex, horn status, parity, approximate age, wool and skin colour, fat tail dimensions, height and weight. Animal sampling for the Zel breed was performed in the northern province of Mazandaran, south of the Caspian Sea and for the Lori-Bakhtiari breed in the Chaharmahal and Bakhtiari provinces, located in the south western part of Iran close to the Zagros Mountain ranges on farms distributed across the traditional rearing area of each breed (Figure 1). In general, more than 100 blood samples per each breed were collected and finally 47 samples were selected to be genotyped in each breed.

Ovine HapMap data set

The second data set used in this study were genotypes of similarly divergent breeds from the Ovine HapMap project of the ISGC (International Sheep Genomics Consortium) and are available to the public at http://www.sheephapmap.org. There are many thin tail breeds in the HapMap project, so four breeds from geographically diverse locations and similar tail shape to Zel breed were selected. Since fat tail breeds in the HapMap project were mostly from a relatively narrow geographic region and their number was limited, three breeds with similar histories and fat tail shape to Lori Bakhtiari breed was selected (Table 1).

Genotyping and data quality control

The Ovine HapMap project genotype data were downloaded from http://www.sheephapmap.org/download.php. Genomic DNA in the Zel-Lori Bakhtiari data set was extracted from whole blood by applying a modified salting out protocol [60] and DNA samples diluted to 50 ng/ul for genotyping. A total of 94 animals consisting of 47 samples per breed were genotyped on the Illumina OvineSNP50K Beadchip assay at the Centre for Reproduction and Genomics (CRG), Invermay, New Zealand using standard procedures (http://www.illumina.com) The same set of 49,018 SNPs randomly distributed across the genome which passed the HapMap criteria [61] were used in both data sets.

To ensure the overall quality of samples and a consistent set of genotypes, quality control filters were applied to the initial data. All samples with more than 5% missing data were excluded from further analysis. The rationale for this is that DNA which is not of high quality will be more likely to have more missing data and also have incorrect genotype calls [29]. Then for each SNP, minor allele frequency (MAF) and percentage of calls (how many sheep the marker worked for) was calculated. The SNPs that had a call rate less than 95% and a MAF (over all animals) less than 2% were discarded [62]. For the remaining SNPs outlier departure from Hardy-Weinberg equilibrium (p < 10^-6) over all animals of a breed were used for identifying genotyping errors. Although departure from HWE might result from selection, it is most likely that technical problems explain this result and they were therefore excluded from subsequent analyses [63]. The Bonferroni correction (β = α/n) was used to address the problem of multiple comparisons [64]. The number of tests was taken to be the number of SNPs (n = 50,000) giving β = 10^-6 corresponding to α = 0.05 experiment-wise error. A principal component analysis (PCA) was performed using the prcomp function in the R package and the samples which were located outside of their expected breed cluster were excluded from further analysis. Finally markers were filtered to exclude loci assigned to unmapped contigs. The HapMap genotyping data had already passed the quality control process and only MAF and call rate were considered (Additional file 3: Table S3 and Additional file 4: Table S4).

Analysis

Estimates of Wright's F_ST and Weir & Cockerham's Theta

To determine a genome wide pattern of positive selection, and to compare this with the Ovine HapMap project samples, the basic form of Wright's fixation index (F_ST), which measures the degree of genetic differentiation between subpopulations, based on genetic polymorphism data, was calculated as described by MacEachern et al. [34]:

F_{S T} = \frac{H_{T} - H_{S}}{H_{T}}

Where H_T denotes the expected heterozygosities for overall total populations,

H_{T} = 1 - \sum (\overset{- 2}{p} - \overset{- 2}{q})

And H_S denotes the expected heterozygosities in subpopulations,

H_{S} = \frac{\sum_{i = 1}^{n} H_{exp i} \times n_{i}}{N_{T o t a l}}

In the above formula, $\bar{p}$ and $\bar{q}$ stand for the frequency of allele A₁ and A₂ over the total population and H_expi and n_i denote expected heterozygosity and sample size in subpopulation i.

All F_ST values in this study are for a single locus. The value of F_ST can theoretically range from 0 (no differentiation) to 1 (complete differentiation, in which populations are fixed for different alleles). Signatures of positive selection can be recognized when adjacent SNPs all show high F_ST[24], due to the hitch-hiking effect [65], implying divergent selection between breeds.

Smoothing, where a moving average of a certain number of markers is taken, is a method of looking for regions where selection is apparent over multiple markers, rather than one-off high values. The optimal size of the window depends on how long ago the selection sweep occurred, as LD breaks down with time. An arbitrary window of 5 markers (~300 Kbp) was chosen as it appeared to provide the better signal. Smoothing raises the issue of an appropriate adjustment for multiple testing, but we use it only for graphic illustration, with the singular SNP values used when looking at specific regions. The windowed F_ST values were plotted against genome location.

One of the main problems with Wright's measure of F_ST is that it does not account for sampling error. This was corrected for by Weir and Cockerham [42] who developed the unbiased estimator. We used this method in both data sets. As originally defined [66], the range of F_ST is between 0 and 1. However, it is possible for the above unbiased estimate of F_ST to assume negative values as well [24].

By ranking the sliding window F_ST in the each data set, the empirical P-value was calculated for each SNP as the proportion of SNPs with a sliding window F_ST value at least as extreme as the value for that SNP.

We applied Fisher's [67] method to combine the results from the k = 2 independent resources used, using the following statistic which follows a chi-square distribution with 2k degrees of freedom, under the null hypothesis:

X^{2} = - 2 \sum_{i = 1}^{K} {log}_{e} (p_{i})

Where, p_i is the p-value for the i^th hypothesis test. The significance level considered for each experiment was 10^-2 and the chosen level for combined experiment was 10^-4.

Study of genes and QTLs in our candidate regions

It is useful to compare regions of interest in O. aries to the corresponding areas in B. taurus, as the taurine genome is better annotated. Before comparisons of Ovine and Bovine sequences, dot plots were created using Gepard [68], with Ovine (OAR true chromosomes v 1.0) sequences obtained from Ovine Gbrowse [69], and Bovine (btau4) sequences obtained from the UCSC Genome Browser [43]. A dot plot is a graphical method for comparing two sequences, and identifies regions of similarity between them by organizing one sequence on the x-axis, and another one on the y-axis of a plot [70]. When the residues of both sequences match at the same location, a dot is drawn. The straightness of the line is indicative of the relationship between the two sequences.

In order to discover if any of the regions implicated by the whole genome analysis contained genes of interest we used OAR true chromosomes (ver.1.0, as at 5/2008) from CSIRO [41]. After obtaining the Ovine sequence of our candidate regions using this browser, the genes in cow genome were identified by BLAT search using the UCSC Genome Browser on Cow Oct. 2007 (Baylor 4.0/bosTau4) Assembly. BLAT on DNA is designed to quickly find sequences of 95% and greater similarity of length 25 bases or more [43]. Gene function was determined using Online Mendelian Inheritance in Man (OMIM) at http://www.ncbi.nlm.nih.gov/omim/ and Uniprot at http://biogps.gnf.org/.

We also explored two QTL databases available online http://genomes.sapac.edu.au/bovineqtl/index.html, http://www.animalgenome.org/QTLdb/cattle.html to identify any overlapping of the candidate regions with published QTL in dairy and beef cattle.

Median run of homozygosity

One method for looking for selection between breeds and also finding the breed that has been under selection is a comparison of homozygosity in a region. When a causal mutation is selected for in one breed and not the other, one breed would show high homozygosity in a genomic interval while the other does not. For each SNP in each animal, the length of a run of homozygosity (number of consecutive homozygous SNPs including the one being considered) was calculated (this would be zero if the SNP being considered was heterozygous). For each marker the median length, over the breed, of the run of homozygosity was calculated and plotted against genomic position in the candidate regions (25 SNPs on each side) for each breed. This is similar to looking at LD in those areas where a run of homozygosity (and therefore a high median homozygosity) indicates selection. Median homozygosity on chromosome X was calculated using only females (36 animals per each breed).

Estimation of Ne and haplotype age

A pair of haplotypes was estimated for each animal in the sample using fastPHASE Version 1.2 [71]. The estimated pairwise haplotype frequencies were used to calculate the squared correlation coefficient between the 2 loci (r²) following Hill and Robertson [72] as:

r^{2} = \frac{{(f r e q (A 1 - B 1) \times f r e q (A 2 - B 2) - f r e q (A 1 - B 2) \times f r e q (A 2 - B 1))}^{2}}{f r e q (A 1) \times f r e q (A 2) \times f r e q (B 1) \times f r e q (B 2)}

Where for example freq (A1-B1) is the frequency of haplotype marker 1 allele 1, marker 2 allele 1, freq (A1) is the frequency of marker 1 allele 1, and freq (B1) is the frequency of marker 2 allele 1 [27].

The effective population size was then estimated using the approximate expectation of r² as:

E (r^{2}) = \frac{1}{1 + 4 N_{t} c}

Where N_t is the effective population size 1/(2c) generations in the past, E(r²) is estimated by the average of r² values for all pairs of SNPs, and c is the median of distances in Morgans [27, 73–75].

To obtain a crude estimate of the ages of sweeps, we used the age estimation based on current frequency as -4N [p(log p)/(1 - p)], where N is effective population size and p is the current frequency of the derived allele (selected sweep) [53].

References

Chessa B, Pereira F, Arnaud F, Amorim A, Goyache F, Mainland I, Kao RR, Pemberton JM, Beraldi D, Stear MJ: Revealing the history of Sheep domestication using retrovirus integrations. Science. 2009, 324 (5926): 532-536. 10.1126/science.1170587.
Article PubMed Central CAS PubMed Google Scholar
Meadows JRS, Cemal I, Karaca O, Gootwine E, Kijas JW: Five Ovine mitochondrial lineages identified from Sheep breeds of the Near East. Genetics. 2007, 175 (3): 1371-1379.
Article PubMed Central CAS PubMed Google Scholar
Ryder ML: Sheep and Man. 1983, London: Duckworth press, 17-92.
Google Scholar
Handley LJL, Byrne K, Santucci F, Townsend S, Taylor M, Bruford MW, Hewitt GM: Genetic structure of European sheep breeds. Heredity. 2007, 99: 620-631. 10.1038/sj.hdy.6801039.
Article Google Scholar
Zeder MA: Animal domestication in the Zagros: a review of past and current research. Pale'orient. 1999, 25: 11-26.
Google Scholar
Kijas JW, Townley D, Dalrymple BP, Heaton MP, Maddox JF, McGrath A, Wilson P, Ingersoll RG, McCulloch R, McWilliam S, Tang D, McEwan J: A Genome Wide Survey of SNP Variation Reveals the Genetic Structure of Sheep Breeds. PLoS ONE. 2009, 4 (3): e4668-10.1371/journal.pone.0004668.
Article PubMed Central PubMed Google Scholar
Davidson A: The Oxford Companion to Food. Edited by: Tom Jaine. 2006, Oxford University Press, 293-294. 2
Chapter Google Scholar
Ryder M: Evolution of domesticated animals. 1981, Longman: London and New York, 63-85.
Google Scholar
Ermias E, Yami A, Rege JEO: Fat deposition in tropical sheep as adaptive attribute to periodic feed fluctuation. J Anim Breed Genet. 2002, 119: 235-246. 10.1046/j.1439-0388.2002.00344.x.
Article Google Scholar
Gokdal O, Ulker H, Karakus F, Cengiz F, Temur C, Handil H: Growth, feedlot performance and carcass characteristics of Karakas and crossbred lambs (F1) (Ile de France × Akkaraman (G1) × Karakas) under rural farm conditions in Turkey. South African Journal of Animal Science. 2004, 34 (4): 223-232.
Google Scholar
Kashan NEJ, Manafi-Azar GH, Afzalzadeh A, Salehi A: Growth performance and carcass quality of fattening lambs from fat-tailed and tailed sheep breeds. Small Ruminant Research. 2005, 60: 267-271. 10.1016/j.smallrumres.2005.01.001.
Article Google Scholar
Nejati-Javaremi A, Izadi F, Rahmati Gh, Moradi M: Selection in fat-tailed sheep based on two traits of fat-tail and body weight versus single-trait total body weight. Int J Agri Biol. 2007, 9 (4): 645-648.
Google Scholar
Güney O: Commercial crossbreeding between Ile-de-France, Rambouillet, Chios and local fat-tail Awassi for market lamb production. Small Ruminant Research. 1990, 3 (6): 449-456.
Article Google Scholar
Farid A: Slaughter and carcass characteristics of three fat-tailed sheep breeds and their crosses with Corriedale and Targhee rams. Small Ruminant Research. 1991, 5 (3): 255-271. 10.1016/0921-4488(91)90130-I.
Article Google Scholar
Momani-Shaker M, Abdullah AY, Kridli RT, Blaha J, Sada I, Sovjak R: Fattening performance and carcass value of Awassi ram lambs, F1 crossbreds of Romanov × Awassi and Charollais × Awassi in Jordan. Czech J Anim Sci. 2002, 47 (10): 429-438.
Google Scholar
Unal N, Akcapinar H, Aytac M, Atasoy F: Fattening performance and carcass traits in crossbred ram lambs. Medycyna Wet. 2006, 62 (2): 401-404.
Google Scholar
Khaldari M, Kashan NEJ, Afzalzadeh A, Salehi A: Growth and carcass characteristics of crossbred progeny from lean tailed and fat tailed sheep breeds. South African Journal of Animal Science. 2007, 37 (1): 51-56.
Google Scholar
Marai IFM, Daader AH, Bahgat LB: Performance traits of purebred Ossimi and Rahmani lambs and their crosses with Finnsheep born under two accelerated mating systems. Arch Tierz. 2009, 52: 497-51.
Google Scholar
Akey JM: Constructing genomic maps of positive selection in humans: Where do we go from here. Genome Res. 2009, 19 (5): 711-722. 10.1101/gr.086652.108.
Article PubMed Central CAS PubMed Google Scholar
Qanbari S, Pimentel ECG, Tetens J, Thaller G, Lichtner P, Sharifi AR, Simianer H: A genome-wide scan for signatures of recent selection in Holstein cattle. Anim Genet. 2010, 41 (4): 377-89.
CAS PubMed Google Scholar
Sabeti PC, Schaffner SF, Fry B, Lohmueller J, Varilly P, Shamovsky O, Palma A, Mikkelsen TS, Altshuler D, Lande ES: Positive natural selection in the human lineage. Science. 2006, 3129 (5780): 1614-1620.
Article Google Scholar
Biswas S, Akey JM: Genomic insights into positive selection. TRENDS in Genetics. 2006, 22 (8): 437-436. 10.1016/j.tig.2006.06.005.
Article CAS PubMed Google Scholar
McRae K: Signatures of selective sweeps in parasite selection flocks. 2011, University of Otago, Zoology department: MSc thesis
Google Scholar
Akey JM, Zhang G, Zhang K, Jin L, Shriver MD: Interrogating a high density SNP map for signatures of natural selection. Genome Res. 2002, 12 (12): 1805-14. 10.1101/gr.631202.
Article PubMed Central CAS PubMed Google Scholar
Pollinger JP, Bustamante CD, Fledel-Alon A, Schmutz S, Gray MM, Wayne RK: Selective sweep mapping of genes with large phenotypic effects. Genome Research. 2005, 15: 1809-1819. 10.1101/gr.4374505.
Article PubMed Central CAS PubMed Google Scholar
Worley K, Carey J, Veitch A, Coltman DW: Detecting the signature of selection on immune genes in highly structured populations of wild sheep (Ovis dalli). Mol Ecol. 2006, 15 (3): 623-637. 10.1111/j.1365-294X.2006.02829.x.
Article CAS PubMed Google Scholar
Hayes BJ, Lien S, Nilsen H, Olsen HG, Berg P, MacEachern S, Potter S, Meuwissen THE: The origin of selection signatures on bovine chromosome six. Anim Genet. 2008, 39 (2): 105-111. 10.1111/j.1365-2052.2007.01683.x.
Article CAS PubMed Google Scholar
Prasad A, Schnabel RD, McKay SD, Murdoch B, Stothard P, Kolbehdari D, Wang Z, Taylor JF, Moore SS: Linkage disequilibrium and signatures of selection on chromosomes 19 and 29 in beef and dairy cattle. Anim Genet. 2008, 39: 597-605. 10.1111/j.1365-2052.2008.01772.x.
Article PubMed Central CAS PubMed Google Scholar
Barendse W, Harrison BE, Bunch RJ, Thomas MB, Turner LB: Genome wide signatures of positive selection: The comparison of independent samples and the identification of regions associated to traits. BMC Genomics. 2009, 10: 178-10.1186/1471-2164-10-178.
Article PubMed Central PubMed Google Scholar
Dayo GK, Thevenon S, Berthier D, Moazami-Goudarzi K, Denis C, Cuny G, Eggen A, Gautier M: Blackwell Publishing Ltd Detection of selection signatures within candidate regions underlying trypanotolerance in outbred cattle populations. Molecular Ecology. 2009, 18: 1801-1813. 10.1111/j.1365-294X.2009.04141.x.
Article CAS PubMed Google Scholar
Flori L, Fritz S, Jaffre zic F, Boussaha M, Gut I, Heath S, Foulley JL, Gautier M: The genome response to artificial selection: a case study in dairy cattle. PLoS ONE. 2009, 4 (8): e6595-10.1371/journal.pone.0006595.
Article PubMed Central PubMed Google Scholar
Gu J, Orr N, Park SD, Katz LM, Sulimova G, MacHugh DE, Hill EW: A genome scan for positive selection in thoroughbred horses. PLoS ONE. 2009, 4 (6): e5767-10.1371/journal.pone.0005767.
Article PubMed Central PubMed Google Scholar
Hayes BJ, Chamberlain AJ, Maceachern S, Savin K, McPartlan H, MacLeod I, Sethuraman L, Goddard ME: A genome map of divergent artificial selection between Bos taurus dairy cattle and Bos taurus beef cattle. Anim Genet. 2009, 40 (2): 176-184. 10.1111/j.1365-2052.2008.01815.x.
Article CAS PubMed Google Scholar
MacEachern S, Hayes B, McEwan J, Goddard M: An examination of positive selection and changing effective population size in Angus and Holstein cattle populations (Bos taurus) using a high density SNP genotyping platform and the contribution of ancient polymorphism to genomic diversity in Domestic cattle. BMC Genomics. 2009, 10: 181-10.1186/1471-2164-10-181.
Article PubMed Central PubMed Google Scholar
Pariset L, Joost S, Ajmone-Marsan P, Valentini A: Econogene Consortium(EC): Landscape genomics and biased F_ST approaches reveal single nucleotide polymorphisms under selection in goat breeds of North-East Mediterranean. BMC Genetics. 2009, 10: 7-
Article PubMed Central PubMed Google Scholar
The Bovine HapMap Consortium: Genome wide survey of SNP variation uncovers the genetic structure of cattle breeds. Science. 2009, 324 (5926): 528-532.
Article PubMed Central Google Scholar
Stella A, Ajmone-Marsan P, Lazzari B: Boettcher Paul: Identification of selection signatures in cattle breeds selected for dairy production. Genetics. 2010, 185: 1451-1461. 10.1534/genetics.110.116111.
Article PubMed Central CAS PubMed Google Scholar
Qanbari S, Gianola D, Hayes B, Schenkel F, Miller S, Moore S, Thaller G, Simianer H: Application of site and haplotype-frequency based approaches for detecting selection signatures in cattle. BMC Genomics. 2011, 12: 318-10.1186/1471-2164-12-318.
Article PubMed Central PubMed Google Scholar
Vatankhah M, Talebi MA: Heritability estimates and correlations between production and reproductive traits in Lori-Bakhtiari sheep in Iran. South African Journal of Animal Science. 2008, 38 (2): 110-118.
Google Scholar
Kamalzadeh A, Rajabaigy M, Kiasat A: Livestock production systems and trends in livestock industry in Iran. J Agri Soc Sci. 2008, 04: 183-88.
Google Scholar
Dalrymple BP, Kirkness EF, Nefedov M, McWilliam S, Ratnakumar A, Barris W, Zhao S, Shetty J, Maddox JF, O'Grady M: Using comparative genomics to reorder the human genome sequence into a virtual sheep genome. Genome Biology. 2007, 8 (7): R152-10.1186/gb-2007-8-7-r152.
Article PubMed Central PubMed Google Scholar
Weir BS, Cockerham CC: Estimating F-statistics for the analysis of population structure. Evolution. 1984, 38 (6): 1358-1370. 10.2307/2408641.
Article Google Scholar
The UCSC Genome Browser. [http://genome.ucsc.edu/]
Casas E, Stone RT, Keele JW, Shackelford SD, Kappes SM, Koohmaraie M: A comprehensive search for quantitative trait loci affecting growth and carcass composition of cattle segregating alternative forms of the myostatin gene. J Anim Sci. 2001, 79: 854-860.
CAS PubMed Google Scholar
McClure MC, Morsci NS, Schnabel RD, Kim JW, Yao P, Rolf MM, McKay SD, Gregg SJ, Chapple RH, Northcutt SL, Taylor JF: A genome scan for quantitative trait loci influencing carcass, post-natal growth and reproductive traits in commercial Angus cattle. Animal Genetics. 2010, 41: 597-607. 10.1111/j.1365-2052.2010.02063.x.
Article CAS PubMed Google Scholar
Casas E, Shackelford SD, Keele JW, Stone RT, Kappes SM, Koohmaraie M: Quantitative trait loci affecting growth and carcass composition of cattle segregating alternate forms of myostatin. J Anim Sci. 2000, 78: 560-569.
CAS PubMed Google Scholar
MacNeil MD, Grosz MD: Genome-wide scans for QTL affecting carcass traits in Hereford × composite double. J Anim Sci. 2002, 80: 2316-2324.
CAS PubMed Google Scholar
Casas E, Shackelford SD, Keele JW, Koohmaraie M, Smith TPL, Stone RT: Detection of quantitative trait loci for growth and carcass composition in cattle. J Anim Sci. 2003, 81: 2976-2983.
CAS PubMed Google Scholar
Li C, Basarab J, Snelling WM, Benkel B, Murdoch B, Hansen C, Moore SS: Assessment of positional candidate genes MYF5 and IGF1 for growth on bovine chromosome 5 in commercial lines of Bos taurus. J Anim Sci. 2004, 82: 1-7.
CAS PubMed Google Scholar
Schrooten C, Bink MCAM, Bovenhuis H: Whole genome scans to detect chromosomal regions affecting multiple traits in dairy cattle. J Dairy Sci. 2004, 87: 3550-3560. 10.3168/jds.S0022-0302(04)73492-X.
Article CAS PubMed Google Scholar
Ashwell MS, Heyen DW, Weller JI, Ron M, Sonstegard TS, Van Tassell CP, Lewin HA: Detection of quantitative trait loci influencing conformation traits and calving ease in Holstein-Friesian cattle. J Dairy Sci. 2005, 88: 4111-4119. 10.3168/jds.S0022-0302(05)73095-2.
Article CAS PubMed Google Scholar
Weicker JJ, Brumfield RT: Estimating the unbiased estimator θ for population genetic survey data. Evolution. 2001, 55 (12): 2601-2605.
CAS PubMed Google Scholar
Falconer DS, Mackay TFC: Introduction to quantitative genetics. 1996, Addison Wesley Longman, Harlow, Essex, UK, 72-73. 3
Google Scholar
Stephens JC, Reich DE, Goldstein DB, Shin HD, Smith MW, Carrington M, Winkler C, Huttley GA, Allikmets R, Schriml L: Dating the origin of the CCR5-D32 AID-resistance allele by the coalescence of haplotypes. Am J Hum Genet. 1998, 62: 1507-1515. 10.1086/301867.
Article PubMed Central CAS PubMed Google Scholar
Tecofsky RTJ, Gemmell NJ, Johnson PL, McEwan J: Searching for genetic markers for yellow fat in New Zealand sheep [abstract]. In proceedings of the ComBio Conference. 2009
Google Scholar
Becker D, Tetens J, Brunner A, Burstel D, Ganter M, Kijas J, Drogemuller C: Microphthalmia in Texel sheep is associated with a missense mutation in the paired-like homeodomain 3 PITX gene. PLoS ONE. 2010, 5 (1): e8689-10.1371/journal.pone.0008689.
Article PubMed Central PubMed Google Scholar
Voight BF, Kudaravalli S, Wen XQ, Pritchard JK: A map of recent positive selection in the human genome. PLoS Biology. 2006, 4 (3): e72-10.1371/journal.pbio.0040072.
Article PubMed Central PubMed Google Scholar
Sabeti PC, Varilly P, Fry B, Lohmueller J, Hostetter E, Cotsapas C, Xie XH, Byrne EH, McCarroll SA, Gaudet R: Genome-wide detection and characterization of positive selection in human populations. Nature. 2007, 449 (7164): 913-U912. 10.1038/nature06250.
Article PubMed Central CAS PubMed Google Scholar
Davies W, Isles AR, Wilkinson LS: Imprinted genes and mental dysfunction. Ann Med. 2001, 33 (6): 428-436. 10.3109/07853890108995956.
Article CAS PubMed Google Scholar
Helms C: Salting out Procedure for Human DNA extraction. The Donis-Keller Lab - Lab Manual Homepage. [http://humgen.wustl.edu/hdk_lab_manual/dna/dna2.html]
Kijas J, Lenstra JA, Hayes B, Boitard S, Neto LP, Cristobal MS, Servin B, McCulloch R, Whan V, Gietzen K: Genome wide analysis of the world's sheep breeds reveals high levels of historic mixture and strong recent selection. PLoS Biology.
The Wellcome Trust Case Control Consortium: Genome-wide association study of 14,000 cases of seven common diseases and 3,000 shared controls. Nature. 2007, 447 (7145): 661-78. 10.1038/nature05911.
Article PubMed Central Google Scholar
Teo YY, Fry AE, Clark TG, Tai ES, Seielstad M: On the usage of HWE for identifying genotyping errors. Annals of Human Genetics. 2007, 71: 701-703. 10.1111/j.1469-1809.2007.00356.x.
Article CAS PubMed Google Scholar
Abdi H: Bonferroni and Šidàk corrections for multiple comparisons. Encyclopedia of Measurement and Statistics. Edited by: Salkind N. 2007, Sage: Thousand Oaks, 103-107.
Google Scholar
Maynard-Smith J, Haigh J: The hitch-hiking effect of a favourable gene. Genet Res. 1974, 23: 23-35. 10.1017/S0016672300014634.
Article Google Scholar
Wright S: The genetical structure of populations. Ann Eugen. 1951, 15: 323-354.
Article CAS PubMed Google Scholar
Fisher RA: Statistical Methods for Research Workers. 1932, London: Oliver and Boyd
Google Scholar
Krumsiek J, Arnold R, Rattei T: Gepard: A rapid and sensitive tool for creating dotplots on genome scale. Bioinformatics. 2007, 23 (8): 1026-1028. 10.1093/bioinformatics/btm039.
Article CAS PubMed Google Scholar
The Ovine Genome Browser. [http://www.livestockgenomics.csiro.au/sheep/oar1.0.php]
Maizel JV, Lenk RP: Enhanced graphic matrix analysis of nucleic acid and protein sequences. Proceedings of the National Academy of Sciences of the United States of America. 1981, 78 (12): 7665-7669. 10.1073/pnas.78.12.7665.
Article PubMed Central CAS PubMed Google Scholar
Scheet P, Stephens M: A fast and flexible statistical model for large-scale population genotype data: applications to inferring missing genotypes and haplotypic phase. Am J Hum Genet. 2006, 78 (4): 629-644. 10.1086/502802.
Article PubMed Central CAS PubMed Google Scholar
Hill WG, Robertson A: Linkage disequilibrium in finite populations. Theoretical and Applied Genetics. 1968, 38: 226-231. 10.1007/BF01245622.
Article CAS PubMed Google Scholar
Hayes BJ, Visscher PM, McPartlan HC, Goddard ME: Novel multilocus measure of linkage disequilibrium to estimate past effective population size. Genome Res. 2003, 13 (4): 635-643. 10.1101/gr.387103.
Article PubMed Central CAS PubMed Google Scholar
Tenesa A, Navarro P, Hayes BJ, Duffy DL, Clarke GM, Goddard ME, Visscher PM: Recent human effective population size estimated from linkage disequilibrium. Genome Res. 2007, 17 (4): 520-526. 10.1101/gr.6023607.
Article PubMed Central CAS PubMed Google Scholar
Villa-Angulo R, Matukumalli LM, Gill CA, Choi G, Van Tassle CP, Grefenstette JJ: High-resolution haplotype block structure in the cattle genome. BMC Genetics. 2009, 10: 19-
Article PubMed Central PubMed Google Scholar

Download references

Acknowledgements

The authors gratefully acknowledge the International Sheep Genomics Consortium for access to the Ovine HapMap genotypes prior to publication and also Animal Breeding Center of Iran (ABCI) for access to the records and animals of the Iranian breeds. Thanks to the staff from university of Tehran, Animal Science Research Institute of Iran and AgResearch, especially Dr. Abbasi, Dr. Amirinia, Dr. Rudiger Brauning, Ms. Rayna Anderson, Ms. Kathryn McRae and Ms. Natalie Pickering who helped and supported this research. The authors also acknowledge the financial contributions of Animal Science Research Institute of Iran, Mobarakandish Institute and AgResearch, New Zealand.

Author information

Authors and Affiliations

Department of Animal Science-Excellent centre for improving sheep carcass quality and quantity, University of Tehran, PO Box 3158711167-4111, Karaj, Iran
Mohammad Hossein Moradi, Ardeshir Nejati-Javaremi & Mohammad Moradi-Shahrbabak
Centre for Reproduction and Genomics, AgResearch, Invermay, Mosgiel, New Zealand
Ken G Dodds & John C McEwan

Authors

Mohammad Hossein Moradi
View author publications
You can also search for this author in PubMed Google Scholar
Ardeshir Nejati-Javaremi
View author publications
You can also search for this author in PubMed Google Scholar
Mohammad Moradi-Shahrbabak
View author publications
You can also search for this author in PubMed Google Scholar
Ken G Dodds
View author publications
You can also search for this author in PubMed Google Scholar
John C McEwan
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Mohammad Hossein Moradi.

Additional information

Authors' contributions

MHM planned, performed the analyses and drafted the manuscript, ANJ contributed to the Iranian data set collection and supervised the analysis, MMS coordinated the study and sample collection, KGD provided statistical and analysis support and JCM supervised the analysis and provided the Ovine HapMap Data set. All authors have contributed to the editing of the article, and approved the final manuscript.

Electronic supplementary material

12863_2011_985_MOESM1_ESM.PDF

Additional file 1:Figure S1: Dot plots comparing ovine sequences (y axis), and their corresponding area on bovine genome (x axis) for different regions. (PDF 986 KB)

12863_2011_985_MOESM2_ESM.PDF

Additional file 2:Figure S2: Change in effective population size across generations as estimated from linkage disequilibrium data. (PDF 37 KB)

Additional file 3:Table S3: Summary table for data cleaning in the Zel-Lori Bakhtiari data set. (PDF 68 KB)

Additional file 4:Table S4: Summary table for data cleaning in the Ovine HapMap data set. (PDF 63 KB)

Authors’ original submitted files for images

Below are the links to the authors’ original submitted files for images.

Authors’ original file for figure 1

Authors’ original file for figure 2

Authors’ original file for figure 3

Authors’ original file for figure 4

Authors’ original file for figure 5

Rights and permissions

This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Reprints and permissions

About this article

Cite this article

Moradi, M.H., Nejati-Javaremi, A., Moradi-Shahrbabak, M. et al. Genomic scan of selective sweeps in thin and fat tail sheep breeds for identifying of candidate regions associated with fat deposition. BMC Genet 13, 10 (2012). https://doi.org/10.1186/1471-2156-13-10

Download citation

Received: 28 August 2011
Accepted: 26 February 2012
Published: 26 February 2012
DOI: https://doi.org/10.1186/1471-2156-13-10

Genomic scan of selective sweeps in thin and fat tail sheep breeds for identifying of candidate regions associated with fat deposition

Abstract

Background

Results

Conclusions

Background

Results

Genotyping and data mining

Genomic distribution of FST in Zel-Lori Bakhtiari data set

Study of Bovine genes and published QTLs in regions showing evidence of selection

Genomic distribution of FST in Ovine HapMap data set and comparison with Zel-Lori Bakhtiari data set results

Study of median homozygosity in candidate regions

Estimation of effective population size and haplotype age

Discussion

Conclusions

Methods

Animal samples

Zel-Lori Bakhtiari data set

Ovine HapMap data set

Genotyping and data quality control

Analysis

Estimates of Wright's FST and Weir & Cockerham's Theta

Study of genes and QTLs in our candidate regions

Median run of homozygosity

Estimation of Ne and haplotype age

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Additional information

Authors' contributions

Electronic supplementary material

Authors’ original submitted files for images

Rights and permissions

About this article

Cite this article

Share this article

Keywords

BMC Genomic Data

Contact us

Genomic distribution of F_ST in Zel-Lori Bakhtiari data set

Genomic distribution of F_ST in Ovine HapMap data set and comparison with Zel-Lori Bakhtiari data set results

Estimates of Wright's F_ST and Weir & Cockerham's Theta