<?xml version='1.0'?>
<!DOCTYPE art SYSTEM 'http://www.biomedcentral.com/xml/article.dtd'>
<art>
   <ui>1471-2164-10-422</ui>
   <ji>1471-2164</ji>
   <fm>
      <dochead>Methodology article</dochead>
      <bibl>
         <title>
            <p>Allele-specific expression assays using Solexa</p>
         </title>
         <aug>
            <au ca="yes" id="A1">
               <snm>Main</snm>
               <mi>J</mi>
               <fnm>Bradley</fnm>
               <insr iid="I1"/>
               <email>bmain@usc.edu</email>
            </au>
            <au id="A2">
               <snm>Bickel</snm>
               <mi>D</mi>
               <fnm>Ryan</fnm>
               <insr iid="I1"/>
               <email>rbickel@usc.edu</email>
            </au>
            <au id="A3">
               <snm>McIntyre</snm>
               <mi>M</mi>
               <fnm>Lauren</fnm>
               <insr iid="I2"/>
               <email>mcintyre@mgm.ufl.edu</email>
            </au>
            <au id="A4">
               <snm>Graze</snm>
               <mi>M</mi>
               <fnm>Rita</fnm>
               <insr iid="I2"/>
               <email>rgraze@ufl.edu</email>
            </au>
            <au id="A5">
               <snm>Calabrese</snm>
               <mi>P</mi>
               <fnm>Peter</fnm>
               <insr iid="I1"/>
               <email>petercal@usc.edu</email>
            </au>
            <au id="A6">
               <snm>Nuzhdin</snm>
               <mi>V</mi>
               <fnm>Sergey</fnm>
               <insr iid="I1"/>
               <email>snuzhdin@usc.edu</email>
            </au>
         </aug>
         <insg>
            <ins id="I1">
               <p>Section of Molecular and Computational Biology, Department of Biological Sciences, University of Southern California, Los Angeles, California 90089, USA</p>
            </ins>
            <ins id="I2">
               <p>Departments of Molecular Genetics and Microbiology and Statistics, University of Florida, Gainesville, Florida 32610-1399, USA</p>
            </ins>
         </insg>
         <source>BMC Genomics</source>
         <issn>1471-2164</issn>
         <pubdate>2009</pubdate>
         <volume>10</volume>
         <issue>1</issue>
         <fpage>422</fpage>
         <url>http://www.biomedcentral.com/1471-2164/10/422</url>
         <xrefbib>
            <pubidlist>
               <pubid idtype="pmpid">19740431</pubid>
               <pubid idtype="doi">10.1186/1471-2164-10-422</pubid>
            </pubidlist>
         </xrefbib>
      </bibl>
      <history>
         <rec>
            <date>
               <day>21</day>
               <month>4</month>
               <year>2009</year>
            </date>
         </rec>
         <acc>
            <date>
               <day>9</day>
               <month>9</month>
               <year>2009</year>
            </date>
         </acc>
         <pub>
            <date>
               <day>9</day>
               <month>9</month>
               <year>2009</year>
            </date>
         </pub>
      </history>
      <cpyrt>
         <year>2009</year>
         <collab>Main et al; licensee BioMed Central Ltd.</collab>
         <note>This is an Open Access article distributed under the terms of the Creative Commons Attribution License (<url>http://creativecommons.org/licenses/by/2.0</url>), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</note>
      </cpyrt>
      <abs>
         <sec>
            <st>
               <p>Abstract</p>
            </st>
            <sec>
               <st>
                  <p>Background</p>
               </st>
               <p>Allele-specific expression (ASE) assays can be used to identify <it>cis</it>, <it>trans</it>, and <it>cis</it>-by-<it>trans </it>regulatory variation. Understanding the source of expression variation has important implications for disease susceptibility, phenotypic diversity, and adaptation. While ASE is commonly measured via relative fluorescence at a SNP, next generation sequencing provides an opportunity to measure ASE in an accurate and high-throughput manner using read counts.</p>
            </sec>
            <sec>
               <st>
                  <p>Results</p>
               </st>
               <p>We introduce a Solexa-based method to perform large numbers of ASE assays using only a single lane of a Solexa flowcell. In brief, transcripts of interest, which contain a known SNP, are PCR enriched and barcoded to enable multiplexing. Then high-throughput sequencing is used to estimate allele-specific expression using sequencing counts. To validate this method, we measured the allelic bias in a dilution series and found high correlations between measured and expected values (r>0.9, p &lt; 0.001). We applied this method to a set of 5 genes in a <it>Drosophila simulans </it>parental mix, F1 and introgression and found that for these genes the majority of expression divergence can be explained by <it>cis</it>-regulatory variation.</p>
            </sec>
            <sec>
               <st>
                  <p>Conclusion</p>
               </st>
               <p>We present a new method with the capacity to measure ASE for large numbers of assays using as little as one lane of a Solexa flowcell. This will be a valuable technique for molecular and population genetic studies, as well as for verification of genome-wide data sets.</p>
            </sec>
         </sec>
      </abs>
   </fm>
   <bdy>
      <sec>
         <st>
            <p>Background</p>
         </st>
         <p>Genotype-phenotype mapping is a fundamental aim of biological science. This is critical for many goals such as understanding of how genetic architecture shapes phenotypic variation and adaptation <abbrgrp><abbr bid="B1">1</abbr><abbr bid="B2">2</abbr><abbr bid="B3">3</abbr></abbrgrp>, and more specific aims such as deciphering how genetic variation in humans may affect response to treatment <abbrgrp><abbr bid="B4">4</abbr><abbr bid="B5">5</abbr></abbrgrp>. Many genetic variants resulting in phenotypic differences are mediated through changes in gene expression. Thus, analyzing gene expression allows us to better understand genotypic variation. Variation in gene expression can be due to polymorphisms both at the gene locus (<it>cis</it>) and in other genes that influence its expression (<it>trans</it>), as well as the non-additive interactions between the two (<it>cis</it>-by<it>-trans</it>) <abbrgrp><abbr bid="B6">6</abbr></abbrgrp>. Furthermore, epigenetic mechanisms <abbrgrp><abbr bid="B7">7</abbr></abbrgrp>, chromatin conformation <abbrgrp><abbr bid="B8">8</abbr></abbrgrp>, copy number variation <abbrgrp><abbr bid="B9">9</abbr><abbr bid="B10">10</abbr></abbrgrp> and microRNA <abbrgrp><abbr bid="B11">11</abbr></abbrgrp> all play important roles in the transcription of a given gene. By partitioning regulatory variation into <it>cis, trans</it>, and <it>cis</it>-by-<it>trans</it>, we can identify their respective contributions to changes in gene expression and potentially how expression levels evolve within the genomes of complex organisms <abbrgrp><abbr bid="B12">12</abbr><abbr bid="B13">13</abbr></abbrgrp>.</p>
         <p>Allele-specific expression (ASE) studies have introduced a creative method to uncover the respective contributions of <it>cis</it>- and <it>trans</it>-regulatory variation <abbrgrp><abbr bid="B12">12</abbr><abbr bid="B14">14</abbr><abbr bid="B15">15</abbr><abbr bid="B16">16</abbr><abbr bid="B17">17</abbr></abbrgrp>. First, total expression differences are measured from a pooled sample of two homozygous lines. Then, <it>cis</it>-regulatory variation is estimated from the allelic imbalance (unequal expression of alleles) in the corresponding F1 heterozygote, where each allele is regulated by the same <it>trans</it>-factors <abbrgrp><abbr bid="B18">18</abbr></abbrgrp>. <it>Trans </it>can then be inferred from the total expression differences that are not explained by <it>cis</it>. Of course, these inferences about <it>cis</it>- and <it>trans</it>-regulatory variation can be complicated if <it>cis</it>-elements and <it>trans</it>-factors interact non-additively <abbrgrp><abbr bid="B17">17</abbr><abbr bid="B19">19</abbr><abbr bid="B20">20</abbr></abbrgrp>. Allelic imbalance in non-imprinted genes has been shown to be common in mice, maize and humans <abbrgrp><abbr bid="B18">18</abbr><abbr bid="B21">21</abbr><abbr bid="B22">22</abbr></abbrgrp>. Also, a few studies have investigated <it>cis</it>- and <it>trans</it>-regulatory variation within and between species of <it>Drosophila</it>. Measuring ASE, Wittkopp <it>et al</it>. reported that <it>cis</it>-regulatory variation plays a predominant role in divergent gene expression between <it>D. melanogaster </it>and <it>D. simulans </it><abbrgrp><abbr bid="B15">15</abbr></abbrgrp>.</p>
         <p>Allele-specific expression has been measured using various targeted approaches including reverse transcription-PCR (asRT-PCR; <abbrgrp><abbr bid="B23">23</abbr></abbrgrp>), and pyrosequencing <abbrgrp><abbr bid="B15">15</abbr><abbr bid="B24">24</abbr></abbrgrp>. Genome-wide approaches have also been used including serial analysis of gene expression (SAGE) <abbrgrp><abbr bid="B25">25</abbr><abbr bid="B26">26</abbr></abbrgrp>, massively parallel signature sequence (MPSS)<abbrgrp><abbr bid="B27">27</abbr><abbr bid="B21">21</abbr></abbrgrp>, and microarray-based methods <abbrgrp><abbr bid="B22">22</abbr><abbr bid="B28">28</abbr></abbrgrp>. Here, we introduce a simple approach to ASE assays that combines a targeted approach to gene expression assays with the power of high-throughput sequencing. In brief, transcripts of interest (containing a known SNP) are PCR enriched and barcoded to enable large-scale multiplexing. Using this approach, we sequence only regions of interest and allele-specific read counts are used to estimate ASE for large numbers of samples using a single lane of a Solexa flowcell (Figure <figr fid="F1">1</figr> and <figr fid="F2">2</figr>). To demonstrate its application, we investigated variation in gene expression in a set of five <it>Drosophila simulans </it>genes. Using a common tester line, we measured ASE in an equal mix of homozygotes (parental mix), a heterozygote, and an introgression (Figure <figr fid="F3">3</figr>). By analyzing changes in ASE under these different regulatory conditions, we elucidate the respective contributions of <it>cis, trans</it>, and <it>cis</it>-by-<it>trans </it>interactions on variation in gene expression. Furthermore, we tested for within-species variation in <it>cis </it>and <it>trans </it>by comparing trends in ASE between six highly inbred lines collected from a single population.</p>
         <fig id="F1">
            <title>
               <p>Figure 1</p>
            </title>
            <caption>
               <p>ASE using Solexa</p>
            </caption>
            <text>
               <p><b>ASE using Solexa</b>. Summary of sample preparation: (a) The forward primer is designed to anneal 1 bp upstream of the known SNP. The 5' tail contains a unique barcode and adapter sequences necessary for hybridization to the Solexa sequencing platform. (b) Model of target enrichment and allele-specific expression represented as sequencing coverage per allele.</p>
            </text>
            <graphic file="1471-2164-10-422-1"/>
         </fig>
         <fig id="F2">
            <title>
               <p>Figure 2</p>
            </title>
            <caption>
               <p>Count-based ASE assays</p>
            </caption>
            <text>
               <p><b>Count-based ASE assays</b>. Allele-specific expression represented as sequencing reads per W-allele (black) and tester allele (white). The sequencing coverage per allele is shown on each bar and the corrected RNA (*) is represented as a proportion of each allele. The y-axis is the proportion of the difference (allele1- allele2)/(allele1 + allele2).</p>
            </text>
            <graphic file="1471-2164-10-422-2"/>
         </fig>
         <fig id="F3">
            <title>
               <p>Figure 3</p>
            </title>
            <caption>
               <p>Nucleic acid samples</p>
            </caption>
            <text>
               <p><b>Nucleic acid samples</b>. The three nucleic acid sample types used in this study: the parental mix, F1 heterozygote, and introgression lines. (a) Representation of all chromosomes for each genetic background assayed. Red represents the W line and black represents the tester line. (b) Zoom-in of chromosome 3 for each genotype. All genes analyzed are between the genetic markers <it>scarlet </it>(<it>st</it>) and <it>ebony </it>(<it>e</it>).</p>
            </text>
            <graphic file="1471-2164-10-422-3"/>
         </fig>
      </sec>
      <sec>
         <st>
            <p>Results and discussion</p>
         </st>
         <sec>
            <st>
               <p>Digital ASE assay</p>
            </st>
            <p>In this study, we introduce an accurate approach to allele-specific expression assays that relies on read counts generated from Solexa sequencing. For each gene of interest, a single nucleotide polymorphism (SNP) within the mRNA transcript was identified that differs between the lines of interest and our tester line. We then designed a PCR primer that annealed immediately flanking the 5' end of the SNP and another primer that annealed 200-300 base pairs downstream (Figure <figr fid="F1">1</figr>). In the primer design, we included adapter sequences, provided by Illumina, as 5' tails to allow these PCR products to be sequenced on the Solexa platform without additional steps. When we planned on analyzing more than one sample per gene, a unique forward primer was ordered for each sample that contained the common elements described above, plus a unique one to three base pair barcode sequence in the 5' tail that will allow for individual sample identification (Figure <figr fid="F1">1a</figr>). We used a touchdown PCR cycling program to enrich each sample for our target region and then purified the amplicons using gel extraction. We then pooled the purified samples in large numbers (in our case, 300) and sequenced them in parallel on a single lane of a Solexa flow cell. The resulting reads were analyzed by assigning each read to a specific gene based on homology and sample based on the unique barcode (see methods). We can then compare the number of reads containing each SNP to have a digital representation of ASE (Figure <figr fid="F1">1b</figr> and <figr fid="F2">2</figr>).</p>
            <p>In all comparisons, both alleles were amplified in the same reaction and thus utilized the same reagents. As a result, each allele should theoretically maintain the same relative abundance throughout amplification. However, this may not be the case if small differences in primer hybridization or polymerase efficiency exist between alleles. We can control for this error in amplification by analyzing heterozygous DNA samples, where the actual allele frequency is 50:50, and then correcting each RNA sample by the difference observed in the heterozygote (<abbrgrp><abbr bid="B15">15</abbr></abbrgrp> and Additional file <supplr sid="S1">1</supplr>). DNA samples from each mixed sample were also analyzed in order to correct for both allele-specific amplification error and differences in cell number between the individuals extracted from each parental line <abbrgrp><abbr bid="B15">15</abbr></abbrgrp>. We report allelic imbalance as: the proportion of reads that are differentially expressed ((allele1 - allele2)/(allele1 + allele2)). This approach allows us to easily estimate the binomial sampling error. If there is no difference between alleles, the bias is zero, while the bias is positive if the first allele is favored and negative if the second allele is favored.</p>
            <suppl id="S1">
               <title>
                  <p>Additional file 1</p>
               </title>
               <text>
                  <p><b>Supplemental materials</b>. "Supplemental material"</p>
               </text>
               <file name="1471-2164-10-422-S1.doc">
                  <p>Click here for file</p>
               </file>
            </suppl>
            <p>To verify the accuracy of Solexa and the normalization procedure, we created three replicate dilution series using genomic DNA from the tester line and an experimental line (W line 84). Then we estimated the allelic bias at each step of the dilution (in multiplex) using a separate sequencing lane. All four genes demonstrated a strong correlation (r >0.9 p &lt; 0.001) between the expected and observed allelic bias (Figure <figr fid="F4">4</figr>). The gene <it>dsx </it>had a relatively low correlation, because there was very limited sequencing coverage for all samples within this gene. Thus, the Solexa sequencing output can be used to accurately measure the relative abundance of alleles in a given sample.</p>
            <fig id="F4">
               <title>
                  <p>Figure 4</p>
               </title>
               <caption>
                  <p>Verification of ASE assays using Solexa</p>
               </caption>
               <text>
                  <p><b>Verification of ASE assays using Solexa</b>. Each data point represents one of three replicate dilutions analyzed at each step of the dilution series (9:1, 8:2, 7:3, 5:5, 3:7, 2:8, 1:9). The distribution of sequencing reads within each gene is demonstrated as follows: mean &#177; SE (n = 21). DSX = 11.3 &#177; 3.8, CG2604 = 2,478.3 &#177; 288.8, CG10824 = 2,756.4 &#177; 333.5, CG11459 = 11,578.6 &#177; 2515.1.</p>
               </text>
               <graphic file="1471-2164-10-422-4"/>
            </fig>
            <p>By analyzing technical variation and correlation coefficients in the verification experiment, it appears that coverage on the order of a few hundred reads is sufficient to yield reproducible results (Figure <figr fid="F4">4</figr> and Additional file <supplr sid="S1">1</supplr>). Thus, increased biological replication should be favored over sequencing coverage beyond a few hundred reads (Additional file <supplr sid="S1">1</supplr>: Table S3). In this study, coverage varied widely between genes (Additional file <supplr sid="S1">1</supplr>: Table S1), while coverage of individual assays within each gene was similar (Additional file <supplr sid="S1">1</supplr>: Table S3). This is most likely due to variation in initial transcript abundance, variation in amplification efficiency between genes and the resulting uneven pooling of DNA between genes. Therefore, we suggest that if the pooling of DNA is done carefully based on the concentration of each purified sample or PCR amplicon band intensity, the sequencing coverage can be more evenly distributed. This will also increase the number of high-quality ASE assays that can be performed on a single lane of Solexa.</p>
            <p>The cost per base of sequence using Solexa is very low, so for large-scale projects, the preparation costs such as barcoded primers become the limiting economic factor. To address these concerns, we suggest one of the following approaches depending on the type of project: For studies where many assays are performed with a few select genes, barcode costs can be significantly reduced if paired-end sequencing reads are combined with multiplicative barcoding. For example, instead of ordering 900 barcoded adapters for a given gene, we can create 900 unique barcode combinations using only 30 barcoded forward primers and 30 barcoded reverse primers. For studies involving large gene sets, barcoded adapter sequences can be added to typical gene-specific annealing primers before PCR using single-strand ligation. Using this approach, barcoded adapter sequences can be used in multiple experiments. We have shown that Solexa can effectively estimate ASE using this targeted approach and we mention these additional modifications to allow easier adaptation for future researchers.</p>
         </sec>
         <sec>
            <st>
               <p>Error analysis</p>
            </st>
            <p>To understand the error in quantifying ASE, we looked at sources of variation at multiple levels. First, we estimated sequencing error at the SNP position by identifying the proportion of reads with an incorrect base-call. Sequencing error was well below 2% for most of the genes. Furthermore, this error rate did not seem to change when the position of the SNP within the read shifted due to barcode lengths changing from one to three base pairs (Additional file <supplr sid="S1">1</supplr>: Table S2). Second, we estimated the binomial sampling error and found that this error quickly becomes negligible with the high coverage obtained with this method (Additional file <supplr sid="S1">1</supplr>: Table S3). Third, we assessed the error introduced by the method, such as polymerase and reverse transcription error by comparing allelic imbalance between technical replicates (cDNA templates created separately from the same RNA pool). This technical variation was considerably more than binomial sampling once coverage was over a few hundred reads (Additional file <supplr sid="S1">1</supplr>: Table S3). And finally, to analyze overall variation in expression estimates, including dynamic gene expression <it>in vivo</it>, we compared ASE estimates between biological replicates (separately collected material from the same genetic cross). The biological variation was greater than technical variation in this study, but an accurate assessment of the relative magnitude of technical and biological variance is beyond the scope of this study and thus, both should be considered when designing future experiments (Additional file <supplr sid="S1">1</supplr>: Table S3).</p>
         </sec>
         <sec>
            <st>
               <p><it>Cis</it>- and <it>trans</it>-regulatory variation within species</p>
            </st>
            <p>We used six highly inbred lines of <it>D. simulans </it>from Winters, CA (W Lines) and a <it>scarlet ebony </it>(<it>st e</it>) tester line in this study. For each W line, we compared expression to the tester line in a parental mix (<it>i.e</it>. an equal mix of homozygous tester and W line flies), the related F1 heterozygote, and a corresponding introgression (see methods for details) (Figure <figr fid="F3">3</figr>). This experimental design allows us to assess intraspecific regulatory variation in <it>cis</it>, <it>trans </it>and <it>cis-by-trans</it>. To do this, we estimated the overall expression differences in four genes in the aforementioned parental mix. Then, we determined <it>cis</it>-regulatory variation from the allelic imbalance present in the related F1 heterozygote, where <it>trans</it>-factors affect each allele equally. We can then infer <it>trans </it>from the overall expression difference that is not explained by <it>cis</it>. In the four genes analyzed, <it>cis</it>-regulatory variation explains the majority of the overall divergence in gene expression between the tester line and the W lines (Figure <figr fid="F5">5</figr>, regression coefficient = 0.91 &#177; 0.13, <abbrgrp><abbr bid="B12">12</abbr></abbrgrp>). It should be noted that this is a small gene set and most of these genes have been shown to be variable within-species (see methods). Thus, we do not consider this result to be a reflection of the genome as a whole. One explanation for the small contribution of <it>trans </it>in these genes is that <it>trans</it>-variation has an increased potential for pleiotropic effects, some of which may be deleterious and removed by purifying selection in the W lines tested.</p>
            <fig id="F5">
               <title>
                  <p>Figure 5</p>
               </title>
               <caption>
                  <p>Intraspecific regulatory variation due to <it>cis </it>and <it>trans</it></p>
               </caption>
               <text>
                  <p><b>Intraspecific regulatory variation due to <it>cis </it>and <it>trans</it></b>. Expression bias is the proportion of the difference between alleles: (allele1- allele2)/(allele1 + allele2). The parental mix is plotted on the x-axis and the F1 heterozygote is plotted on the y-axis. A 1:1 relationship would indicate 100% <it>cis </it>and a slope of zero would indicate 100% <it>trans</it>. A 1:1 line is shown for reference. The vast majority of the expression differences between the W lines and the tester line are attributed to <it>cis</it>, for the four genes and six lines tested (regression coefficient = 0.91 &#177; 0.13). Color code: line 84 - blue, line 181 - red, line 147 - purple, line 192 - orange, and line 68 - green.</p>
               </text>
               <graphic file="1471-2164-10-422-5"/>
            </fig>
            <p>Gene expression is a result of <it>cis</it>-regulatory elements interacting with <it>trans</it>-regulatory proteins. If there is variation in both <it>cis</it>-elements and <it>trans</it>-proteins, there is the potential for non-additive interactions (<it>cis</it>-by-<it>trans</it>) <abbrgrp><abbr bid="B29">29</abbr></abbrgrp>. To test for <it>cis</it>-by-<it>trans</it>, we compared allelic imbalance in the heterozygote and the introgression within each W line (Figure <figr fid="F3">3</figr>). If all genes act additively, allelic imbalance in the heterozygote and introgression should be equal because the <it>cis</it>-regulatory elements for each allele and the trans-factors within each genotype are identical. If allelic imbalance between these genotypes is not equal, that difference is due to <it>cis</it>-by-<it>trans</it>. We lacked the statistical power to individually detect these <it>cis</it>-by-<it>trans </it>interactions, but we were able to test for a systematic shift in expression between these genotypes across all genes and W lines. We hypothesized that if the <it>cis</it>-elements and <it>trans</it>-factors within the tester line and the W lines co-evolved, non-additive interactions would be relatively common and therefore we might detect significant <it>cis</it>-by-<it>trans </it>effects across all genes and lines. To test for this, we analyzed the relationship between heterozygous and introgression ASE values for the 6 W lines and 4 genes using a linear model. The regression coefficient is not significantly different from one (p = 0.5) and thus, there is no systematic <it>cis</it>-by-<it>trans </it>interactions detected within our sample population and selected genes. This result, together with recent findings seems to be indicating that at least within <it>Drosophila</it>, epistasis may be rare or is small in magnitude <abbrgrp><abbr bid="B17">17</abbr></abbrgrp>. This may also be due to the lack of population structure in <it>Drosophila</it>, which would hinder the co-evolution of divergent <it>cis</it>-elements and <it>trans</it>-factors.</p>
            <p>To compliment our previous estimates of <it>cis </it>and <it>trans </it>between the tester line and the W lines, we measured variation in <it>cis </it>between W lines in order to give additional insight into the type and magnitude of regulatory variation that may be segregating in natural populations. To identify this variation we tested for a significant effect of a given W line on the overall estimate of <it>cis </it>and <it>trans</it>. Using this approach we were unable to detect significant variation between lines (p = 0.31). We did however find evidence for variation between the W lines and the tester line suggesting that genetic variation may be more common between populations (Figure <figr fid="F2">2</figr>).</p>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Conclusion</p>
         </st>
         <p>We have presented a new method that uses Solexa to accurately measure ASE for hundreds of samples using as little as one lane of a Solexa flowcell. This will be a valuable technique for analyzing a few genes for many individuals or under many conditions, measuring a large selection of genes for a few individuals, and verifying ASE estimates from genome-wide expression assays.</p>
      </sec>
      <sec>
         <st>
            <p>Methods</p>
         </st>
         <sec>
            <st>
               <p>Fly strains and genotypes</p>
            </st>
            <p><it>D. simulans </it>lines used in this study were collected from the Wolfskill orchard in Winters, CA and inbred for a minimum of 20 generations to create highly inbred lines (W lines) <abbrgrp><abbr bid="B30">30</abbr></abbrgrp>. A <it>D. simulans </it>tester line containing the visible mutations <it>scarlet (st) </it>and <it>ebony (e) </it>(<it>st </it>((chr3L: 15833418-15836165), <it>e </it>(chr3R: 4397293-4404648)) was obtained from the Drosophila stock center at UCSD (stock number 14021-0251.041) and inbred for over 20 generations. We analyzed ASE in a parental mix, heterozygote, and introgression for each W line against the common tester line (lines: 84, 181, 147, 201, 68, 192) (Figure <figr fid="F3">3</figr>). To create the parental mix, homozygous male flies from a given W line were homogenized together with homozygous male flies from the tester line. The heterozygous genotype was the F1 male progeny from a cross between a given W line male and two virgin tester line females. And finally, to construct introgression lines, F1 heterozygous females were created from an initial cross between a given W line female and <it>st e </it>male. Then they were backcrossed to the <it>st e </it>line. After each generation, wild-type females were selected (heterozygous genotype at both markers) for subsequent backcrossing. This process was repeated for a minimum of 20 generations to create an introgression line. Thus the introgression line is heterozygous for approximately 12 Mb between the genetic markers <it>st </it>and <it>e </it>on the third chromosome, while the remainder of the genome is homozygous for the <it>st e </it>line (Figure <figr fid="F3">3</figr>). Two replicate introgression lines were created in parallel for each W line (introgression A and B) to control for expression differences due to different introgression cutoff points flanking the visible markers. Reciprocal crosses were not analyzed, but the effects of genomic imprinting, the only type of parent-of-origin effects that would impact ASE, are likely rare or do not exist in adult Drosophila <abbrgrp><abbr bid="B31">31</abbr></abbrgrp>. All flies were reared at ~25&#176;C with a 12 h:12 h light cycle.</p>
         </sec>
         <sec>
            <st>
               <p>Gene selection</p>
            </st>
            <p>We chose a set of genes within the introgressed region on chromosome three for further analysis (Figure <figr fid="F3">3b</figr>). Genes were chosen based on current interest in the lab (<it>dsx</it>) and previous analyses showing them to vary within species (<it>Cyp21c, CG10824, CG2604</it>) <abbrgrp><abbr bid="B30">30</abbr></abbrgrp> or between species (<it>CG11459</it>) <abbrgrp><abbr bid="B15">15</abbr></abbrgrp>. We resequenced most of the coding region of each gene and then designed our gene-specific primers based on the location of a single nucleotide polymorphism (SNP) between the W lines and the <it>st e </it>tester line. The <it>Cyp21c </it>primers non-specifically amplified a homologous gene (<it>Cyp4ac3</it>), thus it was not analyzed further.</p>
         </sec>
         <sec>
            <st>
               <p>Extraction of nucleic acids and cDNA synthesis</p>
            </st>
            <p>We extracted DNA and RNA in parallel from 14 whole-body adult male flies for each sample using a modified protocol for Promega's SV Total RNA Extraction Kit <abbrgrp><abbr bid="B15">15</abbr></abbrgrp>. Experimental samples included a parental mix and a heterozygous DNA sample, along with cDNA from the parental mix, F1, and introgression A and B (see above) for each W line. We included replicate biological samples (separately collected material from the same genetic cross) for all experimental samples involving W line 181 and 84. Also, we included technical replicate (cDNA templates created separately from the same RNA pool) samples for all biological samples involving W lines 84 (Additional file <supplr sid="S3">3</supplr>). To extract nucleic acid from the parental mix, we homogenized seven homozygous male flies from a given W line together with seven homozygous male flies from the <it>st e </it>line. The heterozygous and the introgression genotypes were extracted from 14 male progeny. Before extraction, adult male flies (three to five days old) were snap-frozen in liquid nitrogen in the morning and stored at -70&#176;C until extraction. Briefly, we passed the supernatant from fly homogenate through an affinity column optimized for binding DNA. Then the flow-through was run through another column optimized to bind all RNA. We then treated RNA columns with DNAse, followed by subsequent washing steps and elution. Using Applied Biosystem's (AB) High-Capacity cDNA Reverse Transcription Kit we immediately synthesized cDNA from the RNA samples using AB's standard protocols with RNAse inhibitor. Following extraction, we stored all cDNA and DNA samples at -70&#176;C until further preparation.</p>
            <suppl id="S3">
               <title>
                  <p>Additional file 3</p>
               </title>
               <text>
                  <p><b>Sequencing counts</b>. "Raw counts" All sequencing counts for each sample.</p>
               </text>
               <file name="1471-2164-10-422-S3.xls">
                  <p>Click here for file</p>
               </file>
            </suppl>
         </sec>
         <sec>
            <st>
               <p>PCR and purification</p>
            </st>
            <p>PCR primers for the ASE assay were designed such that one annealed immediately flanking the five prime (5') end of the SNP to be sequenced and the other, 200-300 base pairs downstream (3'). Then we modified each primer design with a custom 5' tail consisting of Solexa adapter sequences. Furthermore, to allow for multiplexing, a sample-specific barcode (one to three base pairs) was added to the design between the adapter and the annealing primer (Figure <figr fid="F1">1</figr> and Additional file <supplr sid="S2">2</supplr>).</p>
            <suppl id="S2">
               <title>
                  <p>Additional file 2</p>
               </title>
               <text>
                  <p><b>Detailed protocol</b>. "ase using solexa protocol " Detailed Protocol</p>
               </text>
               <file name="1471-2164-10-422-S2.doc">
                  <p>Click here for file</p>
               </file>
            </suppl>
            <p>All PCR reactions were performed using Finnzymes Phusion High-Fidelity DNA Polymerase and a touchdown cycling program stepping down 1&#176; each cycle from 70&#176;C to 60&#176;C, followed by an additional 25 cycles at 60&#176;C annealing. Samples were run on a 2% agarose gel for amplicon size confirmation and agarose gel purification. PCR products were gel purified using Qiagen's Qiaex II Gel extraction kit and standard protocols. Following purification, a portion of each of the 300 PCR enriched samples was pooled together, ethanol precipitated and then resuspended to a concentration of 10 nM. The pooled sample was sequenced at Cornell's core sequencing center using a custom primer.</p>
         </sec>
         <sec>
            <st>
               <p>Data analysis</p>
            </st>
            <p>Using a custom Perl script, the 6 million reads generated from a single lane of a Solexa flow cell were assigned to one of our five genes based on the first eight base pairs (allowing for one mismatch) of the annealing primer. Within each gene pool, the reads were then separated into 60 unique ASE assays using the one to three base pair sample-specific barcodes (Figure <figr fid="F1">1</figr>). We then checked for a known sequence surrounding the SNP, including five base pairs 5' of the SNP and eight base pairs 3' of the SNP. If these 13 base pairs (5 bp pre-SNP + 8 bp post-SNP) could not be identified (allowing for a single mismatch) the reads were discarded. We then scored each read for the nucleotide in the SNP position to determine the allele-specific read count (Additional file <supplr sid="S3">3</supplr>).</p>
            <p>To determine the respective contributions of <it>cis</it>- and <it>trans</it>-regulatory variation, we compared ASE in the parental mix flies and the heterozygous flies. ASE differences in the parental mix reflect total variation in gene expression, including <it>cis </it>and <it>trans</it>. In contrast, <it>trans </it>is shared between alleles in the heterozygote, thus only <it>cis </it>contributes to ASE differences in this genotype. As a result, <it>trans </it>can be inferred from the percent of the total variation estimated in the parental mix (<it>cis+trans</it>) that is not explained by the variation estimated in the heterozygote (<it>cis</it>). To perform this test this we used the model: Y<sub>ij </sub>= &#956;+BX<sub>ij</sub>+e<sub>ij </sub>where for the i<sup>th </sup>W line and the j<sup>th </sup>gene, Y is the estimated difference between alleles in the heterozygote (<it>cis</it>) and X is the total difference in expression estimated from the parental mix. Then, we estimated <it>trans</it>-regulatory variation from the percent of the overall expression differences found in the parental mix that are not explained by this model (Figure <figr fid="F5">5</figr>). Missing data points are due to lack of a SNP within a given gene and W line.</p>
            <p>To identify <it>cis</it>-by<it>-trans </it>interactions (<it>i.e</it>. non-additive effects), we tested whether the allelic bias in heterozygotes was significantly different from the allelic bias in introgressions. To examine this effect, we fit the model: Y<sub>ij </sub>= &#956;+BZ<sub>ij</sub>+e<sub>ij </sub>where for the i<sup>th </sup>W line and for the j<sup>th </sup>gene, Y is the estimated difference between alleles for the heterozygote and Z is the estimated difference between alleles for the introgression line. The <it>cis</it>-regulatory elements for each allele in the heterozygote are identical to the <it>cis</it>-regulatory elements in the introgression, hence the allelic bias cannot vary due to <it>cis </it>between these genotypes. Furthermore, although the <it>trans</it>-factors change between genotypes (heterozygous W/<it>st e </it>to homozygous <it>st e</it>) the <it>trans</it>-factors are shared equally between alleles within each genotype and thus, differences in <it>trans </it>cannot contribute to allelic bias. As a result, any change in allelic bias between these genotypes must be due to <it>cis</it>-by-<it>trans </it>interactions.</p>
            <p>To test for natural variation (<it>i.e</it>. variation between W lines) in the relative contribution of <it>cis </it>and <it>trans</it>, we fit the model Y<sub>ij </sub>= &#956;+B<sub>1</sub>X<sub>ij</sub>+B<sub>2</sub>L<sub>i</sub>+e<sub>ij </sub>where for the i<sup>th </sup>line and the j<sup>th </sup>gene, Y is the average bias between alleles for the parental mix, and X is the average bias between alleles for the heterozygote. We then tested for the effect of line.</p>
            <p>To verify the accuracy of ASE assays using Solexa, we used a separate lane of a flow cell to analyze three replicate dilution series (1:9, 2:8, 3:7, 5:5, 7:3, 8:2, 9:1). These dilutions were created using genomic DNA extracted separately from homozygous W line 84 and the homozygous <it>st e </it>tester line (Figure <figr fid="F4">4</figr>). Each of the four genes analyzed in this study were tested for the ability to accurately detect known allele frequencies. Heterozygous DNA from a cross between these lines was also analyzed in parallel to correct for allele-specific amplification bias (<abbrgrp><abbr bid="B15">15</abbr></abbrgrp> and Additional file <supplr sid="S1">1</supplr>). To correct for differences in starting concentration of the homozygous DNA used for the dilution series, we corrected the expected values by the average allelic bias found in the 1:1 mix <abbrgrp><abbr bid="B15">15</abbr></abbrgrp>. Correlations between the known dilution and the measured ASE using Solexa were estimated separately for each gene using Pearson's correlation coefficient <abbrgrp><abbr bid="B32">32</abbr></abbrgrp>.</p>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Authors' contributions</p>
         </st>
         <p>BJM conceived the method and performed the experiments. SVN BJM RMG designed the experiment. RDB SVN BJM LMM PPC analyzed the data. BJM RDB LMM SVN wrote the paper. All authors read and approved the final manuscript.</p>
      </sec>
   </bdy>
   <bm>
      <ack>
         <sec>
            <st>
               <p>Acknowledgements</p>
            </st>
            <p>We would like to thank Hyo-sik Jang for fly work. Also, we thank Joe Dunham and Johanna Main for discussions and comments on the method and paper, respectively. This work was supported by NIH grant RGM076643 (SVN) and 1R01GM077618 (LMM, SVN).</p>
         </sec>
      </ack>
      <refgrp>
         <bibl id="B1">
            <title>
               <p>Quantitative genetic analyses of complex behaviours in Drosophila</p>
            </title>
            <aug>
               <au>
                  <snm>Anholt</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Mackay</snm>
                  <fnm>T</fnm>
               </au>
            </aug>
            <source>Nat Rev Genet</source>
            <pubdate>2004</pubdate>
            <volume>5</volume>
            <fpage>838</fpage>
            <lpage>49</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/nrg1472</pubid>
                  <pubid idtype="pmpid" link="fulltext">15520793</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B2">
            <title>
               <p>Genetic basis of fitness differences in natural populations</p>
            </title>
            <aug>
               <au>
                  <snm>Ellegren</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Sheldon</snm>
                  <fnm>B</fnm>
               </au>
            </aug>
            <source>Nature</source>
            <pubdate>2008</pubdate>
            <volume>452</volume>
            <fpage>169</fpage>
            <lpage>75</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/nature06737</pubid>
                  <pubid idtype="pmpid" link="fulltext">18337813</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B3">
            <title>
               <p>Co-regulated transcriptional networks contribute to natural genetic variation in Drosophila sleep</p>
            </title>
            <aug>
               <au>
                  <snm>Harbison</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Carbone</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Ayroles</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Stone</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Lyman</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Mackay</snm>
                  <fnm>T</fnm>
               </au>
            </aug>
            <source>Nat genet</source>
            <pubdate>2009</pubdate>
            <volume>41</volume>
            <fpage>371</fpage>
            <lpage>5</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/ng.330</pubid>
                  <pubid idtype="pmcid">2683981</pubid>
                  <pubid idtype="pmpid" link="fulltext">19234472</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B4">
            <title>
               <p>Quantitative monitoring by polymerase colony assay of known mutations resistant to ABL kinase inhibitors</p>
            </title>
            <aug>
               <au>
                  <snm>Nardi</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>Raz</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Cao</snm>
                  <fnm>X</fnm>
               </au>
               <au>
                  <snm>Wu</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Stone</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Cortes</snm>
                  <fnm>J</fnm>
               </au>
               <etal/>
            </aug>
            <source>Oncogene</source>
            <pubdate>2008</pubdate>
            <volume>27</volume>
            <fpage>775</fpage>
            <lpage>82</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/sj.onc.1210698</pubid>
                  <pubid idtype="pmpid" link="fulltext">17684485</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B5">
            <title>
               <p>Genetic mapping in human disease</p>
            </title>
            <aug>
               <au>
                  <snm>Altshuler</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Daly</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Lander</snm>
                  <fnm>E</fnm>
               </au>
            </aug>
            <source>Science</source>
            <pubdate>2008</pubdate>
            <volume>322</volume>
            <fpage>881</fpage>
            <lpage>8</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1126/science.1156409</pubid>
                  <pubid idtype="pmcid">2694957</pubid>
                  <pubid idtype="pmpid" link="fulltext">18988837</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B6">
            <title>
               <p>Genetics of global gene expression</p>
            </title>
            <aug>
               <au>
                  <snm>Rockman</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Kruglyak</snm>
                  <fnm>L</fnm>
               </au>
            </aug>
            <source>Nat Rev Genet</source>
            <pubdate>2006</pubdate>
            <volume>7</volume>
            <fpage>862</fpage>
            <lpage>72</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/nrg1964</pubid>
                  <pubid idtype="pmpid" link="fulltext">17047685</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B7">
            <title>
               <p>Epigenetic regulation of gene expression: how the genome integrates intrinsic and environmental signals</p>
            </title>
            <aug>
               <au>
                  <snm>Jaenisch</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Bird</snm>
                  <fnm>A</fnm>
               </au>
            </aug>
            <source>Nat genet</source>
            <pubdate>2003</pubdate>
            <volume>33</volume>
            <fpage>245</fpage>
            <lpage>54</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/ng1089</pubid>
                  <pubid idtype="pmpid" link="fulltext">12610534</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B8">
            <title>
               <p>Using genomics to study how chromatin influences gene expression</p>
            </title>
            <aug>
               <au>
                  <snm>Higgs</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Vernimmen</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Hughes</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Gibbons</snm>
                  <fnm>R</fnm>
               </au>
            </aug>
            <source>Annu Rev Genomics Hum Genet</source>
            <pubdate>2007</pubdate>
            <volume>8</volume>
            <fpage>299</fpage>
            <lpage>325</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1146/annurev.genom.8.080706.092323</pubid>
                  <pubid idtype="pmpid" link="fulltext">17506662</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B9">
            <title>
               <p>Segmental copy number variation shapes tissue transcriptomes</p>
            </title>
            <aug>
               <au>
                  <snm>Henrichsen</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Vinckenbosch</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Z&#246;llner</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Chaignat</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Pradervand</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Sch&#252;tz</snm>
                  <fnm>F</fnm>
               </au>
               <etal/>
            </aug>
            <source>Nat genet</source>
            <pubdate>2009</pubdate>
            <volume>41</volume>
            <fpage>424</fpage>
            <lpage>429</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/ng.345</pubid>
                  <pubid idtype="pmpid" link="fulltext">19270705</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B10">
            <title>
               <p>The impact of copy number variation on local gene expression in mouse hematopoietic stem and progenitor cells</p>
            </title>
            <aug>
               <au>
                  <snm>Cahan</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Li</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Izumi</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Graubert</snm>
                  <fnm>T</fnm>
               </au>
            </aug>
            <source>Nat genet</source>
            <pubdate>2009</pubdate>
            <volume>41</volume>
            <fpage>430</fpage>
            <lpage>437</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/ng.350</pubid>
                  <pubid idtype="pmpid" link="fulltext">19270704</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B11">
            <title>
               <p>MicroRNAs: genomics, biogenesis, mechanism, and function</p>
            </title>
            <aug>
               <au>
                  <snm>Bartel</snm>
                  <fnm>D</fnm>
               </au>
            </aug>
            <source>Cell</source>
            <pubdate>2004</pubdate>
            <volume>116</volume>
            <fpage>281</fpage>
            <lpage>97</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/S0092-8674(04)00045-5</pubid>
                  <pubid idtype="pmpid" link="fulltext">14744438</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B12">
            <title>
               <p>Regulatory changes underlying expression differences within and between Drosophila species</p>
            </title>
            <aug>
               <au>
                  <snm>Wittkopp</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Haerum</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Clark</snm>
                  <fnm>A</fnm>
               </au>
            </aug>
            <source>Nat genet</source>
            <pubdate>2008</pubdate>
            <volume>40</volume>
            <fpage>346</fpage>
            <lpage>350</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/ng.77</pubid>
                  <pubid idtype="pmpid" link="fulltext">18278046</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B13">
            <title>
               <p>The genomics of gene expression</p>
            </title>
            <aug>
               <au>
                  <snm>Stamatoyannopoulos</snm>
                  <fnm>J</fnm>
               </au>
            </aug>
            <source>Genomics</source>
            <pubdate>2004</pubdate>
            <volume>84</volume>
            <fpage>449</fpage>
            <lpage>57</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/j.ygeno.2004.05.002</pubid>
                  <pubid idtype="pmpid" link="fulltext">15498452</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B14">
            <title>
               <p>Allelic variation in human gene expression</p>
            </title>
            <aug>
               <au>
                  <snm>Yan</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Yuan</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Velculescu</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>Vogelstein</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Kinzler</snm>
                  <fnm>K</fnm>
               </au>
            </aug>
            <source>Science</source>
            <pubdate>2002</pubdate>
            <volume>297</volume>
            <fpage>1143</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1126/science.1072545</pubid>
                  <pubid idtype="pmpid" link="fulltext">12183620</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B15">
            <title>
               <p>Evolutionary changes in cis and trans gene regulation</p>
            </title>
            <aug>
               <au>
                  <snm>Wittkopp</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Haerum</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Clark</snm>
                  <fnm>A</fnm>
               </au>
            </aug>
            <source>Nature</source>
            <pubdate>2004</pubdate>
            <volume>430</volume>
            <fpage>85</fpage>
            <lpage>88</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/nature02698</pubid>
                  <pubid idtype="pmpid" link="fulltext">15229602</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B16">
            <title>
               <p>Allele-specific gene expression uncovered</p>
            </title>
            <aug>
               <au>
                  <snm>Knight</snm>
                  <fnm>J</fnm>
               </au>
            </aug>
            <source>Trends genet</source>
            <pubdate>2004</pubdate>
            <volume>20</volume>
            <fpage>113</fpage>
            <lpage>6</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/j.tig.2004.01.001</pubid>
                  <pubid idtype="pmpid">15049300</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B17">
            <title>
               <p>Independent effects of cis- and trans-regulatory variation on gene expression in Drosophila melanogaster</p>
            </title>
            <aug>
               <au>
                  <snm>Wittkopp</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Haerum</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Clark</snm>
                  <fnm>A</fnm>
               </au>
            </aug>
            <source>Genetics</source>
            <pubdate>2008</pubdate>
            <volume>178</volume>
            <fpage>1831</fpage>
            <lpage>5</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1534/genetics.107.082032</pubid>
                  <pubid idtype="pmcid">2278090</pubid>
                  <pubid idtype="pmpid" link="fulltext">18245838</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B18">
            <title>
               <p>Detection of regulatory variation in mouse genes</p>
            </title>
            <aug>
               <au>
                  <snm>Cowles</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Hirschhorn</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Altshuler</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Lander</snm>
                  <fnm>E</fnm>
               </au>
            </aug>
            <source>Nat genet</source>
            <pubdate>2002</pubdate>
            <volume>32</volume>
            <fpage>432</fpage>
            <lpage>437</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/ng992</pubid>
                  <pubid idtype="pmpid" link="fulltext">12410233</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B19">
            <title>
               <p>Epistasis: too often neglected in complex trait studies?</p>
            </title>
            <aug>
               <au>
                  <snm>Carlborg</snm>
                  <fnm>O</fnm>
               </au>
               <au>
                  <snm>Haley</snm>
                  <fnm>C</fnm>
               </au>
            </aug>
            <source>Nat Rev Genet</source>
            <pubdate>2004</pubdate>
            <volume>5</volume>
            <fpage>618</fpage>
            <lpage>25</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/nrg1407</pubid>
                  <pubid idtype="pmpid" link="fulltext">15266344</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B20">
            <title>
               <p>Extensive Sex-Specific Nonadditivity of Gene Expression in Drosophila melanogaster</p>
            </title>
            <aug>
               <au>
                  <snm>Gibson</snm>
                  <fnm>G</fnm>
               </au>
            </aug>
            <source>Genet</source>
            <pubdate>2004</pubdate>
            <volume>167</volume>
            <fpage>1791</fpage>
            <lpage>1799</lpage>
            <xrefbib>
               <pubid idtype="doi">10.1534/genetics.104.026583</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B21">
            <title>
               <p>Allelic variation of gene expression in maize hybrids</p>
            </title>
            <aug>
               <au>
                  <snm>Guo</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Rupe</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Zinselmeier</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Habben</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Bowen</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Smith</snm>
                  <fnm>O</fnm>
               </au>
            </aug>
            <source>Plant Cell</source>
            <pubdate>2004</pubdate>
            <volume>7</volume>
            <fpage>1707</fpage>
            <lpage>16</lpage>
            <xrefbib>
               <pubid idtype="doi">10.1105/tpc.022087</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B22">
            <title>
               <p>Allelic variation in gene expression is common in the human genome</p>
            </title>
            <aug>
               <au>
                  <snm>Lo</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Wang</snm>
                  <fnm>Z</fnm>
               </au>
               <au>
                  <snm>Hu</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Yang</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Gere</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Buetow</snm>
                  <fnm>K</fnm>
               </au>
            </aug>
            <source>Genome Res</source>
            <pubdate>2003</pubdate>
            <volume>13</volume>
            <fpage>1855</fpage>
            <lpage>1862</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1101/gr.885403</pubid>
                  <pubid idtype="pmcid">403776</pubid>
                  <pubid idtype="pmpid" link="fulltext">12902379</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B23">
            <title>
               <p>Quantitative RT-PCR-Based Analysis of Allele-Specific Gene Expression</p>
            </title>
            <aug>
               <au>
                  <snm>Singer-Sam</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Gao</snm>
                  <fnm>C</fnm>
               </au>
            </aug>
            <source>Meth Mol Biol</source>
            <pubdate>2001</pubdate>
            <volume>181</volume>
            <fpage>145</fpage>
            <lpage>52</lpage>
         </bibl>
         <bibl id="B24">
            <title>
               <p>Single-nucleotide polymorphism analysis by pyrosequencing</p>
            </title>
            <aug>
               <au>
                  <snm>Ahmadian</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Gharizadeh</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Gustafsson</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Sterky</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Nyr&#233;n</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Uhl&#233;n</snm>
                  <fnm>M</fnm>
               </au>
               <etal/>
            </aug>
            <source>Anal Biochem</source>
            <pubdate>2000</pubdate>
            <volume>280</volume>
            <fpage>103</fpage>
            <lpage>10</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1006/abio.2000.4493</pubid>
                  <pubid idtype="pmpid" link="fulltext">10805527</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B25">
            <title>
               <p>Serial analysis of gene expression (LongSAGE) and 3' LongSAGE for transcriptome characterization and genome annotation</p>
            </title>
            <aug>
               <au>
                  <snm>Velculescu</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>Zhang</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Vogelstein</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Kinzler</snm>
                  <fnm>K</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci USA</source>
            <pubdate>2004</pubdate>
            <volume>32</volume>
            <fpage>11701</fpage>
            <lpage>6</lpage>
         </bibl>
         <bibl id="B26">
            <title>
               <p>5' Long serial analysis of gene expression (LongSAGE) and 3' LongSAGE for transcriptome characterization and genome annotation</p>
            </title>
            <aug>
               <au>
                  <snm>Wei</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Ng</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Chiu</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Wong</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Ang</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Lipovich</snm>
                  <fnm>L</fnm>
               </au>
               <etal/>
            </aug>
            <source>Proc Natl Acad Sci USA</source>
            <pubdate>2004</pubdate>
            <volume>32</volume>
            <fpage>11701</fpage>
            <lpage>6</lpage>
            <xrefbib>
               <pubid idtype="doi">10.1073/pnas.0403514101</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B27">
            <title>
               <p>Gene expression analysis by massively parallel signature sequencing (MPSS) on microbead arrays</p>
            </title>
            <aug>
               <au>
                  <snm>Brenner</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Johnson</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Bridgham</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Golda</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Lloyd</snm>
                  <fnm>DH</fnm>
               </au>
               <au>
                  <snm>Johnson</snm>
                  <fnm>D</fnm>
               </au>
               <etal/>
            </aug>
            <source>Nat Biotechnol</source>
            <pubdate>2000</pubdate>
            <volume>18</volume>
            <fpage>630</fpage>
            <lpage>4</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/76469</pubid>
                  <pubid idtype="pmpid" link="fulltext">10835600</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B28">
            <title>
               <p>Differential allelic expression in the human genome: a robust approach to identify genetic and epigenetic cis-acting mechanisms regulating gene expression</p>
            </title>
            <aug>
               <au>
                  <snm>Serre</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Gurd</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Ge</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Sladek</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Sinnett</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Harmsen</snm>
                  <fnm>E</fnm>
               </au>
               <etal/>
            </aug>
            <source>PLoS Genet</source>
            <pubdate>2008</pubdate>
            <volume>2</volume>
            <fpage>1000006</fpage>
            <xrefbib>
               <pubid idtype="doi">10.1371/journal.pgen.1000006</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B29">
            <title>
               <p>The genetic architecture of quantitative traits</p>
            </title>
            <aug>
               <au>
                  <snm>Mackay</snm>
                  <fnm>T</fnm>
               </au>
            </aug>
            <source>Annu Rev Genet</source>
            <pubdate>2001</pubdate>
            <volume>35</volume>
            <fpage>303</fpage>
            <lpage>39</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1146/annurev.genet.35.102401.090633</pubid>
                  <pubid idtype="pmpid" link="fulltext">11700286</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B30">
            <title>
               <p>Common Pattern of Evolution of Gene Expression Level and Protein Sequence in Drosophila</p>
            </title>
            <aug>
               <au>
                  <snm>Nuzhdin</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Wayne</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Harmon</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>McIntyre</snm>
                  <fnm>L</fnm>
               </au>
            </aug>
            <source>Mol Biol Evol</source>
            <pubdate>2004</pubdate>
            <volume>21</volume>
            <fpage>1308</fpage>
            <lpage>1317</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/molbev/msh128</pubid>
                  <pubid idtype="pmpid" link="fulltext">15034135</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B31">
            <title>
               <p>Parent-of-origin effects on mRNA expression in Drosophila melanogaster not caused by genomic imprinting</p>
            </title>
            <aug>
               <au>
                  <snm>Wittkopp</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Haerum</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Clark</snm>
                  <fnm>A</fnm>
               </au>
            </aug>
            <source>Genet</source>
            <pubdate>2006</pubdate>
            <volume>173</volume>
            <fpage>1817</fpage>
            <lpage>1821</lpage>
            <xrefbib>
               <pubid idtype="doi">10.1534/genetics.105.054684</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B32">
            <title>
               <p>On the Appropriateness of the Correlation Coefficient with a 0, 1 Dependent Variable</p>
            </title>
            <aug>
               <au>
                  <snm>Neter</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Maynes</snm>
                  <fnm>E</fnm>
               </au>
            </aug>
            <source>J Am Stat Assoc</source>
            <pubdate>1970</pubdate>
            <volume>65</volume>
            <fpage>501</fpage>
            <lpage>509</lpage>
            <xrefbib>
               <pubid idtype="doi">10.2307/2284562</pubid>
            </xrefbib>
         </bibl>
      </refgrp>
   </bm>
</art>
