<?xml version='1.0'?>
<!DOCTYPE art SYSTEM 'http://www.biomedcentral.com/xml/article.dtd'>
<art>
   <ui>1753-6561-1-S1-S67</ui>
   <ji>1753-6561</ji>
   <fm>
      <dochead>Proceedings</dochead>
      <bibl>
         <title>
            <p>Detecting epistatic interactions contributing to human gene expression using the CEPH family data</p>
         </title>
         <aug>
            <au id="A1" ca="yes">
               <snm>Li</snm>
               <fnm>Hua</fnm>
               <insr iid="I1"/>
               <email>hul@stowers-institute.org
</email>
            </au>
            <au id="A2">
               <snm>Gao</snm>
               <fnm>Guimin</fnm>
               <insr iid="I2"/>
               <email>guiming@uab.edu</email>
            </au>
            <au id="A3">
               <snm>Li</snm>
               <fnm>Jian</fnm>
               <insr iid="I3"/>
               <email>jili@vbi.vt.edu</email>
            </au>
            <au id="A4">
               <snm>Page</snm>
               <mi>P</mi>
               <fnm>Grier</fnm>
               <insr iid="I2"/>
               <email>gpage@ms.soph.uab.edu</email>
            </au>
            <au id="A5">
               <snm>Zhang</snm>
               <fnm>Kui</fnm>
               <insr iid="I2"/>
               <email>kzhang@ms.soph.uab.edu</email>
            </au>
         </aug>
         <insg>
            <ins id="I1">
               <p>Bioinformatics Center, Stowers Institute for Medical Research, 1000 East 50th Street, Kansas City, Missouri 64110, USA</p>
            </ins>
            <ins id="I2">
               <p>Section on Statistical Genetics, Department of Biostatistics, RPHB 327, 1530 3rd Avenue South, University of Alabama at Birmingham, Birmingham, Alabama 35294, USA</p>
            </ins>
            <ins id="I3">
               <p>Virginia Bioinformatics Institute, Virginia Tech, Blacksburg, VA 24061, USA</p>
            </ins>
         </insg>
         <source>BMC Proceedings</source>
         <supplement>
            <title>
               <p>Genetic Analysis Workshop 15: Gene Expression Analysis and Approaches to Detecting Multiple Functional Loci</p>
            </title>
            <editor>Heather J Cordell, Mariza de Andrade, Marie-Claude Babron, Christopher W Bartlett, Joseph Beyene, Heike Bickeb&#246;ller, Robert Culverhouse, Adrienne Cupples, E Warwick Daw, Jos&#233;e Dupuis, Catherine T Falk, Saurabh Ghosh, Katrina A Goddard, Ellen L Goode, Elizabeth R Hauser, Lisa J Martin, Maria Martinez, Kari E North, Nancy L Saccone, Silke Schmidt, William Tapper, Duncan Thomas, David Tritchler, Veronica J Vieland, Ellen M Wijsman, Marsha A Wilcox, John S Witte, Qiong Yang, Andreas Ziegler, Laura Almasy and Jean W MacCluer</editor>
            <note>Proceedings</note>
         </supplement>
         <conference>
            <title>
               <p>Genetic Analysis Workshop 15</p>
            </title>
            <location>St. Pete Beach, Florida, USA</location>
            <date-range>11&#8211;15 November 2006</date-range>
            <url>http://www.gaworkshop.org/</url>
         </conference>
         <issn>1753-6561</issn>
         <pubdate>2007</pubdate>
         <volume>1</volume>
         <issue>Suppl 1</issue>
         <fpage>S67</fpage>
         <url>http://www.biomedcentral.com/1753-6561/1/S1/S67</url>
         <xrefbib>
            <pubid idtype="pmpid">18466568</pubid>
         </xrefbib>
      </bibl>
      <history>
         <pub>
            <date>
               <day>18</day>
               <month>12</month>
               <year>2007</year>
            </date>
         </pub>
      </history>
      <cpyrt>
         <year>2007</year>
         <collab>Li et al; licensee BioMed Central Ltd.</collab>
         <note>This is an open access article distributed under the terms of the Creative Commons Attribution License (<url>http://creativecommons.org/licenses/by/2.0</url>), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</note>
      </cpyrt>
      <abs>
         <sec>
            <st>
               <p>Abstract</p>
            </st>
            <p>It is believed that epistatic interactions among loci contribute to variations in quantitative traits. Several methods are available to detect epistasis using population-based data. However, methods to characterize epistasis for quantitative traits in family-based association analysis are not well developed, especially for studying thousands of gene expression traits. Here, we proposed a linear mixed-model approach to detect epistasis for quantitative traits using family data. The proposed method was implemented in a widely used software program SOLAR. We evaluated the power of the method by simulation studies and applied this method to the analysis of the Centre d'Etude du Polymorphisme Humain family gene expression data provided by Genetics Analysis Workshop 15 (GAW15).</p>
         </sec>
      </abs>
   </fm>
   <bdy>
      <sec>
         <st>
            <p>Background</p>
         </st>
         <p>With the ability to measure simultaneously thousands of gene expression traits, understanding the causes of transcriptional variation has been of great interest. Genetic interactions, also called epistasis, have been shown to affect gene expression phenotypes. For example, Brem and Kruglyak <abbrgrp><abbr bid="B1">1</abbr></abbrgrp> found that the genetic basis of transcripts in yeast is more often likely to be polygenic rather than monogenic and that in yeast epistasis effects are present in more than 15% of transcripts. Therefore, it is essential to analyze epistatic interactions between loci that contribute to variations in gene expression traits.</p>
         <p>Several statistical methods for studying epistatic interactions between loci for quantitative traits using populations of unrelated individuals or from experimental designs have been developed <abbrgrp><abbr bid="B2">2</abbr><abbr bid="B3">3</abbr><abbr bid="B4">4</abbr><abbr bid="B5">5</abbr><abbr bid="B6">6</abbr></abbrgrp>. For quantitative traits using family-based samples (related individuals), epistatic testing has been incorporated into the variance-component linkage analysis and implemented in the software SOLAR <abbrgrp><abbr bid="B7">7</abbr></abbrgrp>. However, epistatic detection on the basis of the linkage analysis can only locate the two interacting loci in wide confidence intervals and will have small power for data sets with small sample sizes, such as in the GAW15 (Genetic Analysis Workshop 15) CEPH (Centre d'Etude du Polymorphisme Humain) data set, which only contains 194 individuals. In this situation, association-based methods are expected to have higher power for detecting epistasis. A variety of approaches <abbrgrp><abbr bid="B8">8</abbr><abbr bid="B9">9</abbr></abbrgrp> that focus on association testing can be used to detect epistasis <abbrgrp><abbr bid="B3">3</abbr></abbrgrp>. However, these are transmission-disequilibrium test (TDT)-based methods, which also have lower power in epistasis studies. Also, it is complicated to identify the best statistical model for the joint effects of loci including their interactions through model selection, particularly for analyzing thousands of gene expression traits for thousands of markers.</p>
         <p>In this paper, we have extended the association-based linear regression model <abbrgrp><abbr bid="B2">2</abbr><abbr bid="B3">3</abbr></abbrgrp> by adding a random polygenic effect into the model to allow for familial data for epistasis detection of quantitative traits. The proposed linear mixed model was implemented in the widely used software program SOLAR <abbrgrp><abbr bid="B7">7</abbr></abbrgrp>, which calculates significance levels for each covariate, and performs covariate screening in the model. We applied the proposed method to a subset of the gene expression profiles in the CEPH data set as provided by GAW15.</p>
      </sec>
      <sec>
         <st>
            <p>Methods</p>
         </st>
         <sec>
            <st>
               <p>Statistical methods</p>
            </st>
            <p>Based on the linear regression model of Cokerham and Zeng <abbrgrp><abbr bid="B2">2</abbr></abbrgrp> (also see Cordell <abbrgrp><abbr bid="B3">3</abbr></abbrgrp>), we propose a linear mixed model for detecting epistatic interactions for quantitative traits using family-based data:</p>
            <p>
               <display-formula id="M1">y = <it>&#956; </it>+ <it>a</it><sub>1</sub><it>x</it><sub>1 </sub>+ <it>d</it><sub>1</sub><it>z</it><sub>1 </sub>+ <it>a</it><sub>2</sub><it>x</it><sub>2 </sub>+ <it>d</it><sub>2</sub><it>z</it><sub>2 </sub>+ <it>i</it><sub><it>aa</it></sub><it>x</it><sub>1</sub><it>x</it><sub>2 </sub>+ <it>i</it><sub><it>ad</it></sub><it>x</it><sub>1</sub><it>z</it><sub>2 </sub>+ <it>i</it><sub><it>da</it></sub><it>z</it><sub>1</sub><it>x</it><sub>2 </sub>+ <it>i</it><sub><it>dd</it></sub><it>z</it><sub>1</sub><it>z</it><sub>2 </sub>+ <it>W&#946; </it>+ <it>v </it>+ <it>&#949;</it>.</display-formula>
            </p>
            <p>This model assumes diallelic marker loci and that <it>y </it>is a normally distributed quantitative gene expression phenotype from related individuals, <it>W </it>is a vector of fixed covariates such as sex effects, <it>&#946; </it>is the corresponding vector of coefficients, <it>v </it>is the random polygenic effect within a family, the vector of polygenic effects in each family follows multi-normal distribution N(0, 2A<inline-formula><m:math name="1753-6561-1-S1-S67-i1" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:msubsup><m:mi>&#963;</m:mi><m:mi>f</m:mi><m:mn>2</m:mn></m:msubsup></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaaiiaacqWFdpWCdaqhaaWcbaacbaGae4NzaygabaGaeGOmaidaaaaa@30EC@</m:annotation></m:semantics></m:math></inline-formula>) where A is the kinship matrix and <inline-formula><m:math name="1753-6561-1-S1-S67-i1" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:msubsup><m:mi>&#963;</m:mi><m:mi>f</m:mi><m:mn>2</m:mn></m:msubsup></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaaiiaacqWFdpWCdaqhaaWcbaacbaGae4NzaygabaGaeGOmaidaaaaa@30EC@</m:annotation></m:semantics></m:math></inline-formula> is the variance associated with vectors of polygenic effects, <it>a</it><sub><it>i </it></sub>and <it>d</it><sub><it>i </it></sub>are the additive and dominant effects, and <it>x</it><sub><it>i </it></sub>and <it>z</it><sub><it>i </it></sub>are dummy variables related to the genotypes at the locus <it>i</it>. For example, for a diallelic locus, we might set <it>x</it><sub><it>i </it></sub>= 1 and <it>z</it><sub><it>i </it></sub>= -0.5 for genotype <it>BB</it>, <it>x</it><sub><it>i </it></sub>= 0 and <it>z</it><sub><it>i </it></sub>= 0.5 for genotype <it>Bb</it>, and <it>x</it><sub><it>i </it></sub>= -1 and <it>z</it><sub><it>i </it></sub>= -0.5 for genotype <it>bb</it>, respectively. <it>i</it><sub><it>aa</it></sub>, <it>i</it><sub><it>ad</it></sub>, <it>i</it><sub><it>da</it></sub>, and <it>i</it><sub><it>dd </it></sub>are additive-additive, additive-dominant, and dominant-dominant interaction effects between the two loci, respectively, corresponding to epistatic interaction effects, and <it>&#949; </it>is the residual error, following normal distribution N(0, <inline-formula><m:math name="1753-6561-1-S1-S67-i2" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:msubsup><m:mi>&#963;</m:mi><m:mi>e</m:mi><m:mn>2</m:mn></m:msubsup></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaaiiaacqWFdpWCdaqhaaWcbaacbaGae4xzaugabaGaeGOmaidaaaaa@30EA@</m:annotation></m:semantics></m:math></inline-formula>). Significant interaction effects imply presence of epistasis.</p>
            <p>To detect epistasis, for each gene expression phenotype, we ran Model (1) in SOLAR for each pair of single-nucleotide polymorphisms (SNPs) in the selected candidate regions (see Description of the data set for more details). The number of tests for each gene expression phenotype ranges from 6 to 820, depending on the marker density and size of the candidate regions selected for the epistasis search. For each gene expression phenotype, individual <it>p</it>-values were adjusted using false-discovery rate (FDR) under the general dependency assumption <abbrgrp><abbr bid="B10">10</abbr></abbrgrp> within each phenotype. FDR-adjusted <it>p</it>-values equal to or less than 0.05 (FDR &#8804; 0.05) are considered to be significant.</p>
         </sec>
         <sec>
            <st>
               <p>Simulation study</p>
            </st>
            <p>We simulated a data set based on the pedigree structure from CEPH family data, which has 14 three-generation families of 194 individuals. We considered two unlinked diallelic markers in our analysis with allele frequency of 0.5. Marker genotypes for the grandparents were generated assuming Hardy-Weinberg equilibrium at each locus. Genotypes for parents and children were simulated conditional on their parental genotypes following Mendel's law. As an example we evaluated the power (true negative rate) and type I error (false-positive rate) of the proposed method in identifying additive-additive epistatic effect <it>i</it><sub><it>aa</it></sub>. Phenotypes of each individual was generated based on the Model (1), where <it>a</it><sub>1 </sub>= 0.2, <it>d</it><sub>1 </sub>= 0.01, <it>a</it><sub>2 </sub>= 0.2, <it>d</it><sub>2 </sub>= 0.01, <it>i</it><sub><it>ad </it></sub>= <it>i</it><sub><it>da </it></sub>= <it>i</it><sub><it>dd </it></sub>= 0, <it>&#946; </it>= 0.1 (the vector W only contains sex), <inline-formula><m:math name="1753-6561-1-S1-S67-i3" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:msubsup><m:mi>&#963;</m:mi><m:mi>f</m:mi><m:mn>2</m:mn></m:msubsup><m:mo>=</m:mo><m:msubsup><m:mi>&#963;</m:mi><m:mi>e</m:mi><m:mn>2</m:mn></m:msubsup><m:mo>=</m:mo><m:mn>0.5</m:mn></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaaiiaacqWFdpWCdaqhaaWcbaacbaGae4NzaygabaGaeGOmaidaaOGamaeGg2da9iadGa4FdpWCdGaia2baaSqaiacGcWaia6xzaugabGaiakadGaiIYaGmaaGccWaGac3=aaGH9aqpcWaGac0=aaaIWaamcWaGac0=aaGGUaGlcWaGac0=aaaI1aqnaaa@4805@</m:annotation></m:semantics></m:math></inline-formula>. When evaluating power and type I error we set <it>i</it><sub><it>aa </it></sub>= 0.7 and <it>i</it><sub><it>aa </it></sub>= 0, respectively. These values were chosen based on the estimated values from analysis of selected 27 traits in the CEPH family data. We plotted receiver operating characteristic (ROC) curve by calculating specificity and sensitivity as we varied the nominal threshold for determining the significant epistasis, where:</p>
            <p>
               <display-formula>1 - specificity = (false positive)/(true negative + false positive)</display-formula>
            </p>
            <p>
               <display-formula>sensitivity = (true positive)/(true positive + false negative).</display-formula>
            </p>
         </sec>
         <sec>
            <st>
               <p>Description of the dataset</p>
            </st>
            <p>The CEPH family data provided by GAW15 includes 3554 Affymetrix<sup>&#174; </sup>gene expressions measured for 194 individuals from 14 three-generation CEPH families. In addition, 2882 autosomal and X-linked SNPs were typed for these individuals.</p>
            <p>The software package pedStat as distributed in Merlin version 1.0.1 <abbrgrp><abbr bid="B11">11</abbr></abbrgrp> was used to check for Mendelian inconsistence, genotyping proportions, and heterozygosity of SNPs. SNP markers with minor allele frequencies less than 1% (equivalent to heterozygousity &lt; 1.98%), markers with greater than 30% missing genotypes, and markers with only two of the three possible genotypes were removed from analysis, which left 2436 SNPs for our analysis.</p>
            <p>We limited our analysis to the 27 gene expression phenotypes with the strongest linkage evidence of <it>cis </it>effects (Table <tblr tid="T1">1</tblr> from Cheung et al <abbrgrp><abbr bid="B12">12</abbr></abbrgrp>). Their Table 1 listed one to two peak markers for each phenotype, where a peak marker is the SNP with the most significant finding in the genome-wide association analysis (GWA) for this phenotype. Fourteen of the 27 gene expression phenotypes exhibited <it>cis </it>regulation (with <it>cis </it>peak markers) by the GWA analysis. For the phenotype <it>PPAT </it>there are two peak markers that point to both <it>cis </it>and <it>trans </it>regulation for this gene. For the remaining 12 phenotypes, the peak markers are <it>trans </it>markers. In our study, for each of the 27 phenotypes we selected a 15-Mb candidate region centered on the target gene location. If a <it>trans </it>peak marker was identified in the GWA analysis, we also selected an additional 15-Mb candidate region centred on that marker. We analyzed the epistatic effects for all possible combinations of the SNPs within the candidate regions.</p>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Results</p>
         </st>
         <sec>
            <st>
               <p>Simulated data</p>
            </st>
            <p>We calculated the ROC profiles for detecting interacting SNPs using linear mixed Model (1). An ROC curve measures the trade-off between sensitivity (true-positive rate, TPR) and 1 - specificity (false-positive rate, FPR) for different cut-offs (Figure <figr fid="F1">1</figr>). Each threshold value results in a sensitivity and 1 - specificity, which is represented by a point on the ROC curve. For 100 simulated data sets, the TPR starts at 0.82 and quickly increases to 0.95, with FPR less than 0.11. When the nominal threshold is 0.05 for the significant calls of epistasis, the power is 0.95 and the type I error is 0.08 in identifying the additive by additive epistasis effect <it>i</it><sub><it>aa</it></sub>. In summary, simulation results indicate the proposed linear mixed model has high sensitivity (power) and specificity for detecting SNP-pair interaction in family data for two unlinked markers.</p>
            <fig id="F1">
               <title>
                  <p>Figure 1</p>
               </title>
               <caption>
                  <p>Receiver operating characteristic (ROC) curves for 100 simulated data</p>
               </caption>
               <text>
                  <p>Receiver operating characteristic (ROC) curves for 100 simulated data.</p>
               </text>
               <graphic file="1753-6561-1-S1-S67-1"/>
            </fig>
         </sec>
         <sec>
            <st>
               <p>CEPH family data</p>
            </st>
            <p>When applying the linear mixed model to the analysis of the CEPH data, a total of six SNP pairs showed significant epistatic interactions for three gene expression phenotypes (Table <tblr tid="T1">1</tblr>). Two out of six significant epistatic effects are from SNPs that are located on two regions from different chromosomes. These two regions are corresponding to the <it>cis </it>linkage region and the <it>trans </it>association marker. These results show gene expression regulation cannot be simplified as <it>cis </it>or <it>trans </it>regulation because both <it>cis </it>or <it>trans </it>effects simultaneously contribute to variation in message RNA expression level. The strong epistatic interactions between SNPs could be interpreted as the strong epistatic interactions between genes if the nearby genes are in linkage disequilibrium (LD) with SNPs. For example, the SNP rs1537638 is within the gene <it>PTK7 </it>(protein tyrosine kinase 7), which is located on chromosome 6 and shows significant interaction with the SNP rs1505694, which is close to the gene <it>ITGB1BP1 </it>(integrin <it>&#946;</it>1 binding protein 1) on chromosome 2. This suggests that these two genes might physically or genetically interact with each other or be involved in the same biological process. In fact, proteins encoded by these two genes are indeed involved in the cell adhesion process, in which <it>PTK7 </it>is a receptor tyrosine kinase transducing extracellular signals across the cell membrane and <it>ITGB1BP1 </it>plays an important role during integrin-dependent cell adhesion by binding the <it>&#946;</it>1 integrin cytoplasmic domain.</p>
            <tbl id="T1">
               <title>
                  <p>Table 1</p>
               </title>
               <caption>
                  <p>FDR adjusted <it>p</it>-values of significant epistatic effects detected under the full mode with the model selection between two loci located on different chromosomes in the analysis of the CEPH family data</p>
               </caption>
               <tblbdy cols="7">
                  <r>
                     <c ca="center">
                        <p>Gene (location)</p>
                     </c>
                     <c ca="center">
                        <p>SNP1 (location)</p>
                     </c>
                     <c ca="center">
                        <p>SNP2 (location)</p>
                     </c>
                     <c ca="center">
                        <p>a<sub>1</sub>a<sub>2</sub></p>
                     </c>
                     <c ca="center">
                        <p>a<sub>1</sub>d<sub>2</sub></p>
                     </c>
                     <c ca="center">
                        <p>d<sub>1</sub>a<sub>2</sub></p>
                     </c>
                     <c ca="center">
                        <p>d<sub>1</sub>d<sub>2</sub></p>
                     </c>
                  </r>
                  <r>
                     <c cspan="7">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>ITGB1BP1 (2p25.1)</p>
                     </c>
                     <c ca="center">
                        <p>rs1505694 (2p25.2)</p>
                     </c>
                     <c ca="center">
                        <p>rs1537638 (6p21.1)</p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>0.0291<sup>a</sup></b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>0.0092</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>1</p>
                     </c>
                     <c ca="center">
                        <p>0.3725</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>TM7SF3 (12p11.23)</p>
                     </c>
                     <c ca="center">
                        <p>rs575030</p>
                        <p>11q23.3</p>
                     </c>
                     <c ca="center">
                        <p>rs725291</p>
                        <p>11q24.2</p>
                     </c>
                     <c ca="center">
                        <p>1</p>
                     </c>
                     <c ca="center">
                        <p>1</p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>0.0341</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>0.0387</b>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>rs725291</p>
                        <p>11q24.2</p>
                     </c>
                     <c ca="center">
                        <p>rs1944819</p>
                        <p>11q24.2</p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>0.0351</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>1</p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>0.0341</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>1</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>rs725291</p>
                        <p>11q24.2</p>
                     </c>
                     <c ca="center">
                        <p>rs674237</p>
                        <p>11q24.2</p>
                     </c>
                     <c ca="center">
                        <p>0.0800</p>
                     </c>
                     <c ca="center">
                        <p>1</p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>0.0341</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>1</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>rs753013</p>
                        <p>12p11.22</p>
                     </c>
                     <c ca="center">
                        <p>rs1492332</p>
                        <p>12q12</p>
                     </c>
                     <c ca="center">
                        <p>1</p>
                     </c>
                     <c ca="center">
                        <p>1</p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>0.0341</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>0.4308</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>PPAT (4q12)</p>
                     </c>
                     <c ca="center">
                        <p>rs1824965 (4q12)</p>
                     </c>
                     <c ca="center">
                        <p>rs39068 (7p15.1)</p>
                     </c>
                     <c ca="center">
                        <p>1</p>
                     </c>
                     <c ca="center">
                        <p>1</p>
                     </c>
                     <c ca="center">
                        <p>0.1377</p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>0.0351</b>
                        </p>
                     </c>
                  </r>
               </tblbdy>
               <tblfn>
                  <p><sup>a</sup>Significant interaction effects are in bold.</p>
               </tblfn>
            </tbl>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Conclusion and discussion</p>
         </st>
         <p>We have presented an association-based method for detecting epistatic interactions for quantitative traits using family data, and applied this method to the analysis of gene expression phenotypes of CEPH family data provided by GAW15. When we applied the proposed method to the CEPH data, we detected six SNP pairs that showed significant epistatic interactions for 3 gene expression phenotypes among the 27 phenotypes analyzed. We study the epistasis among genes by analyzing the interactions of SNPs located in the corresponding genes. This kind of epistasis detected from statistical tests does not necessarily correspond to the classic model of epistasis. Strong epistatic interactions among SNPs may not always indicate biological interactions among genes.</p>
         <p>Although we demonstrated the association-based linear mixed-model approach for analyzing 27 phenotypes, the method is mainly proposed for analyzing thousands of phenotypes in genome-wide study. In general, one could identify two interacting linkage regions (QTL intervals) using two-dimensional genome linkage scan by allowing a higher a false-positive rate. Or one could do stepwise search of two interacting QTLs by identifying one primary QTL and then searching for the secondary QTL conditional on the primary locus being linked <abbrgrp><abbr bid="B13">13</abbr></abbrgrp>. Once the two candidate intervals (regions) have been identified, the proposed linear mixed-model approach in this work could be used for epistasis detection between SNPS.</p>
         <p>The proposed linear mixed model could be implemented in two steps by first regressing out the fixed effects (not including genetic effects) and polygenic effects and then detecting genetic interaction effects using predicted residuals from the first-step analysis. This strategy is attractive because of its flexibility in identifying the best statistical model for the joint effects of loci and computational efficiency for analyzing thousands of gene expression traits in genome-wide study.</p>
      </sec>
      <sec>
         <st>
            <p>Competing interests</p>
         </st>
         <p>The author(s) declare that they have no competing interests.</p>
      </sec>
   </bdy>
   <bm>
      <ack>
         <sec>
            <st>
               <p>Acknowledgements</p>
            </st>
            <p>The authors thank Arcady Mushegian, and Dongxiao Zhu for helpful discussion and scientific editing. The authors acknowledge the financial support from Stowers Institute for Medical Research and as well as grants from the National Institutes of Health (R01ES09912 (GG), U54CA100949 (GPP, GG), R01NS043530 (GPP), and R01GM074913 (KZ)).</p>
            <p>This article has been published as part of <it>BMC Proceedings </it>Volume 1 Supplement 1, 2007: Genetic Analysis Workshop 15: Gene Expression Analysis and Approaches to Detecting Multiple Functional Loci. The full contents of the supplement are available online at <url>http://www.biomedcentral.com/1753-6561/1?issue=S1</url>.</p>
         </sec>
      </ack>
      <refgrp>
         <bibl id="B1">
            <title>
               <p>The landscape of genetic complexity across 5700 gene expression traits in yeast</p>
            </title>
            <aug>
               <au>
                  <snm>Brem</snm>
                  <fnm>RB</fnm>
               </au>
               <au>
                  <snm>Kruglyak</snm>
                  <fnm>L</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci USA</source>
            <pubdate>2005</pubdate>
            <volume>102</volume>
            <fpage>1572</fpage>
            <lpage>1577</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1073/pnas.0408709102</pubid>
                  <pubid idtype="pmpid" link="fulltext">15659551</pubid>
                  <pubid idtype="pmcid">547855</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B2">
            <title>
               <p>Design III with marker loci</p>
            </title>
            <aug>
               <au>
                  <snm>Cokerham</snm>
                  <fnm>CC</fnm>
               </au>
               <au>
                  <snm>Zeng</snm>
                  <fnm>ZB</fnm>
               </au>
            </aug>
            <source>Genetics</source>
            <pubdate>1996</pubdate>
            <volume>143</volume>
            <fpage>1437</fpage>
            <lpage>1456</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1207411</pubid>
                  <pubid idtype="pmpid" link="fulltext">8807314</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B3">
            <title>
               <p>Epistasis. What it means, what it doesn't mean, and statistical methods to detect it in human</p>
            </title>
            <aug>
               <au>
                  <snm>Cordell</snm>
                  <fnm>HJ</fnm>
               </au>
            </aug>
            <source>Hum Mol Genet</source>
            <pubdate>2002</pubdate>
            <volume>11</volume>
            <fpage>2463</fpage>
            <lpage>2468</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/hmg/11.20.2463</pubid>
                  <pubid idtype="pmpid" link="fulltext">12351582</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B4">
            <title>
               <p>A combinatorial partitioning method to identify multilocus genotypic partitions that predict quantitative trait variation</p>
            </title>
            <aug>
               <au>
                  <snm>Nelson</snm>
                  <fnm>MR</fnm>
               </au>
               <au>
                  <snm>Kardia</snm>
                  <fnm>SLR</fnm>
               </au>
               <au>
                  <snm>Ferrell</snm>
                  <fnm>RE</fnm>
               </au>
               <au>
                  <snm>Sing</snm>
                  <fnm>CF</fnm>
               </au>
            </aug>
            <source>Genome Res</source>
            <pubdate>2001</pubdate>
            <volume>11</volume>
            <fpage>458</fpage>
            <lpage>470</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">311041</pubid>
                  <pubid idtype="pmpid" link="fulltext">11230170</pubid>
                  <pubid idtype="doi">10.1101/gr.172901</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B5">
            <title>
               <p>Detecting epistatic interactions contributing to quantitative traits</p>
            </title>
            <aug>
               <au>
                  <snm>Culverhouse</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Klein</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Shannon</snm>
                  <fnm>W</fnm>
               </au>
            </aug>
            <source>Genet Epidemiol</source>
            <pubdate>2004</pubdate>
            <volume>27</volume>
            <fpage>141</fpage>
            <lpage>152</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1002/gepi.20006</pubid>
                  <pubid idtype="pmpid" link="fulltext">15305330</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B6">
            <title>
               <p>Multiple interval mapping for quantitative trait loci</p>
            </title>
            <aug>
               <au>
                  <snm>Kao</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Zeng</snm>
                  <fnm>ZB</fnm>
               </au>
               <au>
                  <snm>Teasdale</snm>
                  <fnm>RD</fnm>
               </au>
            </aug>
            <source>Genetics</source>
            <pubdate>1999</pubdate>
            <volume>152</volume>
            <fpage>1203</fpage>
            <lpage>1216</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1460657</pubid>
                  <pubid idtype="pmpid" link="fulltext">10388834</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B7">
            <title>
               <p>Multipoint quantitative trait linkage analysis in general pedigrees</p>
            </title>
            <aug>
               <au>
                  <snm>Almasy</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Blangero</snm>
                  <fnm>J</fnm>
               </au>
            </aug>
            <source>Am J Hum Genet</source>
            <pubdate>1998</pubdate>
            <volume>62</volume>
            <fpage>1198</fpage>
            <lpage>1211</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1377101</pubid>
                  <pubid idtype="pmpid" link="fulltext">9545414</pubid>
                  <pubid idtype="doi">10.1086/301844</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B8">
            <title>
               <p>A unified framework for transmission-disequilibrium test analysis of discrete and continuous traits</p>
            </title>
            <aug>
               <au>
                  <snm>Liu</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Tritchler</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Bull</snm>
                  <fnm>SB</fnm>
               </au>
            </aug>
            <source>Genet Epidemiol</source>
            <pubdate>2002</pubdate>
            <volume>22</volume>
            <fpage>26</fpage>
            <lpage>40</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1002/gepi.1041</pubid>
                  <pubid idtype="pmpid" link="fulltext">11754471</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B9">
            <title>
               <p>Family-based tests of association and linkage that use unaffected sibs covariates, and interactions</p>
            </title>
            <aug>
               <au>
                  <snm>Lunetta</snm>
                  <fnm>KL</fnm>
               </au>
               <au>
                  <snm>Faraone</snm>
                  <fnm>SV</fnm>
               </au>
               <au>
                  <snm>Biederman</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Laird</snm>
                  <fnm>NM</fnm>
               </au>
            </aug>
            <source>Am J Hum Genet</source>
            <pubdate>2000</pubdate>
            <volume>66</volume>
            <fpage>605</fpage>
            <lpage>614</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1288113</pubid>
                  <pubid idtype="pmpid" link="fulltext">10677320</pubid>
                  <pubid idtype="doi">10.1086/302782</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B10">
            <title>
               <p>The control of the false discovery rate in multiple testing under dependency</p>
            </title>
            <aug>
               <au>
                  <snm>Benjamini</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Yekutieli</snm>
                  <fnm>D</fnm>
               </au>
            </aug>
            <source>Ann Stat</source>
            <pubdate>2001</pubdate>
            <volume>29</volume>
            <fpage>1165</fpage>
            <lpage>1188</lpage>
            <xrefbib>
               <pubid idtype="doi">10.1214/aos/1013699998</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B11">
            <title>
               <p>Merlin-rapid analysis of dense genetic maps using sparse gene flow trees</p>
            </title>
            <aug>
               <au>
                  <snm>Abecasis</snm>
                  <fnm>GR</fnm>
               </au>
               <au>
                  <snm>Cherny</snm>
                  <fnm>SS</fnm>
               </au>
               <au>
                  <snm>Cookson</snm>
                  <fnm>WO</fnm>
               </au>
               <au>
                  <snm>Cardon</snm>
                  <fnm>LR</fnm>
               </au>
            </aug>
            <source>Nat Genet</source>
            <pubdate>2002</pubdate>
            <volume>30</volume>
            <fpage>97</fpage>
            <lpage>101</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/ng786</pubid>
                  <pubid idtype="pmpid" link="fulltext">11731797</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B12">
            <title>
               <p>Mapping determinants of human gene expression by regional and whole genome association</p>
            </title>
            <aug>
               <au>
                  <snm>Cheung</snm>
                  <fnm>VG</fnm>
               </au>
               <au>
                  <snm>Spielman</snm>
                  <fnm>RS</fnm>
               </au>
               <au>
                  <snm>Ewens</snm>
                  <fnm>KG</fnm>
               </au>
               <au>
                  <snm>Weber</snm>
                  <fnm>TM</fnm>
               </au>
               <au>
                  <snm>Morley</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Burdick</snm>
                  <fnm>JT</fnm>
               </au>
            </aug>
            <source>Nature</source>
            <pubdate>2005</pubdate>
            <volume>437</volume>
            <fpage>1365</fpage>
            <lpage>1369</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/nature04244</pubid>
                  <pubid idtype="pmpid" link="fulltext">16251966</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B13">
            <title>
               <p>Multiple locus linkage analysis of genomewide expression in yeast</p>
            </title>
            <aug>
               <au>
                  <snm>Storey</snm>
                  <fnm>JD</fnm>
               </au>
               <au>
                  <snm>Akey</snm>
                  <fnm>JM</fnm>
               </au>
               <au>
                  <snm>Kruglyak</snm>
                  <fnm>L</fnm>
               </au>
            </aug>
            <source>PLoS Biol</source>
            <pubdate>2005</pubdate>
            <volume>3</volume>
            <fpage>1380</fpage>
            <lpage>1390</lpage>
            <xrefbib>
               <pubid idtype="doi">10.1371/journal.pbio.0030267</pubid>
            </xrefbib>
         </bibl>
      </refgrp>
   </bm>
</art>
