<?xml version='1.0'?>
<!DOCTYPE art SYSTEM 'http://www.biomedcentral.com/xml/article.dtd'>
<art>
   <ui>1753-6561-1-S1-S8</ui>
   <ji>1753-6561</ji>
   <fm>
      <dochead>Proceedings</dochead>
      <bibl>
         <title>
            <p>A comparison of founder-only and all-pedigree-members genotype-expression association by regression analysis</p>
         </title>
         <aug>
            <au id="A1">
               <snm>Suh</snm>
               <fnm>Young Ju</fnm>
               <insr iid="I1"/>
               <email>yjsprite@yahoo.co.kr</email>
            </au>
            <au id="A2">
               <snm>Lee</snm>
               <fnm>Hye-Soon</fnm>
               <insr iid="I2"/>
               <insr iid="I3"/>
               <email>lhsberon@hanyang.ac.kr</email>
            </au>
            <au id="A3">
               <snm>Batliwalla</snm>
               <fnm>Franak</fnm>
               <insr iid="I2"/>
               <email>fb@nshs.edu</email>
            </au>
            <au id="A4" ca="yes">
               <snm>Li</snm>
               <fnm>Wentian</fnm>
               <insr iid="I2"/>
               <email>wli@nslij-genetics.org</email>
            </au>
         </aug>
         <insg>
            <ins id="I1">
               <p>BK21 Research Division of Medicine and Department of Preventive Medicine, School of Medicine, Ewha Womans University, Seoul 158-710, Korea</p>
            </ins>
            <ins id="I2">
               <p>The Robert S. Boas Center for Genomics and Human Genetics, Feinstein Institute for Medical Research, North Shore LIJ Health System, 350 Community Drive, Manhasset, New York 11030 USA</p>
            </ins>
            <ins id="I3">
               <p>Division of Rheumatology, Department of Internal Medicine, Hanyang University Medical College, Seoul 133-792, Korea</p>
            </ins>
         </insg>
         <source>BMC Proceedings</source>
         <supplement>
            <title>
               <p>Genetic Analysis Workshop 15: Gene Expression Analysis and Approaches to Detecting Multiple Functional Loci</p>
            </title>
            <editor>Heather J Cordell, Mariza de Andrade, Marie-Claude Babron, Christopher W Bartlett, Joseph Beyene, Heike Bickeb&#246;ller, Robert Culverhouse, Adrienne Cupples, E Warwick Daw, Jos&#233;e Dupuis, Catherine T Falk, Saurabh Ghosh, Katrina A Goddard, Ellen L Goode, Elizabeth R Hauser, Lisa J Martin, Maria Martinez, Kari E North, Nancy L Saccone, Silke Schmidt, William Tapper, Duncan Thomas, David Tritchler, Veronica J Vieland, Ellen M Wijsman, Marsha A Wilcox, John S Witte, Qiong Yang, Andreas Ziegler, Laura Almasy and Jean W MacCluer</editor>
            <note>Proceedings</note>
         </supplement>
         <conference>
            <title>
               <p>Genetic Analysis Workshop 15</p>
            </title>
            <location>St. Pete Beach, Florida, USA</location>
            <date-range>11&#8211;15 November 2006</date-range>
            <url>http://www.gaworkshop.org/</url>
         </conference>
         <issn>1753-6561</issn>
         <pubdate>2007</pubdate>
         <volume>1</volume>
         <issue>Suppl 1</issue>
         <fpage>S8</fpage>
         <url>http://www.biomedcentral.com/1753-6561/1/S1/S8</url>
         <xrefbib>
            <pubid idtype="pmpid">18466582</pubid>
         </xrefbib>
      </bibl>
      <history>
         <pub>
            <date>
               <day>18</day>
               <month>12</month>
               <year>2007</year>
            </date>
         </pub>
      </history>
      <cpyrt>
         <year>2007</year>
         <collab>Suh et al; licensee BioMed Central Ltd.</collab>
         <note>This is an open access article distributed under the terms of the Creative Commons Attribution License (<url>http://creativecommons.org/licenses/by/2.0</url>), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</note>
      </cpyrt>
      <abs>
         <sec>
            <st>
               <p>Abstract</p>
            </st>
            <p>Genotype-expression association analysis using linear regression may produce different test results depending on whether founders only or all pedigreed members are used. This difference is not due to the correlation of samples within a pedigree, because linear mixed models have been applied to account for that correlation. We investigated the possibility that the difference is due to a dependence of expression levels on, among other things, the generation number in the pedigree. Indeed, of the 30 or so studied expression quantitative traits, several of them show significant dependence on the generation number. We propose to use all pedigree members in genotype-expression association analyses whenever the complete genotyping information is available.</p>
         </sec>
      </abs>
   </fm>
   <bdy>
      <sec>
         <st>
            <p>Background</p>
         </st>
         <p>The genotype-expression association analyses were performed by Morley et al. <abbrgrp><abbr bid="B1">1</abbr></abbrgrp> and Cheung et al. <abbrgrp><abbr bid="B2">2</abbr></abbrgrp> in unrelated CEPH founders only to avoid the issue of correlated samples. On the other hand, the problem of correlated samples has been dealt with using the mixed model approach in many fields <abbrgrp><abbr bid="B3">3</abbr></abbrgrp>, including animal and plant genetics <abbrgrp><abbr bid="B4">4</abbr><abbr bid="B5">5</abbr></abbrgrp>. The mixed model with random effects accounts for the correlation by assuming a particular form of covariance structure <abbrgrp><abbr bid="B6">6</abbr></abbrgrp>. The parameters in the mixed model are estimated by the maximum likelihood (or restricted maximum likelihood) approach with iterative algorithms. The correlation among pedigree members apparently can be handled by the mixed model. In fact, a recent paper used the genotype-expression data in Morley et al. <abbrgrp><abbr bid="B1">1</abbr></abbrgrp> as an example to illustrate their version of linear mixed model <abbrgrp><abbr bid="B7">7</abbr></abbrgrp>.</p>
         <p>In linear regression models (<it>y ~ ax+b</it>), the <it>x </it>variable in different samples is allowed to be correlated as long as the <it>y </it>variable is independent conditional on <it>x</it>. Using pedigree members might be a potential problem in genotype-expression association by regression analysis if expression levels in some pedigrees are systematically higher or lower than other pedigrees, thus the parameter <it>b</it>, for example, can vary from one pedigree to another in a random fashion. Note that this is a different issue from the correlated samples for allele frequency estimation in case-control analysis <abbrgrp><abbr bid="B8">8</abbr><abbr bid="B9">9</abbr><abbr bid="B10">10</abbr></abbrgrp>. In allele frequency estimation, correlation among pedigree members increases the variance of the estimation, while keeping the estimator unbiased <abbrgrp><abbr bid="B10">10</abbr></abbrgrp>.</p>
         <p>Morley et al. found 27 genes that exhibit strong <it>cis-</it>acting genotype-expression (or SNP-eQT) association signal <abbrgrp><abbr bid="B2">2</abbr></abbrgrp>. These signals are believed to be true positives not only because <it>cis </it>action makes much biological sense, but also because these genes are under linkage peaks. We used these genes in our study, but relaxed the <it>cis</it>-acting requirement: the SNP that shows the strongest association signal will be checked for our comparative study between using founders only and using all pedigree members. This procedure usually selects <it>trans</it>-acting SNPs, and thus the chance for false-positive SNP-eQT association is higher.</p>
         <p>A comparison of using founders and using all pedigree members might detect something else: consistency between gene expression in founders and in non-founders. If pedigree founders are not a representative subset of samples of a pedigree, using founders only will lead to biased selection. Recent studies have shown that many eQTs may be age-dependent <abbrgrp><abbr bid="B11">11</abbr></abbrgrp>, which may provide a biological basis for a possible founder bias. If an eQT is more closely associated with age than with a SNP's genotype, the genotype only partially explains the expression level. We will examine this issue here.</p>
      </sec>
      <sec>
         <st>
            <p>Methods</p>
         </st>
         <sec>
            <st>
               <p>Selection of the SNP-eQT pairs</p>
            </st>
            <p>We selected 28 eQTs: 26 of the 27 gene expressions (there is no probe set in GAW data that matches ICAP-1A) listed in Table <tblr tid="T1">1</tblr> of Cheung et al. <abbrgrp><abbr bid="B2">2</abbr></abbrgrp>, which show both <it>cis </it>linkage signal and strong association signals, and 2 eQTs that are of great interests for our investigation of rheumatoid arthritis, HLA-DRB1 <abbrgrp><abbr bid="B12">12</abbr></abbrgrp> and PTPN22 <abbrgrp><abbr bid="B13">13</abbr></abbrgrp>. After using a filtering procedure, we selected the SNP (out of 2263) that has the strongest association signal with the given eQT by the linear regression of eQT over three genotypes (coded as 0, 1, 2) using the 56 founder samples. Three eQTs (CSTB, DDX17, and HLA-DRB1) contribute two SNP-eQT pairs either because of the second SNP strongly associated with the eQT, or because of the multiple probe sets corresponding to the same eQT, leading to total 31 SNP-eQT pairs. Note that given an eQT, our procedure usually selects the SNP not located in the same region as the gene of that eQT (<it>tran</it>-acting); and SNP-eQT pairs used here are mostly different from those in Cheung et al. <abbrgrp><abbr bid="B2">2</abbr></abbrgrp>, even though the list of eQTs is the same. For the last subsection of Results section, we also examined all possible SNP-eQT pairs, for 2263 SNPs and 3554 eQTs.</p>
            <tbl id="T1">
               <title>
                  <p>Table 1</p>
               </title>
               <caption>
                  <p>Averaged percentile value of 28 eQTs</p>
               </caption>
               <tblbdy cols="4">
                  <r>
                     <c ca="left">
                        <p>eQT<sup>a</sup></p>
                     </c>
                     <c ca="left">
                        <p>mean(P)<sup>b</sup></p>
                     </c>
                     <c ca="left">
                        <p>sd(P)<sup>c</sup></p>
                     </c>
                     <c ca="left">
                        <p>median(P)<sup>d</sup></p>
                     </c>
                  </r>
                  <r>
                     <c cspan="4">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>LRAP</p>
                     </c>
                     <c ca="left">
                        <p>55.4</p>
                     </c>
                     <c ca="left">
                        <p>30.61</p>
                     </c>
                     <c ca="left">
                        <p>50</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>AA827892</p>
                     </c>
                     <c ca="left">
                        <p>52.57</p>
                     </c>
                     <c ca="left">
                        <p>31.16</p>
                     </c>
                     <c ca="left">
                        <p>50</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>PSPHL</p>
                     </c>
                     <c ca="left">
                        <p>51.63</p>
                     </c>
                     <c ca="left">
                        <p>29.51</p>
                     </c>
                     <c ca="left">
                        <p>50</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>CPN &#215; 101</p>
                     </c>
                     <c ca="left">
                        <p>62.14</p>
                     </c>
                     <c ca="left">
                        <p>31.05</p>
                     </c>
                     <c ca="left">
                        <p>67.86</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>CSTB</p>
                     </c>
                     <c ca="left">
                        <p>67.22</p>
                     </c>
                     <c ca="left">
                        <p>29.79</p>
                     </c>
                     <c ca="left">
                        <p>71.43</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>RPS26</p>
                     </c>
                     <c ca="left">
                        <p>63.18</p>
                     </c>
                     <c ca="left">
                        <p>30.68</p>
                     </c>
                     <c ca="left">
                        <p>71.43</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>GSTM2</p>
                     </c>
                     <c ca="left">
                        <p>51.28</p>
                     </c>
                     <c ca="left">
                        <p>30.09</p>
                     </c>
                     <c ca="left">
                        <p>50</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>HLA-DRB2</p>
                     </c>
                     <c ca="left">
                        <p>50.1</p>
                     </c>
                     <c ca="left">
                        <p>29.14</p>
                     </c>
                     <c ca="left">
                        <p>50</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>IRF5</p>
                     </c>
                     <c ca="left">
                        <p>53.35</p>
                     </c>
                     <c ca="left">
                        <p>30.85</p>
                     </c>
                     <c ca="left">
                        <p>50</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>HSD17B12</p>
                     </c>
                     <c ca="left">
                        <p>46.74</p>
                     </c>
                     <c ca="left">
                        <p>29.64</p>
                     </c>
                     <c ca="left">
                        <p>42.86</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>GSTM1</p>
                     </c>
                     <c ca="left">
                        <p>52.27</p>
                     </c>
                     <c ca="left">
                        <p>32.44</p>
                     </c>
                     <c ca="left">
                        <p>46.43</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>PPAT</p>
                     </c>
                     <c ca="left">
                        <p>59.16</p>
                     </c>
                     <c ca="left">
                        <p>27.14</p>
                     </c>
                     <c ca="left">
                        <p>57.14</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>DDX17</p>
                     </c>
                     <c ca="left">
                        <p>67.63</p>
                     </c>
                     <c ca="left">
                        <p>26.24</p>
                     </c>
                     <c ca="left">
                        <p>71.43</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>CTSH</p>
                     </c>
                     <c ca="left">
                        <p>59.7</p>
                     </c>
                     <c ca="left">
                        <p>29.84</p>
                     </c>
                     <c ca="left">
                        <p>62.91</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>POMZP3</p>
                     </c>
                     <c ca="left">
                        <p>56.65</p>
                     </c>
                     <c ca="left">
                        <p>31.54</p>
                     </c>
                     <c ca="left">
                        <p>57.14</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>CGI-96</p>
                     </c>
                     <c ca="left">
                        <p>75.87</p>
                     </c>
                     <c ca="left">
                        <p>20.58</p>
                     </c>
                     <c ca="left">
                        <p>78.57</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>CHI3L2</p>
                     </c>
                     <c ca="left">
                        <p>57.42</p>
                     </c>
                     <c ca="left">
                        <p>30.25</p>
                     </c>
                     <c ca="left">
                        <p>57.14</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>VAMP8</p>
                     </c>
                     <c ca="left">
                        <p>45.34</p>
                     </c>
                     <c ca="left">
                        <p>30.89</p>
                     </c>
                     <c ca="left">
                        <p>42.86</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>&#215;10IF3S8</p>
                     </c>
                     <c ca="left">
                        <p>60</p>
                     </c>
                     <c ca="left">
                        <p>28.59</p>
                     </c>
                     <c ca="left">
                        <p>64.29</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>TM7SF3</p>
                     </c>
                     <c ca="left">
                        <p>64.07</p>
                     </c>
                     <c ca="left">
                        <p>29.1</p>
                     </c>
                     <c ca="left">
                        <p>66.76</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>IL16</p>
                     </c>
                     <c ca="left">
                        <p>44.2</p>
                     </c>
                     <c ca="left">
                        <p>26.2</p>
                     </c>
                     <c ca="left">
                        <p>42.86</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>TC &#215; 10A1</p>
                     </c>
                     <c ca="left">
                        <p>54.61</p>
                     </c>
                     <c ca="left">
                        <p>30.17</p>
                     </c>
                     <c ca="left">
                        <p>50</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>S100A13</p>
                     </c>
                     <c ca="left">
                        <p>43.97</p>
                     </c>
                     <c ca="left">
                        <p>26.06</p>
                     </c>
                     <c ca="left">
                        <p>42.86</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>SMARCB1</p>
                     </c>
                     <c ca="left">
                        <p>45.37</p>
                     </c>
                     <c ca="left">
                        <p>30.15</p>
                     </c>
                     <c ca="left">
                        <p>35.71</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>CTBP1</p>
                     </c>
                     <c ca="left">
                        <p>51.41</p>
                     </c>
                     <c ca="left">
                        <p>32.13</p>
                     </c>
                     <c ca="left">
                        <p>50</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>ZNF85</p>
                     </c>
                     <c ca="left">
                        <p>34.72</p>
                     </c>
                     <c ca="left">
                        <p>24.54</p>
                     </c>
                     <c ca="left">
                        <p>28.57</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>PTPN22</p>
                     </c>
                     <c ca="left">
                        <p>31.67</p>
                     </c>
                     <c ca="left">
                        <p>19.13</p>
                     </c>
                     <c ca="left">
                        <p>28.57</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>HLA-DRB1</p>
                     </c>
                     <c ca="left">
                        <p>36.2</p>
                     </c>
                     <c ca="left">
                        <p>28.94</p>
                     </c>
                     <c ca="left">
                        <p>23.08</p>
                     </c>
                  </r>
               </tblbdy>
               <tblfn>
                  <p><sup>a</sup>eQT, name of expression quantitative trait.</p>
                  <p><sup>b</sup>mean(P), averaged percentile value of founders for a given eQT.</p>
                  <p><sup>c</sup>sd(P), standard deviation of founders' percentile values.</p>
                  <p><sup>d</sup>median(P), dmedian of founders' percentile value.</p>
               </tblfn>
            </tbl>
         </sec>
         <sec>
            <st>
               <p>Linear regression with mixed effects and linear regression of covariates</p>
            </st>
            <p>When all pedigree members are used (194 samples), we considered two models to account for possible pedigree-specific effects on eQT: a random effect on the intercept a (MM1 for mixed model 1):<it>y</it><sub><it>ij </it></sub>= <it>a </it>+ <it>&#949;</it><sub><it>i </it></sub>+ <it>bx</it><sub><it>ij </it></sub>+ <it>&#949;</it><sub><it>ij</it></sub>, where <it>i </it>is the pedigree index and <it>j </it>is the person index; and random effects on both <it>a </it>and the slope <it>b </it>(MM2 for mixed model 2): <it>y</it><sub><it>ij </it></sub>+ <it>a </it>+ <it>&#949;</it><sub><it>i </it></sub>+ (<it>b </it>+ <it>&#948;</it><sub><it>i</it></sub>)<it>x</it><sub><it>ij </it></sub>+ <it>&#949;</it><sub><it>ij</it></sub>. As for the age effect, we simplified the issue by examining the effect of generation number (1, 2, 3) on expression level, disregarding pedigrees (labeled as "GC model" for generation as a covariate): <it>y</it><sub><it>ij </it></sub>= <it>a </it>+ <it>bx</it><sub><it>ij </it></sub>+ <it>c</it><sub>2</sub><it>g</it><sub>2,<it>ij </it></sub>+ <it>c</it><sub>3</sub><it>g</it><sub>3,<it>ij </it></sub>+ <it>&#949;</it><sub><it>ij</it></sub>. Note that the generation covariate is coded as a factor with two dummy variables, g<sub>2 </sub>(1 for generation 2; 0 otherwise) and g<sub>3 </sub>(1 for generation 3; 0 otherwise).</p>
         </sec>
         <sec>
            <st>
               <p><it>t</it>-Test assuming dominant or recessive model</p>
            </st>
            <p>To check the robustness of linear regression in which an additive model is assumed, we also carried out two <it>t</it>-tests for the expression levels by either combining genotypes 0 and 1 as one group (and genotype 2 as the second group), or combining genotype 1 and 2 as a group (and genotype 0 as another group). The best <it>p</it>-value of the two tests was selected.</p>
         </sec>
         <sec>
            <st>
               <p>Averaged percentile of founders</p>
            </st>
            <p>To examine whether founders tend to have higher or lower expression levels with respect to the non-founders, we determined the percentile value for each founder in a pedigree, for a specific eQT. For that eQT, this percentile value was averaged over all pedigrees. If the averaged founder percentile was close to 50%, there was no founder selection bias; and if it was close to 100% or 0%, founders were considered as a biased subset.</p>
         </sec>
         <sec>
            <st>
               <p>Programs used</p>
            </st>
            <p>Both SAS and R statistical packages were used.</p>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Results &amp; discussion</p>
         </st>
         <sec>
            <st>
               <p>Using founders only and using all pedigree members may lead to different association results</p>
            </st>
            <p>As seen in Additional file <supplr sid="S1">1</supplr>, a significant association for a SNP-eQT pair using founders only may lose its significance when all pedigree members are used (e.g., the DDX17-rs243404 pair). This observation is somewhat surprising, because there are several reasons to believe the opposite. First, extra samples in the 194-sample data set are offspring of those in the 56-sample data set, so if there is relatedness in their eQTs, it will only reinforce the association signal. Second, with a larger sample size in the 194-sample data set, we would expect a smaller <it>p</it>-value, instead of a larger, insignificant one. For the 31 SNP-eQT pairs listed in Additional file <supplr sid="S1">1</supplr>, 25 pairs' <it>p</it>-values are increased in the all-pedigree-member data set (using the naive approach, assuming all samples are independent), despite a tripling of the sample size.</p>
            <suppl id="S1">
               <title>
                  <p>Additional file 1</p>
               </title>
               <text>
                  <p>Comparison of <it>p</it>-values in regression analyses for 31 selected SNP-eQT pairs</p>
               </text>
               <file name="1753-6561-1-S1-S8-S1.doc">
                  <p>Click here for file</p>
               </file>
            </suppl>
         </sec>
         <sec>
            <st>
               <p>Checking robustness of result by <it>t</it>-tests using dominant and recessive models</p>
            </st>
            <p>Because the linear regression used here implies an additive model, we also carry out two <it>t</it>-tests by grouping samples with the heterozygous genotype to those with one of the homozygous genotypes. For founders, the only SNP-eQT pair with <it>p</it>-value smaller than 4.4 &#215; 10<sup>-6</sup>(0.01/2263) is CSTB-rs157334 (<it>p</it>-value = 1.4 &#215; 10<sup>-6</sup>). When all pedigree members are used, CSTB-rs157334 is still the only pair that is significant at this level (<it>p</it>-value = 1.6 &#215; 10<sup>-10</sup>).</p>
         </sec>
         <sec>
            <st>
               <p>Correcting for sample relatedness by random mixed models</p>
            </st>
            <p>When the MM1 mixed model is applied to the 194-sample data, only two SNP-eQT pairs remain significant at the 4.4 &#215; 10<sup>-6 </sup>level: CSTB-rs157334 and HSD17B12-rs1334334. When MM2 mixed model is applied, only CSTB-rs157334 exhibits a <it>p</it>-value close to that level (5.3 &#215; 10<sup>-6</sup>). Interestingly, this is the only <it>cis</it>-acting pair among those listed in Additional file <supplr sid="S1">1</supplr>. If an eQT has stronger dependence on a SNP genotype and a lower variation from pedigree to pedigree, we expect the significant SNP-eQT association to survive the application of a mixed model, at least with the random effect on the intercept. However, some SNP-eQT pairs lose the test significance in both MM1 and MM2. The MM2 model describes a situation in which not only the eQT varies among pedigrees, but also its degree of dependence on SNP genotype changes with pedigrees. MM2 is a less realistic model than MM1.</p>
         </sec>
         <sec>
            <st>
               <p>Age/generation effect on eQT</p>
            </st>
            <p>To investigate why all-pedigree-member data set may exhibit a different regression result from the founders-only data set, we examined whether eQT changes from generation to generation. This is a simplified version of examining the age effect, as generation 1 (founders) are older than the second and the third generations. Under the GC regression model described in the Methods section, three <it>p</it>-values are listed in Additional file <supplr sid="S1">1</supplr>: these are for testing zero coefficients for the genotype (<it>b</it>), generation 2 (<it>c</it><sub>2</sub>), and generation 3 (<it>c</it><sub>3</sub>). Note that in this regression, pedigree member information is discarded.</p>
            <p>Five SNP-eQT pairs in Additional file <supplr sid="S1">1</supplr> show significant genotype association at the 4.4 &#215; 10<sup>-6 </sup>level after accounting for the generation effect: the two pairs mentioned above as well as RPS26-rs720428, CTSH-rs1021639, and GSTM1-rs1039337. On the other hand, significant association with the generation variable has been observed for these eQTs: DDX17 (4.8 &#215; 10<sup>-15</sup>) on generation 2; PTPN22 (2.8 &#215; 10<sup>-20</sup>), CGI-96 (8.4 &#215; 10<sup>-15</sup>), HLA-DRB1 (1.7 &#215; 10<sup>-8</sup>), ZNF85 (3.5 &#215; 10<sup>-8</sup>), IL16 (2.2 &#215; 10<sup>-6</sup>), and CSTB (6.6 &#215; 10<sup>-6</sup>) on generation 3. Interestingly, even though CSTB has a significant association with generation 3, it has an even stronger association with the genotype of rs157334.</p>
            <p>A very simple check on whether founders tend to have different expression levels from non-founders is to calculate their percentile value with respect to other samples in the pedigree. For example, for a 14-member pedigree, each member has a percentile value ranging from 1/14 to 1, in an increment of 1/14, and is determined by their expression level of a particular eQT. For 28 eQTs (the probe sets that represent the "consensus sequence" of DDX17 and HLA-DRB1 are selected, whereas those that are "example sequences" are discarded), their average, standard deviation, and median percentile values are listed in Table <tblr tid="T1">1</tblr>. It is clear that for some eQTs, founders are indeed a biased subset of the pedigree. Most of the examples mentioned above for having a significant generation dependence is regression model also show up in Additional file <supplr sid="S1">1</supplr> for having higher or lower averaged founder percentiles: DDX17 (68%), PTPN22 (32%), CGI-96 (76%), HLA-DRB1 (36%), ZNF85 (35%), and CSTB (67%).</p>
            <p>To view more directly the relative location of founders' expression with respect to other pedigree members, Figure <figr fid="F1">1</figr> shows the box-and-whisker plot of the 28 eQTs, with the founders marked by crosses.</p>
            <fig id="F1">
               <title>
                  <p>Figure 1</p>
               </title>
               <caption>
                  <p>Box-and-whisker plot of 28 eQTs for 14 pedigrees</p>
               </caption>
               <text>
                  <p><b>Box-and-whisker plot of 28 eQTs for 14 pedigrees</b>. The founders' expression level is marked by crosses (four founders per pedigree).</p>
               </text>
               <graphic file="1753-6561-1-S1-S8-1"/>
            </fig>
            <p>Yet another simple check for the founder effect on expression is the two-way ANOVA with pedigree as one factor and founder/non-founder as the second factor. Our result shows that 1) pedigree-dependence of the expression is significant for most eQTs (with the exception of four to five eQTs); 2) founder-dependence of expression is significant for nine eQTs at the <it>p</it>-value = 0.001 level. These nine eQTs are CSTB, RPS26, DDX17, CGI96, TM7SF3, IL16, ZNF85, PTPN22, and HLA-DRB1, consistent with the result from the percentile value calculation; 3) for the founder-pedigree interaction term, four eQTs are significant at the <it>p</it>-value = 0.001 level.</p>
         </sec>
         <sec>
            <st>
               <p>One-stage versus two-stage analysis</p>
            </st>
            <p>Although it does not apply to the GAW data, in a practical setting, genome-wide genotyping information may only be available for the first stage of a study. In this situation, one may select SNPs that show promising association signals, and only type these SNPs for the rest of the pedigree (second stage) in order to save cost. It is also suggested by Van Steen et al. that two-stage design also helps to ease the multiple testing problem <abbrgrp><abbr bid="B14">14</abbr></abbrgrp>. If the whole genome genotyping information is available for all pedigree members, one should use all samples in the analysis, while correcting the sample correlation by appropriate procedures (such as the mixed model procedure discussed here).</p>
            <p>To test where a two-stage design would lead us, we imagine a hypothetical situation in which we do not have the genotyping information for non-founders. Then we would first exhaustively perform all possible genotype-expression linear regression analyses for the 56 founders. There are more than 8 millions possible SNP-eQT pairs, leading to an equal number of <it>p</it>-values. From these results, for each eQT, the minimum <it>p</it>-values of all 2263 SNPs can be recorded. There are 47 eQTs that have a minimum <it>p</it>-value lower than 4.4 &#215; 10<sup>-6</sup>. Now we assume in the second-stage that these selective SNPs are typed for non-founders. For the corresponding eQT-SNP pair, another linear regression analysis on all 194 members can be carried out, as well as a linear regression analysis with the generation variable as covariates. If we do that, among these 47 eQT-SNP pairs, only two remain significant at the 4.4 &#215; 10<sup>-6 </sup>level for the 194-sample data set: CSTB-rs157334 and HSD17B12-rs1334334. Interestingly, these two pairs are the only overlap between the 47-pair set and the 31-pair set listed in Additional file <supplr sid="S1">1</supplr>. Furthermore, the two pairs are also the only ones showing significant association with genotype after the generation effect has been removed.</p>
            <p>Because the eQTs listed in Table <tblr tid="T1">1</tblr> of Cheung et al. <abbrgrp><abbr bid="B2">2</abbr></abbrgrp> (and Additional file <supplr sid="S1">1</supplr> here) are selected based on a larger data set and, more importantly, extra information (e.g., linkage signal), they stand at a better chance to be true positives. The fact that the only two eQTs whose significant association with a SNP survive the test using all pedigree members cautions us on the practice of not using all available data, but only relying on a subset of the data set.</p>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Conclusion</p>
         </st>
         <p>We have shown that besides pedigree founders, pedigree members can also be used in a SNP-eQT association analysis. The issue of relatedness among pedigree members can be handled by mixed models, either with one or two random effects to account for pedigree-specific expression variations. Using all pedigree members has other advantages, such as increasing the sample size, checking the consistency among samples, and detecting possible age dependence in expression levels.</p>
      </sec>
      <sec>
         <st>
            <p>List of Abbreviations</p>
         </st>
         <p>CEPH: Centre d'Etude du Polymorphisme Human</p>
         <p>GAW: Genetic Analysis Workshop 15</p>
         <p>GC: generation as a covariate (linear regression model)</p>
         <p>eQT: expression quantitative trait</p>
         <p>MM: mixed model (with 1 or 2 random effects)</p>
         <p>SNP: single-nucleotide polymorphism</p>
      </sec>
      <sec>
         <st>
            <p>Competing interests</p>
         </st>
         <p>The author(s) declare that they have no competing interests.</p>
      </sec>
   </bdy>
   <bm>
      <ack>
         <sec>
            <st>
               <p>Acknowledgements</p>
            </st>
            <p>We thank Yaning Yang and Peter Gregersen for discussions, Jianxin Shi and Harald Goring for discussing their results prior to publication, and Houman Khalili for reading the manuscript.</p>
            <p>This article has been published as part of <it>BMC Proceedings </it>Volume 1 Supplement 1, 2007: Genetic Analysis Workshop 15: Gene Expression Analysis and Approaches to Detecting Multiple Functional Loci. The full contents of the supplement are available online at <url>http://www.biomedcentral.com/1753-6561/1?issue=S1</url>.</p>
         </sec>
      </ack>
      <refgrp>
         <bibl id="B1">
            <title>
               <p>Genetic analysis of genome-wide variation in human gene expression</p>
            </title>
            <aug>
               <au>
                  <snm>Morley</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Molony</snm>
                  <fnm>CM</fnm>
               </au>
               <au>
                  <snm>Weber</snm>
                  <fnm>TM</fnm>
               </au>
               <au>
                  <snm>Devlin</snm>
                  <fnm>JL</fnm>
               </au>
               <au>
                  <snm>Ewens</snm>
                  <fnm>KG</fnm>
               </au>
               <au>
                  <snm>Spielman</snm>
                  <fnm>RS</fnm>
               </au>
               <au>
                  <snm>Cheung</snm>
                  <fnm>VG</fnm>
               </au>
            </aug>
            <source>Nat Genet</source>
            <pubdate>2004</pubdate>
            <volume>430</volume>
            <fpage>743</fpage>
            <lpage>747</lpage>
            <xrefbib>
               <pubid idtype="doi">10.1038/nature02797</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B2">
            <title>
               <p>Mapping determinants of human gene expression by regional and genome-wide association</p>
            </title>
            <aug>
               <au>
                  <snm>Cheung</snm>
                  <fnm>VG</fnm>
               </au>
               <au>
                  <snm>Spielman</snm>
                  <fnm>RS</fnm>
               </au>
               <au>
                  <snm>Ewens</snm>
                  <fnm>KG</fnm>
               </au>
               <au>
                  <snm>Weber</snm>
                  <fnm>TM</fnm>
               </au>
               <au>
                  <snm>Morley</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Burdick</snm>
                  <fnm>JT</fnm>
               </au>
            </aug>
            <source>Nature</source>
            <pubdate>2005</pubdate>
            <volume>437</volume>
            <fpage>1365</fpage>
            <lpage>1369</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/nature04244</pubid>
                  <pubid idtype="pmpid" link="fulltext">16251966</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B3">
            <aug>
               <au>
                  <snm>Pinheiro</snm>
                  <fnm>JC</fnm>
               </au>
               <au>
                  <snm>Pinheiro</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Bates</snm>
                  <fnm>DM</fnm>
               </au>
            </aug>
            <source>Mixed-Effects Models in S and S-Plus</source>
            <publisher>New York: Springer-Verlag</publisher>
            <pubdate>2002</pubdate>
         </bibl>
         <bibl id="B4">
            <title>
               <p>Estimation of effects of single genes on quantitative traits</p>
            </title>
            <aug>
               <au>
                  <snm>Kennedy</snm>
                  <fnm>BW</fnm>
               </au>
               <au>
                  <snm>Quinton</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>van Arendonk</snm>
                  <fnm>JA</fnm>
               </au>
            </aug>
            <source>J Anim Sci</source>
            <pubdate>1992</pubdate>
            <volume>70</volume>
            <fpage>2000</fpage>
            <lpage>2012</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">1644672</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B5">
            <title>
               <p>Mixed model analysis of quantitative trait loci</p>
            </title>
            <aug>
               <au>
                  <snm>Xu</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Yi</snm>
                  <fnm>N</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci USA</source>
            <pubdate>2000</pubdate>
            <volume>97</volume>
            <fpage>14542</fpage>
            <lpage>14547</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">18955</pubid>
                  <pubid idtype="pmpid" link="fulltext">11114174</pubid>
                  <pubid idtype="doi">10.1073/pnas.250235197</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B6">
            <title>
               <p>Random-effects models for longitudinal data</p>
            </title>
            <aug>
               <au>
                  <snm>Laird</snm>
                  <fnm>NM</fnm>
               </au>
               <au>
                  <snm>Ware</snm>
                  <fnm>JH</fnm>
               </au>
            </aug>
            <source>Biometrics</source>
            <pubdate>1982</pubdate>
            <volume>38</volume>
            <fpage>963</fpage>
            <lpage>974</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.2307/2529876</pubid>
                  <pubid idtype="pmpid">7168798</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B7">
            <title>
               <p>A unified mixed-model method for association mapping that accounts for multiple levels of relatedness</p>
            </title>
            <aug>
               <au>
                  <snm>Yu</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Pressoir</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Briggs</snm>
                  <fnm>WH</fnm>
               </au>
               <au>
                  <snm>Bi</snm>
                  <fnm>IV</fnm>
               </au>
               <au>
                  <snm>Yamasaki</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Doebley</snm>
                  <fnm>JF</fnm>
               </au>
               <au>
                  <snm>McMullen</snm>
                  <fnm>MD</fnm>
               </au>
               <au>
                  <snm>Gaut</snm>
                  <fnm>BS</fnm>
               </au>
               <au>
                  <snm>Nielsen</snm>
                  <fnm>DM</fnm>
               </au>
               <au>
                  <snm>Holland</snm>
                  <fnm>JB</fnm>
               </au>
               <au>
                  <snm>Kresovich</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Buckler</snm>
                  <fnm>ES</fnm>
               </au>
            </aug>
            <source>Nat Genet</source>
            <pubdate>2006</pubdate>
            <volume>38</volume>
            <fpage>203</fpage>
            <lpage>208</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/ng1702</pubid>
                  <pubid idtype="pmpid" link="fulltext">16380716</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B8">
            <title>
               <p>Estimation of allele frequencies with data on sibships</p>
            </title>
            <aug>
               <au>
                  <snm>Broman</snm>
                  <fnm>KW</fnm>
               </au>
            </aug>
            <source>Genet Epidemiol</source>
            <pubdate>2000</pubdate>
            <volume>20</volume>
            <fpage>307</fpage>
            <lpage>315</lpage>
            <xrefbib>
               <pubid idtype="doi">10.1002/gepi.2</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B9">
            <title>
               <p>Evaluation of candidate genes in case-control studies: a statistical method to account for related subjects</p>
            </title>
            <aug>
               <au>
                  <snm>Slager</snm>
                  <fnm>SL</fnm>
               </au>
               <au>
                  <snm>Schaid</snm>
                  <fnm>DJ</fnm>
               </au>
            </aug>
            <source>Am J Hum Genet</source>
            <pubdate>2001</pubdate>
            <volume>68</volume>
            <fpage>1457</fpage>
            <lpage>1462</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1226132</pubid>
                  <pubid idtype="pmpid" link="fulltext">11353403</pubid>
                  <pubid idtype="doi">10.1086/320608</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B10">
            <title>
               <p>Effective sample size: quick estimation of the effect of related samples in genetic case-control association analyses</p>
            </title>
            <aug>
               <au>
                  <snm>Li</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Yang</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Remmers</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Ogunwole</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Kastner</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Gregersen</snm>
                  <fnm>P</fnm>
               </au>
            </aug>
            <source>Nature Precedings</source>
            <pubdate>2007</pubdate>
            <url>http://hdl.nature.com/10101/npre.2007.400.1</url>
         </bibl>
         <bibl id="B11">
            <title>
               <p>Sex and age effects on genome-wide linkage analysis of gene expression in transformed lymphoblasts</p>
            </title>
            <aug>
               <au>
                  <snm>Rangrej</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Beyene</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Hu</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Paterson</snm>
                  <fnm>AD</fnm>
               </au>
            </aug>
            <source>BMC Proc</source>
            <pubdate>2007</pubdate>
            <volume>1</volume>
            <issue>Suppl 1</issue>
            <fpage>S92</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">2367486</pubid>
                  <pubid idtype="pmpid" link="fulltext">18466596</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B12">
            <title>
               <p>Microsatellite typing for DRB1 alleles: application to the analysis of HLA associations with rheumatoid arthritis</p>
            </title>
            <aug>
               <au>
                  <snm>Lee</snm>
                  <fnm>HS</fnm>
               </au>
               <au>
                  <snm>Li</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Lee</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Rodine</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Graham</snm>
                  <fnm>RR</fnm>
               </au>
               <au>
                  <snm>Ortman</snm>
                  <fnm>WA</fnm>
               </au>
               <au>
                  <snm>Batliwalla</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Lee</snm>
                  <fnm>KW</fnm>
               </au>
               <au>
                  <snm>Bae</snm>
                  <fnm>SC</fnm>
               </au>
               <au>
                  <snm>Behrens</snm>
                  <fnm>TW</fnm>
               </au>
               <au>
                  <snm>Gregersen</snm>
                  <fnm>PK</fnm>
               </au>
            </aug>
            <source>Gene Immun</source>
            <pubdate>2006</pubdate>
            <volume>7</volume>
            <fpage>533</fpage>
            <lpage>543</lpage>
            <xrefbib>
               <pubid idtype="doi">10.1038/sj.gene.6364325</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B13">
            <title>
               <p><it>PTPN22 </it>: setting thresholds for autoimmunity</p>
            </title>
            <aug>
               <au>
                  <snm>Gregersen</snm>
                  <fnm>PK</fnm>
               </au>
               <au>
                  <snm>Lee</snm>
                  <fnm>HS</fnm>
               </au>
               <au>
                  <snm>Batliwalla</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Begovich</snm>
                  <fnm>AB</fnm>
               </au>
            </aug>
            <source>Semin Immunol</source>
            <pubdate>2006</pubdate>
            <volume>18</volume>
            <fpage>214</fpage>
            <lpage>223</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/j.smim.2006.03.009</pubid>
                  <pubid idtype="pmpid" link="fulltext">16731003</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B14">
            <title>
               <p>Genomic screening and replication using the same data set in family-based association testing</p>
            </title>
            <aug>
               <au>
                  <snm>Van Steen</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>McQueen</snm>
                  <fnm>MB</fnm>
               </au>
               <au>
                  <snm>Herbert</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Raby</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Lyon</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Demeo</snm>
                  <fnm>DL</fnm>
               </au>
               <au>
                  <snm>Murphy</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Su</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Datta</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Rosenow</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Christman</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Silverman</snm>
                  <fnm>EK</fnm>
               </au>
               <au>
                  <snm>Laird</snm>
                  <fnm>NM</fnm>
               </au>
               <au>
                  <snm>Weiss</snm>
                  <fnm>ST</fnm>
               </au>
               <au>
                  <snm>Lange</snm>
                  <fnm>C</fnm>
               </au>
            </aug>
            <source>Nat Genet</source>
            <pubdate>2005</pubdate>
            <volume>37</volume>
            <fpage>683</fpage>
            <lpage>691</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/ng1582</pubid>
                  <pubid idtype="pmpid" link="fulltext">15937480</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
      </refgrp>
   </bm>
</art>
