<?xml version='1.0'?>
<!DOCTYPE art SYSTEM 'http://www.biomedcentral.com/xml/article.dtd'>
<art>
   <ui>1471-2148-8-154</ui>
   <ji>1471-2148</ji>
   <fm>
      <dochead>Research article</dochead>
      <bibl>
         <title>
            <p>Evidence against the energetic cost hypothesis for the short introns in highly expressed genes</p>
         </title>
         <aug>
            <au id="A1">
               <snm>Huang</snm>
               <fnm>Yi-Fei</fnm>
               <insr iid="I1"/>
               <email>huangyifei@mail.bnu.edu.cn</email>
            </au>
            <au id="A2" ca="yes">
               <snm>Niu</snm>
               <fnm>Deng-Ke</fnm>
               <insr iid="I1"/>
               <email>dengkeniu@hotmail.com</email>
            </au>
         </aug>
         <insg>
            <ins id="I1">
               <p>MOE Key Laboratory for Biodiversity Science and Ecological Engineering, College of Life Sciences, Beijing Normal University, Beijing 100875, P R China</p>
            </ins>
         </insg>
         <source>BMC Evolutionary Biology</source>
         <issn>1471-2148</issn>
         <pubdate>2008</pubdate>
         <volume>8</volume>
         <issue>1</issue>
         <fpage>154</fpage>
         <url>http://www.biomedcentral.com/1471-2148/8/154</url>
         <xrefbib>
            <pubidlist>
               <pubid idtype="pmpid">18492248</pubid>
               <pubid idtype="doi">10.1186/1471-2148-8-154</pubid>
            </pubidlist>
         </xrefbib>
      </bibl>
      <history>
         <rec>
            <date>
               <day>10</day>
               <month>12</month>
               <year>2007</year>
            </date>
         </rec>
         <acc>
            <date>
               <day>20</day>
               <month>5</month>
               <year>2008</year>
            </date>
         </acc>
         <pub>
            <date>
               <day>20</day>
               <month>5</month>
               <year>2008</year>
            </date>
         </pub>
      </history>
      <cpyrt>
         <year>2008</year>
         <collab>Huang and Niu; licensee BioMed Central Ltd.</collab>
         <note>This is an Open Access article distributed under the terms of the Creative Commons Attribution License (<url>http://creativecommons.org/licenses/by/2.0</url>), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</note>
      </cpyrt>
      <abs>
         <sec>
            <st>
               <p>Abstract</p>
            </st>
            <sec>
               <st>
                  <p>Background</p>
               </st>
               <p>In animals, the moss <it>Physcomitrella patens </it>and the pollen of <it>Arabidopsis thaliana</it>, highly expressed genes have shorter introns than weakly expressed genes. A popular explanation for this is selection for transcription efficiency, which includes two sub-hypotheses: to minimize the energetic cost or to minimize the time cost.</p>
            </sec>
            <sec>
               <st>
                  <p>Results</p>
               </st>
               <p>In an individual human, different organs may differ up to hundreds of times in cell number (for example, a liver versus a hypothalamus). Considered at the individual level, a gene specifically expressed in a large organ is actually transcribed tens or hundreds of times more than a gene with a similar expression level (a measure of mRNA abundance per cell) specifically expressed in a small organ. According to the energetic cost hypothesis, the former should have shorter introns than the latter. However, in humans and mice we have not found significant differences in intron length between large-tissue/organ-specific genes and small-tissue/organ-specific genes with similar expression levels. Qualitative estimation shows that the deleterious effect (that is, the energetic burden) of long introns in highly expressed genes is too negligible to be efficiently selected against in mammals.</p>
            </sec>
            <sec>
               <st>
                  <p>Conclusion</p>
               </st>
               <p>The short introns in highly expressed genes should not be attributed to energy constraint. We evaluated evidence for the time cost hypothesis and other alternatives.</p>
            </sec>
         </sec>
      </abs>
   </fm>
   <meta>
      <classifications>
         <classification type="bmc" subtype="user_supplied_xml" id="endnote"/>
      </classifications>
   </meta>
   <bdy>
      <sec>
         <st>
            <p>Background</p>
         </st>
         <p>In animals (including humans, mice and <it>Caenorhabditis elegans</it>), the moss <it>Physcomitrella patens </it>and the pollen of <it>Arabidopsis thaliana</it>, highly expressed genes have been found to have short introns and exons <abbrgrp><abbr bid="B1">1</abbr><abbr bid="B2">2</abbr><abbr bid="B3">3</abbr><abbr bid="B4">4</abbr><abbr bid="B5">5</abbr><abbr bid="B6">6</abbr><abbr bid="B7">7</abbr></abbrgrp>. Several hypotheses have been proposed to explain the compactness of highly expressed genes. The first, based on the fact that transcription is a slow and expensive process, suggests that natural selection for transcriptional efficiency favors the compactness of highly expressed genes <abbrgrp><abbr bid="B1">1</abbr><abbr bid="B8">8</abbr><abbr bid="B9">9</abbr></abbrgrp>. The second hypothesis, called "genome design", suggests that highly expressed genes are short because most of them are housekeeping genes whose epigenetic regulation is less complex than that of weakly expressed tissue-specific genes <abbrgrp><abbr bid="B10">10</abbr></abbrgrp>. In line with this hypothesis, expression level and breadth are strongly positively correlated, and human housekeeping genes are more compact than tissue-specific genes <abbrgrp><abbr bid="B9">9</abbr><abbr bid="B10">10</abbr></abbrgrp>. However, by comparing artificially selected pairs of housekeeping and narrowly expressed genes with similar average expression levels, Li et al. <abbrgrp><abbr bid="B6">6</abbr></abbrgrp> recently found that housekeeping genes are no more compact than narrowly expressed genes if the expression level is controlled. This implies that expression level rather than breadth determines the compactness of genes. The third hypothesis is mutational bias, which supposes that highly expressed genes tend to localize in chromosomal regions with high deletion rates, or that there is a transcription-associated deletion bias <abbrgrp><abbr bid="B2">2</abbr><abbr bid="B5">5</abbr></abbrgrp>. Urrutia and Hurst <abbrgrp><abbr bid="B5">5</abbr></abbrgrp> found that the introns of highly expressed genes are still small even if the effects of chromosomal regions are controlled. Housekeeping genes are expected to have much higher germline transcriptional frequencies, and thus, more transcription-associated deletions, than genes that are narrowly expressed in somatic tissues. However, Li et al <abbrgrp><abbr bid="B6">6</abbr></abbrgrp> found that housekeeping genes are no more compact than genes that are narrowly expressed in somatic tissues with similar average expression levels.</p>
         <p>The transcription efficiency hypothesis includes two sub-hypotheses: an energetic cost hypothesis and a time cost hypothesis. Selection for short introns and short exons may be driven either by minimizing the energetic cost of transcription or by the requirement to transcribe large amounts of mRNA molecules within limited periods. Human antisense genes that have very short response times have been found to have short introns <abbrgrp><abbr bid="B11">11</abbr><abbr bid="B12">12</abbr></abbrgrp>, which directly supports the time cost hypothesis. Furthermore, Jeffares et al. <abbrgrp><abbr bid="B13">13</abbr></abbrgrp> found that the intron density in common eukaryotes is positively correlated with the duration of life cycle. However, the time cost hypothesis has been argued against or overlooked in recent studies <abbrgrp><abbr bid="B3">3</abbr><abbr bid="B4">4</abbr><abbr bid="B6">6</abbr></abbrgrp>. Seoighe et al. <abbrgrp><abbr bid="B3">3</abbr></abbrgrp> pointed out that the transcription of multiple copies of mRNA does not necessarily require a much longer period of time than required to transcribe the first copy, because multiple polymerases may be simultaneously working on one template <abbrgrp><abbr bid="B14">14</abbr></abbrgrp>. The present paper presents evidence against the energetic cost hypothesis and evaluates evidence for the time cost hypothesis and other alternatives.</p>
         <p>In animals, different organs may differ up to hundreds of times in cell number and weight. For example, in an adult human, a lung weighs about 1000 g while a prostate weighs only about 20 g. Thus, humans produce tens of times more mRNA molecules for a lung-specific gene (for example, <it>SFTPD</it>) than for a prostate-specific gene (for example, <it>SEMG1</it>) with a similar expression level (considered to be a measure of mRNA abundance per cell in this paper; see Methods for the source of the expression data of these two example genes). Expression of <it>SFTPD </it>is thus expected to have tens of times higher energetic cost to a human body than expression of <it>SEMG1</it>, if these two genes have similar lengths. According to the energetic cost hypothesis, <it>SFTPD </it>should have much shorter introns than <it>SEMG1</it>. On the contrary, <it>SFTPD </it>has a longer average intron length and total intron length than <it>SEMG1 </it>(Additional File <supplr sid="S1">1</supplr>). The present paper surveys large-tissue/organ-specific (LTS) genes and small-tissue/organ-specific (STS) genes at a genome-wide scale and compares their compactness for a statistically convincing result.</p>
         <suppl id="S1">
            <title>
               <p>Additional file 1</p>
            </title>
            <text>
               <p>A list of all the human tissue/organ-specific genes counted in Table <tblr tid="T1">1</tblr>. This list includes the gene symbols, gene features and some other details. Gene expression was defined by the conservative criterion described in the Methods and probe sets annotated with an "_x" appended to the probe set name were retained.</p>
            </text>
            <file name="1471-2148-8-154-S1.xls">
               <p>Click here for file</p>
            </file>
         </suppl>
      </sec>
      <sec>
         <st>
            <p>Results and Discussion</p>
         </st>
         <sec>
            <st>
               <p>Large-tissue/organ-specific genes and small-tissue/organ-specific genes have similar sizes</p>
            </st>
            <p>The gene expression datasets we used include the gene expression levels in 69 non-disease adult tissue/organ samples from humans and 55 non-disease adult tissue/organ samples from mice <abbrgrp><abbr bid="B15">15</abbr></abbrgrp>. The weights of these tissue/organs are on a continuum varying by several magnitudes. For reliability, only the largest samples are defined as large tissue/organs and the smallest samples are defined as small tissue/organs (Table <tblr tid="T1">1</tblr>). The sizes of tissue/organs were determined by searching the literature <abbrgrp><abbr bid="B16">16</abbr><abbr bid="B17">17</abbr><abbr bid="B18">18</abbr><abbr bid="B19">19</abbr><abbr bid="B20">20</abbr><abbr bid="B21">21</abbr><abbr bid="B22">22</abbr><abbr bid="B23">23</abbr><abbr bid="B24">24</abbr></abbrgrp> and internet resources (for example, Wikipedia, the free encyclopedia), or estimated by experience. A conservative estimation of the difference in average tissue/organ weight between large tissue/organ samples and small tissue/organ samples is > 50 times.</p>
            <tbl id="T1">
               <title>
                  <p>Table 1</p>
               </title>
               <caption>
                  <p>Tissue/organ samples and the number of specific genes analyzed in this study<sup>a</sup></p>
               </caption>
               <tblbdy cols="3">
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>Large tissue/organ (number of specific genes; tissue/organ weight)<sup>b</sup></p>
                     </c>
                     <c ca="left">
                        <p>Small tissue/organ (number of specific genes; tissue/organ weight)<sup>b</sup></p>
                     </c>
                  </r>
                  <r>
                     <c cspan="3">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Homo sapiens</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>Cultured adipocytes (18; 9 Kg)</p>
                     </c>
                     <c ca="left">
                        <p>Brain amygdala (22; --)</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>Liver (79; 1.5 Kg)</p>
                     </c>
                     <c ca="left">
                        <p>Hypothalamus (7; 4 g)</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>Lung (18; 1 Kg)</p>
                     </c>
                     <c ca="left">
                        <p>Pituitary (6; 5 g)</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>Skeletal muscle (4; 27 Kg)</p>
                     </c>
                     <c ca="left">
                        <p>Tonsil (1; 30&#8211;40 g)</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>Skin (6; 5 Kg)</p>
                     </c>
                     <c ca="left">
                        <p>Prostate (13; 20 g)</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>Smooth muscle (24; --)</p>
                     </c>
                     <c ca="left">
                        <p>Thymus (11; 30&#8211;40 g)</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>Thyroid (25; 18&#8211;60 g)</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>Tongue (11; 70 g)</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Mus musculus</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>Adipose tissue (13; --)</p>
                     </c>
                     <c ca="left">
                        <p>Amygdala (4; --)</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>Liver (76; 2 g)</p>
                     </c>
                     <c ca="left">
                        <p>Hypothalamus (12; &lt; 60 mg)</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>Skeletal muscle (47; --)</p>
                     </c>
                     <c ca="left">
                        <p>Pituitary (29; 3 mg)</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>Epidermis (4; --)</p>
                     </c>
                     <c ca="left">
                        <p>Trigeminal (7; --)</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>Prostate (24; 0.11 g)</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>Thymus (64; &lt; 60 mg)</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>Thyroid (21; 15 mg)</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>Tongue epidermis (14; --)</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>Retina (71; --)</p>
                     </c>
                  </r>
               </tblbdy>
               <tblfn>
                  <p><sup>a</sup>See Additional File <supplr sid="S1">1</supplr> and Additional File <supplr sid="S2">2</supplr> for full lists of these genes.</p>
                  <p><sup>b </sup>Some samples, like the subthalamic nucleus and trigeminal ganglion, are undoubtedly small tissues/organs. These may be not included in this study because we could not find any specific genes for them. The tissue/organ weights were obtained directly from literatures and internet resources (for example, Wikipedia, the free encyclopedia) or calculated according to their ratio to body by assuming that the weights of adult human and mouse bodies are about 70 Kg and 30 g, respectively (when different sources of data are not consistent, we retained the conservative estimation) [16&#8211;24, 34]. Some samples (like smooth muscle, tongue epidermis and retina) were categorized into large tissue/organs or small tissue/organs on the basis of experience. Some mouse tissue/organs were categorized by consulting their human homologs. In humans, the lower limit of large tissue/organ samples was lung (about 1000 g), while the upper limit of small tissue/organ samples was tongue (70 g). In mice, the lower limit of large tissue/organ samples was liver (about 2 g), while the upper limit of small tissue/organ samples was prostate (0.11 g).</p>
               </tblfn>
            </tbl>
            <p>Tissue/organ-specific genes are those that are expressed only in one particular tissue/organ sample. In total, we found 149 LTS genes and 96 STS genes in humans and 140 LTS genes and 246 STS genes in mice (Table <tblr tid="T1">1</tblr>, Additional Files <supplr sid="S1">1</supplr>, <supplr sid="S2">2</supplr>). As the tissue/organ weights differed by tens or even hundreds of times, an LTS gene is expected to produce tens or even hundreds of times more mRNA molecules per tissue/organ than an STS gene with a similar expression level. If the compactness of highly expressed genes has evolved to minimize the energetic cost of transcription, the LTS genes should be more compact than the STS genes with similar expression levels. However, pairwise comparisons of LTS-STS gene pairs with similar expression levels (for details, see Methods) do not show significant differences in average intron length, total intron length, intron number, coding sequence (CDS) length or untranslated region (UTR) length between LTS genes and STS genes, in either humans or mice (Figure <figr fid="F1">1</figr>).</p>
            <suppl id="S2">
               <title>
                  <p>Additional file 2</p>
               </title>
               <text>
                  <p>A list of all the mouse tissue/organ-specific genes counted in Table <tblr tid="T1">1</tblr>. This list includes the gene symbols, gene features and some other details. Gene expression was defined by the conservative criterion described in the Methods and probe sets annotated with an "_x" appended to the probe set name were retained.</p>
               </text>
               <file name="1471-2148-8-154-S2.xls">
                  <p>Click here for file</p>
               </file>
            </suppl>
            <fig id="F1">
               <title>
                  <p>Figure 1</p>
               </title>
               <caption>
                  <p>Comparison of large-tissue/organ-specific genes and small-tissue/organ-specific genes with similar expression levels</p>
               </caption>
               <text>
                  <p><b>Comparison of large-tissue/organ-specific genes and small-tissue/organ-specific genes with similar expression levels.</b> The logarithm (base 10) values are shown. The Y axis represents small-tissue/organ-specific genes, while the X axis shows their large-tissue/organ-specific counterparts. The numbers of dots above (marked at the top left corner) and below (marked at the bottom right corner) the diagonal line illustrate the comparison between large-tissue/organ-specific genes and small-tissue/organ-specific genes. We performed Wilcoxon signed ranks tests to determine the significance of the differences. The number of gene pairs and the significance levels are: (A) 82, <it>P </it>= 0.59; (B) 116, <it>P </it>= 0.39; (C) 82, <it>P </it>= 0.57; (D) 116, <it>P </it>= 0.81; (E) 82, <it>P </it>= 0.90; (F) 116, <it>P </it>= 0.57; (G) 82, <it>P </it>= 0.86; (H) 116, <it>P </it>= 0.50; (I) 67, <it>P </it>= 0.89; (J) 63, <it>P </it>= 0.83.</p>
               </text>
               <graphic file="1471-2148-8-154-1"/>
            </fig>
            <p>How large a difference in expression level is required to generate a significant difference in gene compactness? The genes analyzed above were divided on the basis of expression level, rather than the size of tissue/organ; genes in the top 30% quantile were considered to be highly expressed and those in the bottom 30% quantile were considered to be weakly expressed genes. As shown in Table <tblr tid="T2">2</tblr>, the introns and UTRs of highly expressed genes are significantly shorter than those of weakly expressed genes, but there is no significant difference in intron number or CDS length (Table <tblr tid="T2">2</tblr>). This result is in contrast to a previous study <abbrgrp><abbr bid="B6">6</abbr></abbrgrp>, but is in line with another study, which found that total exon length is much more weakly related to expression level than intron length <abbrgrp><abbr bid="B1">1</abbr></abbrgrp>. We suspect that the small number of genes analyzed in this study may have obscured a weak trend. One might expect that increasing the difference in expression level between highly expressed and weakly expressed genes (for example comparing genes in the top 10% quantile with those in the bottom 10% quantile) would reveal significant differences in intron number and CDS length. In fact, selecting 10% quantiles resulted in a much smaller number of genes being analyzed and, consequently, statistically less convincing results (data not shown). The difference in expression level between the top and bottom 30% quantiles of human genes or mouse genes is about 20 times (Table <tblr tid="T2">2</tblr>). As the expression value detected by microarray is linear with the concentration of target RNA (Affymetrix 2001, technical note, new statistical algorithms for monitoring gene expression on GeneChip<sup>&#174; </sup>probe arrays), this difference in expression level can reflect the difference in the concentrations of the target mRNAs.</p>
            <tbl id="T2">
               <title>
                  <p>Table 2</p>
               </title>
               <caption>
                  <p>Comparison of compactness between genes expressed at different levels<sup>a</sup></p>
               </caption>
               <tblbdy cols="7">
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>Average intron length</p>
                     </c>
                     <c ca="left">
                        <p>Total intron length</p>
                     </c>
                     <c ca="left">
                        <p>Intron number</p>
                     </c>
                     <c ca="left">
                        <p>CDS length</p>
                     </c>
                     <c ca="left">
                        <p>UTR length</p>
                     </c>
                     <c ca="left">
                        <p>Expression level</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="7">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c cspan="7" ca="left">
                        <p>Human genes</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Top 30% quantile</p>
                     </c>
                     <c ca="left">
                        <p>2768 &#177; 608</p>
                     </c>
                     <c ca="left">
                        <p>28117 &#177; 7347</p>
                     </c>
                     <c ca="left">
                        <p>8 &#177; 1</p>
                     </c>
                     <c ca="left">
                        <p>1313 &#177; 90</p>
                     </c>
                     <c ca="left">
                        <p>775 &#177; 107</p>
                     </c>
                     <c ca="left">
                        <p>5369 &#177; 770</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Versus</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Bottom 30% quantile</p>
                     </c>
                     <c ca="left">
                        <p>10448 &#177; 4237</p>
                     </c>
                     <c ca="left">
                        <p>901046 &#177; 33210</p>
                     </c>
                     <c ca="left">
                        <p>9 &#177; 1</p>
                     </c>
                     <c ca="left">
                        <p>1764 &#177; 232</p>
                     </c>
                     <c ca="left">
                        <p>1478 &#177; 244</p>
                     </c>
                     <c ca="left">
                        <p>267 &#177; 14</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p><it>P </it>= 0.001</p>
                     </c>
                     <c ca="left">
                        <p><it>P </it>= 0.019</p>
                     </c>
                     <c ca="left">
                        <p><it>P </it>= 0.844</p>
                     </c>
                     <c ca="left">
                        <p><it>P </it>= 0.273</p>
                     </c>
                     <c ca="left">
                        <p>0.019</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c cspan="7" ca="left">
                        <p>Mouse genes</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Top 30% quantile</p>
                     </c>
                     <c ca="left">
                        <p>2631 &#177; 290</p>
                     </c>
                     <c ca="left">
                        <p>16190 &#177; 1828</p>
                     </c>
                     <c ca="left">
                        <p>7 &#177; 1</p>
                     </c>
                     <c ca="left">
                        <p>1214 &#177; 65</p>
                     </c>
                     <c ca="left">
                        <p>779 &#177; 136</p>
                     </c>
                     <c ca="left">
                        <p>6219 &#177; 794</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>versus</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>bottom 30% quantile</p>
                     </c>
                     <c ca="left">
                        <p>8032 &#177; 2706</p>
                     </c>
                     <c ca="left">
                        <p>37391 &#177; 4615</p>
                     </c>
                     <c ca="left">
                        <p>8 &#177; 1</p>
                     </c>
                     <c ca="left">
                        <p>1450 &#177; 128</p>
                     </c>
                     <c ca="left">
                        <p>1496 &#177; 190</p>
                     </c>
                     <c ca="left">
                        <p>365 &#177; 16</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p><it>P </it>= 0.001</p>
                     </c>
                     <c ca="left">
                        <p><it>P </it>= 0.001</p>
                     </c>
                     <c ca="left">
                        <p><it>P </it>= 0.444</p>
                     </c>
                     <c ca="left">
                        <p><it>P </it>= 0.589</p>
                     </c>
                     <c ca="left">
                        <p><it>P </it>= 0.001</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
               </tblbdy>
               <tblfn>
                  <p><sup>a </sup>The human genes and the mouse genes are those analyzed in Figure 1. We used the Mann-Whitney U test to determine the significance of differences. For each case, we present the average value &#177; standard error of mean.</p>
               </tblfn>
            </tbl>
            <p>The weight ratio of a large tissue/organ to a small tissue/organ is much larger than the ratio in mRNA abundance required producing a significant difference in average intron length, total intron length and UTR length. However, large differences in tissue/organ weights do not produce significant differences in intron length or UTR length (Figure <figr fid="F1">1</figr>). This result is unexpected on the basis of the energetic cost hypothesis.</p>
         </sec>
         <sec>
            <st>
               <p>Qualitatively estimating the energetic burden of long introns in highly expressed genes</p>
            </st>
            <p>We also qualitatively estimated the length and number of introns in genomes that may be selected against because of their energetic cost during transcription. In a highly expressed housekeeping gene (housekeeping genes are expressed in all cells in the human body, so their cumulative energetic burden is higher), let us assume that there is an intron with the threshold length (<it>L</it>) to trigger natural selection. Several studies have shown that most eukaryotic genes are expressed at the level of two or three copies of mRNA per cell <abbrgrp><abbr bid="B25">25</abbr><abbr bid="B26">26</abbr><abbr bid="B27">27</abbr></abbrgrp>, so a gene that produces 30 mRNA copies in each cell can be viewed as a highly expressed gene. The median half-life of human mRNA is about 10 h, and fast decay mRNAs have half-lives of &lt; 2 h <abbrgrp><abbr bid="B28">28</abbr></abbrgrp>. For a conservative estimation, we can assume that the gene needs to synthesize 30 mRNA copies every 2 h, that is, 360 mRNA copies per day, per cell. The expense of transcription is two ATP molecules per nucleotide. Therefore, transcription of the intron requires 360 &#215; 2 <it>L </it>= 720 <it>L </it>ATP molecules per day in each cell. Estimates of the number of cells in an adult human body vary from 10<sup>13 </sup>to 10<sup>14 </sup><abbrgrp><abbr bid="B29">29</abbr></abbrgrp>. For a conservative estimation of the energetic cost of gene transcription, we used the higher value, 10<sup>14 </sup>cells. As an adult human consumes about 200 mol of ATP per day <abbrgrp><abbr bid="B18">18</abbr><abbr bid="B30">30</abbr></abbrgrp>, the energy consumption of each human cell is (200 &#215; 6.02 &#215; 10<sup>23</sup>)/10<sup>14 </sup>= 1.2 &#215; 10<sup>12 </sup>ATP molecules per day. It should be noted that this is a conservative estimation; the energy consumption involved in strenuous exercises (for example, mountain climbing) may be as much as 10 times more than that used when resting <abbrgrp><abbr bid="B18">18</abbr></abbrgrp>. The proportion of human daily energy consumption representing the energetic cost of the long putative intron of a highly expressed housekeeping gene (which can be considered as the coefficient of natural selection, <it>S</it>) is 720 <it>L</it>/(1.2 &#215; 10<sup>12</sup>) = 6 <it>L </it>&#215; 10<sup>-10</sup>. The recent effective population size (<it>Ne</it>) of humans is &#8804; 10<sup>4 </sup><abbrgrp><abbr bid="B31">31</abbr><abbr bid="B32">32</abbr></abbrgrp>. According to <it>S </it>= 1/(2 <it>Ne</it>) as the margin above which natural selection is stronger than genetic drift, <it>L </it>= 1/(2 &#215; 10<sup>4</sup>&#215; 6 &#215; 10<sup>-10</sup>) = 8.3 &#215; 10<sup>4 </sup>nt. In human genome, only 0.9% of introns are longer than this threshold. In principal, this estimation is applicable to the energetic cost of the transcription of a CDS or UTR.</p>
            <p>The major differences between humans and mice are in their body sizes, their metabolic rates and their effective population sizes. We could not find an estimation of the number of cells in a mouse body. However we did find data on mass-specific metabolic rates <abbrgrp><abbr bid="B33">33</abbr><abbr bid="B34">34</abbr></abbrgrp>, from which we can estimate energy consumption per mouse cell by assuming that human and mouse cells do not differ greatly in mass. The mass-specific metabolic rate of mice is 0.0151 W/g and that of humans is 0.00118 W/g <abbrgrp><abbr bid="B34">34</abbr></abbrgrp>, so a mouse cell uses ~12.8 times more energy than a human cell. As estimated above, the energy consumption of each human cell is about 1.2 &#215; 10<sup>12 </sup>ATP molecules per day, so that of each mouse cell is about 1.5 &#215; 10<sup>13 </sup>ATP molecules per day. The proportion of mouse daily energy consumption (<it>S</it>) representing the energetic cost of the long putative intron of a highly expressed housekeeping gene is (360 &#215; 2 <it>L</it>)/(1.5 &#215; 10<sup>13</sup>) = 4.8 <it>L </it>&#215; 10<sup>-11</sup>, where <it>L </it>is defined as described in the previous paragraph. Different sources of data on the effective population size of mice are not consistent <abbrgrp><abbr bid="B35">35</abbr><abbr bid="B36">36</abbr></abbrgrp>; we retained a higher value (<it>Ne </it>= 8.1 &#215; 10<sup>5</sup>) for a conservative estimation. Thus, in mice, the threshold length of introns to trigger natural selection is <it>L </it>= 1/(2 &#215; 8.1 &#215; 10<sup>5</sup>&#215; 4.8 &#215; 10<sup>-11</sup>) = 1.3 &#215; 10<sup>4 </sup>nt. Similar to the situation in humans, only a small fraction of introns in the mouse genome (6.8%) are longer than this threshold.</p>
            <p>Owing to a lack of the required information (such as mRNA decay rates), it is impossible to accurately estimate the burden of long introns in other vertebrates and invertebrates. Considering that the effective population size of vertebrates is only about 10<sup>4 </sup><abbrgrp><abbr bid="B37">37</abbr></abbrgrp>, we suggest that long introns in highly expressed vertebrate genes are unlikely to be selected against. However, for invertebrates, with an effective population size of about 10<sup>6 </sup><abbrgrp><abbr bid="B37">37</abbr></abbrgrp>, it would be too bold to give a rough estimation.</p>
            <p>Benefiting from the extensive studies on yeast <it>Saccharomyces cerevisiae</it>, we also found enough data to estimate the energetic burden of a long intron in a unicellular eukaryote. A gene that produces 30 mRNA copies in each cell can also be viewed as a highly expressed gene in yeasts <abbrgrp><abbr bid="B25">25</abbr><abbr bid="B26">26</abbr><abbr bid="B27">27</abbr></abbrgrp>. The median half-life of yeast mRNAs is about 21 min, and the 90th percentile of mRNA half-lives is 10 min <abbrgrp><abbr bid="B26">26</abbr></abbrgrp>. Conservatively, we assumed that such a gene would need to synthesize 30 mRNA copies every 10 min; that is, 30 &#215; 24 &#215; 60/10 = 4320 copies of mRNA every day. To transcribe a long intron, a yeast cell consumes 4320 &#215; 2 <it>L </it>= 8340 <it>L </it>ATP molecules, where <it>L </it>is defined as previously. A yeast cell weighs 3.35 &#215; 10<sup>-11 </sup>g and the median value of yeast metabolic rates at eight different temperatures is 0.267 W/g <abbrgrp><abbr bid="B38">38</abbr></abbrgrp>, so the metabolic rate of a yeast cell is 8.9 &#215; 10<sup>-12 </sup>W, which can be convert to 1.39 &#215; 10<sup>13 </sup>ATP molecules per day. The proportion of yeast daily energy consumption representing the energetic cost of the putative long intron in a highly expressed gene is 8640 <it>L</it>/(1.39 &#215; 10<sup>13</sup>) = 6.2 <it>L </it>&#215; 10<sup>-10</sup>. The effective population size of yeasts is about 10<sup>7 </sup><abbrgrp><abbr bid="B37">37</abbr><abbr bid="B39">39</abbr></abbrgrp>. Thus, in yeasts, the threshold length of introns to trigger natural selection is <it>L </it>= 1/(2 &#215; 10<sup>7&#215; </sup>6.2 &#215; 10<sup>-10</sup>) = 81 nt. Unlike the situation in humans and mice, 86.5% of the introns in the genome of <it>S. cerevisiae </it>are longer than this threshold length. The fractional energetic cost of long introns may be overestimated here; thus the extant long introns, even in highly expressed genes, may be not under negative selection. At least, this result is helpful to explain the fact that unicellular eukaryotes generally have much shorter introns than mammals, and it is consistent with a previous study, which showed that energy is a constraint on evolutionary changes in yeast gene expression <abbrgrp><abbr bid="B39">39</abbr></abbrgrp>. However, these estimations are at least seemingly contradictory to the observations that highly expressed genes have longer introns than weakly expressed genes in yeasts <abbrgrp><abbr bid="B40">40</abbr><abbr bid="B41">41</abbr></abbrgrp>. To reach a conclusion, further investigations are required.</p>
            <p>Considered just from the point of view of the energetic cost of transcription, loss of entire introns may be favored in yeasts, but unlikely in mammals. On the other side, intron gain may be selected against in yeasts, but is most likely neutral, and thus, under genetic drift in mammals. This idea is consistent with the paucity of introns in yeast genes and the abundance of introns in animal genes <abbrgrp><abbr bid="B42">42</abbr><abbr bid="B43">43</abbr></abbrgrp>. Previously, the existence of different rates of intron loss in the evolution of different lineages was explained by differential retrotransposon activities <abbrgrp><abbr bid="B44">44</abbr><abbr bid="B45">45</abbr><abbr bid="B46">46</abbr></abbrgrp>. We look forward to further evidence to determine whether selection to reduce energetic cost is a complementary explanation. In evolution, insertion of several nucleotides or various transposons into introns and deletion of short sequences from introns are much more frequent than gain and loss of entire introns. Considered just from the point of view of the energetic cost of transcription, the effects of common indels are negligible in mammals, but visible to natural selection in yeasts. This idea is similar to the theory of Lynch on the evolution of genome complexity <abbrgrp><abbr bid="B47">47</abbr><abbr bid="B48">48</abbr></abbrgrp>.</p>
         </sec>
         <sec>
            <st>
               <p>Alternate hypotheses for short introns in highly expressed genes</p>
            </st>
            <p>The first alternate hypothesis is the time cost hypothesis. RNA polymerase II can elongate only about 20&#8211;40 nt per second <abbrgrp><abbr bid="B1">1</abbr><abbr bid="B49">49</abbr></abbrgrp>. Recent evidence indicates that elongation, instead of RNA polymerase II recruitment, may be the predominant rate-limiting event in gene activation <abbrgrp><abbr bid="B50">50</abbr><abbr bid="B51">51</abbr></abbrgrp>. Therefore, gene length should have an important impact on the duration of gene expression. To be completely transcribed, a large gene in the human genome, such as <it>DMD </it>(2.3 Mb), requires 16 hours <abbrgrp><abbr bid="B49">49</abbr></abbrgrp>, a medium-sized gene (for example, <it>TUBE1</it>, 16.7 Kb) requires about 7&#8211;14 minutes, and a small gene (for example, <it>HBA2</it>, 834 bp) requires only about 20&#8211;40 seconds. Seoighe et al. <abbrgrp><abbr bid="B3">3</abbr></abbrgrp> argued that the time required to transcribe multiple copies of mRNA is not a multiple of the transcription period of the first copy, because one template can be transcribed by several polymerases simultaneously <abbrgrp><abbr bid="B14">14</abbr></abbrgrp>. Assuming a normal elongation rate of 0.03 seconds per nucleotide, the completion of the transcription of the first copy of a gene with <it>L </it>nt requires 0.03 <it>L </it>seconds. Assuming that there are <it>k </it>polymerases attached to the same template simultaneously, the completion of an additional copy of this transcript requires an additional 0.03 <it>L</it>/<it>k </it>seconds. Thus, the completion of the transcription of <it>n </it>copies of an mRNA requires <it>T</it><sub><it>n </it></sub>= 0.03 <it>L </it>(1 + (<it>n</it>-1)/<it>k</it>) seconds. Apparently, if <it>n </it>&lt;&lt;<it>k</it>, <it>T</it><sub><it>n</it></sub>&#8776; 0.03 <it>L</it>, gene length and transcript copy number are not related. However, in highly expressed genes, <it>n </it>is unlikely to be much smaller than <it>k</it>; thus, both gene length (<it>L</it>) and transcript copy number (<it>n</it>) contribute to the duration of transcription. To produce a large number of transcripts in a limited period of time, natural selection may decrease <it>L </it>or increase <it>k</it>. Unfortunately no genome-wide data on the values for <it>k </it>are now available in animals.</p>
            <p>On the other side of the same coin, the time taken to transcribe introns has long been proposed to contribute to the timing mechanisms during development <abbrgrp><abbr bid="B52">52</abbr><abbr bid="B53">53</abbr><abbr bid="B54">54</abbr></abbrgrp>. An extension of this hypothesis is that long introns may be maintained in some genes to reduce the number of mRNA products in the otherwise too-long time during which the genes are activated.</p>
            <p>Another alternate hypothesis is that short genes may experience lower frequencies of abortive transcription and/or erroneous splicing than long genes. Successful transcription requires the polymerase to be stably associated with the DNA template during the elongation process. However, in some cases, the RNA-DNA duplex may not be stable enough to avoid abnormal pausing and arrest of elongation <abbrgrp><abbr bid="B55">55</abbr></abbrgrp>. In a study of the human <it>DMD </it>gene, Tennyson et al. <abbrgrp><abbr bid="B49">49</abbr></abbrgrp> found that 30&#8211;40% of transcription events were terminated or stopped at premature sites. Recently, Guenther et al. <abbrgrp><abbr bid="B50">50</abbr></abbrgrp> found that many genes that have experienced transcription initiation do not produce complete transcripts. The short lengths of highly expressed genes may lead to a decreased possibility of a gene containing such sequences that are difficult to transcribe and cause abortion of elongation. In addition, evidence shows that long introns increase the frequency of erroneous splicing of nearby exons <abbrgrp><abbr bid="B56">56</abbr></abbrgrp>.</p>
            <p>Long introns (and long UTRs) in highly expressed genes may also be selected against because of the crowding of active genes in a restricted interchromatin compartment <abbrgrp><abbr bid="B57">57</abbr></abbrgrp>.</p>
            <p>A slightly more speculative and seemingly less likely hypothesis is that long introns are selected for in weakly expressed genes to avoid DNA damage resulting from transcriptional R-loops <abbrgrp><abbr bid="B6">6</abbr><abbr bid="B58">58</abbr></abbrgrp>. The fact that mRNA lengths have a similar correlation with expression levels as intron lengths <abbrgrp><abbr bid="B1">1</abbr><abbr bid="B6">6</abbr><abbr bid="B9">9</abbr></abbrgrp> negates this hypothesis.</p>
            <p>In addition, there is also the possibility that highly expressed genes are compact because their epigenetic regulation is relatively simple, as suggested by the "genome design" hypothesis <abbrgrp><abbr bid="B10">10</abbr></abbrgrp>. Although there is some evidence against this idea, indicating that the lengths of intergenic spacers rather than those of introns are correlated with the complexity of epigenetic regulation <abbrgrp><abbr bid="B6">6</abbr><abbr bid="B59">59</abbr></abbrgrp>, there is also evidence supporting it <abbrgrp><abbr bid="B60">60</abbr><abbr bid="B61">61</abbr><abbr bid="B62">62</abbr><abbr bid="B63">63</abbr><abbr bid="B64">64</abbr></abbrgrp>.</p>
            <p>In contrast to the observations that highly expressed genes have short introns in animals, <it>P. patens </it>and the pollen of <it>A. thaliana</it>, highly expressed genes were found to have longer introns than weakly expressed genes in unicellular organisms, the sporophytes of <it>A. thaliana </it>and <it>Oryza sativa</it>, and, at least, the vegetative stage of the slime mould <it>Dictyostelium discoideum </it>[<abbrgrp><abbr bid="B40">40</abbr><abbr bid="B41">41</abbr><abbr bid="B65">65</abbr></abbrgrp>, Y.F. Huang and D.K. Niu, unpublished results from analyzing the data from <abbrgrp><abbr bid="B66">66</abbr></abbrgrp>]. To date, there has been no satisfactory explanation for this difference <abbrgrp><abbr bid="B4">4</abbr><abbr bid="B65">65</abbr></abbrgrp>. Perhaps, the compact genomes and compact genes in large genomes have lost most of their nonfunctional sequences; thus, most of the retained intronic sequences have regulatory functions in gene expression <abbrgrp><abbr bid="B67">67</abbr><abbr bid="B68">68</abbr><abbr bid="B69">69</abbr><abbr bid="B70">70</abbr></abbrgrp>. Surprisingly, a weak, but significant negative correlation of mRNA length (and protein length) with expression level was found in all studied organisms <abbrgrp><abbr bid="B1">1</abbr><abbr bid="B2">2</abbr><abbr bid="B5">5</abbr><abbr bid="B6">6</abbr><abbr bid="B71">71</abbr><abbr bid="B72">72</abbr><abbr bid="B73">73</abbr><abbr bid="B74">74</abbr></abbrgrp>, which is also generally explained by minimizing the energetic cost of gene expression. In light of this study, we suggest other potential reasons for the short introns of highly expressed genes: to minimize the duration of gene expression, or to reduce the frequencies of abortive transcription and/or erroneous splicing. However, we do not wish to completely discount the energetic cost hypothesis for mRNA compactness, because we have insufficient data on protein abundance (note that translation is also an expensive process).</p>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Conclusion</p>
         </st>
         <p>By assuming that intronic sequences are mostly junky, it is reasonable to attribute the fact that highly expressed genes have short introns to potential selection to minimize the energetic cost of gene expression. However, this hypothesis is not supported by our comparison of tissue/organ-specific genes between large tissue/organs and small tissue/organs in humans or mice. In addition, by conservatively selecting the values of a series of parameters, we quantitively estimated the energetic burden of a long intron in highly expressed genes. In mammals, the burden seems to be too negligible to trigger purifying selection against long introns. Further investigations are required to establish a new theory from a series of alternate hypotheses.</p>
      </sec>
      <sec>
         <st>
            <p>Methods</p>
         </st>
         <p>The reference genomes of <it>Homo sapiens </it>(build 36, version 2) and <it>Mus musculus </it>(build 36, version 1) were downloaded from the NCBI genome database <abbrgrp><abbr bid="B75">75</abbr></abbrgrp>. These genomes have been reviewed by NCBI staff. Genes with obvious annotation errors were excluded from our analyses. In the case of alternative splicing variants, we used the longest mRNA for analysis (although similar results were obtained by analyzing the shortest mRNA, data not shown). UTRs shorter than 30 nt were considered as trustless annotations. In analyzing UTR length, we retained only those genes with both 5' -UTRs and 3' -UTRs of 30 nt or longer. The UTR length of a gene is the sum of the lengths of its 5' UTR and 3' UTR.</p>
         <p>The microarray gene expression datasets of <it>H. sapiens </it>and <it>M. musculus </it>were downloaded from GNF Genome Informatics Applications &amp; Datasets <abbrgrp><abbr bid="B15">15</abbr><abbr bid="B76">76</abbr></abbrgrp>. These are the most extensive gene expression datasets freely available online. Besides quantitive signals, the datasets contain qualitative indicators of gene expression for each Affymetrix probe set in each tissue/organ sample: P (present), M (marginal), A (absent). Several probe sets may be annotated as one gene and each probe set has two repeats. In this study, we defined a gene as being expressed in a tissue/organ sample by a conservative criterion and a relaxed one. In the conservative criterion, all probe sets and repeats of a gene should be marked as P in the datasets, and in the relaxed criterion, two repeats of at least one probe set should be marked as P or M. These two criteria gave similar results. We present the results of analysis based on the conservative criterion in the main text of this paper, and those based on the relaxed criterion as Figure S1 and Table S1 of Additional File <supplr sid="S3">3</supplr>. Some probes of the probe sets annotated with a "_x" appended to the probe set name may cross-hybridize with other sequences, and so the resulting signal may partially arise from transcripts other than the one being intentionally measured (Affymetrix Technical Note, Array Design for the HGU133 set). We repeated our analysis by removing such probe sets from the gene expression datasets and obtained similar results (see Figure S2 and Table S2 of Additional File <supplr sid="S3">3</supplr>).</p>
         <suppl id="S3">
            <title>
               <p>Additional file 3</p>
            </title>
            <text>
               <p>Comparisons of compactness between LTS-STS gene pairs with similar expression levels and compactness between genes expressed at different levels. Figure S1 &#8211; Figure S3 present the results of the comparisons of LTS-STS gene pairs with similar expression levels selected based on criteria different from Figure <figr fid="F1">1</figr>. Table S1 &#8211; Table S3 show the results of the comparison of compactness between genes expressed at different levels.</p>
            </text>
            <file name="1471-2148-8-154-S3.doc">
               <p>Click here for file</p>
            </file>
         </suppl>
         <p>A greedy algorithm was used to match LTS genes and STS genes with similar expression levels. To maximize the number of gene pairs, the category with smaller gene number (STS genes in humans and LTS genes in mice) was used as the query set, and the category with larger gene number was used as the target set. For each gene in the query set, we selected the gene with the most similar expression level from the target set as the candidate target gene. If the within-pair difference was equal or smaller than the threshold of 20%, the query gene and the candidate target gene were viewed as a gene pair with similar expression levels. Adjusting this threshold to 10% gave similar results (Figure S3 and Table S3 of Additional File <supplr sid="S3">3</supplr>); a much lower threshold would result in too small a sample size to study. Similar to a previous study <abbrgrp><abbr bid="B6">6</abbr></abbrgrp>, the within-pair differences of expression levels was defined as</p>
         <p>
            <display-formula>
               <m:math name="1471-2148-8-154-i1" xmlns:m="http://www.w3.org/1998/Math/MathML">
                  <m:semantics>
                     <m:mrow>
                        <m:mrow>
                           <m:mo>|</m:mo>
                           <m:mrow>
                              <m:mfrac>
                                 <m:mrow>
                                    <m:mi>A</m:mi>
                                    <m:mo>&#8722;</m:mo>
                                    <m:mi>B</m:mi>
                                 </m:mrow>
                                 <m:mrow>
                                    <m:mi>M</m:mi>
                                    <m:mi>a</m:mi>
                                    <m:mi>x</m:mi>
                                    <m:mo stretchy="false">(</m:mo>
                                    <m:mi>A</m:mi>
                                    <m:mo>,</m:mo>
                                    <m:mi>B</m:mi>
                                    <m:mo stretchy="false">)</m:mo>
                                 </m:mrow>
                              </m:mfrac>
                           </m:mrow>
                           <m:mo>|</m:mo>
                        </m:mrow>
                     </m:mrow>
                     <m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xI8qiVKYPFjYdHaVhbbf9v8qqaqFr0xc9vqFj0dXdbba91qpepeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGaciGaaiaabeqaaeqabiWaaaGcbaqcfa4aaqWaaeaadaWcaaqaaiabdgeabjabgkHiTiabdkeacbqaaiabd2eanjabdggaHjabdIha4jabcIcaOiabdgeabjabcYcaSiabdkeacjabcMcaPaaaaiaawEa7caGLiWoaaaa@3B79@</m:annotation>
                  </m:semantics>
               </m:math>
            </display-formula>
         </p>
         <p>where A is the expression level of an LTS gene and B is the expression level of an STS gene. As shown in Figure S4 of Additional File <supplr sid="S3">3</supplr>, the within-pair differences in expression levels were not biased to either LTS genes or STS genes.</p>
      </sec>
      <sec>
         <st>
            <p>Authors' contributions</p>
         </st>
         <p>D&#8211;KN and Y&#8211;FH conceived and designed the research. Y&#8211;FH performed the analysis. D&#8211;KN wrote the paper. Both authors read and approved the final manuscript.</p>
      </sec>
   </bdy>
   <bm>
      <ack>
         <sec>
            <st>
               <p>Acknowledgements</p>
            </st>
            <p>We thank the anonymous referees for their comments. This study was supported by Beijing Normal University and Program for NCET-07-0094.</p>
         </sec>
      </ack>
      <refgrp>
         <bibl id="B1">
            <title>
               <p>Selection for short introns in highly expressed genes</p>
            </title>
            <aug>
               <au>
                  <snm>Castillo-Davis</snm>
                  <fnm>CI</fnm>
               </au>
               <au>
                  <snm>Mekhedov</snm>
                  <fnm>SL</fnm>
               </au>
               <au>
                  <snm>Hartl</snm>
                  <fnm>DL</fnm>
               </au>
               <au>
                  <snm>Koonin</snm>
                  <fnm>EV</fnm>
               </au>
               <au>
                  <snm>Kondrashov</snm>
                  <fnm>FA</fnm>
               </au>
            </aug>
            <source>Nat Genet</source>
            <pubdate>2002</pubdate>
            <volume>31</volume>
            <issue>4</issue>
            <fpage>415</fpage>
            <lpage>418</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">12134150</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B2">
            <title>
               <p>Selective and mutational patterns associated with gene expression in humans: Influences on synonymous composition and intron presence</p>
            </title>
            <aug>
               <au>
                  <snm>Comeron</snm>
                  <fnm>JM</fnm>
               </au>
            </aug>
            <source>Genetics</source>
            <pubdate>2004</pubdate>
            <volume>167</volume>
            <issue>3</issue>
            <fpage>1293</fpage>
            <lpage>1304</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1470943</pubid>
                  <pubid idtype="pmpid" link="fulltext">15280243</pubid>
                  <pubid idtype="doi">10.1534/genetics.104.026351</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B3">
            <title>
               <p>Gametophytic selection in <it>Arabidopsis thaliana</it> supports the selective model of intron length reduction</p>
            </title>
            <aug>
               <au>
                  <snm>Seoighe</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Gehring</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Hurst</snm>
                  <fnm>LD</fnm>
               </au>
            </aug>
            <source>PLoS Genet</source>
            <pubdate>2005</pubdate>
            <volume>1</volume>
            <issue>2</issue>
            <fpage>e13</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1186733</pubid>
                  <pubid idtype="pmpid" link="fulltext">16110339</pubid>
                  <pubid idtype="doi">10.1371/journal.pgen.0010013</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B4">
            <title>
               <p>Compact genes are highly expressed in the moss <it>Physcomitrella patens</it></p>
            </title>
            <aug>
               <au>
                  <snm>Stenoien</snm>
                  <fnm>HK</fnm>
               </au>
            </aug>
            <source>J Evol Biol</source>
            <pubdate>2007</pubdate>
            <volume>20</volume>
            <issue>3</issue>
            <fpage>1223</fpage>
            <lpage>1229</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1111/j.1420-9101.2007.01301.x</pubid>
                  <pubid idtype="pmpid" link="fulltext">17465932</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B5">
            <title>
               <p>The signature of selection mediated by expression on human genes</p>
            </title>
            <aug>
               <au>
                  <snm>Urrutia</snm>
                  <fnm>AO</fnm>
               </au>
               <au>
                  <snm>Hurst</snm>
                  <fnm>LD</fnm>
               </au>
            </aug>
            <source>Genome Res</source>
            <pubdate>2003</pubdate>
            <volume>13</volume>
            <issue>10</issue>
            <fpage>2260</fpage>
            <lpage>2264</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">403694</pubid>
                  <pubid idtype="pmpid" link="fulltext">12975314</pubid>
                  <pubid idtype="doi">10.1101/gr.641103</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B6">
            <title>
               <p>Selection for the miniaturization of highly expressed genes</p>
            </title>
            <aug>
               <au>
                  <snm>Li</snm>
                  <fnm>SW</fnm>
               </au>
               <au>
                  <snm>Feng</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Niu</snm>
                  <fnm>DK</fnm>
               </au>
            </aug>
            <source>Biochem Biophys Res Commun</source>
            <pubdate>2007</pubdate>
            <volume>360</volume>
            <issue>3</issue>
            <fpage>586</fpage>
            <lpage>592</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/j.bbrc.2007.06.085</pubid>
                  <pubid idtype="pmpid" link="fulltext">17610841</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B7">
            <title>
               <p>Extraordinary diversity among members of the large gene family, 185/333, from the purple sea urchin, <it>Strongylocentrotus purpuratus</it></p>
            </title>
            <aug>
               <au>
                  <snm>Buckley</snm>
                  <fnm>KM</fnm>
               </au>
               <au>
                  <snm>Smith</snm>
                  <fnm>LC</fnm>
               </au>
            </aug>
            <source>BMC Mol Biol</source>
            <pubdate>2007</pubdate>
            <volume>8</volume>
            <fpage>68</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1988830</pubid>
                  <pubid idtype="pmpid" link="fulltext">17697382</pubid>
                  <pubid idtype="doi">10.1186/1471-2199-8-68</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B8">
            <title>
               <p>Imprinted genes have few and small introns</p>
            </title>
            <aug>
               <au>
                  <snm>Hurst</snm>
                  <fnm>LD</fnm>
               </au>
               <au>
                  <snm>McVean</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Moore</snm>
                  <fnm>T</fnm>
               </au>
            </aug>
            <source>Nat Genet</source>
            <pubdate>1996</pubdate>
            <volume>12</volume>
            <issue>3</issue>
            <fpage>234</fpage>
            <lpage>237</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/ng0396-234</pubid>
                  <pubid idtype="pmpid" link="fulltext">8589711</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B9">
            <title>
               <p>Human housekeeping genes are compact</p>
            </title>
            <aug>
               <au>
                  <snm>Eisenberg</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Levanon</snm>
                  <fnm>EY</fnm>
               </au>
            </aug>
            <source>Trends Genet</source>
            <pubdate>2003</pubdate>
            <volume>19</volume>
            <issue>7</issue>
            <fpage>362</fpage>
            <lpage>365</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/S0168-9525(03)00140-9</pubid>
                  <pubid idtype="pmpid" link="fulltext">12850439</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B10">
            <title>
               <p>Compactness of human housekeeping genes: selection for economy or genomic design?</p>
            </title>
            <aug>
               <au>
                  <snm>Vinogradov</snm>
                  <fnm>AE</fnm>
               </au>
            </aug>
            <source>Trends Genet</source>
            <pubdate>2004</pubdate>
            <volume>20</volume>
            <issue>5</issue>
            <fpage>248</fpage>
            <lpage>253</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/j.tig.2004.03.006</pubid>
                  <pubid idtype="pmpid" link="fulltext">15109779</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B11">
            <title>
               <p>Human antisense genes have unusually short introns: evidence for selection for rapid transcription</p>
            </title>
            <aug>
               <au>
                  <snm>Chen</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Sun</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Hurst</snm>
                  <fnm>LD</fnm>
               </au>
               <au>
                  <snm>Carmichael</snm>
                  <fnm>GG</fnm>
               </au>
               <au>
                  <snm>Rowley</snm>
                  <fnm>JD</fnm>
               </au>
            </aug>
            <source>Trends Genet</source>
            <pubdate>2005</pubdate>
            <volume>21</volume>
            <issue>4</issue>
            <fpage>203</fpage>
            <lpage>207</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/j.tig.2005.02.003</pubid>
                  <pubid idtype="pmpid" link="fulltext">15797613</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B12">
            <title>
               <p>The small introns of antisense genes are better explained by selection for rapid transcription than by "genomic design"</p>
            </title>
            <aug>
               <au>
                  <snm>Chen</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Sun</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Rowley</snm>
                  <fnm>JD</fnm>
               </au>
               <au>
                  <snm>Hurst</snm>
                  <fnm>LD</fnm>
               </au>
            </aug>
            <source>Genetics</source>
            <pubdate>2005</pubdate>
            <volume>171</volume>
            <issue>4</issue>
            <fpage>2151</fpage>
            <lpage>2155</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1456133</pubid>
                  <pubid idtype="pmpid" link="fulltext">16143605</pubid>
                  <pubid idtype="doi">10.1534/genetics.105.048066</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B13">
            <title>
               <p>The biology of intron gain and loss</p>
            </title>
            <aug>
               <au>
                  <snm>Jeffares</snm>
                  <fnm>DC</fnm>
               </au>
               <au>
                  <snm>Mourier</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Penny</snm>
                  <fnm>D</fnm>
               </au>
            </aug>
            <source>Trends Genet</source>
            <pubdate>2006</pubdate>
            <volume>22</volume>
            <issue>1</issue>
            <fpage>16</fpage>
            <lpage>22</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/j.tig.2005.10.006</pubid>
                  <pubid idtype="pmpid" link="fulltext">16290250</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B14">
            <title>
               <p>Visualization of single RNA transcripts in situ</p>
            </title>
            <aug>
               <au>
                  <snm>Femino</snm>
                  <fnm>AM</fnm>
               </au>
               <au>
                  <snm>Fay</snm>
                  <fnm>FS</fnm>
               </au>
               <au>
                  <snm>Fogarty</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Singer</snm>
                  <fnm>RH</fnm>
               </au>
            </aug>
            <source>Science</source>
            <pubdate>1998</pubdate>
            <volume>280</volume>
            <issue>5363</issue>
            <fpage>585</fpage>
            <lpage>590</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1126/science.280.5363.585</pubid>
                  <pubid idtype="pmpid" link="fulltext">9554849</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B15">
            <title>
               <p>A gene atlas of the mouse and human protein-encoding transcriptomes</p>
            </title>
            <aug>
               <au>
                  <snm>Su</snm>
                  <fnm>AI</fnm>
               </au>
               <au>
                  <snm>Wiltshire</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Batalov</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Lapp</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Ching</snm>
                  <fnm>KA</fnm>
               </au>
               <au>
                  <snm>Block</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Zhang</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Soden</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Hayakawa</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Kreiman</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Cooke</snm>
                  <fnm>MP</fnm>
               </au>
               <au>
                  <snm>Walker</snm>
                  <fnm>JR</fnm>
               </au>
               <au>
                  <snm>Hogenesch</snm>
                  <fnm>JB</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci USA</source>
            <pubdate>2004</pubdate>
            <volume>101</volume>
            <issue>16</issue>
            <fpage>6062</fpage>
            <lpage>6067</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">395923</pubid>
                  <pubid idtype="pmpid" link="fulltext">15075390</pubid>
                  <pubid idtype="doi">10.1073/pnas.0400782101</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B16">
            <title>
               <p>Organ weight in 684 adult autopsies: new tables for a Caucasoid population</p>
            </title>
            <aug>
               <au>
                  <snm>de la Grandmaison</snm>
                  <fnm>GL</fnm>
               </au>
               <au>
                  <snm>Clairand</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Durigon</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>Forensic Sci Int</source>
            <pubdate>2001</pubdate>
            <volume>119</volume>
            <issue>2</issue>
            <fpage>149</fpage>
            <lpage>154</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/S0379-0738(00)00401-1</pubid>
                  <pubid idtype="pmpid" link="fulltext">11376980</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B17">
            <title>
               <p>Abnormal size of the amygdala predicts impaired emotional memory in major depressive disorder</p>
            </title>
            <aug>
               <au>
                  <snm>Weniger</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Lange</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Irle</snm>
                  <fnm>E</fnm>
               </au>
            </aug>
            <source>J Affect Disord</source>
            <pubdate>2006</pubdate>
            <volume>94</volume>
            <issue>1-3</issue>
            <fpage>219</fpage>
            <lpage>229</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/j.jad.2006.04.017</pubid>
                  <pubid idtype="pmpid" link="fulltext">16740316</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B18">
            <title>
               <p>Amazing Numbers in Biology</p>
            </title>
            <aug>
               <au>
                  <snm>Flindt</snm>
                  <fnm>R</fnm>
               </au>
            </aug>
            <publisher>Berlin , Springer-Verlag</publisher>
            <pubdate>2006</pubdate>
            <fpage>295</fpage>
         </bibl>
         <bibl id="B19">
            <title>
               <p>Skeletal muscle mass and distribution in 468 men and women aged 18-88 yr</p>
            </title>
            <aug>
               <au>
                  <snm>Janssen</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Heymsfield</snm>
                  <fnm>SB</fnm>
               </au>
               <au>
                  <snm>Wang</snm>
                  <fnm>ZM</fnm>
               </au>
               <au>
                  <snm>Ross</snm>
                  <fnm>R</fnm>
               </au>
            </aug>
            <source>J Appl Physiol</source>
            <pubdate>2000</pubdate>
            <volume>89</volume>
            <issue>1</issue>
            <fpage>81</fpage>
            <lpage>88</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">10904038</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B20">
            <title>
               <p>Reference Man: Anatomical, Physiological and Metabolic Characteristics</p>
            </title>
            <aug>
               <au>
                  <cnm>International Commission on Radiological Protection</cnm>
               </au>
            </aug>
            <publisher> Elsevier</publisher>
            <pubdate>1975</pubdate>
            <fpage>512</fpage>
         </bibl>
         <bibl id="B21">
            <title>
               <p>Effects of p-nonylphenol and resveratrol on body and organ weight and in vivo fertility of outbred CD-1 mice</p>
            </title>
            <aug>
               <au>
                  <snm>Kyselova</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>Peknicova</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Buckiova</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Boubelik</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>Reprod Biol Endocrinol</source>
            <pubdate>2003</pubdate>
            <volume>1</volume>
            <issue>1</issue>
            <fpage>30</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">155686</pubid>
                  <pubid idtype="pmpid" link="fulltext">12749770</pubid>
                  <pubid idtype="doi">10.1186/1477-7827-1-30</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B22">
            <title>
               <p>Opioid peptides and alpha -melanocyte-stimulating hormone in genetically obese (ob/ob) mice during development</p>
            </title>
            <aug>
               <au>
                  <snm>Rossier</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Rogers</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Shibasaki</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Guillemin</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Bloom</snm>
                  <fnm>FE</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci USA</source>
            <pubdate>1979</pubdate>
            <volume>76</volume>
            <issue>4</issue>
            <fpage>2077</fpage>
            <lpage>2080</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">383537</pubid>
                  <pubid idtype="pmpid">287046</pubid>
                  <pubid idtype="doi">10.1073/pnas.76.4.2077</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B23">
            <title>
               <p>Partial glucocorticoid agonist-like effects of imipramine on hypothalamic-pituitary-adrenocortical activity, thymus weight, and hippocampal glucocorticoid receptors in male C57BL/6 mice</p>
            </title>
            <aug>
               <au>
                  <snm>Mukherjee</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Knisely</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Jacobson</snm>
                  <fnm>L</fnm>
               </au>
            </aug>
            <source>Endocrinology</source>
            <pubdate>2004</pubdate>
            <volume>145</volume>
            <issue>9</issue>
            <fpage>4185</fpage>
            <lpage>4191</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1210/en.2004-0147</pubid>
                  <pubid idtype="pmpid" link="fulltext">15155572</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B24">
            <title>
               <p>Induction of thyroid tumours in (C57BL/6NxC3H/N) F1 mice by oral administration of kojic acid</p>
            </title>
            <aug>
               <au>
                  <snm>Fujimoto</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Watanabe</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Nakatani</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Roy</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Ito</snm>
                  <fnm>A</fnm>
               </au>
            </aug>
            <source>Food Chem Toxicol</source>
            <pubdate>1998</pubdate>
            <volume>36</volume>
            <issue>8</issue>
            <fpage>697</fpage>
            <lpage>703</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/S0278-6915(98)00030-1</pubid>
                  <pubid idtype="pmpid" link="fulltext">9734720</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B25">
            <title>
               <p>Dissecting the regulatory circuitry of a eukaryotic genome</p>
            </title>
            <aug>
               <au>
                  <snm>Holstege</snm>
                  <fnm>FC</fnm>
               </au>
               <au>
                  <snm>Jennings</snm>
                  <fnm>EG</fnm>
               </au>
               <au>
                  <snm>Wyrick</snm>
                  <fnm>JJ</fnm>
               </au>
               <au>
                  <snm>Lee</snm>
                  <fnm>TI</fnm>
               </au>
               <au>
                  <snm>Hengartner</snm>
                  <fnm>CJ</fnm>
               </au>
               <au>
                  <snm>Green</snm>
                  <fnm>MR</fnm>
               </au>
               <au>
                  <snm>Golub</snm>
                  <fnm>TR</fnm>
               </au>
               <au>
                  <snm>Lander</snm>
                  <fnm>ES</fnm>
               </au>
               <au>
                  <snm>Young</snm>
                  <fnm>RA</fnm>
               </au>
            </aug>
            <source>Cell</source>
            <pubdate>1998</pubdate>
            <volume>95</volume>
            <issue>5</issue>
            <fpage>717</fpage>
            <lpage>728</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/S0092-8674(00)81641-4</pubid>
                  <pubid idtype="pmpid" link="fulltext">9845373</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B26">
            <title>
               <p>Precision and functional specificity in mRNA decay</p>
            </title>
            <aug>
               <au>
                  <snm>Wang</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Liu</snm>
                  <fnm>CL</fnm>
               </au>
               <au>
                  <snm>Storey</snm>
                  <fnm>JD</fnm>
               </au>
               <au>
                  <snm>Tibshirani</snm>
                  <fnm>RJ</fnm>
               </au>
               <au>
                  <snm>Herschlag</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Brown</snm>
                  <fnm>PO</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci USA</source>
            <pubdate>2002</pubdate>
            <volume>99</volume>
            <issue>9</issue>
            <fpage>5860</fpage>
            <lpage>5865</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">122867</pubid>
                  <pubid idtype="pmpid" link="fulltext">11972065</pubid>
                  <pubid idtype="doi">10.1073/pnas.092538799</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B27">
            <title>
               <p>Transcript copy number estimation using a mouse whole-genome oligonucleotide microarray</p>
            </title>
            <aug>
               <au>
                  <snm>Carter</snm>
                  <fnm>MG</fnm>
               </au>
               <au>
                  <snm>Sharov</snm>
                  <fnm>AA</fnm>
               </au>
               <au>
                  <snm>VanBuren</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>Dudekula</snm>
                  <fnm>DB</fnm>
               </au>
               <au>
                  <snm>Carmack</snm>
                  <fnm>CE</fnm>
               </au>
               <au>
                  <snm>Nelson</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Ko</snm>
                  <fnm>MSH</fnm>
               </au>
            </aug>
            <source>Genome Biol</source>
            <pubdate>2005</pubdate>
            <volume>6</volume>
            <issue>7</issue>
            <fpage>R61</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1175992</pubid>
                  <pubid idtype="pmpid" link="fulltext">15998450</pubid>
                  <pubid idtype="doi">10.1186/gb-2005-6-7-r61</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B28">
            <title>
               <p>Decay rates of human mRNAs: Correlation with functional characteristics and sequence attributes</p>
            </title>
            <aug>
               <au>
                  <snm>Yang</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>van Nimwegen</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Zavolan</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Rajewsky</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Schroeder</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Magnasco</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Darnell</snm>
                  <fnm>JE</fnm>
                  <suf>Jr.</suf>
               </au>
            </aug>
            <source>Genome Res</source>
            <pubdate>2003</pubdate>
            <volume>13</volume>
            <issue>8</issue>
            <fpage>1863</fpage>
            <lpage>1872</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">403777</pubid>
                  <pubid idtype="pmpid" link="fulltext">12902380</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B29">
            <title>
               <p>Nanomedicine, Volume I: Basic Capabilities</p>
            </title>
            <aug>
               <au>
                  <snm>Freitas</snm>
                  <fnm>RA</fnm>
                  <suf>Jr.</suf>
               </au>
            </aug>
            <publisher>Georgetown, TX , Landes Bioscience</publisher>
            <pubdate>1999</pubdate>
         </bibl>
         <bibl id="B30">
            <title>
               <p>Biochemistry</p>
            </title>
            <aug>
               <au>
                  <snm>Voet</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Voet</snm>
                  <fnm>JG</fnm>
               </au>
            </aug>
            <publisher>New York , John Wiley &amp; Sons</publisher>
            <pubdate>1990</pubdate>
            <fpage>1223</fpage>
         </bibl>
         <bibl id="B31">
            <title>
               <p>Recent human effective population size estimated from linkage disequilibrium</p>
            </title>
            <aug>
               <au>
                  <snm>Tenesa</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Navarro</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Hayes</snm>
                  <fnm>BJ</fnm>
               </au>
               <au>
                  <snm>Duffy</snm>
                  <fnm>DL</fnm>
               </au>
               <au>
                  <snm>Clarke</snm>
                  <fnm>GM</fnm>
               </au>
               <au>
                  <snm>Goddard</snm>
                  <fnm>ME</fnm>
               </au>
               <au>
                  <snm>Visscher</snm>
                  <fnm>PM</fnm>
               </au>
            </aug>
            <source>Genome Res</source>
            <pubdate>2007</pubdate>
            <volume>17</volume>
            <issue>4</issue>
            <fpage>520</fpage>
            <lpage>526</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1832099</pubid>
                  <pubid idtype="pmpid" link="fulltext">17351134</pubid>
                  <pubid idtype="doi">10.1101/gr.6023607</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B32">
            <title>
               <p>Allelic genealogy and human evolution</p>
            </title>
            <aug>
               <au>
                  <snm>Takahata</snm>
                  <fnm>N</fnm>
               </au>
            </aug>
            <source>Mol Biol Evol</source>
            <pubdate>1993</pubdate>
            <volume>10</volume>
            <issue>1</issue>
            <fpage>2</fpage>
            <lpage>22</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">8450756</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B33">
            <title>
               <p>Scaling of number, size, and metabolic rate of cells with body size in mammals</p>
            </title>
            <aug>
               <au>
                  <snm>Savage</snm>
                  <fnm>VM</fnm>
               </au>
               <au>
                  <snm>Allen</snm>
                  <fnm>AP</fnm>
               </au>
               <au>
                  <snm>Brown</snm>
                  <fnm>JH</fnm>
               </au>
               <au>
                  <snm>Gillooly</snm>
                  <fnm>JF</fnm>
               </au>
               <au>
                  <snm>Herman</snm>
                  <fnm>AB</fnm>
               </au>
               <au>
                  <snm>Woodruff</snm>
                  <fnm>WH</fnm>
               </au>
               <au>
                  <snm>West</snm>
                  <fnm>GB</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci USA</source>
            <pubdate>2007</pubdate>
            <volume>104</volume>
            <issue>11</issue>
            <fpage>4718</fpage>
            <lpage>4723</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1838666</pubid>
                  <pubid idtype="pmpid" link="fulltext">17360590</pubid>
                  <pubid idtype="doi">10.1073/pnas.0611235104</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B34">
            <title>
               <p>The predominance of quarter-power scaling in biology</p>
            </title>
            <aug>
               <au>
                  <snm>Savage</snm>
                  <fnm>VM</fnm>
               </au>
               <au>
                  <snm>Gillooly</snm>
                  <fnm>JF</fnm>
               </au>
               <au>
                  <snm>Woodruff</snm>
                  <fnm>WH</fnm>
               </au>
               <au>
                  <snm>West</snm>
                  <fnm>GB</fnm>
               </au>
               <au>
                  <snm>Allen</snm>
                  <fnm>AP</fnm>
               </au>
               <au>
                  <snm>Enquist</snm>
                  <fnm>BJ</fnm>
               </au>
               <au>
                  <snm>Brown</snm>
                  <fnm>JH</fnm>
               </au>
            </aug>
            <source>Funct Ecol</source>
            <pubdate>2004</pubdate>
            <volume>18</volume>
            <issue>2</issue>
            <fpage>257</fpage>
            <lpage>282</lpage>
            <xrefbib>
               <pubid idtype="doi">10.1111/j.0269-8463.2004.00856.x</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B35">
            <title>
               <p>Patterns of DNA variability at X-linked loci in <it>Mus domesticus</it></p>
            </title>
            <aug>
               <au>
                  <snm>Nachman</snm>
                  <fnm>MW</fnm>
               </au>
            </aug>
            <source>Genetics</source>
            <pubdate>1997</pubdate>
            <volume>147</volume>
            <issue>3</issue>
            <fpage>1303</fpage>
            <lpage>1316</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1208253</pubid>
                  <pubid idtype="pmpid" link="fulltext">9383072</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B36">
            <title>
               <p>Evidence for widespread degradation of gene control regions in hominid genomes</p>
            </title>
            <aug>
               <au>
                  <snm>Keightley</snm>
                  <fnm>PD</fnm>
               </au>
               <au>
                  <snm>Lercher</snm>
                  <fnm>MJ</fnm>
               </au>
               <au>
                  <snm>Eyre-Walker</snm>
                  <fnm>A</fnm>
               </au>
            </aug>
            <source>PLoS Biol</source>
            <pubdate>2005</pubdate>
            <volume>3</volume>
            <issue>2</issue>
            <fpage>282</fpage>
            <lpage>288</lpage>
            <xrefbib>
               <pubid idtype="doi">10.1371/journal.pbio.0030042</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B37">
            <title>
               <p>The origins of eukaryotic gene structure</p>
            </title>
            <aug>
               <au>
                  <snm>Lynch</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>Mol Biol Evol</source>
            <pubdate>2006</pubdate>
            <volume>23</volume>
            <issue>2</issue>
            <fpage>450</fpage>
            <lpage>468</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/molbev/msj050</pubid>
                  <pubid idtype="pmpid" link="fulltext">16280547</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B38">
            <title>
               <p>Effects of size and temperature on metabolic rate</p>
            </title>
            <aug>
               <au>
                  <snm>Gillooly</snm>
                  <fnm>JF</fnm>
               </au>
               <au>
                  <snm>Brown</snm>
                  <fnm>JH</fnm>
               </au>
               <au>
                  <snm>West</snm>
                  <fnm>GB</fnm>
               </au>
               <au>
                  <snm>Savage</snm>
                  <fnm>VM</fnm>
               </au>
               <au>
                  <snm>Charnov</snm>
                  <fnm>EL</fnm>
               </au>
            </aug>
            <source>Science</source>
            <pubdate>2001</pubdate>
            <volume>293</volume>
            <issue>5538</issue>
            <fpage>2248</fpage>
            <lpage>2251</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1126/science.1061967</pubid>
                  <pubid idtype="pmpid" link="fulltext">11567137</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B39">
            <title>
               <p>Energy constraints on the evolution of gene expression</p>
            </title>
            <aug>
               <au>
                  <snm>Wagner</snm>
                  <fnm>A</fnm>
               </au>
            </aug>
            <source>Mol Biol Evol</source>
            <pubdate>2005</pubdate>
            <volume>22</volume>
            <issue>6</issue>
            <fpage>1365</fpage>
            <lpage>1374</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/molbev/msi126</pubid>
                  <pubid idtype="pmpid" link="fulltext">15758206</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B40">
            <title>
               <p>Intron length and codon usage</p>
            </title>
            <aug>
               <au>
                  <snm>Vinogradov</snm>
                  <fnm>AE</fnm>
               </au>
            </aug>
            <source>J Mol Evol</source>
            <pubdate>2001</pubdate>
            <volume>52</volume>
            <issue>1</issue>
            <fpage>2</fpage>
            <lpage>5</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">11139289</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B41">
            <title>
               <p>Introns regulate RNA and protein abundance in yeast</p>
            </title>
            <aug>
               <au>
                  <snm>Juneau</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Miranda</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Hillenmeyer</snm>
                  <fnm>ME</fnm>
               </au>
               <au>
                  <snm>Nislow</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Davis</snm>
                  <fnm>RW</fnm>
               </au>
            </aug>
            <source>Genetics</source>
            <pubdate>2006</pubdate>
            <volume>174</volume>
            <issue>1</issue>
            <fpage>511</fpage>
            <lpage>518</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1569799</pubid>
                  <pubid idtype="pmpid" link="fulltext">16816425</pubid>
                  <pubid idtype="doi">10.1534/genetics.106.058560</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B42">
            <title>
               <p>Eukaryotic intron loss</p>
            </title>
            <aug>
               <au>
                  <snm>Mourier</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Jeffares</snm>
                  <fnm>DC</fnm>
               </au>
            </aug>
            <source>Science</source>
            <pubdate>2003</pubdate>
            <volume>300</volume>
            <issue>5624</issue>
            <fpage>1393</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1126/science.1080559</pubid>
                  <pubid idtype="pmpid" link="fulltext">12775832</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B43">
            <title>
               <p>mRNA-mediated intron losses: evidence from extraordinarily large exons</p>
            </title>
            <aug>
               <au>
                  <snm>Niu</snm>
                  <fnm>DK</fnm>
               </au>
               <au>
                  <snm>Hou</snm>
                  <fnm>WR</fnm>
               </au>
               <au>
                  <snm>Li</snm>
                  <fnm>SW</fnm>
               </au>
            </aug>
            <source>Mol Biol Evol</source>
            <pubdate>2005</pubdate>
            <volume>22</volume>
            <issue>6</issue>
            <fpage>1475</fpage>
            <lpage>1481</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/molbev/msi138</pubid>
                  <pubid idtype="pmpid" link="fulltext">15788745</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B44">
            <title>
               <p>Very little intron loss/gain in <it>Plasmodium</it>: Intron loss/gain mutation rates and intron number</p>
            </title>
            <aug>
               <au>
                  <snm>Roy</snm>
                  <fnm>SW</fnm>
               </au>
               <au>
                  <snm>Hartl</snm>
                  <fnm>DL</fnm>
               </au>
            </aug>
            <source>Genome Res</source>
            <pubdate>2006</pubdate>
            <volume>16</volume>
            <issue>6</issue>
            <fpage>750</fpage>
            <lpage>756</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1473185</pubid>
                  <pubid idtype="pmpid" link="fulltext">16702411</pubid>
                  <pubid idtype="doi">10.1101/gr.4845406</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B45">
            <title>
               <p>Large-scale intron conservation and order-of-magnitude variation in intron loss/gain rates in apicomplexan evolution</p>
            </title>
            <aug>
               <au>
                  <snm>Roy</snm>
                  <fnm>SW</fnm>
               </au>
               <au>
                  <snm>Penny</snm>
                  <fnm>D</fnm>
               </au>
            </aug>
            <source>Genome Res</source>
            <pubdate>2006</pubdate>
            <volume>16</volume>
            <issue>10</issue>
            <fpage>1270</fpage>
            <lpage>1275</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1581436</pubid>
                  <pubid idtype="pmpid" link="fulltext">16963708</pubid>
                  <pubid idtype="doi">10.1101/gr.5410606</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B46">
            <title>
               <p>Widespread intron loss suggests retrotransposon activity in ancient apicomplexans</p>
            </title>
            <aug>
               <au>
                  <snm>Roy</snm>
                  <fnm>SW</fnm>
               </au>
               <au>
                  <snm>Penny</snm>
                  <fnm>D</fnm>
               </au>
            </aug>
            <source>Mol Biol Evol</source>
            <pubdate>2007</pubdate>
            <volume>24</volume>
            <issue>9</issue>
            <fpage>1926</fpage>
            <lpage>1933</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/molbev/msm102</pubid>
                  <pubid idtype="pmpid" link="fulltext">17522085</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B47">
            <title>
               <p>The origins of genome complexity</p>
            </title>
            <aug>
               <au>
                  <snm>Lynch</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Conery</snm>
                  <fnm>JS</fnm>
               </au>
            </aug>
            <source>Science</source>
            <pubdate>2003</pubdate>
            <volume>302</volume>
            <issue>5649</issue>
            <fpage>1401</fpage>
            <lpage>1404</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1126/science.1089370</pubid>
                  <pubid idtype="pmpid" link="fulltext">14631042</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B48">
            <title>
               <p>The Origins of Genome Architecture</p>
            </title>
            <aug>
               <au>
                  <snm>Lynch</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <publisher>Sunderland, Sinauer Associates, Inc.</publisher>
            <fpage>494</fpage>
            <note>2007</note>
         </bibl>
         <bibl id="B49">
            <title>
               <p>The human dystrophin gene requires 16 hours to be transcribed and is cotranscriptionally spliced</p>
            </title>
            <aug>
               <au>
                  <snm>Tennyson</snm>
                  <fnm>CN</fnm>
               </au>
               <au>
                  <snm>Klamut</snm>
                  <fnm>HJ</fnm>
               </au>
               <au>
                  <snm>Worton</snm>
                  <fnm>RG</fnm>
               </au>
            </aug>
            <source>Nat Genet</source>
            <pubdate>1995</pubdate>
            <volume>9</volume>
            <issue>2</issue>
            <fpage>184</fpage>
            <lpage>190</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/ng0295-184</pubid>
                  <pubid idtype="pmpid" link="fulltext">7719347</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B50">
            <title>
               <p>A chromatin landmark and transcription initiation at most promoters in human cells</p>
            </title>
            <aug>
               <au>
                  <snm>Guenther</snm>
                  <fnm>MG</fnm>
               </au>
               <au>
                  <snm>Levine</snm>
                  <fnm>SS</fnm>
               </au>
               <au>
                  <snm>Boyer</snm>
                  <fnm>LA</fnm>
               </au>
               <au>
                  <snm>Jaenisch</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Young</snm>
                  <fnm>RA</fnm>
               </au>
            </aug>
            <source>Cell</source>
            <pubdate>2007</pubdate>
            <volume>130</volume>
            <issue>1</issue>
            <fpage>77</fpage>
            <lpage>88</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/j.cell.2007.05.042</pubid>
                  <pubid idtype="pmpid" link="fulltext">17632057</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B51">
            <title>
               <p><it>In vivo</it> dynamics of RNA polymerase II transcription</p>
            </title>
            <aug>
               <au>
                  <snm>Darzacq</snm>
                  <fnm>X</fnm>
               </au>
               <au>
                  <snm>Shav-Tal</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>de Turris</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>Brody</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Shenoy</snm>
                  <fnm>SM</fnm>
               </au>
               <au>
                  <snm>Phair</snm>
                  <fnm>RD</fnm>
               </au>
               <au>
                  <snm>Singer</snm>
                  <fnm>RH</fnm>
               </au>
            </aug>
            <source>Nat Struct Mol Biol</source>
            <pubdate>2007</pubdate>
            <volume>14</volume>
            <issue>9</issue>
            <fpage>796</fpage>
            <lpage>806</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/nsmb1280</pubid>
                  <pubid idtype="pmpid" link="fulltext">17676063</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B52">
            <title>
               <p>Intron-delay and the precision of expression of homoeotic gene products in <it>Drosophila</it></p>
            </title>
            <aug>
               <au>
                  <snm>Gubb</snm>
                  <fnm>D</fnm>
               </au>
            </aug>
            <source>Dev Genet</source>
            <pubdate>1986</pubdate>
            <volume>7</volume>
            <issue>3</issue>
            <fpage>119</fpage>
            <lpage>131</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1002/dvg.1020070302</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B53">
            <title>
               <p>Mechanisms of transcriptional timing in <it>Drosophila</it></p>
            </title>
            <aug>
               <au>
                  <snm>Thummel</snm>
                  <fnm>CS</fnm>
               </au>
            </aug>
            <source>Science</source>
            <pubdate>1992</pubdate>
            <volume>255</volume>
            <issue>5040</issue>
            <fpage>39</fpage>
            <lpage>40</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1126/science.1553530</pubid>
                  <pubid idtype="pmpid" link="fulltext">1553530</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B54">
            <title>
               <p>Intron delays and transcriptional timing during development</p>
            </title>
            <aug>
               <au>
                  <snm>Swinburne</snm>
                  <fnm>IA</fnm>
               </au>
               <au>
                  <snm>Silver</snm>
                  <fnm>PA</fnm>
               </au>
            </aug>
            <source>Dev Cell</source>
            <pubdate>2008</pubdate>
            <volume>14</volume>
            <issue>3</issue>
            <fpage>324</fpage>
            <lpage>330</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/j.devcel.2008.02.002</pubid>
                  <pubid idtype="pmpid" link="fulltext">18331713</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B55">
            <title>
               <p>Roles of RNA : DNA hybrid stability, RNA structure, and active site conformation in pausing by human RNA polymerase II</p>
            </title>
            <aug>
               <au>
                  <snm>Palangat</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Landick</snm>
                  <fnm>R</fnm>
               </au>
            </aug>
            <source>J Mol Biol</source>
            <pubdate>2001</pubdate>
            <volume>311</volume>
            <issue>2</issue>
            <fpage>265</fpage>
            <lpage>282</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1006/jmbi.2001.4842</pubid>
                  <pubid idtype="pmpid" link="fulltext">11478860</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B56">
            <title>
               <p>The architecture of pre-mRNAs affects mechanisms of splice-site pairing</p>
            </title>
            <aug>
               <au>
                  <snm>Fox-Walsh</snm>
                  <fnm>KL</fnm>
               </au>
               <au>
                  <snm>Dou</snm>
                  <fnm>YM</fnm>
               </au>
               <au>
                  <snm>Lam</snm>
                  <fnm>BJ</fnm>
               </au>
               <au>
                  <snm>Hung</snm>
                  <fnm>SP</fnm>
               </au>
               <au>
                  <snm>Baldi</snm>
                  <fnm>PF</fnm>
               </au>
               <au>
                  <snm>Hertel</snm>
                  <fnm>KJ</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci USA</source>
            <pubdate>2005</pubdate>
            <volume>102</volume>
            <issue>45</issue>
            <fpage>16176</fpage>
            <lpage>16181</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1283478</pubid>
                  <pubid idtype="pmpid" link="fulltext">16260721</pubid>
                  <pubid idtype="doi">10.1073/pnas.0508489102</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B57">
            <title>
               <p>Intron size correlates positively with recombination rate in <it>Caenorhabditis elegans</it></p>
            </title>
            <aug>
               <au>
                  <snm>Prachumwat</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>DeVincentis</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Palopoli</snm>
                  <fnm>MF</fnm>
               </au>
            </aug>
            <source>Genetics</source>
            <pubdate>2004</pubdate>
            <volume>166</volume>
            <issue>3</issue>
            <fpage>1585</fpage>
            <lpage>1590</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1470791</pubid>
                  <pubid idtype="pmpid" link="fulltext">15082572</pubid>
                  <pubid idtype="doi">10.1534/genetics.166.3.1585</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B58">
            <title>
               <p>Protecting exons from deleterious R-loops: a potential advantage of having introns</p>
            </title>
            <aug>
               <au>
                  <snm>Niu</snm>
                  <fnm>DK</fnm>
               </au>
            </aug>
            <source>Biol Direct</source>
            <pubdate>2007</pubdate>
            <volume>2</volume>
            <issue>1</issue>
            <fpage>11</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1863416</pubid>
                  <pubid idtype="pmpid" link="fulltext">17459149</pubid>
                  <pubid idtype="doi">10.1186/1745-6150-2-11</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B59">
            <title>
               <p>Housekeeping genes tend to show reduced upstream sequence conservation</p>
            </title>
            <aug>
               <au>
                  <snm>Farre</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Bellora</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Mularoni</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Messeguer</snm>
                  <fnm>X</fnm>
               </au>
               <au>
                  <snm>Alba</snm>
                  <fnm>MM</fnm>
               </au>
            </aug>
            <source>Genome Biol</source>
            <pubdate>2007</pubdate>
            <volume>8</volume>
            <issue>7</issue>
            <fpage>R140</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">2323216</pubid>
                  <pubid idtype="pmpid" link="fulltext">17626644</pubid>
                  <pubid idtype="doi">10.1186/gb-2007-8-7-r140</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B60">
            <title>
               <p>Genome mapping and expression analyses of human intronic noncoding RNAs reveal tissue-specific patterns and enrichment in genes related to regulation of transcription</p>
            </title>
            <aug>
               <au>
                  <snm>Nakaya</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Amaral</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Louro</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Lopes</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Fachel</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Moreira</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>El-Jundi</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>da Silva</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Reisand</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Verjovski-Almeida</snm>
                  <fnm>S</fnm>
               </au>
            </aug>
            <source>Genome Biol</source>
            <pubdate>2007</pubdate>
            <volume>8</volume>
            <issue>3</issue>
            <fpage>R43</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1868932</pubid>
                  <pubid idtype="pmpid" link="fulltext">17386095</pubid>
                  <pubid idtype="doi">10.1186/gb-2007-8-3-r43</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B61">
            <title>
               <p>Intron size in mammals: complexity comes to terms with economy</p>
            </title>
            <aug>
               <au>
                  <snm>Pozzoli</snm>
                  <fnm>U</fnm>
               </au>
               <au>
                  <snm>Menozzi</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Comi</snm>
                  <fnm>GP</fnm>
               </au>
               <au>
                  <snm>Cagliani</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Bresolin</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Sironi</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>Trends Genet</source>
            <pubdate>2007</pubdate>
            <volume>23</volume>
            <issue>1</issue>
            <fpage>20</fpage>
            <lpage>24</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/j.tig.2006.10.003</pubid>
                  <pubid idtype="pmpid" link="fulltext">17070957</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B62">
            <title>
               <p>"Genome design" model: Evidence from conserved intronic sequence in human-mouse comparison</p>
            </title>
            <aug>
               <au>
                  <snm>Vinogradov</snm>
                  <fnm>AE</fnm>
               </au>
            </aug>
            <source>Genome Res</source>
            <pubdate>2006</pubdate>
            <volume>16</volume>
            <issue>3</issue>
            <fpage>347</fpage>
            <lpage>354</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1415212</pubid>
                  <pubid idtype="pmpid" link="fulltext">16461636</pubid>
                  <pubid idtype="doi">10.1101/gr.4318206</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B63">
            <title>
               <p>Protein polymorphism is negatively correlated with conservation of intronic sequences and complexity of expression patterns in <it>Drosophila melanogaster</it></p>
            </title>
            <aug>
               <au>
                  <snm>Petit</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Casillas</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Ruiz</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Barbadilla</snm>
                  <fnm>A</fnm>
               </au>
            </aug>
            <source>J Mol Evol</source>
            <pubdate>2007</pubdate>
            <volume>64</volume>
            <issue>5</issue>
            <fpage>511</fpage>
            <lpage>518</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1007/s00239-006-0047-5</pubid>
                  <pubid idtype="pmpid" link="fulltext">17460807</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B64">
            <title>
               <p>Patterns of intron sequence evolution in <it>Drosophila</it> are dependent upon length and GC content</p>
            </title>
            <aug>
               <au>
                  <snm>Haddrill</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Charlesworth</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Halligan</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Andolfatto</snm>
                  <fnm>P</fnm>
               </au>
            </aug>
            <source>Genome Biol</source>
            <pubdate>2005</pubdate>
            <volume>6</volume>
            <issue>8</issue>
            <fpage>R67</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1273634</pubid>
                  <pubid idtype="pmpid" link="fulltext">16086849</pubid>
                  <pubid idtype="doi">10.1186/gb-2005-6-8-r67</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B65">
            <title>
               <p>In plants, highly expressed genes are the least compact</p>
            </title>
            <aug>
               <au>
                  <snm>Ren</snm>
                  <fnm>XY</fnm>
               </au>
               <au>
                  <snm>Vorst</snm>
                  <fnm>O</fnm>
               </au>
               <au>
                  <snm>Fiers</snm>
                  <fnm>MWEJ</fnm>
               </au>
               <au>
                  <snm>Stiekema</snm>
                  <fnm>WJ</fnm>
               </au>
               <au>
                  <snm>Nap</snm>
                  <fnm>JP</fnm>
               </au>
            </aug>
            <source>Trends Genet</source>
            <pubdate>2006</pubdate>
            <volume>22</volume>
            <issue>10</issue>
            <fpage>528</fpage>
            <lpage>532</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/j.tig.2006.08.008</pubid>
                  <pubid idtype="pmpid" link="fulltext">16934358</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B66">
            <title>
               <p>Transcriptional regulation of post-aggregation genes in <it>Dictyostelium</it> by a feed-forward loop involving GBF and LagC</p>
            </title>
            <aug>
               <au>
                  <snm>Iranfar</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Fuller</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Loomis</snm>
                  <fnm>WF</fnm>
               </au>
            </aug>
            <source>Dev Biol</source>
            <pubdate>2006</pubdate>
            <volume>290</volume>
            <issue>2</issue>
            <fpage>460</fpage>
            <lpage>469</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/j.ydbio.2005.11.035</pubid>
                  <pubid idtype="pmpid" link="fulltext">16386729</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B67">
            <title>
               <p>The evolution of noncoding DNA: how much junk, how much func?</p>
            </title>
            <aug>
               <au>
                  <snm>Castillo-Davis</snm>
                  <fnm>CI</fnm>
               </au>
            </aug>
            <source>Trends Genet</source>
            <pubdate>2005</pubdate>
            <volume>21</volume>
            <issue>10</issue>
            <fpage>533</fpage>
            <lpage>536</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/j.tig.2005.08.001</pubid>
                  <pubid idtype="pmpid" link="fulltext">16098630</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B68">
            <title>
               <p>Rapid, transcript-specific changes in splicing in response to environmental stress</p>
            </title>
            <aug>
               <au>
                  <snm>Pleiss</snm>
                  <fnm>JA</fnm>
               </au>
               <au>
                  <snm>Whitworth</snm>
                  <fnm>GB</fnm>
               </au>
               <au>
                  <snm>Bergkessel</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Guthrie</snm>
                  <fnm>C</fnm>
               </au>
            </aug>
            <source>Mol Cell</source>
            <pubdate>2007</pubdate>
            <volume>27</volume>
            <issue>6</issue>
            <fpage>928</fpage>
            <lpage>937</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/j.molcel.2007.07.018</pubid>
                  <pubid idtype="pmpid" link="fulltext">17889666</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B69">
            <title>
               <p>Minimal introns are not "junky"</p>
            </title>
            <aug>
               <au>
                  <snm>Yu</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Yang</snm>
                  <fnm>ZY</fnm>
               </au>
               <au>
                  <snm>Kibukawa</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Paddock</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Passey</snm>
                  <fnm>DA</fnm>
               </au>
               <au>
                  <snm>Wong</snm>
                  <fnm>GKS</fnm>
               </au>
            </aug>
            <source>Genome Res</source>
            <pubdate>2002</pubdate>
            <volume>12</volume>
            <issue>8</issue>
            <fpage>1185</fpage>
            <lpage>1189</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">186636</pubid>
                  <pubid idtype="pmpid" link="fulltext">12176926</pubid>
                  <pubid idtype="doi">10.1101/gr.224602</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B70">
            <title>
               <p>Patterns and rates of intron divergence between humans and chimpanzees</p>
            </title>
            <aug>
               <au>
                  <snm>Gazave</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Marques-Bonet</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Fernando</snm>
                  <fnm>O</fnm>
               </au>
               <au>
                  <snm>Charlesworth</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Navarro</snm>
                  <fnm>A</fnm>
               </au>
            </aug>
            <source>Genome Biol</source>
            <pubdate>2007</pubdate>
            <volume>8</volume>
            <issue>2</issue>
            <fpage>R21</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1852421</pubid>
                  <pubid idtype="pmpid" link="fulltext">17309804</pubid>
                  <pubid idtype="doi">10.1186/gb-2007-8-2-r21</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B71">
            <title>
               <p>Relationship of codon bias to mRNA concentration and protein length in <it>Saccharomyces cerevisiae</it></p>
            </title>
            <aug>
               <au>
                  <snm>Coghlan</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Wolfe</snm>
                  <fnm>KH</fnm>
               </au>
            </aug>
            <source>Yeast</source>
            <pubdate>2000</pubdate>
            <volume>16</volume>
            <issue>12</issue>
            <fpage>1131</fpage>
            <lpage>1145</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1002/1097-0061(20000915)16:12&lt;1131::AID-YEA609>3.0.CO;2-F</pubid>
                  <pubid idtype="pmpid" link="fulltext">10953085</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B72">
            <title>
               <p>Analysis of the yeast transcriptome with structural and functional categories: characterizing highly expressed proteins</p>
            </title>
            <aug>
               <au>
                  <snm>Jansen</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Gerstein</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2000</pubdate>
            <volume>28</volume>
            <issue>6</issue>
            <fpage>1481</fpage>
            <lpage>1488</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">111042</pubid>
                  <pubid idtype="pmpid" link="fulltext">10684945</pubid>
                  <pubid idtype="doi">10.1093/nar/28.6.1481</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B73">
            <title>
               <p>Translational selection and yeast proteome evolution</p>
            </title>
            <aug>
               <au>
                  <snm>Akashi</snm>
                  <fnm>H</fnm>
               </au>
            </aug>
            <source>Genetics</source>
            <pubdate>2003</pubdate>
            <volume>164</volume>
            <issue>4</issue>
            <fpage>1291</fpage>
            <lpage>1303</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1462678</pubid>
                  <pubid idtype="pmpid" link="fulltext">12930740</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B74">
            <title>
               <p>Evolutionary constraints on yeast protein size</p>
            </title>
            <aug>
               <au>
                  <snm>Warringer</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Blomberg</snm>
                  <fnm>A</fnm>
               </au>
            </aug>
            <source>BMC Evol Biol</source>
            <pubdate>2006</pubdate>
            <volume>6</volume>
            <issue>1</issue>
            <fpage>61</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1560397</pubid>
                  <pubid idtype="pmpid" link="fulltext">16911784</pubid>
                  <pubid idtype="doi">10.1186/1471-2148-6-61</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B75">
            <title>
               <p>NCBI genome database</p>
            </title>
            <url>ftp://ftp.ncbi.nih.gov/genomes/</url>
         </bibl>
         <bibl id="B76">
            <title>
               <p>GNF Genome Informatics Applications &amp; Datasets </p>
            </title>
            <url>http://wombat.gnf.org/index.html</url>
         </bibl>
      </refgrp>
   </bm>
</art>
