<?xml version='1.0'?>
<!DOCTYPE art SYSTEM 'http://www.biomedcentral.com/xml/article.dtd'>
<art><ui>1471-2148-11-182</ui><ji>1471-2148</ji><fm>
<dochead>Research article</dochead>
<bibl>
<title>
<p>Increased gene sampling strengthens support for higher-level groups within leaf-mining moths and relatives (Lepidoptera: Gracillariidae)</p>
</title>
<aug>
<au ca="yes" id="A1"><snm>Kawahara</snm><mi>Y</mi><fnm>Akito</fnm><insr iid="I1"/><email>kawahara@flmnh.ufl.edu</email></au>
<au id="A2"><snm>Ohshima</snm><fnm>Issei</fnm><insr iid="I2"/><email>issei@nibb.ac.jp</email></au>
<au id="A3"><snm>Kawakita</snm><fnm>Atsushi</fnm><insr iid="I3"/><email>kawakita@ecology.kyoto-u.ac.jp</email></au>
<au id="A4"><snm>Regier</snm><mi>C</mi><fnm>Jerome</fnm><insr iid="I4"/><email>regier@umd.edu</email></au>
<au id="A5"><snm>Mitter</snm><fnm>Charles</fnm><insr iid="I1"/><email>cmitter@umd.edu</email></au>
<au id="A6"><snm>Cummings</snm><mi>P</mi><fnm>Michael</fnm><insr iid="I5"/><email>mike@umiacs.umd.edu</email></au>
<au id="A7"><snm>Davis</snm><mi>R</mi><fnm>Donald</fnm><insr iid="I6"/><email>davisd@si.edu</email></au>
<au id="A8"><snm>Wagner</snm><mi>L</mi><fnm>David</fnm><insr iid="I7"/><email>david.wagner@uconn.edu</email></au>
<au id="A9"><snm>De Prins</snm><fnm>Jurate</fnm><insr iid="I8"/><email>jurate.de.prins@africamuseum.be</email></au>
<au id="A10"><snm>Lopez-Vaamonde</snm><fnm>Carlos</fnm><insr iid="I9"/><email>carlos.lopez-vaamonde@orleans.inra.fr</email></au>
</aug>
<insg>
<ins id="I1"><p>Department of Entomology, University of Maryland, College Park, MD, USA</p></ins>
<ins id="I2"><p>Division of Evolutionary Biology, National Institute for Basic Biology, Okazaki, Japan</p></ins>
<ins id="I3"><p>Center for Ecological Research, Kyoto University, Kyoto, Japan</p></ins>
<ins id="I4"><p>Institute for Bioscience and Biotechnology Research, University of Maryland, College Park, MD, USA</p></ins>
<ins id="I5"><p>Laboratory of Molecular Evolution, Center for Bioinformatics and Computational Biology, University of Maryland, College Park, MD, USA</p></ins>
<ins id="I6"><p>Department of Entomology, Smithsonian Institution, Washington, D.C., USA</p></ins>
<ins id="I7"><p>Department of Ecology &amp; Evolutionary Biology, University of Connecticut, Storrs, CT, USA</p></ins>
<ins id="I8"><p>Royal Museum for Central Africa, Tervuren, Belgium</p></ins>
<ins id="I9"><p>INRA, UR0633 Zoologie Foresti&#232;re, F-45000, Orl&#233;ans, France</p></ins>
</insg>
<source>BMC Evolutionary Biology</source>
<issn>1471-2148</issn>
<pubdate>2011</pubdate>
<volume>11</volume>
<issue>1</issue>
<fpage>182</fpage>
<url>http://www.biomedcentral.com/1471-2148/11/182</url>
<xrefbib><pubidlist><pubid idtype="pmpid">21702958</pubid><pubid idtype="doi">10.1186/1471-2148-11-182</pubid></pubidlist></xrefbib>
</bibl>
<history><rec><date><day>8</day><month>11</month><year>2010</year></date></rec><acc><date><day>24</day><month>6</month><year>2011</year></date></acc><pub><date><day>24</day><month>6</month><year>2011</year></date></pub></history>
<cpyrt><year>2011</year><collab>Kawahara et al; licensee BioMed Central Ltd.</collab><note>This is an Open Access article distributed under the terms of the Creative Commons Attribution License (<url>http://creativecommons.org/licenses/by/2.0</url>), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</note></cpyrt>
<abs>
<sec>
<st>
<p>Abstract</p>
</st>
<sec>
<st>
<p>Background</p>
</st>
<p>Researchers conducting molecular phylogenetic studies are frequently faced with the decision of what to do when weak branch support is obtained for key nodes of importance. As one solution, the researcher may choose to sequence additional orthologous genes of appropriate evolutionary rate for the taxa in the study. However, generating large, complete data matrices can become increasingly difficult as the number of characters increases. A few empirical studies have shown that augmenting genes even for a subset of taxa can improve branch support. However, because each study differs in the number of characters and taxa, there is still a need for additional studies that examine whether incomplete sampling designs are likely to aid at increasing deep node resolution. We target Gracillariidae, a Cretaceous-age (~100 Ma) group of leaf-mining moths to test whether the strategy of adding genes for a subset of taxa can improve branch support for deep nodes. We initially sequenced ten genes (8,418 bp) for 57 taxa that represent the major lineages of Gracillariidae plus outgroups. After finding that many deep divergences remained weakly supported, we sequenced eleven additional genes (6,375 bp) for a 27-taxon subset. We then compared results from different data sets to assess whether one sampling design can be favored over another. The concatenated data set comprising all genes and all taxa and three other data sets of different taxon and gene sub-sampling design were analyzed with maximum likelihood. Each data set was subject to five different models and partitioning schemes of non-synonymous and synonymous changes. Statistical significance of non-monophyly was examined with the Approximately Unbiased (AU) test.</p>
</sec>
<sec>
<st>
<p>Results</p>
</st>
<p>Partial augmentation of genes led to high support for deep divergences, especially when non-synonymous changes were analyzed alone. Increasing the number of taxa without an increase in number of characters led to lower bootstrap support; increasing the number of characters without increasing the number of taxa generally increased bootstrap support. More than three-quarters of nodes were supported with bootstrap values greater than 80% when all taxa and genes were combined. Gracillariidae, Lithocolletinae + <it>Leucanthiza</it>, and <it>Acrocercops </it>and <it>Parectopa </it>groups were strongly supported in nearly every analysis. <it>Gracillaria </it>group was well supported in some analyses, but less so in others. We find strong evidence for the exclusion of Douglasiidae from Gracillarioidea sensu Davis and Robinson (1998). Our results strongly support the monophyly of a G.B.R.Y. clade, a group comprised of Gracillariidae + Bucculatricidae + Roeslerstammiidae + Yponomeutidae, when analyzed with non-synonymous changes only, but this group was frequently split when synonymous and non-synonymous substitutions were analyzed together.</p>
</sec>
<sec>
<st>
<p>Conclusions</p>
</st>
<p>1) Partially or fully augmenting a data set with more characters increased bootstrap support for particular deep nodes, and this increase was dramatic when non-synonymous changes were analyzed alone. Thus, the addition of sites that have low levels of saturation and compositional heterogeneity can greatly improve results. 2) Gracillarioidea, as defined by Davis and Robinson (1998), clearly do not include Douglasiidae, and changes to current classification will be required. 3) Gracillariidae were monophyletic in all analyses conducted, and nearly all species can be placed into one of six strongly supported clades though relationships among these remain unclear. 4) The difficulty in determining the phylogenetic placement of Bucculatricidae is probably attributable to compositional heterogeneity at the third codon position. From our tests for compositional heterogeneity and strong bootstrap values obtained when synonymous changes are excluded, we tentatively conclude that Bucculatricidae is closely related to Gracillariidae + Roeslerstammiidae + Yponomeutidae.</p>
</sec>
</sec>
</abs>
</fm><bdy>
<sec>
<st>
<p>Background</p>
</st>
<p>Researchers conducting molecular phylogenetic studies are frequently faced with the decision of what to do when weak branch support is obtained for key nodes of importance. As one solution, the researcher may choose to sequence additional orthologous genes of appropriate evolutionary rate. Indeed, it is well known that increasing the number of characters can improve branch support (e.g. <abbrgrp>
<abbr bid="B1">1</abbr>
<abbr bid="B2">2</abbr>
<abbr bid="B3">3</abbr>
<abbr bid="B4">4</abbr>
<abbr bid="B5">5</abbr>
</abbrgrp>). However, generating large, complete data matrices can become increasingly difficult as the number of characters increases. Two empirical studies <abbrgrp>
<abbr bid="B6">6</abbr>
<abbr bid="B7">7</abbr>
</abbrgrp> have concluded that augmenting genes even for a subset of taxa can improve branch support. However, because each study differs in the number of characters and taxa, there is still a need for additional studies that examine whether incomplete sampling designs are likely to aid at increasing deep node resolution.</p>
<p>In this paper, we target Gracillariidae, a Cretaceous-age (~100 Ma) group of leaf-mining moths <abbrgrp>
<abbr bid="B8">8</abbr>
</abbrgrp> to test whether the strategy of adding genes for a subset of taxa can improve branch support for deep nodes. Gracillariidae, with 1,855 species <abbrgrp>
<abbr bid="B9">9</abbr>
<abbr bid="B10">10</abbr>
</abbrgrp>, is one of the largest groups of leaf-mining Lepidoptera with numerous economically important species that cause agricultural damage <abbrgrp>
<abbr bid="B9">9</abbr>
<abbr bid="B11">11</abbr>
<abbr bid="B12">12</abbr>
<abbr bid="B13">13</abbr>
<abbr bid="B14">14</abbr>
<abbr bid="B15">15</abbr>
<abbr bid="B16">16</abbr>
</abbrgrp>. Gracillariids show a diversity of life-history strategies, such as fruit mining, stem mining, leaf rolling, boring, and galling <abbrgrp>
<abbr bid="B11">11</abbr>
<abbr bid="B17">17</abbr>
</abbrgrp>, and some species change strategies during development <abbrgrp>
<abbr bid="B17">17</abbr>
<abbr bid="B18">18</abbr>
<abbr bid="B19">19</abbr>
<abbr bid="B20">20</abbr>
</abbrgrp>. Despite the agricultural importance and diversity of life-history strategies, the systematics of Gracillariidae is poorly understood. Monophyly of the superfamily Gracillarioidea as currently defined by Davis and Robinson <abbrgrp>
<abbr bid="B11">11</abbr>
</abbrgrp> remains uncertain. The phylogenetic position of Gracillarioidea in Lepidoptera is also relatively unclear, though recent molecular studies strongly support a close relationship to Yponomeutoidea <abbrgrp>
<abbr bid="B7">7</abbr>
<abbr bid="B21">21</abbr>
<abbr bid="B22">22</abbr>
</abbrgrp>.</p>
<p>Davis and Robinson's classification includes four families in Gracillarioidea, Bucculatricidae, Douglasiidae, Gracillariidae, and Roeslerstammiidae. Bucculatricidae and Douglasiidae were included in Gracillarioidea based on nine morphological features that they share with Gracillariidae and Roeslerstammiidae, including two from the larva, two from the pupa, and five from the adult <abbrgrp>
<abbr bid="B11">11</abbr>
</abbrgrp>. Others have included Bucculatricidae, Gracillariidae, and Phyllocnistidae (the latter now in Gracillariidae <abbrgrp>
<abbr bid="B9">9</abbr>
<abbr bid="B10">10</abbr>
<abbr bid="B23">23</abbr>
<abbr bid="B24">24</abbr>
</abbrgrp>), Bucculatricidae, Gracillariidae, and Lyonetiidae <abbrgrp>
<abbr bid="B25">25</abbr>
</abbrgrp>, or Bucculatricidae, Gracillariidae and Roeslerstammiidae <abbrgrp>
<abbr bid="B26">26</abbr>
</abbrgrp>. While some putative relationships have been postulated for higher-level relationships within Gracillarioidea based on morphology (e.g. <abbrgrp>
<abbr bid="B24">24</abbr>
<abbr bid="B27">27</abbr>
</abbrgrp>, the trees presented in these studies were based on phenetic similarity rather than discrete character analysis. The only phylogenetic study that examined higher-level gracillariid relationships was a recent study aimed at resolving broader relationships of Lepidoptera that included 14 gracillarioid species <abbrgrp>
<abbr bid="B22">22</abbr>
</abbrgrp>. The authors suggested that Gracillarioidea might not include Bucculatricidae or Douglasiidae. Most phylogenetic studies within Gracillarioidea have focused mainly at the genus level or below (e.g. host races of <it>Acrocercops transecta </it>
<abbrgrp>
<abbr bid="B28">28</abbr>
<abbr bid="B29">29</abbr>
</abbrgrp>, <it>Epicephala </it>and relatives <abbrgrp>
<abbr bid="B30">30</abbr>
<abbr bid="B31">31</abbr>
<abbr bid="B32">32</abbr>
</abbrgrp>, and <it>Phyllonorycter </it>
<abbrgrp>
<abbr bid="B33">33</abbr>
<abbr bid="B34">34</abbr>
</abbrgrp>).</p>
<p>This study utilizes 21 nuclear protein-coding genes to evaluate the effect of augmenting sequence data for a subsample of taxa and to tackle the problem of the phylogeny of Gracillariidae and their relatives. Fifty-seven taxa, including exemplars representing the major lineages of Gracillarioidea plus outgroups, were initially sequenced for ten genes (8,418 bp). After discovering that many deep divergences within the superfamily could not be recovered with strong branch support, we sequenced 11 additional genes (6,375 bp) for 27 taxa representing the major lineages of Gracillarioidea (21 genes total, 14,793 bp). We compared results from four data sets differing in gene and taxon sampling design (Figure <figr fid="F1">1</figr>), to assess whether one design can be favored over another. We also examined the effect of excluding synonymous changes, which at deeper levels in our taxon sample are subject both to saturation and to divergence in base composition, possibly obscuring phylogenetic signal.</p>
<fig id="F1"><title><p>Figure 1</p></title><caption><p>Four data sets representing different taxon/gene sampling strategies</p></caption><text>
   <p><b>Four data sets representing different taxon/gene sampling strategies</b>. A) 27 taxa &#215; 10 genes, B) 27 taxa &#215; 21 genes, C) 57 taxa &#215; 10 genes, D) combination of B and C into a single data set with a block of missing data accounting for approximately a quarter of total data.</p>
</text><graphic file="1471-2148-11-182-1" hint_layout="single"/></fig>
</sec>
<sec>
<st>
<p>Methods</p>
</st>
<sec>
<st>
<p>Taxon sampling</p>
</st>
<p>The present study included 45 species of Gracillarioidea, of which 39 were Gracillariidae (Additional file <supplr sid="S1">1</supplr>). Taxa were chosen to represent the major lineages as defined by the classification of Davis and Robinson <abbrgrp>
<abbr bid="B11">11</abbr>
</abbrgrp>. Whenever possible, we included the type species or genus. Twelve outgroups were chosen based on the availability of sequence data and their phylogenetic proximity to Gracillarioidea in two recent molecular phylogenetic studies of ditrysian Lepidoptera <abbrgrp>
<abbr bid="B7">7</abbr>
<abbr bid="B21">21</abbr>
</abbrgrp>. While we mention Oecophyllembiinae in our discussion, we follow Davis and Robinson <abbrgrp>
<abbr bid="B11">11</abbr>
</abbrgrp> and V&#225;ri et al. <abbrgrp>
<abbr bid="B35">35</abbr>
</abbrgrp> and do not formally recognize this subfamily.</p>
<suppl id="S1">
<title>
<p>Additional file 1</p>
</title>
<text>
<p>
<b>Exemplar species included, their classification, and GenBank accession numbers</b>. For Gracillariidae the number of taxa in each subfamily and genus is listed in parentheses (number of taxa sampled/number of taxa known). "x" denotes a sequence that could not be amplified.</p>
</text>
<file name="1471-2148-11-182-S1.XLS">
   <p>Click here for file</p>
</file>
</suppl>
</sec>
<sec>
<st>
<p>Gene sampling</p>
</st>
<p>Ten nuclear protein-coding genes, totalling an alignment length of 8,418 bp, were initially chosen for this study (Table <tblr tid="T1">1</tblr>). These genes were included because they had some of the highest amplification success rates and had proven useful for estimating a "backbone" phylogeny of Lepidoptera (see <url>http://www.leptree.net/</url>) and are among the 68 gene regions originally developed for deep-level phylogenetics of Arthropoda <abbrgrp>
<abbr bid="B36">36</abbr>
</abbrgrp>. We created two data sets for ten genes, one with 27 taxa (data set A, Figure <figr fid="F1">1A</figr>) and another with 57 taxa (data set C, Figure <figr fid="F1">1C</figr>). After discovering that ten genes did not adequately resolve phylogenetic relationships among subfamilies, we chose a resource-efficient approach to tackle the problem of weak branch support for deep nodes in the tree. We additionally sequenced eleven genes for 27 taxa and combined this additional sequence data with the original ten (data set B, 14,793 bp; Figure <figr fid="F1">1B</figr>). These additional eleven genes were chosen based on amplification success and examining the average rate of non-synonymous change from a previous study <abbrgrp>
<abbr bid="B36">36</abbr>
</abbrgrp>. Throughout the paper, we refer to the "ten genes" and "eleven genes" as the original ten and additional eleven genes. Because our primary goal was to produce the best estimate for relationships of Gracillariidae and relatives, we combined data sets A through C to create data set D (57 taxa &#215; 21 genes, 14,793 bp; Figure <figr fid="F1">1D</figr>). We assessed the differences in tree topology and branch support among these data sets and tested the effect of synonymous changes on each.</p>
<tbl id="T1"><title><p>Table 1</p></title><caption><p>Representation of genes and their amplicon names in each of the four data sets.</p></caption><tblbdy cols="7">
      <r>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c ca="center" cspan="4">
            <p>
               <b>Data set</b>
            </p>
         </c>
      </r>
      <r>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c cspan="4">
            <hr/>
         </c>
      </r>
      <r>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c ca="center">
            <p>
               <b>A</b>
            </p>
         </c>
         <c ca="center">
            <p>
               <b>B</b>
            </p>
         </c>
         <c ca="center">
            <p>
               <b>C</b>
            </p>
         </c>
         <c ca="center">
            <p>
               <b>D</b>
            </p>
         </c>
      </r>
      <r>
         <c cspan="7">
            <hr/>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>
               <b>Gene</b>
            </p>
         </c>
         <c ca="left">
            <p>
               <b>Amplicon name and reference</b>
            </p>
         </c>
         <c ca="left">
            <p>
               <b>Length</b>
            </p>
            <p>
               <b>(bp)</b>
            </p>
         </c>
         <c indent="1" ca="left">
            <p>
               <b>10 g &#215; 27 t</b>
            </p>
         </c>
         <c indent="1" ca="left">
            <p>
               <b>21 g &#215; 27 t</b>
            </p>
         </c>
         <c indent="1" ca="left">
            <p>
               <b>10 g &#215; 57 t</b>
            </p>
         </c>
         <c indent="1" ca="left">
            <p>
               <b>21 g &#215; 27 t</b>
            </p>
         </c>
      </r>
      <r>
         <c cspan="7">
            <hr/>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>40fin2_3</p>
         </c>
         <c ca="left">
            <p>Phosphogluconate dehydrogenase <abbrgrp><abbr bid="B36">36</abbr></abbrgrp></p>
         </c>
         <c ca="left">
            <p>750</p>
         </c>
         <c>
            <p/>
         </c>
         <c ca="center">
            <p>X</p>
         </c>
         <c>
            <p/>
         </c>
         <c ca="center">
            <p>X</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>42fin1_2</p>
         </c>
         <c ca="left">
            <p>Putative GTP-binding protein <abbrgrp><abbr bid="B36">36</abbr></abbrgrp></p>
         </c>
         <c ca="left">
            <p>840</p>
         </c>
         <c>
            <p/>
         </c>
         <c ca="center">
            <p>X</p>
         </c>
         <c>
            <p/>
         </c>
         <c ca="center">
            <p>X</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>109fin1_2</p>
         </c>
         <c ca="left">
            <p>Gelsolin <abbrgrp><abbr bid="B36">36</abbr></abbrgrp></p>
         </c>
         <c ca="left">
            <p>552</p>
         </c>
         <c ca="center">
            <p>X</p>
         </c>
         <c ca="center">
            <p>X</p>
         </c>
         <c ca="center">
            <p>X</p>
         </c>
         <c ca="center">
            <p>X</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>192fin1_2</p>
         </c>
         <c ca="left">
            <p>Glutamyl- &amp; prolyl-tRNA sybphetase <abbrgrp><abbr bid="B36">36</abbr></abbrgrp></p>
         </c>
         <c ca="left">
            <p>402</p>
         </c>
         <c>
            <p/>
         </c>
         <c ca="center">
            <p>X</p>
         </c>
         <c>
            <p/>
         </c>
         <c ca="center">
            <p>X</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>197fin1_2</p>
         </c>
         <c ca="left">
            <p>Triosephosphate isomerase <abbrgrp><abbr bid="B36">36</abbr></abbrgrp></p>
         </c>
         <c ca="left">
            <p>444</p>
         </c>
         <c>
            <p/>
         </c>
         <c ca="center">
            <p>X</p>
         </c>
         <c>
            <p/>
         </c>
         <c ca="center">
            <p>X</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>262fin1_2</p>
         </c>
         <c ca="left">
            <p>Proteasome subunit <abbrgrp><abbr bid="B36">36</abbr></abbrgrp></p>
         </c>
         <c ca="left">
            <p>501</p>
         </c>
         <c>
            <p/>
         </c>
         <c ca="center">
            <p>X</p>
         </c>
         <c>
            <p/>
         </c>
         <c ca="center">
            <p>X</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>265fin2_3</p>
         </c>
         <c ca="left">
            <p>Histidyl-tRNA sybphetase <abbrgrp><abbr bid="B36">36</abbr></abbrgrp></p>
         </c>
         <c ca="left">
            <p>447</p>
         </c>
         <c ca="center">
            <p>X</p>
         </c>
         <c ca="center">
            <p>X</p>
         </c>
         <c ca="center">
            <p>X</p>
         </c>
         <c ca="center">
            <p>X</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>268fin1_2</p>
         </c>
         <c ca="left">
            <p>AMP deaminase <abbrgrp><abbr bid="B36">36</abbr></abbrgrp></p>
         </c>
         <c ca="left">
            <p>768</p>
         </c>
         <c ca="center">
            <p>X</p>
         </c>
         <c ca="center">
            <p>X</p>
         </c>
         <c ca="center">
            <p>X</p>
         </c>
         <c ca="center">
            <p>X</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>3007fin1_2</p>
         </c>
         <c ca="left">
            <p>Glucose phosphosphate dehydrogenase <abbrgrp><abbr bid="B36">36</abbr></abbrgrp></p>
         </c>
         <c ca="left">
            <p>621</p>
         </c>
         <c ca="center">
            <p>X</p>
         </c>
         <c ca="center">
            <p>X</p>
         </c>
         <c ca="center">
            <p>X</p>
         </c>
         <c ca="center">
            <p>X</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>3017fin1_2</p>
         </c>
         <c ca="left">
            <p>Tetrahydrofolate sybphase <abbrgrp><abbr bid="B36">36</abbr></abbrgrp></p>
         </c>
         <c ca="left">
            <p>594</p>
         </c>
         <c>
            <p/>
         </c>
         <c ca="center">
            <p>X</p>
         </c>
         <c>
            <p/>
         </c>
         <c ca="center">
            <p>X</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>3070fin4_5</p>
         </c>
         <c ca="left">
            <p>Alanyl-tRNA sybphetase <abbrgrp><abbr bid="B36">36</abbr></abbrgrp></p>
         </c>
         <c ca="left">
            <p>705</p>
         </c>
         <c>
            <p/>
         </c>
         <c ca="center">
            <p>X</p>
         </c>
         <c>
            <p/>
         </c>
         <c ca="center">
            <p>X</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>8028fin1_2</p>
         </c>
         <c ca="left">
            <p>Nucleolar cysteine-rich protein <abbrgrp><abbr bid="B36">36</abbr></abbrgrp></p>
         </c>
         <c ca="left">
            <p>324</p>
         </c>
         <c>
            <p/>
         </c>
         <c ca="center">
            <p>X</p>
         </c>
         <c>
            <p/>
         </c>
         <c ca="center">
            <p>X</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>8091fin1_2</p>
         </c>
         <c ca="left">
            <p>Glucose phosphate isomerase <abbrgrp><abbr bid="B36">36</abbr></abbrgrp></p>
         </c>
         <c ca="left">
            <p>666</p>
         </c>
         <c>
            <p/>
         </c>
         <c ca="center">
            <p>X</p>
         </c>
         <c>
            <p/>
         </c>
         <c ca="center">
            <p>X</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>acc2_4</p>
         </c>
         <c ca="left">
            <p>Acetyl-coA carboxylase <abbrgrp><abbr bid="B36">36</abbr></abbrgrp></p>
         </c>
         <c ca="left">
            <p>501</p>
         </c>
         <c ca="center">
            <p>X</p>
         </c>
         <c ca="center">
            <p>X</p>
         </c>
         <c ca="center">
            <p>X</p>
         </c>
         <c ca="center">
            <p>X</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>CAD</p>
         </c>
         <c ca="left">
            <p>Pyrimidine biosynthesis <abbrgrp><abbr bid="B85">85</abbr></abbrgrp></p>
         </c>
         <c ca="left">
            <p>2913</p>
         </c>
         <c ca="center">
            <p>X</p>
         </c>
         <c ca="center">
            <p>X</p>
         </c>
         <c ca="center">
            <p>X</p>
         </c>
         <c ca="center">
            <p>X</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>DDC</p>
         </c>
         <c ca="left">
            <p>Dopa-decarboxylase <abbrgrp><abbr bid="B86">86</abbr></abbrgrp></p>
         </c>
         <c ca="left">
            <p>708</p>
         </c>
         <c ca="center">
            <p>X</p>
         </c>
         <c ca="center">
            <p>X</p>
         </c>
         <c ca="center">
            <p>X</p>
         </c>
         <c ca="center">
            <p>X</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>EF-1alpha</p>
         </c>
         <c ca="left">
            <p>Elongation factor-1 alpha <abbrgrp><abbr bid="B37">37</abbr></abbrgrp></p>
         </c>
         <c ca="left">
            <p>519</p>
         </c>
         <c ca="center">
            <p>X</p>
         </c>
         <c ca="center">
            <p>X</p>
         </c>
         <c ca="center">
            <p>X</p>
         </c>
         <c ca="center">
            <p>X</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>enolase</p>
         </c>
         <c ca="left">
            <p>Enolase <abbrgrp><abbr bid="B87">87</abbr></abbrgrp></p>
         </c>
         <c ca="left">
            <p>1134</p>
         </c>
         <c ca="center">
            <p>X</p>
         </c>
         <c ca="center">
            <p>X</p>
         </c>
         <c ca="center">
            <p>X</p>
         </c>
         <c ca="center">
            <p>X</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>histone 3</p>
         </c>
         <c ca="left">
            <p>Histone 3 <abbrgrp><abbr bid="B38">38</abbr></abbrgrp></p>
         </c>
         <c ca="left">
            <p>273</p>
         </c>
         <c ca="center">
            <p>X</p>
         </c>
         <c ca="center">
            <p>X</p>
         </c>
         <c ca="center">
            <p>X</p>
         </c>
         <c ca="center">
            <p>X</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>period</p>
         </c>
         <c ca="left">
            <p>Period <abbrgrp><abbr bid="B88">88</abbr></abbrgrp></p>
         </c>
         <c ca="left">
            <p>747</p>
         </c>
         <c>
            <p/>
         </c>
         <c ca="center">
            <p>X</p>
         </c>
         <c>
            <p/>
         </c>
         <c ca="center">
            <p>X</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>wingless</p>
         </c>
         <c ca="left">
            <p>Wingless <abbrgrp><abbr bid="B89">89</abbr></abbrgrp></p>
         </c>
         <c ca="left">
            <p>402</p>
         </c>
         <c>
            <p/>
         </c>
         <c ca="center">
            <p>X</p>
         </c>
         <c>
            <p/>
         </c>
         <c ca="center">
            <p>X</p>
         </c>
      </r>
   </tblbdy><tblfn>
      <p>A box with an "X" indicates a gene that was included in that particular data set.</p>
   </tblfn></tbl>
<p>Specifically, data set A (27 taxa &#215; 10 genes) was constructed to examine whether modest taxa and gene sampling can strongly resolve relationships of Gracillariidae and relatives. Data set B (27 taxa &#215; 21 genes) was constructed to examine whether nearly doubling the number of characters would boost branch support for deep splits that were not well supported with ten genes. Data set C (57 taxa &#215; 10 genes) was built to assess the effects of adding more taxa to data set A. Finally, data set D (57 taxa &#215; 21 genes) was constructed to examine how trees generated from a data set that included all taxa and genes (but also approximately a quarter of the characters without data) compared to those from the other three data sets. The amount of missing data for each of the four data sets (A through D) was 13.6%, 14.3%, 31.2%, and 43.6% respectively.</p>
<p>For all genes except elongation factor-1 alpha (EF-1&#945; <abbrgrp>
<abbr bid="B37">37</abbr>
</abbrgrp>) and histone 3 (H3 <abbrgrp>
<abbr bid="B38">38</abbr>
</abbrgrp>), nucleic acid sequences were generated from mRNAs amplified with RT-PCR following laboratory protocols, primer sequences, and amplification strategies of Regier et al. <abbrgrp>
<abbr bid="B39">39</abbr>
</abbrgrp>. For EF-1&#945; and H3, we followed methods outlined in Kawakita et al. <abbrgrp>
<abbr bid="B30">30</abbr>
</abbrgrp>, Kawakita and Kato <abbrgrp>
<abbr bid="B40">40</abbr>
</abbrgrp>, Ogden and Whiting <abbrgrp>
<abbr bid="B38">38</abbr>
</abbrgrp>, and amplified directly from genomic DNA. Each single-gene data set was individually translated and aligned with the "Translation Align" option in Geneious 5.1 <abbrgrp>
<abbr bid="B41">41</abbr>
</abbrgrp> after making sure the data set began with the first codon position (nt1). The alignment was visually inspected, and checked twice for frame-shifts and the presence of termination codons. Difficult to align regions were assessed in GBlocks 0.91b <abbrgrp>
<abbr bid="B42">42</abbr>
</abbrgrp> and removed as they can cause problems in phylogeny estimation <abbrgrp>
<abbr bid="B43">43</abbr>
<abbr bid="B44">44</abbr>
</abbrgrp>. Sequences were also assessed for contamination and sample-switching error, by generating pairwise distance tables for nt12, nt3, and nt123 in PAUP* 4b10 <abbrgrp>
<abbr bid="B45">45</abbr>
</abbrgrp> and ML bootstrap trees in GARLI 1.0 <abbrgrp>
<abbr bid="B46">46</abbr>
</abbrgrp> for each gene before all genes were concatenated. The 21 data matrices were concatenated with Geneious <abbrgrp>
<abbr bid="B41">41</abbr>
</abbrgrp> and the entire edited sequence data set visually checked. GenBank accession numbers are listed in Additional file <supplr sid="S1">1</supplr>.</p>
</sec>
<sec>
<st>
<p>Phylogenetic analysis</p>
</st>
<p>Phylogenetic analyses were conducted with maximum likelihood (ML) as implemented in GARLI 1.0 <abbrgrp>
<abbr bid="B46">46</abbr>
</abbrgrp> and GARLI-PART 0.97 <abbrgrp>
<abbr bid="B47">47</abbr>
</abbrgrp>. All settings were kept as default except where indicated below. We used jModelTest <abbrgrp>
<abbr bid="B48">48</abbr>
</abbrgrp> to determine the best substitution model for each data set, which in nearly all cases was the General-Time-Reversible (GTR) model <abbrgrp>
<abbr bid="B49">49</abbr>
<abbr bid="B50">50</abbr>
</abbrgrp>, with among-site rate heterogeneity modeled according to a gamma (&#915;) distribution <abbrgrp>
<abbr bid="B51">51</abbr>
</abbrgrp> while allowing for a proportion of invariable sites (I) <abbrgrp>
<abbr bid="B52">52</abbr>
</abbrgrp>. Two thousand ML and bootstrap tree searches were conducted for analyses that applied a nucleotide substitution model. We also ran codon model analyses <abbrgrp>
<abbr bid="B53">53</abbr>
</abbrgrp> as implemented in GARLI. Due to computational limitations at the time of this study, each codon analysis was conducted with 100 ML tree searches and 100 bootstrap replicates. To expedite tree searches, we used Grid computing <abbrgrp>
<abbr bid="B54">54</abbr>
</abbrgrp> through The Lattice Project <abbrgrp>
<abbr bid="B55">55</abbr>
</abbrgrp>. For consistency in the characterization of results, we will refer to bootstrap support (BP) of 70-79% as "moderate," support &#8805; 80% (but &lt; 90%) as "strong," and &#8805; 90% as "very strong." We use the arbitrary cutoff of 80% BP as a measure to compare the number of nodes with strong support across individual genes.</p>
</sec>
<sec>
<st>
<p>Base compositional heterogeneity</p>
</st>
<p>Base compositional bias can cause unrelated lineages to incorrectly group together (e.g. <abbrgrp>
<abbr bid="B56">56</abbr>
<abbr bid="B57">57</abbr>
<abbr bid="B58">58</abbr>
<abbr bid="B59">59</abbr>
<abbr bid="B60">60</abbr>
</abbrgrp>). While models for phylogenetic analysis assume compositional homogeneity, strong compositional bias is common at sites capable of undergoing synonymous substitution <abbrgrp>
<abbr bid="B21">21</abbr>
<abbr bid="B36">36</abbr>
<abbr bid="B61">61</abbr>
</abbrgrp>. In order to examine the effect of compositional heterogeneity, we examined five different character partitions, with and without synonymous change: (a) "nt123": all nucleotides and all changes; (b) "codon": all nucleotides and changes, but implementing a codon model, which "down-weights" synonymous changes because of their relatively rapid evolution; (c) "degen1" <abbrgrp>
<abbr bid="B62">62</abbr>
<abbr bid="B63">63</abbr>
</abbrgrp>: all sequence sites with the potential to undergo synonymous changes fully degenerated, an extension of the RY coding scheme of Phillips et al. <abbrgrp>
<abbr bid="B64">64</abbr>
</abbrgrp>; (d) "partitioned": all nucleotides partitioned into mostly synonymously evolving and mostly non-synonymously evolving sites, specifically, the partition, "noLRall1 + nt2" versus "LRall1 + nt3" of Regier et al. <abbrgrp>
<abbr bid="B62">62</abbr>
</abbrgrp>; and (e) amino acids. As an alternative means to filter synonymous substitutions, in some cases we also analyzed the noLRall1 + nt2 data set alone (see Discussion).</p>
<p>To further investigate the potential influence of compositional heterogeneity, we conducted chi-square tests of among-taxon heterogeneity on data set B (27 taxa &#215; 21 genes). We chose data set B because it includes the largest number of characters (14,793 bp) with a relatively low percentage of missing data (14.3%) out of the four data sets. Chi-square tests were conducted on a character set undergoing mostly synonymous change, nt3, and one undergoing mostly non-synonymous change, noLRall1 + nt2. We conducted the test for various groups in Gracillariidae and outgroups on both the entire character set, and after eliminating invariable sites in the degen1 data set. To gauge the possible effect of compositional heterogeneity on phylogeny inference, we compared neighbor-joining trees using two different distances: ML distances based on the GTR model, which can be influenced by compositional heterogeneity; and Euclidean distances calculated on the proportions of the four nucleotide states treated as independent characters, which will reflect only compositional heterogeneity. Compositional distances were generated using a Perl script that was written with modification of the MBE Toolbox <abbrgrp>
<abbr bid="B65">65</abbr>
</abbrgrp> and calculated with PAUP* <abbrgrp>
<abbr bid="B45">45</abbr>
</abbrgrp>.</p>
</sec>
<sec>
<st>
<p>Testing alternative hypotheses</p>
</st>
<p>Morphology and larval mining patterns predict the monophyly of Gracillariidae + Bucculatricidae + Roeslerstammiidae <abbrgrp>
<abbr bid="B26">26</abbr>
</abbrgrp>, Gracillariinae + Lithocolletinae <abbrgrp>
<abbr bid="B24">24</abbr>
</abbrgrp>, and Oecophyllembiinae + Phyllocnistinae <abbrgrp>
<abbr bid="B11">11</abbr>
<abbr bid="B35">35</abbr>
<abbr bid="B66">66</abbr>
</abbrgrp>, but some of these proposed higher-level groups were not recovered. To test whether these differences between morphological (and behavioral) versus molecular inferences were "real," i.e. not attributable to sampling error in the molecular data, we used the Approximately Unbiased (AU) test of Shimodaira <abbrgrp>
<abbr bid="B67">67</abbr>
</abbrgrp>. The AU test ranks trees and determines if trees under a topological constraint describe the data significantly worse than the best tree.</p>
<p>To compare the confidence between our results and prior morphology-based hypotheses, we conducted separate analyses in which groups believed to be monophyletic were constrained. ML trees were calculated with constraints enforced, and the ML tree from the constrained and the unconstrained analyses compared with the AU test. Each analysis applied the same number of ML runs determined to be appropriate for that character partition as described above. Site likelihoods were estimated with PAUP* <abbrgrp>
<abbr bid="B45">45</abbr>
</abbrgrp>. For each data set, we combined the site likelihoods generated from all ML constraint analyses together into a single file with the unconstrained site likelihoods. In CONSEL 0.1j <abbrgrp>
<abbr bid="B68">68</abbr>
</abbrgrp>, the AU test statistic of Shimodaira <abbrgrp>
<abbr bid="B67">67</abbr>
</abbrgrp> was used to determine the difference in fit to data of the constrained and unconstrained trees.</p>
</sec>
</sec>
<sec>
<st>
<p>Results</p>
</st>
<sec>
<st>
<p>Gene versus taxon sampling</p>
</st>
<p>Sampling design had the greatest influence on the recovery and bootstrap support of deep nodes in Gracillariidae and relatives, which was especially pronounced for the G.B.R.Y. clade (Gracillariidae + Bucculatricidae + Roeslerstammiidae + Yponomeutidae). Bootstrap support for this clade rose for all five analytical methods when the number of sampled characters was nearly doubled (data set A [27 taxa &#215; 10 genes] versus B [27 taxa &#215; 21 genes]; Table <tblr tid="T2">2</tblr>). Degen1 provided the strongest support for the G.B.R.Y. clade, rising from 74% (data set A) to 90% (data set B). An increase was also seen when we analyzed the complementary 11 gene, 27 taxa data set (data set B minus A), which had 84% BP for the G.B.R.Y. clade (data not shown). Conversely, doubling the number of taxa in data set A, yielding data set C, lowered support from 74% to &lt; 50% BP for degen1 (data set A versus C; Table <tblr tid="T2">2</tblr>). Augmenting data set C with sequence data for 11 genes for just over half the number of taxa greatly improved branch support (all five analyses resulted in &gt; 50% BP; data set C versus D, Table <tblr tid="T2">2</tblr>). There was little difference in bootstrap support for the G.B.R.Y. clade between the two data sets with the greatest amount of gene sampling (data set B versus D). Bootstrap support for other deep, non-G.B.R.Y. clades changed very little.</p>
<tbl id="T2"><title><p>Table 2</p></title><caption><p>Bootstrap support values across data sets for selected clades.</p></caption><tblbdy cols="12">
      <r>
         <c ca="left">
            <p>
               <b>Data set</b>
            </p>
         </c>
         <c ca="left">
            <p>
               <b>Analysis</b>
            </p>
         </c>
         <c ca="left">
            <p>
               <b>'G.B.R.Y.' clade</b>
            </p>
         </c>
         <c ca="left">
            <p>
               <b>Gracillariidae + Bucculatricidae + Yponomeutidae</b>
            </p>
            <p>
               <b>('G.B.Y.' clade)</b>
            </p>
         </c>
         <c ca="left">
            <p>
               <b>Gracillariidae + Roeslerstammiidae + Yponomeutidae</b>
            </p>
            <p>
               <b>('G.R.Y.' clade)</b>
            </p>
         </c>
         <c ca="left">
            <p>
               <b>Gracillariidae + Yponomeutidae ('G.Y.' clade)</b>
            </p>
         </c>
         <c ca="left">
            <p>
               <b>Gracillariidae</b>
            </p>
         </c>
         <c ca="left">
            <p>
               <b>Lithocolletinae + <it>Leucanthiza</it></b>
            </p>
         </c>
         <c ca="left">
            <p>
               <b><it>Acrocercops </it>group</b>
            </p>
         </c>
         <c ca="left">
            <p>
               <b><it>Gracillaria </it>group</b>
            </p>
         </c>
         <c ca="left">
            <p>
               <b><it>Parectopa </it>group</b>
            </p>
         </c>
         <c ca="left">
            <p>
               <b>Phyllocnistinae + Oecophyllembiinae + <it>Dendrorycter </it>+ <it>Marmara</it></b>
            </p>
         </c>
      </r>
      <r>
         <c cspan="12">
            <hr/>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>A</p>
         </c>
         <c ca="left">
            <p>nt123</p>
         </c>
         <c ca="left">
            <p>[&lt; 50]</p>
         </c>
         <c ca="left">
            <p>[&lt; 50]</p>
         </c>
         <c ca="left">
            <p>&lt; 50</p>
         </c>
         <c ca="left">
            <p>&lt; 50</p>
         </c>
         <c ca="left">
            <p>99</p>
         </c>
         <c ca="left">
            <p>N/A</p>
         </c>
         <c ca="left">
            <p>87</p>
         </c>
         <c ca="left">
            <p>N/A</p>
         </c>
         <c ca="left">
            <p>100</p>
         </c>
         <c ca="left">
            <p>N/A</p>
         </c>
      </r>
      <r>
         <c>
            <p/>
         </c>
         <c ca="left">
            <p>codon</p>
         </c>
         <c ca="left">
            <p>[&lt; 50]</p>
         </c>
         <c ca="left">
            <p>[&lt; 50]</p>
         </c>
         <c ca="left">
            <p>[&lt; 50]</p>
         </c>
         <c ca="left">
            <p>[&lt; 50]</p>
         </c>
         <c ca="left">
            <p>99</p>
         </c>
         <c ca="left">
            <p>N/A</p>
         </c>
         <c ca="left">
            <p>86</p>
         </c>
         <c ca="left">
            <p>N/A</p>
         </c>
         <c ca="left">
            <p>100</p>
         </c>
         <c ca="left">
            <p>N/A</p>
         </c>
      </r>
      <r>
         <c>
            <p/>
         </c>
         <c ca="left">
            <p>degen</p>
         </c>
         <c ca="left">
            <p>74</p>
         </c>
         <c ca="left">
            <p>55</p>
         </c>
         <c ca="left">
            <p>[&lt; 50]</p>
         </c>
         <c ca="left">
            <p>[&lt; 50]</p>
         </c>
         <c ca="left">
            <p>100</p>
         </c>
         <c ca="left">
            <p>N/A</p>
         </c>
         <c ca="left">
            <p>99</p>
         </c>
         <c ca="left">
            <p>N/A</p>
         </c>
         <c ca="left">
            <p>100</p>
         </c>
         <c ca="left">
            <p>N/A</p>
         </c>
      </r>
      <r>
         <c>
            <p/>
         </c>
         <c ca="left">
            <p>partitioned</p>
         </c>
         <c ca="left">
            <p>[&lt; 50]</p>
         </c>
         <c ca="left">
            <p>[&lt; 50]</p>
         </c>
         <c ca="left">
            <p>[&lt; 50]</p>
         </c>
         <c ca="left">
            <p>[&lt; 50]</p>
         </c>
         <c ca="left">
            <p>99</p>
         </c>
         <c ca="left">
            <p>N/A</p>
         </c>
         <c ca="left">
            <p>89</p>
         </c>
         <c ca="left">
            <p>N/A</p>
         </c>
         <c ca="left">
            <p>100</p>
         </c>
         <c ca="left">
            <p>N/A</p>
         </c>
      </r>
      <r>
         <c>
            <p/>
         </c>
         <c ca="left">
            <p>aa</p>
         </c>
         <c ca="left">
            <p>[&lt; 50]</p>
         </c>
         <c ca="left">
            <p>[&lt; 50]</p>
         </c>
         <c ca="left">
            <p>[&lt; 50]</p>
         </c>
         <c ca="left">
            <p>[&lt; 50]</p>
         </c>
         <c ca="left">
            <p>97</p>
         </c>
         <c ca="left">
            <p>N/A</p>
         </c>
         <c ca="left">
            <p>90</p>
         </c>
         <c ca="left">
            <p>N/A</p>
         </c>
         <c ca="left">
            <p>100</p>
         </c>
         <c ca="left">
            <p>N/A</p>
         </c>
      </r>
      <r>
         <c cspan="12">
            <hr/>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>B</p>
         </c>
         <c ca="left">
            <p>nt123</p>
         </c>
         <c ca="left">
            <p>
               <abbrgrp>
                  <abbr bid="B53">53</abbr>
               </abbrgrp>
            </p>
         </c>
         <c ca="left">
            <p>[&lt; 50]</p>
         </c>
         <c ca="left">
            <p>&lt; 50</p>
         </c>
         <c ca="left">
            <p>[&lt; 50]</p>
         </c>
         <c ca="left">
            <p>100</p>
         </c>
         <c ca="left">
            <p>N/A</p>
         </c>
         <c ca="left">
            <p>98</p>
         </c>
         <c ca="left">
            <p>N/A</p>
         </c>
         <c ca="left">
            <p>100</p>
         </c>
         <c ca="left">
            <p>N/A</p>
         </c>
      </r>
      <r>
         <c>
            <p/>
         </c>
         <c ca="left">
            <p>codon</p>
         </c>
         <c ca="left">
            <p>
               <abbrgrp>
                  <abbr bid="B54">54</abbr>
               </abbrgrp>
            </p>
         </c>
         <c ca="left">
            <p>[&lt; 50]</p>
         </c>
         <c ca="left">
            <p>&lt; 50</p>
         </c>
         <c ca="left">
            <p>[&lt; 50]</p>
         </c>
         <c ca="left">
            <p>100</p>
         </c>
         <c ca="left">
            <p>N/A</p>
         </c>
         <c ca="left">
            <p>93</p>
         </c>
         <c ca="left">
            <p>N/A</p>
         </c>
         <c ca="left">
            <p>100</p>
         </c>
         <c ca="left">
            <p>N/A</p>
         </c>
      </r>
      <r>
         <c>
            <p/>
         </c>
         <c ca="left">
            <p>degen</p>
         </c>
         <c ca="left">
            <p>90</p>
         </c>
         <c ca="left">
            <p>[&lt; 50]</p>
         </c>
         <c ca="left">
            <p>&lt; 50</p>
         </c>
         <c ca="left">
            <p>[&lt; 50]</p>
         </c>
         <c ca="left">
            <p>100</p>
         </c>
         <c ca="left">
            <p>N/A</p>
         </c>
         <c ca="left">
            <p>92</p>
         </c>
         <c ca="left">
            <p>N/A</p>
         </c>
         <c ca="left">
            <p>100</p>
         </c>
         <c ca="left">
            <p>N/A</p>
         </c>
      </r>
      <r>
         <c>
            <p/>
         </c>
         <c ca="left">
            <p>partitioned</p>
         </c>
         <c ca="left">
            <p>
               <abbrgrp>
                  <abbr bid="B62">62</abbr>
               </abbrgrp>
            </p>
         </c>
         <c ca="left">
            <p>[&lt; 50]</p>
         </c>
         <c ca="left">
            <p>[&lt; 50]</p>
         </c>
         <c ca="left">
            <p>[&lt; 50]</p>
         </c>
         <c ca="left">
            <p>100</p>
         </c>
         <c ca="left">
            <p>N/A</p>
         </c>
         <c ca="left">
            <p>94</p>
         </c>
         <c ca="left">
            <p>N/A</p>
         </c>
         <c ca="left">
            <p>100</p>
         </c>
         <c ca="left">
            <p>N/A</p>
         </c>
      </r>
      <r>
         <c>
            <p/>
         </c>
         <c ca="left">
            <p>aa</p>
         </c>
         <c ca="left">
            <p>66</p>
         </c>
         <c ca="left">
            <p>[&lt; 50]</p>
         </c>
         <c ca="left">
            <p>[&lt; 50]</p>
         </c>
         <c ca="left">
            <p>&lt; 50</p>
         </c>
         <c ca="left">
            <p>98</p>
         </c>
         <c ca="left">
            <p>N/A</p>
         </c>
         <c ca="left">
            <p>97</p>
         </c>
         <c ca="left">
            <p>N/A</p>
         </c>
         <c ca="left">
            <p>100</p>
         </c>
         <c ca="left">
            <p>N/A</p>
         </c>
      </r>
      <r>
         <c cspan="12">
            <hr/>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>C</p>
         </c>
         <c ca="left">
            <p>nt123</p>
         </c>
         <c ca="left">
            <p>[&lt; 50]</p>
         </c>
         <c ca="left">
            <p>[&lt; 50]</p>
         </c>
         <c ca="left">
            <p>[&lt; 50]</p>
         </c>
         <c ca="left">
            <p>[&lt; 50]</p>
         </c>
         <c ca="left">
            <p>99</p>
         </c>
         <c ca="left">
            <p>100</p>
         </c>
         <c ca="left">
            <p>98</p>
         </c>
         <c ca="left">
            <p>71</p>
         </c>
         <c ca="left">
            <p>100</p>
         </c>
         <c ca="left">
            <p>[&lt; 50]</p>
         </c>
      </r>
      <r>
         <c>
            <p/>
         </c>
         <c ca="left">
            <p>codon</p>
         </c>
         <c ca="left">
            <p>[&lt; 50]</p>
         </c>
         <c ca="left">
            <p>[&lt; 50]</p>
         </c>
         <c ca="left">
            <p>[&lt; 50]</p>
         </c>
         <c ca="left">
            <p>[&lt; 50]</p>
         </c>
         <c ca="left">
            <p>99</p>
         </c>
         <c ca="left">
            <p>100</p>
         </c>
         <c ca="left">
            <p>98</p>
         </c>
         <c ca="left">
            <p>58</p>
         </c>
         <c ca="left">
            <p>100</p>
         </c>
         <c ca="left">
            <p>[&lt; 50]</p>
         </c>
      </r>
      <r>
         <c>
            <p/>
         </c>
         <c ca="left">
            <p>degen</p>
         </c>
         <c ca="left">
            <p>[&lt; 50]</p>
         </c>
         <c ca="left">
            <p>[&lt; 50]</p>
         </c>
         <c ca="left">
            <p>[&lt; 50]</p>
         </c>
         <c ca="left">
            <p>[&lt; 50]</p>
         </c>
         <c ca="left">
            <p>100</p>
         </c>
         <c ca="left">
            <p>100</p>
         </c>
         <c ca="left">
            <p>100</p>
         </c>
         <c ca="left">
            <p>89</p>
         </c>
         <c ca="left">
            <p>100</p>
         </c>
         <c ca="left">
            <p>[&lt; 50]</p>
         </c>
      </r>
      <r>
         <c>
            <p/>
         </c>
         <c ca="left">
            <p>partitioned</p>
         </c>
         <c ca="left">
            <p>[&lt; 50]</p>
         </c>
         <c ca="left">
            <p>[&lt; 50]</p>
         </c>
         <c ca="left">
            <p>[&lt; 50]</p>
         </c>
         <c ca="left">
            <p>[&lt; 50]</p>
         </c>
         <c ca="left">
            <p>99</p>
         </c>
         <c ca="left">
            <p>100</p>
         </c>
         <c ca="left">
            <p>97</p>
         </c>
         <c ca="left">
            <p>77</p>
         </c>
         <c ca="left">
            <p>100</p>
         </c>
         <c ca="left">
            <p>[&lt; 50]</p>
         </c>
      </r>
      <r>
         <c>
            <p/>
         </c>
         <c ca="left">
            <p>aa</p>
         </c>
         <c ca="left">
            <p>[&lt; 50]</p>
         </c>
         <c ca="left">
            <p>&lt; 50</p>
         </c>
         <c ca="left">
            <p>[&lt; 50]</p>
         </c>
         <c ca="left">
            <p>[&lt; 50]</p>
         </c>
         <c ca="left">
            <p>90</p>
         </c>
         <c ca="left">
            <p>100</p>
         </c>
         <c ca="left">
            <p>93</p>
         </c>
         <c ca="left">
            <p>&lt; 50</p>
         </c>
         <c ca="left">
            <p>100</p>
         </c>
         <c ca="left">
            <p>[&lt; 50]</p>
         </c>
      </r>
      <r>
         <c cspan="12">
            <hr/>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>D</p>
         </c>
         <c ca="left">
            <p>nt123</p>
         </c>
         <c ca="left">
            <p>
               <abbrgrp>
                  <abbr bid="B55">55</abbr>
               </abbrgrp>
            </p>
         </c>
         <c ca="left">
            <p>[&lt; 50]</p>
         </c>
         <c ca="left">
            <p>&lt; 50</p>
         </c>
         <c ca="left">
            <p>[&lt; 50]</p>
         </c>
         <c ca="left">
            <p>99</p>
         </c>
         <c ca="left">
            <p>100</p>
         </c>
         <c ca="left">
            <p>100</p>
         </c>
         <c ca="left">
            <p>67</p>
         </c>
         <c ca="left">
            <p>100</p>
         </c>
         <c ca="left">
            <p>51</p>
         </c>
      </r>
      <r>
         <c>
            <p/>
         </c>
         <c ca="left">
            <p>codon</p>
         </c>
         <c ca="left">
            <p>
               <abbrgrp>
                  <abbr bid="B61">61</abbr>
               </abbrgrp>
            </p>
         </c>
         <c ca="left">
            <p>[&lt; 50]</p>
         </c>
         <c ca="left">
            <p>&lt; 50</p>
         </c>
         <c ca="left">
            <p>&lt; 50</p>
         </c>
         <c ca="left">
            <p>97</p>
         </c>
         <c ca="left">
            <p>100</p>
         </c>
         <c ca="left">
            <p>100</p>
         </c>
         <c ca="left">
            <p>100</p>
         </c>
         <c ca="left">
            <p>100</p>
         </c>
         <c ca="left">
            <p>&lt; 50</p>
         </c>
      </r>
      <r>
         <c>
            <p/>
         </c>
         <c ca="left">
            <p>degen</p>
         </c>
         <c ca="left">
            <p>83</p>
         </c>
         <c ca="left">
            <p>[&lt; 50]</p>
         </c>
         <c ca="left">
            <p>&lt; 50</p>
         </c>
         <c ca="left">
            <p>&lt; 50</p>
         </c>
         <c ca="left">
            <p>100</p>
         </c>
         <c ca="left">
            <p>100</p>
         </c>
         <c ca="left">
            <p>100</p>
         </c>
         <c ca="left">
            <p>93</p>
         </c>
         <c ca="left">
            <p>100</p>
         </c>
         <c ca="left">
            <p>[&lt; 50]</p>
         </c>
      </r>
      <r>
         <c>
            <p/>
         </c>
         <c ca="left">
            <p>partitioned</p>
         </c>
         <c ca="left">
            <p>
               <abbrgrp>
                  <abbr bid="B59">59</abbr>
               </abbrgrp>
            </p>
         </c>
         <c ca="left">
            <p>[&lt; 50]</p>
         </c>
         <c ca="left">
            <p>&lt; 50</p>
         </c>
         <c ca="left">
            <p>[&lt; 50]</p>
         </c>
         <c ca="left">
            <p>99</p>
         </c>
         <c ca="left">
            <p>100</p>
         </c>
         <c ca="left">
            <p>97</p>
         </c>
         <c ca="left">
            <p>67</p>
         </c>
         <c ca="left">
            <p>100</p>
         </c>
         <c ca="left">
            <p>51</p>
         </c>
      </r>
      <r>
         <c>
            <p/>
         </c>
         <c ca="left">
            <p>aa</p>
         </c>
         <c ca="left">
            <p>75</p>
         </c>
         <c ca="left">
            <p>[&lt; 50]</p>
         </c>
         <c ca="left">
            <p>[&lt; 50]</p>
         </c>
         <c ca="left">
            <p>&lt; 50</p>
         </c>
         <c ca="left">
            <p>89</p>
         </c>
         <c ca="left">
            <p>100</p>
         </c>
         <c ca="left">
            <p>94</p>
         </c>
         <c ca="left">
            <p>&lt; 50</p>
         </c>
         <c ca="left">
            <p>100</p>
         </c>
         <c ca="left">
            <p>&lt; 50</p>
         </c>
      </r>
   </tblbdy><tblfn>
      <p>Square brackets indicate support values for clades that were not present in the ML tree.</p>
   </tblfn></tbl>
<p>Gracillariinae was polyphyletic in all analyses conducted (data sets A through D), and the position of Phyllocnistinae remains unclear, as it was ancestral to the Lithocolletinae + <it>Leucanthiza </it>in data set D, but was sister to the <it>Parectopa </it>group in data sets A through C for degen1 and amino acids. For nt123 and codon analyses, the Phyllocnistinae was sister to the <it>Parectopa </it>group (nt123, data sets A, C), sister to <it>Dendrorcyter </it>+ <it>Marmara </it>(nt123, data set D; codon, data sets C, D), or <it>Eumetriochroa </it>(nt123, data set C, codon, data sets A, B). Partitioned analyses ("noLRall1 + nt2" versus "LRall1 + nt3") gave the same placement for the Phyllocnistinae as did nt123 in all four data sets.</p>
</sec>
<sec>
<st>
<p>Agreement and conflict among individual genes</p>
</st>
<p>There were no strongly supported groups that conflicted with each other across genes, and few nodes above the subfamily level were moderately or strongly supported by any one gene alone. Instances of strong bootstrap support by only one gene were: 83% for Gracillariidae (CAD), 96% for the <it>Acrocercops </it>group (CAD), and 82% for <it>Eumetriochroa </it>+ Phyllocnistinae (<it>Period</it>; Additional file <supplr sid="S2">2</supplr>).</p>
<suppl id="S2">
<title>
<p>Additional file 2</p>
</title>
<text>
<p>
<b>Single gene bootstrap values for all nodes in the nt123 tree of data set B</b>. Shaded boxes are those with &gt; 80% bootstrap support. "ALL" refers to dataset B (all genes included). See Additional file <supplr sid="S1">1</supplr> for taxon code names.</p>
</text>
<file name="1471-2148-11-182-S2.XLS">
   <p>Click here for file</p>
</file>
</suppl>
</sec>
<sec>
<st>
<p>Base compositional heterogeneity</p>
</st>
<p>Results of the chi-square tests for compositional heterogeneity are shown in Table <tblr tid="T3">3</tblr>. Homogeneity was not rejected for any group in the noLRall1 + nt2 character set. In contrast, nt3 showed highly significant heterogeneity across all taxa and the five taxon subsets. As a gauge of the possible misleading signal produced by compositional heterogeneity, we calculated neighbor-joining trees on distances reflecting only composition for nt123 and nt3. In these trees, Bucculatricidae clustered with five other gracillarioid and non-gracillarioid taxa that are together separated by long internal branches from the Tineidae and the remaining species in the tree (Additional file <supplr sid="S3">3</supplr>).</p>
<tbl id="T3"><title><p>Table 3</p></title><caption><p>Results of Chi-square tests on nucleotide compositional homogeneity.</p></caption><tblbdy cols="3">
      <r>
         <c>
            <p/>
         </c>
         <c ca="center" cspan="2">
            <p>
               <b><it>P </it>value for character set</b>
            </p>
         </c>
      </r>
      <r>
         <c>
            <p/>
         </c>
         <c cspan="2">
            <hr/>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>
               <b>Taxon (number of species)</b>
            </p>
         </c>
         <c ca="center">
            <p>
               <b>noLRall1 + nt2</b>
            </p>
         </c>
         <c ca="center">
            <p>
               <b>nt3</b>
            </p>
         </c>
      </r>
      <r>
         <c cspan="3">
            <hr/>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>All (27)</p>
         </c>
         <c ca="center">
            <p>> 0.999</p>
         </c>
         <c ca="center">
            <p>&lt; 0.001</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>Gracillariidae (11)</p>
         </c>
         <c ca="center">
            <p>> 0.999</p>
         </c>
         <c ca="center">
            <p>&lt; 0.001</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>Oecophyllembiinae <it>sensu </it>Kumata + Phyllocnistinae (3)</p>
         </c>
         <c ca="center">
            <p>0.969</p>
         </c>
         <c ca="center">
            <p>&lt; 0.001</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>Bucculatricidae + Tineidae (3)</p>
         </c>
         <c ca="center">
            <p>0.953</p>
         </c>
         <c ca="center">
            <p>&lt; 0.001</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>Bucculatricidae + Outgroups + <it>Klimeschia </it>- Tineidae (10)</p>
         </c>
         <c ca="center">
            <p>>0.999</p>
         </c>
         <c ca="center">
            <p>&lt; 0.001</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>Outgroups + <it>Klimeschia </it>- Tineidae (9)</p>
         </c>
         <c ca="center">
            <p>> 0.999</p>
         </c>
         <c ca="center">
            <p>&lt; 0.001</p>
         </c>
      </r>
      <r>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>Total number of characters</p>
         </c>
         <c ca="center">
            <p>8701</p>
         </c>
         <c ca="center">
            <p>4937</p>
         </c>
      </r>
   </tblbdy></tbl>
<suppl id="S3">
<title>
<p>Additional file 3</p>
</title>
<text>
<p>
<b>Comparison of Euclidean compositional distance (NJ), GTR ML distance (NJ), and ML trees for nt123 and nt3</b>. Arrows indicate a long internal branch in the Euclidean compositional distance trees.</p>
</text>
<file name="1471-2148-11-182-S3.EPS">
   <p>Click here for file</p>
</file>
</suppl>
</sec>
<sec>
<st>
<p>Relationships of Gracillariidae and Gracillarioidea</p>
</st>
<p>All analyses resulted in a polyphyletic Gracillarioidea <it>sensu </it>Davis and Robinson <abbrgrp>
<abbr bid="B11">11</abbr>
</abbrgrp>; specifically, Douglasiidae was consistently separated from Bucculatricidae, Gracillariidae, and Roeslerstammiidae by two or more nodes. Monophyly of the superfamily was significantly rejected at <it>P </it>&#8804; 0.015 in six of eight AU tests (Table <tblr tid="T4">4</tblr>). In all analyses, branch support for the monophyly of Gracillariidae was robust (&#8805; 97% BP, Table <tblr tid="T2">2</tblr>). In general, nt123, codon and nt123 partitioned results were similar in topology and branch support, while degen1 and amino acids were similar to each other, but differed from the other results in topology. For data set D (57 taxa &#215; 21 genes), degen1 resulted in a monophyletic 'G.B.R.Y.' clade with strong bootstrap support (83%, Figure <figr fid="F2">2</figr>). The G.B.R.Y. clade was never recovered in nt123, codon and nt123 partitioned ML trees, but the bootstrap consensus from these analyses supported the G.B.R.Y. clade with weak, but evident signal (up to 62% BP, Table <tblr tid="T2">2</tblr>). The latter three methods resulted in Bucculatricidae diverging before all taxa except the designated outgroup, Tineidae (e.g. Additional files <supplr sid="S4">4</supplr>, <supplr sid="S5">5</supplr>, <supplr sid="S6">6</supplr>).</p>
<tbl id="T4"><title><p>Table 4</p></title><caption><p>Results of Approximately Unbiased (AU) significance tests <abbrgrp><abbr bid="B67">67</abbr></abbrgrp> for non-monophyly of predicted clades for data sets C and D.</p></caption><tblbdy cols="5">
      <r>
         <c>
            <p/>
         </c>
         <c ca="center" cspan="4">
            <p>
               <b><it>P </it>values: data sets C/D</b>
            </p>
         </c>
      </r>
      <r>
         <c>
            <p/>
         </c>
         <c cspan="4">
            <hr/>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>
               <b>Predicted clade</b>
            </p>
         </c>
         <c ca="center">
            <p>
               <b>nt123</b>
            </p>
         </c>
         <c ca="center">
            <p>
               <b>codon</b>
            </p>
         </c>
         <c ca="center">
            <p>
               <b>degen</b>
            </p>
         </c>
         <c ca="center">
            <p>
               <b>AA</b>
            </p>
         </c>
      </r>
      <r>
         <c cspan="5">
            <hr/>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>Gracillarioidea <it>sensu </it>Davis &amp; Robinson</p>
         </c>
         <c ca="center">
            <p>
               <b>0.015/0.002</b>
            </p>
         </c>
         <c ca="center">
            <p>
               <b>0.007/0.006</b>
            </p>
         </c>
         <c ca="center">
            <p>0.083/&lt;<b>0.001</b></p>
         </c>
         <c ca="center">
            <p>0.081/<b>0.008</b></p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>Gracillariinae + Lithocolletinae</p>
         </c>
         <c ca="center">
            <p>
               <b>0.011/0.002</b>
            </p>
         </c>
         <c ca="center">
            <p>
               <b>0.013/0.011</b>
            </p>
         </c>
         <c ca="center">
            <p>0.455/0.161</p>
         </c>
         <c ca="center">
            <p>0.152/0.119</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>Gracillariinae</p>
         </c>
         <c ca="center">
            <p>
               <b>&lt; 0.001/&lt; 0.001</b>
            </p>
         </c>
         <c ca="center">
            <p>
               <b>&lt; 0.001/&lt; 0.001</b>
            </p>
         </c>
         <c ca="center">
            <p>
               <b>&lt; 0.001/&lt; 0.001</b>
            </p>
         </c>
         <c ca="center">
            <p>
               <b>&lt; 0.001/&lt; 0.001</b>
            </p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>Gracillariinae minus <it>Leucanthiza</it></p>
         </c>
         <c ca="center">
            <p>
               <b>0.015/0.001</b>
            </p>
         </c>
         <c ca="center">
            <p>
               <b>0.021/&lt; 0.001</b>
            </p>
         </c>
         <c ca="center">
            <p>0.107/0.084</p>
         </c>
         <c ca="center">
            <p>0.104/0.202</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>Oecophyllembiinae <it>sensu </it>Kumata + Phyllocnistinae</p>
         </c>
         <c ca="center">
            <p>0.467/0.165</p>
         </c>
         <c ca="center">
            <p>0.385/0.739</p>
         </c>
         <c ca="center">
            <p>0.339/0.352</p>
         </c>
         <c ca="center">
            <p>0.472/0.073</p>
         </c>
      </r>
   </tblbdy><tblfn>
      <p>nt123, all nucleotides; Codon, codon model; degen, degeneracy1; AA, amino acids. Groups that were significant at alpha = 0.05 are shown in bold.</p>
   </tblfn></tbl>
<fig id="F2"><title><p>Figure 2</p></title><caption><p>Maximum likelihood degen1 tree for data set D</p></caption><text>
   <p><b>Maximum likelihood degen1 tree for data set D</b>. Large numbers denote six major clades in Gracillariidae (see Results). Asterisks indicate taxa sequenced for 21 genes. Hyphens denote support values &lt; 50%. Square brackets, shown only for nodes with support > 50% that conflict with the nt123 ML tree, denote groupings not present in the ML tree for that analysis. Green branches lead to taxa placed in Gracillariinae. Morphological and behavioral traits that are characteristic of each group are also noted.</p>
</text><graphic file="1471-2148-11-182-2" hint_layout="double"/></fig>
<suppl id="S4">
<title>
<p>Additional file 4</p>
</title>
<text>
<p>
<b>Maximum likelihood nt123 trees for data sets A through D</b>. Scale bar = 0.07 substitutions/site.</p>
</text>
<file name="1471-2148-11-182-S4.EPS">
   <p>Click here for file</p>
</file>
</suppl>
<suppl id="S5">
<title>
<p>Additional file 5</p>
</title>
<text>
<p>
<b>Maximum likelihood trees based on a codon model</b>. Scale bar = 0.03 substitutions/site.</p>
</text>
<file name="1471-2148-11-182-S5.EPS">
   <p>Click here for file</p>
</file>
</suppl>
<suppl id="S6">
<title>
<p>Additional file 6</p>
</title>
<text>
<p>
<b>Maximum likelihood trees based on a partitioned model</b>. Scale bar = 0.2 substitutions/site.</p>
</text>
<file name="1471-2148-11-182-S6.EPS">
   <p>Click here for file</p>
</file>
</suppl>
<p>While there was some support for the G.B.R.Y. clade, none of the analyses provided &gt; 55% BP for any inter-family relationships within this clade. To assess the possible sister group of Gracillariidae, we examined whether the ML trees consistently recovered Gracillariidae plus any combination of Bucculatricidae, Roeslerstammiidae, or Yponomeutidae. ML analyses recovered a 'G.R.Y.' clade eight times, a 'G.Y.' clade five times, and a 'G.B.Y.' clade two times (Table <tblr tid="T2">2</tblr>). No 'G.B.', 'G.R.', or 'G.B.R.' clades were found in any of the best ML trees.</p>
<p>Within Gracillariidae, there was strong support for 28 of the 36 total nodes (78%) with data set D, and nearly every species was placed into one of six lineages that are either monotypic or strongly supported in our sampling. These lineages, labeled on Figure <figr fid="F2">2</figr>, include: 1) Lithocolletinae + <it>Leucanthiza</it>, 2) Phyllocnistinae (excluding <it>Eumetriochroa</it>), and four clades within Gracillariinae: 3) <it>Acrocercops </it>group, 4) <it>Dendrorycter </it>+ <it>Marmara</it>; 5) <it>Gracillaria </it>group, and 6) <it>Parectopa </it>group. Kumata compared the morphology of these groups and recognized that each has unique features absent in others (e.g. <abbrgrp>
<abbr bid="B69">69</abbr>
<abbr bid="B70">70</abbr>
<abbr bid="B71">71</abbr>
<abbr bid="B72">72</abbr>
</abbrgrp>, some noted in Figure <figr fid="F2">2</figr>).</p>
<p>Branch support within these six gracillariid clades was strong. There was &gt; 90% BP for eight of the eleven nodes within Lithocolletinae + <it>Leucanthiza</it>, three of five nodes within the <it>Acrocercops </it>group, five of six nodes in the <it>Parectopa </it>group, and all nodes in the <it>Gracillaria </it>group. Monophyly of the subfamily Gracillariinae Stainton 1854, as previously defined, was not recovered, and statistically rejected in all eight AU tests (<it>P </it>&lt; 0.001, Table <tblr tid="T4">4</tblr>). Rejection of gracillariine monophyly was also evident even when <it>Leucanthiza </it>was excluded from Gracillariinae in the AU tests (see Gracillariinae minus <it>Leucanthiza</it>, Table <tblr tid="T4">4</tblr>). Monophyly of Gracillariinae + Lithocolletinae, as previously proposed by Kuznetzov and Stekol'nikov <abbrgrp>
<abbr bid="B24">24</abbr>
</abbrgrp> was rejected by nt123 and codon model analyses (<it>P </it>&lt; 0.05). Monophyly cannot be rejected for the sister-group relationship of the Oecophyllembiinae (<it>sensu </it>Kumata) + Phyllocnistinae, however, as <it>P </it>&gt; 0.073 under the AU test in all cases (Table <tblr tid="T4">4</tblr>). While we do not formally designate any new taxonomic names in this study, we recognize these well-supported clades as the first step toward a phylogenetic reclassification of Gracillariidae.</p>
</sec>
</sec>
<sec>
<st>
<p>Discussion</p>
</st>
<sec>
<st>
<p>Augmentation of sequence data and its effect on branch support</p>
</st>
<p>Partial augmentation of gene sampling can improve estimates of deep relationships of Gracillariidae (comparison of data set C and D), as it increased support for the G.B.R.Y. clade for all five character treatments, most strikingly for degen1 (a BP increase from &lt; 50% to 83%). While partial or full augmentation of gene sampling generally improved branch support for deep nodes, other nodes at the superfamily level are not robustly supported even with &gt; 14 kb of sequence data (Table <tblr tid="T2">2</tblr>). Short internal branches for deep divergences and the difficulty of achieving strong support for some nodes even with 21 genes may reflect a rapid radiation, which likely characterizes many divergences among lepidopteran families <abbrgrp>
<abbr bid="B21">21</abbr>
<abbr bid="B22">22</abbr>
<abbr bid="B73">73</abbr>
</abbrgrp> and other insect orders <abbrgrp>
<abbr bid="B74">74</abbr>
</abbrgrp>.</p>
<p>The amount of missing data in data set D, accounting for roughly a quarter of the total matrix, does not apparently induce phylogenetic artifacts of the kind postulated by Lemmon et al. <abbrgrp>
<abbr bid="B75">75</abbr>
</abbrgrp>. For degen1, the problematic Bucculatricidae is left out of the G.B.R.Y. clade in the ML tree from non-augmented data set C (Figure <figr fid="F3">3C</figr>), but partially augmented data set D pulls it into the G.B.R.Y. clade. We favor the topology from the partially augmented data set D over the non-augmented data set C, as the close relationship of Bucculatricidae to Gracillariidae corroborates morphological <abbrgrp>
<abbr bid="B24">24</abbr>
<abbr bid="B26">26</abbr>
</abbrgrp> and molecular studies <abbrgrp>
<abbr bid="B22">22</abbr>
</abbrgrp>.</p>
<fig id="F3"><title><p>Figure 3</p></title><caption><p>ML trees based on non-synonymous differences only (degen1) of data sets A through C</p></caption><text>
   <p><b>ML trees based on non-synonymous differences only (degen1) of data sets A through C</b>. Bucculatricidae + Gracillariidae + Roeslerstammiidae + Yponomeutidae form a monophyletic group for data sets A and B. Scale bar = 0.02 substitutions/site.</p>
</text><graphic file="1471-2148-11-182-3" hint_layout="double"/></fig>
<p>A comparison of data sets A and B reveals that increasing the number of genes from 10 to 21 can improve bootstrap support values, a result consistent with other studies (e.g. <abbrgrp>
<abbr bid="B1">1</abbr>
<abbr bid="B2">2</abbr>
<abbr bid="B3">3</abbr>
<abbr bid="B4">4</abbr>
<abbr bid="B5">5</abbr>
</abbrgrp>). In contrast, increasing the number of taxa (comparison of data set A and C) depressed branch support for higher groups (e.g. G.B.R.Y. clade). Therefore, if the goal is to achieve strong branch support for deep divergences, it may be favorable to focus on sequencing many genes for fewer taxa. However, sequencing many characters for few taxa can lead to problems such as long-branch attraction <abbrgrp>
<abbr bid="B76">76</abbr>
<abbr bid="B77">77</abbr>
</abbrgrp> and produce relationships that are misleading (e.g. <abbrgrp>
<abbr bid="B78">78</abbr>
<abbr bid="B79">79</abbr>
<abbr bid="B80">80</abbr>
<abbr bid="B81">81</abbr>
</abbrgrp>). Thus, if the researcher is faced with limited resources and seeks to improve support for deep divergences without introducing misleading artifacts, a solution may be to partially augment more characters for a selected but broad taxonomic sample. In our study, a deep divergence that was initially weak in support (the G.B.R.Y. clade) now has stronger branch support and the relationships are largely consistent with prior morphological hypotheses. While we have chosen a broad taxonomic sample to augment additional characters, future studies should examine how the choice of taxa for augmentation can influence the support for the correct tree.</p>
</sec>
<sec>
<st>
<p>Synonymous changes and base compositional heterogeneity</p>
</st>
<p>Overall, two non-synonymous characters sets, degen1 and amino acids, provided the highest support for deep-level relationships in Gracillarioidea. These analyses resulted in a monophyletic G.B.R.Y. clade, for which support in some cases was very robust. In contrast, nt123, the codon model and nt123 partitioned analyses provided little or no support for deep relationships among gracillarioid families. When synonymous sites are added, only weak signal (&#8804; 62% BP) remains for the G.B.R.Y. clade. We speculate that analyses that include synonymous changes, even when that signal is down-weighted or modeled separately, do not effectively correct for the strong compositional heterogeneity found at nt3, leaving that non-phylogenetic signal to conflict with and obscure the true signal from non-synonymous change.</p>
<p>Strong compositional heterogeneity can incorrectly group unrelated taxa together <abbrgrp>
<abbr bid="B56">56</abbr>
<abbr bid="B57">57</abbr>
</abbrgrp>, or equivalently, widely separate a taxon with strongly divergent base composition from its true relatives. Under the three character treatments, nt123, codon model, and nt123 partitioned, <it>Bucculatrix </it>(Bucculatricidae) was placed between the Tineidae and the remaining taxa (Additional files <supplr sid="S4">4</supplr>, <supplr sid="S5">5</supplr>, <supplr sid="S6">6</supplr>), a position difficult to reconcile with morphology. Compositional heterogeneity may account for the strikingly different placement of Bucculatricidae. Because non-synonymous changes strongly support the monophyly of the G.B.R.Y. clade, synonymous changes, mostly at nt3, must account for the less decisive placement of Bucculatricidae in nearly all nt123 trees. These results support our previous findings (e.g. <abbrgrp>
<abbr bid="B21">21</abbr>
</abbrgrp>) that filtering synonymous substitutions (and thereby compositional heterogeneity) can result in more robust phylogenetic inference at deep levels.</p>
<p>A comparison of the ML topology with the neighbor-joining GTR ML distance and with Euclidean compositional distance trees for nt123 and nt3 suggests that the uncertain placement of Bucculatricidae in the nt123 data set is largely due to nt3 (Additional file <supplr sid="S3">3</supplr>). In the compositional distance trees, six taxa (<it>Bucculatrix sp., Atteva punctella</it>, <it>Eumetriochroa hederae</it>, <it>Hemerophila felis</it>, <it>Phyllocnistis citrella</it>, and <it>P. magnoliella</it>) fall between the Tineidae and the remaining taxa in a long internal branch. In the nt123 ML tree, in contrast, all taxa but <it>Bucculatrix </it>move to parts of the tree that are generally well supported and expected based on morphology and existing classifications (e.g. <abbrgrp>
<abbr bid="B11">11</abbr>
<abbr bid="B35">35</abbr>
<abbr bid="B82">82</abbr>
</abbrgrp>, <it>Eumetriochroa </it>with <it>Phyllocnistis</it>, and <it>Atteva </it>with <it>Eucalantica</it>).</p>
<p>Results of the ML nt3 analysis are very different, providing further evidence that compositional heterogeneity can affect trees based on nt3 alone. Despite providing about 90% of the total character change, the nt3 character set alone yields &gt; 50% BP for only 7 nodes as compared to the full data set (nt123; 15 supported nodes), fewer even than the degen1 character set (13 supported nodes). Some unexpected relationships are found, such as <it>Bucculatrix </it>+ <it>Eumetriochroa</it>, which break up well-supported groups, in this case the monophyletic Gracillariidae (Additional file <supplr sid="S3">3</supplr>, F).</p>
</sec>
<sec>
<st>
<p>Phylogenetic relationships of Gracillariidae and Gracillarioidea</p>
</st>
<p>Our results provide some of the first molecular evidence on the composition of and relationships within Gracillariidae and Gracillarioidea <it>sensu </it>Davis and Robinson <abbrgrp>
<abbr bid="B11">11</abbr>
</abbrgrp>. Some previous hypotheses about those relationships were confirmed, and several novel ones proposed. Because rate variation between synonymous and non-synonymous sites was dramatic in the present study (see Table <tblr tid="T3">3</tblr>), we focus our discussion on the degen1 analyses unless otherwise noted. The discussion focuses on the degen1 tree from data set D (Figure <figr fid="F2">2</figr>) because it includes the most number of taxa and characters.</p>
<p>Gracillarioidea <it>sensu </it>Davis and Robinson <abbrgrp>
<abbr bid="B11">11</abbr>
</abbrgrp>, i.e. including Douglasiidae, was polyphyletic in all analyses, and monophyly of the superfamily was rejected significantly in six of eight AU tests (Table <tblr tid="T4">4</tblr>). Recently, Mutanen et al. <abbrgrp>
<abbr bid="B22">22</abbr>
</abbrgrp> came to a similar conclusion based on fewer gracillarioid taxa and genes. In their analyses, Gracillarioidea were never monophyletic, and Douglasiidae was consistently placed in Apoditrysia. Mutanen et al. <abbrgrp>
<abbr bid="B22">22</abbr>
</abbrgrp> also had difficulty in placing Bucculatricidae, which was paraphyletic with respect to <it>Tritymba </it>(Plutellidae), and Bucculatricidae + <it>Tritymba </it>was sister to Gracillariidae with weak (&lt; 50%) ML bootstrap support. Based on our study and Mutanen et al. <abbrgrp>
<abbr bid="B22">22</abbr>
</abbrgrp>, Gracillarioidea as currently defined, will need to be reevaluated. Future studies should also sequence the genes included in the present study for <it>Ogmograptis </it>(Bucculatricidae), a genus that could not be obtained.</p>
<p>To resolve a possible sister group of Gracillariidae remains a challenging problem. Comparing ML trees from all analyses, Gracillariidae was most often closely related to Roeslerstammiidae + Yponomeutidae (the latter relationship which is congruent with morphology <abbrgrp>
<abbr bid="B83">83</abbr>
<abbr bid="B84">84</abbr>
</abbrgrp>). The close relationship of Yponomeutidae to Gracillarioidea (excluding Douglasiidae) is also consistent with other molecular studies <abbrgrp>
<abbr bid="B7">7</abbr>
<abbr bid="B21">21</abbr>
<abbr bid="B22">22</abbr>
</abbrgrp>. These reports suggest, at least tentatively, that the putative morphological apomorphies proposed for Gracillarioidea by Davis and Robinson <abbrgrp>
<abbr bid="B11">11</abbr>
</abbrgrp> may be homoplasies. In order to restore monophyly of the superfamily, we would need to exclude Douglasiidae from Gracillarioidea or include Yponomeutidae. However, more convincing resolution of inter-family relationships is desirable before any formal taxonomic changes are made.</p>
<p>Monophyly of Gracillariidae was strongly supported in nearly all analyses, a relationship that is corroborated by at least two morphological synapomorphies <abbrgrp>
<abbr bid="B26">26</abbr>
</abbrgrp>. The grouping of Oecophyllembiinae (<it>sensu </it>Kumata) + Phyllocnistinae, which share unique serpentine mine morphology <abbrgrp>
<abbr bid="B11">11</abbr>
<abbr bid="B66">66</abbr>
</abbrgrp> and a highly specialized spinning instar <abbrgrp>
<abbr bid="B17">17</abbr>
</abbrgrp>, was supported weakly or not at all in our analyses. However, this sister-group relationship could not be rejected by any AU test, and this grouping was well supported by <it>Period </it>(82% BP, Additional file <supplr sid="S2">2</supplr>). The sister-group relationship of Gracillariinae to Lithocolletinae proposed by Kuznetzov and Stekol'nikov <abbrgrp>
<abbr bid="B24">24</abbr>
</abbrgrp> was rejected by four of eight AU tests (Table <tblr tid="T4">4</tblr>). Our results strongly support the inclusion of <it>Leucanthiza </it>in Lithocolletinae, suggesting that this genus should be transferred from Gracillariinae, a conclusion that is also supported by larval morphology (Wagner and Davis unpubl. data). Monophyly of Gracillariinae (both with and without <it>Leucanthiza</it>) was rejected by the AU test in more than half of the data sets, suggesting that this subfamily needs to be revised. However, we did identify four genus-level groups with strong support within Gracillariinae: <it>Acrocercops</it>, <it>Gracillaria</it>, <it>Parectopa </it>groups and <it>Dendrorycter + Marmara</it>, all which were previously postulated based on morphology and life-history comparisons <abbrgrp>
<abbr bid="B18">18</abbr>
<abbr bid="B69">69</abbr>
<abbr bid="B70">70</abbr>
<abbr bid="B71">71</abbr>
<abbr bid="B72">72</abbr>
</abbrgrp>.</p>
</sec>
</sec>
<sec>
<st>
<p>Conclusions</p>
</st>
<p>Several main conclusions can be drawn from this study. First, branch support was maximized when gene sampling was increased, especially when we analyzed only the non-synonymous changes. Second, augmenting a taxon-rich data set (data set C; 57 taxa &#215; 10 genes) with additional sequence data for approximately half the taxa substantially increased deep node support in a resource-efficient manner, apparently without inducing phylogenetic artifacts due to large blocks of missing data. While these two conclusions were drawn specifically from the data sets in this study, they are congruent with the results of Cho et al. <abbrgrp>
<abbr bid="B7">7</abbr>
</abbrgrp>. Third, Gracillariidae were monophyletic in all our analyses, and nearly all species can be placed into one of six strongly supported clades, though relationships among them remain largely unclear. Fourth, Gracillarioidea, as defined by Davis and Robinson <abbrgrp>
<abbr bid="B11">11</abbr>
</abbrgrp>, clearly do not include Douglasiidae, and changes to the classification will be required. Fifth, the difficulty in placing Bucculatricidae is probably attributable to base compositional heterogeneity at nt3. From our tests for compositional heterogeneity and strong bootstrap values obtained when synonymous changes are excluded, we tentatively conclude that Bucculatricidae is closely related to Gracillariidae + Roeslerstammiidae + Yponomeutidae. Finally, the different levels of branch support seen under our different character treatments reinforce the importance of assessing confidence in groups under multiple phylogenetic approaches. Factors such as compositional heterogeneity, which can influence phylogenetic accuracy, are most easily assessed when data are partitioned into largely non-synonymous and mostly synonymous character sets. Branch support overall is strongest when all changes are included, but for several deep divergences, strong support is obtained only when synonymous changes were excluded. Because branch support for many deep splits was weak, we are exploring whether greater branch support for gracillariids and relatives can be achieved by means of genomic (next-generation) sequencing -- the focus of a future project.</p>
</sec>
<sec>
<st>
<p>Authors' contributions</p>
</st>
<p>AYK carried out the RT-PCR experiments, sequence alignment, phylogenetic analyses, and drafted the manuscript. IO and AK conducted PCR work on two genes, H3 and EF-1&#945;. JCR and CM provided funds for conducting the molecular work and helped design the study. MPC provided phylogenetic programs and hardware through the Lattice Project. AK, DRD, DLW, IO, JDP, and CLV collected valuable specimens necessary for the project. All authors read and approved the final manuscript.</p>
<suppl id="S7">
<title>
<p>Additional file 7</p>
</title>
<text>
<p>
<b>Maximum likelihood trees based on inferred amino acids</b>. Scale bar = 0.03 substitutions/site.</p>
</text>
<file name="1471-2148-11-182-S7.EPS">
   <p>Click here for file</p>
</file>
</suppl>
</sec>
</bdy><bm>
<ack>
<sec>
<st>
<p>Acknowledgements</p>
</st>
<p>We thank Andreas Zwick (State Museum of Natural History, Stuttgart) for continuous support, critical comments and advice, and Tosio Kumata (Hokkaido University) for many insights into the systematics of Gracillariidae. Zaile Du and Anamika Verma assisted with sequencing. Adam Bazinet (University of Maryland) was instrumental in helping access and utilize The Lattice Grid. Joaquin Baixeras (University of Valencia), Soowon Cho (Chungbuk National University), Terry Harrison (University of Illinois), Kenji Nishida (University of Costa Rica), Jae-Cheon Sohn (University of Maryland), and Kazuhiro Sugisima (Okayama City, Japan) provided some important taxa used in this study. We also thank Rodolphe Rougerie (Canadian Centre for DNA Barcoding) for sending valuable extracts, and Kim Mitter (University of Maryland) for assisting with the preparation of samples. Jean-Fran&#231;ois Landry (Canadian National Collection) helped clarify taxonomic inconsistencies related to some taxa in the study. Julie Byrd (University of Maryland) and two anonymous reviewers provided comments that improved the quality of the manuscript. The U.S. National Science Foundation awards 0531626, 0531769, and 0755048 provided funding for this project. Financial support also came from the Belgian Science Policy Research Programmes for the Federal Scientific Institutions, project number MO/37/022; The Research Fellowships of the Japan Society for the Promotion of Science (JSPS) for Young Scientists (award 20-5555); and a Grant-in Aid from JSPS (20870021).</p>
</sec>
</ack>
<refgrp><bibl id="B1"><title><p>Genome-scale approaches to resolving incongruence in molecular phylogenies</p></title><aug><au><snm>Rokas</snm><fnm>A</fnm></au><au><snm>Williams</snm><fnm>BL</fnm></au><au><snm>King</snm><fnm>N</fnm></au><au><snm>Carroll</snm><fnm>SB</fnm></au></aug><source>Nature</source><pubdate>2003</pubdate><volume>425</volume><fpage>798</fpage><lpage>804</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1038/nature02053</pubid><pubid idtype="pmpid" link="fulltext">14574403</pubid></pubidlist></xrefbib></bibl><bibl id="B2"><title><p>The analysis of 100 genes supports the grouping of three highly divergent amoebae: <it>Dictyostelium</it>, <it>Entamoeba</it>, and <it>Mastigamoeba</it></p></title><aug><au><snm>Bapteste</snm><fnm>E</fnm></au><au><snm>Brinkmann</snm><fnm>H</fnm></au><au><snm>Lee</snm><fnm>JA</fnm></au><au><snm>Moore</snm><fnm>DV</fnm></au><au><snm>Sensen</snm><fnm>CW</fnm></au><au><snm>Gordon</snm><fnm>P</fnm></au><au><snm>Durufl&#233;</snm><fnm>L</fnm></au><au><snm>Gaasterland</snm><fnm>T</fnm></au><au><snm>Lopez</snm><fnm>P</fnm></au><au><snm>M&#252;ller</snm><fnm>M</fnm></au><etal/></aug><source>Proceedings of the National Academy of Sciences, USA</source><pubdate>2002</pubdate><volume>99</volume><fpage>1414</fpage><lpage>1419</lpage><xrefbib><pubid idtype="doi">10.1073/pnas.032662799</pubid></xrefbib></bibl><bibl id="B3"><title><p>Branch lengths, support, and congruence: testing the phylogenomic approach with 20 nuclear loci in snakes</p></title><aug><au><snm>Wiens</snm><fnm>JJ</fnm></au><au><snm>Kuczynski</snm><fnm>CA</fnm></au><au><snm>Smith</snm><fnm>SA</fnm></au><au><snm>Mulcahy</snm><fnm>DG</fnm></au><au><snm>Sites</snm><fnm>JW</fnm><suf>Jr</suf></au><au><snm>Townsend</snm><fnm>TM</fnm></au><au><snm>Reeder</snm><fnm>TW</fnm></au></aug><source>Systematic Biology</source><pubdate>2008</pubdate><volume>57</volume><issue>3</issue><fpage>420</fpage><lpage>431</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1080/10635150802166053</pubid><pubid idtype="pmpid" link="fulltext">18570036</pubid></pubidlist></xrefbib></bibl><bibl id="B4"><title><p>Increased gene sampling yields robust support for higher-level clades within Bombycoidea (Lepidoptera)</p></title><aug><au><snm>Zwick</snm><fnm>A</fnm></au><au><snm>Regier</snm><fnm>JC</fnm></au><au><snm>Mitter</snm><fnm>C</fnm></au><au><snm>Cummings</snm><fnm>MP</fnm></au></aug><source>Systematic Entomology</source><pubdate>2011</pubdate><volume>36</volume><fpage>31</fpage><lpage>43</lpage><xrefbib><pubid idtype="doi">10.1111/j.1365-3113.2010.00543.x</pubid></xrefbib></bibl><bibl id="B5"><title><p>Broad phylogenomic sampling improves resolution of the animal tree of life</p></title><aug><au><snm>Dunn</snm><fnm>CW</fnm></au><au><snm>Hejnol</snm><fnm>A</fnm></au><au><snm>Matus</snm><fnm>DQ</fnm></au><au><snm>Pang</snm><fnm>K</fnm></au><au><snm>Browne</snm><fnm>WE</fnm></au><au><snm>Smith</snm><fnm>SA</fnm></au><au><snm>Seaver</snm><fnm>E</fnm></au><au><snm>Rouse</snm><fnm>GW</fnm></au><au><snm>Obst</snm><fnm>M</fnm></au><au><snm>Edgecombe</snm><fnm>GD</fnm></au><etal/></aug><source>Nature</source><pubdate>2008</pubdate><volume>452</volume><fpage>745</fpage><lpage>749</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1038/nature06614</pubid><pubid idtype="pmpid" link="fulltext">18322464</pubid></pubidlist></xrefbib></bibl><bibl id="B6"><title><p>Inferring phylogenies with incomplete data sets: a 5-gene, 567-taxon analysis of angiosperms</p></title><aug><au><snm>Burleigh</snm><fnm>JG</fnm></au><au><snm>Hilu</snm><fnm>KW</fnm></au><au><snm>Soltis</snm><fnm>DE</fnm></au></aug><source>BMC Evolutionary Biology</source><pubdate>2009</pubdate><volume>9</volume><fpage>61</fpage><xrefbib><pubidlist><pubid idtype="doi">10.1186/1471-2148-9-61</pubid><pubid idtype="pmcid">2674047</pubid><pubid idtype="pmpid" link="fulltext">19292928</pubid></pubidlist></xrefbib></bibl><bibl id="B7"><title><p>Deliberately unequal gene sampling: boon or bane for phylogenetics of Lepidoptera (Hexapoda)?</p></title><aug><au><snm>Cho</snm><fnm>S</fnm></au><au><snm>Zwick</snm><fnm>A</fnm></au><au><snm>Regier</snm><fnm>JC</fnm></au><au><snm>Mitter</snm><fnm>C</fnm></au><au><snm>Cummings</snm><fnm>MP</fnm></au><au><snm>Yao</snm><fnm>J</fnm></au><au><snm>Du</snm><fnm>Z</fnm></au><au><snm>Zhao</snm><fnm>H</fnm></au><au><snm>Kawahara</snm><fnm>AY</fnm></au><au><snm>Weller</snm><fnm>SJ</fnm></au><etal/></aug><source>Systematic Biology</source><pubdate>2011</pubdate></bibl><bibl id="B8"><title><p>Ninety-seven million years of angiosperm-insect association: paleobiological insights into the meaning of coevolution</p></title><aug><au><snm>Labandeira</snm><fnm>CC</fnm></au><au><snm>Dilcher</snm><fnm>DL</fnm></au><au><snm>Davis</snm><fnm>DR</fnm></au><au><snm>Wagner</snm><fnm>DL</fnm></au></aug><source>Proceedings of the National Academy of Sciences, USA</source><pubdate>1994</pubdate><volume>91</volume><fpage>12278</fpage><lpage>12282</lpage><xrefbib><pubid idtype="doi">10.1073/pnas.91.25.12278</pubid></xrefbib></bibl><bibl id="B9"><title><p>Global Taxonomic Database of Gracillariidae (Lepidoptera)</p></title><aug><au><snm>De Prins</snm><fnm>J</fnm></au><au><snm>De Prins</snm><fnm>W</fnm></au></aug><publisher>World Wide Web electronic publication</publisher><url>http://www.gracillariidae.net</url><note>accessed 1 Feb 2011</note></bibl><bibl id="B10"><title><p>Order Lepidoptera Linnaeus, 1758</p></title><aug><au><snm>Nieukerken</snm><fnm>EJ</fnm></au><au><snm>Kaila</snm><fnm>L</fnm></au><au><snm>Kitching</snm><fnm>IJ</fnm></au><au><snm>Kristensen</snm><fnm>NP</fnm></au><au><snm>Lees</snm><fnm>DC</fnm></au><au><snm>Minet</snm><fnm>J</fnm></au><au><snm>Mitter</snm><fnm>C</fnm></au><au><snm>Mutanen</snm><fnm>M</fnm></au><au><snm>Regier</snm><fnm>JC</fnm></au><au><snm>Simonsen</snm><fnm>TJ</fnm></au><etal/></aug><source>Zootaxa</source><publisher>Animal biodiversity: An outline of higher-level classification and survey of taxonomic richness</publisher><editor>Zhang ZQ</editor><pubdate>2011</pubdate><inpress/></bibl><bibl id="B11"><title><p>The Tineoidea and Gracillarioidea</p></title><aug><au><snm>Davis</snm><fnm>DR</fnm></au><au><snm>Robinson</snm><fnm>GS</fnm></au></aug><source>Lepidoptera, moths and butterflies 1: Evolution, Systematics and Biogeography</source><publisher>Berlin, New York: De Gruyter</publisher><editor>Kristensen NP</editor><edition>35</edition><pubdate>1998</pubdate><fpage>91</fpage><lpage>117</lpage><note>4. Arthropoda: Insecta, Part</note></bibl><bibl id="B12"><title><p>Forecasting <it>Cameraria ohridella </it>invasion dynamics in recently invaded countries: from validation to prediction</p></title><aug><au><snm>Gilbert</snm><fnm>M</fnm></au><au><snm>Guichard</snm><fnm>S</fnm></au><au><snm>Freise</snm><fnm>J</fnm></au><au><snm>Gr&#233;goire</snm><fnm>J-C</fnm></au><au><snm>Heitland</snm><fnm>W</fnm></au><au><snm>Straw</snm><fnm>N</fnm></au><au><snm>Tilbury</snm><fnm>C</fnm></au><au><snm>Augustin</snm><fnm>S</fnm></au></aug><source>Journal of Applied Ecology</source><pubdate>2005</pubdate><volume>45</volume><fpage>805</fpage><lpage>813</lpage></bibl><bibl id="B13"><title><p>Citrus leafminer, <it>Phyllocnistis citrella </it>Stainton (Lepidoptera: Gracillariidae: Phyllocnistinae)</p></title><aug><au><snm>Heppner</snm><fnm>JB</fnm></au></aug><source>Entomology Circular No 359, Florida Department of Agriculture &amp; Consumer Services</source><pubdate>1993</pubdate><fpage>1</fpage><lpage>2</lpage></bibl><bibl id="B14"><title><p><it>Conopomorpha cramerella </it>(Lepidoptera: Gracillariidae) in the Malay Archipelago: genetic signature of a bottlenecked population?</p></title><aug><au><snm>Shapiro</snm><fnm>LH</fnm></au><au><snm>Scheffer</snm><fnm>SJ</fnm></au><au><snm>Maisin</snm><fnm>N</fnm></au><au><snm>Lambert</snm><fnm>S</fnm></au><au><snm>Purung</snm><fnm>HB</fnm></au><au><snm>Sulistyowati</snm><fnm>E</fnm></au><au><snm>Vega</snm><fnm>FE</fnm></au><au><snm>Gende</snm><fnm>P</fnm></au><au><snm>Laup</snm><fnm>S</fnm></au><au><snm>Rosmana</snm><fnm>A</fnm></au><etal/></aug><source>Annals of the Entomological Society of America</source><pubdate>2008</pubdate><volume>101</volume><fpage>930</fpage><lpage>938</lpage><xrefbib><pubid idtype="doi">10.1603/0013-8746(2008)101[930:CCLGIT]2.0.CO;2</pubid></xrefbib></bibl><bibl id="B15"><title><p>Mitochondrial and microsatellite DNA markers reveal a Balkan origin for the highly invasive horse-chestnut leaf miner <it>Cameraria ohridella </it>(Lepidoptera, Gracillariidae)</p></title><aug><au><snm>Valade</snm><fnm>R</fnm></au><au><snm>Kenis</snm><fnm>M</fnm></au><au><snm>Hernandez-Lopez</snm><fnm>A</fnm></au><au><snm>Augustin</snm><fnm>S</fnm></au><au><snm>Mari Mena</snm><fnm>N</fnm></au><au><snm>Magnoux</snm><fnm>E</fnm></au><au><snm>Rougerie</snm><fnm>R</fnm></au><au><snm>Lakatos</snm><fnm>F</fnm></au><au><snm>Roques</snm><fnm>A</fnm></au><au><snm>Lopez-Vaamonde</snm><fnm>C</fnm></au></aug><source>Molecular Ecology</source><pubdate>2009</pubdate><volume>18</volume><fpage>3458</fpage><lpage>3470</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1111/j.1365-294X.2009.04290.x</pubid><pubid idtype="pmpid" link="fulltext">19627490</pubid></pubidlist></xrefbib></bibl><bibl id="B16"><title><p>Biology and systematics of the New World <it>Phyllocnistis </it>leafminers of the avocado genus <it>Persea </it>(Lepidoptera: Gracillariidae)</p></title><aug><au><snm>Davis</snm><fnm>DR</fnm></au><au><snm>Wagner</snm><fnm>DL</fnm></au></aug><source>ZooKeys</source><pubdate>2011</pubdate><volume>97</volume><fpage>39</fpage><lpage>73</lpage><xrefbib><pubidlist><pubid idtype="pmcid">3095128</pubid><pubid idtype="pmpid">21594066</pubid></pubidlist></xrefbib></bibl><bibl id="B17"><title><p>Gracillariidae</p></title><aug><au><snm>Davis</snm><fnm>DR</fnm></au></aug><source>Immature insects</source><publisher>Dubuque: Kendall/Hunt</publisher><editor>Stehr FW</editor><pubdate>1987</pubdate><volume>1</volume><fpage>372</fpage><lpage>374</lpage></bibl><bibl id="B18"><title><p>A hidden past: The hypermetamorphic development of <it>Marmara arbutiella </it>(Lepidoptera: Gracillariidae)</p></title><aug><au><snm>Wagner</snm><fnm>DL</fnm></au><au><snm>Loose</snm><fnm>JL</fnm></au><au><snm>Fitzgerald</snm><fnm>TD</fnm></au><au><snm>DeBenedictis</snm><fnm>JA</fnm></au><au><snm>Davis</snm><fnm>DR</fnm></au></aug><source>Annals of the Entomological Society of America</source><pubdate>2000</pubdate><volume>93</volume><fpage>59</fpage><lpage>64</lpage><xrefbib><pubid idtype="doi">10.1603/0013-8746(2000)093[0059:AHPTHD]2.0.CO;2</pubid></xrefbib></bibl><bibl id="B19"><title><p>Japanese species of the subfamily Oecophyllembiinae R&#233;al and Balachowsky (Lepidoptera: Gracillariidae), with descriptions of a new genus and eight new species</p></title><aug><au><snm>Kumata</snm><fnm>T</fnm></au></aug><source>Insecta Matsumurana, New Series</source><pubdate>1998</pubdate><volume>54</volume><fpage>77</fpage><lpage>131</lpage></bibl><bibl id="B20"><title><p>Serpentine miner <it>Marmara fraxinicola </it>(Lepidoptera: Gracillariidae) in stems of white ash</p></title><aug><au><snm>Fitzgerald</snm><fnm>TD</fnm></au><au><snm>Simeone</snm><fnm>JB</fnm></au></aug><source>Annals of the Entomological Society of America</source><pubdate>1971</pubdate><volume>64</volume><fpage>770</fpage><lpage>773</lpage></bibl><bibl id="B21"><title><p>Toward reconstructing the evolution of advanced moths and butterflies (Lepidoptera: Ditrysia): an initial molecular study</p></title><aug><au><snm>Regier</snm><fnm>JC</fnm></au><au><snm>Zwick</snm><fnm>A</fnm></au><au><snm>Cummings</snm><fnm>MP</fnm></au><au><snm>Kawahara</snm><fnm>AY</fnm></au><au><snm>Cho</snm><fnm>S</fnm></au><au><snm>Weller</snm><fnm>S</fnm></au><au><snm>Roe</snm><fnm>A</fnm></au><au><snm>Baixeras</snm><fnm>J</fnm></au><au><snm>Brown</snm><fnm>JW</fnm></au><au><snm>Parr</snm><fnm>C</fnm></au><etal/></aug><source>BMC Evolutionary Biology</source><pubdate>2009</pubdate><volume>9</volume><fpage>280</fpage><xrefbib><pubidlist><pubid idtype="doi">10.1186/1471-2148-9-280</pubid><pubid idtype="pmcid">2796670</pubid><pubid idtype="pmpid" link="fulltext">19954545</pubid></pubidlist></xrefbib></bibl><bibl id="B22"><title><p>Comprehensive gene and taxon coverage elucidates radiation patterns in moths and butterflies</p></title><aug><au><snm>Mutanen</snm><fnm>M</fnm></au><au><snm>Wahlberg</snm><fnm>N</fnm></au><au><snm>Kaila</snm><fnm>L</fnm></au></aug><source>Proceedings of the Royal Society of London, Series B</source><pubdate>2010</pubdate><volume>277</volume><fpage>2839</fpage><lpage>2848</lpage><xrefbib><pubid idtype="doi">10.1098/rspb.2010.0392</pubid></xrefbib></bibl><bibl id="B23"><title><p>Lepidoptera-the butterflies</p></title><aug><au><snm>Gerasimov</snm><fnm>AM</fnm></au></aug><source>Opredelitel Nasekomykh Evropeyskoy Chasti SSSR</source><pubdate>1948</pubdate><fpage>920</fpage><lpage>1094</lpage></bibl><bibl id="B24"><title><p>Functional morphology of the male genitalia and notes on the classification and phylogenetic relationships of mining moths of superfamily Gracillarioidea (Lepidoptera)</p></title><aug><au><snm>Kuznetzov</snm><fnm>VI</fnm></au><au><snm>Stekol&apos;nikov</snm><fnm>AA</fnm></au></aug><source>Entomological Review</source><pubdate>1987</pubdate><volume>66</volume><fpage>16</fpage><lpage>30</lpage></bibl><bibl id="B25"><title><p>Insects of Hawaii 9, Microlepidoptera, pt. 1</p></title><aug><au><snm>Zimmerman</snm><fnm>EC</fnm></au></aug><publisher>Honolulu: University of Hawaii Press</publisher><pubdate>1978</pubdate></bibl><bibl id="B26"><title><p>A phylogeny for the Tineoidea (Lepidoptera)</p></title><aug><au><snm>Robinson</snm><fnm>GS</fnm></au></aug><source>Entomologica Scandinavica</source><pubdate>1988</pubdate><volume>19</volume><fpage>117</fpage><lpage>129</lpage></bibl><bibl id="B27"><title><p>To the systematics and phylogeny of mining moths Gracillariidae, Bucculatricidae and Lyonetiidae (Lepidoptera) with consideration of functional and comparative morphology of male genitalia</p></title><aug><au><snm>Kuznetzov</snm><fnm>VI</fnm></au><au><snm>Kozlov</snm><fnm>MV</fnm></au><au><snm>Seksyaeva</snm><fnm>SV</fnm></au></aug><source>Trudy Zoologicheskogo Instituta, Akademija Nauk SSSR</source><pubdate>1988</pubdate><volume>176</volume><fpage>52</fpage><lpage>71</lpage></bibl><bibl id="B28"><title><p>Host race formation in the leaf-mining moth <it>Acrocercops transecta </it>(Lepidoptera: Gracillariidae)</p></title><aug><au><snm>Ohshima</snm><fnm>I</fnm></au></aug><source>Biological Journal of the Linnaean Society</source><pubdate>2008</pubdate><volume>93</volume><fpage>135</fpage><lpage>145</lpage></bibl><bibl id="B29"><title><p>Differential introgression causes genealogical discordance in host races of <it>Acrocercops transecta </it>(Insecta: Lepidoptera)</p></title><aug><au><snm>Ohshima</snm><fnm>I</fnm></au></aug><source>Molecular Ecology</source><pubdate>2010</pubdate><volume>19</volume><fpage>2106</fpage><lpage>2119</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1111/j.1365-294X.2010.04624.x</pubid><pubid idtype="pmpid" link="fulltext">20374488</pubid></pubidlist></xrefbib></bibl><bibl id="B30"><title><p>Cospeciation analysis of an obligate pollination mutualism: have <it>Glochidion </it>trees (Euphorbiaceae) and pollinating <it>Epicephala </it>moths (Gracillariidae) diversified in parallel?</p></title><aug><au><snm>Kawakita</snm><fnm>A</fnm></au><au><snm>Takimura</snm><fnm>A</fnm></au><au><snm>Terachi</snm><fnm>T</fnm></au><au><snm>Sota</snm><fnm>T</fnm></au><au><snm>Kato</snm><fnm>M</fnm></au></aug><source>Evolution</source><pubdate>2004</pubdate><volume>10</volume><fpage>2201</fpage><lpage>2214</lpage></bibl><bibl id="B31"><title><p>Repeated independent evolution of obligate pollination mutualism in the Phyllantheae-<it>Epicephala </it>association</p></title><aug><au><snm>Kawakita</snm><fnm>A</fnm></au><au><snm>Kato</snm><fnm>M</fnm></au></aug><source>Proceedings of the Royal Society of London, Series B</source><pubdate>2009</pubdate><volume>276</volume><fpage>417</fpage><lpage>426</lpage><xrefbib><pubid idtype="doi">10.1098/rspb.2008.1226</pubid></xrefbib></bibl><bibl id="B32"><title><p>Mutualism favours higher host specificity than does antagonism in plant-herbivore interaction</p></title><aug><au><snm>Kawakita</snm><fnm>A</fnm></au><au><snm>Okamoto</snm><fnm>T</fnm></au><au><snm>Goto</snm><fnm>R</fnm></au><au><snm>Kato</snm><fnm>M</fnm></au></aug><source>Proceedings of the Royal Society of London, Series B</source><pubdate>2010</pubdate><volume>277</volume><issue>1695</issue><fpage>2765</fpage><lpage>2774</lpage><xrefbib><pubid idtype="doi">10.1098/rspb.2010.0355</pubid></xrefbib></bibl><bibl id="B33"><title><p>Evolutionary dynamics of host-plant use in a genus of leaf mining moth</p></title><aug><au><snm>Lopez-Vaamonde</snm><fnm>C</fnm></au><au><snm>Godfray</snm><fnm>C</fnm></au><au><snm>Cook</snm><fnm>JM</fnm></au></aug><source>Evolution</source><pubdate>2003</pubdate><volume>57</volume><fpage>1804</fpage><lpage>1821</lpage><xrefbib><pubid idtype="pmpid">14503622</pubid></xrefbib></bibl><bibl id="B34"><title><p>Fossil-calibrated molecular phylogenies reveal that leaf-mining moths radiated millions of years after their host plants</p></title><aug><au><snm>Lopez-Vaamonde</snm><fnm>C</fnm></au><au><snm>Wikstr&#246;m</snm><fnm>N</fnm></au><au><snm>Labandeira</snm><fnm>C</fnm></au><au><snm>Godfrey</snm><fnm>HCJ</fnm></au><au><snm>Goodman</snm><fnm>SJ</fnm></au><au><snm>Cook</snm><fnm>JM</fnm></au></aug><source>Journal of Evolutionary Biology</source><pubdate>2006</pubdate><volume>19</volume><fpage>1314</fpage><lpage>1326</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1111/j.1420-9101.2005.01070.x</pubid><pubid idtype="pmpid" link="fulltext">16780532</pubid></pubidlist></xrefbib></bibl><bibl id="B35"><title><p>Classification and checklist of the species of Lepidoptera recorded in southern Africa</p></title><aug><au><snm>V&#225;ri</snm><fnm>L</fnm></au><au><snm>Kroon</snm><fnm>DM</fnm></au><au><snm>Kr&#252;ger</snm><fnm>M</fnm></au></aug><publisher>Chatswood: Simple Solutions</publisher><pubdate>2002</pubdate></bibl><bibl id="B36"><title><p>Resolving arthropod phylogeny: exploring phylogenetic signal within 41 kb of protein-coding nuclear gene sequence</p></title><aug><au><snm>Regier</snm><fnm>JC</fnm></au><au><snm>Shultz</snm><fnm>JW</fnm></au><au><snm>Ganley</snm><fnm>ARD</fnm></au><au><snm>Hussey</snm><fnm>A</fnm></au><au><snm>Shi</snm><fnm>D</fnm></au><au><snm>Ball</snm><fnm>B</fnm></au><au><snm>Zwick</snm><fnm>A</fnm></au><au><snm>Stajich</snm><fnm>JE</fnm></au><au><snm>Cummings</snm><fnm>MP</fnm></au><au><snm>Martin</snm><fnm>JW</fnm></au><etal/></aug><source>Systematic Biology</source><pubdate>2008</pubdate><volume>57</volume><fpage>920</fpage><lpage>938</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1080/10635150802570791</pubid><pubid idtype="pmpid" link="fulltext">19085333</pubid></pubidlist></xrefbib></bibl><bibl id="B37"><title><p>A highly conserved nuclear gene for low-level phylogenetics: elongation factor-1&#945; recovers morphology-based tree for heliothine moth</p></title><aug><au><snm>Cho</snm><fnm>S</fnm></au><au><snm>Mitchell</snm><fnm>A</fnm></au><au><snm>Regier</snm><fnm>JC</fnm></au><au><snm>Mitter</snm><fnm>C</fnm></au><au><snm>Poole</snm><fnm>RW</fnm></au><au><snm>Friedlander</snm><fnm>TP</fnm></au><au><snm>Zhao</snm><fnm>S</fnm></au></aug><source>Molecular Biology and Evolution</source><pubdate>1995</pubdate><volume>12</volume><fpage>650</fpage><lpage>656</lpage><xrefbib><pubid idtype="pmpid" link="fulltext">7659020</pubid></xrefbib></bibl><bibl id="B38"><title><p>The problem with "the Paleoptera problem:" sense and sensitivity</p></title><aug><au><snm>Ogden</snm><fnm>TH</fnm></au><au><snm>Whiting</snm><fnm>MF</fnm></au></aug><source>Cladistics</source><pubdate>2003</pubdate><volume>19</volume><fpage>432</fpage><lpage>442</lpage></bibl><bibl id="B39"><title><p>Protocols, Concepts, and Reagents for preparing DNA sequencing templates</p></title><aug><au><snm>Regier</snm><fnm>JC</fnm></au></aug><url>http://www.umbi.umd.edu/users/jcrlab/PCR_primers.pdf</url><note>Version 12/4/08</note></bibl><bibl id="B40"><title><p>Assessment of the diversity and species specificity of the mutualistic association between <it>Epicephala </it>moths and <it>Glochidion </it>trees</p></title><aug><au><snm>Kawakita</snm><fnm>A</fnm></au><au><snm>Kato</snm><fnm>M</fnm></au></aug><source>Molecular Ecology</source><pubdate>2006</pubdate><volume>15</volume><fpage>3567</fpage><lpage>3581</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1111/j.1365-294X.2006.03037.x</pubid><pubid idtype="pmpid" link="fulltext">17032258</pubid></pubidlist></xrefbib></bibl><bibl id="B41"><title><p>Geneious ver. 5.1</p></title><aug><au><snm>Drummond</snm><fnm>AJ</fnm></au><au><snm>Ashton</snm><fnm>B</fnm></au><au><snm>Cheung</snm><fnm>M</fnm></au><au><snm>Heled</snm><fnm>J</fnm></au><au><snm>Kearse</snm><fnm>M</fnm></au><au><snm>Moir</snm><fnm>R</fnm></au><au><snm>Stones-Havas</snm><fnm>S</fnm></au><au><snm>Thierer</snm><fnm>T</fnm></au><au><snm>Wilson</snm><fnm>A</fnm></au></aug><pubdate>2010</pubdate><url>http://www.geneious.com</url></bibl><bibl id="B42"><title><p>GBlocks, ver. 0.91b</p></title><aug><au><snm>Castresana</snm><fnm>J</fnm></au></aug><pubdate>2002</pubdate><url>http://molevol.cmima.csic.es/castresana/</url></bibl><bibl id="B43"><title><p>Improvement of phylogenies after removing divergent and ambiguously aligned blocks from protein sequence alignments</p></title><aug><au><snm>Talavera</snm><fnm>G</fnm></au><au><snm>Castresana</snm><fnm>J</fnm></au></aug><source>Systematic Biology</source><pubdate>2007</pubdate><volume>56</volume><fpage>564</fpage><lpage>577</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1080/10635150701472164</pubid><pubid idtype="pmpid" link="fulltext">17654362</pubid></pubidlist></xrefbib></bibl><bibl id="B44"><title><p>Selection of conserved blocks from multiple alignments for their use in phylogenetic analysis</p></title><aug><au><snm>Castresana</snm><fnm>J</fnm></au></aug><source>Molecular Biology and Evolution</source><pubdate>2000</pubdate><volume>17</volume><fpage>540</fpage><lpage>552</lpage><xrefbib><pubid idtype="pmpid" link="fulltext">10742046</pubid></xrefbib></bibl><bibl id="B45"><title><p>PAUP*: Phylogenetic analysis using parsimony (*and other methods). Version 4.0b10</p></title><aug><au><snm>Swofford</snm><fnm>DL</fnm></au></aug><publisher>Sunderland: Sinauer Associates</publisher><pubdate>2002</pubdate></bibl><bibl id="B46"><title><p>Genetic algorithm approaches for the phylogenetic analysis of large biological sequence datasets under the maximum likelihood criterion</p></title><aug><au><snm>Zwickl</snm><fnm>DJ</fnm></au></aug><publisher>Ph.D. dissertation: The University of Texas at Austin</publisher><pubdate>2006</pubdate></bibl><bibl id="B47"><title><p>GARLI-PART Version 0.97. Genetic Algorithm for Rapid Likelihood Inference</p></title><aug><au><snm>Zwickl</snm><fnm>DJ</fnm></au></aug><url>https://www.nescent.org/wg_garli/Partitioned_version</url><note>accessed 12 Aug. 2010</note></bibl><bibl id="B48"><title><p>jModelTest: Phylogenetic Model Averaging</p></title><aug><au><snm>Posada</snm><fnm>D</fnm></au></aug><source>Molecular Biology and Evolution</source><pubdate>2008</pubdate><volume>25</volume><fpage>1253</fpage><lpage>1256</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1093/molbev/msn083</pubid><pubid idtype="pmpid" link="fulltext">18397919</pubid></pubidlist></xrefbib></bibl><bibl id="B49"><title><p>A new method for calculating evolutionary substitution rates</p></title><aug><au><snm>Lanave</snm><fnm>C</fnm></au><au><snm>Preparata</snm><fnm>G</fnm></au><au><snm>Saccone</snm><fnm>C</fnm></au><au><snm>Serio</snm><fnm>G</fnm></au></aug><source>Journal of Molecular Evolution</source><pubdate>1984</pubdate><volume>20</volume><fpage>86</fpage><lpage>93</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1007/BF02101990</pubid><pubid idtype="pmpid">6429346</pubid></pubidlist></xrefbib></bibl><bibl id="B50"><title><p>Some probablistic and statistical problems on the analysis of DNA sequences</p></title><aug><au><snm>Tavar&#233;</snm><fnm>S</fnm></au></aug><source>Lectures on Mathematics in the Life Sciences</source><pubdate>1986</pubdate><volume>17</volume><fpage>57</fpage><lpage>86</lpage></bibl><bibl id="B51"><title><p>Maximum-likelihood phylogenetic estimation from DNA sequences with variable rates over sites: approximate methods</p></title><aug><au><snm>Yang</snm><fnm>Z</fnm></au></aug><source>Journal of Molecular Evolution</source><pubdate>1994</pubdate><volume>39</volume><fpage>306</fpage><lpage>314</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1007/BF00160154</pubid><pubid idtype="pmpid">7932792</pubid></pubidlist></xrefbib></bibl><bibl id="B52"><title><p>Maximum likelihood estimation of the heterogeneity of substitution rate among nucleotide sites</p></title><aug><au><snm>Gu</snm><fnm>X</fnm></au><au><snm>Fu</snm><fnm>YX</fnm></au><au><snm>Li</snm><fnm>WH</fnm></au></aug><source>Molecular Biology and Evolution</source><pubdate>1995</pubdate><volume>12</volume><fpage>546</fpage><lpage>557</lpage><xrefbib><pubid idtype="pmpid" link="fulltext">7659011</pubid></xrefbib></bibl><bibl id="B53"><title><p>A codon-based model of nucleotide substitution for protein-coding DNA sequences</p></title><aug><au><snm>Goldman</snm><fnm>N</fnm></au><au><snm>Yang</snm><fnm>Z</fnm></au></aug><source>Molecular Biology and Evolution</source><pubdate>1994</pubdate><volume>11</volume><fpage>725</fpage><lpage>736</lpage><xrefbib><pubid idtype="pmpid" link="fulltext">7968486</pubid></xrefbib></bibl><bibl id="B54"><title><p>Grid computing</p></title><aug><au><snm>Cummings</snm><fnm>MP</fnm></au><au><snm>Huskamp</snm><fnm>JC</fnm></au></aug><source>EDUCAUSE Review</source><pubdate>2005</pubdate><volume>40</volume><fpage>116</fpage><lpage>117</lpage></bibl><bibl id="B55"><title><p>The Lattice Project: a Grid research and production environment combining multiple Grid computing models</p></title><aug><au><snm>Bazinet</snm><fnm>AL</fnm></au><au><snm>Cummings</snm><fnm>MP</fnm></au></aug><source>Distributed &amp; Grid Computing - Science Made Transparent for Everyone Principles, Applications and Supporting Communities Tectum</source><publisher>Marburg</publisher><editor>Weber WHW</editor><pubdate>2009</pubdate><fpage>2</fpage><lpage>13</lpage><xrefbib><pubid idtype="pmpid" link="fulltext">21706448</pubid></xrefbib></bibl><bibl id="B56"><title><p>Recovering evolutionary trees under a more realistic model of sequence evolution</p></title><aug><au><snm>Lockhart</snm><fnm>PJ</fnm></au><au><snm>Stell</snm><fnm>MA</fnm></au><au><snm>Hendy</snm><fnm>MD</fnm></au><au><snm>Penny</snm><fnm>D</fnm></au></aug><source>Molecular Biology and Evolution</source><pubdate>1994</pubdate><volume>11</volume><fpage>605</fpage><lpage>612</lpage><xrefbib><pubid idtype="pmpid" link="fulltext">19391266</pubid></xrefbib></bibl><bibl id="B57"><title><p>Compositional bias may affect both DNA-based and protein-based phylogenetic reconstructions</p></title><aug><au><snm>Foster</snm><fnm>PG</fnm></au><au><snm>Hickey</snm><fnm>DA</fnm></au></aug><source>Journal of Molecular Evolution</source><pubdate>1999</pubdate><volume>48</volume><fpage>284</fpage><lpage>290</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1007/PL00006471</pubid><pubid idtype="pmpid" link="fulltext">10093217</pubid></pubidlist></xrefbib></bibl><bibl id="B58"><title><p>Compositional heterogeneity and phylogenomic inference of metazoan relationships</p></title><aug><au><snm>Nesnidal</snm><fnm>MP</fnm></au><au><snm>Helmkampf</snm><fnm>M</fnm></au><au><snm>Bruchhaus</snm><fnm>I</fnm></au><au><snm>Hausdorf</snm><fnm>B</fnm></au></aug><source>Molecular Biology and Evolution</source><pubdate>2010</pubdate><volume>27</volume><fpage>2095</fpage><lpage>2104</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1093/molbev/msq097</pubid><pubid idtype="pmpid" link="fulltext">20382658</pubid></pubidlist></xrefbib></bibl><bibl id="B59"><title><p>When phylogenetic assumptions are violated: base compositional heterogeneity and among-site rate variation in beetle mitochondrial phylogenomics</p></title><aug><au><snm>Song</snm><fnm>H</fnm></au><au><snm>Sheffield</snm><fnm>NC</fnm></au><au><snm>Cameron</snm><fnm>SL</fnm></au><au><snm>Miller</snm><fnm>KB</fnm></au><au><snm>Whiting</snm><fnm>MF</fnm></au></aug><source>Systematic Entomology</source><pubdate>2010</pubdate><volume>35</volume><fpage>429</fpage><lpage>448</lpage><xrefbib><pubid idtype="doi">10.1111/j.1365-3113.2009.00517.x</pubid></xrefbib></bibl><bibl id="B60"><title><p>A comprehensive analysis of mammalian mitochondrial genome base composition and improved phylogenetic methods</p></title><aug><au><snm>Gibson</snm><fnm>A</fnm></au><au><snm>Gowri-Shankar</snm><fnm>V</fnm></au><au><snm>Higgs</snm><fnm>PG</fnm></au><au><snm>Rattray</snm><fnm>M</fnm></au></aug><source>Molecular Biology and Evolution</source><pubdate>2005</pubdate><volume>22</volume><fpage>251</fpage><lpage>264</lpage><xrefbib><pubid idtype="pmpid" link="fulltext">15483324</pubid></xrefbib></bibl><bibl id="B61"><title><p>A phylogenetic study of the 'bombycoid complex' (Lepidoptera) using five protein-coding nuclear genes, with comments on the problem of macrolepidopteran phylogeny</p></title><aug><au><snm>Regier</snm><fnm>JC</fnm></au><au><snm>Cook</snm><fnm>CP</fnm></au><au><snm>Mitter</snm><fnm>C</fnm></au><au><snm>Hussey</snm><fnm>A</fnm></au></aug><source>Systematic Entomology</source><pubdate>2008</pubdate><volume>33</volume><fpage>175</fpage><lpage>189</lpage><xrefbib><pubid idtype="doi">10.1111/j.1365-3113.2007.00409.x</pubid></xrefbib></bibl><bibl id="B62"><title><p>Arthropod relationships revealed by phylogenomic analysis of nuclear protein-coding sequences</p></title><aug><au><snm>Regier</snm><fnm>JC</fnm></au><au><snm>Shultz</snm><fnm>JW</fnm></au><au><snm>Zwick</snm><fnm>A</fnm></au><au><snm>Hussey</snm><fnm>A</fnm></au><au><snm>Ball</snm><fnm>B</fnm></au><au><snm>Wetzer</snm><fnm>R</fnm></au><au><snm>Martin</snm><fnm>JW</fnm></au><au><snm>Cunningham</snm><fnm>CW</fnm></au></aug><source>Nature</source><pubdate>2010</pubdate><volume>463</volume><fpage>1079</fpage><lpage>1083</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1038/nature08742</pubid><pubid idtype="pmpid" link="fulltext">20147900</pubid></pubidlist></xrefbib></bibl><bibl id="B63"><title><p>Degeneracy Coding Web Service. PhyloTools</p></title><aug><au><snm>Zwick</snm><fnm>A</fnm></au></aug><pubdate>2010</pubdate><url>http://www.phylotools.com/ptdegen1webservice.htm</url><note>Web, accessed 25 Feb. 2010</note></bibl><bibl id="B64"><title><p>Genome-scale phylogeny and the detection of systematic biases</p></title><aug><au><snm>Phillips</snm><fnm>MJ</fnm></au><au><snm>Delsuc</snm><fnm>F</fnm></au><au><snm>Penny</snm><fnm>D</fnm></au></aug><source>Molecular Biology and Evolution</source><pubdate>2004</pubdate><volume>21</volume><fpage>1455</fpage><lpage>1458</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1093/molbev/msh137</pubid><pubid idtype="pmpid" link="fulltext">15084674</pubid></pubidlist></xrefbib></bibl><bibl id="B65"><title><p>MBE Toolbox: a Matlab toolbox for sequence data analysis in molecular biology and evolution</p></title><aug><au><snm>Cai</snm><fnm>JJ</fnm></au><au><snm>Smith</snm><fnm>DK</fnm></au><au><snm>Xia</snm><fnm>X</fnm></au><au><snm>Yuen</snm><fnm>KY</fnm></au></aug><source>BMC Bioinformatics</source><pubdate>2005</pubdate><volume>6</volume><fpage>64</fpage><xrefbib><pubidlist><pubid idtype="doi">10.1186/1471-2105-6-64</pubid><pubid idtype="pmcid">1274259</pubid><pubid idtype="pmpid" link="fulltext">15780146</pubid></pubidlist></xrefbib></bibl><bibl id="B66"><title><p>New leaf-mining moths from Chile, with remarks on the history and composition of Phyllocnistinae (Lepidoptera: Gracillariidae)</p></title><aug><au><snm>Davis</snm><fnm>DR</fnm></au></aug><source>Tropical Lepidoptera</source><pubdate>1994</pubdate><volume>5</volume><fpage>65</fpage><lpage>75</lpage></bibl><bibl id="B67"><title><p>An approximately unbiased test of phylogenetic tree selection</p></title><aug><au><snm>Shimodaira</snm><fnm>H</fnm></au></aug><source>Systematic Biology</source><pubdate>2002</pubdate><volume>51</volume><fpage>492</fpage><lpage>508</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1080/10635150290069913</pubid><pubid idtype="pmpid" link="fulltext">12079646</pubid></pubidlist></xrefbib></bibl><bibl id="B68"><title><p>CONSEL: for assessing the confidence of phylogenetic tree selection</p></title><aug><au><snm>Shimodaira</snm><fnm>H</fnm></au><au><snm>Hasegawa</snm><fnm>M</fnm></au></aug><source>Bioinformatics</source><pubdate>2001</pubdate><volume>17</volume><fpage>1246</fpage><lpage>1247</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1093/bioinformatics/17.12.1246</pubid><pubid idtype="pmpid" link="fulltext">11751242</pubid></pubidlist></xrefbib></bibl><bibl id="B69"><title><p>A taxonomic revision of the <it>Gracillaria </it>group occurring in Japan (Lepidoptera: Gracillariidae)</p></title><aug><au><snm>Kumata</snm><fnm>T</fnm></au></aug><source>Insecta Matsumurana, New Series</source><pubdate>1982</pubdate><volume>26</volume><fpage>1</fpage><lpage>186</lpage></bibl><bibl id="B70"><title><p>Japanese species of the <it>Acrocercops</it>-group (Lepidoptera: Gracillariidae)</p></title><aug><au><snm>Kumata</snm><fnm>T</fnm></au><au><snm>Kuroko</snm><fnm>H</fnm></au><au><snm>Ermolaev</snm><fnm>VP</fnm></au></aug><source>Insecta Matsumurana, New Series</source><pubdate>1988</pubdate><volume>38</volume><fpage>1</fpage><lpage>111</lpage><note>40:1-133</note></bibl><bibl id="B71"><title><p>A new stem-miner of alder in Japan, with a review of the larval transformation in the Gracillariidae (Lepidoptera)</p></title><aug><au><snm>Kumata</snm><fnm>T</fnm></au></aug><source>Insecta Matsumurana, New Series</source><pubdate>1978</pubdate><volume>13</volume><fpage>1</fpage><lpage>27</lpage></bibl><bibl id="B72"><title><p>On the Japanese species of the genera <it>Macarostola</it>, <it>Aristaea </it>and <it>Systoloneura</it>, with descriptions of three new species (Lepidoptera: Gracillariidae)</p></title><aug><au><snm>Kumata</snm><fnm>T</fnm></au></aug><source>Insecta Matsumurana, New Series</source><pubdate>1977</pubdate><volume>9</volume><fpage>1</fpage><lpage>51</lpage></bibl><bibl id="B73"><title><p>Lepidoptera phylogeny and systematics: the state of inventorying moth and butterfly diversity. Linnaeus Tercentenary: Progress in Invertebrate Taxonomy</p></title><aug><au><snm>Kristensen</snm><fnm>NP</fnm></au><au><snm>Scoble</snm><fnm>M</fnm></au><au><snm>Karsholt</snm><fnm>O</fnm></au></aug><source>Zootaxa</source><pubdate>2007</pubdate><volume>1668</volume><fpage>699</fpage><lpage>747</lpage></bibl><bibl id="B74"><title><p>Ancient rapid radiations of insects: challenges for phylogenetic analysis</p></title><aug><au><snm>Whitfield</snm><fnm>JB</fnm></au><au><snm>Kjer</snm><fnm>KM</fnm></au></aug><source>Annual Review of Entomology</source><pubdate>2008</pubdate><volume>53</volume><fpage>449</fpage><lpage>472</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1146/annurev.ento.53.103106.093304</pubid><pubid idtype="pmpid" link="fulltext">17877448</pubid></pubidlist></xrefbib></bibl><bibl id="B75"><title><p>The effect of ambiguous data on phylogenetic estimates obtained by maximum likelihood and Bayesian inference</p></title><aug><au><snm>Lemmon</snm><fnm>AR</fnm></au><au><snm>Brown</snm><fnm>JM</fnm></au><au><snm>Stanger-Hall</snm><fnm>K</fnm></au><au><snm>Lemmon</snm><fnm>EM</fnm></au></aug><source>Systematic Biology</source><pubdate>2009</pubdate><volume>58</volume><fpage>130</fpage><lpage>145</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1093/sysbio/syp017</pubid><pubid idtype="pmpid" link="fulltext">20525573</pubid></pubidlist></xrefbib></bibl><bibl id="B76"><title><p>Cases in which parsimony or compatibility methods will be positively misleading</p></title><aug><au><snm>Felsenstein</snm><fnm>J</fnm></au></aug><source>Systematic Zoology</source><pubdate>1978</pubdate><volume>27</volume><fpage>401</fpage><lpage>410</lpage><xrefbib><pubid idtype="doi">10.2307/2412923</pubid></xrefbib></bibl><bibl id="B77"><title><p>A framework for the quantitative study of evolutionary trees</p></title><aug><au><snm>Hendy</snm><fnm>MD</fnm></au><au><snm>Penny</snm><fnm>D</fnm></au></aug><source>Systematic Zoology</source><pubdate>1989</pubdate><volume>38</volume><fpage>297</fpage><lpage>309</lpage><xrefbib><pubid idtype="doi">10.2307/2992396</pubid></xrefbib></bibl><bibl id="B78"><title><p><it>Amborella </it>not a "basal angiosperm"? Not so fast</p></title><aug><au><snm>Soltis</snm><fnm>DE</fnm></au><au><snm>Soltis</snm><fnm>PS</fnm></au></aug><source>American Journal of Botany</source><pubdate>2004</pubdate><volume>2004</volume><fpage>997</fpage><lpage>1001</lpage></bibl><bibl id="B79"><title><p>Phylogeny of eukaryotes based on ribosomal RNA: long-branch attraction and models of sequence evolution</p></title><aug><au><snm>Philippe</snm><fnm>H</fnm></au><au><snm>Germot</snm><fnm>A</fnm></au></aug><source>Molecular Biology and Evolution</source><pubdate>2000</pubdate><volume>17</volume><fpage>830</fpage><lpage>834</lpage><xrefbib><pubid idtype="pmpid" link="fulltext">10917801</pubid></xrefbib></bibl><bibl id="B80"><title><p>Resolution of phylogenetic conflict in large data sets by increased taxon sampling</p></title><aug><au><snm>Hedtke</snm><fnm>SM</fnm></au><au><snm>Townsend</snm><fnm>TM</fnm></au><au><snm>Hillis</snm><fnm>DM</fnm></au></aug><source>Systematic Biology</source><pubdate>2006</pubdate><volume>55</volume><issue>3</issue><fpage>522</fpage><lpage>529</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1080/10635150600697358</pubid><pubid idtype="pmpid" link="fulltext">16861214</pubid></pubidlist></xrefbib></bibl><bibl id="B81"><title><p>Phylogeny and biogeography of hawkmoths (Lepidoptera: Sphingidae): evidence from five nuclear genes</p></title><aug><au><snm>Kawahara</snm><fnm>AY</fnm></au><au><snm>Mignault</snm><fnm>AA</fnm></au><au><snm>Regier</snm><fnm>JC</fnm></au><au><snm>Kitching</snm><fnm>IJ</fnm></au><au><snm>Mitter</snm><fnm>C</fnm></au></aug><source>PLoS ONE</source><pubdate>2009</pubdate><volume>4</volume><issue>5</issue><fpage>e5719</fpage><xrefbib><pubidlist><pubid idtype="doi">10.1371/journal.pone.0005719</pubid><pubid idtype="pmcid">2683934</pubid><pubid idtype="pmpid" link="fulltext">19492095</pubid></pubidlist></xrefbib></bibl><bibl id="B82"><title><p>The Yponomeutoidea: a reassessment of the superfamily and its suprageneric groups (Lepidoptera)</p></title><aug><au><snm>Kyrki</snm><fnm>J</fnm></au></aug><source>Entomologica Scandinavica</source><pubdate>1984</pubdate><volume>15</volume><fpage>71</fpage><lpage>84</lpage></bibl><bibl id="B83"><title><p>Phylogenetic relationship between the superfamilies Psychoidea, Tineoidea, and Yponomeutoidea (Lepidoptera), taking into account the functional morphology of the male genital apparatus. Part 1. Functional morphology of the male genitalia</p></title><aug><au><snm>Kuznetzov</snm><fnm>VI</fnm></au><au><snm>Stekol&apos;nikov</snm><fnm>AA</fnm></au></aug><source>Entomological Review</source><pubdate>1976</pubdate><volume>55</volume><fpage>533</fpage><lpage>548</lpage></bibl><bibl id="B84"><title><p>Phylogenetic relationship between the superfamilies Psychoidea, Tineoidea, and Yponomeutoidea (Lepidoptera), taking into account the functional morphology of the male genital apparatus. Part 2. Phylogenetic relationships of the families and subfamilies</p></title><aug><au><snm>Kuznetzov</snm><fnm>VI</fnm></au><au><snm>Stekol&apos;nikov</snm><fnm>AA</fnm></au></aug><source>Entomological Review</source><pubdate>1977</pubdate><volume>56</volume><fpage>14</fpage><lpage>21</lpage></bibl><bibl id="B85"><title><p>Evolution and phylogenetic utility of CAD (rudimentary) among Mesozoic-aged eremoneuran Diptera (Insecta)</p></title><aug><au><snm>Moulton</snm><fnm>JK</fnm></au><au><snm>Wiegmann</snm><fnm>B</fnm></au></aug><source>Molecular Phylogenetics and Evolution</source><pubdate>2003</pubdate><volume>31</volume><fpage>363</fpage><lpage>378</lpage></bibl><bibl id="B86"><title><p>A new nuclear gene for insect phylogenetics: dopa decarboxylase is informative of relationships within Heliothinae (Lepidoptera: Noctuidae)</p></title><aug><au><snm>Fang</snm><fnm>QQ</fnm></au><au><snm>Cho</snm><fnm>S</fnm></au><au><snm>Regier</snm><fnm>JC</fnm></au><au><snm>Mitter</snm><fnm>C</fnm></au><au><snm>Matthews</snm><fnm>M</fnm></au><au><snm>Poole</snm><fnm>RW</fnm></au><au><snm>Friedlander</snm><fnm>TP</fnm></au><au><snm>Zhao</snm><fnm>SW</fnm></au></aug><source>Systematic Biology</source><pubdate>1997</pubdate><volume>46</volume><fpage>269</fpage><lpage>283</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1093/sysbio/46.2.269</pubid><pubid idtype="pmpid">11975343</pubid></pubidlist></xrefbib></bibl><bibl id="B87"><title><p>Evolutionary assembly of the milkweed fauna: Cytochrome oxidase I and the age of <it>Tetraopes </it>beetles</p></title><aug><au><snm>Farrell</snm><fnm>BD</fnm></au></aug><source>Molecular Phylogenetics and Evolution</source><pubdate>2001</pubdate><volume>18</volume><fpage>467</fpage><lpage>478</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1006/mpev.2000.0888</pubid><pubid idtype="pmpid" link="fulltext">11277638</pubid></pubidlist></xrefbib></bibl><bibl id="B88"><title><p>Evolution and phylogenetic utility of the <it>period </it>gene in Lepidoptera</p></title><aug><au><snm>Regier</snm><fnm>JC</fnm></au><au><snm>Fang</snm><fnm>QQ</fnm></au><au><snm>Mitter</snm><fnm>C</fnm></au><au><snm>Peigler</snm><fnm>RS</fnm></au><au><snm>Friedlander</snm><fnm>TP</fnm></au><au><snm>Solis</snm><fnm>MA</fnm></au></aug><source>Molecular Biology and Evolution</source><pubdate>1998</pubdate><volume>15</volume><fpage>1172</fpage><lpage>1182</lpage><xrefbib><pubid idtype="pmpid" link="fulltext">9729881</pubid></xrefbib></bibl><bibl id="B89"><title><p>Patterns of mitochondrial versus nuclear DNA sequence divergence among nymphalid butterflies: the utility of <it>wingless </it>as a source of characters for phylogenetic inference</p></title><aug><au><snm>Brower</snm><fnm>AVZ</fnm></au><au><snm>DeSalle</snm><fnm>R</fnm></au></aug><source>Insect Molecular Biology</source><pubdate>1998</pubdate><volume>7</volume><fpage>73</fpage><lpage>82</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1046/j.1365-2583.1998.71052.x</pubid><pubid idtype="pmpid" link="fulltext">9459431</pubid></pubidlist></xrefbib></bibl></refgrp>
</bm></art>