<?xml version='1.0'?>
<!DOCTYPE art SYSTEM 'http://www.biomedcentral.com/xml/article.dtd'>
<art>
   <ui>1471-2229-7-38</ui>
   <ji>1471-2229</ji>
   <fm>
      <dochead>Research article</dochead>
      <bibl>
         <title>
            <p>Novel exon combinations generated by alternative splicing of gene fragments mobilized by a CACTA transposon in <it>Glycine max</it></p>
         </title>
         <aug>
            <au id="A1">
               <snm>Zabala</snm>
               <fnm>Gracia</fnm>
               <insr iid="I1"/>
               <email>g-zabala@uiuc.edu</email>
            </au>
            <au id="A2" ca="yes">
               <snm>Vodkin</snm>
               <fnm>Lila</fnm>
               <insr iid="I1"/>
               <email>l-vodkin@uiuc.edu</email>
            </au>
         </aug>
         <insg>
            <ins id="I1">
               <p>Department of Crop Sciences, University of Illinois, Urbana, Illinois 61801, USA</p>
            </ins>
         </insg>
         <source>BMC Plant Biology</source>
         <issn>1471-2229</issn>
         <pubdate>2007</pubdate>
         <volume>7</volume>
         <issue>1</issue>
         <fpage>38</fpage>
         <url>http://www.biomedcentral.com/1471-2229/7/38</url>
         <xrefbib>
            <pubidlist>
               <pubid idtype="pmpid">17629935</pubid>
               <pubid idtype="doi">10.1186/1471-2229-7-38</pubid>
            </pubidlist>
         </xrefbib>
      </bibl>
      <history>
         <rec>
            <date>
               <day>26</day>
               <month>1</month>
               <year>2007</year>
            </date>
         </rec>
         <acc>
            <date>
               <day>14</day>
               <month>7</month>
               <year>2007</year>
            </date>
         </acc>
         <pub>
            <date>
               <day>14</day>
               <month>7</month>
               <year>2007</year>
            </date>
         </pub>
      </history>
      <cpyrt>
         <year>2007</year>
         <collab>Zabala and Vodkin; licensee BioMed Central Ltd.</collab>
         <note>This is an Open Access article distributed under the terms of the Creative Commons Attribution License (<url>http://creativecommons.org/licenses/by/2.0</url>), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</note>
      </cpyrt>
      <abs>
         <sec>
            <st>
               <p>Abstract</p>
            </st>
            <sec>
               <st>
                  <p>Background</p>
               </st>
               <p>The recent discoveries of transposable elements carrying host gene fragments such as the Pack-MULEs (Mutator-like transposable elements) of maize (<it>Zea mays</it>), rice (<it>Oryza sativa</it>) and <it>Arabidopsis thaliana</it>, the <it>Helitrons </it>of maize and the <it>Tgm-Express </it>of soybeans, revealed a widespread genetic mechanism with the potential to rearrange genomes and create novel chimeric genes affecting genomic and proteomic diversity. Not much is known with regard to the mechanisms of gene fragment capture by those transposon elements or the expression of the captured host gene fragments. There is some evidence that chimeric transcripts can be assembled and exist in EST collections.</p>
            </sec>
            <sec>
               <st>
                  <p>Results</p>
               </st>
               <p>We report results obtained from analysis of RT-PCR derived cDNAs of the <it>Glycine max </it>mutant flower color gene, <it>wp</it>, that contains a 5.7-kb transposon (<it>Tgm-Express1</it>) in Intron 2 of the flavanone 3-hydroxylase gene (<it>F3H</it>) and is composed of five unrelated host gene fragments. The collection of cDNAs derived from the <it>wp </it>allele represents a multiplicity of processed RNAs varying in length and sequence that includes some identical to the correctly processed mature F3H transcript with three properly spliced exons. Surprisingly, the five gene fragments carried by the <it>Tgm-Express1 </it>were processed through complex alternative splicing as additional exons of the <it>wp </it>transcript.</p>
            </sec>
            <sec>
               <st>
                  <p>Conclusion</p>
               </st>
               <p>The gene fragments carried by the <it>Tgm </it>inverted repeat ends appear to be retained as functional exons/introns within the element. The spliceosomes then select indiscriminately the canonical intron splice sites from a pre-mRNA to assemble diverse chimeric transcripts from the exons contained in the <it>wp </it>allele. The multiplicity and randomness of these events provide some insights into the origin and mechanism of alternatively spliced genes.</p>
            </sec>
         </sec>
      </abs>
   </fm>
   <bdy>
      <sec>
         <st>
            <p>Background</p>
         </st>
         <p>A mutation in a soybean flower color gene <it>(Wp) </it>encoding a flavanone 3-hydroxylase (F3H) was characterized as a novel transposon insertion, <it>Tgm-Express1</it>, of the CACTA superfamily, that carried multiple captured host gene fragments <abbrgrp><abbr bid="B1">1</abbr></abbrgrp>. The most visible effect of the <it>wp </it>mutation is production of pink rather than purple flowers and lighter color in the seed coats (Figure <figr fid="F1">1</figr>). It has also been associated with lower oil and higher seed protein content than the purple flowered <it>Wp </it>isoline <abbrgrp><abbr bid="B2">2</abbr><abbr bid="B3">3</abbr></abbrgrp>.</p>
         <fig id="F1">
            <title>
               <p>Figure 1</p>
            </title>
            <caption>
               <p>Illustration of the effect of <it>wp </it>on flower and seed coat phenotypes</p>
            </caption>
            <text>
               <p><b>Illustration of the effect of <it>wp </it>on flower and seed coat phenotypes</b>. <b>(A) </b>Stable purple flower of plants with <it>Wp </it>genotype (left panel) or stable pink flower of plants with <it>wp </it>genotype (right panel) in lines LN89-5320-6 (<it>i</it><sup><it>i</it></sup><it>RtW1Wp) and </it>LN89-5322-2 (<it>i</it><sup><it>i</it></sup><it>RtW1wp</it>) both of which have yellow seed coats. In soybean <it>I </it>(<it>CHS</it>), <it>R </it>and <it>T </it>(<it>F3'H</it>) are three independent loci that control pigmentation in seed coats and <it>W1 </it>(<it>F5'3'H</it>) and <it>Wp </it>(<it>F3H</it>) were described as flower color markers, but all five loci seem to be encoding genes of the anthocyanin and proanthocyanidin pathways. Mutant alleles of those loci (<it>i, i</it><sup><it>i</it></sup>, <it>r, t, w1 </it>and <it>wp</it>) affect flower, seed coat, hypocotyle or pubescence coloration [1, 28, 30, 31]. <b>(B) </b>Imperfect black color of seed coats of plants with <it>iRtW1Wp </it>genotype (left panel) as contrasted with the lighter shaded seed coats of plants with <it>iRtW1wp </it>genotype (right panel). Effect on the seed coat phenotype was revealed by crossing the <it>wp </it>allele into lines having the recessive <it>i </it>allele that allows spatial pigmentation of the entire seed coat [24]. The cracks on both seed coat types result from an epistatic effect of <it>t </it>[31]. <b>(C) </b>Black seed coats of plants with <it>iRTW1Wp </it>genotype where the <it>T </it>allele drives the synthesis of cyanidins (left panel) contrasted with the lighter seed coats of plants with <it>iRTW1wp </it>genotype (right panel). <b>(D) </b>Abbreviated schematic representation of the three branches leading to the synthesis of the three anthocyanin classes and the genes encoding the enzymes relevant to the present study.</p>
            </text>
            <graphic file="1471-2229-7-38-1"/>
         </fig>
         <p>The <it>Tgm-Express1 </it>element, like the Pack-MULEs (Mutator-like elements) of maize, rice and <it>Arabidopsis</it>, retroelements in rice and the <it>Helitrons </it>of maize contains several host gene fragments. It carries intronic and exonic regions of five genes: unknown protein (<it>UP</it>), cell division cycle 2 (<it>CDC2</it>), fructose-6-phosphate 2-kinase/fructose-2-6-biphosphatase (<it>FPK</it>), malate dehydrogenase (<it>M</it>) and cysteine synthase (<it>CS</it>) <abbrgrp><abbr bid="B1">1</abbr></abbrgrp>. Little is known about how any of those transposons or retroelements acquired the gene pieces but there is some evidence that they are transcribed creating chimeric cDNAs that exist in EST (expressed sequence tag) collections <abbrgrp><abbr bid="B4">4</abbr><abbr bid="B5">5</abbr><abbr bid="B6">6</abbr><abbr bid="B7">7</abbr></abbrgrp>. Of 475 Pack-MULEs identified on chromosomes 1 and 10 of rice via computer searches, 5% were transcribed based on exact matches to full-length cDNAs <abbrgrp><abbr bid="B4">4</abbr></abbrgrp>. Most of the transcripts (91%) were initiated from promoters at the TIRs (terminal inverted repeats) or within the element while 9% of the transcripts initiated outside the element. Three chimeric transcripts were seen in an RNA blot probed with both a <it>sh2 </it>and a <it>Helitron </it>insertion fragment probes <abbrgrp><abbr bid="B8">8</abbr></abbrgrp>. A single 2,620 bp chimeric transcript spanning the entire <it>Helitron </it>including several gene fragments contained within the element has also been described <abbrgrp><abbr bid="B6">6</abbr></abbrgrp>. The promoter was predicted to reside upstream of the <it>Helitron </it>insertion site.</p>
         <p>We present here an array of 12 distinct alternatively spliced chimeric transcripts obtained via RT-PCR that were derived from the soybean <it>wp </it>mutant allele in which the second intron of the <it>F3H1 </it>gene is interrupted by the 5.7 kb transposon containing five captured host gene-fragments. The chimeric transcripts analyzed were more abundant in seed coats than in cotyledons and ranged in size from 3,108 bp to a correctly processed one of 1,422 bp that was identical to the transcript derived from the wild type <it>Wp </it>allele. All transcripts isolated initiated at the <it>F3H1 </it>gene (<it>Wp</it>, <it>wp</it>) promoter that is strongly expressed in seed coats.</p>
         <p>Alternative splicing is a common regulatory mechanism in higher eukaryotes and the mechanisms governing it have been studied extensively in mammalian systems but sparingly in plants <abbrgrp><abbr bid="B9">9</abbr><abbr bid="B10">10</abbr></abbrgrp>. In general, the splicing pattern of a multiexon pre-mRNA can be altered in many ways. Exons that are always spliced and included in the mature mRNA are known as constitutive exons. However, mechanistic decisions of the splicing components can result in exons that are included at times but excluded at others times from the mature mRNA. These are referred to as cassette exons. There are also occurrences of 5' and 3' alternative splice sites altering the length of some exons. In addition, the failure to remove an intron, referred to as intron retention, is also found. Genes whose pre-mRNAs have multiple locations of alternative splicing produce a family of related proteins with different allosteric regulation, protein localization, or enzymatic activity <abbrgrp><abbr bid="B9">9</abbr></abbrgrp>.</p>
         <p>We show that the exon/intron regions of gene fragments carried by the <it>Tgm-Express1 </it>of the <it>wp </it>allele are alternatively spliced and assembled with the constitutive exons of the <it>F3H1 </it>gene to generate an array of chimeric transcripts encoding a variety of open reading frames (orfs). Analysis of the derived amino acid sequence from the 12 distinct <it>wp </it>chimeric cDNAs predicted putative chimeric orfs varying in length and frame locations.</p>
         <p>The splicing machinery at times eliminates all extraneous (cassette) exons and introns of the <it>Tgm-Express1 </it>element to generate a full length transcript identical to that of the wild type gene and thus likely functional. The number of F3H molecules may be extensive enough to allow the synthesis of sufficient anthocyanin pigment that could account for the pink flower and the lighter seed coat phenotypes in the <it>wp </it>lines (Figure <figr fid="F1">1</figr>). On the other hand, the more complex chimeric transcripts containing cassette exons (UP, CDC2, FPK, M and CS) from the captured gene fragments in the transposon may upon translation generate products that could interfere with function of the wild type host-gene counterparts leading to secondary phenotypes. Whether any of the novel exon combinations derived from alternative splicing of the mobile exons of the <it>Tgm-Express1 </it>element create new phenotypes is unknown. However, there is growing evidence from both plant <abbrgrp><abbr bid="B11">11</abbr><abbr bid="B7">7</abbr></abbrgrp> and animal <abbrgrp><abbr bid="B12">12</abbr><abbr bid="B13">13</abbr></abbrgrp> systems that repeat sequences derived from mobile elements play a significant role in generation and evolution of novel genes and exons. The <it>wp </it>locus in soybean is a unique example of an insertional mutation in the act of <it>de novo </it>generation of fused, multiple chimeric exons through inclusion or exclusion of cassette exons carried by the element into the affected gene.</p>
      </sec>
      <sec>
         <st>
            <p>Results</p>
         </st>
         <sec>
            <st>
               <p>Complex aberrant expression of the flower color mutant gene wp</p>
            </st>
            <p>We discovered that a pink flower locus (<it>Wp</it>) of soybean encoded a flavanone 3- hydroxylase gene (<it>F3H1</it>) by differential screening of a cDNA soybean microarray using RNAs from mutant pink (<it>wp</it>) and standard purple (<it>Wp</it>) flower isolines <abbrgrp><abbr bid="B1">1</abbr></abbrgrp>. We also showed that the <it>Tgm-Express1 </it>transposon insertion impaired expression of the mutated gene and that the <it>F3H1 </it>gene was strongly expressed in the seed coats but not in cotyledons <abbrgrp><abbr bid="B1">1</abbr></abbrgrp>.</p>
            <p>Analysis of the <it>wp </it>allele expression by RT-PCR with a pair of <it>F3H1 </it>outermost 5' and 3'-primers revealed a bizarre pattern of amplification resulting in a variety of cDNA sizes from both seed coat and cotyledon RNAs (Figure <figr fid="F2">2</figr>). The broad bright band of PCR product obtained with the seed coat samples (Figure <figr fid="F2">2A</figr>) represents multiple size bands. Shorter exposure photograph of that same gel revealed at least 4 distinct bands (left most lane, Figure <figr fid="F2">2A</figr>). The <it>wp </it>transcriptional activity between the two tissues, cotyledons and seed coats, could be deduced from the difference in the intensity of the PCR products obtained from the two <it>wp </it>RNA sets (Figure <figr fid="F2">2A</figr> and <figr fid="F2">2B</figr>). Even though no hybridization to a F3H probe was apparent by RNA blots with cotyledon RNAs of either genotype (<it>Wp </it>and <it>wp</it>) <abbrgrp><abbr bid="B1">1</abbr></abbrgrp>, RT-PCR showed the existence of 1.4 kb transcripts representing the mature <it>F3H1 </it>gene (data not shown) and the aberrant larger transcripts from mutant line RNAs (Figure <figr fid="F2">2B</figr>).</p>
            <fig id="F2">
               <title>
                  <p>Figure 2</p>
               </title>
               <caption>
                  <p>Variant flavanone 3-hydroxylase cDNAs from isolines containing mutant <it>wp </it>alleles</p>
               </caption>
               <text>
                  <p><b>Variant flavanone 3-hydroxylase cDNAs from isolines containing mutant <it>wp </it>alleles. (A) </b>Ethidium bromide-stained gel showing an array of cDNA bands between 5 and 1.4 kb in size that were amplified from RNAs of seed coats of the <it>wp </it>mutant line LN89-5322-2 through RT-PCR reactions. The (+) and (-) at top indicate reactions with and without Superscript RTII. The bright broad bands obtained from mutant RNA samples in the (+) reactions were resolved into a group of discreet bands with shorter photographic exposure of the same gel (far left lane). <b>(B) </b>Ethidium bromide-stained gel showing cDNAs amplified via RT-PCR with RNA from cotyledons of the LN89-5322-2 (<it>wp</it>) mutant line.</p>
               </text>
               <graphic file="1471-2229-7-38-2"/>
            </fig>
            <p>Cloning the larger sized RT-PCR cDNAs from plants homozygous for the <it>wp </it>allele resulted in a surprising array of alternatively spliced transcripts. Sequence analysis of the multiple size cDNAs cloned from both seed coat and cotyledons revealed multiple transcripts derived from the <it>wp </it>allele containing the wild type gene (<it>F3H1</it>) exons (1, 2, 3) plus varying portions of exonic and intronic regions of the gene fragments captured by the <it>Tgm-Express1 </it>element that interrupts Intron 2 in the <it>wp </it>allele (Figure <figr fid="F3">3</figr>).</p>
            <fig id="F3">
               <title>
                  <p>Figure 3</p>
               </title>
               <caption>
                  <p>Schematic representation of the <it>wp </it>recessive allele and the novel exon combinations generated in its transcribed RNAs</p>
               </caption>
               <text>
                  <p><b>Schematic representation of the <it>wp </it>recessive allele and the novel exon combinations generated in its transcribed RNAs. (A) </b>Represents the genomic sequence of the mutant <it>wp </it>allele obtained from the line LN89-5322-2 (<it>i</it><sup><it>i</it></sup><it>RtW1wp</it>). The introns are indicated and their length given in bp. The 5,725 bp <it>Tgm-Express </it>insertion in Intron 2 is drawn at top with the arrow heads representing inverted repeats and the five captured gene fragments color coded. The full length of the mutant gene is 9,251 bp. The three Exons in purple represent the cDNA of the proper spliced wild type gene 1,422 bp in size. The 7F and 1428R primers used in the PCR reactions that generated the chimeric cDNA clones shown in Figure 3B and C map at the 5' end of Exon 1 (7F) and the 3' end of Exon 3 (1428R) respectively. <b>(B) </b>Graphic representation of six chimeric, multi-exon cDNA clones (<it>wp</it>-25s, -22s, -28s, -9s, 12s, -4s) derived from seed coat RNAs of the <it>wp </it>mutant line via RT-PCR. These clones contained besides the F3H three Exons (1, 2, 3) varying numbers of alternatively spliced exons (solid color boxes) and introns (dashed narrower boxes) from 3 or 5 of the <it>Tgm-Express1 </it>captured gene fragments (<it>UP, CDC2, FPK, M </it>and <it>CS</it>). A seventh cDNA clone, <it>wp</it>-15s, also derived from the mutant <it>wp </it>line is composed only of the wild type gene (<it>Wp</it>) Exons 1, 2 and 3. <b>(C) </b>Six chimeric cDNA clones (<it>wp</it>-9c, -8c, -2c, -13c, -12c, -6c) derived from cotyledon RNAs of the <it>wp </it>mutant line via RT-PCR. All clones contained the F3H Exons (1, 2,3) with varying numbers of alternatively spliced exonic and intronic regions from the <it>Tgm-Express1 </it>acquired host-gene fragments separating the Exon 2-Exon 3 junction. Abbreviations: <it>UP</it>, unknown protein; <it>CDC2</it>, cell division cycle 2; <it>FPK</it>, fructose-6-phosphate 2-kinase/fructose-2-6-biphosphatase; <it>M</it>, malate dehydrogenase; <it>CS</it>, cysteine synthase. Two CDC2 intronic regions captured by the transposon element and sandwished between the three exonic regions (C, D and C2, Figure 2A) were spliced out to form the CDC2 exon in the chimeric transcripts (Figure 2B and C). One FPK intronic fragment captured by the transposon between two flanking exons (F and PK, Figure 2A) was also spliced out to form the FPK exon in the chimeric transcripts (Figure 2B and C). A smaller FPK intron flanked by 15 bp exon fragment (narrow orange block not named) at the 5'end (Figure 2A) is not always spliced out (Figure 2B and C).</p>
               </text>
               <graphic file="1471-2229-7-38-3"/>
            </fig>
            <p>Figure <figr fid="F3">3A</figr> shows the schematic representation of the genomic sequence of the <it>wp </it>allele with the <it>Tgm-Express1 </it>insertion in Intron 2. The number of gene fragments contained within the element and their exons (solid colored boxes) and introns (striped boxes) were revealed upon sequence analysis of the multiple transcripts derived from <it>wp </it>expression in seed coats (See additional file <supplr sid="S1">1</supplr>: Seed coat <it>wp </it>RT-PCR cDNA sequence alignment) and cotyledons (See additional file <supplr sid="S2">2</supplr>: Cotyledons <it>wp </it>RT-PCR cDNA sequence alignment). There were exonic portions of five distinct genes: unknown protein (<it>UP</it>), cell division cycle 2 (<it>CDC2</it>), fructose-6-phosphate 2-kinase/fructose-2-6-biphosphatase (<it>FPK</it>), malate dehydrogenase (<it>M</it>) and cysteine synthase (<it>CS</it>). Some of the intronic regions could be assigned to specific genes (one color stripes) while others (two colors) could not. The solid black line between the solid arrow heads (inverted repeats) could be introns or intergenic DNA regions. Including the latter, all marked intronic regions conform to the canonical 5'GT donor and 3'AG acceptor splice sites.</p>
            <p>Figure <figr fid="F3">3B</figr> is a graphic summary of seven distinct RT-PCR cDNA clones derived from seed coat RNAs. The larger clones (<it>wp</it>-25s, <it>wp</it>-22s, <it>wp</it>-28s, <it>wp</it>-9s and <it>wp</it>-12s) contain beside the three exons of the <it>F3H1 </it>gene, all cassette exons of the <it>Tgm-express1 </it>gene fragments and varying intron pieces. The smaller clones (<it>wp</it>-4s and <it>wp</it>-15s) had only the F3H exons (<it>wp</it>-15s) or the F3H exons and three cassette exons (<it>wp</it>-4s). Sequence data from these clones have been deposited with the EMBL/GenBank Data Libraries under accession numbers: <ext-link ext-link-type="gen" ext-link-id="EF100865">EF100865</ext-link> (<it>wp-</it>25s), <ext-link ext-link-type="gen" ext-link-id="EF100866">EF100866</ext-link> (<it>wp-</it>22s), <ext-link ext-link-type="gen" ext-link-id="EF100867">EF100867</ext-link> (<it>wp-</it>28s), <ext-link ext-link-type="gen" ext-link-id="EF100868">EF100868</ext-link> (<it>w-p</it>9s), <ext-link ext-link-type="gen" ext-link-id="EF100869">EF100869</ext-link> (<it>wp-</it>12s), <ext-link ext-link-type="gen" ext-link-id="EF100870">EF100870</ext-link> (<it>wp-</it>4s), <ext-link ext-link-type="gen" ext-link-id="EF100871">EF100871</ext-link> (<it>wp</it>-15s).</p>
            <suppl id="S1">
               <title>
                  <p>Additional file 1</p>
               </title>
               <text>
                  <p>Seed coat <it>wp </it>RT-PCR cDNA sequence alignment.</p>
               </text>
               <file name="1471-2229-7-38-S1.doc">
                  <p>Click here for file</p>
               </file>
            </suppl>
            <suppl id="S2">
               <title>
                  <p>Additional file 2</p>
               </title>
               <text>
                  <p>Cotyledons <it>wp </it>RT-PCR cDNA sequence alignment.</p>
               </text>
               <file name="1471-2229-7-38-S2.doc">
                  <p>Click here for file</p>
               </file>
            </suppl>
            <p>Likewise, Figure <figr fid="F3">3C</figr> shows six cDNA clones obtained via RT-PCR from cotyledon RNAs. As in the case of the seed coat derived cDNA clones, the larger cotyledon cDNA clones (<it>wp</it>-9c, <it>wp</it>-8c, <it>wp</it>-2c and <it>wp</it>-13c) contained some intron fragments besides the three F3H exons and cassette exons from the <it>Tgm-Express1 </it>element. The smaller clones (<it>wp</it>-12c and <it>wp</it>-6c) contained only exons, the three F3H exons and the five cassette exons correctly spliced. The latter two clones diverged only by 61 bp mostly due to two splicing errors in <it>wp-</it>6c deleting 15 bp at the beginning of Exon 2 and 47 bp at the CDC2/FPK exons junction (See additional file <supplr sid="S2">2</supplr>: Cotyledons <it>wp </it>RT-PCR cDNA sequence alignment). Of the six cotyledon cDNAs cloned, only one (<it>wp</it>-8c) was identical to one (<it>wp</it>-22s) of the seed coat cDNA clones. Sequence data from these clones have been deposited with the EMBL/GenBank Data Libraries under accession numbers: <ext-link ext-link-type="gen" ext-link-id="EF100872">EF100872</ext-link> (<it>wp</it>-9c), <ext-link ext-link-type="gen" ext-link-id="EF100873">EF100873</ext-link> (<it>wp</it>-8c), <ext-link ext-link-type="gen" ext-link-id="EF100874">EF100874</ext-link> (<it>wp</it>-2c), <ext-link ext-link-type="gen" ext-link-id="EF100875">EF100875</ext-link> (<it>wp</it>-13c), <ext-link ext-link-type="gen" ext-link-id="EF100876">EF100876</ext-link> (<it>wp</it>-12c), and <ext-link ext-link-type="gen" ext-link-id="EF100877">EF100877</ext-link> (<it>wp</it>-6c).</p>
            <p>Overall, we isolated 12 different transcripts synthesized from the <it>wp </it>allele. These are a good representation of the chimeric transcripts generated by the spliceosome machinery in the tissues examined. We conclude that the most abundant transcripts shown by the discrete bands in figure <figr fid="F2">2A</figr> (left lane) have been cloned based on their size. The four most abundant bands are between 2 and 3 kb in size as are 11 of the 12 different cloned cDNAs. Our results also demonstrated that alternative splicing at the <it>wp </it>allele occurs in two tissues, one (the seed coats) in which the F3H promoter is highly expressed and another (cotyledons) in which it is not.</p>
         </sec>
         <sec>
            <st>
               <p>Open reading frames of chimeric, multi-exon wp transcripts</p>
            </st>
            <p>The amino acid sequences derived from the cDNA sequences of seed coat and cotyledon <it>wp</it>-cDNA clones shown in Figure <figr fid="F3">3B</figr> and <figr fid="F3">3C</figr>, varied significantly from clone to clone and consequently the putative open reading frames (orfs) of these chimeric transcripts. A search for orfs consisting of more than 100 amino acids (aa) found that many were chimeric (Figure <figr fid="F4">4</figr> and additional file <supplr sid="S3">3</supplr>: Seed coat <it>wp </it>RT-PCR cDNA derived amino acid sequences and open reading frames, and additional file <supplr sid="S4">4</supplr>: Cotyledon <it>wp </it>RT-PCR cDNA derived amino acid sequences and open reading frames). Of interest were two putative chimeric orfs present in several of the <it>wp</it>-cDNAs. One was composed of approximately 210 bp (70 aa) of the <it>UP </it>exon fragment and 192 bp (64 aa) of the <it>CDC2 </it>exon fragment. It was present in seed coat transcripts <it>wp</it>-25s, -22s, -28s, -9s, -12s, -4s, and cotyledon transcripts <it>wp</it>-9c, -8c, -2c, -13c, -12c, -6c, always in a (+) frame. In half the clones this orf appears as just described (seed coat <it>wp</it>-25s, -22s, -12s and cotyledon <it>wp</it>-9c, -8c, -2c) (Figure <figr fid="F4">4-D</figr>). In the other half the orf is part of a larger chimeric orf containing also the <it>F3H1 </it>Exon 1 and Exon 2 sequences (Figure <figr fid="F4">4-A</figr> and <figr fid="F4">4C</figr>) (seed coat <it>wp</it>-28s, -9s, -4s and cotyledon <it>wp</it>-13c, -6c, -12c). A second chimeric orf predicted for five of the <it>wp</it>-cDNAs (seed coat <it>wp</it>-25s, -22s, -9s and cotyledon <it>wp</it>-9c, -8c) consisted of approximately 103 bp (35 aa) of <it>FPK/M </it>intronic fragment and 212 bp (70 aa) of <it>FPK </it>exonic region, always in one of the three (-) frames (Figure <figr fid="F4">4-E</figr>). A related chimeric orf containing <it>FPK </it>exonic sequence of approximately 231/199 bp (77/66 aa) appears in seed coat <it>wp</it>-12s clone and cotyledon <it>wp</it>-13c and <it>wp</it>-6c, all three in a (+) frame (Figure <figr fid="F4">4-F</figr>).</p>
            <suppl id="S3">
               <title>
                  <p>Additional file 3</p>
               </title>
               <text>
                  <p>See coat <it>wp </it>RT-PCR cDNA derived anubi acxud sequences and open reading frames.</p>
               </text>
               <file name="1471-2229-7-38-S3.doc">
                  <p>Click here for file</p>
               </file>
            </suppl>
            <suppl id="S4">
               <title>
                  <p>Additional file 4</p>
               </title>
               <text>
                  <p>Cotyledon <it>wp </it>RT-PCR cDNA derived amino acid sequences and open reading frames.</p>
               </text>
               <file name="1471-2229-7-38-S4.doc">
                  <p>Click here for file</p>
               </file>
            </suppl>
            <fig id="F4">
               <title>
                  <p>Figure 4</p>
               </title>
               <caption>
                  <p>Schematic of relevant chimeric and non-chimeric orfs generated by the <it>wp </it>allele</p>
               </caption>
               <text>
                  <p><b>Schematic of relevant chimeric and non-chimeric orfs generated by the <it>wp </it>allele</b>. In order of decreasing aa length the chimeric orfs from several of the chimeric mRNAs isolated were: A (418 aa) containing F3H Exons 1 and 2, the UP and CDC2 Exons; C (293 aa) with F3H Exon 1, 3'end 16 aa, and Exon 2 plus UP and CDC2 Exons; D (143 aa) had 9 aa of the UP intron plus the UP and CDC2 Exons. E (105 aa) with 34 aa of the FPK/MDH Intron plus 71 aa of the FPK Exon; F (105 aa) had 19 aa of FPK Intron, 77 aa of FPK Exon and 9 aa of MDH Exon. The non chimeric orf B (394 aa) had the three F3H Exons identical to the ones translated from the <it>Wp </it>allele the only cDNA clone with this orf was <it>wp</it>-15s. The chimeric orfs were generated from several of the cDNAs sequenced and they are listed underneath each orf class and also the frame in each one of the clones. * The orf from the <it>wp</it>-6c clone was 5 aa shorter. ** The orf from the <it>wp</it>-6c was 2 aa longer.</p>
               </text>
               <graphic file="1471-2229-7-38-4"/>
            </fig>
            <p>The products of these chimeric orfs may not serve enzymatic functions <it>per se </it>but if translated, they could potentially affect the function of wild type proteins synthesized from the intact host genes (<it>UP</it>, <it>CDC2 </it>or <it>FPK</it>). In addition to the chimeric orfs, we also found a cDNA that reconstituted the <it>F3H1</it>. The seed coat <it>wp</it>-15s cDNA clone in the (+3) frame contained an orf of 394 amino acids identical to the orf derived from the functional allele <it>F3H1 </it>of the purple flower isoline (<it>Wp</it>) (Figure <figr fid="F4">4-B</figr>). The product of this <it>F3H1</it>orf has the full potential to be translated into a functional F3H enzyme.</p>
         </sec>
         <sec>
            <st>
               <p>Expression of host genes with homology to the <it>Tgm-Express1 </it>captured gene fragments</p>
            </st>
            <p>To analyze the expression of the host genes related to the exons captured by the <it>Tgm-Express1 </it>element, we amplified the cassette exons from the seed coat derived <it>wp</it>-12 cDNA clone (Figure 3B) to generate a chimeric radiolabeled probe that would hybridized to all RNAs with homology to the probe's exon fragments. These include those transcripts derived from the related host genes as well as the chimeric transcripts expressed from the <it>wp </it>mutant allele. This probe contained sequence fragments with similarity to an unknown protein (<it>UP</it>), cell division cycle 2 (<it>CDC2</it>), fructose-6-phosphate 2-kinase/fructose-2-6-biphosphatase (<it>FPK</it>), malate dehydrogenase (<it>M</it>) and cysteine synthase (<it>CS</it>) genes.</p>
            <p>Figure <figr fid="F5">5</figr> shows the hybridization of the chimeric radiolabelled probe to an RNA blot containing flower bud, seed coat and cotyledon RNAs from the <it>wp</it>, <it>Wp </it>and <it>wp</it><sup><it>m </it></sup>isolines. It appears that the hybridization signal in the seed coats from the <it>wp </it>line that contains the <it>Tgm-Express1 </it>insertion is greater than the signal in the <it>Wp </it>line that lacks the insertion, suggesting that the chimeric fragments within the <it>wp </it>allele are detected in the RNA blots along with hybridization to the host genes. In cotyledons on the other hand, hybridization levels to RNAs from both isolines <it>Wp </it>and <it>wp </it>were similar. Since we have previously shown that the <it>F3H1 </it>promoter is strongly expressed in seed coats but with practically undetectable expression in the cotyledons by RNA blot analysis <abbrgrp><abbr bid="B1">1</abbr></abbrgrp>, we deduced that the hybridization seen with cotyledon RNAs is likely to represent expression from one or more of the host genes. We have RNA blot data showing transcripts hybridizing to a soybean FPK cDNA clone (Gm-c1023-5325) of a size similar to those of the chimeric transcripts and that were more highly expressed in cotyledons of all three isolines than in seed coats (data not shown). This would explain the hybridization results observed for the cotyledons of the <it>Wp </it>where no chimeric transcripts should be synthesized.</p>
            <fig id="F5">
               <title>
                  <p>Figure 5</p>
               </title>
               <caption>
                  <p>Expression of the <it>Tgm-Express1 </it>captured gene fragments and related host genes in the <it>Wp</it>-flower color isolines</p>
               </caption>
               <text>
                  <p><b>Expression of the <it>Tgm-Express1 </it>captured gene fragments and related host genes in the <it>Wp</it>-flower color isolines. (A) </b>RNA gel blot with total RNA samples purified from flower buds, seed coats of three developmental stages and cotyledons of two developmental stages of the flower color isolines: LN89-5320-8-53 (<it>wp</it><sup><it>m </it></sup><it>wp</it><sup><it>m</it></sup>), LN89-5320-6 (<it>WpWp</it>) and LN89-5322-2 (<it>wpwp</it>). Seed fresh weight of each seed coat and cotyledon sample in mg is shown at bottom. The chimeric DNA probe containing <it>UP, CDC2, FPK, M </it>and <it>CS </it>sequences was an amplification product from the seed coat RT-PCR derived <it>wp</it>-12 cDNA clone (Figure 3). <b>(B) </b>Ethidium bromide-stained gel prior to membrane transfer. The 25 S rRNA is shown to compare RNA sample loading.</p>
               </text>
               <graphic file="1471-2229-7-38-5"/>
            </fig>
            <p>In addition, the hybridizing RNAs shown in Figure <figr fid="F5">5</figr> are large transcripts of size similar to those recovered by RT-PCR suggesting that most likely there are no internal transcription initiation sites within the <it>Tgm-Express1 </it>element to generate truncated smaller transcripts.</p>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Discussion</p>
         </st>
         <p>Plant transposable elements have long been known to cause changes in gene expression as a result of insertion or deletion in coding regions and gene promoters and their effect on RNA processing <abbrgrp><abbr bid="B14">14</abbr></abbrgrp>. However, more than 20 years of research on tracking the movements of individual transposable elements at the molecular level has revealed only a few examples that these elements were capable of transposing non-element associated coding regions from the host genome. A part of a host gene was carried by a <it>Mutator </it>(<it>Mu</it>) related element <abbrgrp><abbr bid="B15">15</abbr></abbrgrp> and another by the <it>Bs1 </it>retroviral element <abbrgrp><abbr bid="B16">16</abbr></abbrgrp>. More recently, with the availability of entire genome sequences for rice and <it>Arabidopsis</it>, many <it>Mu</it>-like elements associated with fragments of host cellular genes have been found in those two plant species <abbrgrp><abbr bid="B17">17</abbr><abbr bid="B18">18</abbr><abbr bid="B4">4</abbr></abbrgrp>. Bioinformatics analysis of the rice genome uncovered 898 intact retrogenes of which 380 were predicted to have chimeral protein coding sequences and several of these were confirmed by expression data <abbrgrp><abbr bid="B7">7</abbr></abbrgrp>. A novel family of maize transposons, the <it>Helitrons</it>, has been found recently to be embedded with portions of cellular gene fragments <abbrgrp><abbr bid="B8">8</abbr><abbr bid="B19">19</abbr><abbr bid="B5">5</abbr><abbr bid="B6">6</abbr></abbrgrp>. The <it>Tgm-Express1</it>, a member of the CACTA family, was shown to carry four genic fragments <abbrgrp><abbr bid="B1">1</abbr></abbrgrp>. An additional exonic region, UP, was identified upon analysis of the RT-PCR cDNA sequences obtained in the present study. More significantly, we show also that the <it>wp </it>allele carrying the <it>Tgm-Express1 </it>spawns an array of chimeric transcripts resulting from alternative splicing events at this locus, some of which lead to novel open reading frames.</p>
         <p>If one were to envision an evolutionary advantageous mechanism for <it>the novo </it>generation of mutations, it likely would be one that would alter the splicing of a gene's constitutive exons to produce novel proteins without the complete loss of the wild-type protein. The <it>wp </it>pink flower mutation of soybean that we describe offers an example of how a transposon insertion in the intron of the wild-type gene (<it>F3H1</it>) could generate an array of alternatively spliced transcripts with potential to be translated into novel proteins without totally losing the ability to synthesize the wild-type transcript and encoded protein. The transposon insertion, <it>Tgm-Express1</it>, responsible for this mutation carries five host gene fragments, <it>UP, CDC2, FPK, M </it>and <it>CS</it>, which we show are exons that are alternatively spliced with <it>F3H1 </it>exons. Sequence analysis of multiple RT-PCR derived cDNAs from two different tissues, the seed coat and the cotyledon, revealed the processing of at least 12 distinct putative transcripts, 11 of which contained besides the F3H exons (1, 2 and 3) varying numbers of cassette exons with or without small host gene-fragment intronic portions. In addition, transcripts with only the constitutive F3H exons were also synthesized from the <it>wp </it>mutant allele. Although observations of element removal from maize exons <abbrgrp><abbr bid="B20">20</abbr><abbr bid="B21">21</abbr></abbrgrp> and introns <abbrgrp><abbr bid="B22">22</abbr></abbrgrp> has previously been observed, the splicing of the very large <it>wp </it>intron of 6.44 kb (consisting of the 718 bp F3H intron 2 and the 5722 bp <it>Tgm-Express1 </it>transpon within it) is an unusual event. Plant introns are generally short and splicing of extremely large introns is rare <abbrgrp><abbr bid="B10">10</abbr></abbrgrp>.</p>
         <p>In alternative splicing processes in other eukaryotic systems, splicing of the pre-messenger RNA required five ribonucleoproteins and numerous accessory proteins to form four complexes (E, A, B and C) to join two adjacent exons with the release of an intervening sequence <abbrgrp><abbr bid="B9">9</abbr></abbrgrp>. It is remarkable to see the variety of processed transcripts that were created from the <it>wp </it>mutant allele leading to the precise splicing of the F3H and the cassette exons from within the <it>Tgm-Express1 </it>element (Figure <figr fid="F3">3B</figr> and <figr fid="F3">3C</figr>, seed coat <it>wp-</it>4s, <it>wp-</it>15s and cotyledon <it>wp</it>-12c and <it>wp</it>-6c, for example). Of greater interest also is the proper processing of the three CDC2 and two FPK exons in some of the transcripts that results in fusion of unrelated exons.</p>
         <p>Analysis of the derived amino acid sequence from the 12 distinct <it>wp </it>chimeric cDNA sequences predicted putative chimeric open reading frames (orfs) varying in location and length within the cDNAs (Figure <figr fid="F4">4</figr>). Alternative splicing of the cassette exons within <it>Tgm-Express1 </it>creates premature termination codons in many of the transcripts which could lead to their being targets for nonsense-mediated mRNA decay, the surveillance mechanism that degrades selectively nonsense mRNAs <abbrgrp><abbr bid="B23">23</abbr></abbrgrp>. However the larger 5'-end chimeric orfs (seed coats <it>wp</it>-28s, -9s, and cotyledons <it>wp</it>-13c and <it>wp-</it>6c) may have a higher chance to be translated (Figure <figr fid="F4">4-A</figr>). This orf is 418 aa, 24 aa longer than the one like the F3H wild type orf of 394 aa (Figure <figr fid="F4">4-B</figr>).</p>
         <p>The correctly processed transcript, represented by the isolated <it>wp</it>-15s clone (Figure <figr fid="F4">4-B</figr>), would allow the synthesis of functional F3H protein which could explain the existence of residual anthocyanin pigment in the pink flowers and the lighter colored seed coats in plants with <it>wpwp </it>genotype (Figure <figr fid="F1">1</figr>). F3H function is a required step in the synthesis of all three branches of anthocyanin and proanthocyanidins (Figure <figr fid="F1">1D</figr>). The results presented here demonstrate that <it>wp </it>is not a null mutation and that correctly spliced F3H1 transcripts are synthesized in sufficient amounts to allow the synthesis of anthocyanin pigment coloring the pink flowers (Figure <figr fid="F1">1A</figr>) and seed coats of plants with the <it>i</it>, <it>t</it>, <it>wp </it>(Figure <figr fid="F1">1B</figr>) and <it>i</it>, <it>T</it>, <it>wp</it>, (Figure <figr fid="F1">1C</figr>) genotypes. The fact that <it>wp </it>is not a null mutation suggests that the residual anthocyanins being synthesized in the <it>wp </it>flowers are most likely the same as in the <it>Wp </it>purple flowered isolines. The only difference between the colors of the two phenotypes may be the amount of pigment being synthesized.</p>
         <p>We cloned two similar F3H genomic sequences (<it>F3H1 </it>and <it>F3H2</it>) and found that the <it>F3H2 </it>gene is not expressed in the tissues discussed here <abbrgrp><abbr bid="B1">1</abbr></abbrgrp>. Therefore, the residual flower and seed coat color- phenotypes displayed by <it>wp </it>allele (Figure <figr fid="F1">1</figr> right panels) are not the result of the expression of other <it>F3H </it>family member genes elsewhere in the genome.</p>
         <p>Except for the wild-type like orf in the <it>wp</it>-15 cDNA clone with the three correctly spliced Exons (1, 2 and 3), that reconstitute F3H, most other orfs of more than 100 amino acids were chimeric. The more extensive ones were composed of the two first exons of <it>F3H </it>and varying portions of <it>UP </it>and <it>CDC2 </it>(418 aa and 293 aa; Figure <figr fid="F4">4-A</figr> and <figr fid="F4">4C</figr>). In addition, two other chimeric orfs encoding 143 and 105 amino acids respectively were created in several of the cDNA clones analyzed. The first contains portions of <it>UP </it>and <it>CDC2 </it>sequences in (+) frames while the second has <it>FPK </it>and <it>FPK/M </it>intron-derived sequences in (-) frames (Figure <figr fid="F4">4-D</figr> and <figr fid="F4">4E</figr>). Since both the host genes and the <it>wp </it>chimeric transcripts appear to be weakly expressed in the tissues examined (Figure <figr fid="F5">5</figr> and data not shown), any polypeptide fragments translated from the putative orfs, might interfere with the assembly and function of their active host protein counterparts. Thus, the CDC2 and FPK polypeptide pieces translated from the <it>wp </it>mutant allele may have potential to interfere with the functional enzymes translated from intact <it>CDC2 </it>or <it>FPK </it>host genes. Intriguingly, the second phenotype manifested in the <it>wp </it>mutant plants is seeds with lower oil and higher protein content than the <it>Wp </it>plants <abbrgrp><abbr bid="B2">2</abbr><abbr bid="B3">3</abbr></abbrgrp>. Inhibition of key metabolic enzymes such as FPK could potentially influence the direction of metabolic flux resulting in decreased fatty acid metabolism and linked increases in protein synthesis.</p>
      </sec>
      <sec>
         <st>
            <p>Conclusion</p>
         </st>
         <p>The multiplicity of transcript isoforms described here add an additional layer of complexity reinforcing the tremendous potential these gene-fragment-loaded-transposon elements such as <it>Tgm-Express</it>, Pack-MULEs, <it>Helitrons</it>, and retroelements can have not only in disrupting or modifying gene function but in the creation of new or modified genes leading to an increase in plant genome and possibly proteome diversity. Analysis of human and other vertebrate genomes <abbrgrp><abbr bid="B12">12</abbr><abbr bid="B13">13</abbr></abbrgrp> revealed that recently evolved exons are more likely to be alternatively spliced cassette exons originating from highly repeated DNA elements including transposons, SINEs, LINEs, and Alu repeats, emphasizing the importance of mobile elements in creating diversity during evolution of both animal and plant species.</p>
      </sec>
      <sec>
         <st>
            <p>Methods</p>
         </st>
         <sec>
            <st>
               <p>Plant material and genotypes</p>
            </st>
            <p>The <it>Glycine max </it>cultivars and isolines used for this study were: LN89-5320-6 (<it>Wp</it>, purple-flowered plants), LN89-5322-2 (<it>wp</it>, a stable line with pink flowered plants), and LN89-5320-8-53 (<it>wp</it><sup><it>m</it></sup>, a mutable line with chimeric flower colors or sectors of pink and purple flowers). Each is homozygous for the indicated alleles of the <it>Wp </it>locus. The origin, genetics, and isolation of the <it>wp </it>allele have been described previously <abbrgrp><abbr bid="B2">2</abbr><abbr bid="B3">3</abbr><abbr bid="B25">25</abbr><abbr bid="B1">1</abbr></abbrgrp>. Plants were grown in the greenhouse. Seed coats dissected from seeds at varying stages of development, cotyledons of various stages of seed development, flower buds, stems, mature leaves and roots from two week old plants, and shoot tips (meristems surrounded by primordial leaves) were frozen in liquid nitrogen, freeze dried (Multi-dry lyophilazer; FTS systems), and stored at -20&#176;C. For seed coat's (cotyledon's) developmental stages, seeds were divided into the following groups according to the fresh weight of the entire seed: 10&#8211;25 mg, 25&#8211;50 mg, 50&#8211;75 mg.</p>
         </sec>
         <sec>
            <st>
               <p>RNA extraction, purification and cDNA synthesis</p>
            </st>
            <p>Total RNA was isolated from seed coats and other soybean tissues using a phenol-chloroform and lithium chloride precipitation method <abbrgrp><abbr bid="B25">25</abbr><abbr bid="B26">26</abbr></abbrgrp>. RNA was stored at -70&#176;C until used.</p>
            <p>cDNA copies of the <it>F3H1 </it>gene from the three isolines (LN89-5320-6, LN89-5322-2 and LN89-5320-8-53) were amplified from a first-strand cDNA pool synthesized using 1 &#956;g of seed coat or cotyledon total RNA and the Superscript first strand synthesis system for reverse transcriptase (RT)-PCR with oligo (dT)<sub>12&#8211;18 </sub>primers (Invitrogen, San Diego). The total RNAs used for these RT-PCR reactions were treated with DNAaseI using Ambion's DNA-free kit and concentrated in Microcon YM-30 columns (Millipore, Bedford, MA). For each RNA sample, parallel reactions were allowed in the absence of Superscript (- controls) to assess the extent of DNA contamination. The sequences of the two primers used were: 5'-GCATTGCATTCTGCTATTTAATTCC-3' (7F) and 5'-AAAGACAGTGCCACTTATTTTCATT-3' (1428R). These primers map at the 5' and 3' ends of the <it>F3H1 </it>gene respectively. The numbering correspond to the base pair of the <it>F3H1 </it>gene cDNA sequence (Figure <figr fid="F3">3A</figr>)</p>
         </sec>
         <sec>
            <st>
               <p>Primer synthesis, PCR reaction conditions, cDNA cloning and DNA sequencing</p>
            </st>
            <p>Oligonucleotide primers were synthesized on an Applied Biosystems (Foster City, CA) model 394A DNA synthesizer at the Keck Center, a unit of the University of Illinois Biotechnology Center. PCR reactions were performed by an initial denaturation step at 94&#176;C for 2 min followed by 30 cycles of denaturing at 94&#176;C for 30 sec, annealing at 56&#176;C for 1 min, extension at 68&#176;C for 9 min, to end with a 10 min extension at 72&#176;C. High-fidelity and -efficiency <it>ExTaq </it>(Takara Bio Inc. Otsu, Japan) polymerase was used at 0.75 units per 50 &#956;l reaction. Amplified cDNAs were separated from oligonucleotides with a QIAquick PCR Purification kit (QIAGEN), cloned into pGem-T easy and sequenced in an ABI 3730 &#215; l (Applied Biosystems, Inc. Foster City, CA) at the Keck Center.</p>
         </sec>
         <sec>
            <st>
               <p>RNA gel-blot analysis and cDNA probe synthesis</p>
            </st>
            <p>RNA (10 &#956;g/sample) was electrophoresed in a 1.2% agarose-3% formaldehyde gel <abbrgrp><abbr bid="B27">27</abbr></abbrgrp>. Size-fractionated RNAs were transferred to Optitran-supported nitrocellulose membrane (Midwest Scientific, Valley Park, MO) by capillary action as described in Sambrook <it>et al</it>. (1989) <abbrgrp><abbr bid="B27">27</abbr></abbrgrp> and cross-linked with UV light (Stratagene, La Jolla, CA). Nitrocellulose RNA blots were prehybridized, hybridized, washed, and exposed to Hyperfilm (Amersham, Arlington Heights, IL) as described by Todd and Vodkin (1996) <abbrgrp><abbr bid="B28">28</abbr></abbrgrp>. All the RNA blot results presented are from autoradiographs exposed for 7 days.</p>
            <p>Cloned DNAs used as probes were PCR amplified, electrophoresed, and purified from the agarose using the QIAquick gel extraction kit (QIAGEN, Valencia, CA). DNA concentration of the final eluate was determined with a NanoDrop (NanoDrop Technologies, Inc. Rockland, DE). Purified DNA fragments (25&#8211;250 ng) were labeled with [a-<sup>32</sup>P]dATP by random primer reaction <abbrgrp><abbr bid="B29">29</abbr></abbrgrp>.</p>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Authors' contributions</p>
         </st>
         <p>GZ designed and performed the experiments, analyzed the results and drafted the manuscript.</p>
         <p>LV led the research and edited the manuscript.</p>
      </sec>
   </bdy>
   <bm>
      <ack>
         <sec>
            <st>
               <p>Acknowledgements</p>
            </st>
            <p>We thank Laura Guest and Virginia Lukas who directed the DNA sequencing and synthesis of the oligonucleotide primers. Special thanks to Kay Wallheimer for her assistance with the graphic display of figures <figr fid="F2">2</figr> and <figr fid="F3">3</figr>, and to Katie Carberry who helped with the display of the <it>wp</it>-cDNAs open reading frames in the Additional Material files <supplr sid="S3">3</supplr> and <supplr sid="S4">4</supplr>. We gratefully acknowledge support from grants of the Illinois Soybean Association, USDA, and United Soybean Board.</p>
         </sec>
      </ack>
      <refgrp>
         <bibl id="B1">
            <title>
               <p>The <it>wp </it>mutation of <it>Glycine max </it>carries a gene-fragment-rich transposon of the CACTA superfamily</p>
            </title>
            <aug>
               <au>
                  <snm>Zabala</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Vodkin</snm>
                  <fnm>LO</fnm>
               </au>
            </aug>
            <source>Plant Cell</source>
            <pubdate>2005</pubdate>
            <volume>17</volume>
            <fpage>2619</fpage>
            <lpage>2632</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1242261</pubid>
                  <pubid idtype="pmpid" link="fulltext">16141454</pubid>
                  <pubid idtype="doi">10.1105/tpc.105.033506</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B2">
            <title>
               <p>Inheritance of pink flower color in soybean</p>
            </title>
            <aug>
               <au>
                  <snm>Stephens</snm>
                  <fnm>PA</fnm>
               </au>
               <au>
                  <snm>Nickell</snm>
                  <fnm>CD</fnm>
               </au>
            </aug>
            <source>Crop Sci</source>
            <pubdate>1992</pubdate>
            <volume>32</volume>
            <fpage>1131</fpage>
            <lpage>1132</lpage>
         </bibl>
         <bibl id="B3">
            <title>
               <p>Pink flower color associated with increased protein and seed size in soybean</p>
            </title>
            <aug>
               <au>
                  <snm>Stephens</snm>
                  <fnm>PA</fnm>
               </au>
               <au>
                  <snm>Nickell</snm>
                  <fnm>CD</fnm>
               </au>
               <au>
                  <snm>Vodkin</snm>
                  <fnm>LO</fnm>
               </au>
            </aug>
            <source>Crop Sci</source>
            <pubdate>1993</pubdate>
            <volume>33</volume>
            <fpage>1135</fpage>
            <lpage>1137</lpage>
         </bibl>
         <bibl id="B4">
            <title>
               <p>Pack-MULE transposable elements mediate gene evolution in plants</p>
            </title>
            <aug>
               <au>
                  <snm>Jiang</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Bao</snm>
                  <fnm>Z</fnm>
               </au>
               <au>
                  <snm>Zhang</snm>
                  <fnm>X</fnm>
               </au>
               <au>
                  <snm>Eddy</snm>
                  <fnm>SR</fnm>
               </au>
               <au>
                  <snm>Wessler</snm>
                  <fnm>SR</fnm>
               </au>
            </aug>
            <source>Nature</source>
            <pubdate>2004</pubdate>
            <volume>431</volume>
            <fpage>569</fpage>
            <lpage>573</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/nature02953</pubid>
                  <pubid idtype="pmpid" link="fulltext">15457261</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B5">
            <title>
               <p>Gene duplication and exon shuffling by helitron-like transposons generate intraspecies diversity in maize</p>
            </title>
            <aug>
               <au>
                  <snm>Morgante</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Brunner</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Pea</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Fengler</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Zuccolo</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Rafalski</snm>
                  <fnm>A</fnm>
               </au>
            </aug>
            <source>Nature Genetics</source>
            <pubdate>2005</pubdate>
            <volume>37</volume>
            <fpage>997</fpage>
            <lpage>627</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/ng1615</pubid>
                  <pubid idtype="pmpid" link="fulltext">16056225</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B6">
            <title>
               <p>Origins, genetic organization and transcription of a family of non-autonomous helitron elements in maize</p>
            </title>
            <aug>
               <au>
                  <snm>Brunner</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Pea</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Rafalski</snm>
                  <fnm>A</fnm>
               </au>
            </aug>
            <source>Plant J</source>
            <pubdate>2005</pubdate>
            <volume>43</volume>
            <fpage>799</fpage>
            <lpage>810</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1111/j.1365-313X.2005.02497.x</pubid>
                  <pubid idtype="pmpid" link="fulltext">16146520</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B7">
            <title>
               <p>High rate of chimeric gene origination by retroposition in plant genomes</p>
            </title>
            <aug>
               <au>
                  <snm>Wang</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Zheng</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Fan</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Li</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Shi</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Cai</snm>
                  <fnm>Z</fnm>
               </au>
               <au>
                  <snm>Liu</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Zhang</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Vang</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Lu</snm>
                  <fnm>Z</fnm>
               </au>
               <au>
                  <snm>Wong</snm>
                  <fnm>GK-S</fnm>
               </au>
               <au>
                  <snm>Long</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Wang</snm>
                  <fnm>J</fnm>
               </au>
            </aug>
            <source>Plant Cell</source>
            <pubdate>2006</pubdate>
            <volume>18</volume>
            <fpage>1791</fpage>
            <lpage>1802</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1105/tpc.106.041905</pubid>
                  <pubid idtype="pmpid" link="fulltext">16829590</pubid>
                  <pubid idtype="pmcid">1533979</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B8">
            <title>
               <p>The maize genome contains a Helitron Insertion</p>
            </title>
            <aug>
               <au>
                  <snm>Lal</snm>
                  <fnm>SK</fnm>
               </au>
               <au>
                  <snm>Giroux</snm>
                  <fnm>MJ</fnm>
               </au>
               <au>
                  <snm>Brendel</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>Vallejos</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Hannah</snm>
                  <fnm>LC</fnm>
               </au>
            </aug>
            <source>Plant Cell</source>
            <pubdate>2003</pubdate>
            <volume>15</volume>
            <fpage>381</fpage>
            <lpage>391</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">141208</pubid>
                  <pubid idtype="pmpid" link="fulltext">12566579</pubid>
                  <pubid idtype="doi">10.1105/tpc.008375</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B9">
            <title>
               <p>Mechanisms of alternative pre-messenger RNA splicing</p>
            </title>
            <aug>
               <au>
                  <snm>Black</snm>
                  <fnm>DL</fnm>
               </au>
            </aug>
            <source>Annu Rev Biochem</source>
            <pubdate>2003</pubdate>
            <volume>72</volume>
            <fpage>291</fpage>
            <lpage>336</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1146/annurev.biochem.72.121801.161720</pubid>
                  <pubid idtype="pmpid" link="fulltext">12626338</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B10">
            <title>
               <p>Genomewide comparative analysis of alternative splicing in plants</p>
            </title>
            <aug>
               <au>
                  <snm>Wang</snm>
                  <fnm>B-B</fnm>
               </au>
               <au>
                  <snm>Brendel</snm>
                  <fnm>V</fnm>
               </au>
            </aug>
            <source>PNAS</source>
            <pubdate>2006</pubdate>
            <volume>103</volume>
            <fpage>7175</fpage>
            <lpage>7180</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1459036</pubid>
                  <pubid idtype="pmpid" link="fulltext">16632598</pubid>
                  <pubid idtype="doi">10.1073/pnas.0602039103</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B11">
            <title>
               <p>Transposable elements, gene creation and genome rearrangement in flowering plants</p>
            </title>
            <aug>
               <au>
                  <snm>Bennetzen</snm>
                  <fnm>JL</fnm>
               </au>
            </aug>
            <source>Current Opinion in Genet &amp; Dev</source>
            <pubdate>2005</pubdate>
            <volume>15</volume>
            <fpage>621</fpage>
            <lpage>627</lpage>
            <xrefbib>
               <pubid idtype="doi">10.1016/j.gde.2005.09.010</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B12">
            <title>
               <p>Comparison of multiple vertebrate genomes reveals the birth and evolution of human exons</p>
            </title>
            <aug>
               <au>
                  <snm>Zhang</snm>
                  <fnm>XH-F</fnm>
               </au>
               <au>
                  <snm>Chasin</snm>
                  <fnm>LA</fnm>
               </au>
            </aug>
            <source>PNAS</source>
            <pubdate>2006</pubdate>
            <volume>103</volume>
            <fpage>13427</fpage>
            <lpage>13432</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1569180</pubid>
                  <pubid idtype="pmpid" link="fulltext">16938881</pubid>
                  <pubid idtype="doi">10.1073/pnas.0603042103</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B13">
            <title>
               <p>Emergence of primate genes by retrotransposon-mediated sequence transduction</p>
            </title>
            <aug>
               <au>
                  <snm>Xing</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Wang</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Belancio</snm>
                  <fnm>VP</fnm>
               </au>
               <au>
                  <snm>Cordaux</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Deininger</snm>
                  <fnm>PL</fnm>
               </au>
               <au>
                  <snm>Batzer</snm>
                  <fnm>MA</fnm>
               </au>
            </aug>
            <source>PNAS</source>
            <pubdate>2006</pubdate>
            <volume>103</volume>
            <fpage>17608</fpage>
            <lpage>17613</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1693794</pubid>
                  <pubid idtype="pmpid" link="fulltext">17101974</pubid>
                  <pubid idtype="doi">10.1073/pnas.0603224103</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B14">
            <title>
               <p>The effects of plant transposable element insertion on transcription initiation and RNA processing</p>
            </title>
            <aug>
               <au>
                  <snm>Weil</snm>
                  <fnm>CF</fnm>
               </au>
               <au>
                  <snm>Wessler</snm>
                  <fnm>SR</fnm>
               </au>
            </aug>
            <source>Annu Rev Plant Physiol Plant Mol Biol</source>
            <pubdate>1990</pubdate>
            <volume>41</volume>
            <fpage>527</fpage>
            <lpage>552</lpage>
            <xrefbib>
               <pubid idtype="doi">10.1146/annurev.pp.41.060190.002523</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B15">
            <title>
               <p>Characterization of a highly conserved sequence related to mutator transposable elements in maize</p>
            </title>
            <aug>
               <au>
                  <snm>Talbert</snm>
                  <fnm>LE</fnm>
               </au>
               <au>
                  <snm>Chandler</snm>
                  <fnm>VL</fnm>
               </au>
            </aug>
            <source>Mol Biol Evol</source>
            <pubdate>1988</pubdate>
            <volume>5</volume>
            <fpage>519</fpage>
            <lpage>529</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">2848175</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B16">
            <title>
               <p>Integration and nonrandom mutation of a plasma membrane proton ATPase gene fragment within the <it>Bs1 </it>retroelement of maize</p>
            </title>
            <aug>
               <au>
                  <snm>Jin</snm>
                  <fnm>Y-K</fnm>
               </au>
               <au>
                  <snm>Bennetzen</snm>
                  <fnm>JL</fnm>
               </au>
            </aug>
            <source>Plant Cell</source>
            <pubdate>1994</pubdate>
            <volume>6</volume>
            <fpage>1177</fpage>
            <lpage>1186</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">160511</pubid>
                  <pubid idtype="pmpid" link="fulltext">7919987</pubid>
                  <pubid idtype="doi">10.1105/tpc.6.8.1177</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B17">
            <title>
               <p>Mutator-like elements in Arabidopsis thaliana: structure, diversity, and evolution</p>
            </title>
            <aug>
               <au>
                  <snm>Yu</snm>
                  <fnm>Z</fnm>
               </au>
               <au>
                  <snm>Wright</snm>
                  <fnm>SI</fnm>
               </au>
               <au>
                  <snm>Bureau</snm>
                  <fnm>TE</fnm>
               </au>
            </aug>
            <source>Genetics</source>
            <pubdate>2000</pubdate>
            <volume>156</volume>
            <fpage>2019</fpage>
            <lpage>2031</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1461377</pubid>
                  <pubid idtype="pmpid" link="fulltext">11102392</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B18">
            <title>
               <p>Survey of transposable elements from rice genomic sequences</p>
            </title>
            <aug>
               <au>
                  <snm>Turcotte</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Srinvasan</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Bureau</snm>
                  <fnm>T</fnm>
               </au>
            </aug>
            <source>Plant J</source>
            <pubdate>2001</pubdate>
            <volume>25</volume>
            <fpage>169</fpage>
            <lpage>179</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1046/j.1365-313x.2001.00945.x</pubid>
                  <pubid idtype="pmpid" link="fulltext">11169193</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B19">
            <title>
               <p>A novel class of Helitron- related transposable elements in maize contain portions of multiple pseudogenes</p>
            </title>
            <aug>
               <au>
                  <snm>Gupta</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Gallavotti</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Stryker</snm>
                  <fnm>GA</fnm>
               </au>
               <au>
                  <snm>Schmidt</snm>
                  <fnm>RJ</fnm>
               </au>
               <au>
                  <snm>Lal</snm>
                  <fnm>SK</fnm>
               </au>
            </aug>
            <source>Plant Mol Biol</source>
            <pubdate>2005</pubdate>
            <volume>57</volume>
            <fpage>115</fpage>
            <lpage>127</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1007/s11103-004-6636-z</pubid>
                  <pubid idtype="pmpid" link="fulltext">15821872</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B20">
            <title>
               <p>The maize transposable element Ds is spliced from RNA</p>
            </title>
            <aug>
               <au>
                  <snm>Wessler</snm>
                  <fnm>SR</fnm>
               </au>
               <au>
                  <snm>Baran</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Varagona</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>Science</source>
            <pubdate>1987</pubdate>
            <volume>237</volume>
            <fpage>916</fpage>
            <lpage>918</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1126/science.3039661</pubid>
                  <pubid idtype="pmpid" link="fulltext">3039661</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B21">
            <title>
               <p><it>De novo </it>synthesis of an intron by the maize transposable element, Dissociation</p>
            </title>
            <aug>
               <au>
                  <snm>Giroux</snm>
                  <fnm>MJ</fnm>
               </au>
               <au>
                  <snm>Clancy</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Baier</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Ingham</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>McCarty</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Hannah</snm>
                  <fnm>LC</fnm>
               </au>
            </aug>
            <source>PNAS</source>
            <pubdate>1994</pubdate>
            <volume>91</volume>
            <fpage>12150</fpage>
            <lpage>12154</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">45394</pubid>
                  <pubid idtype="pmpid" link="fulltext">7991598</pubid>
                  <pubid idtype="doi">10.1073/pnas.91.25.12150</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B22">
            <title>
               <p>Retrotransposon insertion into the maize waxy gene results in tissue-specific RNA processing</p>
            </title>
            <aug>
               <au>
                  <snm>Marillonnet</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Wessler</snm>
                  <fnm>SR</fnm>
               </au>
            </aug>
            <source>Plant Cell</source>
            <pubdate>1997</pubdate>
            <volume>9</volume>
            <fpage>967</fpage>
            <lpage>978</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">156971</pubid>
                  <pubid idtype="pmpid" link="fulltext">9212470</pubid>
                  <pubid idtype="doi">10.1105/tpc.9.6.967</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B23">
            <title>
               <p>Evidence for the widespread coupling of alternative splicing and nonsense-mediated mRNA decay in humans</p>
            </title>
            <aug>
               <au>
                  <snm>Lewis</snm>
                  <fnm>BP</fnm>
               </au>
               <au>
                  <snm>Green</snm>
                  <fnm>RE</fnm>
               </au>
               <au>
                  <snm>Brenner</snm>
                  <fnm>SE</fnm>
               </au>
            </aug>
            <source>PNAS</source>
            <pubdate>2003</pubdate>
            <volume>100</volume>
            <fpage>189</fpage>
            <lpage>192</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">140922</pubid>
                  <pubid idtype="pmpid" link="fulltext">12502788</pubid>
                  <pubid idtype="doi">10.1073/pnas.0136770100</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B24">
            <title>
               <p>Instability of a novel multicolored flower trait in inbred and outcrossed soybean lines</p>
            </title>
            <aug>
               <au>
                  <snm>Johnson</snm>
                  <fnm>EOC</fnm>
               </au>
               <au>
                  <snm>Stephens</snm>
                  <fnm>PA</fnm>
               </au>
               <au>
                  <snm>Fasoula</snm>
                  <fnm>DA</fnm>
               </au>
               <au>
                  <snm>Nickell</snm>
                  <fnm>CD</fnm>
               </au>
               <au>
                  <snm>Vodkin</snm>
                  <fnm>LO</fnm>
               </au>
            </aug>
            <source>J Hered</source>
            <pubdate>1998</pubdate>
            <volume>89</volume>
            <fpage>508</fpage>
            <lpage>515</lpage>
            <xrefbib>
               <pubid idtype="doi">10.1093/jhered/89.6.508</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B25">
            <title>
               <p>A simple method for extraction of RNA from maize tissue</p>
            </title>
            <aug>
               <au>
                  <snm>McCarty</snm>
                  <fnm>D</fnm>
               </au>
            </aug>
            <source>Maize Genet Coop Newsl</source>
            <pubdate>1986</pubdate>
            <volume>60</volume>
            <fpage>61</fpage>
         </bibl>
         <bibl id="B26">
            <title>
               <p>Chalcone synthase mRNA and activity are reduced in yellow soybean seed coats with dominant I alleles</p>
            </title>
            <aug>
               <au>
                  <snm>Wang</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Todd</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Vodkin</snm>
                  <fnm>LO</fnm>
               </au>
            </aug>
            <source>Plant Physiol</source>
            <pubdate>1994</pubdate>
            <volume>105</volume>
            <fpage>739</fpage>
            <lpage>748</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">159416</pubid>
                  <pubid idtype="pmpid" link="fulltext">8066134</pubid>
                  <pubid idtype="doi">10.1104/pp.105.2.739</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B27">
            <aug>
               <au>
                  <snm>Sambrook</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Fritsch</snm>
                  <fnm>EF</fnm>
               </au>
               <au>
                  <snm>Maniatis</snm>
                  <fnm>T</fnm>
               </au>
            </aug>
            <source>Molecular cloning: A laboratory manual</source>
            <publisher>Cold Spring Harbor: Cold Spring Harbor Laboratory Press</publisher>
            <edition>2</edition>
            <pubdate>1989</pubdate>
         </bibl>
         <bibl id="B28">
            <title>
               <p>Duplications that suppress and deletions that restore expression from a chalcone synthase multigene family</p>
            </title>
            <aug>
               <au>
                  <snm>Todd</snm>
                  <fnm>JJ</fnm>
               </au>
               <au>
                  <snm>Vodkin</snm>
                  <fnm>LO</fnm>
               </au>
            </aug>
            <source>Plant Cell</source>
            <pubdate>1996</pubdate>
            <volume>8</volume>
            <fpage>687</fpage>
            <lpage>699</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">161129</pubid>
                  <pubid idtype="pmpid" link="fulltext">12239396</pubid>
                  <pubid idtype="doi">10.1105/tpc.8.4.687</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B29">
            <title>
               <p>A technique for radiolabeling DNA restriction fragments to high specific activity</p>
            </title>
            <aug>
               <au>
                  <snm>Feinberg</snm>
                  <fnm>AP</fnm>
               </au>
               <au>
                  <snm>Vogelstein</snm>
                  <fnm>B</fnm>
               </au>
            </aug>
            <source>Anal Biochem</source>
            <pubdate>1983</pubdate>
            <volume>132</volume>
            <fpage>6</fpage>
            <lpage>13</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/0003-2697(83)90418-9</pubid>
                  <pubid idtype="pmpid" link="fulltext">6312838</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B30">
            <title>
               <p>A rearrangement resulting in small tandem repeats in the F3'5'H gene of white flower genotypes is associated with the soybean <it>W1 </it>locus</p>
            </title>
            <aug>
               <au>
                  <snm>Zabala</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Vodkin</snm>
                  <fnm>LO</fnm>
               </au>
            </aug>
            <source>The Plant Genome</source>
            <pubdate>2007</pubdate>
            <inpress/>
         </bibl>
         <bibl id="B31">
            <title>
               <p>Cloning of the pleiotropic <it>T </it>locus in soybean and two recessive alleles that differentially affect structure and expression of the encoded flavonoid 3' hydroxylase</p>
            </title>
            <aug>
               <au>
                  <snm>Zabala</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Vodkin</snm>
                  <fnm>LO</fnm>
               </au>
            </aug>
            <source>Genetics</source>
            <volume>163</volume>
            <fpage>295</fpage>
            <lpage>309</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1462420</pubid>
                  <pubid idtype="pmpid" link="fulltext">12586717</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
      </refgrp>
   </bm>
</art>
