<?xml version='1.0'?>
<!DOCTYPE art SYSTEM 'http://www.biomedcentral.com/xml/article.dtd'>
<art>
   <ui>1471-2105-5-80</ui>
   <ji>1471-2105</ji>
   <fm>
      <dochead>Correspondence</dochead>
      <bibl>
         <title>
            <p>Mistaken Identifiers: Gene name errors can be introduced inadvertently when using Excel in bioinformatics</p>
         </title>
         <aug>
            <au id="A1" ce="yes">
               <snm>Zeeberg</snm>
               <mi>R</mi>
               <fnm>Barry</fnm>
               <insr iid="I1"/>
               <email>barry@discover.nci.nih.gov</email>
            </au>
            <au id="A2" ce="yes">
               <snm>Riss</snm>
               <fnm>Joseph</fnm>
               <insr iid="I2"/>
               <email>rissj@helix.nih.gov</email>
            </au>
            <au id="A3">
               <snm>Kane</snm>
               <mi>W</mi>
               <fnm>David</fnm>
               <insr iid="I3"/>
               <email>david_kane@sra.com</email>
            </au>
            <au id="A4">
               <snm>Bussey</snm>
               <mi>J</mi>
               <fnm>Kimberly</fnm>
               <insr iid="I1"/>
               <email>busseyk@mail.nih.gov</email>
            </au>
            <au id="A5">
               <snm>Uchio</snm>
               <fnm>Edward</fnm>
               <insr iid="I4"/>
               <email>eu8v@nih.gov</email>
            </au>
            <au id="A6">
               <snm>Linehan</snm>
               <fnm>W Marston</fnm>
               <insr iid="I4"/>
               <email>linehanm@mail.nih.gov</email>
            </au>
            <au id="A7">
               <snm>Barrett</snm>
               <fnm>J Carl</fnm>
               <insr iid="I2"/>
               <email>barrett@mail.nih.gov</email>
            </au>
            <au id="A8" ca="yes">
               <snm>Weinstein</snm>
               <mi>N</mi>
               <fnm>John</fnm>
               <insr iid="I1"/>
               <email>weinstein@dtpvx2.ncifcrf.gov</email>
            </au>
         </aug>
         <insg>
            <ins id="I1">
               <p>Genomics &amp; Bioinformatics Group, Laboratory of Molecular Pharmacology, Center for Cancer Research (CCR), National Cancer Institute (NCI), National Institutes of Health (NIH), Bldg 37 Rm 5041, NIH, 9000 Rockville Pike, Bethesda, MD 20892 USA</p>
            </ins>
            <ins id="I2">
               <p>Laboratory of Biosystems and Cancer, CCR, Bldg 37 Rm 5032, NIH, 9000 Rockville Pike, Bethesda, MD 20892 USA</p>
            </ins>
            <ins id="I3">
               <p>SRA International, 4300 Fair Lakes CT, Fairfax, VA 22033 USA</p>
            </ins>
            <ins id="I4">
               <p>Urologic Oncology Branch, Bldg 10 Rm 2B47, National Institutes of Health, Bethesda, MD 20892 USA</p>
            </ins>
         </insg>
         <source>BMC Bioinformatics</source>
         <issn>1471-2105</issn>
         <pubdate>2004</pubdate>
         <volume>5</volume>
         <issue>1</issue>
         <fpage>80</fpage>
         <url>http://www.biomedcentral.com/1471-2105/5/80</url>
         <xrefbib>
            <pubidlist>
               <pubid idtype="pmpid">15214961</pubid>
               <pubid idtype="doi">10.1186/1471-2105-5-80</pubid>
            </pubidlist>
         </xrefbib>
      </bibl>
      <history>
         <rec>
            <date>
               <day>05</day>
               <month>3</month>
               <year>2004</year>
            </date>
         </rec>
         <acc>
            <date>
               <day>23</day>
               <month>6</month>
               <year>2004</year>
            </date>
         </acc>
         <pub>
            <date>
               <day>23</day>
               <month>6</month>
               <year>2004</year>
            </date>
         </pub>
      </history>
      <cpyrt>
         <year>2004</year>
         <collab>Zeeberg et al; licensee BioMed Central Ltd. This is an Open Access article: verbatim copying and redistribution of this article are permitted in all media for any purpose, provided this notice is preserved along with the article's original URL.</collab>
      </cpyrt>
      <abs>
         <sec>
            <st>
               <p>Abstract</p>
            </st>
            <sec>
               <st>
                  <p>Background</p>
               </st>
               <p>When processing microarray data sets, we recently noticed that some gene names were being changed inadvertently to non-gene names.</p>
            </sec>
            <sec>
               <st>
                  <p>Results</p>
               </st>
               <p>A little detective work traced the problem to default date format conversions and floating-point format conversions in the very useful Excel program package. The date conversions affect at least 30 gene names; the floating-point conversions affect at least 2,000 if Riken identifiers are included. These conversions are irreversible; the original gene names cannot be recovered.</p>
            </sec>
            <sec>
               <st>
                  <p>Conclusions</p>
               </st>
               <p>Users of Excel for analyses involving gene names should be aware of this problem, which can cause genes, including medically important ones, to be lost from view and which has contaminated even carefully curated public databases. We provide work-arounds and scripts for circumventing the problem.</p>
            </sec>
         </sec>
      </abs>
   </fm>
   <bdy>
      <sec>
         <st>
            <p>Text</p>
         </st>
         <p>MatchMiner <abbrgrp><abbr bid="B1">1</abbr></abbrgrp> and GoMiner <abbrgrp><abbr bid="B2">2</abbr></abbrgrp> are two bioinformatics program packages we published recently in another Biomed Central Journal, Genome Biology. When we were beta-testing those programs on microarray data, a frustrating problem occurred repeatedly: Some gene names kept bouncing back as "unknown." A little detective work revealed the reason: Use of one of the research community's most valuable and extensively applied tools for manipulation of genomic data can introduce erroneous names. A default date conversion feature in Excel (Microsoft Corp., Redmond, WA) was altering gene names that it considered to look like dates. For example, the tumor suppressor DEC1 [Deleted in Esophageal Cancer 1] <abbrgrp><abbr bid="B3">3</abbr></abbrgrp> was being converted to '1-DEC.' Figure <figr fid="F1">1</figr> lists 30 gene names that suffer an analogous fate.</p>
         <fig id="F1">
            <title>
               <p>Figure 1</p>
            </title>
            <caption>
               <p>Screen shot of Microsoft Excel spreadsheet illustrating errors caused by default conversion of gene names to dates</p>
            </caption>
            <text>
               <p>Screen shot of Microsoft Excel spreadsheet illustrating errors caused by default conversion of gene names to dates. Columns A, E, and I contain the correct gene names. Columns B, F, and J contain the corresponding underlying internal Excel date representation resulting from the forced default date conversion. Columns C, G, and K contain the corresponding default format date conversions. To create this table, we prepared a tab-delimited text file in which each gene name was repeated three times side by side. The correct gene names in columns A, E, and I were retained by opening this text file with Excel, and selecting "text" mode for columns A, E, and I in the Text Import Wizard Step 3 of 3 that appears while opening a file in Excel. Subsequently, the format menu "number" option (with zero decimal places) was applied to columns B, F, and J to display the internal date format.</p>
            </text>
            <graphic file="1471-2105-5-80-1"/>
         </fig>
         <p>There is another default conversion problem for RIKEN <abbrgrp><abbr bid="B4">4</abbr></abbrgrp> clone identifiers of the form nnnnnnnEnn, where n denotes a digit. These identifiers are comprised of the serial number of the plate that contains the library, information on plate status, and the address of the clone <abbrgrp><abbr bid="B5">5</abbr></abbrgrp>. A search (using the "DNA sequence length search" functionality at <url>http://fantom2.gsc.riken.go.jp/db/search/</url>) identified more than 2,000 such identifiers out of a total set of 60,770. For example, the RIKEN identifier "2310009E13" was converted irreversibly to the floating-point number "2.31E+13." A non-expert user might well fail to notice that approximately 3% of the identifiers on a microarray with tens of thousands of genes had been converted to an incorrect form, yet the potential for 2,000 identifiers to be transmogrified without notice is a considerable concern. Most important, these conversions to an internal date representation or floating-point number format are irreversible; the original gene name cannot be recovered. If one were dealing manually with small numbers of genes, these problems could be detected and then corrected by the tedious, convoluted process described in the legend of Figure <figr fid="F1">1</figr>. But with microarray or other high-throughput data, human proofreading and manual curation are impractical.</p>
         <p>The floating-point conversion is not restricted to RIKEN clone identifiers but will affect any clone designation derived from plate coordinates. Although the standard convention is to designate clones officially by library-plate-row-column identification, it is common practice to omit the library reference, particularly if all of the clones come from a single library. Without the library reference in the identifier, all clones from row E of any plate are converted to floating point numbers by Excel. If the library designation is numeric, as it is for RIKEN clones <abbrgrp><abbr bid="B5">5</abbr></abbrgrp>, then including it does not solve the problem. Since 96-well plates contain 8 rows and 12 columns, row E represents 12/96 or 12.5% of the clones on the plate; similarly, 6.25% of clones from 384-well plates would be affected. Most libraries contain hundreds of plates, each of which would be subject to this problem.</p>
         <p>DEC1, a possible target for cancer therapy, was incorrectly rendered, and it could potentially be missed in downstream data analysis. The same type of error can infect, and propagate through, the major public data resources. For example, this type of error occurs several times in even the immaculately curated LocusLink database (Figure <figr fid="F2">2</figr>). This error in LocusLink originated from the human-mouse homology map data (Figure <figr fid="F3">3</figr>).</p>
         <fig id="F2">
            <title>
               <p>Figure 2</p>
            </title>
            <caption>
               <p>Screen shot of LocusLink from November 12, 2002 illustrating an error caused by default conversion of a gene name to date that had propagated from the human-mouse homology map data (Figure 3)</p>
            </caption>
            <text>
               <p>Screen shot of LocusLink from November 12, 2002 illustrating an error caused by default conversion of a gene name to date that had propagated from the human-mouse homology map data (Figure 3).</p>
            </text>
            <graphic file="1471-2105-5-80-2"/>
         </fig>
         <fig id="F3">
            <title>
               <p>Figure 3</p>
            </title>
            <caption>
               <p>Screen shot of the human-mouse homology map from November 14, 2003 illustrating an error caused by default conversion of a gene name to date</p>
            </caption>
            <text>
               <p>Screen shot of the human-mouse homology map from November 14, 2003 illustrating an error caused by default conversion of a gene name to date.</p>
            </text>
            <graphic file="1471-2105-5-80-3"/>
         </fig>
         <p>There are a number of work-arounds to these behaviors in Excel, but all of them require continued attention on the part of the user to avoid introducing errors. The appropriate solutions depend on the context in which Excel is opened:</p>
         <p>If Excel is configured to open a test file automatically from another application, then the data must be pre-processed in the upstream application. For example, a space character or an apostrophe can be placed in front of the gene name. That is the solution implemented in the Excel output format option of MatchMiner <abbrgrp><abbr bid="B1">1</abbr></abbrgrp> and the primary approach recommended by Microsoft in their Knowledge Base Article on the issue <abbrgrp><abbr bid="B6">6</abbr></abbrgrp>.</p>
         <p>If a text file is to be opened by Excel, open Excel first and then select the text file to read. Then select "text" mode for the column(s) containing potentially affected symbols in the Text Import Wizard Step 3 of 3.</p>
         <p>If text is to be copied from another application (such as a text processor) and pasted into a pre-opened Excel spreadsheet, the formatting must be set in the spreadsheet. Within the pre-opened spreadsheet, prior to pasting, use Format -> cells to specify which columns of the recipient spreadsheet are to be treated as text <abbrgrp><abbr bid="B6">6</abbr></abbrgrp>. That procedure works for copying from several text processors tested on Mac OS 10.2. However, changing the format of the spreadsheet column to text fails to solve the conversion problem when pasting data from a Microsoft Word file. In that case, in addition to the formatting, use the Paste Special -> Paste: As: Text command to insert the text.</p>
         <p>Despite the work-arounds, even the most vigilant investigator can inadvertently introduce conversion errors, and it is often necessary to screen data received from other sources. For that reason, we have provided the text of a Unix C shell program that detects possible gene-to-date and floating-point conversion problems (Figure <figr fid="F4">4</figr>).  A downloadable version is available in the Supplementary Materials (see <supplr sid="S1">Additional file 1</supplr>) and on our companion web site <abbrgrp><abbr bid="B7">7</abbr></abbrgrp>. We have implemented a version in MatchMiner <abbrgrp><abbr bid="B1">1</abbr></abbrgrp> and are releasing a version of GoMiner <abbrgrp><abbr bid="B2">2</abbr></abbrgrp> that includes a quality check that uses the script.</p>
         <fig id="F4">
            <title>
               <p>Figure 4</p>
            </title>
            <caption>
               <p>Script to scan for SymbolMutation error</p>
            </caption>
            <text>
               <p>Script to scan for SymbolMutation error.</p>
            </text>
            <graphic file="1471-2105-5-80-4"/>
         </fig>
         <p>We hope the date and floating-point conversions will be made non-default options &#8211; in deference to the large bioinformatics and biotechnology communities if not for other users. Even if that is done, however, there will be a lag time before all researchers have the new program version and an even longer time of confusion before the existing errors and inconsistencies have been expunged from all public and private databases. In the meantime, it is important to be alert to the problem.</p>
         <p>The twin aims of this paper are (i) to minimize additional contamination of the public databases and literature, and (ii) to provide a cautionary note to the many researchers who put their microarray data through the exceptionally useful medium of an Excel spreadsheet, then map the clone identifiers or GenBank accession numbers into human-readable gene names. There is no way to know how many times and in how many laboratories the default date and floating point conversions to non-gene names have adversely affected an experiment or caused genes to "disappear" from view.</p>
         <suppl id="S1">
            <title>
               <p>Additional File 1</p>
            </title>
            <text>
               <p>Symbol Mutation Scan: Shell file in downloadable format Companion Website: <url>http://discover.nci.nih.gov/symbolmutation</url></p>
            </text>
            <file name="1471-2105-5-80-S1.sh">
               <p>Click here for file [NOTE: to correct an error, this file was updated on 20 August 2004]</p>
            </file>
         </suppl>
      </sec>
   </bdy>
   <bm>
      <ack>
         <sec>
            <st>
               <p>Acknowledgements</p>
            </st>
            <p>We thank Dr. Keith Collins (Laboratory of Biosystems and Cancer, CCR, NCI, NIH) and Jocelyn Hsu (SRA International) for helpful comments and technical review of the manuscript.</p>
         </sec>
      </ack>
      <refgrp>
         <bibl id="B1">
            <title>
               <p>MatchMiner: a tool for batch navigation among gene and gene product identifiers</p>
            </title>
            <aug>
               <au>
                  <snm>Bussey</snm>
                  <fnm>KJ</fnm>
               </au>
               <au>
                  <snm>Kane</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Sunshine</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Narasimhan</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Nishizuka</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Reinhold</snm>
                  <fnm>WC</fnm>
               </au>
               <au>
                  <snm>Zeeberg</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Ajay</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Weinstein</snm>
                  <fnm>JN</fnm>
               </au>
            </aug>
            <source>Genome Biol</source>
            <pubdate>2003</pubdate>
            <volume>4</volume>
            <fpage>R27</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1186/gb-2003-4-4-r27</pubid>
                  <pubid idtype="pmpid" link="fulltext">12702208</pubid>
                  <pubid idtype="pmcid">154578</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B2">
            <title>
               <p>GoMiner: a resource for biological interpretation of genomic and proteomic data</p>
            </title>
            <aug>
               <au>
                  <snm>Zeeberg</snm>
                  <fnm>BR</fnm>
               </au>
               <au>
                  <snm>Feng</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Wang</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Wang</snm>
                  <fnm>MD</fnm>
               </au>
               <au>
                  <snm>Fojo</snm>
                  <fnm>AT</fnm>
               </au>
               <au>
                  <snm>Sunshine</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Narasimhan</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Kane</snm>
                  <fnm>DW</fnm>
               </au>
               <au>
                  <snm>Reinhold</snm>
                  <fnm>WC</fnm>
               </au>
               <au>
                  <snm>Lababidi</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Bussey</snm>
                  <fnm>KJ</fnm>
               </au>
               <au>
                  <snm>Riss</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Barrett</snm>
                  <fnm>JC</fnm>
               </au>
               <au>
                  <snm>Weinstein</snm>
                  <fnm>JN</fnm>
               </au>
            </aug>
            <source>Genome Biol</source>
            <pubdate>2003</pubdate>
            <volume>4</volume>
            <fpage>R28</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1186/gb-2003-4-4-r28</pubid>
                  <pubid idtype="pmpid" link="fulltext">12702209</pubid>
                  <pubid idtype="pmcid">154579</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B3">
            <title>
               <p>Inhibition of PPAR gamma 2 gene expression by the HIF-1-regulated gene DEC1/Stra13: a mechanism for regulation of adipogenesis by hypoxia</p>
            </title>
            <aug>
               <au>
                  <snm>Yun</snm>
                  <fnm>Z</fnm>
               </au>
               <au>
                  <snm>Maecker</snm>
                  <fnm>HL</fnm>
               </au>
               <au>
                  <snm>Johnson</snm>
                  <fnm>RS</fnm>
               </au>
               <au>
                  <snm>Giaccia</snm>
                  <fnm>AJ</fnm>
               </au>
            </aug>
            <source>Dev Cell</source>
            <pubdate>2002</pubdate>
            <volume>2</volume>
            <fpage>331</fpage>
            <lpage>341</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/S1534-5807(02)00131-4</pubid>
                  <pubid idtype="pmpid" link="fulltext">11879638</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B4">
            <title>
               <p>Analysis of the mouse transcriptome based on functional annotation of 60,770 full-length cDNAs</p>
            </title>
            <aug>
               <au>
                  <snm>Okazaki</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Furuno</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Kasukawa</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Adachi</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Bono</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Kondo</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Nikaido</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Osato</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Saito</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Suzuki</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Yamanaka</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Kiyosawa</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Yagi</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Tomaru</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Hasegawa</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Nogami</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Schonbach</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Gojobori</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Baldarelli</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Hill</snm>
                  <fnm>DP</fnm>
               </au>
               <au>
                  <snm>Bult</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Hume</snm>
                  <fnm>DA</fnm>
               </au>
               <au>
                  <snm>Quackenbush</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Schriml</snm>
                  <fnm>LM</fnm>
               </au>
               <au>
                  <snm>Kanapin</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Matsuda</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Batalov</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Beisel</snm>
                  <fnm>KW</fnm>
               </au>
               <au>
                  <snm>Blake</snm>
                  <fnm>JA</fnm>
               </au>
               <au>
                  <snm>Bradt</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Brusic</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>Chothia</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Corbani</snm>
                  <fnm>LE</fnm>
               </au>
               <au>
                  <snm>Cousins</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Dalla</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Dragani</snm>
                  <fnm>TA</fnm>
               </au>
               <au>
                  <snm>Fletcher</snm>
                  <fnm>CF</fnm>
               </au>
               <au>
                  <snm>Forrest</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Frazer</snm>
                  <fnm>KS</fnm>
               </au>
               <au>
                  <snm>Gaasterland</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Gariboldi</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Gissi</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Godzik</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Gough</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Grimmond</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Gustincich</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Hirokawa</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Jackson</snm>
                  <fnm>IJ</fnm>
               </au>
               <au>
                  <snm>Jarvis</snm>
                  <fnm>ED</fnm>
               </au>
               <au>
                  <snm>Kanai</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Kawaji</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Kawasawa</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Kedzierski</snm>
                  <fnm>RM</fnm>
               </au>
               <au>
                  <snm>King</snm>
                  <fnm>BL</fnm>
               </au>
               <au>
                  <snm>Konagaya</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Kurochkin</snm>
                  <fnm>IV</fnm>
               </au>
               <au>
                  <snm>Lee</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Lenhard</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Lyons</snm>
                  <fnm>PA</fnm>
               </au>
               <au>
                  <snm>Maglott</snm>
                  <fnm>DR</fnm>
               </au>
               <au>
                  <snm>Maltais</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Marchionni</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>McKenzie</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Miki</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Nagashima</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Numata</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Okido</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Pavan</snm>
                  <fnm>WJ</fnm>
               </au>
               <au>
                  <snm>Pertea</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Pesole</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Petrovsky</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Pillai</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Pontius</snm>
                  <fnm>JU</fnm>
               </au>
               <au>
                  <snm>Qi</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Ramachandran</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Ravasi</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Reed</snm>
                  <fnm>JC</fnm>
               </au>
               <au>
                  <snm>Reed</snm>
                  <fnm>DJ</fnm>
               </au>
               <au>
                  <snm>Reid</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Ring</snm>
                  <fnm>BZ</fnm>
               </au>
               <au>
                  <snm>Ringwald</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Sandelin</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Schneider</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Semple</snm>
                  <fnm>CA</fnm>
               </au>
               <au>
                  <snm>Setou</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Shimada</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Sultana</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Takenaka</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Taylor</snm>
                  <fnm>MS</fnm>
               </au>
               <au>
                  <snm>Teasdale</snm>
                  <fnm>RD</fnm>
               </au>
               <au>
                  <snm>Tomita</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Verardo</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Wagner</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Wahlestedt</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Wang</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Watanabe</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Wells</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Wilming</snm>
                  <fnm>LG</fnm>
               </au>
               <au>
                  <snm>Wynshaw-Boris</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Yanagisawa</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Yang</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Yang</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Yuan</snm>
                  <fnm>Z</fnm>
               </au>
               <au>
                  <snm>Zavolan</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Zhu</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Zimmer</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Carninci</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Hayatsu</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Hirozane-Kishikawa</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Konno</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Nakamura</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Sakazume</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Sato</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Shiraki</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Waki</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Kawai</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Aizawa</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Arakawa</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Fukuda</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Hara</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Hashizume</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Imotani</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Ishii</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Itoh</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Kagawa</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Miyazaki</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Sakai</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Sasaki</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Shibata</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Shinagawa</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Yasunishi</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Yoshino</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Waterston</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Lander</snm>
                  <fnm>ES</fnm>
               </au>
               <au>
                  <snm>Rogers</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Birney</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Hayashizaki</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <cnm>FANTOM Consortium; RIKEN Genome Exploration Research Group Phase I &amp; II Team</cnm>
               </au>
            </aug>
            <source>Nature</source>
            <pubdate>2002</pubdate>
            <volume>420</volume>
            <fpage>563</fpage>
            <lpage>573</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/nature01266</pubid>
                  <pubid idtype="pmpid" link="fulltext">12466851</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B5">
            <title>
               <p>Computer-based methods for the mouse full-length cDNA encyclopedia: real-time sequence clustering for construction of a nonredundant cDNA library</p>
            </title>
            <aug>
               <au>
                  <snm>Konno</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Fukunishi</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Shibata</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Itoh</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Carninci</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Sugahara</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Hayashizaki</snm>
                  <fnm>Y</fnm>
               </au>
            </aug>
            <source>Genome Res</source>
            <pubdate>2001</pubdate>
            <volume>11</volume>
            <fpage>281</fpage>
            <lpage>289</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1101/gr.GR-1457R</pubid>
                  <pubid idtype="pmpid" link="fulltext">11157791</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B6">
            <title>
               <p>Microsoft Knowledge Base Article 21 XL: Text or Number Converted to Unintended Number Format</p>
            </title>
            <url>http://support.microsoft.com/default.aspx?scid=kb;EN-US;Q214233</url>
         </bibl>
         <bibl id="B7">
            <title>
               <p>Mistaken Identifiers Companion</p>
            </title>
            <url>http://discover.nci.nih.gov/symbolmutation</url>
         </bibl>
      </refgrp>
   </bm>
</art>
