<?xml version='1.0'?>
<!DOCTYPE art SYSTEM 'http://www.biomedcentral.com/xml/article.dtd'>
<art>
   <ui>1471-2105-5-19</ui>
   <ji>1471-2105</ji>
   <fm>
      <dochead>Methodology article</dochead>
      <bibl>
         <title>
            <p>The tissue microarray data exchange specification: implementation by the Cooperative Prostate Cancer Tissue Resource</p>
         </title>
         <aug>
            <au id="A1" ca="yes">
               <snm>Berman</snm>
               <mi>J</mi>
               <fnm>Jules</fnm>
               <insr iid="I1"/>
               <email>bermanj@mail.nih.gov</email>
            </au>
            <au id="A2">
               <snm>Datta</snm>
               <fnm>Milton</fnm>
               <insr iid="I2"/>
               <email>mdatta@mcw.edu</email>
            </au>
            <au id="A3">
               <snm>Kajdacsy-Balla</snm>
               <fnm>Andre</fnm>
               <insr iid="I3"/>
               <email>aballa@uic.edu</email>
            </au>
            <au id="A4">
               <snm>Melamed</snm>
               <fnm>Jonathan</fnm>
               <insr iid="I4"/>
               <email>Jonathan.melamed@med.nyu.edu</email>
            </au>
            <au id="A5">
               <snm>Orenstein</snm>
               <fnm>Jan</fnm>
               <insr iid="I5"/>
               <email>jorenstein@mfa.gwu.edu</email>
            </au>
            <au id="A6">
               <snm>Dobbin</snm>
               <fnm>Kevin</fnm>
               <insr iid="I1"/>
               <email>dobbinke@mail.nih.gov</email>
            </au>
            <au id="A7">
               <snm>Patel</snm>
               <fnm>Ashok</fnm>
               <insr iid="I6"/>
               <email>patelaa@MSX.UPMC.EDU</email>
            </au>
            <au id="A8">
               <snm>Dhir</snm>
               <fnm>Rajiv</fnm>
               <insr iid="I6"/>
               <email>dhirr@MSX.UPMC.EDU</email>
            </au>
            <au id="A9">
               <snm>Becich</snm>
               <mi>J</mi>
               <fnm>Michael</fnm>
               <insr iid="I6"/>
               <email>becich@msx.upmc.edu</email>
            </au>
         </aug>
         <insg>
            <ins id="I1">
               <p>Cancer Diagnosis Program, National Cancer Institute, National Institutes of Health, Bethesda, USA</p>
            </ins>
            <ins id="I2">
               <p>Department of Pathology, University of Wisconsin, Milwaukee, USA</p>
            </ins>
            <ins id="I3">
               <p>Department of Pathology, University of Illinois Medical Center, Chicago, USA</p>
            </ins>
            <ins id="I4">
               <p>Department of Pathology, New York Medical Center, New York, USA</p>
            </ins>
            <ins id="I5">
               <p>Department of Pathology, George Washington University Medical Center, Washington, D.C., USA</p>
            </ins>
            <ins id="I6">
               <p>Center for Pathology Informatics and Benedum Oncology Informatics Center, University of Pittsburgh Medical Center, Pittsburgh, USA</p>
            </ins>
         </insg>
         <source>BMC Bioinformatics</source>
         <issn>1471-2105</issn>
         <pubdate>2004</pubdate>
         <volume>5</volume>
         <issue>1</issue>
         <fpage>19</fpage>
         <url>http://www.biomedcentral.com/1471-2105/5/19</url>
         <xrefbib>
            <pubidlist>
               <pubid idtype="doi">10.1186/1471-2105-5-19</pubid>
               <pubid idtype="pmpid">15040818</pubid>
            </pubidlist>
         </xrefbib>
      </bibl>
      <history>
         <rec>
            <date>
               <day>17</day>
               <month>12</month>
               <year>2003</year>
            </date>
         </rec>
         <acc>
            <date>
               <day>27</day>
               <month>2</month>
               <year>2004</year>
            </date>
         </acc>
         <pub>
            <date>
               <day>27</day>
               <month>2</month>
               <year>2004</year>
            </date>
         </pub>
      </history>
      <cpyrt>
         <year>2004</year>
         <collab>Berman et al; licensee BioMed Central Ltd. This is an Open Access article: verbatim copying and redistribution of this article are permitted in all media for any purpose, provided this notice is preserved along with the article's original URL.</collab>
      </cpyrt>
      <abs>
         <sec>
            <st>
               <p>Abstract</p>
            </st>
            <sec>
               <st>
                  <p>Background</p>
               </st>
               <p>Tissue Microarrays (TMAs) have emerged as a powerful tool for examining the distribution of marker molecules in hundreds of different tissues displayed on a single slide. TMAs have been used successfully to validate candidate molecules discovered in gene array experiments. Like gene expression studies, TMA experiments are data intensive, requiring substantial information to interpret, replicate or validate. Recently, an open access Tissue Microarray Data Exchange Specification has been released that allows TMA data to be organized in a self-describing XML document annotated with well-defined common data elements. While this specification provides sufficient information for the reproduction of the experiment by outside research groups, its initial description did not contain instructions or examples of actual implementations, and no implementation studies have been published. The purpose of this paper is to demonstrate how the TMA Data Exchange Specification is implemented in a prostate cancer TMA.</p>
            </sec>
            <sec>
               <st>
                  <p>Results</p>
               </st>
               <p>The Cooperative Prostate Cancer Tissue Resource (CPCTR) is funded by the National Cancer Institute to provide researchers with samples of prostate cancer annotated with demographic and clinical data. The CPCTR now offers prostate cancer TMAs and has implemented a TMA database conforming to the new open access Tissue Microarray Data Exchange Specification. The bulk of the TMA database consists of clinical and demographic data elements for 299 patient samples. These data elements were extracted from an Excel database using a transformative Perl script. The Perl script and the TMA database are open access documents distributed with this manuscript.</p>
            </sec>
            <sec>
               <st>
                  <p>Conclusions</p>
               </st>
               <p>TMA databases conforming to the Tissue Microarray Data Exchange Specification can be merged with other TMA files, expanded through the addition of data elements, or linked to data contained in external biological databases. This article describes an open access implementation of the TMA Data Exchange Specification and provides detailed guidance to researchers who wish to use the Specification.</p>
            </sec>
         </sec>
      </abs>
   </fm>
   <bdy>
      <sec>
         <st>
            <p>Background</p>
         </st>
         <p>TMA technology was introduced in 1998 <abbrgrp><abbr bid="B1">1</abbr></abbrgrp>. A TMA fundamentally differs from a conventional glass slide only in the number of tissue samples included [see Figure <figr fid="F1">1</figr>]. Tissue microarrays typically contain between 100 and 1,000 core tissue samples. A single TMA block can be sectioned and distributed to dozens of laboratories, saving years of preparation time, hundreds of thousands of dollars in tissue collection costs, and conserving experimental reagents by measuring a marker's distribution on hundreds of specimens arrayed on a single glass slide <abbrgrp><abbr bid="B1">1</abbr></abbrgrp>. Several studies have demonstrated the value of TMAs to validate the biologic relevance of candidate genes expressed in prostate cancers <abbrgrp><abbr bid="B2">2</abbr><abbr bid="B3">3</abbr><abbr bid="B4">4</abbr><abbr bid="B5">5</abbr><abbr bid="B6">6</abbr></abbrgrp>.</p>
         <fig id="F1">
            <title>
               <p>Figure 1</p>
            </title>
            <caption>
               <p>Prostate Cancer TMA slide</p>
            </caption>
            <text>
               <p><b>Prostate Cancer TMA slide. </b>Hematoxylin &amp; Eosin stained example of a TMA slide of prostate samples.</p>
            </text>
            <graphic file="1471-2105-5-19-1"/>
         </fig>
         <p>Because TMAs are designed to answer questions applicable to pathologic lesions with specific sets of attributes (e.g. stage or grade or diagnostic subtype), preparation of a TMA requires access to large archives of paraffin embedded tissues. Each TMA core tissue must be annotated with clinical, demographic or histopathologic information so that measurements on the TMA core samples can result in clinically useful correlations. To ensure inter-laboratory reproducibility, information describing the preparation of TMA blocks and slides need to be provided along with the TMA data records.</p>
         <p>The Cooperative Prostate Cancer Tissue Resource (CPCTR) is a multi-institutional virtual tissue bank funded by the U.S. National Cancer Institute (NCI) to provide researchers with samples of prostate cancer tissues <abbrgrp><abbr bid="B7">7</abbr></abbrgrp>. The member institutions of the CPCTR are New York University, George Washington University, University of Pittsburgh and Medical College of Wisconsin. The CPCTR began service to the cancer research community on December 6, 2001. The CPCTR has over 5,000 prostate cancer specimens including radical prostatectomy cases (paraffin and fresh-frozen) and paraffinized needle biopsies. The CPCTR represents the largest repository of histologically-characterized and clinically annotated prostate cancer tissue in the USA. All accrued cases undergo pathology review and all clinical data is collected using methodology standardized across the participating institutions. CPCTR resources are available to all researchers, academic and commercial. Further information can be obtained from the CPCTR website <abbrgrp><abbr bid="B8">8</abbr></abbrgrp>.</p>
         <p>The CPCTR has constructed a prostate cancer TMA implemented in conformance with the new TMA Data Exchange Specification (herinafter designated "the Specification"). The Specification was developed through a series of open workshops sponsored by the National Cancer Institute and the Association for Pathology Informatics <abbrgrp><abbr bid="B9">9</abbr></abbrgrp>. Tissue data included in the CPCTR TMA database is de-identified, and assembled in an open access database to permit data sharing, in compliance with current NIH policy on data sharing <abbrgrp><abbr bid="B10">10</abbr></abbrgrp> and in concert with ongoing NIH initiatives to develop new methods for sharing research data <abbrgrp><abbr bid="B11">11</abbr><abbr bid="B12">12</abbr></abbrgrp>.</p>
      </sec>
      <sec>
         <st>
            <p>Results and Discussion</p>
         </st>
         <p>The TMA data exchange specification was designed to allow TMA database files to be totally self-describing. The properties of a self-describing database file would include:</p>
         <p>1. An informative header that explained the purpose of the file and provided all the information to understand the file (i.e., its organization).</p>
         <p>2. Information regarding the creation of the file (e.g., creator, date of creation)</p>
         <p>3. Rights of use (e.g. specifying any restrictions on use)</p>
         <p>4. Warranty information</p>
         <p>5. Methodology (e.g. how the data contained in the file was obtained)</p>
         <p>6. Data</p>
         <p>7. Metadata (the data that describes or defines the actual data)</p>
         <p>8. Metadata definitions (clear descriptions and definitions of the metadata)</p>
         <p>The typical database contains data (property #6) but nothing else in the way of self-descriptive annotation. The CPCTR implementation of the Specification has all eight properties and employs the following enhancements:</p>
         <p>1. Uses Uniform Resource Locators (URLs) to link the TMA database with web documents that provide detailed information supplementing the metadata tags. These external URLs are:</p>
         <p>a. A link to the Dublin Core Meta Data Elements used in the header section of the document <abbrgrp><abbr bid="B13">13</abbr></abbrgrp>.</p>
         <p>b. A link to the ISO-11179-compliant listing of Common Data Elements (CDEs) provided in the Specification <abbrgrp><abbr bid="B14">14</abbr></abbrgrp>.</p>
         <p>c. A link to the CPCTR CDEs <abbrgrp><abbr bid="B15">15</abbr></abbrgrp>.</p>
         <p>d. Links to external documents that provide methodologies for preparing CPCTR TMA blocks and sectioning slides <abbrgrp><abbr bid="B16">16</abbr><abbr bid="B17">17</abbr></abbrgrp>.</p>
         <p>3. Supports complex TMAs within a single TMA file. In this case, a single TMA file contained four blocks, with cores from a single tissue samples appearing in multiple locations in more than one block.</p>
         <p>2. Protects patient privacy (by deidentifying all data)</p>
         <p>3. Allows data sharing (by permitting free distribution of the XML data document)</p>
      </sec>
      <sec>
         <st>
            <p>Conclusions</p>
         </st>
         <p>Tissue microarrays allow for the high throughput analysis of tissue samples and their association with clinical or outcomes data. Yet these experiments require a large amount of information for the subsequent analysis and evaluation, in particular by interested second parties. The Specification provides an accurate and reproducible method for the transfer of this information as is required for inter-laboratory reproducibility. One of the most important problems with modern data specifications is the daunting technical expertise required for their implementation. The Specification was written to permit maximal flexibility and minimal implementation requirements <abbrgrp><abbr bid="B9">9</abbr></abbrgrp>. This study demonstrates that the Specification can be implemented using a simple Perl script that converts an Excel database into XML-tagged data elements. The resulting large section of core-related XML text can be simply inserted into a conformant document containing header, block and slide information. The resulting TMA database can be validated with a Perl script provided with the Specification document.</p>
      </sec>
      <sec>
         <st>
            <p>Methods</p>
         </st>
         <sec>
            <st>
               <p>Human subjects protections</p>
            </st>
            <p>All institutions participating in the CPCTR have Institutional Review Board (IRB) approval for human subjects research. Each CPCTR institution develops its own local protocols to protect the confidentiality and privacy of human subjects and obtains local IRB approval for all CPCTR activities. The IRB assurance numbers for each cooperating institution are: New York University &#8211; M1177; Medical College of Wisconsin &#8211; M1061; University of Pittsburgh Medical Center &#8211; M1256; and George Washington University Medical Center &#8211; M1125. Tissue data records from the cooperating institutions are submitted to a central data manager (Information Management Services, Inc., contracted by the NCI) as de-identified records. All institutions assign an arbitrary number to each record before submitting the de-identified record to the central database. This ensures that the central database has no links connecting records to patients. In addition, HIPAA's proscribed set of 18 data elements are omitted from core sample records (so-called safe harbor approach to HIPAA-compliance) <abbrgrp><abbr bid="B18">18</abbr></abbrgrp>.</p>
         </sec>
         <sec>
            <st>
               <p>Tissue and data collection</p>
            </st>
            <p>The CPCTR maintains a publicly available Manual of Operations that describes its tissue collection procedures and policies <abbrgrp><abbr bid="B19">19</abbr></abbrgrp>.</p>
            <p>Pathological characterization of specimens involves review of all cases by a CPCTR pathologist using diagnostic criteria explained in the publicly available CPCTR histologic atlas and manual <abbrgrp><abbr bid="B20">20</abbr></abbrgrp>.</p>
            <p>Protocols for the construction the TMA block and TMA slide are publicly available documents available at the CPCTR web site and linked from the TMA Database <abbrgrp><abbr bid="B16">16</abbr><abbr bid="B17">17</abbr></abbrgrp>.</p>
         </sec>
         <sec>
            <st>
               <p>The TMA Data Exchange Specification</p>
            </st>
            <p>The Specification is an open access document that can be used without restriction <abbrgrp><abbr bid="B9">9</abbr></abbrgrp>.</p>
            <p>The Specification requires four general sections for each TMA file:</p>
            <p>1) Header, containing the specification Dublin Core identifiers, 2) Block, describing the paraffin-embedded array of tissues, 3) Slide, describing the glass slides produced from the Block, and 4) Core, containing all data related to the individual tissue samples contained in the array. The simplest possible structure for a conforming TMA file consists of nothing more than empty tags designating the four required sections [see Figure <figr fid="F2">2</figr>] <abbrgrp><abbr bid="B9">9</abbr></abbrgrp>.</p>
            <fig id="F2">
               <title>
                  <p>Figure 2</p>
               </title>
               <caption>
                  <p>Simplest conforming TMA file</p>
               </caption>
               <text>
                  <p><b>Simplest conforming TMA file. </b>Image displaying the simplest possible XML file conforming to the TMA Data Exchange Specification.</p>
               </text>
               <graphic file="1471-2105-5-19-2"/>
            </fig>
            <p>Common Data Elements (CDEs) are metadata tags that describe the data elements included in an XML database. To be of value, CDEs must be well-defined, uniquely identified and available for human review or computer access. Eighty CDEs, conforming to the ISO-11179 <abbrgrp><abbr bid="B21">21</abbr></abbrgrp> specification for data elements constitute the XML tags provided in the Specification <abbrgrp><abbr bid="B9">9</abbr></abbrgrp>. CDE descriptors are publicly available <abbrgrp><abbr bid="B14">14</abbr></abbrgrp>. However, the only CDEs that must appear in any conforming TMA file are the section CDEs (header, block, slide and core), the root CDE (histo) and the tma CDE itself (tma). A set of six simple semantic rules describe the syntax for the data exchange specification <abbrgrp><abbr bid="B9">9</abbr></abbrgrp>.</p>
            <p>The Specification was designed for maximal flexibility. Flexibility in the first version of an XML specification permits the addition of greater structure in later versions built on tested implementations. A similar approach has been used for ANSI/HL7 Common Data Architecture (CDE) wherein the earliest version (Level One) is intentionally sparse <abbrgrp><abbr bid="B22">22</abbr></abbrgrp>. At this time, there is no DTD (Data Type Definition) or Schema included in the Specification. For those wishing to use a DTD, a Specification-compliant DTD has been prepared by David G. Nohle, Ohio State University Department of Pathology and the Mid-Region AIDS &amp; Cancer Specimen Resource (ACSR) <abbrgrp><abbr bid="B23">23</abbr></abbrgrp>.</p>
         </sec>
         <sec>
            <st>
               <p>Constructing the TMA Data file</p>
            </st>
            <p>Constructing a TMA Database consists of the following:</p>
            <p>1. Filling the four sections (header, block, slide and core)</p>
            <p>2. Assembling the four sections into a TMA file with a proper file declaration, root element and TMA CDE.</p>
            <p>3. Validating that the TMA file conforms to the specification</p>
            <p>The header, block and slide sections of the TMA will vary only slightly from project to project within a laboratory. The CPCTR header, block and slide sections were prepared "by hand" using the section-specific CDEs provided in the specification.</p>
            <p>The header section contains descriptive information about the file and its contents. With the exception of one CDE (filename), the header CDEs are the same CDEs used in the Dublin Core set of XML identifiers used by librarians. Detailed information describing the Dublin Core elements is available <abbrgrp><abbr bid="B13">13</abbr></abbrgrp>. A link to the Dublin Core elements is also included in CPCTR TMA database. The first few lines of the TMA database are shown [see Figure <figr fid="F3">3</figr>]. The block and slide headers of the TMA database are short and are also completed manually.</p>
            <fig id="F3">
               <title>
                  <p>Figure 3</p>
               </title>
               <caption>
                  <p>TMA XML opening section</p>
               </caption>
               <text>
                  <p><b>TMA XML opening section. </b>Image displaying the first few lines of the TMA XML document.</p>
               </text>
               <graphic file="1471-2105-5-19-3"/>
            </fig>
            <p>The cores are distributed for each block in an array, with cores assigned to specific locations [see Figure <figr fid="F4">4</figr>], and all the cores in an array are assigned to a slide, which is a numbered section derived from a block [see Figure <figr fid="F5">5</figr>]. The core section contains annotated data for each core in the TMA. The central database for all CPCTR tissues is maintained as an Excel database by an NCI-contracted information management service (IMS, Rockville, MD). IMS extracts an Excel sub-file consisting of records pertaining to the tissues selected for the TMA block. CPCTR-specific data elements included in the IMS records are publicly available <abbrgrp><abbr bid="B15">15</abbr></abbrgrp>.</p>
            <fig id="F4">
               <title>
                  <p>Figure 4</p>
               </title>
               <caption>
                  <p>Core Array Mapping Image</p>
               </caption>
               <text>
                  <p><b>Core Array Mapping Image. </b>Schematic image showing the array locations of cores listed in the TMA doucment.</p>
               </text>
               <graphic file="1471-2105-5-19-4"/>
            </fig>
            <fig id="F5">
               <title>
                  <p>Figure 5</p>
               </title>
               <caption>
                  <p>TMA XML slide section information</p>
               </caption>
               <text>
                  <p><b>TMA XML slide section information. </b>Image displaying the data elements describing the glass slide sectioning information.</p>
               </text>
               <graphic file="1471-2105-5-19-5"/>
            </fig>
            <p>A Perl script was written that converts Excel files to XML, enclosing the data associated with the spreadsheet cells to XML CDEs corresponding to the column headings. This creates the "core" section of the TMA database. A sample of an XML-tagged extracted data record is shown [see Figure <figr fid="F6">6</figr>]. The Perl script is available as an open access file with this article [see <supplr sid="S1">Additional file 1</supplr>].</p>
            <fig id="F6">
               <title>
                  <p>Figure 6</p>
               </title>
               <caption>
                  <p>TMA XML core section information</p>
               </caption>
               <text>
                  <p><b>TMA XML core section information. </b>Image displaying the data elements comprising a record for a single tissue core.</p>
               </text>
               <graphic file="1471-2105-5-19-6"/>
            </fig>
            <suppl id="S1">
               <title>
                  <p>Additional File 1</p>
               </title>
               <text>
                  <p>Excel to XML converting script. Opener7.pl is a Perl script that converts an Excel file to XML-tagged data elements that can be easily inserted into an XML file. This script only works under Windows and requires an installed version of Excel. It is distributed as an open access plain-text file.</p>
               </text>
               <file name="1471-2105-5-19-S1.pl">
                  <p>Click here for file</p>
               </file>
            </suppl>
            <p>The CPCTR prostate cancer TMA consists of 299 core samples distributed in four blocks, each block having 300 arrayed cores. Each block contains about 150 core samples in two different locations in each block. The core duplicates are staggered in the array, to maximize the chance that a given core will be represented if an area of the slide section is lost in processing. The distribution of one set of core samples in multiple array locations in four blocks yields a complex TMA that cannot be adequately represented by separate descriptions of each block. The Specification permits multi-block TMA files. Within the block CDE are the nested sets of four blocks that compose the complex TMA. Each core CDE is nested within a specific block CDE, and one core may have two associated array locations [see Figure <figr fid="F6">6</figr>].</p>
            <p>The four sections are concatenated as a single XML database file. The CPCTR database file is provided with this manuscript [see <supplr sid="S2">Additional file 2</supplr>].</p>
            <suppl id="S2">
               <title>
                  <p>Additional File 2</p>
               </title>
               <text>
                  <p>CPCTR tissue microarray database file. Cpctrtma.xml is the XML representation of the CPCTR prostate cancer TMA. This file conforms to the TMA Data Exchange Specification and can be viewed as a formatted XML file on most web browsers. The file exceeds 700 KB in length.</p>
               </text>
               <file name="1471-2105-5-19-S2.xml">
                  <p>Click here for file</p>
               </file>
            </suppl>
         </sec>
         <sec>
            <st>
               <p>Validating the TMA Data file</p>
            </st>
            <p>Once a TMA database is prepared, it needs to be validated to ensure conformance with the Specification. At this time, all TMA files should be validated using a software implementation written in Perl and distributed as an open access supplemental file with the Specification and with this publication [see <supplr sid="S3">Additional file 3</supplr>]. The validating script requires a Perl installation but should operate equally well on any operating system. The validation software has a simple command-line interface. When the file successfully validates, the Perl script outputs the encountered CDEs from the Specification, a statement that the file is valid, and a one-way hash value specific for the validated file [see Figure <figr fid="F7">7</figr>].</p>
            <suppl id="S3">
               <title>
                  <p>Additional File 3</p>
               </title>
               <text>
                  <p>TMA validating script. Validtma.pl is a validating Perl script distributed as a plain-text file. It parses through XML files and determines conformance to the TMA Data Exchange Specification <abbrgrp><abbr bid="B8">8</abbr></abbrgrp>. This is an open access Perl script, and is identical to the Perl script distributed with the Specification.</p>
               </text>
               <file name="1471-2105-5-19-S3.pl">
                  <p>Click here for file</p>
               </file>
            </suppl>
            <fig id="F7">
               <title>
                  <p>Figure 7</p>
               </title>
               <caption>
                  <p>Validation script output</p>
               </caption>
               <text>
                  <p><b>Validation script output. </b>Image displaying the interaction between Perl validating script and user.</p>
               </text>
               <graphic file="1471-2105-5-19-7"/>
            </fig>
         </sec>
         <sec>
            <st>
               <p>Availability and requirements</p>
            </st>
            <p>The Perl scripts and files for the production of TMA databases that meet the Specification are available with this publication. The example prostate cancer TMA database is available as a supplementary file with this article [see <supplr sid="S1">Additional file 1</supplr>]. The actual tissue microarray slides are available after an application process Although the CPCTR is a non-profit, government-sponsored resource, a surcharge is attached for glass slides, to help defray a portion of the costs of TMA production. The application process and charges are described at the CPCTR web site <abbrgrp><abbr bid="B8">8</abbr></abbrgrp>. Questions regarding any aspect of the CPCTR can be directed to the CPCTR email query service [ask-cpctr-l@list.nih.gov].</p>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Competing interests</p>
         </st>
         <p>All authors have reported no competing interests. See funding sources in Acknowledgements.</p>
      </sec>
      <sec>
         <st>
            <p>Authors' contributions</p>
         </st>
         <p>Jules Berman was an author of the original TMA Data Exchange Specification, developed the Perl tools for implementing the CPCTR TMA in conformance with the Specification, and wrote the first draft of the manuscript. Milton Datta helped in the design and supervised the laboratory that constructed the physical TMA, provided the protocols for TMA construction, provided annotation data describing the core array grid. Andre Kajdacsy-Balla, Jonathan Melamed, Jan Orenstein, Ashok Patel, Rajiv Dhir and Michael J. Becich assisted in the design of the TMA and were responsible for the technical steps in TMA construction including block selection, core focus selection, core quality assurance and for the selection and extraction of all data annotations used in the TMA database. Kevin Dobbin was responsible for data review (for consistency and completeness) for the TMA database and was involved in the TMA design and case selection. All authors reviewed and commented on successive drafts of the manuscript and have provided the first author with approval of the final manuscript.</p>
      </sec>
   </bdy>
   <bm>
      <ack>
         <sec>
            <st>
               <p>Acknowledgements</p>
            </st>
            <p>This work was supported by four grants from the National Cancer Institute for the support of the Cooperative Prostate Cancer Tissue Resource: U01 CA86772, U01 CA86743, U01 CA86735, and U01 CA86739. With the exceptions of Jules Berman and Kevin Dobbin, the authors are funding recipients of these grants. Jules Berman and Kevin Dobbin performed this work as part of his regular activities as a U.S. government employee. Hang Liu, of the University of Wisconsin, is acknowledged for writing a Perl script that extracted the array locations for core samples.</p>
         </sec>
      </ack>
      <refgrp>
         <bibl id="B1">
            <title>
               <p>Tissue microarrays for high-throughput molecular profiling of tumor specimens</p>
            </title>
            <aug>
               <au>
                  <snm>Kononen</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Bubendorf</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Kallioniemi</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Barlund</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Schraml</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Leighton</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Torhorst</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Mihatsch</snm>
                  <fnm>MJ</fnm>
               </au>
               <au>
                  <snm>Sauter</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Kallioniemi</snm>
                  <fnm>O-P</fnm>
               </au>
            </aug>
            <source>Nat Med</source>
            <pubdate>1998</pubdate>
            <volume>4</volume>
            <fpage>844</fpage>
            <lpage>847</lpage>
            <xrefbib>
               <pubid idtype="pmpid">9662379</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B2">
            <title>
               <p>Clinical validation of candidate genes associated with prostate cancer progression in the CWR22 model system using tissue microarrays</p>
            </title>
            <aug>
               <au>
                  <snm>Mousses</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Bubendorf</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Wagner</snm>
                  <fnm>U</fnm>
               </au>
               <au>
                  <snm>Hostetter</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Kononen</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Cornelison</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Goldberger</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Elkahloun</snm>
                  <fnm>AG</fnm>
               </au>
               <au>
                  <snm>Willi</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Koivisto</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Ferhle</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Raffeld</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Sauter</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Kallioniemi</snm>
                  <fnm>O</fnm>
               </au>
            </aug>
            <source>Cancer Research</source>
            <pubdate>2002</pubdate>
            <volume>62</volume>
            <fpage>1256</fpage>
            <lpage>1260</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">11888886</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B3">
            <title>
               <p>Tissue microarray assessment of prostate cancer tumor proliferation in African-American and white men</p>
            </title>
            <aug>
               <au>
                  <snm>Perrone</snm>
                  <fnm>EE</fnm>
               </au>
               <au>
                  <snm>Theoharis</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Mucci</snm>
                  <fnm>NR</fnm>
               </au>
               <au>
                  <snm>Hayasaka</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Taylor</snm>
                  <fnm>JM</fnm>
               </au>
               <au>
                  <snm>Cooney</snm>
                  <fnm>KA</fnm>
               </au>
               <au>
                  <snm>Rubin</snm>
                  <fnm>MA</fnm>
               </au>
            </aug>
            <source>J Natl Cancer Inst</source>
            <pubdate>2000</pubdate>
            <volume>92</volume>
            <fpage>937</fpage>
            <lpage>939</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/jnci/92.11.937</pubid>
                  <pubid idtype="pmpid" link="fulltext">10841830</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B4">
            <title>
               <p>Combined Loss of PTEN and p27 expression is associated with tumor cell proliferation by Ki-67 and increased risk of recurrent disease in localized prostate cancer</p>
            </title>
            <aug>
               <au>
                  <snm>Halvorsen</snm>
                  <fnm>OJ</fnm>
               </au>
               <au>
                  <snm>Haukaas</snm>
                  <fnm>SA</fnm>
               </au>
               <au>
                  <snm>Akslen</snm>
                  <fnm>LA</fnm>
               </au>
            </aug>
            <source>Clin Cancer Res</source>
            <pubdate>2003</pubdate>
            <volume>9</volume>
            <fpage>1474</fpage>
            <lpage>1479</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">12684422</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B5">
            <title>
               <p>Tissue micro-array: a cost and time-effective method for correlative studies by regional and national cancer study groups</p>
            </title>
            <aug>
               <au>
                  <snm>Milanes-Yearsley</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Hammond</snm>
                  <fnm>ME</fnm>
               </au>
               <au>
                  <snm>Pajak</snm>
                  <fnm>TF</fnm>
               </au>
               <au>
                  <snm>Cooper</snm>
                  <fnm>JS</fnm>
               </au>
               <au>
                  <snm>Chang</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Griffin</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Nelson</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Laramore</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Pilepich</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>Mod Pathol</source>
            <pubdate>2002</pubdate>
            <volume>15</volume>
            <fpage>1366</fpage>
            <lpage>1373</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1097/01.MP.0000036345.18944.22</pubid>
                  <pubid idtype="pmpid">12481019</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B6">
            <title>
               <p>Tissue microarray sampling strategy for prostate cancer biomarker analysis</p>
            </title>
            <aug>
               <au>
                  <snm>Rubin</snm>
                  <fnm>MA</fnm>
               </au>
               <au>
                  <snm>Dunn</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Strawderman</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Pienta</snm>
                  <fnm>KJ</fnm>
               </au>
            </aug>
            <source>Am J Surg Pathol</source>
            <pubdate>2002</pubdate>
            <volume>26</volume>
            <fpage>312</fpage>
            <lpage>319</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1097/00000478-200203000-00004</pubid>
                  <pubid idtype="pmpid" link="fulltext">11859202</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B7">
            <title>
               <p>Cooperative prostate cancer tissue resource: Release Date April 29,1999, RFA CA-99-012, National Cancer Institute</p>
            </title>
            <url>http://grants1.nih.gov/grants/guide/rfa-files/RFA-CA-99-012.html</url>
         </bibl>
         <bibl id="B8">
            <title>
               <p>Cooperative Prostate Cancer Tissue Resource</p>
            </title>
            <url>http://cpctr.cancer.gov</url>
         </bibl>
         <bibl id="B9">
            <title>
               <p>The tissue microarray data exchange specification: A community-based, open source tool for sharing tissue microarray data</p>
            </title>
            <aug>
               <au>
                  <snm>Berman</snm>
                  <fnm>JJ</fnm>
               </au>
               <au>
                  <snm>Edgerton</snm>
                  <fnm>ME</fnm>
               </au>
               <au>
                  <snm>Friedman</snm>
                  <fnm>BA</fnm>
               </au>
            </aug>
            <source>BMC Med Inform Decis Mak</source>
            <pubdate>2003</pubdate>
            <volume>5</volume>
         </bibl>
         <bibl id="B10">
            <title>
               <p>Final NIH statement on sharing research data: Release date February 26, Notice NOT-OD-03-032, National Institutes of Health</p>
            </title>
            <url>http://grants1.nih.gov/grants/guide/notice-files/NOT-OD-03-032.html</url>
         </bibl>
         <bibl id="B11">
            <title>
               <p>Infrastructure for data sharing and archiving: Release date October 17, RFA Number: RFA-HD-03-032, Department of Health and Human Services</p>
            </title>
            <url>http://grants.nih.gov/grants/guide/rfa-files/RFA-HD-03-032.html</url>
         </bibl>
         <bibl id="B12">
            <title>
               <p>Tools for collaborations that involve data sharing: Release date June 4, PA Number: PAR-03-134, Department of Health and Human Services</p>
            </title>
            <url>http://grants.nih.gov/grants/guide/pa-files/PAR-03-134.html</url>
         </bibl>
         <bibl id="B13">
            <title>
               <p>Dublin core metadata initiative</p>
            </title>
            <url>http://dublincore.org</url>
         </bibl>
         <bibl id="B14">
            <title>
               <p>Association for Pathology Informatics tissue microarray common data elements</p>
            </title>
            <url>http://65.222.228.150/jjb/tma_cde.htm</url>
         </bibl>
         <bibl id="B15">
            <title>
               <p>CPCTR data elements for frozen tissue collection user instructions</p>
            </title>
            <url>http://www.pathology.pitt.edu/pdf/cpctr/cpctr-cde-v22.pdf</url>
         </bibl>
         <bibl id="B16">
            <title>
               <p>Instructions for CPCTR TMA Block Production</p>
            </title>
            <url>http://www.pathology.pitt.edu/pdf/cpctr/block.htm</url>
         </bibl>
         <bibl id="B17">
            <title>
               <p>Preparing CPCTR slides from paraffin TMA blocks</p>
            </title>
            <url>http://www.pathology.pitt.edu/pdf/cpctr/section.htm</url>
         </bibl>
         <bibl id="B18">
            <title>
               <p>Department of Health and Human Services. 45 CFR (Code of Federal Regulations), 164.514(6)(2)(i). Standards for Privacy of Individually Identifiable Health Information (final)</p>
            </title>
            <pubdate>2002</pubdate>
         </bibl>
         <bibl id="B19">
            <title>
               <p>National Cancer Institute cooperative prostate cancer tissue resource manual of operations</p>
            </title>
            <url>http://www.pathology.pitt.edu/pdf/cpctr/cpctr-moo-110403jo.pdf</url>
         </bibl>
         <bibl id="B20">
            <title>
               <p>Cooperative prostate cancer tissue resource histomanual</p>
            </title>
            <url>http://www.pathology.pitt.edu/pdf/cpctr/histomanual25.pdf</url>
         </bibl>
         <bibl id="B21">
            <title>
               <p>Metadata and the reintegration of clinical information: ISO 11179</p>
            </title>
            <aug>
               <au>
                  <snm>Solbrig</snm>
                  <fnm>HR</fnm>
               </au>
            </aug>
            <source>MD Comput</source>
            <pubdate>2000</pubdate>
            <volume>3</volume>
            <fpage>25</fpage>
            <lpage>28</lpage>
         </bibl>
         <bibl id="B22">
            <title>
               <p>The HL7 Clinical Document Architecture</p>
            </title>
            <aug>
               <au>
                  <snm>Dolin</snm>
                  <fnm>RH</fnm>
               </au>
               <au>
                  <snm>Alschuler</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Beebe</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Biron</snm>
                  <fnm>PV</fnm>
               </au>
               <au>
                  <snm>Boyer</snm>
                  <fnm>SL</fnm>
               </au>
               <au>
                  <snm>Essin</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Kimber</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Lincoln</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Mattison</snm>
                  <fnm>JE</fnm>
               </au>
            </aug>
            <source>J Am Med Inform Assoc</source>
            <pubdate>2001</pubdate>
            <volume>8</volume>
            <fpage>552</fpage>
            <lpage>69</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">11687563</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B23">
            <title>
               <p>Mid-region AIDS and cancer specimen resource tissue micro-arrays</p>
            </title>
            <url>http://virtualmicroscope.osu.edu/tma/</url>
         </bibl>
      </refgrp>
   </bm>
</art>
