<?xml version='1.0'?>
<!DOCTYPE art SYSTEM 'http://www.biomedcentral.com/xml/article.dtd'>
<art>
	<ui>gb-2004-5-9-r67</ui>
	<ji>GBJ</ji>
	<fm>
		<dochead>Research</dochead>
		<bibl>
			<title>
				<p>An <it>Ambystoma mexicanum </it>EST sequencing project: analysis of 17,352 expressed sequence tags from embryonic and regenerating blastema cDNA libraries</p>
			</title>
			<aug>
				<au id="A1" ca="yes">
					<snm>Habermann</snm>
					<fnm>Bianca</fnm>
					<insr iid="I1"/>
					<email>habermann@mpi-cbg.de</email>
				</au>
				<au id="A2">
					<snm>Bebin</snm>
					<fnm>Anne-Gaelle</fnm>
					<insr iid="I2"/>
					<email>bebin@mpi-cbg.de</email>
				</au>
				<au id="A3">
					<snm>Herklotz</snm>
					<fnm>Stephan</fnm>
					<insr iid="I2"/>
				</au>
				<au id="A4">
					<snm>Volkmer</snm>
					<fnm>Michael</fnm>
					<insr iid="I1"/>
					<email>volkmer@mpi-cbg.de</email>
				</au>
				<au id="A5">
					<snm>Eckelt</snm>
					<fnm>Kay</fnm>
					<insr iid="I2"/>
				</au>
				<au id="A6">
					<snm>Pehlke</snm>
					<fnm>Kerstin</fnm>
					<insr iid="I3"/>
				</au>
				<au id="A7">
					<snm>Epperlein</snm>
					<mnm>Henning</mnm>
					<fnm>Hans</fnm>
					<insr iid="I3"/>
				</au>
				<au id="A8">
					<snm>Schackert</snm>
					<mnm>Konrad</mnm>
					<fnm>Hans</fnm>
					<insr iid="I4"/>
				</au>
				<au id="A9">
					<snm>Wiebe</snm>
					<fnm>Glenis</fnm>
					<insr iid="I2"/>
					<email>wiebe@mpi-cbg.de</email>
				</au>
				<au id="A10" ca="yes">
					<snm>Tanaka</snm>
					<mi>M</mi>
					<fnm>Elly</fnm>
					<insr iid="I2"/>
					<email>tanaka@mpi-cbg.de</email>
				</au>
			</aug>
			<insg>
				<ins id="I1">
					<p>Scionics Computer Innovation GmbH, Pfotenhauerstrasse 110, Dresden 01307, Germany</p>
				</ins>
				<ins id="I2">
					<p>Max Planck Institute of Molecular Cell Biology and Genetics, Pfotenhauerstrasse 108, Dresden 01307, Germany</p>
				</ins>
				<ins id="I3">
					<p>Institute of Anatomy, Medical Faculty of the Carl Gustav Carus Technical University, Dresden, Fetscherstrasse 74, Dresden 01307, Germany</p>
				</ins>
				<ins id="I4">
					<p>Department of Surgical Research, Medical Faculty of the Carl Gustav Carus Technical University, Dresden, Fetscherstrasse 74, Dresden 01307, Germany</p>
				</ins>
			</insg>
			<source>Genome Biology</source>
			<issn>1465-6906</issn>
			<pubdate>2004</pubdate>
			<volume>5</volume>
			<issue>9</issue>
			<fpage>R67</fpage>
			<url>http://genomebiology.com/2004/5/9/R67</url>
			<xrefbib>
				<pubidlist><pubid idtype="pmpid">15345051</pubid><pubid idtype="doi">10.1186/gb-2004-5-9-r67</pubid>
				</pubidlist></xrefbib>
		</bibl>
		<history>
			<rec>
				<date>
					<day>17</day>
					<month>11</month>
					<year>2003</year>
				</date>
			</rec>
			<revrec>
				<date>
					<day>6</day>
					<month>5</month>
					<year>2004</year>
				</date>
			</revrec>
			<acc>
				<date>
					<day>29</day>
					<month>6</month>
					<year>2004</year>
				</date>
			</acc>
			<pub>
				<date>
					<day>13</day>
					<month>8</month>
					<year>2004</year>
				</date>
			</pub>
		</history>
		<cpyrt>
			<year>2004</year>
			<collab>Habermann et al.; licensee BioMed Central Ltd.</collab>
			<note>This is an Open Access article distributed under the terms of the Creative Commons Attribution License (<url>http://creativecommons.org/licenses/by/2.0</url>), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</note>
		</cpyrt>
		<shorttitle>
			<p>An <it>Ambystoma mexicanum </it>EST sequencing project: analysis of 17,352 expressed sequence tags from embryonic and regenerating blastema cDNA librariese</p>
		</shorttitle>
		<shortabs>
			<p>An EST database has been generated for the axolotl <it>Ambystoma mexicanum</it>. Analysis of this data has uncovered an unusual phylogenetic distribution of the cyclin dependent kinase inhibitor 1 gene family in amphibians.</p>
		</shortabs>
		<abs>
			<sec>
				<st>
					<p>Abstract</p>
				</st>
				<sec>
					<st>
						<p>Background</p>
					</st>
					<p>The ambystomatid salamander, <it>Ambystoma mexicanum </it>(axolotl), is an important model organism in evolutionary and regeneration research but relatively little sequence information has so far been available. This is a major limitation for molecular studies on caudate development, regeneration and evolution. To address this lack of sequence information we have generated an expressed sequence tag (EST) database for <it>A. mexicanum</it>.</p>
				</sec>
				<sec>
					<st>
						<p>Results</p>
					</st>
					<p>Two cDNA libraries, one made from stage 18-22 embryos and the other from day-6 regenerating tail blastemas, generated 17,352 sequences. From the sequenced ESTs, 6,377 contigs were assembled that probably represent 25% of the expressed genes in this organism. Sequence comparison revealed significant homology to entries in the NCBI non-redundant database. Further examination of this gene set revealed the presence of genes involved in important cell and developmental processes, including cell proliferation, cell differentiation and cell-cell communication. On the basis of these data, we have performed phylogenetic analysis of key cell-cycle regulators. Interestingly, while cell-cycle proteins such as the cyclin B family display expected evolutionary relationships, the cyclin-dependent kinase inhibitor 1 gene family shows an unusual evolutionary behavior among the amphibians.</p>
				</sec>
				<sec>
					<st>
						<p>Conclusions</p>
					</st>
					<p>Our analysis reveals the importance of a comprehensive sequence set from a representative of the Caudata and illustrates that the EST sequence database is a rich source of molecular, developmental and regeneration studies. To aid in data mining, the ESTs have been organized into an easily searchable database that is freely available online.</p>
				</sec>
			</sec>
		</abs>
	</fm>
	<meta>
		<classifications>
			<classification type="BMC" subtype="man_spc_id" id="30010010">Genome studies</classification>
			<classification type="BMC" subtype="man_spc_id" id="30010009">Genetics</classification>
			<classification type="BMC" subtype="man_spc_id" id="30010005">Development</classification>
			<classification type="BMC" subtype="man_spc_id" id="30010008">Evolution</classification>
		</classifications>
	</meta>
	<bdy>
		<sec>
			<st>
				<p>Background</p>
			</st>
			<p>The Caudata (tailed amphibians such as salamanders) are a major focus of work in vertebrate evolution and speciation <abbrgrp><abbr bid="B1">1</abbr><abbr bid="B2">2</abbr></abbrgrp>. The salamander is also an important vertebrate model organism for understanding regeneration, being one of the few vertebrates that is able to regenerate entire body structures such as the limb, tail and jaw as an adult. Despite the pivotal role of this animal order in research, comparatively little sequence information is available. In contrast, 458,413 nucleotide sequences exist for the Anura (frogs and toads). This high number is primarily attributable to large EST sequencing efforts for the model organisms for embryology - <it>Xenopus laevis </it>and <it>Silurana tropicalis.</it></p>
			<p>A salamander EST project is particularly important as these organisms have extremely large genomes, making a genome project unwieldy and unlikely without specialized approaches such as methylation filtration <abbrgrp><abbr bid="B3">3</abbr></abbrgrp>. Genome sizes range from 8.5 billion base pairs for <it>Desmognathus monticola </it>(seal salamander) to nearly 70 billion base pairs for <it>Plethodon vandykei </it>(Van Dyke's salamander) <abbrgrp><abbr bid="B4">4</abbr></abbrgrp>. The ambystomatid <it>Ambystoma mexicanum</it>, a species important for studies in evolution, regeneration and development, has an estimated genome size between 21.9 billion and 48 billion base pairs <abbrgrp><abbr bid="B5">5</abbr><abbr bid="B6">6</abbr></abbrgrp> and measurements of its genome in centimorgans (cM) has yielded the largest size reported for a living vertebrate so far (7,291 cM <abbrgrp><abbr bid="B7">7</abbr></abbrgrp>). In maize, another organism with a large genome, 60,000 sequence reads were required before genome sequencing of methylation-filtered genomic libraries generated significantly more gene sequence information than the available maize EST sequences <abbrgrp><abbr bid="B8">8</abbr></abbrgrp>.</p>
			<p>Molecular evolution studies of salamanders have relied primarily on mitochondrial genes such as those for ribosomal RNAs and cytochrome <it>c </it><abbrgrp><abbr bid="B9">9</abbr></abbrgrp>. The lack of sequence information among the Caudata hinders the ability to perform sequence comparison with other important gene families. Furthermore, because of the lack of clones, the number of molecular markers available to study salamander embryology and regeneration is low. To address this gap in sequence availability we have generated a large gene sequence set for <it>A. mexicanum</it>. We chose this species because of its role in evolutionary, developmental and regeneration studies. <it>A. mexicanum </it>is easily bred in the laboratory, and animals can be obtained from a large, NSF-funded colony <abbrgrp><abbr bid="B10">10</abbr></abbrgrp>. We have sequenced inserts from two cDNA libraries, one produced from dorsal regions of stage 18-22 embryos, consisting primarily of neural tube, somite and notochord. The second library was constructed from day-6 regenerating tail blastema tissue. By sequencing from these two sources, our goal was to obtain sequences of transcripts involved in organizing and regenerating the primary body axis. Here we describe the EST gene set, provide an example of molecular phylogenetic analysis of one gene from this collection, and describe the database created for organizing the <it>A. mexicanum </it>EST information. This database is also being implemented for EST sequences from a full-length <it>X. laevis </it>cDNA library, and for sequences from a <it>Canis familiaris </it>EST project.</p>
		</sec>
		<sec>
			<st>
				<p>Results</p>
			</st>
			<sec>
				<st>
					<p>Assessment of library and EST sequence quality</p>
				</st>
				<p>To generate a diverse set of sequences involved in organizing and regenerating the primary body axis, two independent cDNA libraries were used for sequencing. One was derived from dorsal regions of stage 18-22 embryos containing neural tube, somite and notochord - called the 'neural tube' library - the other from 6-day post-amputation regenerating tail blastema. From 18,432 sequencing attempts 17,522 high-quality sequences were obtained after Phred analysis <abbrgrp><abbr bid="B11">11</abbr></abbrgrp>. All sequences are 5' reads of the inserts. Of 17,522 high-quality, single-pass sequencing runs, 32 clones contained no insert and 137 sequences were below 32 base pairs (bp). These sequences were excluded from further analysis (32 bp representing the lower limit for assembly of a sequence using TIGR-assembler), yielding 17,352 clones for final analysis. The neural tube library was the origin of 7,469 sequences and the blastema library of 9,883 sequences (Table <tblr tid="T1">1</tblr>, and see Materials and methods). As shown in Figure <figr fid="F1">1a</figr>, the average sequence read length peaked between 500 and 600 nucleotides with an average length of 510 nucleotides and a maximum of 871.</p>
				<tbl id="T1" hint_layout="single">
					<title>
						<p>Table 1</p>
					</title>
					<caption>
						<p>Some characteristics of the <it>A. mexicanum </it>EST contigs</p>
					</caption>
					<tblbdy cols="5">
						<r>
							<c ca="left">
								<p>Library</p>
							</c>
							<c ca="center">
								<p>Number of sequences</p>
							</c>
							<c ca="center">
								<p>Number of contigs (+ singlets)</p>
							</c>
							<c ca="center">
								<p>Number of clones in contigs</p>
							</c>
							<c ca="center">
								<p>Number of clones in singlets</p>
							</c>
						</r>
						<r>
							<c cspan="5">
								<hr/>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>St18-22 neural tube</p>
							</c>
							<c ca="center">
								<p>7,469</p>
							</c>
							<c>
								<p/>
							</c>
							<c>
								<p/>
							</c>
							<c>
								<p/>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>6D tail blastema</p>
							</c>
							<c ca="center">
								<p>9,883</p>
							</c>
							<c>
								<p/>
							</c>
							<c>
								<p/>
							</c>
							<c>
								<p/>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>Combined total</p>
							</c>
							<c ca="center">
								<p>17,352</p>
							</c>
							<c ca="center">
								<p>6,377</p>
							</c>
							<c ca="center">
								<p>12,791</p>
							</c>
							<c ca="center">
								<p>4,561</p>
							</c>
						</r>
					</tblbdy>
					<tblfn>
						<p>The number of expressed sequence tags sequenced from the two libraries blastema and neural tube, as well as the number of contigs, the number of clones in contigs and the number of clones found in singlets is shown.</p>
					</tblfn>
				</tbl>
				<fig id="F1">
					<title>
						<p>Figure 1</p>
					</title>
					<caption>
						<p>Distribution of sequence length</p>
					</caption>
					<text>
						<p>Distribution of sequence length. <b>(a) </b>Distribution of read lengths of the sequenced ESTs after quality control. The average read length was 569 bp, corresponding to a peak of between 500 and 600 bp. <b>(b) </b>Distribution of sequence length of assembled contigs. The average length of contigs was 597 bp. <b>(c) </b>Distribution of the number of ESTs per assembled contig. Most of the contigs had one EST. The two largest contigs contained over 400 ESTs (cytochrome <it>c </it>oxidase subunit I and 12S rRNA, respectively).</p>
					</text>
					<graphic file="gb-2004-5-9-r67-1"/>
				</fig>
				<p>The blastema and neural tube libraries were unnormalized and unamplified. We assessed library quality and diversity on the basis of the number of redundant clones in the library. Redundancy was estimated by performing BLASTN searches <abbrgrp><abbr bid="B12">12</abbr></abbrgrp> against all clones sequenced. After sequencing 10,752 clones of the blastema library 42% of the sequences were still unique, and 50% of clones were still singlets after sequencing 7,680 clones from neural tube, indicating that both libraries display high diversity.</p>
			</sec>
			<sec>
				<st>
					<p>EST assembly into contigs</p>
				</st>
				<p>To identify ESTs belonging to the same open reading frames (ORFs), sequences were assembled into contigs using TIGR-Assembler version 2 <abbrgrp><abbr bid="B13">13</abbr></abbrgrp>. The 17,353 sequences assembled into 6,594 contigs, of which 217 were less than 100 nucleotides long and excluded from further analysis. A total of 6,377 contigs was therefore left for final analysis (Table <tblr tid="T1">1</tblr>). Of these, 4,561 contigs contained a single clone. The average contig length of the remaining dataset was 616 nucleotides (Figure <figr fid="F1">1b</figr>). Other than singlets, most of the contigs consisted of two ESTs (884 contigs, Figure <figr fid="F1">1c</figr>). The largest contigs included cytochrome <it>c </it>oxidase subunit I (469 ESTs), 12S rRNA (445 ESTs), nuclear factor 7 Zn-binding protein A33 (332 ESTs), type II keratin (274 ESTs), keratin (211 ESTs) and cytoplasmic beta-actin (206 ESTs) (Table <tblr tid="T2">2</tblr>).</p>
				<tbl id="T2" hint_layout="single">
					<title>
						<p>Table 2</p>
					</title>
					<caption>
						<p>Gene definition of the most abundant contigs in the <it>A. mexicanum </it>EST libraries</p>
					</caption>
					<tblbdy cols="2">
						<r>
							<c ca="left">
								<p>Gene definition</p>
							</c>
							<c ca="center">
								<p>Number of clones in contig</p>
							</c>
						</r>
						<r>
							<c cspan="2">
								<hr/>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>Cytochrome <it>c </it>oxidase subunit I</p>
							</c>
							<c ca="center">
								<p>469</p>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>12S rRNA</p>
							</c>
							<c ca="center">
								<p>445</p>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>Nuclear factor 7</p>
							</c>
							<c ca="center">
								<p>332</p>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>Keratin type II</p>
							</c>
							<c ca="center">
								<p>274</p>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>Keratin</p>
							</c>
							<c ca="center">
								<p>211</p>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>Cytoplasmic &#946;-actin</p>
							</c>
							<c ca="center">
								<p>206</p>
							</c>
						</r>
					</tblbdy>
					<tblfn>
						<p>The gene with the highest number of clones identified was cytochrome <it>c </it>oxidase subunit I (469 clones in contig), followed by 12S rRNA (445) and nuclear factor 7 (332 clones in contig).</p>
					</tblfn>
				</tbl>
			</sec>
			<sec>
				<st>
					<p>Comparison to existing <it>A. mexicanum </it>genes in NCBI: 6,000 new contig sequences</p>
				</st>
				<p>A total of 1,134 ESTs were available from <it>A. mexicanum </it>in the National Center for Biological Information (NCBI) EST databases prior to this work, most of which originate from a sequencing effort of the Voss laboratory (<abbrgrp><abbr bid="B14">14</abbr></abbrgrp> and S.R. Voss, D. King, N. Maness, J.J. Smith, M. Rondet, S.V. Bryant, D.M. Gardiner, and D.M. Parichy, unpublished work (NCBI-accession numbers BI817205-BI818091); see also <abbrgrp><abbr bid="B15">15</abbr></abbrgrp>). We examined to what extent our EST dataset overlapped with the sequences available to date. Only 600 of the ESTs in the public database identified one of our contigs in a BLASTN search as a homolog; in 85% of cases, the E-value was below 1E-50 and the sequences can be considered as potentially identical. Existing ESTs in the database largely originate from regenerating limb (S.R. Voss, D. King, N. Maness, J.J. Smith, M. Rondet, S.V. Bryant, D.M. Gardiner and D.M. Parichy, unpublished work). There was, however, only a slight bias of matching contigs to regenerating blastema (49%) as compared to neural tube (44%). Seven percent of identified contigs were found in both libraries. These results mean that our EST data enriches the existing sequence resource of <it>A. mexicanum </it>with approximately 6,000 new gene sequences.</p>
			</sec>
			<sec>
				<st>
					<p>BLAST analysis of <it>A. mexicanum </it>contigs to assign homologies</p>
				</st>
				<p>To identify putative homologies to known proteins, we subjected the contigs to BLASTX searches against the non-redundant protein database (NR, NCBI) where a cutoff E-value of 1e-05 was used for parsing output files. In our annotation, we used an E-value of 1e-20 as an upper limit to assign significant homology. We note that this does not imply that such sequences are true orthologs. In addition, in cases where no significant homology was found, we used an E-value limit of 1e-05 to designate weak homology. We find this additional category of 'weak homology' useful for data mining. As most contigs do not represent full-length sequences, it is possible that only a highly divergent region of a gene sequence is available in our collection. The category of weak homology allows us to find potential homologs in such situations. For example, the BLAST search for contig Am_4671 yielded the GenBank entry NP_004055, cyclin-dependent kinase inhibitor 1B (<it>Homo sapiens</it>), as the top hit with an E-value of 4e-07. This assignment was based on the carboxy-terminal 120 amino acids of the protein, which represents the less conserved region. When we isolated a full-length clone for Am_4671 from our library, we could confirm that it is indeed the axolotl ortholog of cyclin-dependent kinase inhibitor 1B (p27<sup>Kip1</sup>), as discussed later.</p>
				<p>Taken together, a total of 3,718 (58%) sequences shared homology with a protein from selected model organisms in the non-redundant database and could be assigned a putative identity. The E-value distribution of the top hits in the non-redundant database is shown in Figure <figr fid="F2">2a</figr>. Of the contigs, 11% matched a protein with an E-value below 1e-99 and are therefore likely to be true orthologs. Seventy percent of the contigs found a hit with an E-value between 1e-20 and 1e-99 and were assigned significant homology. Finally, 19% of contigs had a first hit with an E-value between 1e-19 and 1e-05 and were assigned weak homology to a protein from the non-redundant database. For annotating our database, these top hits from human, mouse (<it>Mus musculus</it>), rat (<it>Rattus norvegicus</it>), frog (<it>X. laevis</it>), zebrafish (<it>Danio rerio</it>), fugu (<it>Takifugu rubripes</it>), fruitfly (<it>Drosophila melanogaster</it>), mosquito (<it>Anopheles gambiae</it>), worm (<it>Caenorhabditis elegans</it>), newts and the yeast species <it>Saccharomyces cerevisiae</it>, <it>Schizosaccharomyces pombe </it>and <it>Candida albicans </it>were collected and the closest homolog from the above species was used to assign a putative identity.</p>
				<fig id="F2">
					<title>
						<p>Figure 2</p>
					</title>
					<caption>
						<p>Homology of <it>A. mexicanum </it>contigs to protein and nucleotide sequences from other species</p>
					</caption>
					<text>
						<p>Homology of <it>A. mexicanum </it>contigs to protein and nucleotide sequences from other species. <b>(a) </b>Distribution of E-values from the first identified hit in the protein non-redundant database that was used to assign a putative identity to the contig. The majority of contigs identified a protein with an E-value between 1e-20 and 1e-99. In 11% of the cases, the E-value of the first hit was below 1e-100 and can therefore be considered a true ortholog. <b>(b) </b>Distribution of hits in the different sequence databases that were searched sequentially.</p>
					</text>
					<graphic file="gb-2004-5-9-r67-2"/>
				</fig>
				<p>To estimate how many of the clones are full length we examined the BLAST alignments for the position of the alignment in respect to the database sequence. Of the 3,718 sequences with homologs, 1,107 (29.8%) could be aligned in the amino terminus (with the alignment starting before position 10). As the library was poly(dT) primed, many of these clones are likely to represent full-length inserts. Of these 199 (5.4%) could be aligned from the amino terminus to the carboxy terminus and are potential full-length sequences.</p>
				<p>Forty percent of our EST sequences did not generate a significant hit in the non-redundant protein database. The availability of additional sequence databases including complete genome sequences from several organisms allowed us to expand our BLAST searches to identify all possible homologs to the <it>A. mexicanum </it>contigs. With the remaining set of contigs, we first performed BLASTN searches against the nucleotide non-redundant (NT) database and BLASTX searches against the EST database. Finally, we performed BLASTX searches against the fugu and human proteomes. In all cases, an E-value of 1e-05 was used to assign potentially homologous sequences. Sequences in the NT database identified an additional 134 contigs and a further 220 contigs found a hit in the EST databases. A homolog was found for 3,340 (52%) contigs in the fugu proteome and 3,698 (58%) contigs shared homology with a protein from the human proteome. In total, an additional 468 contigs identified a homolog in the selected databases beyond the original assignment from the non-redundant protein database (Figure <figr fid="F2">2b</figr>).</p>
			</sec>
			<sec>
				<st>
					<p>Gene sequences with no identifiable homology</p>
				</st>
				<p>No homologous sequence could be found for 2,191 (34%) contigs in any of the databases searched. Because the library was poly(dT) primed, many of these sequences could represent 3' untranslated regions (3' UTRs). We determined that 953 sequences (43% of non-homologous contigs) contained no ORF and were therefore potential untranslated regions. Thirty of the sequences shared homology to an existing <it>A. mexicanum </it>clone from the EST database (Table <tblr tid="T3">3</tblr>). The complete list of unique ESTs can be downloaded from <abbrgrp><abbr bid="B16">16</abbr></abbrgrp>.</p>
				<tbl id="T3" hint_layout="single">
					<title>
						<p>Table 3</p>
					</title>
					<caption>
						<p>Contig identities and GenBank identifiers of ESTs unique to <it>A. mexicanum</it></p>
					</caption>
					<tblbdy cols="3">
						<r>
							<c ca="left">
								<p>Contig ID</p>
							</c>
							<c ca="center">
								<p>GenBank identifier</p>
							</c>
							<c ca="center">
								<p>UTR</p>
							</c>
						</r>
						<r>
							<c cspan="3">
								<hr/>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>Am_1065</p>
							</c>
							<c ca="center">
								<p>BI817418.1</p>
							</c>
							<c>
								<p/>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>Am_13</p>
							</c>
							<c ca="center">
								<p>BI817561.1</p>
							</c>
							<c>
								<p/>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>Am_1868</p>
							</c>
							<c ca="center">
								<p>BI817299.1</p>
							</c>
							<c>
								<p/>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>Am_1879</p>
							</c>
							<c ca="center">
								<p>BI817273.1</p>
							</c>
							<c ca="center">
								<p>UTR</p>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>Am_1986</p>
							</c>
							<c ca="center">
								<p>BI817397.1</p>
							</c>
							<c>
								<p/>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>Am_2156</p>
							</c>
							<c ca="center">
								<p>BI817699.1</p>
							</c>
							<c ca="center">
								<p>UTR</p>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>Am_2280</p>
							</c>
							<c ca="center">
								<p>BI817354.1</p>
							</c>
							<c>
								<p/>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>Am_242</p>
							</c>
							<c ca="center">
								<p>BI817917.1</p>
							</c>
							<c>
								<p/>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>Am_2631</p>
							</c>
							<c ca="center">
								<p>BI817344.1</p>
							</c>
							<c>
								<p/>
							</c>
						</r>
						<r>
							<c>
								<p/>
							</c>
							<c ca="center">
								<p>BI818040.1</p>
							</c>
							<c>
								<p/>
							</c>
						</r>
						<r>
							<c>
								<p/>
							</c>
							<c ca="center">
								<p>BI817371.1</p>
							</c>
							<c>
								<p/>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>Am_2695</p>
							</c>
							<c ca="center">
								<p>BI818066.1</p>
							</c>
							<c ca="center">
								<p>UTR</p>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>Am_2767</p>
							</c>
							<c ca="center">
								<p>BI817941.1</p>
							</c>
							<c ca="center">
								<p>UTR</p>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>Am_2952</p>
							</c>
							<c ca="center">
								<p>BI817736.1</p>
							</c>
							<c>
								<p/>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>Am_3070</p>
							</c>
							<c ca="center">
								<p>BI817303.1</p>
							</c>
							<c>
								<p/>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>Am_3486</p>
							</c>
							<c ca="center">
								<p>BI817478.1</p>
							</c>
							<c>
								<p/>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>Am_3807</p>
							</c>
							<c ca="center">
								<p>BI817992.1</p>
							</c>
							<c ca="center">
								<p>UTR</p>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>Am_3828</p>
							</c>
							<c ca="center">
								<p>BI817981.1</p>
							</c>
							<c>
								<p/>
							</c>
						</r>
						<r>
							<c>
								<p/>
							</c>
							<c ca="center">
								<p>BI817250.1</p>
							</c>
							<c>
								<p/>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>Am_4598</p>
							</c>
							<c ca="center">
								<p>BI817704.1</p>
							</c>
							<c>
								<p/>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>Am_4661</p>
							</c>
							<c ca="center">
								<p>BI817548.1</p>
							</c>
							<c ca="center">
								<p>UTR</p>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>Am_4720</p>
							</c>
							<c ca="center">
								<p>BI817653.1</p>
							</c>
							<c ca="center">
								<p>UTR</p>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>Am_5031</p>
							</c>
							<c ca="center">
								<p>BI817804.1</p>
							</c>
							<c ca="center">
								<p>UTR</p>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>Am_5579</p>
							</c>
							<c ca="center">
								<p>BI818004.1</p>
							</c>
							<c>
								<p/>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>Am_5650</p>
							</c>
							<c ca="center">
								<p>BI817315.1</p>
							</c>
							<c>
								<p/>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>Am_5742</p>
							</c>
							<c ca="center">
								<p>BI817525.1</p>
							</c>
							<c ca="center">
								<p>UTR</p>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>Am_5881</p>
							</c>
							<c ca="center">
								<p>BI818060.1</p>
							</c>
							<c>
								<p/>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>Am_6107</p>
							</c>
							<c ca="center">
								<p>BI817553.1</p>
							</c>
							<c ca="center">
								<p>UTR</p>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>Am_6128</p>
							</c>
							<c ca="center">
								<p>BI817667.1</p>
							</c>
							<c ca="center">
								<p>UTR</p>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>Am_6198</p>
							</c>
							<c ca="center">
								<p>BI817866.1</p>
							</c>
							<c>
								<p/>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>Am_646</p>
							</c>
							<c ca="center">
								<p>BI817520.1</p>
							</c>
							<c>
								<p/>
							</c>
						</r>
						<r>
							<c>
								<p/>
							</c>
							<c ca="center">
								<p>BI817607.1</p>
							</c>
							<c>
								<p/>
							</c>
						</r>
						<r>
							<c>
								<p/>
							</c>
							<c ca="center">
								<p>BI817743.1</p>
							</c>
							<c>
								<p/>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>Am_6565</p>
							</c>
							<c ca="center">
								<p>BI817313.1</p>
							</c>
							<c ca="center">
								<p>UTR</p>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>Am_901</p>
							</c>
							<c ca="center">
								<p>BI817984.1</p>
							</c>
							<c>
								<p/>
							</c>
						</r>
					</tblbdy>
					<tblfn>
						<p>The table shows contig identities and GenBank identifiers of existing <it>A. mexicanum </it>ESTs that do not share any homology to a known protein or nucleotide sequence and can therefore be considered unique.</p>
					</tblfn>
				</tbl>
			</sec>
			<sec>
				<st>
					<p>Assignment of the <it>A. mexicanum </it>dataset to common Gene Ontology terms</p>
				</st>
				<p>From the homologous proteins found, contigs were assigned a biological process, molecular function and cellular component from the Gene Ontology (GO) database <abbrgrp><abbr bid="B17">17</abbr></abbrgrp>. The closest annotated homolog in the GO database was used, using an E-value of 1e-20 as a cutoff, for assigning these categories. A biological process could be assigned to 2,156 contigs (34% of all contigs and 58% of those sharing a homolog in the non-redundant database); 2,186 contigs (34% and 59%, respectively) were assigned a molecular function; and 2,198 contigs (34% and 59%, respectively) could be assigned a cellular component. The most abundant molecular function assigned was 'death receptor interacting protein', followed by 'peptidase', the highest-ranking biological process were 'biological process unknown' and 'proteolysis/peptidolysis' and the most abundant cellular components assigned were the 'actin cytoskeleton' and 'transcriptional repressor complex'.</p>
				<p>The largest fraction of the contigs was assigned a cellular process in the GO category biological process (87% of annotated contigs) (Figure <figr fid="F3">3a</figr>). We split the biological processes further into different categories: the most abundant categories were 'protein metabolism/modification' (18% of assigned contigs); 'housekeeping functions/metabolism' (17%); 'intracellular transport' (15%); 'cell cycle/proliferation' (13%); 'RNA metabolism' (13%); 'intracellular signaling' (8%); and 'DNA metabolism/repair' (5%) (Figure <figr fid="F3">3a</figr>, Table <tblr tid="T4">4</tblr>). A list of annotated contigs is downloadable from <abbrgrp><abbr bid="B16">16</abbr></abbrgrp>.</p>
				<fig id="F3">
					<title>
						<p>Figure 3</p>
					</title>
					<caption>
						<p>Annotated GO terms and protein domains in the <it>A. mexicanum </it>EST libraries</p>
					</caption>
					<text>
						<p>Annotated GO terms and protein domains in the <it>A. mexicanum </it>EST libraries. <b>(a) </b>Gene Ontology electronic annotation in the category 'biological process' of contigs from <it>A. mexicanum</it>. The largest proportion of annotated contigs was assigned a 'cellular process' (87%). Of those, five large groups of cellular processes emerged, with 'cell cycle/proliferation' (13%), 'intracellular signaling' and 'intracellular transport' (8% and 15%), 'metabolism' (17%), 'protein metabolism/modification' (18%) and 'RNA metabolism' (13%). <b>(b) </b>Domains associated with cellular processes identified in the <it>A. mexicanum </it>contig sequence dataset. The largest fraction of contigs was associated with a domain function in 'intracellular transport', followed by 'RNA-binding and metabolism' and 'DNA-binding and transcriptional control'.</p>
					</text>
					<graphic file="gb-2004-5-9-r67-3"/>
				</fig>
				<tbl id="T4" hint_layout="double">
					<title>
						<p>Table 4</p>
					</title>
					<caption>
						<p>The most abundant biological processes assigned to the <it>A. mexicanum </it>contigs</p>
					</caption>
					<tblbdy cols="5">
						<r>
							<c ca="left">
								<p>Biological process</p>
							</c>
							<c ca="center">
								<p>Total number of contigs</p>
							</c>
							<c ca="center">
								<p>% contigs</p>
							</c>
							<c ca="center">
								<p>BL/NT</p>
							</c>
							<c ca="center">
								<p>Fisher's exact (BL/NT)</p>
							</c>
						</r>
						<r>
							<c cspan="5">
								<hr/>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>Protein metabolism</p>
							</c>
							<c ca="center">
								<p>324</p>
							</c>
							<c ca="center">
								<p>15</p>
							</c>
							<c ca="center">
								<p>116/132</p>
							</c>
							<c ca="center">
								<p>3/1</p>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>Metabolism</p>
							</c>
							<c ca="center">
								<p>296</p>
							</c>
							<c ca="center">
								<p>13.7</p>
							</c>
							<c ca="center">
								<p>78/170</p>
							</c>
							<c ca="center">
								<p>0/3</p>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>Intracellular transport</p>
							</c>
							<c ca="center">
								<p>268</p>
							</c>
							<c ca="center">
								<p>12.4</p>
							</c>
							<c ca="center">
								<p>59/53</p>
							</c>
							<c ca="center">
								<p>4/5</p>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>RNA metabolism</p>
							</c>
							<c ca="center">
								<p>227</p>
							</c>
							<c ca="center">
								<p>10.5</p>
							</c>
							<c ca="center">
								<p>127/45</p>
							</c>
							<c ca="center">
								<p>22/2</p>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>Cell cycle</p>
							</c>
							<c ca="center">
								<p>194</p>
							</c>
							<c ca="center">
								<p>9</p>
							</c>
							<c ca="center">
								<p>95/52</p>
							</c>
							<c ca="center">
								<p>5/2</p>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>Intracellular signaling</p>
							</c>
							<c ca="center">
								<p>148</p>
							</c>
							<c ca="center">
								<p>6.8</p>
							</c>
							<c ca="center">
								<p>95/65</p>
							</c>
							<c ca="center">
								<p>1/6</p>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>DNA metabolism/repair</p>
							</c>
							<c ca="center">
								<p>90</p>
							</c>
							<c ca="center">
								<p>4.1</p>
							</c>
							<c ca="center">
								<p>50/12</p>
							</c>
							<c ca="center">
								<p>3/0</p>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>Development</p>
							</c>
							<c ca="center">
								<p>69</p>
							</c>
							<c ca="center">
								<p>3.2</p>
							</c>
							<c ca="center">
								<p>32/27</p>
							</c>
							<c ca="center">
								<p>0/2</p>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>Cell-cell communication</p>
							</c>
							<c ca="center">
								<p>81</p>
							</c>
							<c ca="center">
								<p>3.7</p>
							</c>
							<c ca="center">
								<p>24/42</p>
							</c>
							<c ca="center">
								<p>0/6</p>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>Differentiation</p>
							</c>
							<c ca="center">
								<p>27</p>
							</c>
							<c ca="center">
								<p>1.5</p>
							</c>
							<c ca="center">
								<p>13/7</p>
							</c>
							<c ca="center">
								<p>2/3</p>
							</c>
						</r>
					</tblbdy>
					<tblfn>
						<p>The highest-ranking biological process is 'protein metabolism/modification' with 15% of contigs assigned. 'Cellular metabolism', 'intracellular transport' and 'RNA metabolism' have all more than 10% of contigs assigned and represent the most abundant gene families in the two libraries. The percentage contigs refers to the number of contigs assigned a biological process. BL: Blastema; NT: Neural tube.</p>
					</tblfn>
				</tbl>
			</sec>
			<sec>
				<st>
					<p>Common SMART and PFAM domains in the <it>A. mexicanum </it>dataset</p>
				</st>
				<p>To identify potential domains in the axolotl contigs, we performed RPS-BLAST searches against the conserved domain database (CDD, NCBI) <abbrgrp><abbr bid="B12">12</abbr><abbr bid="B18">18</abbr></abbrgrp> using the default cutoff E-value of 0.01. A total of 2,199 (34.5%) contigs had a known protein domain in either the CDD or the SMART or PFAM databases. A detailed list of common protein domains identified in our dataset is given in Table <tblr tid="T5">5</tblr>. Among the protein domains identified were homeobox domains such as HOX, PAX and Prox1, eight helix-loop-helix (HLH) domains, RNA-binding domains such as KH and RRM, 69 kinase domains, metal- and lipid binding domains and domains involved in cell-cycle control and ubiquitination (RING fingers, HECT domains, three cullin domains and 12 cyclin domains). Many of these domains were annotated for the first time in a sequence from <it>A. mexicanum</it>. We also compared the occurrence of those domains in other vertebrate species. For most of the common protein domains, only a fraction were found in our dataset; many of these are quite abundant compared to <it>X. laevis </it>or <it>Gallus gallus</it>. The RNA-binding domains KH and RRM especially showed high abundance in our contigs. A complete list of domains is downloadable from <abbrgrp><abbr bid="B16">16</abbr></abbrgrp>.</p>
				<tbl id="T5" hint_layout="double">
					<title>
						<p>Table 5</p>
					</title>
					<caption>
						<p>Common protein domains identified in the <it>A. mexicanum </it>contigs and comparison to domain occurrences in other vertebrate species</p>
					</caption>
					<tblbdy cols="7">
						<r>
							<c ca="left">
								<p>Domain</p>
							</c>
							<c ca="center">
								<p>
									<it>A. mexicanum</it>
								</p>
							</c>
							<c ca="center">
								<p>
									<it>H. sapiens</it>
								</p>
							</c>
							<c ca="center">
								<p>
									<it>M. musculus</it>
								</p>
							</c>
							<c ca="center">
								<p>
									<it>X. laevis</it>
								</p>
							</c>
							<c ca="center">
								<p>
									<it>G. gallus</it>
								</p>
							</c>
							<c ca="center">
								<p>
									<it>D. rerio</it>
								</p>
							</c>
						</r>
						<r>
							<c cspan="7">
								<hr/>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>EF-hand</p>
							</c>
							<c ca="center">
								<p>10</p>
							</c>
							<c ca="center">
								<p>319</p>
							</c>
							<c ca="center">
								<p>308</p>
							</c>
							<c ca="center">
								<p>36</p>
							</c>
							<c ca="center">
								<p>48</p>
							</c>
							<c ca="center">
								<p>38</p>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>Cyclin</p>
							</c>
							<c ca="center">
								<p>12</p>
							</c>
							<c ca="center">
								<p>60</p>
							</c>
							<c ca="center">
								<p>58</p>
							</c>
							<c ca="center">
								<p>20</p>
							</c>
							<c ca="center">
								<p>9</p>
							</c>
							<c ca="center">
								<p>15</p>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>Chromo</p>
							</c>
							<c ca="center">
								<p>5</p>
							</c>
							<c ca="center">
								<p>26</p>
							</c>
							<c ca="center">
								<p>26</p>
							</c>
							<c ca="center">
								<p>8</p>
							</c>
							<c ca="center">
								<p>5</p>
							</c>
							<c ca="center">
								<p>5</p>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>Prox1</p>
							</c>
							<c ca="center">
								<p>5</p>
							</c>
							<c ca="center">
								<p>4</p>
							</c>
							<c ca="center">
								<p>2</p>
							</c>
							<c ca="center">
								<p>2</p>
							</c>
							<c ca="center">
								<p>1</p>
							</c>
							<c ca="center">
								<p>2</p>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>HLH</p>
							</c>
							<c ca="center">
								<p>8 (1)</p>
							</c>
							<c ca="center">
								<p>167</p>
							</c>
							<c ca="center">
								<p>179</p>
							</c>
							<c ca="center">
								<p>83</p>
							</c>
							<c ca="center">
								<p>70</p>
							</c>
							<c ca="center">
								<p>75</p>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>HOX</p>
							</c>
							<c ca="center">
								<p>13 (19)</p>
							</c>
							<c ca="center">
								<p>280</p>
							</c>
							<c ca="center">
								<p>352</p>
							</c>
							<c ca="center">
								<p>196</p>
							</c>
							<c ca="center">
								<p>142</p>
							</c>
							<c ca="center">
								<p>250</p>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>PAX</p>
							</c>
							<c ca="center">
								<p>1 (4)</p>
							</c>
							<c ca="center">
								<p>12</p>
							</c>
							<c ca="center">
								<p>31</p>
							</c>
							<c ca="center">
								<p>25</p>
							</c>
							<c ca="center">
								<p>9</p>
							</c>
							<c ca="center">
								<p>13</p>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>EGF</p>
							</c>
							<c ca="center">
								<p>10</p>
							</c>
							<c ca="center">
								<p>310</p>
							</c>
							<c ca="center">
								<p>281</p>
							</c>
							<c ca="center">
								<p>26</p>
							</c>
							<c ca="center">
								<p>50</p>
							</c>
							<c ca="center">
								<p>32</p>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>SET</p>
							</c>
							<c ca="center">
								<p>2</p>
							</c>
							<c ca="center">
								<p>82</p>
							</c>
							<c ca="center">
								<p>64</p>
							</c>
							<c ca="center">
								<p>4</p>
							</c>
							<c ca="center">
								<p>3</p>
							</c>
							<c ca="center">
								<p>1</p>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>RAS</p>
							</c>
							<c ca="center">
								<p>37</p>
							</c>
							<c ca="center">
								<p>220</p>
							</c>
							<c ca="center">
								<p>194</p>
							</c>
							<c ca="center">
								<p>34</p>
							</c>
							<c ca="center">
								<p>11</p>
							</c>
							<c ca="center">
								<p>27</p>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>RhoGEF</p>
							</c>
							<c ca="center">
								<p>4</p>
							</c>
							<c ca="center">
								<p>124</p>
							</c>
							<c ca="center">
								<p>98</p>
							</c>
							<c ca="center">
								<p>4</p>
							</c>
							<c ca="center">
								<p>2</p>
							</c>
							<c ca="center">
								<p>3</p>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>PH</p>
							</c>
							<c ca="center">
								<p>2</p>
							</c>
							<c ca="center">
								<p>453</p>
							</c>
							<c ca="center">
								<p>374</p>
							</c>
							<c ca="center">
								<p>14</p>
							</c>
							<c ca="center">
								<p>10</p>
							</c>
							<c ca="center">
								<p>18</p>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>PX</p>
							</c>
							<c ca="center">
								<p>4</p>
							</c>
							<c ca="center">
								<p>70</p>
							</c>
							<c ca="center">
								<p>74</p>
							</c>
							<c ca="center">
								<p>2</p>
							</c>
							<c ca="center">
								<p>0</p>
							</c>
							<c ca="center">
								<p>3</p>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>WD40</p>
							</c>
							<c ca="center">
								<p>39</p>
							</c>
							<c ca="center">
								<p>547</p>
							</c>
							<c ca="center">
								<p>490</p>
							</c>
							<c ca="center">
								<p>63</p>
							</c>
							<c ca="center">
								<p>12</p>
							</c>
							<c ca="center">
								<p>50</p>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>Cullin</p>
							</c>
							<c ca="center">
								<p>3</p>
							</c>
							<c ca="center">
								<p>8</p>
							</c>
							<c ca="center">
								<p>20</p>
							</c>
							<c ca="center">
								<p>0</p>
							</c>
							<c ca="center">
								<p>0</p>
							</c>
							<c ca="center">
								<p>2</p>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>F-box</p>
							</c>
							<c ca="center">
								<p>2</p>
							</c>
							<c ca="center">
								<p>119</p>
							</c>
							<c ca="center">
								<p>130</p>
							</c>
							<c ca="center">
								<p>11</p>
							</c>
							<c ca="center">
								<p>0</p>
							</c>
							<c ca="center">
								<p>8</p>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>HectC</p>
							</c>
							<c ca="center">
								<p>3</p>
							</c>
							<c ca="center">
								<p>64</p>
							</c>
							<c ca="center">
								<p>66</p>
							</c>
							<c ca="center">
								<p>4</p>
							</c>
							<c ca="center">
								<p>2</p>
							</c>
							<c ca="center">
								<p>1</p>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>RING</p>
							</c>
							<c ca="center">
								<p>17</p>
							</c>
							<c ca="center">
								<p>374</p>
							</c>
							<c ca="center">
								<p>325</p>
							</c>
							<c ca="center">
								<p>18</p>
							</c>
							<c ca="center">
								<p>16</p>
							</c>
							<c ca="center">
								<p>29</p>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>KH</p>
							</c>
							<c ca="center">
								<p>23 (1)</p>
							</c>
							<c ca="center">
								<p>71</p>
							</c>
							<c ca="center">
								<p>52</p>
							</c>
							<c ca="center">
								<p>20</p>
							</c>
							<c ca="center">
								<p>7</p>
							</c>
							<c ca="center">
								<p>10</p>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>RRM</p>
							</c>
							<c ca="center">
								<p>101 (2)</p>
							</c>
							<c ca="center">
								<p>443</p>
							</c>
							<c ca="center">
								<p>438</p>
							</c>
							<c ca="center">
								<p>94</p>
							</c>
							<c ca="center">
								<p>23</p>
							</c>
							<c ca="center">
								<p>69</p>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>PDZ</p>
							</c>
							<c ca="center">
								<p>8</p>
							</c>
							<c ca="center">
								<p>260</p>
							</c>
							<c ca="center">
								<p>252</p>
							</c>
							<c ca="center">
								<p>17</p>
							</c>
							<c ca="center">
								<p>11</p>
							</c>
							<c ca="center">
								<p>23</p>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>Kinase</p>
							</c>
							<c ca="center">
								<p>69 (2)</p>
							</c>
							<c ca="center">
								<p>949</p>
							</c>
							<c ca="center">
								<p>954</p>
							</c>
							<c ca="center">
								<p>210</p>
							</c>
							<c ca="center">
								<p>122</p>
							</c>
							<c ca="center">
								<p>156</p>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>LIM</p>
							</c>
							<c ca="center">
								<p>5</p>
							</c>
							<c ca="center">
								<p>128</p>
							</c>
							<c ca="center">
								<p>125</p>
							</c>
							<c ca="center">
								<p>22</p>
							</c>
							<c ca="center">
								<p>19</p>
							</c>
							<c ca="center">
								<p>22</p>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>PHD</p>
							</c>
							<c ca="center">
								<p>4</p>
							</c>
							<c ca="center">
								<p>164</p>
							</c>
							<c ca="center">
								<p>122</p>
							</c>
							<c ca="center">
								<p>13</p>
							</c>
							<c ca="center">
								<p>4</p>
							</c>
							<c ca="center">
								<p>3</p>
							</c>
						</r>
					</tblbdy>
					<tblfn>
						<p>Numbers in parentheses indicate the number of domains that had been annotated to a protein sequence from <it>A. mexicanum </it>prior to this project.</p>
					</tblfn>
				</tbl>
				<p>We assigned cellular functions to the identified domains and analyzed the output according to the functional distribution of contigs (Figure <figr fid="F3">3b</figr>). The most abundant domains were found in the category 'intracellular transport'; this is due to redundant annotations of small GTPases. The second largest fraction belonged to 'RNA-binding and metabolism', followed by 'DNA-binding and transcriptional control'.</p>
			</sec>
			<sec>
				<st>
					<p><it>In silico </it>differential display of <it>A. mexicanum </it>contigs in blastema and neural tube</p>
				</st>
				<sec>
					<st>
						<p>Regeneration versus development</p>
					</st>
					<p>We were interested to see if there were strong differences in the sequence representation of the libraries that reflect the different biological processes taking place in each tissue. To this end, we compared the representation of ESTs in the two libraries. This type of <it>in silico </it>differential display has been performed for ESTs in the NCBI collection, and, as with the NCBI differential display data, we have assessed the statistical significance of the differences using Fisher's exact test. A total of 104 contigs met the cutoff value of 0.005 in Fisher's exact test and can therefore be considered differentially expressed.</p>
					<p>Table <tblr tid="T4">4</tblr> provides a detailed comparison of EST representation categorized according to their biological process annotation. Considering the biological properties of the blastema tissue versus the neural tube tissue, we were particularly interested in differential display results of gene sequences that had been assigned to the biological functions of RNA metabolism (as an indicator of an high proliferation index), cell cycle and proliferation and differentiation. The blastema library was produced from tail tissue that was in the process of forming the blastema progenitor cells for regeneration. Blastema formation involves dedifferentiation of mature cells, and entry into rapid cell cycles. In contrast, the neural tube library contains tissue undergoing cell specification and differentiation, such as neurogenesis and somitogenesis. Although these embryonic tissues are still proliferating, the proliferation index of the cells from neural tube should be lower than from blastema.</p>
				</sec>
				<sec>
					<st>
						<p>RNA metabolism</p>
					</st>
					<p>A total of 168 contigs annotated under RNA metabolism (127 when normalized to the ratio of sequenced ESTs from blastema and neural tube) were more frequently sequenced or uniquely sequenced in blastema (6% of assigned contigs, 2.6% of all contigs). This group included RNA metabolism, RNA processing, splicing, editing, nuclear export, binding, catabolism, cleavage, capping, rRNA modification, rRNA transcription and tRNA aminoacetylation. Forty-five contigs assigned a process in RNA metabolism were upregulated or unique in neural tube (2% of assigned and 0.7% of all contigs). After Fisher's exact test analysis, 24 of the clones were considered differentially regulated in the two libraries; 22 out of the 24 contigs were enriched or unique in blastema (Table <tblr tid="T4">4</tblr>).</p>
				</sec>
				<sec>
					<st>
						<p>Cell cycle and proliferation</p>
					</st>
					<p>126 contigs (95 when normalized to sequencing ratios) were assigned as cell-cycle genes (5% of assigned contigs and 1.5% of total contigs) and were more frequently sequenced or uniquely sequenced in the blastema library, compared with 52 in the neural tube library (2.5% and 0.8%, respectively). This category included regulation of mitosis, mitosis, cell-cycle regulation, regulation of cyclin-dependent kinase (CDK) activity, cell proliferation, DNA replication, M phase, mitotic spindle checkpoint, mitotic spindle assembly, chromosome segregation and cytokinesis. As an example, 10 different types of cyclins were found, from various stages of the cell cycle. Seven of the contigs found in cell-cycle regulation met the cutoff criteria of statistical significance in Fisher's exact test. Five out of the seven contigs were more highly represented or unique in blastema (Table <tblr tid="T4">4</tblr>).</p>
				</sec>
				<sec>
					<st>
						<p>Differentiation</p>
					</st>
					<p>Whereas proliferation-associated genes were found with a higher sequence representation in the blastema library, genes that had been electronically annotated as involved in 'cell differentiation' had a higher representation in the neural tube library. A total of 28 contigs were electronically assigned the biological process 'differentiation'. After Fisher's exact test, five contigs showed differential regulation in this group. Three out of the five contigs were found in neural tube (Table <tblr tid="T4">4</tblr>). Taken together, these results indicate that the two cDNA libraries have differences in sequence representation that appear to correlate with the physiological processes taking place in the two tissues.</p>
				</sec>
			</sec>
			<sec>
				<st>
					<p>Gene families involved in cell-cycle control and development in the <it>A. mexicanum </it>dataset</p>
				</st>
				<p>As mentioned earlier, the Mexican axolotl is an important model organism for a number of reasons. First, it is the premier vertebrate model for studying regeneration. Second some aspects of caudate development, for instance mesoderm involution and notochord formation, more closely resemble those found in higher vertebrates than do those in other amphibian embryological models such as <it>X. laevis </it><abbrgrp><abbr bid="B19">19</abbr></abbrgrp>. Finally, the axolotl has interesting developmental features, particularly in relation to metamorphosis. The axolotl undergoes 'cryptic metamorphosis', which is defined by its existence in a perrenibranchiate state and retaining some larval features into adulthood (for instance gills, larval skin morphology, caudal fins). The animals become sexually mature in this state, and develop only small rudimentary lungs. So far, very few markers are available to study these processes in this organism.</p>
				<p>We examined our dataset for genes that are potentially useful for studying regeneration features or developmental processes. To this end, we analyzed our data for genes that are either involved in regulating the cell cycle - as would be expected for the highly proliferative tissue of a regenerating body structure - or could play an essential role during development and metamorphosis from the larval to the adult stage. A list of genes that could be assigned to either cell-cycle regulation or development is shown in Table <tblr tid="T6">6</tblr>. Among the genes involved in cell-cycle regulation were A-, B- and E-type cyclins, cyclin-dependent kinase 4 (Cdk4), Polo kinase, the kinase inhibitor p27<sup>Kip1</sup>, the protein phosphatase Cdc25A, as well as the anaphase-promoting complex (APC) activator proteins Cdc20 and Cdh1. Representing genes involved in developmental processes, we found transcription factors such as HoxA2, B12, C4 and C8, Pax6, as well as Cdx1 and Cdx2. Furthermore we found several genes for proteins that are part of the transforming growth factor-beta (TGF-&#946;) signaling pathway, such as TGF-&#946;, bone morphogenetic protein 1 (BMP-1), BMP and activin membrane-bound inhibitor, activin receptor type II, as well as the transcription factors Smad5 and Smad8. Genes for proteins such as Smad8 and BMPs might be of especial interest to the research field of embryonic development, as they have been associated with mesoderm involution <abbrgrp><abbr bid="B20">20</abbr></abbrgrp>. Other important developmental genes that could be found in our dataset include those for Wnt5 and Wnt8, Sonic hedgehog, retinoblastoma binding protein 2, beta-catenin, as well as Frizzled 2, 5 and 7. Finally, it has been shown that the thyroid hormone receptor pathway has an essential role in the timing of metamorphosis in <it>A. mexicanum </it><abbrgrp><abbr bid="B21">21</abbr><abbr bid="B22">22</abbr><abbr bid="B23">23</abbr></abbrgrp>. We identified the protein TRIP12 (thyroid hormone receptor interacting protein 12), which is a HECT-domain-containing ubiquitin ligase and could have an essential role in regulating thyroid hormone response during development and/or metamorphosis.</p>
				<tbl id="T6" hint_layout="single">
					<title>
						<p>Table 6</p>
					</title>
					<caption>
						<p>Gene families identified that are either involved in cell-cycle control or developmental processes</p>
					</caption>
					<tblbdy cols="4">
						<r>
							<c ca="left">
								<p>Cellular process</p>
							</c>
							<c ca="left">
								<p>Putative ID of contig</p>
							</c>
							<c ca="left">
								<p>Contig</p>
							</c>
							<c ca="left">
								<p>Expression</p>
							</c>
						</r>
						<r>
							<c cspan="4">
								<hr/>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>Cell cycle</p>
							</c>
							<c ca="left">
								<p>Cyclin A2</p>
							</c>
							<c ca="left">
								<p>Am_20</p>
							</c>
							<c ca="left">
								<p>BL unique</p>
							</c>
						</r>
						<r>
							<c>
								<p/>
							</c>
							<c ca="left">
								<p>Cyclin B1</p>
							</c>
							<c ca="left">
								<p>Am_1031</p>
							</c>
							<c ca="left">
								<p>BL 3x</p>
							</c>
						</r>
						<r>
							<c>
								<p/>
							</c>
							<c ca="left">
								<p>Cyclin B2</p>
							</c>
							<c ca="left">
								<p>Am_4185</p>
							</c>
							<c ca="left">
								<p>NT unique</p>
							</c>
						</r>
						<r>
							<c>
								<p/>
							</c>
							<c ca="left">
								<p>Cyclin B3</p>
							</c>
							<c ca="left">
								<p>Am_3173</p>
							</c>
							<c ca="left">
								<p>BL unique</p>
							</c>
						</r>
						<r>
							<c>
								<p/>
							</c>
							<c ca="left">
								<p>Cyclin E1</p>
							</c>
							<c ca="left">
								<p>Am_38</p>
							</c>
							<c ca="left">
								<p>BL unique</p>
							</c>
						</r>
						<r>
							<c>
								<p/>
							</c>
							<c ca="left">
								<p>Cyclin E2</p>
							</c>
							<c ca="left">
								<p>Am_91</p>
							</c>
							<c ca="left">
								<p>BL unique</p>
							</c>
						</r>
						<r>
							<c>
								<p/>
							</c>
							<c ca="left">
								<p>Cdk4</p>
							</c>
							<c ca="left">
								<p>Am_3891</p>
							</c>
							<c ca="left">
								<p>BL unique</p>
							</c>
						</r>
						<r>
							<c>
								<p/>
							</c>
							<c ca="left">
								<p>Polo kinase</p>
							</c>
							<c ca="left">
								<p>Am_1717</p>
							</c>
							<c ca="left">
								<p>BL unique</p>
							</c>
						</r>
						<r>
							<c>
								<p/>
							</c>
							<c ca="left">
								<p>Cdc25A</p>
							</c>
							<c ca="left">
								<p>Am_3678</p>
							</c>
							<c ca="left">
								<p>BL unique</p>
							</c>
						</r>
						<r>
							<c>
								<p/>
							</c>
							<c ca="left">
								<p>p27/Kip1</p>
							</c>
							<c ca="left">
								<p>Am_4671</p>
							</c>
							<c ca="left">
								<p>NT unique</p>
							</c>
						</r>
						<r>
							<c>
								<p/>
							</c>
							<c ca="left">
								<p>Cdc20</p>
							</c>
							<c ca="left">
								<p>Am_2213</p>
							</c>
							<c ca="left">
								<p>BL unique</p>
							</c>
						</r>
						<r>
							<c>
								<p/>
							</c>
							<c ca="left">
								<p>Cdh1</p>
							</c>
							<c ca="left">
								<p>Am_1148</p>
							</c>
							<c ca="left">
								<p>BL unique</p>
							</c>
						</r>
						<r>
							<c>
								<p/>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>Development</p>
							</c>
							<c ca="left">
								<p>Wnt8</p>
							</c>
							<c ca="left">
								<p>Am_384</p>
							</c>
							<c ca="left">
								<p>BL unique</p>
							</c>
						</r>
						<r>
							<c>
								<p/>
							</c>
							<c ca="left">
								<p>Wnt5</p>
							</c>
							<c ca="left">
								<p>Am_642</p>
							</c>
							<c ca="left">
								<p>BL unique</p>
							</c>
						</r>
						<r>
							<c>
								<p/>
							</c>
							<c ca="left">
								<p>FGFR4a</p>
							</c>
							<c ca="left">
								<p>Am_1393</p>
							</c>
							<c ca="left">
								<p>BL unique</p>
							</c>
						</r>
						<r>
							<c>
								<p/>
							</c>
							<c ca="left">
								<p>Sonic hedgehog</p>
							</c>
							<c ca="left">
								<p>Am_3741</p>
							</c>
							<c ca="left">
								<p>BL unique</p>
							</c>
						</r>
						<r>
							<c>
								<p/>
							</c>
							<c ca="left">
								<p>Activin receptor type II</p>
							</c>
							<c ca="left">
								<p>Am_3590</p>
							</c>
							<c ca="left">
								<p>BL unique</p>
							</c>
						</r>
						<r>
							<c>
								<p/>
							</c>
							<c ca="left">
								<p>TGF-&#946;</p>
							</c>
							<c ca="left">
								<p>Am_4990</p>
							</c>
							<c ca="left">
								<p>NT unique</p>
							</c>
						</r>
						<r>
							<c>
								<p/>
							</c>
							<c ca="left">
								<p>BMP-1</p>
							</c>
							<c ca="left">
								<p>Am_4639</p>
							</c>
							<c ca="left">
								<p>NT unique</p>
							</c>
						</r>
						<r>
							<c>
								<p/>
							</c>
							<c ca="left">
								<p>Cdx1</p>
							</c>
							<c ca="left">
								<p>Am_875</p>
							</c>
							<c ca="left">
								<p>BL unique</p>
							</c>
						</r>
						<r>
							<c>
								<p/>
							</c>
							<c ca="left">
								<p>Cdx2</p>
							</c>
							<c ca="left">
								<p>Am_387</p>
							</c>
							<c ca="left">
								<p>BL unique</p>
							</c>
						</r>
						<r>
							<c>
								<p/>
							</c>
							<c ca="left">
								<p>HoxA2</p>
							</c>
							<c ca="left">
								<p>Am_2387</p>
							</c>
							<c ca="left">
								<p>BL unique</p>
							</c>
						</r>
						<r>
							<c>
								<p/>
							</c>
							<c ca="left">
								<p>HoxB13</p>
							</c>
							<c ca="left">
								<p>Am_4865</p>
							</c>
							<c ca="left">
								<p>NT unique</p>
							</c>
						</r>
						<r>
							<c>
								<p/>
							</c>
							<c ca="left">
								<p>HoxC4</p>
							</c>
							<c ca="left">
								<p>Am_3998</p>
							</c>
							<c ca="left">
								<p>BL unique</p>
							</c>
						</r>
						<r>
							<c>
								<p/>
							</c>
							<c ca="left">
								<p>HoxC8</p>
							</c>
							<c ca="left">
								<p>Am_2910</p>
							</c>
							<c ca="left">
								<p>BL unique</p>
							</c>
						</r>
						<r>
							<c>
								<p/>
							</c>
							<c ca="left">
								<p>Pax6</p>
							</c>
							<c ca="left">
								<p>Am_2945</p>
							</c>
							<c ca="left">
								<p>BL unique</p>
							</c>
						</r>
						<r>
							<c>
								<p/>
							</c>
							<c ca="left">
								<p>Smad5</p>
							</c>
							<c ca="left">
								<p>Am_1420</p>
							</c>
							<c ca="left">
								<p>BL unique</p>
							</c>
						</r>
						<r>
							<c>
								<p/>
							</c>
							<c ca="left">
								<p>Smad8</p>
							</c>
							<c ca="left">
								<p>Am_4665</p>
							</c>
							<c ca="left">
								<p>NT unique</p>
							</c>
						</r>
						<r>
							<c>
								<p/>
							</c>
							<c ca="left">
								<p>Retinoblastoma binding protein 2</p>
							</c>
							<c ca="left">
								<p>Am_2723</p>
							</c>
							<c ca="left">
								<p>BL unique</p>
							</c>
						</r>
						<r>
							<c>
								<p/>
							</c>
							<c ca="left">
								<p>Beta-catenin</p>
							</c>
							<c ca="left">
								<p>Am_699</p>
							</c>
							<c ca="left">
								<p>BL 3x</p>
							</c>
						</r>
						<r>
							<c>
								<p/>
							</c>
							<c ca="left">
								<p>Zic5</p>
							</c>
							<c ca="left">
								<p>Am_2068</p>
							</c>
							<c ca="left">
								<p>BL unique</p>
							</c>
						</r>
						<r>
							<c>
								<p/>
							</c>
							<c ca="left">
								<p>Frizzled 2</p>
							</c>
							<c ca="left">
								<p>Am_3243</p>
							</c>
							<c ca="left">
								<p>BL unique</p>
							</c>
						</r>
						<r>
							<c>
								<p/>
							</c>
							<c ca="left">
								<p>Frizzled 5</p>
							</c>
							<c ca="left">
								<p>Am_3451</p>
							</c>
							<c ca="left">
								<p>BL unique</p>
							</c>
						</r>
						<r>
							<c>
								<p/>
							</c>
							<c ca="left">
								<p>Frizzled 7</p>
							</c>
							<c ca="left">
								<p>Am_2334</p>
							</c>
							<c ca="left">
								<p>BL unique</p>
							</c>
						</r>
						<r>
							<c>
								<p/>
							</c>
							<c ca="left">
								<p>TRIP12</p>
							</c>
							<c ca="left">
								<p>Am_6416</p>
							</c>
							<c ca="left">
								<p>NT unique</p>
							</c>
						</r>
					</tblbdy>
					<tblfn>
						<p>The identifier of the <it>A. mexicanum </it>contig is given in the third column. The expression pattern as determined by <it>in silico </it>differential display is shown in column 4.</p>
					</tblfn>
				</tbl>
			</sec>
			<sec>
				<st>
					<p>Phylogenetic analysis of the CDKN1 gene family in vertebrates: amphibians contain an unusual CDKN1 family member</p>
				</st>
				<p>The EST collection will provide rich data for the phylogenetic comparison of particular genes. Cell cycle and cell differentiation are cellular functions that have been modified in various organisms through evolution and it will be interesting to understand the evolutionary basis of such changes. Here we analyze a particularly interesting gene family, the CDKN1 family of cell-cycle regulators which inhibit cell-cycle progression by binding to and inactivating CDKs. As a starting point for phylogenetic analysis, the mitochondrial 12S ribosomal RNA gene from our collection resulted in the expected tree, with the anuran amphibian <it>X. laevis </it>and the caudate <it>A. mexicanum </it>grouping together compared to other vertebrates such as fish, birds and mammals (Figure <figr fid="F4">4a</figr>). Next, we constructed an unrooted phylogenetic tree to compare members of the cyclin B family - cyclins B1, B2 and B3. The sequences of each family member formed strictly separate groups, with the <it>A. mexicanum </it>and <it>X. laevis </it>cyclin B1, B2 and B3 genes grouping with their vertebrate orthologs (Figure <figr fid="F4">4b</figr>).</p>
				<fig id="F4">
					<title>
						<p>Figure 4</p>
					</title>
					<caption>
						<p>Phylogenetic analysis of the vertebrate cyclin-dependent kinase (CDK) inhibitors (CKIs) p21(Cip1), p27(Kip1) and p57(Kip2)</p>
					</caption>
					<text>
						<p>Phylogenetic analysis of the vertebrate cyclin-dependent kinase (CDK) inhibitors (CKIs) p21(Cip1), p27(Kip1) and p57(Kip2). <b>(a) </b>Reference phylogenetic tree of mitochondrial 12S rRNA. The Caudata and Salientia both branch out to build the amphibian group. <b>(b) </b>Unrooted phylogenetic tree of the cyclin B1 gene family. The amphibian cyclin B1 family members form a distinct group. <b>(c) </b>Unrooted phylogenetic tree of the amino-terminal CDK-inhibitory domain of vertebrate p21, p27, p28 and p57, which is conserved between the protein families. p27 of <it>A. mexicanum </it>clearly groups with the p27 proteins from other vertebrates. The amphibian-specific p28-family does not parse with any singe group. Note, however, that unlike the 12S rRNA tree, the <it>A. mexicanum </it>and <it>A. t. tigrinum </it>p27 branch out with that of <it>D. rerio</it>. <b>(d) </b>Unrooted, phylogenetic tree of the full-length kinase inhibitor sequences. Using the full-length protein sequences from the CKI families, the p28 family branches off between the p21 and p27 families. <b>(e) </b>Multiple sequence alignment of the amino-terminal, CDK-inhibitory region of the CKI families. The protein sequence of <it>A. mexicanum </it>p27 is clearly the ortholog of the p27 family, yet displays higher than expected divergence on the protein level. The same divergence is observed for the ambystomatid p57 proteins. The p28 family has extremely high sequence divergence compared to any other CDKN1 family member. Conserved residues between the three CDKN1 families are highlighted in green and the p28-family in light blue. Residues that differ between ambystomatid sequences and the other vertebrate species are highlighted in the ambystomatid sequences in red. Accession numbers are: NM_131513 (<it>D. rerio </it>ccnb1), NM_031966 (<it>H. sapiens </it>ccnb1), BC041302 (<it>X. laevis </it>ccnb1), NM_172301 (<it>M. musculus </it>ccnb1), NM_171991 (<it>R. norvegicus </it>ccnb1), P13351 (<it>X. leavis </it>ccnb2), XP_343420 (<it>R. norvegicus </it>ccnb2), P29332 (<it>G. gallus </it>ccnb2), NP_004692 (<it>H. sapiens </it>ccnb2), NP_031656 (<it>M. musculus </it>ccnb2), CAC24491 (<it>X. laevis </it>ccnb3), P39963 (<it>G. gallus </it>ccnb3), CAC94915 (<it>H. sapiens </it>ccnb3), NP_898836 (<it>M. musculus </it>ccnb3), AAH56746.1 (<it>D. rerio </it>p27A, Drp27A); AAK84219.1 (<it>D. rerio </it>p27, Drp27); CN056871.1 (<it>A. t. tigrinum </it>p27, Attp27); AAM22491.1 (<it>G. gallus </it>p27, Ggp27); NP_004055.1 (<it>H. sapiens </it>p27, Hsp27); P46414 (<it>M. musculus </it>p27, Mmp27); NP_113950.1 (<it>R. norvegicus </it>p27, Rnp27); NP_000067.1 (<it>H. sapiens </it>p57, Hsp57); P49919 (<it>M. musculus </it>p57, Mmp57); XP_341967.1 (<it>R. norvegicus </it>p57, Rnp57); CN039016.1 (<it>A. mexicanum </it>p57, Amp57); BM489375.1 (<it>G. gallus </it>p57, Ggp57); CK697132.1 (<it>D. rerio </it>p57, Drp57); AAH01935.1 (<it>H. sapiens </it>p21, Hsp21); NP_031695.1 (<it>M. musculus </it>p21, Mmp21); NP_542960.1 (<it>R. norvegicus </it>p21, Rnp21); AL639561.2 (<it>X. tropicalis </it>p21, Xtp21); BJ065460.1 (<it>X. laevis </it>p21, Xlp21); AAN63876.1 (<it>G. gallus </it>p21, Ggp21); I51683 (<it>X. laevis </it>Xic1, XlXic1); BX712320.1 (<it>X. tropicalis </it>p28, Xtp28); TNeu143i03.p1cSP6 (<it>X. tropicalis </it>p28A, Xtp28A); CN033557.1 (<it>A. mexicanum </it>p28, Amp28); CN035131.1 (<it>A. mexicanum </it>p28A, Amp28A); CN033708.1 (<it>A. mexicanum </it>p28B, Amp28B). The scale bar indicates substitutions per site.</p>
					</text>
					<graphic file="gb-2004-5-9-r67-4"/>
				</fig>
				<p>In contrast, we obtained a quite different picture when we examined the CDKN1 family. In most vertebrates, this family consists of three members: p21 (CDKN1A), p27<sup>Kip1 </sup>(CDKN1B) and p57 (CDKN1C). In <it>X. laevis</it>, however, only a single family member called p28<sup>Kix1 </sup>(also called p27<sup>Xic1</sup>), which shows unusual sequence features compared to the p27 sequences from any other vertebrate species, had been described in the literature <abbrgrp><abbr bid="B24">24</abbr><abbr bid="B25">25</abbr></abbrgrp>. We wondered whether <it>A. mexicanum </it>harbored the 'canonical' p27<sup>Kip1 </sup>or a p28<sup>Kix1 </sup>similar to that of <it>Xenopus</it>. We initially searched our <it>A. mexicanum </it>data for CDKN1 orthologs and, in contrast to <it>Xenopus</it>, we found a <it>bona fide </it>p27<sup>Kip1 </sup>sequence that clusters closer to vertebrate p27<sup>Kip1 </sup>sequences compared to the <it>Xenopus </it>p28<sup>Kix1 </sup>(Figure <figr fid="F4">4c,d</figr>). Considering this interesting finding, we then undertook a more complete analysis of the CDKN1 family in vertebrates by searching for CDKN1 family members in several databases: the sequenced genomes from human, mouse, rat, fugu or zebrafish, the recently released genome sequence of <it>X. tropicalis</it>, the <it>X. laevis </it>EST collection, the zebrafish and fugu genomes, and a complementary <it>A. mexicanum </it>and <it>A. tigrinum </it>EST set generated by Putta <it>et al</it>. <abbrgrp><abbr bid="B26">26</abbr></abbrgrp>.</p>
				<p>This data mining revealed two striking features about the distribution of CDKN1 family members among vertebrates (Table <tblr tid="T7">7</tblr>). First, the p28<sup>Kix1 </sup>orthologs were only found in amphibians (<it>X. tropicalis</it>, <it>X. laevis</it>, <it>A. mexicanum</it>, <it>A. tigrinum tigrinum</it>). We were not able to identify a p28<sup>Kix1</sup>-like gene in any other database. These p28 orthologs group as a distinct branch in an unrooted phylogenetic tree (Figure <figr fid="F4">4c,d</figr>). These data so far suggest that the p28 family is a CDK inhibitor that is specific for amphibians. With new genome sequence data being released, it will be interesting to see whether the most closely related lineage of birds contains a p28-like gene or whether this gene family is found solely in amphibians.</p>
				<tbl id="T7">
					<title>
						<p>Table 7</p>
					</title>
					<caption>
						<p>Occurrence of CKI-family members in different vertebrate species</p>
					</caption>
					<tblbdy cols="6">
						<r>
							<c>
								<p/>
							</c>
							<c ca="left">
								<p>Human</p>
							</c>
							<c ca="left">
								<p>Zebrafish</p>
							</c>
							<c ca="left">
								<p>Fugu</p>
							</c>
							<c ca="left">
								<p>
									<it>Xenopus tropicalis</it>
								</p>
							</c>
							<c ca="left">
								<p>
									<it>Ambystoma mexicanum</it>
								</p>
							</c>
						</r>
						<r>
							<c cspan="6">
								<hr/>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>CDKN1A (p21)</p>
							</c>
							<c ca="left">
								<p>+</p>
							</c>
							<c ca="left">
								<p>-*</p>
							</c>
							<c ca="left">
								<p>+</p>
							</c>
							<c ca="left">
								<p>+</p>
							</c>
							<c ca="left">
								<p>-*</p>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>CDKN1B (p27<sup>Kip1</sup>)</p>
							</c>
							<c ca="left">
								<p>+</p>
							</c>
							<c ca="left">
								<p>+</p>
							</c>
							<c ca="left">
								<p>+</p>
							</c>
							<c ca="left">
								<p>-<sup>&#8224;</sup></p>
							</c>
							<c ca="left">
								<p>+</p>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>CDKN1C (p57)</p>
							</c>
							<c ca="left">
								<p>+</p>
							</c>
							<c ca="left">
								<p>+</p>
							</c>
							<c ca="left">
								<p>+</p>
							</c>
							<c ca="left">
								<p>-<sup>&#8224;</sup></p>
							</c>
							<c ca="left">
								<p>+</p>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>p28<sup>Kix1</sup></p>
							</c>
							<c ca="left">
								<p>-</p>
							</c>
							<c ca="left">
								<p>-</p>
							</c>
							<c ca="left">
								<p>-</p>
							</c>
							<c ca="left">
								<p>+<sup>&#8225;</sup></p>
							</c>
							<c ca="left">
								<p>+<sup>&#8225;</sup></p>
							</c>
						</r>
					</tblbdy>
					<tblfn>
						<p>* Genes most likely present, yet not identified due to limited sequence information; &#8224; genes not present in genomic sequence information; &#8225; genes so far only present in amphibian species. Databases searched were the human, mouse, rat, fugu, zebrafish and <it>X. tropicalis </it>genome databases, and the EST databases for <it>X. laevis</it>, <it>X. tropicalis</it>, zebrafish, <it>A. mexicanum </it>and <it>A. tigrinum</it>.</p>
					</tblfn>
				</tbl>
				<p>Second, CDKN1B (p27<sup>Kip1</sup>) and CDKN1C (p57) were present in the <it>A. mexicanum </it>databases but were not found in either <it>X. laevis </it>or <it>X. tropicalis</it>, which have far more EST and genome sequence information (Table <tblr tid="T7">7</tblr>, Figure <figr fid="F4">4c,d</figr>). While it is not possible to conclude definitively that <it>Xenopus </it>species lack these genes, the current data are highly suggestive of such a scenario.</p>
				<p>We examined in depth the phylogenetic relationships of the CDKN1 family members among vertebrates by constructing unrooted phylogenetic trees, either using the most conserved, amino-terminal 88-amino-acid domain, which includes the functionally important Cdk2-interaction region, or the entire coding sequence. Analysis of the amino terminus showed that while <it>A. mexicanum </it>p27 and p57 clearly grouped with their respective orthologs from other vertebrates, the p28<sup>Kix1 </sup>proteins from axolotl and the two <it>Xenopus </it>species clustered as a group distinct from any of the other CDKN1 families (Figure <figr fid="F4">4c</figr>). The p28<sup>Kix1 </sup>family showed a closer relationship to p57 than to other CDKN1 members, branching off close to the p57 family. Phylogenetic analysis using the entire coding sequence of the CDKN1 genes, which includes the Cdk2- and PCNA-binding site, resulted in a closer grouping of p28 with the p27 branch (Figure <figr fid="F4">4d</figr>). In both cases, however, the p28 family clearly formed a separate group from the other CDKN1 families.</p>
			</sec>
			<sec>
				<st>
					<p>The <it>Ambystoma mexicanum </it>EST database</p>
				</st>
				<p>A relational database with a web-based front end was created to store, navigate and annotate analyzed contigs. The main object of the database is the annotated sequence contig, which contains information about its length, putative identity, computationally calculated expression profile, GO annotation, homologous proteins and identified domains, as well as number and identity of ESTs that build the contig (Figure <figr fid="F5">5a</figr>). The Gene Identifier (GI) and GO annotation can be modified by the administrator. To circumvent the problem of split contigs, we introduced a super-contig, to which related contigs can be assigned. Furthermore, the administrator can modify the relationship of EST to contig manually. All protein and domain alignments, as well as the assembly of the EST sequences of a contig are stored and can be viewed by the user. On the contig main page, three homologs at most from selected species are shown, with a full list of homologs from selected species displayed on the protein information page (Figure <figr fid="F5">5c</figr>). To make use easier, an image of the identified domains with the beginning and end base pair of the alignment is shown on the contig page. Individual ESTs can be accessed via the contig page, including their length, storage information, quality information and available trimmed EST-sequence (Figure <figr fid="F5">5b</figr>).</p>
				<fig id="F5">
					<title>
						<p>Figure 5</p>
					</title>
					<caption>
						<p>The <it>Ambystoma mexicanum </it>EST database</p>
					</caption>
					<text>
						<p>The <it>Ambystoma mexicanum </it>EST database. A relational database was created as a sequence storage and annotation resource of the sequenced ESTs from <it>A. mexicanum</it>. <b>(a) </b>The main entry site of the EST resource is the contig page, where a subset of the information is available, including the identity of included ESTs, putative identity of the contig, GO annotation including cellular role, biochemical function and cellular component, a list of homologs from different model organisms, and identified conserved domains. Source data are available for all BLAST-based alignments, for external sequence or domain data, and for the complete contig sequence. <b>(b,c) </b>EST information and protein information pages, containing more detailed description of storage information, library source and read length (b). A complete list of homologs and identified conserved domains can be assessed on the protein information page (c). For a more detailed description of the database, see text.</p>
					</text>
					<graphic file="gb-2004-5-9-r67-5"/>
				</fig>
				<p>Some of the main advantages of this database are: first, the direct links to source databases such as the NCBI sequence database, GO database, CDD, and the Smart and Pfam databases for identified domains; second, direct visualization of source data such as sequence alignments of contigs to homologs and domains, as well as alignments of EST assemblies; third, easy retrieval of sequences for further analysis like BLAST-searching; fourth, user-specific annotation of contigs; and fifth, easy manipulation and editing of contig annotations. The database will be available from <abbrgrp><abbr bid="B27">27</abbr></abbrgrp>.</p>
			</sec>
		</sec>
		<sec>
			<st>
				<p>Discussion</p>
			</st>
			<p>The salamander, and in particular the species <it>A. mexicanum, </it>represents an important vertebrate organism for evolutionary, developmental and regeneration studies. The salamanders provide an essential amphibian counterpoint to the anurans such as <it>X. laevis</it>, displaying distinct embryology and other physiological features. For example, mesoderm involution during gastrulation and subsequent notochord formation is distinctive between <it>A. mexicanum </it>and <it>X. laevis</it>. The characteristics of mesoderm involution in <it>A. mexicanum </it>more closely resemble those found in other vertebrates <abbrgrp><abbr bid="B19">19</abbr></abbrgrp>. This and other evidence indicates that <it>A. mexicanum </it>and other urodele amphibians are likely to have retained more ancestral features in common with the 'primitive' tetrapod compared to <it>X. laevis</it>, which appears to be more derived. It is interesting that we observed such segregation on the sequence level of the CDKN1 family. <it>X. laevis </it>appears to have a highly unusual make-up of CDKN1 family members. So far, CDKN1A (p21) and the highly derived p28<sup>Kix1 </sup>are the only CDKN1 family members found in both <it>X. laevis </it>and <it>X. tropicalis</it>. In contrast, the ambystomatids appear to have all the members of the CDKN1-family - including p28<sup>Kix1 </sup>- assuming that the p21 gene is missing purely as a result of lack of sequence information. In addition, our data suggest that p28 is an amphibian-specific variant of the CDKN1 family. Two major questions arise from these data: first, does the amphibian-specific p28 fulfill a cellular function that is unique to this phylogenetic lineage; and second, does the genotypic difference in the gene set of the CDKN1 family in the two amphibian species account for the macroscopic differences observed in developmental mechanisms. The fact that the CDKN1 family is an essential regulator of the cell cycle opens new possibilities for experimental research along these lines.</p>
			<p>Given the estimates in the number of genes present in the human genome (20,000-50,000) <abbrgrp><abbr bid="B28">28</abbr></abbrgrp>, we estimate that our EST contig set (6,377) contains between 10 to 25% of the total number of genes in the axolotl. While the database is not yet complete, it represents a significant proportion of the axolotl transcriptome. Further sequencing efforts, including an NIH-funded EST sequencing project for the axolotl <abbrgrp><abbr bid="B26">26</abbr></abbrgrp>, will enlarge the current dataset to provide a comprehensive gene sequence resource for this organism. Our analysis indicates that the majority of <it>A. mexicanum </it>genes are homologous to genes present in other vertebrates. Sixty-six percent of contigs gave a significant match in either the non-redundant protein or nucleotide databases, the EST databases or the human and fugu protein databases. Thirty-four percent of contigs could not be assigned a homolog in any of the searched databases, and 44% of those could not be assigned a coding sequence and are therefore considered to be part of the UTR. Nineteen percent of the contigs seem to represent novel genes that have not been found in any other organism so far.</p>
			<p>The expressed sequence tags generated in this study also provide a large source of sequence information for developmental and regeneration studies. For example, an examination of the database yielded 194 genes involved in cell proliferation, including pivotal cell-cycle genes such as those for Cdc2, 10 different cyclin family members, Cdk4 and p27. A search for developmental molecules involved in intercellular communication yielded Wnt8, Wnt5B, FGF receptor 4a (FGFR4a), Sonic hedgehog, BMP receptor (BMPR) and BMP-1, while a search for homeodomain-containing proteins yielded 11 members, including Cdx1, Cdx2, HoxA2, HoxC8 and HoxB13.</p>
			<p>The ESTs were derived from two cDNA libraries, stage 18-22 embryonic neural tube/notochord/somite tissue, and day-6 regenerating tail tissue. The embryonic library represents a developmental stage where tissue specification is occurring, whereas the blastema library represents a tissue that is undergoing dedifferentiation, rapid proliferation and cell respecification. Accordingly, we find differences in transcript representation in the two libraries. The blastema library is particularly enriched in cell-cycle genes and RNA metabolism genes, presumably reflecting the high proliferative index of the early regenerating blastema.</p>
		</sec>
		<sec>
			<st>
				<p>Conclusions</p>
			</st>
			<p>This set of 17,352 ESTs from <it>A. mexicanum </it>was generated to provide a comprehensive sequence dataset for the community of biologists. Forty percent of genes could still be found in singlets, which reflects a high diversity of sequences in our cDNA set. Annotation of the assembled contigs revealed a substantial difference in gene representation in the two sequence libraries, reflecting their biological source - regenerating blastema being in a highly proliferative state and embryonic neural tube being a tissue undergoing differentiation. Sequence analysis of assembled contigs revealed that 64% of genes had a putative homolog in other species; 19.4% of the contigs contained a putative coding sequence and can be considered novel genes. From this, we conclude that <it>A. mexicanum </it>does not contain an unusually high number of organism-specific genes. The CDK inhibitor family CDKN1 was selected for comparative phylogenetic analysis. Unlike the frogs <it>X. laevis </it>and <it>X. tropicalis</it>, ambystomatids most probably contain all members of the CDKN1 family, including the amphibian-specific protein p28<sup>Kix1</sup>/p27<sup>Xic1</sup>, which shows unusual sequence divergence compared to CDKN1 members in other vertebrate species. Such data would support the contention that <it>A. mexicanum </it>is closer to a basal tetrapod compared to <it>X. laevis</it>. The EST sequences and annotated contigs presented in this paper will be a publicly available and useful resource for research in various fields.</p>
		</sec>
		<sec>
			<st>
				<p>Materials and methods</p>
			</st>
			<sec>
				<st>
					<p>Plasmid cDNA library construction</p>
				</st>
				<p>Total RNA was purified using Trizol (Invitrogen) from 6-day regenerating tail blastemas and from neural tube-somite-notochord-containing tissue dissected from stage 18-22 <it>A. mexicanum </it>embryos. Total RNA quality was assessed by determining the relative brightness of the 28S:18S rRNA bands (2:1). For library construction mRNA was purified and size fractionated, then poly(dT)-primed cDNA was synthesized and directionally cloned into the <it>Not</it>I-<it>Sal</it>I sites of the pCMVSport6 vector. DNA was transformed by electroporation into EMDH10B-TONA bacteria (library construction performed by Invitrogen). Two separate, unnormalized libraries were produced. The blastema library contained an average insert size of 1.67 kb and 2.67 &#215; 10<sup>7 </sup>independent transformants and the neural tube library had an average insert size of 1.5 kb and 1.9 &#215; 10<sup>7 </sup>transformants. From each library 100,000 clones were arrayed into 384-well plates (Resource Zentrum/Primary Database, Berlin, Germany).</p>
			</sec>
			<sec>
				<st>
					<p>Sequencing</p>
				</st>
				<p>For sequencing, single-pass reads from the 5' end of the library inserts were performed using a custom-designed SP6 primer: GCACATTAGGCCTATTTAGGTGACA. DNA from bacterial library clones was amplified using the Templiphi reaction, based on &#966;29 rolling-circle replication of DNA (AP Biotech). Briefly, approximately 0.5 &#956;l of bacterial glycerol stocks were picked up using 96-pin plastic replicators (Genetix) and centrifuged into 96-well PCR plates. Five microliters of denaturing buffer was added, and samples heated to 95&#176;C for 3 min. After cooling, 5 &#956;l Templiphi enzyme was added and samples incubated overnight in a 30&#176;C incubator. The Templiphi reaction provides two advantages for large-scale sequencing projects on capillary sequencers. First, the reaction proceeds to an endpoint where all nucleotide is incorporated, yielding uniform quantities of DNA from varying amounts of starting bacteria (or DNA). Second, the rolling-circle reaction results in large pieces of DNA that, in contrast to plasmid DNA, do not enter the capillary and interfere with the sequencing run.</p>
				<p>For sequencing reactions, the DNA preparation was diluted fivefold with distilled water. Sequencing reactions were performed using the DYEnamic ET Dye terminator kit diluted twofold with DYEnamic ET dilution buffer (AP Biotech). Five microliters of DNA was added to 5 &#956;l of sequencing reaction mix with primer and cycled 30 times under the following conditions: 95&#176;C 20 sec, 60&#176;C 1 min. Sequencing was performed on a MegaBACE 1000 (AP Biotech). Runs were either performed at injection: 3 kV 60 sec, run: 8 kV 120 min, or injection: 3 kV 60 sec injection, run: 3 kV 360 min.</p>
			</sec>
			<sec>
				<st>
					<p>Analysis of library quality</p>
				</st>
				<p>The redundancy of the arrayed libraries was tested by performing BLASTN searches <abbrgrp><abbr bid="B12">12</abbr></abbrgrp> against all sequenced ESTs from the two libraries. Hits against clones other than the query with an E-value lower than 1e-50 were considered for clustering.</p>
			</sec>
			<sec>
				<st>
					<p>Submission of ESTs to NCBI GenBank</p>
				</st>
				<p>The sequences were submitted to GenBank. After quality control, individual ESTs were used to search the non-redundant protein database (release of July 2004) using the program BLASTX from the standalone NCBI-BLAST package <abbrgrp><abbr bid="B12">12</abbr></abbrgrp>. For annotation of sequenced ESTs, the top hit of the BLAST output was used, whereby an E-value of 1e-20 was used for significant similarity and an E-value of 1e-05 was used as a cutoff value for weak similarity.</p>
			</sec>
			<sec>
				<st>
					<p>Analysis and assembly of sequence data</p>
				</st>
				<p>Quality control of sequenced ESTs was performed using the program Phred <abbrgrp><abbr bid="B11">11</abbr></abbrgrp> using a cutoff of 20 for trimming low-quality regions, and vector trimming was performed using the program cross-match <abbrgrp><abbr bid="B11">11</abbr></abbrgrp>. (We note here that the arbitrary Phred score reflects the likelihood of a false base. A Phred score of 20 indicates that in 1 out of 100 trials (10<sup>2</sup>), the base would be false, 30 would reflect a wrongly sequenced base in 1 of 1,000 trials (10<sup>3</sup>), and so forth.) Sequence and contig files can be downloaded at <abbrgrp><abbr bid="B16">16</abbr></abbrgrp>. The resulting high-quality sequences were assembled into sequence contigs with the program TIGR-Assembler version 2 <abbrgrp><abbr bid="B13">13</abbr></abbrgrp>. Alignment of contigs was performed with the program ClustalW with the settings Gap Opening 5 and Gap Extension 85 <abbrgrp><abbr bid="B29">29</abbr></abbrgrp> or Cap3 <abbrgrp><abbr bid="B30">30</abbr></abbrgrp>, when ClustalW could not correctly assemble the sequences. Assembled contigs were used to perform BLAST searches (BLASTX, BLASTN from NCBI-BLAST <abbrgrp><abbr bid="B12">12</abbr></abbrgrp>) against the non-redundant protein sequence database (release of November 2003), human and fugu protein databases and the NCBI EST database, all downloaded from the NCBI. Domain searches were done with RPS-BLAST against the conserved domain database (CDD <abbrgrp><abbr bid="B18">18</abbr></abbrgrp>) from the NCBI. BLAST and domain-search output files were parsed for homologous sequences, whereby an E-value of 1e-05 was used as a cutoff for BLASTN and BLASTX searches against the sequence databases and the default cut-off of 0.01 was considered to yield significant homology to conserved domains from CDD. A gene identifier was assigned to those contigs that showed reliable homology to a sequence in the non-redundant database (E-value cutoff of 1e-20 for significant similarity and 1e-05 for weak similarity). Potential untranslated regions were identified using the program ESTScan <abbrgrp><abbr bid="B31">31</abbr></abbrgrp>.</p>
			</sec>
			<sec>
				<st>
					<p>Electronic annotation of contigs</p>
				</st>
				<p>Based on the GO annotation of the closest annotated homolog, contigs were assigned a molecular function, biological process and cellular component from the GO database <abbrgrp><abbr bid="B17">17</abbr></abbrgrp>. To this end, the GenBank annotation files from the GO database were downloaded and parsed for the gene identifier (gi) numbers of previously identified homologs. The cutoff for annotating an <it>A. mexicanum </it>contig was an E-value of 1e-20.</p>
			</sec>
			<sec>
				<st>
					<p>Isolation of the full-length p27<sup>Kip1 </sup>gene from the EST sequence</p>
				</st>
				<p>Two EST sequences of the p27<sup>Kip1 </sup>gene were sequenced in the EST collection but neither were full-length sequences. To isolate the full-length sequence, 200,000 clones of our arrayed blastema and neural tube libraries were screened by PCR. Briefly, the bacterial library clones in each 384-well plate were pooled, mini-prepped and arrayed into 96-well plates (RZPD, Berlin), resulting in 576 DNA pools. These DNA pools were screened by PCR using the custom SP6 primer (GCACATTAGGCCTATTTAGGTGACA) as a forward primer and a gene-specific p27 reverse primer (TGATTTCCAATGGCTGGTTT). Fifty nanograms of DNA from each pool was used for PCR reactions and PCR cycling was performed at the following conditions: 94&#176;C 2 min, 30 cycles of 94&#176;C 15 sec, 65.5&#176;C 30 sec, 72&#176;C 90 sec, followed by 72&#176;C 7 min). The largest positive band (1.1 kb) was gel purified and sequenced on an ABI377 machine using the SP6 primer.</p>
			</sec>
			<sec>
				<st>
					<p>Phylogenetic analysis</p>
				</st>
				<p>Multiple sequence alignments were done with the program ClustalX <abbrgrp><abbr bid="B32">32</abbr></abbrgrp> using standard parameters. Phylogenetic analysis of mitochondrial 12S rRNA was done using the programs dnadist, phylogenetic analysis of the cyclin B family and the CDK inhibitor family (CKI family) was done using protdist, both from the Phylip package <abbrgrp><abbr bid="B33">33</abbr></abbrgrp>. Trees were calculated with the program fitch from the same software package, using 100 iterations. For the CKI family, only the amino-terminal, CDK-inhibitory domain or the full-length sequences were used for construction of a phylogenetic tree. For the cyclin-B family, only the region overlapping in <it>A. mexicanum </it>contigs was used for tree construction. Trees were displayed using the program nj-plot <abbrgrp><abbr bid="B34">34</abbr></abbrgrp> for the mitochondrial 12S rRNA tree and unrooted <abbrgrp><abbr bid="B34">34</abbr></abbrgrp> for the CKI- and cyclin B families.</p>
			</sec>
			<sec>
				<st>
					<p>Database design</p>
				</st>
				<p>A relational database was created using the open source software MySQL as the database server to store and navigate through resulting sequence contigs and annotations. Scripts connecting the web-based front end to the database were written in the programming language Python.</p>
			</sec>
		</sec>
	</bdy>
	<bm>
		<ack>
			<sec>
				<st>
					<p>Acknowledgements</p>
				</st>
				<p>We thank Wolfgang Zachariae, Ralf Kittler and S Randal Voss for critical reading of the manuscript. We are grateful to Tony Hyman, Albert Poustka and David Drechsel for advice and support. This work was funded by the Max Planck Institute of Molecular Cell Biology and Genetics, and the MeDDrive program of the Medical Faculty, Technical University of Dresden.</p>
			</sec>
		</ack>
		<refgrp>
			<bibl id="B1">
				<title>
					<p>Phylogeny, variation, and morphological Integration.</p>
				</title>
				<aug>
					<au>
						<snm>Shubin</snm>
						<fnm>NWD</fnm>
					</au>
				</aug>
				<source>Am Zool</source>
				<pubdate>1996</pubdate>
				<volume>36</volume>
				<fpage>51</fpage>
				<lpage>60</lpage>
			</bibl>
			<bibl id="B2">
				<title>
					<p>Genome size, secondary simplification, and the evolution of the brain in salamanders.</p>
				</title>
				<aug>
					<au>
						<snm>Roth</snm>
						<fnm>G</fnm>
					</au>
					<au>
						<snm>Nishikawa</snm>
						<fnm>KC</fnm>
					</au>
					<au>
						<snm>Wake</snm>
						<fnm>DB</fnm>
					</au>
				</aug>
				<source>Brain Behav Evol</source>
				<pubdate>1997</pubdate>
				<volume>50</volume>
				<fpage>50</fpage>
				<lpage>59</lpage>
				<xrefbib>
					<pubid idtype="pmpid">9209766</pubid>
				</xrefbib>
			</bibl>
			<bibl id="B3">
				<title>
					<p>Constructing gene-enriched plant genomic libraries using methylation filtration technology.</p>
				</title>
				<aug>
					<au>
						<snm>Rabinowicz</snm>
						<fnm>PD</fnm>
					</au>
				</aug>
				<source>Methods Mol Biol</source>
				<pubdate>2003</pubdate>
				<volume>236</volume>
				<fpage>21</fpage>
				<lpage>36</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1385/1-59259-413-1:21</pubid>
						<pubid idtype="pmpid" link="fulltext">14501056</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B4">
				<title>
					<p>Animal Genome Size Database</p>
				</title>
				<url>http://www.genomesize.com</url>
			</bibl>
			<bibl id="B5">
				<title>
					<p>Microchemical deoxyribonucleic acid determination in individual cells.</p>
				</title>
				<aug>
					<au>
						<snm>Edstrom</snm>
						<fnm>JE</fnm>
					</au>
					<au>
						<snm>Kawiak</snm>
						<fnm>J</fnm>
					</au>
				</aug>
				<source>J Biophys Biochem Cytol</source>
				<pubdate>1961</pubdate>
				<volume>9</volume>
				<fpage>619</fpage>
				<lpage>626</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1083/jcb.9.3.619</pubid>
						<pubid idtype="pmpid">13725762</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B6">
				<title>
					<p>Cytofluorometric DNA base determination in vertebrate species with different genome sizes.</p>
				</title>
				<aug>
					<au>
						<snm>Capriglione</snm>
						<fnm>T</fnm>
					</au>
					<au>
						<snm>Olmo</snm>
						<fnm>E</fnm>
					</au>
					<au>
						<snm>Odierna</snm>
						<fnm>G</fnm>
					</au>
					<au>
						<snm>Improta</snm>
						<fnm>B</fnm>
					</au>
					<au>
						<snm>Morescalchi</snm>
						<fnm>A</fnm>
					</au>
				</aug>
				<source>Basic Appl Histochem</source>
				<pubdate>1987</pubdate>
				<volume>31</volume>
				<fpage>119</fpage>
				<lpage>126</lpage>
				<xrefbib>
					<pubid idtype="pmpid">3115252</pubid>
				</xrefbib>
			</bibl>
			<bibl id="B7">
				<title>
					<p>Conserved vertebrate chromosome segments in the large salamander genome.</p>
				</title>
				<aug>
					<au>
						<snm>Voss</snm>
						<fnm>SR</fnm>
					</au>
					<au>
						<snm>Smith</snm>
						<fnm>JJ</fnm>
					</au>
					<au>
						<snm>Gardiner</snm>
						<fnm>DM</fnm>
					</au>
					<au>
						<snm>Parichy</snm>
						<fnm>DM</fnm>
					</au>
				</aug>
				<source>Genetics</source>
				<pubdate>2001</pubdate>
				<volume>158</volume>
				<fpage>735</fpage>
				<lpage>746</lpage>
				<xrefbib>
					<pubid idtype="pmpid" link="fulltext">11404337</pubid>
				</xrefbib>
			</bibl>
			<bibl id="B8">
				<title>
					<p>Maize genome sequencing by methylation filtration.</p>
				</title>
				<aug>
					<au>
						<snm>Palmer</snm>
						<fnm>LE</fnm>
					</au>
					<au>
						<snm>Rabinowicz</snm>
						<fnm>PD</fnm>
					</au>
					<au>
						<snm>O'Shaughnessy</snm>
						<fnm>AL</fnm>
					</au>
					<au>
						<snm>Balija</snm>
						<fnm>VS</fnm>
					</au>
					<au>
						<snm>Nascimento</snm>
						<fnm>LU</fnm>
					</au>
					<au>
						<snm>Dike</snm>
						<fnm>S</fnm>
					</au>
					<au>
						<snm>de la Bastide</snm>
						<fnm>M</fnm>
					</au>
					<au>
						<snm>Martienssen</snm>
						<fnm>RA</fnm>
					</au>
					<au>
						<snm>McCombie</snm>
						<fnm>WR</fnm>
					</au>
				</aug>
				<source>Science</source>
				<pubdate>2003</pubdate>
				<volume>302</volume>
				<fpage>2115</fpage>
				<lpage>2117</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1126/science.1091265</pubid>
						<pubid idtype="pmpid" link="fulltext">14684820</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B9">
				<title>
					<p>A molecular phylogenetic perspective on the evolutionary radiation of the salamander family Salamandridae.</p>
				</title>
				<aug>
					<au>
						<snm>Titus</snm>
						<fnm>TA</fnm>
					</au>
					<au>
						<snm>Larson</snm>
						<fnm>A</fnm>
					</au>
				</aug>
				<source>Syst Biol</source>
				<pubdate>1995</pubdate>
				<volume>44</volume>
				<fpage>125</fpage>
				<lpage>151</lpage>
			</bibl>
			<bibl id="B10">
				<title>
					<p>The Indiana University Axolotl Colony</p>
				</title>
				<url>http://www.indiana.edu/~axolotl</url>
			</bibl>
			<bibl id="B11">
				<title>
					<p>Base-calling of automated sequencer traces using phred. I. Accuracy assessment.</p>
				</title>
				<aug>
					<au>
						<snm>Ewing</snm>
						<fnm>B</fnm>
					</au>
					<au>
						<snm>Hillier</snm>
						<fnm>L</fnm>
					</au>
					<au>
						<snm>Wendl</snm>
						<fnm>MC</fnm>
					</au>
					<au>
						<snm>Green</snm>
						<fnm>P</fnm>
					</au>
				</aug>
				<source>Genome Res</source>
				<pubdate>1998</pubdate>
				<volume>8</volume>
				<fpage>175</fpage>
				<lpage>185</lpage>
				<xrefbib>
					<pubid idtype="pmpid" link="fulltext">9521921</pubid>
				</xrefbib>
			</bibl>
			<bibl id="B12">
				<title>
					<p>Gapped BLAST and PSI-BLAST: a new generation of protein database search programs.</p>
				</title>
				<aug>
					<au>
						<snm>Altschul</snm>
						<fnm>SF</fnm>
					</au>
					<au>
						<snm>Madden</snm>
						<fnm>TL</fnm>
					</au>
					<au>
						<snm>Schaffer</snm>
						<fnm>AA</fnm>
					</au>
					<au>
						<snm>Zhang</snm>
						<fnm>J</fnm>
					</au>
					<au>
						<snm>Zhang</snm>
						<fnm>Z</fnm>
					</au>
					<au>
						<snm>Miller</snm>
						<fnm>W</fnm>
					</au>
					<au>
						<snm>Lipman</snm>
						<fnm>DJ</fnm>
					</au>
				</aug>
				<source>Nucleic Acids Res</source>
				<pubdate>1997</pubdate>
				<volume>25</volume>
				<fpage>3389</fpage>
				<lpage>3402</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1093/nar/25.17.3389</pubid>
						<pubid idtype="pmpid" link="fulltext">9254694</pubid>
						<pubid idtype="pmcid">146917</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B13">
				<title>
					<p>TIGR Assembler: a new tool for assembling large shotgun sequencing projects.</p>
				</title>
				<aug>
					<au>
						<snm>Sutton</snm>
						<fnm>G</fnm>
					</au>
					<au>
						<snm>White</snm>
						<fnm>O</fnm>
					</au>
					<au>
						<snm>Adams</snm>
						<fnm>M</fnm>
					</au>
					<au>
						<snm>Kerlavage</snm>
						<fnm>W</fnm>
					</au>
				</aug>
				<source>Genome Sci Technol</source>
				<pubdate>1995</pubdate>
				<volume>1</volume>
				<fpage>9</fpage>
				<lpage>19</lpage>
			</bibl>
			<bibl id="B14">
				<title>
					<p>Salamander Genome Project.</p>
				</title>
				<aug>
					<au>
						<snm>Voss</snm>
						<fnm>SR</fnm>
					</au>
					<au>
						<snm>Parichy</snm>
						<fnm>DM</fnm>
					</au>
				</aug>
				<source>Axolotl Newslett</source>
				<pubdate>2001</pubdate>
				<volume>29</volume>
			</bibl>
			<bibl id="B15">
				<title>
					<p>Salamander Genome Project</p>
				</title>
				<url>http://salamander.uky.edu</url>
			</bibl>
			<bibl id="B16">
				<title>
					<p>Supplementary data</p>
				</title>
				<url>http://www.mpi-cbg.de/~habermann</url>
			</bibl>
			<bibl id="B17">
				<title>
					<p>Gene ontology: tool for the unification of biology. The Gene Ontology Consortium.</p>
				</title>
				<aug>
					<au>
						<snm>Ashburner</snm>
						<fnm>M</fnm>
					</au>
					<au>
						<snm>Ball</snm>
						<fnm>CA</fnm>
					</au>
					<au>
						<snm>Blake</snm>
						<fnm>JA</fnm>
					</au>
					<au>
						<snm>Botstein</snm>
						<fnm>D</fnm>
					</au>
					<au>
						<snm>Butler</snm>
						<fnm>H</fnm>
					</au>
					<au>
						<snm>Cherry</snm>
						<fnm>JM</fnm>
					</au>
					<au>
						<snm>Davis</snm>
						<fnm>AP</fnm>
					</au>
					<au>
						<snm>Dolinski</snm>
						<fnm>K</fnm>
					</au>
					<au>
						<snm>Dwight</snm>
						<fnm>SS</fnm>
					</au>
					<au>
						<snm>Eppig</snm>
						<fnm>JT</fnm>
					</au>
					<etal/>
				</aug>
				<source>Nat Genet</source>
				<pubdate>2000</pubdate>
				<volume>25</volume>
				<fpage>25</fpage>
				<lpage>29</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1038/75556</pubid>
						<pubid idtype="pmpid" link="fulltext">10802651</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B18">
				<title>
					<p>CDD: a database of conserved domain alignments with links to domain three-dimensional structure.</p>
				</title>
				<aug>
					<au>
						<snm>Marchler-Bauer</snm>
						<fnm>A</fnm>
					</au>
					<au>
						<snm>Panchenko</snm>
						<fnm>AR</fnm>
					</au>
					<au>
						<snm>Shoemaker</snm>
						<fnm>BA</fnm>
					</au>
					<au>
						<snm>Thiessen</snm>
						<fnm>PA</fnm>
					</au>
					<au>
						<snm>Geer</snm>
						<fnm>LY</fnm>
					</au>
					<au>
						<snm>Bryant</snm>
						<fnm>SH</fnm>
					</au>
				</aug>
				<source>Nucleic Acids Res</source>
				<pubdate>2002</pubdate>
				<volume>30</volume>
				<fpage>281</fpage>
				<lpage>283</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1093/nar/30.1.281</pubid>
						<pubid idtype="pmpid" link="fulltext">11752315</pubid>
						<pubid idtype="pmcid">99109</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B19">
				<title>
					<p>Evolution of predetermined germ cells in vertebrate embryos: implications for macroevolution.</p>
				</title>
				<aug>
					<au>
						<snm>Johnson</snm>
						<fnm>AD</fnm>
					</au>
					<au>
						<snm>Drum</snm>
						<fnm>M</fnm>
					</au>
					<au>
						<snm>Bachvarova</snm>
						<fnm>RF</fnm>
					</au>
					<au>
						<snm>Masi</snm>
						<fnm>T</fnm>
					</au>
					<au>
						<snm>White</snm>
						<fnm>ME</fnm>
					</au>
					<au>
						<snm>Crother</snm>
						<fnm>BI</fnm>
					</au>
				</aug>
				<source>Evol Dev</source>
				<pubdate>2003</pubdate>
				<volume>5</volume>
				<fpage>414</fpage>
				<lpage>431</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1046/j.1525-142X.2003.03064.x</pubid>
						<pubid idtype="pmpid">12823457</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B20">
				<title>
					<p><it>Xenopus </it>Smad8 acts downstream of BMP-4 to modulate its activity during vertebrate embryonic patterning.</p>
				</title>
				<aug>
					<au>
						<snm>Nakayama</snm>
						<fnm>T</fnm>
					</au>
					<au>
						<snm>Snyder</snm>
						<fnm>MA</fnm>
					</au>
					<au>
						<snm>Grewal</snm>
						<fnm>SS</fnm>
					</au>
					<au>
						<snm>Tsuneizumi</snm>
						<fnm>K</fnm>
					</au>
					<au>
						<snm>Tabata</snm>
						<fnm>T</fnm>
					</au>
					<au>
						<snm>Christian</snm>
						<fnm>JL</fnm>
					</au>
				</aug>
				<source>Development</source>
				<pubdate>1998</pubdate>
				<volume>125</volume>
				<fpage>857</fpage>
				<lpage>867</lpage>
				<xrefbib>
					<pubid idtype="pmpid" link="fulltext">9449668</pubid>
				</xrefbib>
			</bibl>
			<bibl id="B21">
				<title>
					<p>Candidate gene analysis of metamorphic timing in ambystomatid salamanders.</p>
				</title>
				<aug>
					<au>
						<snm>Voss</snm>
						<fnm>SR</fnm>
					</au>
					<au>
						<snm>Prudic</snm>
						<fnm>KL</fnm>
					</au>
					<au>
						<snm>Oliver</snm>
						<fnm>JC</fnm>
					</au>
					<au>
						<snm>Shaffer</snm>
						<fnm>HB</fnm>
					</au>
				</aug>
				<source>Mol Ecol</source>
				<pubdate>2003</pubdate>
				<volume>12</volume>
				<fpage>1217</fpage>
				<lpage>1223</lpage>
				<xrefbib>
					<pubid idtype="pmpid" link="fulltext">12694285</pubid>
				</xrefbib>
			</bibl>
			<bibl id="B22">
				<title>
					<p>Candidate gene analysis of thyroid hormone receptors in metamorphosing vs. nonmetamorphosing salamanders.</p>
				</title>
				<aug>
					<au>
						<snm>Voss</snm>
						<fnm>SR</fnm>
					</au>
					<au>
						<snm>Shaffer</snm>
						<fnm>HB</fnm>
					</au>
					<au>
						<snm>Taylor</snm>
						<fnm>J</fnm>
					</au>
					<au>
						<snm>Safi</snm>
						<fnm>R</fnm>
					</au>
					<au>
						<snm>Laudet</snm>
						<fnm>V</fnm>
					</au>
				</aug>
				<source>Heredity</source>
				<pubdate>2000</pubdate>
				<volume>85</volume>
				<fpage>107</fpage>
				<lpage>114</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1046/j.1365-2540.2000.00714.x</pubid>
						<pubid idtype="pmpid">11012711</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B23">
				<title>
					<p>Tadpole competence and tissue-specific temporal regulation of amphibian metamorphosis: roles of thyroid hormone and its receptors.</p>
				</title>
				<aug>
					<au>
						<snm>Shi</snm>
						<fnm>YB</fnm>
					</au>
					<au>
						<snm>Wong</snm>
						<fnm>J</fnm>
					</au>
					<au>
						<snm>Puzianowska-Kuznicka</snm>
						<fnm>M</fnm>
					</au>
					<au>
						<snm>Stolow</snm>
						<fnm>MA</fnm>
					</au>
				</aug>
				<source>BioEssays</source>
				<pubdate>1996</pubdate>
				<volume>18</volume>
				<fpage>391</fpage>
				<lpage>399</lpage>
				<xrefbib>
					<pubid idtype="pmpid">8639162</pubid>
				</xrefbib>
			</bibl>
			<bibl id="B24">
				<title>
					<p>Cloning and characterization of the <it>Xenopus </it>cyclin-dependent kinase inhibitor p27XIC1.</p>
				</title>
				<aug>
					<au>
						<snm>Su</snm>
						<fnm>JY</fnm>
					</au>
					<au>
						<snm>Rempel</snm>
						<fnm>RE</fnm>
					</au>
					<au>
						<snm>Erikson</snm>
						<fnm>E</fnm>
					</au>
					<au>
						<snm>Maller</snm>
						<fnm>JL</fnm>
					</au>
				</aug>
				<source>Proc Natl Acad Sci USA</source>
				<pubdate>1995</pubdate>
				<volume>92</volume>
				<fpage>10187</fpage>
				<lpage>10191</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="pmcid">40761</pubid>
						<pubid idtype="pmpid" link="fulltext">7479751</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B25">
				<title>
					<p>Cell cycle control by <it>Xenopus </it>p28Kix1, a developmentally regulated inhibitor of cyclin-dependent kinases.</p>
				</title>
				<aug>
					<au>
						<snm>Shou</snm>
						<fnm>W</fnm>
					</au>
					<au>
						<snm>Dunphy</snm>
						<fnm>WG</fnm>
					</au>
				</aug>
				<source>Mol Biol Cell</source>
				<pubdate>1996</pubdate>
				<volume>7</volume>
				<fpage>457</fpage>
				<lpage>469</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="pmcid">275897</pubid>
						<pubid idtype="pmpid">8868473</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B26">
				<title>
					<p>From biomedicine to natural history research: EST resources for ambystomatid salamanders.</p>
				</title>
				<aug>
					<au>
						<snm>Putta</snm>
						<fnm>S</fnm>
					</au>
					<au>
						<snm>Smith</snm>
						<fnm>JJ</fnm>
					</au>
					<au>
						<snm>Walker</snm>
						<fnm>J</fnm>
					</au>
					<au>
						<snm>Mathieu</snm>
						<fnm>R</fnm>
					</au>
					<au>
						<snm>Weisrock</snm>
						<fnm>DW</fnm>
					</au>
					<au>
						<snm>Monaghan</snm>
						<fnm>J</fnm>
					</au>
					<au>
						<snm>Samuels</snm>
						<fnm>AK</fnm>
					</au>
					<au>
						<snm>Kump</snm>
						<fnm>K</fnm>
					</au>
					<au>
						<snm>King</snm>
						<fnm>DC</fnm>
					</au>
					<au>
						<snm>Maness</snm>
						<fnm>NJ</fnm>
					</au>
					<etal/>
				</aug>
				<source>BMC Genomics</source>
				<pubdate>2004</pubdate>
				<volume>5</volume>
				<fpage>54</fpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="pmcid">509418</pubid>
						<pubid idtype="pmpid" link="fulltext">15310388</pubid>
						<pubid idtype="doi">10.1186/1471-2164-5-54</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B27">
				<title>
					<p>The Axolotl EST database</p>
				</title>
				<url>https://intradb.mpi-cbg.de/axolotl</url>
			</bibl>
			<bibl id="B28">
				<title>
					<p>Human genome. A low number wins the GeneSweep pool.</p>
				</title>
				<aug>
					<au>
						<snm>Pennisi</snm>
						<fnm>I</fnm>
					</au>
				</aug>
				<source>Science</source>
				<pubdate>2003</pubdate>
				<volume>300</volume>
				<fpage>1484</fpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1126/science.300.5625.1484b</pubid>
						<pubid idtype="pmpid" link="fulltext">12791949</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B29">
				<title>
					<p>CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice.</p>
				</title>
				<aug>
					<au>
						<snm>Thompson</snm>
						<fnm>JD</fnm>
					</au>
					<au>
						<snm>Higgins</snm>
						<fnm>DG</fnm>
					</au>
					<au>
						<snm>Gibson</snm>
						<fnm>TJ</fnm>
					</au>
				</aug>
				<source>Nucleic Acids Res</source>
				<pubdate>1994</pubdate>
				<volume>22</volume>
				<fpage>4673</fpage>
				<lpage>4680</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="pmcid">308517</pubid>
						<pubid idtype="pmpid">7984417</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B30">
				<title>
					<p>CAP3: A DNA sequence assembly program.</p>
				</title>
				<aug>
					<au>
						<snm>Huang</snm>
						<fnm>X</fnm>
					</au>
					<au>
						<snm>Madan</snm>
						<fnm>A</fnm>
					</au>
				</aug>
				<source>Genome Res</source>
				<pubdate>1999</pubdate>
				<volume>9</volume>
				<fpage>868</fpage>
				<lpage>877</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1101/gr.9.9.868</pubid>
						<pubid idtype="pmpid" link="fulltext">10508846</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B31">
				<title>
					<p>ESTScan: a program for detecting, evaluating, and reconstructing potential coding regions in EST sequences.</p>
				</title>
				<aug>
					<au>
						<snm>Iseli</snm>
						<fnm>C</fnm>
					</au>
					<au>
						<snm>Jongeneel</snm>
						<fnm>CV</fnm>
					</au>
					<au>
						<snm>Bucher</snm>
						<fnm>P</fnm>
					</au>
				</aug>
				<source>Proc Int Conf Intell Syst Mol Biol</source>
				<pubdate>1999</pubdate>
				<fpage>138</fpage>
				<lpage>148</lpage>
				<xrefbib>
					<pubid idtype="pmpid">10786296</pubid>
				</xrefbib>
			</bibl>
			<bibl id="B32">
				<title>
					<p>Multiple sequence alignment with Clustal X.</p>
				</title>
				<aug>
					<au>
						<snm>Jeanmougin</snm>
						<fnm>F</fnm>
					</au>
					<au>
						<snm>Thompson</snm>
						<fnm>JD</fnm>
					</au>
					<au>
						<snm>Gouy</snm>
						<fnm>M</fnm>
					</au>
					<au>
						<snm>Higgins</snm>
						<fnm>DG</fnm>
					</au>
					<au>
						<snm>Gibson</snm>
						<fnm>TJ</fnm>
					</au>
				</aug>
				<source>Trends Biochem Sci</source>
				<pubdate>1998</pubdate>
				<volume>23</volume>
				<fpage>403</fpage>
				<lpage>405</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1016/S0968-0004(98)01285-7</pubid>
						<pubid idtype="pmpid" link="fulltext">9810230</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B33">
				<title>
					<p>PHYLIP - phylogeny inference package (version 3.2).</p>
				</title>
				<aug>
					<au>
						<snm>Felsenstein</snm>
						<fnm>J</fnm>
					</au>
				</aug>
				<source>Cladistics</source>
				<pubdate>1989</pubdate>
				<volume>5</volume>
				<fpage>164</fpage>
				<lpage>166</lpage>
			</bibl>
			<bibl id="B34">
				<title>
					<p>WWW-query: an on-line retrieval system for biological sequence banks.</p>
				</title>
				<aug>
					<au>
						<snm>Perriere</snm>
						<fnm>G</fnm>
					</au>
					<au>
						<snm>Gouy</snm>
						<fnm>M</fnm>
					</au>
				</aug>
				<source>Biochimie</source>
				<pubdate>1996</pubdate>
				<volume>78</volume>
				<fpage>364</fpage>
				<lpage>369</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1016/0300-9084(96)84768-7</pubid>
						<pubid idtype="pmpid" link="fulltext">8905155</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
		</refgrp>
	</bm>
</art>
