<?xml version='1.0'?>
<!DOCTYPE art SYSTEM 'http://www.biomedcentral.com/xml/article.dtd'>
<art>
	<ui>gb-2006-7-1-102</ui>
	<ji>GBJ</ji>
	<fm>
		<dochead>Opinion</dochead>
		<bibl>
			<title>
				<p>On the evolution of the standard amino-acid alphabet</p>
			</title>
			<aug>
				<au id="A1">
					<snm>Lu</snm>
					<fnm>Yi</fnm>
					<insr iid="I1"/>
				</au>
				<au id="A2" ca="yes">
					<snm>Freeland</snm>
					<fnm>Stephen</fnm>
					<insr iid="I1"/>
					<email>freeland@umbc.edu</email>
				</au>
			</aug>
			<insg>
				<ins id="I1">
					<p>Department of Biological Sciences, University of Maryland, Baltimore County, Baltimore, MD 21250, USA</p>
				</ins>
			</insg>
			<source>Genome Biology</source>
			<issn>1465-6906</issn>
			<pubdate>2006</pubdate>
			<volume>7</volume>
			<issue>1</issue>
			<fpage>102</fpage>
			<url>http://genomebiology.com/2006/7/1/102</url>
			<xrefbib>
				<pubidlist><pubid idtype="pmpid">16515719</pubid><pubid idtype="doi">10.1186/gb-2006-7-1-102</pubid>
				</pubidlist></xrefbib>
		</bibl>
		<history>
			<pub>
				<date>
					<day>1</day>
					<month>2</month>
					<year>2006</year>
				</date>
			</pub>
		</history>
		<cpyrt>
			<year>2006</year>
			<collab>BioMed Central Ltd</collab>
		</cpyrt>
		<shorttitle>
			<p>On the evolution of the standard amino-acid alphabet</p>
		</shorttitle>
		<shortabs>
			<p>Fresh insights into the origin of the amino-acid alphabet are now emerging from disciplines as diverse as astrobiology, biochemical engineering and bioinformatics.</p>
		</shortabs>
		<abs>
			<sec>
				<st>
					<p>Abstract</p>
				</st>
				<p>Although one standard amino-acid 'alphabet' is used by most organisms on Earth, the evolutionary cause(s) and significance of this alphabet remain elusive. Fresh insights into the origin of the alphabet are now emerging from disciplines as diverse as astrobiology, biochemical engineering and bioinformatics.</p>
			</sec>
		</abs>
	</fm>
	<meta>
		<classifications>
			<classification type="BMC" subtype="man_spc_id" id="30010008">Evolution</classification>
			<classification type="BMC" subtype="man_spc_id" id="30010016">Molecular biology</classification>
			<classification type="BMC" subtype="man_spc_id" id="30010001">Biochemistry and structural biology</classification>
		</classifications>
	</meta>
	<bdy>
		<sec>
			<st>
				<p/>
			</st>
			<p>At the root of biology there are a handful of biochemical standards, the ubiquity of which tempts us to take them for granted. One is the standard 'alphabet' of 20 encoded amino acids, shared by organisms that diverged as early as <it>Escherichia coli </it>and human beings. But numerous lines of evidence, from abiotic chemistry to protein engineering, combine to indicate that this alphabet could potentially have consisted of fewer, more, or just plain different amino acids. So why have these 20 become the standard alphabet?</p>
			<p>Extensive scientific research has explored both the order by which amino acids entered the primordial genetic code and the ways in which variations of the alphabet affect the structure and function of proteins. But knowing the history of the alphabet's formation and appreciating the high tolerance of protein structures for alternative constituents merely highlights the deeper question of the alphabet's cause. New research, from synthetic biology <abbrgrp><abbr bid="B1">1</abbr><abbr bid="B2">2</abbr></abbrgrp>, genomic analysis <abbrgrp><abbr bid="B3">3</abbr></abbrgrp> and computational biochemistry <abbrgrp><abbr bid="B4">4</abbr><abbr bid="B5">5</abbr></abbrgrp>, is shedding new light on the question. Greater understanding in this area would potentially help scientific adventures as diverse as the search for extraterrestrial life and the drive to improve standard bioinformatic procedures such as homology detection and protein-structure prediction.</p>
		</sec>
		<sec>
			<st>
				<p>Why ask why?</p>
			</st>
			<p>Given the phenotypic diversity that has evolved, the revelation in the 20th century of a highly conserved biochemical framework beneath that diversity was remarkable. This uniformity - which goes from the structure of DNA, via the 'central dogma' of molecular biology that 'genes make RNA make proteins', to the codon assignments of the standard genetic code - spurred the scientific revolution that has carried us into the post-genomic era.</p>
			<p>But behind the biochemical canon lie the deeper questions of why life is built this way, including the question of why proteins are constructed using a standard alphabet of exactly these 20 amino acids. Although recent publications have considered similar questions for nucleic acids <abbrgrp><abbr bid="B6">6</abbr></abbrgrp>, nucleotides <abbrgrp><abbr bid="B7">7</abbr></abbrgrp> and even ribose <abbrgrp><abbr bid="B8">8</abbr></abbrgrp>, the cause(s) of the amino-acid alphabet have not been fully and directly addressed in more than two decades <abbrgrp><abbr bid="B9">9</abbr></abbrgrp>. Indeed, most authors have considered the amino-acid alphabet as a mere sub-component of a multifaceted phenomenon - the genetic code <abbrgrp><abbr bid="B10">10</abbr><abbr bid="B11">11</abbr></abbrgrp>. But understanding whether the amino-acid alphabet reflects some independent logic of its own would provide valuable input on two very different research fronts.</p>
			<p>In one direction, as astrobiology turns skywards to search for extraterrestrial life <abbrgrp><abbr bid="B12">12</abbr></abbrgrp>, it behoves us to ask what exactly we are looking for. Should we anticipate a more or less universal biochemistry? Pace <abbrgrp><abbr bid="B13">13</abbr></abbrgrp> and Benner <it>et al</it>. <abbrgrp><abbr bid="B14">14</abbr></abbrgrp> have each considered this question, only to reach opposite conclusions. Without a quantitative framework for these analyses, it is hard to evaluate who has the stronger argument. At worst, the current absence of such a framework seems to encourage the specter of pseudo-scientific claims of a mysterious 'external force' directing the natural world that continues to haunt American popular culture <abbrgrp><abbr bid="B15">15</abbr></abbrgrp>.</p>
			<p>With feet firmly back on Earth, a deeper understanding of amino-acid biochemistry is also of major importance to the emerging field of bioinformatics. In particular, protein sequence alignment (which underpins homology searching, phylogenetic reconstruction and even protein-structure prediction) is built up essentially from a quantitative model of amino-acid similarity. Increasingly, researchers are seeking further improvements here by replacing generalized, global models of observed amino-acid substitution patterns (models, such as PAM <abbrgrp><abbr bid="B16">16</abbr></abbrgrp> and BLOSUM <abbrgrp><abbr bid="B17">17</abbr></abbrgrp>, that apply to all proteins in all organisms) with specialized models, such as those used for particular protein families <abbrgrp><abbr bid="B18">18</abbr><abbr bid="B19">19</abbr></abbrgrp> or for genomes that have evolved under unusual mutation biases or selection regimes <abbrgrp><abbr bid="B20">20</abbr><abbr bid="B21">21</abbr><abbr bid="B22">22</abbr></abbrgrp>. Discovering in detail how the amino-acid alphabet evolved (developing its 'quantitative etiology') could make it possible to unify such models into a common theoretical framework derived from biophysical considerations.</p>
			<p>In fact, these two seemingly very different research frontiers, exobiology and bioinformatics, meet at several unexpected junctures. For example, some researchers interpret recent insights into the variation and distribution of protein folds as clues that the particular protein families that we find populating our biosphere were as inevitable to evolution as inorganic crystal structures are to physics <abbrgrp><abbr bid="B23">23</abbr></abbrgrp>. This fascinating idea is of equal relevance to drug design and protein-structure prediction as it is to exobiology. Its proponents have so far, however, failed to consider the role of the amino-acid alphabet from which protein folds are constructed. If the standard alphabet were different, what would the impact be on protein evolution? Analysis of protein-space fold suggests that the answer is not trivial <abbrgrp><abbr bid="B3">3</abbr><abbr bid="B4">4</abbr><abbr bid="B5">5</abbr></abbrgrp>. Encouragingly, emerging technologies such as chemoinformatics are opening up new approaches to the exploration of amino-acid etiology, more cheaply and rapidly than anything that has been done before. The time is ripe to reassess what we know and thus to highlight directions for future investigation.</p>
		</sec>
		<sec>
			<st>
				<p>Could alternative alphabets have been encoded?</p>
			</st>
			<p>In seeking a justification for the 20 amino acids we have, we imply that other alphabets were possible. Is this really the case? Early explanations for the size and content of the standard alphabet worked from the very premise that what we see today was somehow an inevitable outcome (see <abbrgrp><abbr bid="B24">24</abbr></abbrgrp> for a review). But as scientific progress undermined these flawed ideas, only one argument against alternative alphabets retained its plausibility. This was the general evolutionary observation that as organisms evolve an increasing complexity, emerging characters can easily become 'locked in' by subsequent evolutionary innovations that are adaptive only in relation to these early characters. Perhaps, then, the first amino acids to enter the code, for whatever reason, were frozen into evolutionary history by a proteome (and hence metabolism) built from them?</p>
			<p>Until recently, it did indeed appear that the potential for proteomic disruption was preventing any natural turnover of the standard amino-acid alphabet. Even the discovery of a widely distributed, 21st 'encoded' amino acid - selenocysteine (Sec) - appeared to support this view, once it was realized that significant extra molecular machinery is required for selenocysteine translation. Specifically, there is no explicit selenocysteine aminoacyl-tRNA synthetase that charges an appropriate tRNA; rather, serine aminoacyl-tRNA synthetase charges tRNA<sup>Sec </sup>with (canonical) serine <abbrgrp><abbr bid="B25">25</abbr></abbrgrp>. Enzymes then modify the serine into selenocysteine <it>in situ </it>while it is attached to the tRNA. Furthermore, a <it>cis</it>-encoded mRNA secondary structure downstream of the relevant codon is required to pause translation long enough for special elongation factors to supervise the incorporation of selenocysteine (reviewed in <abbrgrp><abbr bid="B26">26</abbr></abbrgrp>). All in all, one might view this as <it>prime facie </it>evidence that that the standard amino-acid alphabet is hard to change.</p>
			<p>Biochemical engineering has, however, steadily built up a contrasting picture of flexibility that suggests that a rethink is in order. To start with, something close to 100 non-standard amino acids have been successfully incorporated into various 'natural' protein structures <abbrgrp><abbr bid="B27">27</abbr><abbr bid="B28">28</abbr></abbrgrp>. The biochemistry of protein folds does not therefore tightly restrict the contents of the alphabet - although it remains to be seen whether different alphabets could enable fundamentally different folds. Nor is the alphabet directly and obviously limited by constraints of the translational machinery, as several studies have introduced 'unnatural' amino acids into the genetic code <abbrgrp><abbr bid="B1">1</abbr><abbr bid="B2">2</abbr></abbrgrp> through rational modification of appropriate tRNAs and the aminoacyl-tRNA synthetase molecules that charge them (see <abbrgrp><abbr bid="B29">29</abbr></abbrgrp> and references therein).</p>
			<p>Most directly of all, the discovery of a 22nd encoded amino acid, pyrrolysine, shows that the alphabet can grow and change naturally, not just in the laboratory. Like the 20 standard amino acids, pyrrolysine has its own aminoacyl-tRNA synthetase and its translation requires no unusual <it>cis </it>or <it>trans </it>elements (see <abbrgrp><abbr bid="B29">29</abbr></abbrgrp> for an overview). Viewed in this light, the special decoding arrangements for selenocysteine, including its <it>in situ </it>modification from seryl-tRNA into selenocysteinyl-tRNA, can be interpreted as exactly the sort of evolutionary intermediate that might be expected to arise during alphabet expansion under natural selection, as a way of minimizing disruption to preexisting coded protein products. Indeed, the knowledge that <it>in situ </it>tRNA modification is exactly how two of the standard amino acids (glutamine and asparagine) are coded in many microorganisms adds credibility to this interpretation (see <abbrgrp><abbr bid="B30">30</abbr></abbrgrp> and references therein) and sits well with theories for the origin of the standard alphabet.</p>
		</sec>
		<sec>
			<st>
				<p>Where did the standard alphabet come from?</p>
			</st>
			<p>The biggest single clue to understanding the origin of the standard amino-acid alphabet comes from our understanding of the prebiotic chemistry of Earth (see, for example <abbrgrp><abbr bid="B31">31</abbr></abbrgrp>) and space (see, for example <abbrgrp><abbr bid="B32">32</abbr></abbrgrp>), which suggests that amino acids were likely to have been obvious commodities that primordial life has exploited. The standard amino-acid alphabet is no mere passive reflection of chemistry, however: any correlation between an amino acid's likely prebiotic abundance and its presence within the standard alphabet is weak <abbrgrp><abbr bid="B9">9</abbr></abbrgrp>. Moreover, even the most optimistic assessment admits that lysine, arginine and histidine have never been observed in simulation experiments or in meteorites <abbrgrp><abbr bid="B33">33</abbr></abbrgrp>. In other words, it is clear that not all prebiotically synthesized amino acids ended up in the standard alphabet, and equally clear that not all members of the standard alphabet were prebiotically synthesized (Figure <figr fid="F1">1</figr>).</p>
			<fig id="F1">
				<title>
					<p>Figure 1</p>
				</title>
				<caption>
					<p>A Venn diagram showing different categories of amino acids: abiotic, approximately 80 amino acids which were probably produced by abiotic synthesis before life evolved (see, for example, [53]); biosynthetic, approximately 900 amino acids which are produced by natural biosynthetic pathways [54,55]; and engineered, at least 118 amino acids which have been experimentally engineered and placed into proteins by biomedical research projects [56]</p>
				</caption>
				<text>
					<p>A Venn diagram showing different categories of amino acids: abiotic, approximately 80 amino acids which were probably produced by abiotic synthesis before life evolved (see, for example, [53]); biosynthetic, approximately 900 amino acids which are produced by natural biosynthetic pathways [54,55]; and engineered, at least 118 amino acids which have been experimentally engineered and placed into proteins by biomedical research projects [56]. The group of coded amino acids includes the standard amino-acid alphabet of 20 coded amino acids and the coded and biosynthetic amino acids selenocysteine [26] and pyrrolysine [29], as well as at least 30 engineered amino acids which have been cotranslationally incorporated into proteins [28]. One example is shown for each region of the Venn diagram. At least some of the 20 coded amino acids are thought to have originated as biosynthetic modifications of the others. The diagram shows that the 20 coded amino acids of the standard amino-acid alphabet are a small subset of what was chemically and/or biologically possible.</p>
				</text>
				<graphic file="gb-2006-7-1-102-1"/>
			</fig>
			<p>The latter observation has received the most attention to date, stimulating theories that at least some of the 20 standard amino acids originated as biosynthetic modifications of the others (Figure <figr fid="F1">1</figr>). In particular, Wong <abbrgrp><abbr bid="B34">34</abbr></abbrgrp> extensively developed the idea that the order in which amino acids were added to the alphabet can be seen from the metabolic pathways by which amino acids are biosynthesized in present-day organisms.</p>
			<p>But even a 'consensus order' <abbrgrp><abbr bid="B35">35</abbr></abbrgrp> derived from many different precise models of alphabet expansion cannot explain the current situation fully, because all organisms biosynthetically derive amino acids that are definitely not incorporated into the genetic code. Of these, ornithine, citrulline and homoserine are three of the most ubiquitous, although for many lineages the total number is undoubtedly in the hundreds, if not the thousands <abbrgrp><abbr bid="B36">36</abbr></abbrgrp>. Moreover, post-translational modification introduces many further amino acids into proteins without them ever being 'coded' in any meaningful sense <abbrgrp><abbr bid="B37">37</abbr></abbrgrp>. Of course the term 'amino acid' describes the infinite series of molecular structures that contain both an amino and a carboxyl (acid) group, many of which could plausibly be biosynthesized by the right protein machinery. And let us not forget that within the standard alphabet, proline does not meet even these minimal requirements because it is an amino acid in which a cyclic side chain binds back to the 'backbone' nitrogen, generating a C = NH group where the amino acids have the NH<sub>2 </sub>group.</p>
			<p>At a deeper level, it is not entirely clear why early evolutionary expansion of the alphabet should have occurred at all. Experimental and theoretical analyses of amino-acid alphabet size (see <abbrgrp><abbr bid="B38">38</abbr><abbr bid="B39">39</abbr></abbrgrp>, respectively, and references therein) suggest that a much smaller amino-acid alphabet might be sufficient to produce most of the fold structures that have been observed. Such hypothetical alphabets are much more plausible starting points, given the amino acids that are thought to have been generated by prebiotic chemistry. So we need to ask again, why have these 20 amino acids been used in the code?</p>
		</sec>
		<sec>
			<st>
				<p>Evolutionary causes for the size and contents of the alphabet</p>
			</st>
			<p>To date, only one publication from 1981 has offered detailed, case-by-case, feature-by-feature justifications for the members of the standard amino-acid alphabet <abbrgrp><abbr bid="B9">9</abbr></abbrgrp>, "... on the basis of the availability in the primitive ocean, function in proteins, the stability of the amino acid and its peptides, stability to racemization, and stability on the transfer RNA". The specific explanations given for individual members of the standard alphabet in this work <abbrgrp><abbr bid="B10">10</abbr></abbrgrp> were all strictly qualitative, however, and they are hard to assess, beyond being plausible. At best, then, we have some good ideas for the themes involved in amino-acid alphabet selection. At worst, we have untestable explanations that critics could dismiss as 'adaptive storytelling'. One pointed example is that the dismissal of &#946;-amino acids on the grounds that they could not support stable secondary structures <abbrgrp><abbr bid="B9">9</abbr></abbrgrp>, turns out to be incorrect <abbrgrp><abbr bid="B40">40</abbr></abbrgrp>.</p>
			<p>Contrasting with these specific arguments, others have certainly suggested general, adaptive criteria, although often only as brief comments within work of a different primary focus. Among the most common is that the amino-acid alphabet was somehow selected for its biochemical diversity: for example, Szathmary <abbrgrp><abbr bid="B41">41</abbr></abbrgrp> suggests that "proteins provided a greater catalytic versatility than nucleic acids (20 versus 4 building blocks)". But simulations of protein evolution consistently indicate a high degree of functional redundancy in the standard alphabet (see, for example, <abbrgrp><abbr bid="B42">42</abbr><abbr bid="B43">43</abbr></abbrgrp>), suggesting that diversity alone is not a good explanation. Also, at an intuitive level, the presence of the very similar amino acids valine, leucine and isoleucine suggests that biochemical diversity is hardly maximized in the standard alphabet. Another possible explanatory factor derives from the observation that bulky amino acids, such as phenylalanine and tyrosine, are used much less within 'natural' proteins than simple and small alternatives <abbrgrp><abbr bid="B44">44</abbr><abbr bid="B45">45</abbr></abbrgrp>. Perhaps entry into the standard alphabet was restricted to the smallest and cheapest amino acids that could form a functional protein library following simple, economic principles?</p>
			<p>Of course, many other adaptive criteria can easily be formulated; the question is how we can render such speculations as testable science. In principle, statistical analysis would allow us to test whether the standard amino-acid alphabet forms a non-random collection against the background of plausible alternatives, provided we have reliable, quantitative metrics of important biophysical properties (for example, size, charge and hydrophobicity) for all the relevant molecules. A wealth of such data already exists for the 20 standard amino acids: indeed, the AAIndex database <abbrgrp><abbr bid="B46">46</abbr><abbr bid="B47">47</abbr></abbrgrp> has collated many of these into a free online resource. These data do not, however, extend to non-standard amino acids, for the simple reason that synthesis of a molecule and analysis of its biophysical properties is a slow and expensive endeavor, even for a small molecule. For the hundreds of biosynthetically available alternatives, let alone the thousands that are biochemically plausible, such constraints are prohibitive.</p>
		</sec>
		<sec>
			<st>
				<p>New technologies to address old questions</p>
			</st>
			<p>It is in the analysis of the properties of hundreds of compounds that emerging technologies seem set to open new research possibilities. Specifically, the explosive growth in computational power and sophistication that biologists encounter through bioinformatics extends into chemical realms ('chemoinformatics'), particularly in the form of algorithms to predict the shape and properties of user-defined molecules (see, for example, <abbrgrp><abbr bid="B48">48</abbr></abbrgrp>). Although accurate predictions remain elusive for macromolecules such as proteins <abbrgrp><abbr bid="B49">49</abbr></abbrgrp>, there have been steady improvements in the prediction of structure (see, for example, <abbrgrp><abbr bid="B50">50</abbr></abbrgrp>) and biophysical properties (see, for example, <abbrgrp><abbr bid="B51">51</abbr></abbrgrp>) of smaller molecules. This, then, offers a relatively quick and low-cost approach to exploring the chemically possible amino acids. Theoretical predictions must be developed with caution, under the guidance of empirical data; this challenge is easily met when considering amino acids, however, because the experimentally derived metrics of the 20 standard amino acids offer a natural 'control group' for testing the accuracy of computational predictions.</p>
			<p>Thus, the computational infrastructure of 21st-century biochemistry puts us within reach of asking what, if any, properties of the standard amino-acid alphabet distinguish its contents from the vast array of prebiotically and biosynthetically plausible alternatives - and for only a modest investment of time and money. It is possible that this cornerstone of biochemistry will defy all attempts at logical explanation, leaving us to conclude that the emergence of the standard amino-acid alphabet was an entirely arbitrary outcome. It would certainly match one school of evolutionary thinking <abbrgrp><abbr bid="B52">52</abbr></abbrgrp> if it was discovered that the whole of life is in fact built upon meaningless accidents of chemistry and history.</p>
			<p>What is important is that we can now see ways to ask such questions with scientific rigor. Indeed, as this and other questions of biochemical etiology become amenable to rigorous scientific inquiry, the life sciences will be contributing directly to cosmology: there are few biological questions deeper than asking to what extent life (either our kind of life or indeed any kind of life) was implicit within the physics of this universe.</p>
		</sec>
	</bdy>
	<bm>
		<ack>
			<sec>
				<st>
					<p>Acknowledgements</p>
				</st>
				<p>This work was funded in part by NASA Exobiology award NNG04GJ72G and NSF award DBI 0317349. We thank Gang Wu, Blasej Bulka, Wen Zhu and Nick Keulmann for insights and comments that improved this article.</p>
			</sec>
		</ack>
		<refgrp>
			<bibl id="B1">
				<title>
					<p>Manipulating proteins with chemistry: a cross-section of chemical biology.</p>
				</title>
				<aug>
					<au>
						<snm>Hahn</snm>
						<fnm>ME</fnm>
					</au>
					<au>
						<snm>Muir</snm>
						<fnm>TW</fnm>
					</au>
				</aug>
				<source>Trends Biochem Sci</source>
				<pubdate>2005</pubdate>
				<volume>30</volume>
				<fpage>26</fpage>
				<lpage>34</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1016/j.tibs.2004.10.010</pubid>
						<pubid idtype="pmpid" link="fulltext">15653323</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B2">
				<title>
					<p>Synthetic biology.</p>
				</title>
				<aug>
					<au>
						<snm>Benner</snm>
						<fnm>SA</fnm>
					</au>
					<au>
						<snm>Sismour</snm>
						<fnm>AM</fnm>
					</au>
				</aug>
				<source>Nat Rev Genet</source>
				<pubdate>2005</pubdate>
				<volume>6</volume>
				<fpage>533</fpage>
				<lpage>543</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1038/nrg1637</pubid>
						<pubid idtype="pmpid" link="fulltext">15995697</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B3">
				<title>
					<p>Protein structure and evolutionary history determine sequence space topology.</p>
				</title>
				<aug>
					<au>
						<snm>Shakhnovich</snm>
						<fnm>BE</fnm>
					</au>
					<au>
						<snm>Deeds</snm>
						<fnm>E</fnm>
					</au>
					<au>
						<snm>Delisi</snm>
						<fnm>C</fnm>
					</au>
					<au>
						<snm>Shakhnovich</snm>
						<fnm>E</fnm>
					</au>
				</aug>
				<source>Genome Res</source>
				<pubdate>2005</pubdate>
				<volume>15</volume>
				<fpage>385</fpage>
				<lpage>392</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="pmcid">551565</pubid>
						<pubid idtype="pmpid" link="fulltext">15741509</pubid>
						<pubid idtype="doi">10.1101/gr.3133605</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B4">
				<title>
					<p>Neutral evolution of model proteins: diffusion in sequence space and overdispersion.</p>
				</title>
				<aug>
					<au>
						<snm>Bastolla</snm>
						<fnm>U</fnm>
					</au>
					<au>
						<snm>Roman</snm>
						<fnm>HE</fnm>
					</au>
					<au>
						<snm>Vendruscolo</snm>
						<fnm>M</fnm>
					</au>
				</aug>
				<source>J Theor Biol</source>
				<pubdate>1999</pubdate>
				<volume>200</volume>
				<fpage>49</fpage>
				<lpage>64</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1006/jtbi.1999.0975</pubid>
						<pubid idtype="pmpid" link="fulltext">10479539</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B5">
				<title>
					<p>Protein-folding landscapes in multichain systems.</p>
				</title>
				<aug>
					<au>
						<snm>Cellmer</snm>
						<fnm>T</fnm>
					</au>
					<au>
						<snm>Bratko</snm>
						<fnm>D</fnm>
					</au>
					<au>
						<snm>Prausnitz</snm>
						<fnm>JM</fnm>
					</au>
					<au>
						<snm>Blanch</snm>
						<fnm>H</fnm>
					</au>
				</aug>
				<source>Proc Natl Acad Sci USA</source>
				<pubdate>2005</pubdate>
				<volume>102</volume>
				<fpage>11692</fpage>
				<lpage>11697</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1073/pnas.0505342102</pubid>
						<pubid idtype="pmpid" link="fulltext">16081531</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B6">
				<title>
					<p>Why are there four letters in the genetic alphabet?</p>
				</title>
				<aug>
					<au>
						<snm>Szathmary</snm>
						<fnm>E</fnm>
					</au>
				</aug>
				<source>Nat Rev Genet</source>
				<pubdate>2003</pubdate>
				<volume>4</volume>
				<fpage>995</fpage>
				<lpage>1001</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1038/nrg1231</pubid>
						<pubid idtype="pmpid" link="fulltext">14631359</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B7">
				<title>
					<p>Chemical etiology of nucleic acid structure.</p>
				</title>
				<aug>
					<au>
						<snm>Eschenmoser</snm>
						<fnm>A</fnm>
					</au>
				</aug>
				<source>Science</source>
				<pubdate>1999</pubdate>
				<volume>284</volume>
				<fpage>2118</fpage>
				<lpage>2124</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1126/science.284.5423.2118</pubid>
						<pubid idtype="pmpid" link="fulltext">10381870</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B8">
				<title>
					<p>Chemical etiology of nucleic acid structure: the alpha-threofuranosyl-(3'&#8594;2') oligonucleotide system.</p>
				</title>
				<aug>
					<au>
						<snm>Schoning</snm>
						<fnm>K</fnm>
					</au>
					<au>
						<snm>Scholz</snm>
						<fnm>P</fnm>
					</au>
					<au>
						<snm>Guntha</snm>
						<fnm>S</fnm>
					</au>
					<au>
						<snm>Wu</snm>
						<fnm>X</fnm>
					</au>
					<au>
						<snm>Krishnamurthy</snm>
						<fnm>R</fnm>
					</au>
					<au>
						<snm>Eschenmoser</snm>
						<fnm>A</fnm>
					</au>
				</aug>
				<source>Science</source>
				<pubdate>2000</pubdate>
				<volume>290</volume>
				<fpage>1347</fpage>
				<lpage>1351</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1126/science.290.5495.1347</pubid>
						<pubid idtype="pmpid" link="fulltext">11082060</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B9">
				<title>
					<p>Reasons for the occurrence of the twenty coded protein amino acids.</p>
				</title>
				<aug>
					<au>
						<snm>Weber</snm>
						<fnm>AL</fnm>
					</au>
					<au>
						<snm>Miller</snm>
						<fnm>SL</fnm>
					</au>
				</aug>
				<source>J Mol Evol</source>
				<pubdate>1981</pubdate>
				<volume>17</volume>
				<fpage>273</fpage>
				<lpage>284</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1007/BF01795749</pubid>
						<pubid idtype="pmpid">7277510</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B10">
				<title>
					<p>Selection, history and chemistry: the three faces of the genetic code.</p>
				</title>
				<aug>
					<au>
						<snm>Knight</snm>
						<fnm>RD</fnm>
					</au>
					<au>
						<snm>Freeland</snm>
						<fnm>SJ</fnm>
					</au>
					<au>
						<snm>Landweber</snm>
						<fnm>LF</fnm>
					</au>
				</aug>
				<source>Trends Biochem Sci</source>
				<pubdate>1999</pubdate>
				<volume>24</volume>
				<fpage>241</fpage>
				<lpage>247</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1016/S0968-0004(99)01392-4</pubid>
						<pubid idtype="pmpid" link="fulltext">10366854</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B11">
				<title>
					<p>Coevolution theory of the genetic code at age thirty.</p>
				</title>
				<aug>
					<au>
						<snm>Wong</snm>
						<fnm>JT-F</fnm>
					</au>
				</aug>
				<source>BioEssays</source>
				<pubdate>2005</pubdate>
				<volume>27</volume>
				<fpage>416</fpage>
				<lpage>25</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1002/bies.20208</pubid>
						<pubid idtype="pmpid" link="fulltext">15770677</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B12">
				<title>
					<p>Astronomy: a field with a life of its own.</p>
				</title>
				<aug>
					<au>
						<snm>Bada</snm>
						<fnm>JL</fnm>
					</au>
				</aug>
				<source>Science</source>
				<pubdate>2005</pubdate>
				<volume>307</volume>
				<fpage>46</fpage>
				<xrefbib>
					<pubid idtype="doi">10.1126/science.1106678</pubid>
				</xrefbib>
			</bibl>
			<bibl id="B13">
				<title>
					<p>The universal nature of biochemistry.</p>
				</title>
				<aug>
					<au>
						<snm>Pace</snm>
						<fnm>NR</fnm>
					</au>
				</aug>
				<source>Proc Natl Acad Sci USA</source>
				<pubdate>2001</pubdate>
				<volume>98</volume>
				<fpage>805</fpage>
				<lpage>808</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="pmcid">33372</pubid>
						<pubid idtype="pmpid" link="fulltext">11158550</pubid>
						<pubid idtype="doi">10.1073/pnas.98.3.805</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B14">
				<title>
					<p>Is there a common chemical model for life in the universe?</p>
				</title>
				<aug>
					<au>
						<snm>Benner</snm>
						<fnm>SA</fnm>
					</au>
					<au>
						<snm>Ricardo</snm>
						<fnm>A</fnm>
					</au>
					<au>
						<snm>Carrigan</snm>
						<fnm>MA</fnm>
					</au>
				</aug>
				<source>Curr Opin Chem Biol</source>
				<pubdate>2004</pubdate>
				<volume>8</volume>
				<fpage>672</fpage>
				<lpage>689</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1016/j.cbpa.2004.10.003</pubid>
						<pubid idtype="pmpid" link="fulltext">15556414</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B15">
				<title>
					<p>Simple evolutionary pathways to complex proteins.</p>
				</title>
				<aug>
					<au>
						<snm>Lynch</snm>
						<fnm>M</fnm>
					</au>
				</aug>
				<source>Protein Sci</source>
				<pubdate>2005</pubdate>
				<volume>14</volume>
				<fpage>2217</fpage>
				<lpage>2225</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1110/ps.041171805</pubid>
						<pubid idtype="pmpid" link="fulltext">16131652</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B16">
				<aug>
					<au>
						<snm>Dayhoff</snm>
						<fnm>MO</fnm>
					</au>
					<au>
						<snm>Schwartz</snm>
						<fnm>RM</fnm>
					</au>
					<au>
						<snm>Orcutt</snm>
						<fnm>BC</fnm>
					</au>
				</aug>
				<source>Atlas of Protein Sequence and Structure</source>
				<publisher>Washington DC: National Biomedical Research Foundation</publisher>
				<pubdate>1978</pubdate>
			</bibl>
			<bibl id="B17">
				<title>
					<p>Amino acid substitution matrices from protein blocks.</p>
				</title>
				<aug>
					<au>
						<snm>Henikoff</snm>
						<fnm>S</fnm>
					</au>
					<au>
						<snm>Henikoff</snm>
						<fnm>JG</fnm>
					</au>
				</aug>
				<source>Proc Natl Acad Sci USA</source>
				<pubdate>1992</pubdate>
				<volume>89</volume>
				<fpage>10915</fpage>
				<lpage>10919</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="pmcid">50453</pubid>
						<pubid idtype="pmpid" link="fulltext">1438297</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B18">
				<title>
					<p>Fold-specific substitution matrices for protein classification.</p>
				</title>
				<aug>
					<au>
						<snm>Vilim</snm>
						<fnm>RB</fnm>
					</au>
					<au>
						<snm>Cunningham</snm>
						<fnm>RM</fnm>
					</au>
					<au>
						<snm>Lu</snm>
						<fnm>B</fnm>
					</au>
					<au>
						<snm>Kheradpour</snm>
						<fnm>P</fnm>
					</au>
					<au>
						<snm>Stevens</snm>
						<fnm>FJ</fnm>
					</au>
				</aug>
				<source>Bioinformatics</source>
				<pubdate>2004</pubdate>
				<volume>20</volume>
				<fpage>847</fpage>
				<lpage>853</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1093/bioinformatics/btg492</pubid>
						<pubid idtype="pmpid" link="fulltext">14764567</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B19">
				<title>
					<p>Enriching the sequence substitution matrix by structural information.</p>
				</title>
				<aug>
					<au>
						<snm>Teodorescu</snm>
						<fnm>O</fnm>
					</au>
					<au>
						<snm>Galor</snm>
						<fnm>T</fnm>
					</au>
					<au>
						<snm>Pillardy</snm>
						<fnm>J</fnm>
					</au>
					<au>
						<snm>Elber</snm>
						<fnm>R</fnm>
					</au>
				</aug>
				<source>Proteins</source>
				<pubdate>2004</pubdate>
				<volume>54</volume>
				<fpage>41</fpage>
				<lpage>48</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1002/prot.10474</pubid>
						<pubid idtype="pmpid" link="fulltext">14705022</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B20">
				<title>
					<p>The construction of amino acid substitution matrices for the comparison of proteins with non-standard compositions.</p>
				</title>
				<aug>
					<au>
						<snm>Yu</snm>
						<fnm>YK</fnm>
					</au>
					<au>
						<snm>Altschul</snm>
						<fnm>SF</fnm>
					</au>
				</aug>
				<source>Bioinformatics</source>
				<pubdate>2005</pubdate>
				<volume>21</volume>
				<fpage>902</fpage>
				<lpage>911</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1093/bioinformatics/bti070</pubid>
						<pubid idtype="pmpid" link="fulltext">15509610</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B21">
				<title>
					<p>Construction of non-symmetric substitution matrices derived from proteomes with biased amino acid distributions.</p>
				</title>
				<aug>
					<au>
						<snm>Bastien</snm>
						<fnm>O</fnm>
					</au>
					<au>
						<snm>Roy</snm>
						<fnm>S</fnm>
					</au>
					<au>
						<snm>Marechal</snm>
						<fnm>E</fnm>
					</au>
				</aug>
				<source>C R Biol</source>
				<pubdate>2005</pubdate>
				<volume>328</volume>
				<fpage>445</fpage>
				<lpage>453</lpage>
				<xrefbib>
					<pubid idtype="pmpid">15948633</pubid>
				</xrefbib>
			</bibl>
			<bibl id="B22">
				<title>
					<p>Analysis of differences in amino acid substitution patterns, using multilevel G-tests.</p>
				</title>
				<aug>
					<au>
						<snm>Pacholczyk</snm>
						<fnm>M</fnm>
					</au>
					<au>
						<snm>Kimmel</snm>
						<fnm>M</fnm>
					</au>
				</aug>
				<source>C R Biol</source>
				<pubdate>2005</pubdate>
				<volume>328</volume>
				<fpage>632</fpage>
				<lpage>641</lpage>
				<xrefbib>
					<pubid idtype="pmpid" link="fulltext">15992746</pubid>
				</xrefbib>
			</bibl>
			<bibl id="B23">
				<title>
					<p>The protein folds as platonic forms: new support for the pre-Darwinian conception of evolution by natural law.</p>
				</title>
				<aug>
					<au>
						<snm>Denton</snm>
						<fnm>MJ</fnm>
					</au>
					<au>
						<snm>Marshall</snm>
						<fnm>CJ</fnm>
					</au>
					<au>
						<snm>Legge</snm>
						<fnm>M</fnm>
					</au>
				</aug>
				<source>J Theor Biol</source>
				<pubdate>2002</pubdate>
				<volume>219</volume>
				<fpage>325</fpage>
				<lpage>342</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1006/jtbi.2002.3128</pubid>
						<pubid idtype="pmpid" link="fulltext">12419661</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B24">
				<title>
					<p>The invention of the genetic code.</p>
				</title>
				<aug>
					<au>
						<snm>Hayes</snm>
						<fnm>B</fnm>
					</au>
				</aug>
				<source>Am Sci</source>
				<pubdate>1998</pubdate>
				<volume>86</volume>
				<fpage>8</fpage>
				<lpage>14</lpage>
				<xrefbib>
					<pubid idtype="doi">10.1511/1998.1.8</pubid>
				</xrefbib>
			</bibl>
			<bibl id="B25">
				<title>
					<p>Unique features of selenocysteine incorporation function within the context of general eukaryotic translational processes.</p>
				</title>
				<aug>
					<au>
						<snm>Small-Howard</snm>
						<fnm>AL</fnm>
					</au>
					<au>
						<snm>Berry</snm>
						<fnm>MJ</fnm>
					</au>
				</aug>
				<source>Biochem Soc Trans</source>
				<pubdate>2005</pubdate>
				<volume>33</volume>
				<fpage>1493</fpage>
				<lpage>1497</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1042/BST20051493</pubid>
						<pubid idtype="pmpid" link="fulltext">16246153</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B26">
				<title>
					<p>How selenium has altered our understanding of the genetic code.</p>
				</title>
				<aug>
					<au>
						<snm>Hatfield</snm>
						<fnm>DL</fnm>
					</au>
					<au>
						<snm>Gladyshev</snm>
						<fnm>VN</fnm>
					</au>
				</aug>
				<source>Mol Cell Biol</source>
				<pubdate>2002</pubdate>
				<volume>22</volume>
				<fpage>3565</fpage>
				<lpage>3576</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="pmcid">133838</pubid>
						<pubid idtype="pmpid" link="fulltext">11997494</pubid>
						<pubid idtype="doi">10.1128/MCB.22.11.3565-3576.2002</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B27">
				<title>
					<p>Incorporation of nonnatural amino acids into proteins.</p>
				</title>
				<aug>
					<au>
						<snm>Hendrickson</snm>
						<fnm>TL</fnm>
					</au>
					<au>
						<snm>de Crecy-Lagard</snm>
						<fnm>V</fnm>
					</au>
					<au>
						<snm>Schimmel</snm>
						<fnm>P</fnm>
					</au>
				</aug>
				<source>Annu Rev Biochem</source>
				<pubdate>2004</pubdate>
				<volume>73</volume>
				<fpage>147</fpage>
				<lpage>176</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1146/annurev.biochem.73.012803.092429</pubid>
						<pubid idtype="pmpid" link="fulltext">15189139</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B28">
				<title>
					<p>Adding amino acids to the genetic repertoire.</p>
				</title>
				<aug>
					<au>
						<snm>Xie</snm>
						<fnm>J</fnm>
					</au>
					<au>
						<snm>Schultz</snm>
						<fnm>PG</fnm>
					</au>
				</aug>
				<source>Curr Opin Chem Biol</source>
				<pubdate>2005</pubdate>
				<volume>9</volume>
				<fpage>548</fpage>
				<lpage>554</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1016/j.cbpa.2005.10.011</pubid>
						<pubid idtype="pmpid" link="fulltext">16260173</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B29">
				<title>
					<p>Pyrrolysine and selenocysteine use dissimilar decoding strategies.</p>
				</title>
				<aug>
					<au>
						<snm>Zhang</snm>
						<fnm>Y</fnm>
					</au>
					<au>
						<snm>Baranov</snm>
						<fnm>PV</fnm>
					</au>
					<au>
						<snm>Atkins</snm>
						<fnm>JF</fnm>
					</au>
					<au>
						<snm>Gladyshev</snm>
						<fnm>VN</fnm>
					</au>
				</aug>
				<source>J Biol Chem</source>
				<pubdate>2005</pubdate>
				<volume>280</volume>
				<fpage>20740</fpage>
				<lpage>20751</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1074/jbc.M501458200</pubid>
						<pubid idtype="pmpid" link="fulltext">15788401</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B30">
				<title>
					<p>On the formation of Asp-tRNA(Asn) by aspartyl-tRNA synthetases.</p>
				</title>
				<aug>
					<au>
						<snm>Wong</snm>
						<fnm>JT</fnm>
					</au>
				</aug>
				<source>Bioessays</source>
				<pubdate>2005</pubdate>
				<volume>27</volume>
				<fpage>1309</fpage>
				<xrefbib>
					<pubid idtype="doi">10.1002/bies.20334</pubid>
				</xrefbib>
			</bibl>
			<bibl id="B31">
				<title>
					<p>A production of amino acids under possible primitive earth conditions.</p>
				</title>
				<aug>
					<au>
						<snm>Miller</snm>
						<fnm>SL</fnm>
					</au>
				</aug>
				<source>Science</source>
				<pubdate>1953</pubdate>
				<volume>117</volume>
				<fpage>528</fpage>
				<lpage>529</lpage>
				<xrefbib>
					<pubid idtype="pmpid">13056598</pubid>
				</xrefbib>
			</bibl>
			<bibl id="B32">
				<title>
					<p>Astrobiology: seeds of life?</p>
				</title>
				<aug>
					<au>
						<snm>Shock</snm>
						<fnm>EL</fnm>
					</au>
				</aug>
				<source>Nature</source>
				<pubdate>2002</pubdate>
				<volume>416</volume>
				<fpage>380</fpage>
				<lpage>381</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1038/416380a</pubid>
						<pubid idtype="pmpid" link="fulltext">11919613</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B33">
				<title>
					<p>Current status of the prebiotic synthesis of small molecules.</p>
				</title>
				<aug>
					<au>
						<snm>Miller</snm>
						<fnm>SL</fnm>
					</au>
				</aug>
				<source>Chem Scr</source>
				<pubdate>1986</pubdate>
				<volume>26B</volume>
				<fpage>5</fpage>
				<lpage>11</lpage>
				<xrefbib>
					<pubid idtype="pmpid">11542054</pubid>
				</xrefbib>
			</bibl>
			<bibl id="B34">
				<title>
					<p>A co-evolution theory of the genetic code.</p>
				</title>
				<aug>
					<au>
						<snm>Wong</snm>
						<fnm>JT</fnm>
					</au>
				</aug>
				<source>Proc Natl Acad Sci USA</source>
				<pubdate>1975</pubdate>
				<volume>72</volume>
				<fpage>1909</fpage>
				<lpage>1912</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="pmcid">432657</pubid>
						<pubid idtype="pmpid">1057181</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B35">
				<title>
					<p>Consensus temporal order of amino acids and evolution of the triplet code.</p>
				</title>
				<aug>
					<au>
						<snm>Trifonov</snm>
						<fnm>EN</fnm>
					</au>
				</aug>
				<source>Gene</source>
				<pubdate>2000</pubdate>
				<volume>261</volume>
				<fpage>139</fpage>
				<lpage>151</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1016/S0378-1119(00)00476-5</pubid>
						<pubid idtype="pmpid" link="fulltext">11164045</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B36">
				<title>
					<p>Amino acids.</p>
				</title>
				<aug>
					<au>
						<snm>Bell</snm>
						<fnm>EA</fnm>
					</au>
					<au>
						<snm>John</snm>
						<fnm>DI</fnm>
					</au>
				</aug>
				<source>Organic Chemistry. Series 2, Amino Acids, Peptides and Related Compounds</source>
				<publisher>London: Butterworths</publisher>
				<editor>Rydon HN</editor>
				<pubdate>1976</pubdate>
				<volume>6</volume>
				<fpage>1</fpage>
				<lpage>32</lpage>
			</bibl>
			<bibl id="B37">
				<title>
					<p>Techniques for studying protein heterogeneity and post-translational modifications.</p>
				</title>
				<aug>
					<au>
						<snm>Baumann</snm>
						<fnm>M</fnm>
					</au>
					<au>
						<snm>Meri</snm>
						<fnm>S</fnm>
					</au>
				</aug>
				<source>Expert Rev Proteomics</source>
				<pubdate>2004</pubdate>
				<volume>1</volume>
				<fpage>207</fpage>
				<lpage>217</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1586/14789450.1.2.207</pubid>
						<pubid idtype="pmpid" link="fulltext">15966815</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B38">
				<title>
					<p>An active enzyme constructed from a 9-amino acid alphabet.</p>
				</title>
				<aug>
					<au>
						<snm>Walter</snm>
						<fnm>KU</fnm>
					</au>
					<au>
						<snm>Vamvaca</snm>
						<fnm>K</fnm>
					</au>
					<au>
						<snm>Hilvert</snm>
						<fnm>D</fnm>
					</au>
				</aug>
				<source>J Biol Chem</source>
				<pubdate>2005</pubdate>
				<volume>280</volume>
				<fpage>37742</fpage>
				<lpage>37746</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1074/jbc.M507210200</pubid>
						<pubid idtype="pmpid" link="fulltext">16144843</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B39">
				<title>
					<p>What is the minimum number of letters required to fold a protein?</p>
				</title>
				<aug>
					<au>
						<snm>Fan</snm>
						<fnm>K</fnm>
					</au>
					<au>
						<snm>Wang</snm>
						<fnm>W</fnm>
					</au>
				</aug>
				<source>J Mol Biol</source>
				<pubdate>2003</pubdate>
				<volume>328</volume>
				<fpage>921</fpage>
				<lpage>926</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1016/S0022-2836(03)00324-3</pubid>
						<pubid idtype="pmpid" link="fulltext">12729764</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B40">
				<title>
					<p>Design and synthesis of biologically active &#946;-peptides.</p>
				</title>
				<aug>
					<au>
						<snm>Koyak</snm>
						<fnm>MJ</fnm>
					</au>
					<au>
						<snm>Cheng</snm>
						<fnm>RP</fnm>
					</au>
				</aug>
				<source>Meth Mol Biol</source>
				<inpress/>
			</bibl>
			<bibl id="B41">
				<title>
					<p>The origin of the genetic code: amino acids as cofactors in an RNA world.</p>
				</title>
				<aug>
					<au>
						<snm>Szathmary</snm>
						<fnm>E</fnm>
					</au>
				</aug>
				<source>Trends Genet</source>
				<pubdate>1999</pubdate>
				<volume>15</volume>
				<fpage>223</fpage>
				<lpage>229</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1016/S0168-9525(99)01730-8</pubid>
						<pubid idtype="pmpid" link="fulltext">10354582</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B42">
				<title>
					<p>Why are proteins so robust to site mutations?</p>
				</title>
				<aug>
					<au>
						<snm>Taverna</snm>
						<fnm>DM</fnm>
					</au>
					<au>
						<snm>Goldstein</snm>
						<fnm>RA</fnm>
					</au>
				</aug>
				<source>J Mol Biol</source>
				<pubdate>2002</pubdate>
				<volume>315</volume>
				<fpage>479</fpage>
				<lpage>484</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1006/jmbi.2001.5226</pubid>
						<pubid idtype="pmpid" link="fulltext">11786027</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B43">
				<title>
					<p>Divergence, recombination and retention of functionality during protein evolution.</p>
				</title>
				<aug>
					<au>
						<snm>Xu</snm>
						<fnm>YO</fnm>
					</au>
					<au>
						<snm>Hall</snm>
						<fnm>RW</fnm>
					</au>
					<au>
						<snm>Goldstein</snm>
						<fnm>RA</fnm>
					</au>
					<au>
						<snm>Pollock</snm>
						<fnm>DD</fnm>
					</au>
				</aug>
				<source>Hum Genomics</source>
				<pubdate>2005</pubdate>
				<volume>2</volume>
				<fpage>158</fpage>
				<lpage>167</lpage>
				<xrefbib>
					<pubid idtype="pmpid" link="fulltext">16197733</pubid>
				</xrefbib>
			</bibl>
			<bibl id="B44">
				<title>
					<p>Genetic code synonym quotas and amino acid complexity: cutting the cost of proteins?</p>
				</title>
				<aug>
					<au>
						<snm>Dufton</snm>
						<fnm>MJ</fnm>
					</au>
				</aug>
				<source>J Theor Biol</source>
				<pubdate>1997</pubdate>
				<volume>187</volume>
				<fpage>165</fpage>
				<lpage>173</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1006/jtbi.1997.0443</pubid>
						<pubid idtype="pmpid" link="fulltext">9237887</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B45">
				<title>
					<p>Metabolic efficiency and amino acid composition in the proteomes of <it>Escherichia coli </it>and <it>Bacillus subtilis</it>.</p>
				</title>
				<aug>
					<au>
						<snm>Akashi</snm>
						<fnm>H</fnm>
					</au>
					<au>
						<snm>Gojobori</snm>
						<fnm>T</fnm>
					</au>
				</aug>
				<source>Proc Natl Acad Sci USA</source>
				<pubdate>2002</pubdate>
				<volume>99</volume>
				<fpage>3695</fpage>
				<lpage>3700</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="pmcid">122586</pubid>
						<pubid idtype="pmpid" link="fulltext">11904428</pubid>
						<pubid idtype="doi">10.1073/pnas.062526999</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B46">
				<title>
					<p>AAindex</p>
				</title>
				<url>http://www.genome.jp/aaindex/</url>
			</bibl>
			<bibl id="B47">
				<title>
					<p>AAindex: amino acid index database.</p>
				</title>
				<aug>
					<au>
						<snm>Kawashima</snm>
						<fnm>S</fnm>
					</au>
					<au>
						<snm>Ogata</snm>
						<fnm>H</fnm>
					</au>
					<au>
						<snm>Kanehisa</snm>
						<fnm>M</fnm>
					</au>
				</aug>
				<source>Nucleic Acids Res</source>
				<pubdate>1999</pubdate>
				<volume>27</volume>
				<fpage>368</fpage>
				<lpage>369</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="pmcid">148186</pubid>
						<pubid idtype="pmpid" link="fulltext">9847231</pubid>
						<pubid idtype="doi">10.1093/nar/27.1.368</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B48">
				<title>
					<p>Application of topological descriptors in QSAR and drug design: history and new trends.</p>
				</title>
				<aug>
					<au>
						<snm>Gonzalbes</snm>
						<fnm>R</fnm>
					</au>
					<au>
						<snm>Doucet</snm>
						<fnm>JP</fnm>
					</au>
					<au>
						<snm>Derouin</snm>
						<fnm>F</fnm>
					</au>
				</aug>
				<source>Curr Drug Targets Infect Disord</source>
				<pubdate>2002</pubdate>
				<volume>2</volume>
				<fpage>93</fpage>
				<lpage>102</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.2174/1568005024605909</pubid>
						<pubid idtype="pmpid" link="fulltext">12462157</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B49">
				<title>
					<p>Practical lessons from protein structure prediction.</p>
				</title>
				<aug>
					<au>
						<snm>Ginalski</snm>
						<fnm>K</fnm>
					</au>
					<au>
						<snm>Grishin</snm>
						<fnm>NV</fnm>
					</au>
					<au>
						<snm>Godzik</snm>
						<fnm>A</fnm>
					</au>
					<au>
						<snm>Rychlewski</snm>
						<fnm>L</fnm>
					</au>
				</aug>
				<source>Nucleic Acids Res</source>
				<pubdate>2005</pubdate>
				<volume>33</volume>
				<fpage>1874</fpage>
				<lpage>1891</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="pmcid">1074308</pubid>
						<pubid idtype="pmpid" link="fulltext">15805122</pubid>
						<pubid idtype="doi">10.1093/nar/gki327</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B50">
				<title>
					<p>Linear indices of the 'macromolecular graph's nucleotide adjacency matrix' as a promising approach for bioinformatics studies. Part 1: prediction of paromycin's affinity constant with HIV-1 psi-RNA packaging region.</p>
				</title>
				<aug>
					<au>
						<snm>Ponce</snm>
						<fnm>MY</fnm>
					</au>
					<au>
						<snm>Castillo Garit</snm>
						<fnm>JA</fnm>
					</au>
					<au>
						<snm>Nodarse</snm>
						<fnm>D</fnm>
					</au>
				</aug>
				<source>Bioorg Med Chem</source>
				<pubdate>2005</pubdate>
				<volume>13</volume>
				<fpage>3397</fpage>
				<lpage>3404</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1016/j.bmc.2005.03.010</pubid>
						<pubid idtype="pmpid" link="fulltext">15848751</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B51">
				<title>
					<p>Comparison of predictive ability of water solubility QSPR models generated by MLR, PLS and ANN methods.</p>
				</title>
				<aug>
					<au>
						<snm>Eros</snm>
						<fnm>D</fnm>
					</au>
					<au>
						<snm>Keri</snm>
						<fnm>G</fnm>
					</au>
					<au>
						<snm>Kovesdi</snm>
						<fnm>I</fnm>
					</au>
					<au>
						<snm>Szantai-Kis</snm>
						<fnm>C</fnm>
					</au>
					<au>
						<snm>Meszaros</snm>
						<fnm>G</fnm>
					</au>
					<au>
						<snm>Orfi</snm>
						<fnm>L</fnm>
					</au>
				</aug>
				<source>Mini Rev Med Chem</source>
				<pubdate>2004</pubdate>
				<volume>4</volume>
				<fpage>167</fpage>
				<lpage>177</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.2174/1389557043487466</pubid>
						<pubid idtype="pmpid" link="fulltext">14965289</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B52">
				<aug>
					<au>
						<snm>Gould</snm>
						<fnm>SJ</fnm>
					</au>
				</aug>
				<source>Wonderful Life: The Burgess Shale and the Nature of History</source>
				<publisher>New York: WW Norton</publisher>
				<pubdate>1990</pubdate>
			</bibl>
			<bibl id="B53">
				<title>
					<p>Amino acids in meteorites.</p>
				</title>
				<aug>
					<au>
						<snm>Cronin</snm>
						<fnm>JR</fnm>
					</au>
					<au>
						<snm>Pizzarello</snm>
						<fnm>S</fnm>
					</au>
				</aug>
				<source>Adv Space Res</source>
				<pubdate>1983</pubdate>
				<volume>3</volume>
				<fpage>5</fpage>
				<lpage>18</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1016/0273-1177(83)90036-4</pubid>
						<pubid idtype="pmpid">11542462</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B54">
				<title>
					<p>Posttranslational covalent modification of proteins.</p>
				</title>
				<aug>
					<au>
						<snm>Uy</snm>
						<fnm>R</fnm>
					</au>
					<au>
						<snm>Wold</snm>
						<fnm>F</fnm>
					</au>
				</aug>
				<source>Science</source>
				<pubdate>1977</pubdate>
				<volume>198</volume>
				<fpage>890</fpage>
				<lpage>896</lpage>
				<xrefbib>
					<pubid idtype="pmpid">337487</pubid>
				</xrefbib>
			</bibl>
			<bibl id="B55">
				<title>
					<p>Plant amino acid research in retrospect: from Chinball to Singh.</p>
				</title>
				<aug>
					<au>
						<snm>Fowden</snm>
						<fnm>L</fnm>
					</au>
				</aug>
				<source>Amino Acids</source>
				<pubdate>2001</pubdate>
				<volume>20</volume>
				<fpage>217</fpage>
				<lpage>224</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1007/s007260170040</pubid>
						<pubid idtype="pmpid" link="fulltext">11354600</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B56">
				<title>
					<p>Structures and symbols for synthetic amino acids incorporated into synthetic polypeptides.</p>
				</title>
				<aug>
					<au>
						<snm>Khosla</snm>
						<fnm>MC</fnm>
					</au>
					<au>
						<snm>Cohn</snm>
						<fnm>WE</fnm>
					</au>
				</aug>
				<source>Handbook of Biochemistry and Molecular Biology</source>
				<publisher>Baton Rouge, FL: CRC</publisher>
				<editor>Fasman GD</editor>
				<pubdate>1976</pubdate>
				<volume>1</volume>
				<fpage>96</fpage>
				<lpage>108</lpage>
			</bibl>
		</refgrp>
	</bm>
</art>
