<?xml version='1.0'?>
<!DOCTYPE art SYSTEM 'http://www.biomedcentral.com/xml/article.dtd'>
<art>
   <ui>1471-2105-12-S13-S2</ui>
   <ji>1471-2105</ji>
   <fm>
      <dochead>Proceedings</dochead>
      <bibl>
         <title>
            <p>Novel base triples in RNA structures revealed by graph theoretical searching methods</p>
         </title>
         <aug>
            <au id="A1"><snm>Firdaus-Raih</snm><fnm>Mohd</fnm><insr iid="I1"/><insr iid="I3"/><email>firdaus@mfrlab.org</email></au>
            <au id="A2"><snm>Harrison</snm><fnm>Anne-Marie</fnm><insr iid="I1"/><insr iid="I4"/><email>anne-marie.harrison@shu.ac.uk</email></au>
            <au id="A3"><snm>Willett</snm><fnm>Peter</fnm><insr iid="I2"/><email>p.willett@shef.ac.uk</email></au>
            <au ca="yes" id="A4"><snm>Artymiuk</snm><mi>J</mi><fnm>Peter</fnm><insr iid="I1"/><email>p.artymiuk@shef.ac.uk</email></au>
         </aug>
         <insg>
            <ins id="I1"><p>Department of Molecular Biology and Biotechnology, University of Sheffield, Sheffield S10 2TN, UK</p></ins>
            <ins id="I2"><p>Department of Information Studies, University of Sheffield, Sheffield S10 2TN, UK</p></ins>
            <ins id="I3"><p>School of Biosciences and Biotechnology, Faculty of Science and Technology, Universiti Kebangsaan Malaysia, 43600 UKM Bangi, Malaysia</p></ins>
            <ins id="I4"><p>Present address: Biomedical Research Centre, Sheffield Hallam University, Sheffield S1 1WB, UK</p></ins>
         </insg>
         <source>BMC Bioinformatics</source>
         
         
         <supplement><title><p>Tenth International Conference on Bioinformatics. First ISCB Asia Joint Conference 2011 (InCoB/ISCB-Asia 2011): Bioinformatics</p></title><editor>Shoba Ranganathan, Christian Schoenbach, Sheila Nathan and Tin Wee Tan</editor><note>Proceedings</note></supplement><conference><title><p>Asia Pacific Bioinformatics Network (APBioNet) Tenth International Conference on Bioinformatics. First ISCB Asia Joint Conference 2011 (InCoB2011/ISCB-Asia 2011)</p></title><location>Kuala Lumpur, Malaysia</location><date-range>30 November - 2 December 2011</date-range><url>http://incob.apbionet.org/incob11/</url></conference><issn>1471-2105</issn>
         <pubdate>2011</pubdate>
         <volume>12</volume>
         <issue>Suppl 13</issue>
         <fpage>S2</fpage>
         <url>http://www.biomedcentral.com/1471-2105/12/S13/S2</url>
         <xrefbib><pubidlist><pubid idtype="pmpid">22373013</pubid><pubid idtype="doi">10.1186/1471-2105-12-S13-S2</pubid></pubidlist></xrefbib>
      </bibl>
      <history><pub><date><day>30</day><month>11</month><year>2011</year></date></pub></history>
      <cpyrt><year>2011</year><collab>Firdaus-Raih et al; licensee BioMed Central Ltd.</collab><note>This is an open access article distributed under the terms of the Creative Commons Attribution License (<url>http://creativecommons.org/licenses/by/2.0</url>), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</note></cpyrt>
      <abs>
         <sec>
            <st>
               <p>Abstract</p>
            </st>
            <sec>
               <st>
                  <p>Background</p>
               </st>
               <p>Highly hydrogen bonded base interactions play a major part in stabilizing the tertiary structures of complex RNA molecules, such as transfer-RNAs, ribozymes and ribosomal RNAs.</p>
            </sec>
            <sec>
               <st>
                  <p>Results</p>
               </st>
               <p>We describe the graph theoretical identification and searching of highly hydrogen bonded base triples, where each base is involved in at least two hydrogen bonds with the other bases. Our approach correlates theoretically predicted base triples with literature-based compilations and other actual occurrences in crystal structures. The use of &#8216;fuzzy&#8217; search tolerances has enabled us to discover a number of triple interaction types that have not been previously recorded nor predicted theoretically.</p>
            </sec>
            <sec>
               <st>
                  <p>Conclusions</p>
               </st>
               <p>Comparative analyses of different ribosomal RNA structures reveal several conserved base triple motifs in 50S rRNA structures, indicating an important role in structural stabilization and ultimately RNA function.</p>
            </sec>
         </sec>
      </abs>
   </fm>
   <bdy>
      <sec>
         <st>
            <p>Background</p>
         </st>
         <p>It is now clear that three-dimensional (3D) structure is as fundamental to the functionality of complex RNA molecules as it is for proteins. Awareness of the extent of the complexity and the diversity of RNA tertiary structures has expanded due to the availability of high resolution structures of large assemblies such as both ribosomal subunits <abbrgrp><abbr bid="B1">1</abbr><abbr bid="B2">2</abbr><abbr bid="B3">3</abbr></abbrgrp>, ribozymes <abbrgrp><abbr bid="B4">4</abbr><abbr bid="B5">5</abbr></abbrgrp>, and the P4-P6 domain of the group I intron <abbrgrp><abbr bid="B6">6</abbr></abbrgrp>, in addition to the early transfer-RNA structures <abbrgrp><abbr bid="B7">7</abbr><abbr bid="B8">8</abbr></abbrgrp>. Base triples can provide key interactions in assembling a tertiary structure by docking a base-pair in a helical region, which may be either Watson-Crick or non-canonical, to a single-stranded nucleotide distant in the polynucleotide chain <abbrgrp><abbr bid="B9">9</abbr><abbr bid="B10">10</abbr></abbrgrp>. In addition, neighboring base triples can form stacks or triple helices: the first such example in the RNA shallow groove was observed in the crystal structure of a frame-shifting pseudoknot <abbrgrp><abbr bid="B11">11</abbr></abbrgrp>. The occurrence of four base triples stacked together has also been observed in the structure of the <it>Tetrahymena</it> ribozyme <abbrgrp><abbr bid="B12">12</abbr></abbrgrp>.</p>
         <p>As with base pairs, the key interactions that stabilize base triples are hydrogen bonds. Hydrogen bonds are highly directional and capable of defining specific interactions. Many previous reports discussing base triples consider a wide variety of possible interactions including single hydrogen bond interactions and interactions to non-base components of the nucleotides <abbrgrp><abbr bid="B11">11</abbr><abbr bid="B13">13</abbr></abbrgrp>. However, it is clear that multiply hydrogen bonded triples, which consist of at least two hydrogen bonding interactions per base, will be especially stable and the structural conservation of such triples is therefore likely to be significant. They are thus expected to be influential constituents of RNA structure and are therefore of primary interest in this work. Nevertheless, RNA structures are highly dynamic and even triple interactions may be lost as a result of conformational changes. As the number of complex RNA structures increases, the ability to computationally detect and track such changes presents an interesting challenge. A library of 840 theoretically computed base triples has been compiled and is accessible as part of the NAIL (Nucleic Acid Interaction Library) database <abbrgrp><abbr bid="B14">14</abbr></abbrgrp>. Another resource, the NCIR database <abbrgrp><abbr bid="B13">13</abbr></abbrgrp>, consists of known non-canonical base interactions including base triples and is compiled through a literature search.</p>
         <p>Here, we present a survey, that cross-references the patterns in the NAIL database <abbrgrp><abbr bid="B14">14</abbr></abbrgrp>, the Protein Data Bank (PDB) <abbrgrp><abbr bid="B15">15</abbr></abbrgrp> and the records in NCIR <abbrgrp><abbr bid="B13">13</abbr></abbrgrp>, and is able to deliver a catalogue of base triples of a specific type in existing crystal structures. We use a graph theoretical approach that is capable of documenting occurrences of NAIL patterns in the PDB, and in doing so, find interactions that are predicted by NAIL but not recorded in NCIR. NCIR is a manual literature search and therefore, such a search is limited to what is reported in the available literature in addition to possibly incomplete coverage due to the manual and labor intensive nature of the compilation process. By employing high tolerances designed to give a &#8216;fuzzy&#8217; search, our method was also able to retrieve previously unrecorded occurrences of triples contained within the PDB. Surprisingly, we also show that there are multiply hydrogen bonded base triples that occur in the PDB, but were not included in the NAIL dataset. Our investigation of these triples in the large ribosomal subunit structures also revealed conserved interactions that may be essential base interactions in the stabilization of rRNA structure.</p>
      </sec>
      <sec>
         <st>
            <p>Methods</p>
         </st>
         <sec>
            <st>
               <p>RNA search database and query pattern library</p>
            </st>
            <p>The search database was compiled from PDB-sourced RNA and RNA-associated structures, solved by X-ray crystallography to a minimum resolution of 3&#197;. High resolution structures which became available at a later date, were also included as they became available. Several separate searches using structures solved at lower than 3&#197; resolution, such as the <it>E. coli</it> ribosome at 3.5&#197; <abbrgrp><abbr bid="B16">16</abbr></abbrgrp>, were also done. A library of 942 pattern matrices was generated as queries for the search engine, consisting of (i) The 840 base triple patterns from the Nucleic Acids Interaction Library <abbrgrp><abbr bid="B14">14</abbr></abbrgrp> and (ii) 102 patterns generated by an alternative approach (Additional File <supplr sid="S1">1</supplr> Figure S1).</p>
            <suppl id="S1">
               <title>
                  <p>Additional File 1</p>
               </title>
               <caption>
                  <p/>
               </caption>
               <text>
                  <p>
                     <b>Supplementary Figure S1 and Supplementary Tables S1-S3 in PDF format.</b>
                  </p>
               </text>
               <file name="1471-2105-12-S13-S2-S1.pdf">
   <p>Click here for file</p>
</file>
            </suppl>
         </sec>
         <sec>
            <st>
               <p>Searching program and approach</p>
            </st>
            <p>The computer program NASSAM (Nucleic Acid Search for Substructures and Motifs)<abbrgrp><abbr bid=" B17">17</abbr></abbrgrp>, which uses a simplified vectorial representation of the nucleic acid bases, was used as the search engine and primary screening step. NASSAM implements the Ullmann subgraph isomorphism algorithm <abbrgrp><abbr bid="B18">18</abbr></abbrgrp> for comparing pseudo-atom representations of RNA base orientations. Each of the four RNA bases are represented by two pseudoatom vectors consisting of four pseudoatoms; where one pseudoatom is the start node and another pseudoatom serves as the end node (Figure <figr fid="F1">1</figr>). The search input base triple patterns were created from pseudo-atom distances that hypothetically represent a triple, or they can be created by measuring the distances of the pseudoatoms from an actual occurring formation. Additional pseudo-distances were incorporated into the matrices to ensure that the midpoints of the pseudo-atom vectors for a base were close enough to each other that they were on the same base. These midpoint to midpoint distances internal to a base were constrained to 1&#506;. NASSAM searches are also not dependent on sequence order. A distance tolerance parameter value, which sets the amount of deviation from the distances supplied in the pattern matrices is also incorporated into a search. This parameter can be supplied as either a discrete distance value, for example 1.7&#506;, or as a percentage value. Very high distance tolerances (60% and above), which resulted in &#8216;fuzzy&#8217; searches, were used to ensure that all possible matches were recalled and to facilitate discovery through variations of the original queries. Hits not matching our criteria of two hydrogen bonds per base were eliminated by computing the hydrogen bonding between the three bases in each hit, using the program HBPLUS <abbrgrp><abbr bid="B19">19</abbr></abbrgrp> at default parameters. HBPLUS outputs were screened for matches to our criteria and filtered results were visualized and cross referenced with NAIL, NCIR and available literature.</p>
            <fig id="F1"><title><p>Figure 1</p></title><caption><p>The NASSAM pseudoatom vectors and pattern matrix system. (A) Example of a base triple composed of a guanine (Base 1), a cytosine (Base 2) and another guanine (Base 3). The pseudoatom nodes used to set the distances for the pattern matrix have been marked with <it>Sx</it>, <it>Ex</it> and <it>Ey</it> while the distances between these nodes have been marked with arrows. (B) An example set of vectors and their corresponding distances (in Angstroms) which define the GGC triple orientation in section (A). (C) The corresponding pattern matrix file built from the vectors (distances X10) defined in section (B) for the triple pattern shown in section (A). As an example, the <it>SxSx</it> distance between Base 1 and Base 3 is 10&#506; and is marked 100 under the SS column where Base 1, Node X and Base 3, Node X intersect.</p></caption><text>
   <p>The NASSAM pseudoatom vectors and pattern matrix system. (A) Example of a base triple composed of a guanine (Base 1), a cytosine (Base 2) and another guanine (Base 3). The pseudoatom nodes used to set the distances for the pattern matrix have been marked with <it>Sx</it>, <it>Ex</it> and <it>Ey</it> while the distances between these nodes have been marked with arrows. (B) An example set of vectors and their corresponding distances (in Angstroms) which define the GGC triple orientation in section (A). (C) The corresponding pattern matrix file built from the vectors (distances X10) defined in section (B) for the triple pattern shown in section (A). As an example, the <it>SxSx</it> distance between Base 1 and Base 3 is 10&#506; and is marked 100 under the SS column where Base 1, Node X and Base 3, Node X intersect.</p>
</text><graphic file="1471-2105-12-S13-S2-1"/></fig>
         </sec>
         <sec>
            <st>
               <p>Sequence alignments and structure superpositions</p>
            </st>
            <p>Sequence alignments were done using CLUSTALW<abbrgrp><abbr bid=" B20">20</abbr></abbrgrp> for 19 prokaryotic 23S rRNA sequences retrieved from selected completed genomes accessible from the NCBI website and four prokaryotic 23S rRNA sequences extracted from PDB structures (Table <tblr tid="T1">1</tblr>). The same 19 species were used for an alignment of 16S rRNA sequences against two prokaryotic 16S rRNA sequences from structures available in the PDB (Table <tblr tid="T1">1</tblr>). The chosen species were selected to provide a simple yet diverse representative model of available prokaryotic rRNA sequences. The sequences used in the alignments and their database references are available in Table <tblr tid="T1">1</tblr>. Structural comparisons of <it>T. thermophilus</it> [PDB: 2j01], <it>E. coli</it> [PDB: 2awb] and <it>D. radiodurans</it> [PDB: 1nkw] to the <it>H. marismortui</it> [PDB: 1ffk] structure were carried out using least squares superposition on the phosphate backbone atoms by the program LSQKAB <abbrgrp><abbr bid="B21">21</abbr></abbrgrp> from the CCP4 suite <abbrgrp><abbr bid="B22">22</abbr></abbrgrp>. Geometric descriptions of the triples utilized the nomenclature proposed by Leontis and Westhof <abbrgrp><abbr bid="B23">23</abbr></abbrgrp>. Redundant geometries were identified by ordering the resulting geometries in alphabetical order such as arranging all cis interactions before trans interactions, as well as arranging the interacting edges for each pair alphabetically. Triples with the same geometrical description were then cross checked for differences. The C-H edges of pyrimidines have also been labelled as Hoogsteen edges for uniformity.</p>
            <tbl id="T1"><title><p>Table 1</p></title><caption><p>Source of sequences and structures used for sequence-structure comparisons.</p></caption><tblbdy cols="3">
      <r>
         <c ca="left">
            <p>
               <b>Species</b>
            </p>
         </c>
         <c ca="left">
            <p>
               <b>Taxonomy: (Super)Phylum</b>
            </p>
         </c>
         <c ca="left">
            <p>
               <b>Database code</b>
            </p>
         </c>
      </r>
      <r>
         <c cspan="3">
            <hr/>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>
               <it>Bacteroides fragilis NCTC 9343</it>
            </p>
         </c>
         <c ca="left">
            <p>Bacteroidetes/Chlorobi group;</p>
         </c>
         <c ca="left">
            <p>GenBank: NC_003228</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>
               <it>Borrelia burgdorferi B31</it>
            </p>
         </c>
         <c ca="left">
            <p>Spirochaetes</p>
         </c>
         <c ca="left">
            <p>GenBank: NC_001318</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>
               <it>Candidatus Protochlamydia amoebophila UWE25</it>
            </p>
         </c>
         <c ca="left">
            <p>Chlamydiae/Verrucomicrobia group</p>
         </c>
         <c ca="left">
            <p>GenBank: NC_005861</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>
               <it>Rhodopirellula baltica SH 1</it>
            </p>
         </c>
         <c ca="left">
            <p>Planctomycetes</p>
         </c>
         <c ca="left">
            <p>GenBank: NC_005027</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>
               <it>Thiomicrospira denitrificans ATCC 33889</it>
            </p>
         </c>
         <c ca="left">
            <p>Proteobacteria</p>
         </c>
         <c ca="left">
            <p>GenBank: NC_007575</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>
               <it>Acidobacteria bacterium Ellin345</it>
            </p>
         </c>
         <c ca="left">
            <p>Fibrobacteres/Acidobacteria group</p>
         </c>
         <c ca="left">
            <p>GenBank: NC_008009</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>
               <it>Escherichia coli</it>
            </p>
         </c>
         <c ca="left">
            <p>Proteobacteria</p>
         </c>
         <c ca="left">
            <p>PDB: 2awb</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>
               <it>Baumannia cicadellinicola str. Hc</it>
            </p>
         </c>
         <c ca="left">
            <p>Proteobacteria</p>
         </c>
         <c ca="left">
            <p>GenBank: NC_007984</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>
               <it>Azoarcus sp. EbN1</it>
            </p>
         </c>
         <c ca="left">
            <p>Proteobacteria; Betaproteobacteria</p>
         </c>
         <c ca="left">
            <p>GenBank: NC_006513</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>
               <it>Rhodobacter sphaeroides 2.4.1</it>
            </p>
         </c>
         <c ca="left">
            <p>Proteobacteria; Alphaproteobacteria</p>
         </c>
         <c ca="left">
            <p>GenBank: NC_007493</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>
               <it>Anaeromyxobacter dehalogenans 2CP-C</it>
            </p>
         </c>
         <c ca="left">
            <p>Proteobacteria; delta/epsilon subdivisions</p>
         </c>
         <c ca="left">
            <p>GenBank: NC_007760</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>
               <it>Anabaena variabilis ATCC 29413</it>
            </p>
         </c>
         <c ca="left">
            <p>Cyanobacteria</p>
         </c>
         <c ca="left">
            <p>GenBank: NC_007413</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>
               <it>Dehalococcoides sp. CBDB1</it>
            </p>
         </c>
         <c ca="left">
            <p>Chloroflexi; Dehalococcoidetes</p>
         </c>
         <c ca="left">
            <p>GenBank: NC_007356</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>
               <it>Bacillus subtilis subsp. subtilis str. 168</it>
            </p>
         </c>
         <c ca="left">
            <p>Firmicutes; Bacilli</p>
         </c>
         <c ca="left">
            <p>GenBank: NC_000964</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>
               <it>Deinococcus radiodurans</it>
            </p>
         </c>
         <c ca="left">
            <p>Deinococcus-Thermus; Deinococci</p>
         </c>
         <c ca="left">
            <p>PDB: 1nkw</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>
               <it>Bifidobacterium longum NCC2705</it>
            </p>
         </c>
         <c ca="left">
            <p>Actinobacteria</p>
         </c>
         <c ca="left">
            <p>GenBank: NC_004307</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>
               <it>Thermus thermophilus</it>
            </p>
         </c>
         <c ca="left">
            <p>Deinococcus-Thermus; Deinococci</p>
         </c>
         <c ca="left">
            <p>PDB: 2j00 / 2j01</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>
               <it>Thermotoga maritima MSB8</it>
            </p>
         </c>
         <c ca="left">
            <p>Thermotogae</p>
         </c>
         <c ca="left">
            <p>GenBank: NC_000853</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>
               <it>Aquifex aeolicus VF5</it>
            </p>
         </c>
         <c ca="left">
            <p>Aquificae</p>
         </c>
         <c ca="left">
            <p>GenBank: NC_000918</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>
               <it>Sulfolobus solfataricus P2</it>
            </p>
         </c>
         <c ca="left">
            <p>Crenarchaeota; Thermoprotei</p>
         </c>
         <c ca="left">
            <p>GenBank: NC_002754</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>
               <it>Pyrobaculum aerophilum str. IM2</it>
            </p>
         </c>
         <c ca="left">
            <p>Crenarchaeota; Thermoprotei</p>
         </c>
         <c ca="left">
            <p>GenBank: NC_003364</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>
               <it>Nanoarchaeum equitans Kin4-M</it>
            </p>
         </c>
         <c ca="left">
            <p>Nanoarchaeota</p>
         </c>
         <c ca="left">
            <p>GenBank: NC_005213</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>
               <it>Haloarcula marismortui</it>
            </p>
         </c>
         <c ca="left">
            <p>Euryarchaeota; Halobacteria</p>
         </c>
         <c ca="left">
            <p>PDB: 1ffk</p>
         </c>
      </r>
   </tblbdy></tbl>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Results</p>
         </st>
         <sec>
            <st>
               <p>Searching for structural patterns with graph theory</p>
            </st>
            <p>NASSAM <abbrgrp><abbr bid="B17">17</abbr></abbrgrp>, a graph theoretical search engine, was used to screen a nucleic acid database for matches to the 840 patterns derived from the NAIL database and a further 102 from our own procedure (see Additional File <supplr sid="S1">1</supplr> Figure S1). The results from this primary screen were then filtered for specified hydrogen bonding interactions. The output from this secondary screen, which represent actual occurrences in the PDB, was used to compare and contrast the content of base triple interactions in the NAIL and NCIR databases. The probable importance of particular base triples as conserved motifs was able to be contextually presented by collectively comparing the structural conservation of the hits from our searches and their sequence conservation in a diverse set of sequences. Comparisons of the annotated triples for each structure were integrated to structure superpositions and sequence alignments information to investigate the conservation of triple hits in the prokaryotic ribosomal subunits (Additional File <supplr sid="S1">1</supplr> Table S2).</p>
         </sec>
         <sec>
            <st>
               <p>Comparative overview of triples from the NASSAM search</p>
            </st>
            <p>This survey using NASSAM found matches for 59 of the total 942 search patterns. Fourteen or approximately 24% of these patterns were discovered for multiply hydrogen bonded base triples that were either not currently represented in NCIR (Figure <figr fid="F2">2A</figr>), not represented in NAIL (Figure <figr fid="F2">2B</figr>) or not found in either resource (Figure <figr fid="F2">2C</figr>). Full details are given in Additional File <supplr sid="S1">1</supplr> Table S1. The backbone angles for the residues involved in the novel triples, and the residues before and after them in the sequences, were examined using the Amigos program <abbrgrp><abbr bid="B24">24</abbr></abbrgrp>. No unusual conformations were observed for them. The relative bulk of triples hits retrieved were for occurrences in rRNA structures. This is not unexpected as these are the largest RNA 3D structures available and offer the most opportunity for a diverse array of interactions, including very long range inter-domain interactions. Of the interactions listed in Figure <figr fid="F2">2</figr>, four are currently only found in non-ribosomal structures (ACG1, AGG2, CGU1, GGG2). The majority of the discussion will therefore revolve around triples found in the prokaryotic ribosomal subunits.</p>
            <fig id="F2"><title><p>Figure 2</p></title><caption><p>(A) Base triple interactions that were not previously recorded in the NCIR database. (B) Base triple interactions which were not listed in the NAIL library of query patterns but which were found to be present in the NCIR database. (C) Novel triple interactions that were neither recorded in the NCIR database nor listed in the NAIL query dataset. Hydrogen bonds from possibly protonated bases are marked with arrows and + at the protonated donor position. The geometric orientation labels have been abbreviated as: C=Cis glycosidic bond orientation, H=Hoogsteen Edge, T=Trans glycosidic bond orientation, S=Sugar edge, WC=Watson-Crick edge.</p></caption><text>
   <p>(A) Base triple interactions that were not previously recorded in the NCIR database. (B) Base triple interactions which were not listed in the NAIL library of query patterns but which were found to be present in the NCIR database. (C) Novel triple interactions that were neither recorded in the NCIR database nor listed in the NAIL query dataset. Hydrogen bonds from possibly protonated bases are marked with arrows and + at the protonated donor position. The geometric orientation labels have been abbreviated as: C=Cis glycosidic bond orientation, H=Hoogsteen Edge, T=Trans glycosidic bond orientation, S=Sugar edge, WC=Watson-Crick edge.</p>
</text><graphic file="1471-2105-12-S13-S2-2"/></fig>
         </sec>
         <sec>
            <st>
               <p>Triple types found in high resolution RNA Structures that are unrecorded in NCIR</p>
            </st>
            <p>On examination of the new triple types in the 23S rRNA of <it>H. marismortui</it> and <it>T. thermophilus</it> structures, we find that many triples had already been annotated on the secondary structure diagram <abbrgrp><abbr bid="B1">1</abbr><abbr bid="B2">2</abbr></abbrgrp>. However, one of the hits AGU1 (Figure <figr fid="F2">2A</figr>) appears to be completely unrecorded in this structure or any other, although it had been theoretically predicted by NAIL. This new triple is a GU (G924.U919) wobble pair in helix 37 in which the guanine is also participating in an AG N3-amino, amino-N1 interaction to A166 [PDB: 1ffk_0] which is situated in a hairpin loop in helix 11. Thus this triple links bases in domains I and II. Domain II accounts for much of the &#8220;back&#8221; of the ribosomal particle and thus this base triple is far from the functional sites of 23S rRNA. The A166 and G924 positions are conserved in alignment of 23 prokaryotic species sequences. However, in the sequence of <it>Nanoarchaeum equitans Kin4-M</it>, an archaeal species, the U919 position is replaced with a cytosine. The ACG1 triple (Figure <figr fid="F2">2A</figr>) was found in only in the two structures of the hepatitis delta virus (HDV) ribozyme <abbrgrp><abbr bid="B25">25</abbr><abbr bid="B26">26</abbr></abbrgrp> using a NAIL input pattern which involves a protonated cytosine. Its position between sites for catalysis and protein interaction, may suggest that it contributes to stabilization of the helix backbone necessary for the correct folding of either or both these sites. Another protonated NAIL pattern that finds a match in the database is the ACC1 triple (Figure <figr fid="F2">2A</figr>). This triple, which involves an adenosine protonated at N1, is found in two different locations in the 23S rRNA structures of <it>H. marismortui</it>. One of these, the A2485.C2104.C2536 triple, was found to be conserved in our alignments of 23S rRNA sequences. It is within the peptidyl transferase region and is discussed below. The GGG2 triple (Figure <figr fid="F2">2A</figr>), though not listed in NCIR, has been previously identified in a synthetic RNA-DNA hybrid molecule <abbrgrp><abbr bid="B27">27</abbr></abbrgrp>, but we find no other examples in naturally occurring structures. NCIR is the result of a manual literature search and therefore, by implication, such a search is limited to what is reported in the available literature in addition to possibly incomplete coverage due to the manual and labor intensive nature of such a compilation process. Furthermore, to our knowledge, NCIR is also not automatically updated as newer structures become available.</p>
         </sec>
         <sec>
            <st>
               <p>Previously recorded triple types not predicted in NAIL</p>
            </st>
            <p>Seven triple types were found to be recorded in NCIR, but were not predicted in NAIL. This may be because of limitations already discussed by Walberer <it>et al</it>.<abbrgrp><abbr bid=" B14">14</abbr></abbrgrp>, namely: the fixed length of the initial hydrogen bond and problems associated with modeling of non-planar geometries. Thus in the AAG1, AAG2 and AGU2 triples (Figure <figr fid="F2">2B</figr>) a completely planar triple geometry could result in close approaches and possible atomic clashes. The ACG2 triple is found in the <it>H. marismortui</it> 23S ribosomal RNA structure and involves a rather non planar association of G2033 to the A1742.C2037 pair. This triple caps a short (4bp) double helix which sweeps up from the end of the double helix to complete the triple. The AGU2 triple (Figure <figr fid="F2">2B</figr>), found in the structure of the <it>T. thermophilus</it> 16S rRNA [PDB: 1fjg_A, 1n32_A], consists of an AU reverse Watson-Crick pair and an AG N7 amino, amino N3 pair in a situation where different edges of the shared adenine participates in different non-canonical pairings to form the triple. This triple (A55.G357.U368), is in fact part of a tetrad, listed in NCIR, as G357 also forms a Watson-Crick base pair with C54. In addition to the inter-base hydrogen bonds there is also a hydrogen bond with good geometry between the ribose O2* of G357 and the O2 acceptor of U368. This close approach may be the reason for its exclusion from the NAIL predictions. Another type of triple not predicted in NAIL is CCG1 which occurs in the 5S subunit (Figure <figr fid="F2">2B</figr>) and consists of a Watson-Crick G66.C15 base pair with a bridging C113 [PDB: 1ffk_9]. This interaction forms a junction between three double helices. The GGU1 triple was found to be present in aspartate-tRNA structures as well as 23S rRNA structures (Figure <figr fid="F2">2B</figr>). In the tRNA structures, this triple caps the anticodon stem, stacking with the A24.U11 base pair which in turn stacks on a group of three triples (U12.A23.A9;&#936;13.G22.A46; A14.A21.U8), the triple being augmented by a hydrogen bond between the O2* of residue 45 and the O3* of residue 9. The other occurrence of GGU1 is in the 23S ribosomal RNA structures [G2092.G2093.U2652, PDB: 1ffk_0] where the two guanosines are sequential and form a platform-like structure. This triple caps a stack of two other triples: G2094.A2649.C2651 and A2095.A2612.U2650.</p>
         </sec>
         <sec>
            <st>
               <p>Completely novel base triple interactions</p>
            </st>
            <p>A further two triple types, CGU1 and GGG1 (Figure <figr fid="F2">2C</figr>) were neither predicted in the NAIL nor listed in the NCIR databases. The CGU1 triple consists of a reverse Watson-Crick GC base pair G515.C548 with a bridging O4 from U519. The triple caps a succession of base triples formed by residues 512-515 and 521 to 523 in <it>T. thermophilus</it> tRNA-Gln in complex with its cognate tRNA synthetase <abbrgrp><abbr bid="B28">28</abbr></abbrgrp> [PDB: 1g59_B: C512.C509.G523; U513.A546.G542; A514.U508.A521]. The second novel triple type, GGG1 (Figure <figr fid="F2">2C</figr>), was found in two structures of <it>H. marismortui</it> 23S ribosomal subunits [PDB: 1k9m, 1kd1 <abbrgrp><abbr bid="B29">29</abbr></abbrgrp>, 1m90 <abbrgrp><abbr bid="B30">30</abbr></abbrgrp>] but not in the other <it>H. marismortui</it> high resolution rRNA structures searched. The GGG1 triple type consists of a G.G N3-amino symmetric base pair between G512 and G487. The O6 atom of G512 accepts two hydrogen bonds from N1 and N2 of G504 to complete the triple. This triple can be further extended into a quadruple interaction, which is unlisted in NCIR, via a Watson-Crick pairing between G487 and C515. In some other 23S structures, G512 and G487 are further apart and do not hydrogen bond, although in all the structures, hydrogen bonding occurs from G512 N2 to the O4* in the ribose of G487. The interaction is sandwiched between another triple U488.G503.A513 and a tetrad A485.A509.C505.U481.</p>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Discussion</p>
         </st>
         <sec>
            <st>
               <p>Base triples as constituents of interactions between RNA secondary structures</p>
            </st>
            <p>Many of the base triples found contain a Watson-Crick pair. Thus, approximately 55% of the triples found in <it>H. marismortui</it> 23S rRNA, contain a Watson-Crick pair. Triples were observed as interactions involved in RNA helix packing, or at interfaces of RNA secondary or tertiary structure interactions. A listing of interactions between RNA secondary structure elements highlights the frequency of interactions involving at least one loop structure of any type interacting with either another loop or a helix (Additional File <supplr sid="S1">1</supplr> Table S2). These interactions outnumber helix-helix and intra helix interactions. We further observed that the same triple types occur in different, larger structural motifs involving a variety of RNA secondary structures. For example, there are six AGC amino-N3, N1-amino; Watson-Crick triples in <it>H. marismortui</it> 23S rRNA, which are variously involved in helix to internal loop interactions, hairpin loop &#8211; hairpin loop &#8211; hairpin loop interactions, helix to hairpin loop interactions and helix to multi-branched loop interactions (Additional File <supplr sid="S1">1</supplr> Table S2). Furthermore, three of these six occurrences are also involved in inter-domain interactions. One of these six AGC triples interfaces domain IV and domain VI and is conserved in the sequence alignments, while the other five show varying degrees of conservation.</p>
            <p>Two occurrences of the common AGC amino-N3, N1-amino; Watson-Crick triples were also observed, in <it>T. thermophilus</it> 16S rRNA, one of which is also involved in an inter-domain interaction. Of the two AGC amino-N3, N1-amino; Watson-Crick triples, one involves bases in helices H8 and H14 (G347.C342.A160) and the other bases in helices H12 and H21. The helices for the first triple end in a GAAA (GNRA) tetraloop and a UACG (UNCG) tetraloop, respectively. The interaction between these loops has been previously noted to be an unusual packing arrangement<abbrgrp><abbr bid=" B2">2</abbr></abbrgrp>. The second triple is between the second A (A608) of the GAAA loop and the C.G (C308.G292) base pair, which closes the UACG loop. Interestingly, this second example in 16S RNA, the adenine base, although not part of a GNRA tetraloop, is in a GAAAG internal loop sequence where the GAAA has a very GNRA like conformation. The significance of this may purely be that this base in the GNRA conformation is accessible and has both faces available for tertiary binding. This position may be particularly important for the interaction of tetraloop structures with other features.</p>
         </sec>
         <sec>
            <st>
               <p>The prokaryotic ribosomal subunits: occurrences and conservation of base triples</p>
            </st>
            <p>Many of the interactions in the <it>H. marismortui</it> 23S and <it>T. thermophilus</it> 16S ribosomal RNA discussed here, have been previously noted in the structure-based secondary structure diagrams available <abbrgrp><abbr bid="B1">1</abbr><abbr bid="B2">2</abbr><abbr bid="B3">3</abbr></abbrgrp>. Not all the triples discussed here are novel observations, but our approach in inventorizing specific classes of triple puts the collective contribution of these structural formations into perspective (Additional File <supplr sid="S1">1</supplr> Table S2). We have found 28 triples in <it>H. marismortui</it> 23S rRNA, and the 23S rRNA sequence alignments for 23 prokaryotic species in Table <tblr tid="T1">1</tblr>, showed that the base components for twelve of these triples are totally conserved across the aligned sequences (Additional File <supplr sid="S1">1</supplr> Table S2). Domain V, which contains the peptidyl transferase center has the largest number of triples fitting our criteria, followed by domain II. A comparison of triples in four 23S rRNA structures available (<it>H. marismortui</it>, <it>D. radiodurans</it>, <it>E. coli</it>, <it>T. thermophilus</it>; PDB: 1ffk, 1nkw, 2awb, 2j01 respectively) and using the <it>H. marismortui</it> structure as a reference, shows that 21 out of the 28 triples are also conserved in the other three structures. We were further able to observe that there appear to be stackings and clusterings of these triples (Figure <figr fid="F3">3</figr>). A similar clustering of the A-minor motif classes of nucleotide triples into &#8216;A patches&#8217; has been previously reported <abbrgrp><abbr bid="B31">31</abbr></abbrgrp>. Our observations strengthen the indication that highly hydrogen bonded and stable interactions are clustered together thereby implying a requirement for structural rigidity or integrity in the regions where they are most observed. For example, a novel observation of two stacked UAU Hoogsteen, Watson-Crick triple motifs are discussed below. Another stacked pair of base triples in the large ribosomal subunit (C959.C963.A1005 and A961.G958.C1008) may possibly be involved in maintaining the structure of an internal loop which disrupts the middle of the long helix 38 and is conserved in all our 23 aligned sequences. However, in contrast to the high conservation of triples in the large subunit, our alignments for the 16S rRNA sequences, which are otherwise well known to be highly conserved in sequence <abbrgrp><abbr bid="B32">32</abbr></abbrgrp>, showed that only three out of the fifteen triples in <it>T. thermophilus</it> [PDB: 1fjg] are conserved for all three base positions of a triple. Taking into account occurrences for both subunits, these observations suggests that triples may also be possibly used as an opportunistic interaction mechanism for increasing sequence diversity while preserving the general fold of the ribosomal RNA assemblies and is further discussed in the following sections.</p>
            <fig id="F3"><title><p>Figure 3</p></title><caption><p>Base triples conserved in four 23S rRNA structures. (A) Triples from domains I (red), II (green) and III (blue) highlighted in spacefilling mode and numbered using the <it>H. marismortui</it> numbering unless stated otherwise. (B) Triples in domain IV (red), domain V (green) and domain VI (blue). Triples not originally detected in the <it>H. marismortui</it> structure which were detected in the <it>T. thermophilus</it> search and were found to have conserved equivalent interactions in the other three structures are also highlighted in section (A) as orange spacefills for domain I and cyan spacefills for domain II, and in section (B) as magenta spacefills for domain V and cyan spacefills for domain VI. (C, D) A stacked triples motif located on a junction in domain V which joins three subdomains in four of the available structures compared. (E) shows the two stacked triples, separated for clarity, consist of two planar UAU Hoogsteen, Watson Crick triples. The colors used for each of the 23S subunits compared are presented in section (F).</p></caption><text>
   <p>Base triples conserved in four 23S rRNA structures. (A) Triples from domains I (red), II (green) and III (blue) highlighted in spacefilling mode and numbered using the <it>H. marismortui</it> numbering unless stated otherwise. (B) Triples in domain IV (red), domain V (green) and domain VI (blue). Triples not originally detected in the <it>H. marismortui</it> structure which were detected in the <it>T. thermophilus</it> search and were found to have conserved equivalent interactions in the other three structures are also highlighted in section (A) as orange spacefills for domain I and cyan spacefills for domain II, and in section (B) as magenta spacefills for domain V and cyan spacefills for domain VI. (C, D) A stacked triples motif located on a junction in domain V which joins three subdomains in four of the available structures compared. (E) shows the two stacked triples, separated for clarity, consist of two planar UAU Hoogsteen, Watson Crick triples. The colors used for each of the 23S subunits compared are presented in section (F).</p>
</text><graphic file="1471-2105-12-S13-S2-3"/></fig>
         </sec>
         <sec>
            <st>
               <p>Geometric families for 23S rRNA base triples</p>
            </st>
            <p>The conservation of geometric orientation for a triple position to an extent enables the conservation of 3-D space for a triple and thus preserving the backbone conformation of an RNA molecule despite variations in the nucleotide sequence. An analysis of the interaction geometry for 23S rRNA triples was used to investigate occurrences of repeats for base geometry and the resulting effect such conservation may have on sequence variation while still preserving the general conformation of the sugar-phosphate backbone. 23S rRNA triples from <it>H. marismortui</it> and <it>T. thermophilus</it> 23S rRNA structures were described using the nomenclature proposed by Leontis and Westhof <abbrgrp><abbr bid="B23">23</abbr></abbrgrp>. Triples of equivalent positions in the <it>E. coli</it> and <it>D. radiodurans</it> structures identified via the structural alignment were also described using the same nomenclature. The sequence alignment data was then integrated to this information and the variability of base sequence or content for these triples could be observed.</p>
            <p>All the triples observed in this survey contain at least one base which uses its Watson-Crick edge to interact with other bases. The most common triple in 23S rRNA is the AGC triple from the Cis S / WC &#8211; Cis WC / WC geometric family (Additional File <supplr sid="S1">1</supplr> Table S3). In the 23S rRNA structure, this geometry appears to be exclusive to the AGC triple. However, the most common geometric family in the 23S subunit is the Cis WC / WC &#8211; Trans H / WC family which has seven examples. These seven examples occur in triples with three different base compositions, ACU, AUU and CGG (Additional File <supplr sid="S1">1</supplr> Table S3). The majority of triples occur as composite base pairs where one base interacts via two of its edges to the other two bases of the triple. However, four cases were observed where two bases are interacting with two edges (ACG, CGG, GGU, GUU - Additional File <supplr sid="S1">1</supplr> Table S3). All four of these cases are unique and are not repeated in the 23S rRNA structure.</p>
         </sec>
         <sec>
            <st>
               <p>3D space conservation in triples with unconserved base content</p>
            </st>
            <p>Structure comparisons between the four 23S rRNA structures used showed that several triple locations have variable base content, although the equivalent base positions superpose well and occupy a similar structural space (Figure <figr fid="F4">4</figr>). In some cases, this observation can be attributed to unique interactions, such as inter-domain interactions, which may vary between different species while still maintaining the general fold of the phosphate backbone containing these bases (Figure <figr fid="F4">4A</figr>). For all three cases presented in Figure <figr fid="F4">4</figr>, the 3D space occupied is approximately similar for each case and does not appear to shift the phosphate backbone drastically despite the variety of base combinations. The geometric orientation for structurally equivalent triples with conserved 3D space, are conserved in almost all the cases (Figure <figr fid="F4">4</figr>). In some cases, the geometric orientation of an interaction is not conserved due to the equivalent bases not interacting via hydrogen bonds as per the criteria of this survey such as in Figure <figr fid="F4">4A</figr> or may have an additional interacting edge such as the case for U2652 of the GGU triple in Additional File <supplr sid="S1">1</supplr> Table S3 and Figure <figr fid="F4">4C</figr>.</p>
            <fig id="F4"><title><p>Figure 4</p></title><caption><p>Least squares superpositions of <it>H. marismortui</it> triples that were not conserved in sequence with the three other structures (<it>D. radiodurans</it>, <it>E. coli</it>, <it>T. thermophilus</it>,). These show maintenance of equivalent structural spaces where (A) documents possibly organism specific domain interfacing interactions; (B) interactions where variation occurs as a whole triple; (C) interactions where most variations may have gone through a single base mutation step. The equivalent triples selected from the alignment of 23 different prokaryotic species from Table <tblr tid="T1">1</tblr> are shown to the right of each corresponding triple superposition.</p></caption><text>
   <p>Least squares superpositions of <it>H. marismortui</it> triples that were not conserved in sequence with the three other structures (<it>D. radiodurans</it>, <it>E. coli</it>, <it>T. thermophilus</it>,). These show maintenance of equivalent structural spaces where (A) documents possibly organism specific domain interfacing interactions; (B) interactions where variation occurs as a whole triple; (C) interactions where most variations may have gone through a single base mutation step. The equivalent triples selected from the alignment of 23 different prokaryotic species from Table <tblr tid="T1">1</tblr> are shown to the right of each corresponding triple superposition.</p>
</text><graphic file="1471-2105-12-S13-S2-4"/></fig>
            <p>Sequence variation has also been observed at points of inter-domain interactions. These triples are quite possibly the result of opportunistic interactions resulting from the placement of the component nucleotides by the conservation of the sugar-phosphate backbone fold. These opportunistic interactions are therefore expected to vary between different species and therefore the geometric family of the interactions is also not expected to be conserved. One such example is the A2841.C2087.G2657 triple in <it>H. marismortui</it> (Figure <figr fid="F4">4A</figr>) which has equivalently placed bases in the <it>E. coli</it>, <it>T. thermophilus</it> and <it>D. radiodurans</it> structures that however differ in sequence. This is an interaction between domains V and VI where the geometry of interaction for positions equivalent to A2841 (DVI) are not conserved. As previously noted, the formation of triples that are unconserved in sequence can be seen as an opportunistic mechanism for effecting sequence diversity while still preserving the backbone conformation of the structure in general.</p>
         </sec>
         <sec>
            <st>
               <p>A stacked UAU Hoogsteen, Watson-Crick triple motif in prokaryotic 23S rRNA</p>
            </st>
            <p>One interesting highly conserved occurrence in prokaryotic 23S rRNA is the presence of two UAU Hoogsteen, Watson Crick interactions stacked on each other. These triples [U2116.A2118.U2276 and U2115.A2470.U2277; PDB: 1ffk_0] have been previously recorded in NCIR. However, when viewed together the adenines appear stacked in opposite positions to each other (Figure <figr fid="F3">3C</figr>). This same stacked motif was also found in our survey of the <it>E. coli</it> [PDB: 2awb], <it>T. thermophilus</it> [PDB: 2j01] and <it>D. radiodurans</it> [PDB: 1nkw] 23S rRNA structures. The superposition for the formations in all four structures shows good structural conservation. To our knowledge, there have been no previous discussions or hypotheses with regard to the possible functions of this novel stacked UAU Hoogsteen, Watson Crick triples motif. This motif is situated at a junction which joins the three subdomains of domain V (Figure <figr fid="F3">3C</figr>). One of these three subdomains form the binding site for protein L1, another forms the majority of the central protuberance region and the third extends to the direction of domain VI and the putative peptidyl transferase active site <abbrgrp><abbr bid="B1">1</abbr></abbrgrp>. It has been previously observed that the U2115.A2470.U2277 triad is stacked below a UGCAG pentad<abbrgrp><abbr bid=" B33">33</abbr></abbrgrp> (U2278.G2471.C2114.A2633.G2630). Our results demonstrate that there are actually two structurally conserved UAU triples situated at a multi-loop junction stacked to the UGCAG pentad. At present, the only other occurrence of this kind of UAU Hoogsteen, Watson Crick triple in a non-23S rRNA structure is a lone triple in the structure of cysteinyl tRNA synthetase<abbrgrp><abbr bid=" B34">34</abbr></abbrgrp> [PDB: 1u0b].</p>
         </sec>
         <sec>
            <st>
               <p>Interactions between rRNA domains</p>
            </st>
            <p>More than 80% of the triples in both the large ribosomal subunit and in the 16S rRNA mediate intra-domain interactions. However, inter-domain interactions involving bases very distant to each other in the polynucleotide sequence were also observed. Two interactions that interface three of the six domains and three which interface two domains were observed in the large subunit structure (Figure <figr fid="F3">3A</figr>, <figr fid="F2">2B</figr>). In the first three-domain triple, a G.C Watson-Crick and A.G N3-amino, amino-N1 triple (G2449.C418.A1921) forms an interaction between domains I, IV and V (Additional File <supplr sid="S1">1</supplr> Table S2). Domain IV constitutes much of the subunit interface in contact with the 30S particle and helices 67 to 71 form an area around the putative active site cleft on the subunit interface side of the 23S rRNA<abbrgrp><abbr bid=" B1">1</abbr></abbrgrp>. The adenine in this triple is in helix 68 and appears to be located in a relatively variable region in our alignment. This is the only example we have observed which shows the interactions of three bases, each on a separate hairpin loop and on different domains. The second triple that interfaces three different domains is U1371.A2054.U2648. U1371 on a multi-branched loop in domain III hydrogen bonds with the A2054 in a multi-branched loop on domain IV, which in turn is hydrogen bonded to U2648 on a small internal loop in domain V <abbrgrp><abbr bid="B1">1</abbr></abbrgrp>. An interesting difference in the structural organization of the two ribosomal subunits is that while the domains of the 16S rRNA each form distinct components, the domains of 23S rRNA are much more intricately linked together, which has been suggested may reflect the lesser requirement for flexibility in this subunit<abbrgrp><abbr bid=" B1">1</abbr></abbrgrp>. This is also reflected in our work, where multiply hydrogen bonded triples which are components of inter-domain interactions are more numerous in the 23S rRNA than the 16S rRNA subunit. We did not observe any triples that involved all three domains of the <it>T. thermophilus</it> 16S rRNA structure [PDB: 1fjg], although two triples were observed to interface two different domains. One example, A608.C308.G292, has been previously discussed. The other appears to be a U13.U20 in a hairpin loop interacting with A915, where the Watson-Crick face of U20 forms hydrogen bonds with both U13 and A915.</p>
         </sec>
         <sec>
            <st>
               <p>Base triples and links to ribozymic activity</p>
            </st>
            <p>Domain V of the <it>H. marismortui</it> 23S rRNA has been investigated for links to its ribozymic peptidyl transferase activity <abbrgrp><abbr bid="B35">35</abbr></abbrgrp>. There are 11 interactions that involve base triples in this domain. One cluster of interactions involves the double UAU Hoogsteen, Watson Crick discussed previously. Another set of base triples forms interactions on stems leading or exiting the multi-branched loop (helices 73, 89, 90) where the A-loop <abbrgrp><abbr bid="B36">36</abbr></abbrgrp> is located. These include a triple [C2104.A2485.C2536; PDB: 1ffk] <abbrgrp><abbr bid="B1">1</abbr></abbrgrp> that was also found to be structurally conserved in the structural alignments and is also conserved in all the sequences aligned. This ACC triple is adjacent to A2486, which was originally hypothesized as the peptidyl transferase acid-base catalytic residue by Nissen <it>et al</it>.<abbrgrp><abbr bid=" B35">35</abbr></abbrgrp>, although subsequent work has shown that peptide bond formation in the peptidyl transferase centre does not involve this acid-base catalysis mechanism<abbrgrp><abbr bid=" B1">37</abbr><abbr bid="B38">38</abbr><abbr bid="B39">39</abbr><abbr bid="B40">40</abbr></abbrgrp>. The A2450.C2501 component of this triple (<it>E. coli</it> numbering used), is also involved in an A-minor interaction with A76 of P-site tRNA<abbrgrp><abbr bid=" B41">41</abbr></abbrgrp>.</p>
            <p>Triples in the vicinity of other ribozyme active sites have previously been observed; an example being the base triple sandwich in the structure of the <it>Tetrahymena</it> ribozyme<abbrgrp><abbr bid=" B12">12</abbr></abbrgrp>, where the 3&#8217;-terminal guanosine (&#969;G), which serves as the attacking group for RNA cleavage, participates in a triple interaction with the G264.C311 base pair. This triple fits the multiply hydrogen bonded criteria of this survey and is in turn sandwiched by three other base triples. Of the 3 triples in the sandwich, the A263.C262.G312 interaction above the &#969;G triple, is also of the type covered in this survey. Both triples were found by a NASSAM search although they were not in the original search due to the resolution of the <it>Tetrahymena</it> ribozyme structure [3.8&#197;, PDB: 1x8w] being lower than our cut-off point. The inter-domain interactions and helix stabilization functions carried out by base triples may be important, albeit collective factors, in determining the correct folding of the RNA molecule to enable its catalytic functions.</p>
         </sec>
         <sec>
            <st>
               <p>The ribosomal polypeptide exit tunnel</p>
            </st>
            <p>The polypeptide exit tunnel is the passage through which nascent proteins pass as they are synthesized. This tunnel begins immediately below the peptidyl transferase center, and is approximately 100&#197; in length<abbrgrp><abbr bid=" B42">42</abbr></abbrgrp>. We found that a total of 32 bases, which are the components of 11 triples, are in positions that were defined by Nissen <it>et al.</it><abbrgrp><abbr bid=" B35">35</abbr></abbrgrp> as approaching the tunnel (Figure <figr fid="F5">5</figr>). This includes U1371.A2054.U2648 [PDBID 1ffk], which, as discussed earlier, links domains III, IV and V, although the other ten triples do not participate in any inter-domain interactions. Almost one-third of the triples in the large ribosomal subunit structure of <it>H. marismortui</it> take part in interactions close to the tunnel, including the ACC triple adjacent to the hypothesized peptidyl transferase center discussed in the previous section. Multiply hydrogen bonded base interactions may contribute significantly towards the conformational integrity of the tunnel.</p>
            <fig id="F5"><title><p>Figure 5</p></title><caption><p>Stereo diagrams showing the eleven triples that approach the polypeptide exit tunnel in <it>H. marismortui</it> 23S rRNA (PDB: 1ffk). The triples are shown in space filling mode viewed looking down the polypeptide exit tunnel (above) and in an orthogonal view (below). To aid orientation, 5S rRNA has been colored in blue while domain V of 23S rRNA has been colored magenta.</p></caption><text>
   <p>Stereo diagrams showing the eleven triples that approach the polypeptide exit tunnel in <it>H. marismortui</it> 23S rRNA (PDB: 1ffk). The triples are shown in space filling mode viewed looking down the polypeptide exit tunnel (above) and in an orthogonal view (below). To aid orientation, 5S rRNA has been colored in blue while domain V of 23S rRNA has been colored magenta.</p>
</text><graphic file="1471-2105-12-S13-S2-5"/></fig>
         </sec>
         <sec>
            <st>
               <p>Possible exclusivity of interactions to particular structures</p>
            </st>
            <p>Several types of triples appear to occur more than once within one structure, but are not presently seen in any other class of RNA structure. For example, the new ACC1 triple (Figure <figr fid="F2">2A</figr>) occurs at two different positions in the large ribosomal subunit but does not occur in any other RNA structure used in our search. The two CCG triples found, C37.G43.C46 and C15.G66.C113 [PDB: 1ffk_9], were found only in the 5S rRNA of the large ribosomal subunit. Several of the interactions found have only been observed at the same position in a particular structure, and do not occur anywhere else in that structure or any other structure. Moreover, the ribosomal structures are extremely large compared to the other RNA structures and are likely to contain a wider diversity of triples, and therefore these observations, although inconclusive, are noted for the record.</p>
         </sec>
         <sec>
            <st>
               <p>Future extensions to the NASSAM program - roles of RNA backbone components, water molecules and metal ions</p>
            </st>
            <p>At present the methodology is restricted to base-base interactions. Although this has revealed new types of interactions, it is clear that base to backbone interactions are also of great importance<abbrgrp><abbr bid=" B43">43</abbr></abbrgrp>. While inclusion of backbone information will pose some challenges for the graph theoretical methodology as it greatly increases the number of nodes that can potentially be matched, this is nevertheless an important future enhancement that can be added. Another important extension that can be envisaged is the addition of information relating to water molecules and metal ions (especially Mg<sup>2+</sup> ions) which play important roles in the stabilization of interbase interactions<abbrgrp><abbr bid=" B44">44</abbr></abbrgrp>. Here there is another problem in that it is extremely difficult to accurately define waters and low atomic number metal ions (such as Mg<sup>2+</sup>) in crystal structures unless they are at least 2.0&#197; in resolution; very few RNA structures match this criterion. This problem has been discussed, for example, by Banatao et al (2003) <abbrgrp><abbr bid="B44">44</abbr></abbrgrp>, who observed that many metal sites are mislabelled or completely missing in RNA structures. Nevertheless, with ribosomal structures (such as PDB: 1vqs) already at 2.2 &#197;, this kind of analysis may soon be an accessible goal, and would clearly greatly increase the number of possible interaction patterns beyond those discussed here.</p>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Conclusions</p>
         </st>
         <p>The results of our survey reveal that multiply hydrogen bonded base triples have a high degree of conservation between comparable ribosomal structures, therefore suggesting significant collective contributions towards the overall folding and stabilization of the RNA molecule. By annotating triple patterns and correlating theoretically predicted base triples from NAIL with the literature-based compilations in the NCIR database, we were able to discover a number of multiply hydrogen bonded base triple formations that had not been previously recorded and/or had not been predicted theoretically. The same annotation approach has enabled motif discovery by putting into context known patterns of interactions such as the stacked UAU Hoogsteen, Watson Crick motif. The value of expertly curated databases such as NCIR <abbrgrp><abbr bid="B13">13</abbr></abbrgrp> and SCOR <abbrgrp><abbr bid="B45">45</abbr></abbrgrp> are indisputable. Computationally generated libraries such as NAIL<abbrgrp><abbr bid=" B14">14</abbr></abbrgrp> have proven to be an important resource towards the discovery of new base interactions. Computational structure annotation methods, which can quickly locate tertiary interactions of a given type, large or small, can be invaluable for the comparison of complex structures as well as for increasing the coverage and volume for manual curation. This capability provides the foundation for further experimental work in investigating the specific contributions from possibly essential base interactions which can in turn correlate RNA tertiary structure to their corresponding function. As an outcome of this work, we have made available the NASSAM program via a web enabled interface at <url>http://mfrlab.org/grafss/nassam/</url>.</p>
      </sec>
      <sec>
         <st>
            <p>Competing interests</p>
         </st>
         <p>The authors declare that they have no competing interests.</p>
      </sec>
      <sec>
         <st>
            <p>Authors&#8217; contributions</p>
         </st>
         <p>MFR built the search database, designed the research methodology, carried out the base triple searches, analysed the data and drafted the manuscript. AMH designed the alternative triple pattern approach and carried out the initial NASSAM optimizations. PW participated in the design and coordination of the work. PJA conceived the study, wrote the algorithm, participated in the design, coordination and drafting of the manuscript. All authors read and approved the final manuscript.</p>
      </sec>
   </bdy>
   <bm>
      <ack>
         <sec>
            <st>
               <p>Acknowledgements and funding</p>
            </st>
            <p>We thank the Biotechnology and Biological Sciences Research Council, the Royal Society and the Wolfson Foundation for funding and computing facilities. MFR was funded by the Ministry of Higher Education Malaysia FRGS grant UKM-ST-06-FRGS0006-2009 and the Universiti Kebangsaan Malaysia grant UKM-GGPM-KPB-101-2010. The authors gratefully acknowledge assistance from Ms. Hazrina Yusof Hamdani for the setting up of the web server for NASSAM.</p>
            <p>This article has been published as part of <it>BMC Bioinformatics</it> Volume 12 Supplement 13, 2011: Tenth International Conference on Bioinformatics &#8211; First ISCB Asia Joint Conference 2011 (InCoB/ISCB-Asia 2011): Bioinformatics. The full contents of the supplement are available online at <url>http://www.biomedcentral.com/1471-2105/12?issue=S13</url>.</p>
         </sec>
      </ack>
      <refgrp><bibl id="B1"><title><p>The complete atomic structure of the large ribosomal subunit at 2.4 Angstrom resolution</p></title><aug><au><snm>Ban</snm><fnm>N</fnm></au><au><snm>Nissen</snm><fnm>P</fnm></au><au><snm>Hansen</snm><fnm>J</fnm></au><au><snm>Moore</snm><fnm>PB</fnm></au><au><snm>Steitz</snm><fnm>TA</fnm></au></aug><source>Science</source><pubdate>2000</pubdate><volume>289</volume><fpage>905</fpage><lpage>920</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1126/science.289.5481.905</pubid><pubid idtype="pmpid" link="fulltext">10937989</pubid></pubidlist></xrefbib></bibl><bibl id="B2"><title><p>Structure of the 30S ribosomal subunit</p></title><aug><au><snm>Wimberly</snm><fnm>BT</fnm></au><au><snm>Brodersen</snm><fnm>DE</fnm></au><au><snm>Clemons</snm><fnm>WM</fnm></au><au><snm>Morgan-Warren</snm><fnm>RJ</fnm></au><au><snm>Carter</snm><fnm>AP</fnm></au><au><snm>Vonrhein</snm><fnm>C</fnm></au><au><snm>Hartsch</snm><fnm>T</fnm></au><au><snm>Ramakrishnan</snm><fnm>V</fnm></au></aug><source>Nature</source><pubdate>2000</pubdate><volume>407</volume><fpage>327</fpage><lpage>339</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1038/35030006</pubid><pubid idtype="pmpid" link="fulltext">11014182</pubid></pubidlist></xrefbib></bibl><bibl id="B3"><title><p>Structure of functionally activated small ribosomal subunit at 3.3 angstrom resolution</p></title><aug><au><snm>Schluenzen</snm><fnm>F</fnm></au><au><snm>Tocilj</snm><fnm>A</fnm></au><au><snm>Zarivach</snm><fnm>R</fnm></au><au><snm>Harms</snm><fnm>J</fnm></au><au><snm>Gluehmann</snm><fnm>M</fnm></au><au><snm>Janell</snm><fnm>D</fnm></au><au><snm>Bashan</snm><fnm>A</fnm></au><au><snm>Bartels</snm><fnm>H</fnm></au><au><snm>Agmon</snm><fnm>I</fnm></au><au><snm>Franceschi</snm><fnm>F</fnm></au><au><snm>Yonath</snm><fnm>A</fnm></au></aug><source>Cell</source><pubdate>2000</pubdate><volume>102</volume><fpage>615</fpage><lpage>623</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1016/S0092-8674(00)00084-2</pubid><pubid idtype="pmpid" link="fulltext">11007480</pubid></pubidlist></xrefbib></bibl><bibl id="B4"><title><p>3-dimensional structure of a hammerhead ribozyme</p></title><aug><au><snm>Pley</snm><fnm>HW</fnm></au><au><snm>Flaherty</snm><fnm>KM</fnm></au><au><snm>McKay</snm><fnm>DB</fnm></au></aug><source>Nature</source><pubdate>1994</pubdate><volume>372</volume><fpage>68</fpage><lpage>74</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1038/372068a0</pubid><pubid idtype="pmpid" link="fulltext">7969422</pubid></pubidlist></xrefbib></bibl><bibl id="B5"><title><p>The crystal-structure of an all-RNA Hammerhead ribozyme - a proposed mechanism for RNA catalytic cleavage</p></title><aug><au><snm>Scott</snm><fnm>WG</fnm></au><au><snm>Finch</snm><fnm>JT</fnm></au><au><snm>Klug</snm><fnm>A</fnm></au></aug><source>Cell</source><pubdate>1995</pubdate><volume>81</volume><fpage>991</fpage><lpage>1002</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1016/S0092-8674(05)80004-2</pubid><pubid idtype="pmpid" link="fulltext">7541315</pubid></pubidlist></xrefbib></bibl><bibl id="B6"><title><p>Metal-binding sites in the major groove of a large ribozyme domain</p></title><aug><au><snm>Cate</snm><fnm>JH</fnm></au><au><snm>Doudna</snm><fnm>JA</fnm></au></aug><source>Structure</source><pubdate>1996</pubdate><volume>4</volume><fpage>1221</fpage><lpage>1229</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1016/S0969-2126(96)00129-3</pubid><pubid idtype="pmpid">8939748</pubid></pubidlist></xrefbib></bibl><bibl id="B7"><title><p>3-dimensional tertiary structure of yeast phenylalanine transfer-RNA</p></title><aug><au><snm>Kim</snm><fnm>SH</fnm></au><au><snm>Suddath</snm><fnm>FL</fnm></au><au><snm>Quigley</snm><fnm>GJ</fnm></au><au><snm>McPherso</snm><fnm>A</fnm></au><au><snm>Sussman</snm><fnm>JL</fnm></au><au><snm>Wang</snm><fnm>AHJ</fnm></au><au><snm>Seeman</snm><fnm>NC</fnm></au><au><snm>Rich</snm><fnm>A</fnm></au></aug><source>Science</source><pubdate>1974</pubdate><volume>185</volume><fpage>435</fpage><lpage>440</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1126/science.185.4149.435</pubid><pubid idtype="pmpid" link="fulltext">4601792</pubid></pubidlist></xrefbib></bibl><bibl id="B8"><title><p>Structure of yeast phenylalanine transfer-RNA at 2.5 A resolution</p></title><aug><au><snm>Ladner</snm><fnm>JE</fnm></au><au><snm>Jack</snm><fnm>A</fnm></au><au><snm>Robertus</snm><fnm>JD</fnm></au><au><snm>Brown</snm><fnm>RS</fnm></au><au><snm>Rhodes</snm><fnm>D</fnm></au><au><snm>Clark</snm><fnm>BFC</fnm></au><au><snm>Klug</snm><fnm>A</fnm></au></aug><source>Proc Natl Acad Sci U S A</source><pubdate>1975</pubdate><volume>72</volume><fpage>4414</fpage><lpage>4418</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1073/pnas.72.11.4414</pubid><pubid idtype="pmcid">388732</pubid><pubid idtype="pmpid">1105583</pubid></pubidlist></xrefbib></bibl><bibl id="B9"><title><p>Modeling the 3-dimensional structure of RNA using discrete nucleotide conformational sets</p></title><aug><au><snm>Gautheret</snm><fnm>D</fnm></au><au><snm>Major</snm><fnm>F</fnm></au><au><snm>Cedergren</snm><fnm>R</fnm></au></aug><source>J Mol Biol</source><pubdate>1993</pubdate><volume>229</volume><fpage>1049</fpage><lpage>1064</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1006/jmbi.1993.1104</pubid><pubid idtype="pmpid" link="fulltext">7680379</pubid></pubidlist></xrefbib></bibl><bibl id="B10"><title><p>Structural motifs in RNA</p></title><aug><au><snm>Moore</snm><fnm>PB</fnm></au></aug><source>Annu Rev Biochem</source><pubdate>1999</pubdate><volume>68</volume><fpage>287</fpage><lpage>300</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1146/annurev.biochem.68.1.287</pubid><pubid idtype="pmpid" link="fulltext">10872451</pubid></pubidlist></xrefbib></bibl><bibl id="B11"><title><p>Minor groove RNA triplex in the crystal structure of a ribosomal frameshifting viral pseudoknot</p></title><aug><au><snm>Su</snm><fnm>L</fnm></au><au><snm>Chen</snm><fnm>LQ</fnm></au><au><snm>Egli</snm><fnm>M</fnm></au><au><snm>Berger</snm><fnm>JM</fnm></au><au><snm>Rich</snm><fnm>A</fnm></au></aug><source>Nat Struct Biol</source><pubdate>1999</pubdate><volume>6</volume><fpage>285</fpage><lpage>292</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1038/6722</pubid><pubid idtype="pmpid" link="fulltext">10074948</pubid></pubidlist></xrefbib></bibl><bibl id="B12"><title><p>Structure of the tetrahymena ribozyme: base triple sandwich and metal ion at the active site</p></title><aug><au><snm>Guo</snm><fnm>F</fnm></au><au><snm>Gooding</snm><fnm>AR</fnm></au><au><snm>Cech</snm><fnm>TR</fnm></au></aug><source>Mol Cell</source><pubdate>2004</pubdate><volume>16</volume><fpage>351</fpage><lpage>362</lpage><xrefbib><pubid idtype="pmpid" link="fulltext">15525509</pubid></xrefbib></bibl><bibl id="B13"><title><p>NCIR: a database of non-canonical interactions in known RNA structures</p></title><aug><au><snm>Nagaswamy</snm><fnm>U</fnm></au><au><snm>Larios-Sanz</snm><fnm>M</fnm></au><au><snm>Hury</snm><fnm>J</fnm></au><au><snm>Collins</snm><fnm>S</fnm></au><au><snm>Zhang</snm><fnm>ZD</fnm></au><au><snm>Zhao</snm><fnm>Q</fnm></au><au><snm>Fox</snm><fnm>GE</fnm></au></aug><source>Nucleic Acids Res</source><pubdate>2002</pubdate><volume>30</volume><fpage>395</fpage><lpage>397</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1093/nar/30.1.395</pubid><pubid idtype="pmcid">99067</pubid><pubid idtype="pmpid" link="fulltext">11752347</pubid></pubidlist></xrefbib></bibl><bibl id="B14"><title><p>Structural diversity and isomorphism of hydrogen-bonded base interactions in nucleic acids</p></title><aug><au><snm>Walberer</snm><fnm>BJ</fnm></au><au><snm>Cheng</snm><fnm>AC</fnm></au><au><snm>Frankel</snm><fnm>AD</fnm></au></aug><source>J Mol Biol</source><pubdate>2003</pubdate><volume>327</volume><fpage>767</fpage><lpage>780</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1016/S0022-2836(03)00090-1</pubid><pubid idtype="pmpid" link="fulltext">12654262</pubid></pubidlist></xrefbib></bibl><bibl id="B15"><title><p>The Protein Data Bank</p></title><aug><au><snm>Berman</snm><fnm>HM</fnm></au><au><snm>Westbrook</snm><fnm>J</fnm></au><au><snm>Feng</snm><fnm>Z</fnm></au><au><snm>Gilliland</snm><fnm>G</fnm></au><au><snm>Bhat</snm><fnm>TN</fnm></au><au><snm>Weissig</snm><fnm>H</fnm></au><au><snm>Shindyalov</snm><fnm>IN</fnm></au><au><snm>Bourne</snm><fnm>PE</fnm></au></aug><source>Nucleic Acids Res</source><pubdate>2000</pubdate><volume>28</volume><fpage>235</fpage><lpage>242</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1093/nar/28.1.235</pubid><pubid idtype="pmcid">102472</pubid><pubid idtype="pmpid" link="fulltext">10592235</pubid></pubidlist></xrefbib></bibl><bibl id="B16"><title><p>Structures of the bacterial ribosome at 3.5 Angstrom resolution</p></title><aug><au><snm>Schuwirth</snm><fnm>BS</fnm></au><au><snm>Borovinskaya</snm><fnm>MA</fnm></au><au><snm>Hau</snm><fnm>CW</fnm></au><au><snm>Zhang</snm><fnm>W</fnm></au><au><snm>Vila-Sanjurjo</snm><fnm>A</fnm></au><au><snm>Holton</snm><fnm>JM</fnm></au><au><snm>Cate</snm><fnm>JHD</fnm></au></aug><source>Science</source><pubdate>2005</pubdate><volume>310</volume><fpage>827</fpage><lpage>834</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1126/science.1117230</pubid><pubid idtype="pmpid" link="fulltext">16272117</pubid></pubidlist></xrefbib></bibl><bibl id="B17"><title><p>Representation, searching and discovery of patterns of bases in complex RNA structures</p></title><aug><au><snm>Harrison</snm><fnm>AM</fnm></au><au><snm>South</snm><fnm>DR</fnm></au><au><snm>Willett</snm><fnm>P</fnm></au><au><snm>Artymiuk</snm><fnm>PJ</fnm></au></aug><source>J Comput-Aided Mol Des</source><pubdate>2003</pubdate><volume>17</volume><fpage>537</fpage><lpage>549</lpage><xrefbib><pubid idtype="pmpid" link="fulltext">14703124</pubid></xrefbib></bibl><bibl id="B18"><title><p>Algorithm for subgraph isomorphism</p></title><aug><au><snm>Ullmann</snm><fnm>JR</fnm></au></aug><source>J Acm</source><pubdate>1976</pubdate><volume>23</volume><fpage>31</fpage><lpage>42</lpage><xrefbib><pubid idtype="doi">10.1145/321921.321925</pubid></xrefbib></bibl><bibl id="B19"><title><p>Satisfying hydrogen-bonding potential in proteins</p></title><aug><au><snm>McDonald</snm><fnm>IK</fnm></au><au><snm>Thornton</snm><fnm>JM</fnm></au></aug><source>J Mol Biol</source><pubdate>1994</pubdate><volume>238</volume><fpage>777</fpage><lpage>793</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1006/jmbi.1994.1334</pubid><pubid idtype="pmpid" link="fulltext">8182748</pubid></pubidlist></xrefbib></bibl><bibl id="B20"><title><p>Multiple sequence alignment with the CLUSTAL series of programs</p></title><aug><au><snm>Chenna</snm><fnm>R</fnm></au><au><snm>Sugawara</snm><fnm>H</fnm></au><au><snm>Koike</snm><fnm>T</fnm></au><au><snm>Lopez</snm><fnm>R</fnm></au><au><snm>Gibson</snm><fnm>TJ</fnm></au><au><snm>Higgins</snm><fnm>DG</fnm></au><au><snm>Thompson</snm><fnm>JD</fnm></au></aug><source>Nucleic Acids Res</source><pubdate>2003</pubdate><volume>31</volume><fpage>3497</fpage><lpage>3500</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1093/nar/gkg500</pubid><pubid idtype="pmcid">168907</pubid><pubid idtype="pmpid" link="fulltext">12824352</pubid></pubidlist></xrefbib></bibl><bibl id="B21"><title><p>Solution for best rotation to relate 2 sets of vectors</p></title><aug><au><snm>Kabsch</snm><fnm>W</fnm></au></aug><source>Acta Crystallogr Sect A</source><pubdate>1976</pubdate><volume>32</volume><fpage>922</fpage><lpage>923</lpage><xrefbib><pubid idtype="doi">10.1107/S0567739476001873</pubid></xrefbib></bibl><bibl id="B22"><title><p>The CCP4 Suite - programs for protein crystallography</p></title><aug><au><snm>Bailey</snm><fnm>S</fnm></au></aug><source>Acta Crystallogr Sect D-Biol Crystallogr</source><pubdate>1994</pubdate><volume>50</volume><fpage>760</fpage><lpage>763</lpage><xrefbib><pubid idtype="doi">10.1107/S0907444994003112</pubid></xrefbib></bibl><bibl id="B23"><title><p>Geometric nomenclature and classification of RNA base pairs</p></title><aug><au><snm>Leontis</snm><fnm>NB</fnm></au><au><snm>Westhof</snm><fnm>E</fnm></au></aug><source>RNA-Publ RNA Soc</source><pubdate>2001</pubdate><volume>7</volume><fpage>499</fpage><lpage>512</lpage></bibl><bibl id="B24"><title><p>Stepping through an RNA structure: a novel approach to conformational analysis</p></title><aug><au><snm>Duarte</snm><fnm>CM</fnm></au><au><snm>Pyle</snm><fnm>AM</fnm></au></aug><source>J Mol Biol</source><pubdate>1998</pubdate><volume>284</volume><fpage>1465</fpage><lpage>1478</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1006/jmbi.1998.2233</pubid><pubid idtype="pmpid" link="fulltext">9878364</pubid></pubidlist></xrefbib></bibl><bibl id="B25"><title><p>Crystal structure of a hepatitis delta virus ribozyme</p></title><aug><au><snm>Ferre-D'Amare</snm><fnm>AR</fnm></au><au><snm>Zhou</snm><fnm>K</fnm></au><au><snm>Doudna</snm><fnm>JA</fnm></au></aug><source>Nature</source><pubdate>1998</pubdate><volume>395</volume><fpage>567</fpage><lpage>574</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1038/26912</pubid><pubid idtype="pmpid" link="fulltext">9783582</pubid></pubidlist></xrefbib></bibl><bibl id="B26"><title><p>A conformational switch controls hepatitis delta virus ribozyme catalysis</p></title><aug><au><snm>Ke</snm><fnm>AL</fnm></au><au><snm>Zhou</snm><fnm>KH</fnm></au><au><snm>Ding</snm><fnm>F</fnm></au><au><snm>Cate</snm><fnm>JHD</fnm></au><au><snm>Doudna</snm><fnm>JA</fnm></au></aug><source>Nature</source><pubdate>2004</pubdate><volume>429</volume><fpage>201</fpage><lpage>205</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1038/nature02522</pubid><pubid idtype="pmpid" link="fulltext">15141216</pubid></pubidlist></xrefbib></bibl><bibl id="B27"><title><p>Specific radiation damage can be used to solve macromolecular crystal structures</p></title><aug><au><snm>Ravelli</snm><fnm>RBG</fnm></au><au><snm>Leiros</snm><fnm>HKS</fnm></au><au><snm>Pan</snm><fnm>BC</fnm></au><au><snm>Caffrey</snm><fnm>M</fnm></au><au><snm>McSweeney</snm><fnm>S</fnm></au></aug><source>Structure</source><pubdate>2003</pubdate><volume>11</volume><fpage>217</fpage><lpage>224</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1016/S0969-2126(03)00006-6</pubid><pubid idtype="pmpid" link="fulltext">12575941</pubid></pubidlist></xrefbib></bibl><bibl id="B28"><title><p>Structural basis for anticodon recognition by discriminating glutamyl-tRNA synthetase</p></title><aug><au><snm>Sekine</snm><fnm>S</fnm></au><au><snm>Nureki</snm><fnm>O</fnm></au><au><snm>Shimada</snm><fnm>A</fnm></au><au><snm>Vassylyev</snm><fnm>DG</fnm></au><au><snm>Yokoyama</snm><fnm>S</fnm></au></aug><source>Nat Struct Biol</source><pubdate>2001</pubdate><volume>8</volume><fpage>203</fpage><lpage>206</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1038/84927</pubid><pubid idtype="pmpid" link="fulltext">11224561</pubid></pubidlist></xrefbib></bibl><bibl id="B29"><title><p>The structures of four macrolide antibiotics bound to the large ribosomal subunit</p></title><aug><au><snm>Hansen</snm><fnm>JL</fnm></au><au><snm>Ippolito</snm><fnm>JA</fnm></au><au><snm>Ban</snm><fnm>N</fnm></au><au><snm>Nissen</snm><fnm>P</fnm></au><au><snm>Moore</snm><fnm>PB</fnm></au><au><snm>Steitz</snm><fnm>TA</fnm></au></aug><source>Mol Cell</source><pubdate>2002</pubdate><volume>10</volume><fpage>117</fpage><lpage>128</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1016/S1097-2765(02)00570-1</pubid><pubid idtype="pmpid" link="fulltext">12150912</pubid></pubidlist></xrefbib></bibl><bibl id="B30"><title><p>Structural insights into peptide bond formation</p></title><aug><au><snm>Hansen</snm><fnm>JL</fnm></au><au><snm>Schmeing</snm><fnm>TM</fnm></au><au><snm>Moore</snm><fnm>PB</fnm></au><au><snm>Steitz</snm><fnm>TA</fnm></au></aug><source>Proc Natl Acad Sci U S A</source><pubdate>2002</pubdate><volume>99</volume><fpage>11670</fpage><lpage>11675</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1073/pnas.172404099</pubid><pubid idtype="pmcid">129327</pubid><pubid idtype="pmpid" link="fulltext">12185246</pubid></pubidlist></xrefbib></bibl><bibl id="B31"><title><p>RNA tertiary interactions in the large ribosomal subunit: The A-minor motif</p></title><aug><au><snm>Nissen</snm><fnm>P</fnm></au><au><snm>Ippolito</snm><fnm>JA</fnm></au><au><snm>Ban</snm><fnm>N</fnm></au><au><snm>Moore</snm><fnm>PB</fnm></au><au><snm>Steitz</snm><fnm>TA</fnm></au></aug><source>Proc Natl Acad Sci U S A</source><pubdate>2001</pubdate><volume>98</volume><fpage>4899</fpage><lpage>4903</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1073/pnas.081082398</pubid><pubid idtype="pmcid">33135</pubid><pubid idtype="pmpid" link="fulltext">11296253</pubid></pubidlist></xrefbib></bibl><bibl id="B32"><title><p>Conservation of primary structure in 16S ribosomal-RNA</p></title><aug><au><snm>Woese</snm><fnm>CR</fnm></au><au><snm>Fox</snm><fnm>GE</fnm></au><au><snm>Zablen</snm><fnm>L</fnm></au><au><snm>Uchida</snm><fnm>T</fnm></au><au><snm>Bonen</snm><fnm>L</fnm></au><au><snm>Pechman</snm><fnm>K</fnm></au><au><snm>Lewis</snm><fnm>BJ</fnm></au><au><snm>Stahl</snm><fnm>D</fnm></au></aug><source>Nature</source><pubdate>1975</pubdate><volume>254</volume><fpage>83</fpage><lpage>86</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1038/254083a0</pubid><pubid idtype="pmpid">1089909</pubid></pubidlist></xrefbib></bibl><bibl id="B33"><title><p>3DNA: a software package for the analysis, rebuilding and visualization of three-dimensional nucleic acid structures</p></title><aug><au><snm>Lu</snm><fnm>XJ</fnm></au><au><snm>Olson</snm><fnm>WK</fnm></au></aug><source>Nucleic Acids Res</source><pubdate>2003</pubdate><volume>31</volume><fpage>5108</fpage><lpage>5121</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1093/nar/gkg680</pubid><pubid idtype="pmcid">212791</pubid><pubid idtype="pmpid" link="fulltext">12930962</pubid></pubidlist></xrefbib></bibl><bibl id="B34"><title><p>Shape-selective RNA recognition by cysteinyl-tRNA synthetase</p></title><aug><au><snm>Hauenstein</snm><fnm>S</fnm></au><au><snm>Zhang</snm><fnm>CM</fnm></au><au><snm>Hou</snm><fnm>YM</fnm></au><au><snm>Perona</snm><fnm>JJ</fnm></au></aug><source>Nat Struct Mol Biol</source><pubdate>2004</pubdate><volume>11</volume><fpage>1134</fpage><lpage>1141</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1038/nsmb849</pubid><pubid idtype="pmpid" link="fulltext">15489861</pubid></pubidlist></xrefbib></bibl><bibl id="B35"><title><p>The structural basis of ribosome activity in peptide bond synthesis</p></title><aug><au><snm>Nissen</snm><fnm>P</fnm></au><au><snm>Hansen</snm><fnm>J</fnm></au><au><snm>Ban</snm><fnm>N</fnm></au><au><snm>Moore</snm><fnm>PB</fnm></au><au><snm>Steitz</snm><fnm>TA</fnm></au></aug><source>Science</source><pubdate>2000</pubdate><volume>289</volume><fpage>920</fpage><lpage>930</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1126/science.289.5481.920</pubid><pubid idtype="pmpid" link="fulltext">10937990</pubid></pubidlist></xrefbib></bibl><bibl id="B36"><title><p>Ribosome-catalyzed peptide-bond formation with an a-site substrate covalently linked to 23S ribosomal RNA</p></title><aug><au><snm>Green</snm><fnm>R</fnm></au><au><snm>Switzer</snm><fnm>C</fnm></au><au><snm>Noller</snm><fnm>HF</fnm></au></aug><source>Science</source><pubdate>1998</pubdate><volume>280</volume><fpage>286</fpage><lpage>289</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1126/science.280.5361.286</pubid><pubid idtype="pmpid" link="fulltext">9535658</pubid></pubidlist></xrefbib></bibl><bibl id="B37"><title><p>Essential mechanisms in the catalysis of peptide bond formation on the ribosome</p></title><aug><au><snm>Beringer</snm><fnm>M</fnm></au><au><snm>Bruell</snm><fnm>C</fnm></au><au><snm>Xiong</snm><fnm>LQ</fnm></au><au><snm>Pfister</snm><fnm>P</fnm></au><au><snm>Bieling</snm><fnm>P</fnm></au><au><snm>Katunin</snm><fnm>VI</fnm></au><au><snm>Mankin</snm><fnm>AS</fnm></au><au><snm>Bottger</snm><fnm>EC</fnm></au><au><snm>Rodnina</snm><fnm>MV</fnm></au></aug><source>J Biol Chem</source><pubdate>2005</pubdate><volume>280</volume><fpage>36065</fpage><lpage>36072</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1074/jbc.M507961200</pubid><pubid idtype="pmpid" link="fulltext">16129670</pubid></pubidlist></xrefbib></bibl><bibl id="B38"><title><p>Peptide bond formation does not involve acid-base catalysis by ribosomal residues</p></title><aug><au><snm>Bieling</snm><fnm>P</fnm></au><au><snm>Beringer</snm><fnm>M</fnm></au><au><snm>Adio</snm><fnm>S</fnm></au><au><snm>Rodnina</snm><fnm>MV</fnm></au></aug><source>Nat Struct Mol Biol</source><pubdate>2006</pubdate><volume>13</volume><fpage>423</fpage><lpage>428</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1038/nsmb1091</pubid><pubid idtype="pmpid" link="fulltext">16648860</pubid></pubidlist></xrefbib></bibl><bibl id="B39"><title><p>Structural insights into the roles of water and the 2 ' hydroxyl of the P site tRNA in the peptidyl transferase reaction</p></title><aug><au><snm>Schmeing</snm><fnm>TM</fnm></au><au><snm>Huang</snm><fnm>KS</fnm></au><au><snm>Kitchen</snm><fnm>DE</fnm></au><au><snm>Strobel</snm><fnm>SA</fnm></au><au><snm>Steitz</snm><fnm>TA</fnm></au></aug><source>Mol Cell</source><pubdate>2005</pubdate><volume>20</volume><fpage>437</fpage><lpage>448</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1016/j.molcel.2005.09.006</pubid><pubid idtype="pmpid" link="fulltext">16285925</pubid></pubidlist></xrefbib></bibl><bibl id="B40"><title><p>An induced-fit mechanism to promote peptide bond formation and exclude hydrolysis of peptidyl-tRNA</p></title><aug><au><snm>Schmeing</snm><fnm>TM</fnm></au><au><snm>Huang</snm><fnm>KS</fnm></au><au><snm>Strobel</snm><fnm>SA</fnm></au><au><snm>Steitz</snm><fnm>TA</fnm></au></aug><source>Nature</source><pubdate>2005</pubdate><volume>438</volume><fpage>520</fpage><lpage>524</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1038/nature04152</pubid><pubid idtype="pmpid" link="fulltext">16306996</pubid></pubidlist></xrefbib></bibl><bibl id="B41"><title><p>The ribosomal peptidyl transferase center: Structure, function, evolution, inhibition</p></title><aug><au><snm>Polacek</snm><fnm>N</fnm></au><au><snm>Mankin</snm><fnm>AS</fnm></au></aug><source>Crit Rev Biochem Mol Biol</source><pubdate>2005</pubdate><volume>40</volume><fpage>285</fpage><lpage>311</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1080/10409230500326334</pubid><pubid idtype="pmpid">16257828</pubid></pubidlist></xrefbib></bibl><bibl id="B42"><title><p>The structural basis of large ribosomal subunit function</p></title><aug><au><snm>Moore</snm><fnm>PB</fnm></au><au><snm>Steitz</snm><fnm>TA</fnm></au></aug><source>Annu Rev Biochem</source><pubdate>2003</pubdate><volume>72</volume><fpage>813</fpage><lpage>850</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1146/annurev.biochem.72.110601.135450</pubid><pubid idtype="pmpid" link="fulltext">14527328</pubid></pubidlist></xrefbib></bibl><bibl id="B43"><title><p>BPS: a database of RNA base-pair structures</p></title><aug><au><snm>Xin</snm><fnm>Y</fnm></au><au><snm>Olson</snm><fnm>WK</fnm></au></aug><source>Nucl Acids Res</source><pubdate>2009</pubdate><volume>37</volume><fpage>D83</fpage><lpage>88</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1093/nar/gkn676</pubid><pubid idtype="pmcid">2686499</pubid><pubid idtype="pmpid" link="fulltext">18845572</pubid></pubidlist></xrefbib></bibl><bibl id="B44"><title><p>Microenvironment analysis and identification of magnesium binding sites in RNA</p></title><aug><au><snm>Banatao</snm><fnm>DR</fnm></au><au><snm>Altman</snm><fnm>RB</fnm></au><au><snm>Klein</snm><fnm>TE</fnm></au></aug><source>Nucl Acids Res</source><pubdate>2003</pubdate><volume>31</volume><fpage>4450</fpage><lpage>4460</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1093/nar/gkg471</pubid><pubid idtype="pmcid">169872</pubid><pubid idtype="pmpid" link="fulltext">12888505</pubid></pubidlist></xrefbib></bibl><bibl id="B45"><title><p>SCOR: a structural classification of RNA database</p></title><aug><au><snm>Klosterman</snm><fnm>PS</fnm></au><au><snm>Tamura</snm><fnm>M</fnm></au><au><snm>Holbrook</snm><fnm>SR</fnm></au><au><snm>Brenner</snm><fnm>SE</fnm></au></aug><source>Nucleic Acids Res</source><pubdate>2002</pubdate><volume>30</volume><fpage>392</fpage><lpage>394</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1093/nar/30.1.392</pubid><pubid idtype="pmcid">99131</pubid><pubid idtype="pmpid" link="fulltext">11752346</pubid></pubidlist></xrefbib></bibl></refgrp>
   </bm>
</art>