Email updates

Keep up to date with the latest news and content from BMC Genomics and BioMed Central.

Open Access Database

ArachnoServer: a database of protein toxins from spiders

David LA Wood12*, Tomas Miljenović3, Shuzhi Cai12, Robert J Raven4, Quentin Kaas3, Pierre Escoubas5, Volker Herzig3, David Wilson3 and Glenn F King3*

Author Affiliations

1 Queensland Facility for Advanced Bioinformatics, The University of Queensland, St Lucia, QLD 4072, Australia

2 ARC Centre of Excellence in Bioinformatics, The University of Queensland, St Lucia, QLD 4072, Australia

3 Institute for Molecular Bioscience, The University of Queensland, St Lucia, QLD 4072, Australia

4 Queensland Museum, Brisbane QLD 4101, Australia

5 Institut de Pharmacologie Moléculaire et Cellulaire, CNRS, 06560 Valbonne, France

For all author emails, please log on.

BMC Genomics 2009, 10:375  doi:10.1186/1471-2164-10-375

The electronic version of this article is the complete one and can be found online at: http://www.biomedcentral.com/1471-2164/10/375


Received:13 March 2009
Accepted:13 August 2009
Published:13 August 2009

© 2009 Wood et al; licensee BioMed Central Ltd.

This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Abstract

Background

Venomous animals incapacitate their prey using complex venoms that can contain hundreds of unique protein toxins. The realisation that many of these toxins may have pharmaceutical and insecticidal potential due to their remarkable potency and selectivity against target receptors has led to an explosion in the number of new toxins being discovered and characterised. From an evolutionary perspective, spiders are the most successful venomous animals and they maintain by far the largest pool of toxic peptides. However, at present, there are no databases dedicated to spider toxins and hence it is difficult to realise their full potential as drugs, insecticides, and pharmacological probes.

Description

We have developed ArachnoServer, a manually curated database that provides detailed information about proteinaceous toxins from spiders. Key features of ArachnoServer include a new molecular target ontology designed especially for venom toxins, the most up-to-date taxonomic information available, and a powerful advanced search interface. Toxin information can be browsed through dynamic trees, and each toxin has a dedicated page summarising all available information about its sequence, structure, and biological activity. ArachnoServer currently manages 567 protein sequences, 334 nucleic acid sequences, and 51 protein structures.

Conclusion

ArachnoServer provides a single source of high-quality information about proteinaceous spider toxins that will be an invaluable resource for pharmacologists, neuroscientists, toxinologists, medicinal chemists, ion channel scientists, clinicians, and structural biologists. ArachnoServer is available online at http://www.arachnoserver.org webcite.

Background

The realisation that many venomous creatures possess a complex repertoire of protein toxins with pharmacological, pharmaceutical, and agrochemical applications has led to an exponential increase in the number of new toxins being discovered [1]. Several electronic databases have been established to capture information about subsets of these toxins, such as ConoServer [2] and SCORPION2 [3], which provide information about toxins from marine cone snails and scorpions, respectively. Other databases provide information about all groups of animal toxins, such as Tox-Prot [4] and the Animal Toxin Database (ATDB) [5]. These databases allow cross-comparison of toxins from different animal groups but the individual toxin records typically lack the rich information content of manually curated databases.

Spiders are by far the largest group of venomous animals, with more than 40,000 extant species [6], and they maintain the largest repertoire of pharmacologically active peptides [7]. Some spider toxins have been used for almost two decades as pharmacological tools to probe the structure and function of ion channels and cell-surface receptors [8,9]. Other spider toxins are being developed as bioinsecticides [10-12] or providing leads for the development of drugs to treat a diverse range of pathophysiological states such as inflammatory and neuropathic pain [13], cardiac arrhythmia [14], and erectile dysfunction [15]. Remarkably, despite the incredible chemical diversity of spider venoms, there are no curated databases that deal specifically with proteinaceous toxins from these animals. Here we introduce ArachnoServer, a database designed to provide detailed and easily accessible information on the sequence, three-dimensional structure, and biological activity of spider toxins.

Construction and content

Data sources and integration

A non-redundant set of publicly available spider toxin sequences was sourced from Swiss-Prot [16] and GenBank [17], while experimentally determined protein structures were obtained from the Protein Data Bank [18]. Taxonomic information on spiders (Araneae) was retrieved from the World Spider Catalog [6], which has the advantage of providing more up-to-date taxonomy than NCBI, as well as historic taxonomic information that can be used in database searches. As the World Spider Catalog is under constant revision, care was taken to ensure future compatibility of the ArachnoServer data model with this catalog. Each species in ArachnoServer is assigned a unique life science identifier (LSID) which is linked to the equivalent LSID in an electronic version of the World Spider Catalog. In this way, the ArachnoServer database is automatically linked to taxonomic updates within the World Spider Catalog. High-resolution images of spiders, which are provided without restriction for academic use, were obtained from private collectors and have been used only in cases where both the species and sex could be verified.

A key feature of ArachnoServer is the molecular target ontology. The majority of spider toxins act on ion channels and cell-surface receptors [19], and consequently we developed a new molecular target ontology based on the channel and receptor subtype definitions and nomenclature recommended by the International Union of Basic & Clinical Pharmacology [20]. Additional descriptors were added for toxins that target invertebrate ion channels and receptors, cytolytic toxins, lectins, toxins that target enzymes, and toxins with enzymatic activity. This new ontology allows for the first time the retrieval of spider toxin information based on a precise definition of toxin specificity. For example, this ontology makes it facile to search the database for toxins that target a specific subtype of vertebrate ion channel or receptor. Finally, spider toxins often present diverse posttranslational modifications, and these are described in ArachnoServer using an ontology derived from ConoServer [2].

Data curation

Where appropriate, extra information from the literature was added to each imported toxin record and additional citations were incorporated using the PubMed EFetch service. Year of discovery was carefully curated according to the date on which a manuscript describing the sequence was first submitted for publication, or the date when the toxin sequence was first submitted to GenBank or SWISS-PROT or appeared in the patent literature; this information is often incorrect in the source databases. Toxins were assigned recommended names according to the recently described rational nomenclature for spider toxins [1] and all available literature synonyms were also attached to each toxin record.

ArachnoServer provides the ability to curate information specific to a particular toxin as well as data that are generically applicable. For example, curators can create, edit, or delete symbols and names for posttranslational modifications. Data sets available for curation include lists of posttranslational modifications, biological activities, sequence features (e.g., signal and propeptide sequences), the molecular target ontology, common names and images of spiders, and even the units used to describe LD50, PD50, and IC50 values. The types of experimental evidence for disulfide bonds and toxin pharmacophores (e.g., experimentally determined, by homology, or predicted) can also be managed by the curation team. This inherent flexibility should allow the database to grow in both size and functionality as the knowledge base about spider toxins continues to expand.

Application architecture

ArachnoServer is a Java Spring Model View Controller (MVC) application that uses a Hibernate Object Relational Mapping (ORM) layer to a MySQL database. All database tables are mapped to Plain Old Java Object (POJO) serializable beans. Hibernate manages the writing, updating, and deleting of data from the database tables according to changes in the POJO data, instigated by events triggered from the view and propagated through to the Spring service layer. Using the Spring architecture with ORM mapping, the application and data model can be easily extended or modified, as changes to the data model do not require SQL changes (Hibernate transparently creates all SQL statements using mapped POJOs).

Utility and discussion

Interface and visualisation

ArachnoServer is presented using a graphically designed CSS skin that can be easily swapped for other skins in future releases if required. The front page provides statistics about the total number of toxins available, their taxonomic distribution, and year of discovery (Fig. 1). Individual toxin records can be accessed through basic or advanced searches or via the browse function, all of which are described below.

thumbnailFigure 1. ArachnoServer interface. The front page of ArachnoServer provides a brief introduction to the database, information about performing searches, and details of the total number of available toxins, their taxonomic distribution, and year of discovery. Note that basic searches can be performed from every page within the database using the search bar in the ArachnoServer banner.

The spider toxin record (STR) card summarises all information about a toxin in a clean and intuitive interface (Fig. 2). A description of the toxin's biological activity is provided at the top of the page followed by a dynamically generated image that displays the mature toxin sequence along with information about disulfide bond connectivities, pharmacophore residues, and post-translational modifications. Mousing over the image displays additional information, including the position in the sequence and full residue name. The expandable sections below the image were carefully designed to allow biologists working in different disciplines to find the desired information without being distracted by extraneous data. For example, pharmacologists interested in the molecular target of the toxin but not other details can click on the Biological Activity tab in order to find this specific information, whereas structural biologists and medicinal chemists may be more interested in accessing details of the structure via the Toxin Structure section, which allows the structure to be viewed in interactive 3D mode via a Jmol applet http://www.jmol.org webcite. For all toxins, a colour-coded image is displayed in the Protein Information section that provides a quick overview of the signal sequence, propeptide, and mature toxin regions. Other tabs provide information on the spider taxonomy (Taxonomy), links to external databases (Accessions), links to relevant literature (Literature References), sequences in FASTA format (Sequences), and alternative toxin names (Toxin Synonyms).

thumbnailFigure 2. Example of a Spider Toxin Record (STR). STR for δ-hexatoxin-Ar1a, the lethal toxin from the Sydney funnel-web spider Atrax robustus. The top section of the STR provides a summary of the toxin's activity, the source species, and year of discovery. The dynamically generated toxin image displays the mature toxin sequence and, where available, its disulfide-bond framework, pharmacophore residues, and posttranslational modifications. Below this image are additional user-expandable sections that provide information about the toxin's molecular target, phyletic specificity, and 3D structure, as well as literature references, toxin synonyms, and both current and historic taxonomy of the source species. Clicking the spider photo yields a downloadable high-resolution image.

Basic and advanced searches

ArachnoServer supports basic searches and advanced searches. Basic searches query toxin names (including synonyms), taxonomic information (including historic taxonomy), common names of spiders, and toxin families. Basic searches can be performed from every page using the search bar in the ArachnoServer banner.

A key feature that distinguishes ArachnoServer from other toxin databases is the advanced search feature (Fig. 3). Context-specific fields are dynamically arranged on the page, and search keys populated in drop-down lists of relevant data from the database. Search clauses can be grouped, then joined using boolean operators. Most database fields can be searched, including all basic search fields, as well as additional features such as biological activity, posttranslational modifications, literature references, and counts of various data fields (e.g., the number of disulfide bonds in a toxin). Results from both basic and advanced searches are displayed as paginated tables that can be exported in XML or PDF format. FASTA sequences for toxins from any set of search results can also be exported in plain text format.

thumbnailFigure 3. Advanced searches in ArachnoServer. An example of the advanced search feature in ArachnoServer, which can be used to search the database using a combination of criteria (see text for details). Only the first four toxins from the search results are displayed in (A) and (B) for the sake of brevity. Clicking on the toxin name (highlighted in blue) opens up the corresponding spider toxin record. Results can be exported in XML, PDF, or FASTA format. A help button is available in the top right corner of the search page.

The advanced search feature provides a convenient way for scientists to locate records of immediate interest. For example, a medicinal chemist or microbiologist focused on infectious diseases might be particularly interested in toxins that have antimicrobial activity. Using the advanced search feature to search for toxins with antimicrobial activity reveals that the database currently contains 39 such toxins (Fig. 3A). However, this search could be restricted according to many different criteria, such as the year of discovery, activity against specific bacteria or parasites, or the number of disulfide bonds and posttranslational modifications in the toxin. For example, adding a requirement for activity against Escherichia coli limits the search result to 11 toxins (Fig. 3B). The medicinal chemist might be particularly interested in the subset of these toxins for which a 3D structure has been determined. Adding this criterion limits the search result to two toxins (Fig. 3C), for which the 3D structure can then be viewed from the individual spider toxin records.

Browsing

Browsing of the toxin records is also possible through the browse page. Dynamic trees are populated with folders of toxins that match the grouping criteria. Users can browse the spider taxonomic tree and view all toxins for a given taxon. Similarly, the molecular target ontology and posttranslational modifications can be browsed. Fig. 4 shows an example where the molecular ontology browse function has been used to locate all toxins in the database that are known to target invertebrate voltage-gated calcium channels. Note that a complete list of FASTA sequences can be obtained from browse results.

thumbnailFigure 4. Browsing based on molecular ontology. An example of the browse molecular ontology function in ArachnoServer, which in this example has been used to locate all toxins in the database that are known to act on invertebrate voltage-gated calcium channels.

Sequence similarity searches

Similarity searches using BLAST [21] are available through the BLAST page and also from each of the sequence records in the STR page. Both protein and nucleic databases can be searched using the programs blastn, blastp, blastx, tblastn, and tblastx. Hits from toxins in the database are linked directly from the BLAST result page to the STR cards in ArachnoServer, and, where available, back to the original sources in GenBank or Swiss-Prot.

Conclusion

ArachnoServer is a web-based resource that provides comprehensive information about protein toxins from spider venoms. The combination of specific classification schemes and a rich user interface allows researchers to easily locate and view information on the sequence, structure, and biological activity of these toxins. This manually curated database will be a valuable resource for both basic researchers as well as those interested in potential pharmaceutical and agricultural applications of spider toxins.

Availability and requirements

ArachnoServer is freely available at http://www.arachnoserver.org webcite. ArachnoServer is platform independent and supports most browsers including Firefox (Version 3 and higher), Safari (Ver. 3.2 and higher), Google Chrome (Ver. 1 and higher), Internet Explorer (Ver. 7 and higher), and Opera (Ver. 9.6 and higher). A java plug-in is required in order to view 3D protein structures.

Authors' contributions

GFK and DLAW devised the ArachnoServer concept and wrote the manuscript. All authors were involved in formulating specifications, application design, and testing. DLAW was responsible for data integration, database design, and application implementation with assistance from TM and SC. SC also worked on application architecture. GFK and QK developed the molecular target and posttranslational modification ontologies, respectively. RR was responsible for integration of taxonomy from the World Spider Catalog. DW, VH, and GFK were responsible for database curation. PE was primary beta tester. All authors read the final manuscript

Acknowledgements

This work was supported by Discovery Grant DP0774245 from the Australian Research Council to G.F.K. and a Queensland State Government National and International Alliances Program Grant to QFAB. We thank Nick Peall for design of the ArachnoServer skin, Dominique Gorse for project assistance, and Bastian Rast for high-resolution spider images.

References

  1. King GF, Gentz MC, Escoubas P, Nicholson GM: A rational nomenclature for naming peptide toxins from spiders and other venomous animals.

    Toxicon 2008, 52:264-276. PubMed Abstract | Publisher Full Text OpenURL

  2. Kaas Q, Westermann JC, Halai R, Wang CK, Craik DJ: ConoServer, a database for conopeptide sequences and structures.

    Bioinformatics 2008, 24:445-446. PubMed Abstract | Publisher Full Text OpenURL

  3. Tan PT, Veeramani A, Srinivasan KN, Ranganathan S, Brusic V: SCORPION2: a database for structure-function analysis of scorpion toxins.

    Toxicon 2006, 47:356-363. PubMed Abstract | Publisher Full Text OpenURL

  4. Jungo F, Bairoch A: Tox-Prot, the toxin protein annotation program of the Swiss-Prot protein knowledgebase.

    Toxicon 2005, 45:293-301. PubMed Abstract | Publisher Full Text OpenURL

  5. He QY, He QZ, Deng XC, Yao L, Meng E, Liu ZH, Liang SP: ATDB: a uni-database platform for animal toxins.

    Nucleic Acids Res 2008, 36:D293-D297. PubMed Abstract | Publisher Full Text | PubMed Central Full Text OpenURL

  6. Platnick NI: Advances in spider taxonomy, 1992–1995: with redescriptions 1940–1980. [http://research.amnh.org/entomology/spiders/catalog/] webcite

    New York: New York Entomological Society & The American Museum of Natural History; 1997.

  7. Escoubas P, Sollod BL, King GF: Venom landscapes: mining the complexity of spider venoms via a combined cDNA and mass spectrometric approach.

    Toxicon 2006, 47:650-663. PubMed Abstract | Publisher Full Text OpenURL

  8. Adams ME: Agatoxins: ion channel specific toxins from the American funnel web spider, Agelenopsis aperta.

    Toxicon 2004, 43:509-525. PubMed Abstract | Publisher Full Text OpenURL

  9. Ushkaryov YA, Volynski KE, Ashton AC: The multiple actions of black widow spider toxins and their selective use in neurosecretion studies.

    Toxicon 2004, 43:527-542. PubMed Abstract | Publisher Full Text | PubMed Central Full Text OpenURL

  10. Tedford HW, Sollod BL, Maggio F, King GF: Australian funnel-web spiders: master insecticide chemists.

    Toxicon 2004, 43:601-618. PubMed Abstract | Publisher Full Text OpenURL

  11. Fitches E, Edwards MG, Mee C, Grishin E, Gatehouse AM, Edwards JP, Gatehouse JA: Fusion proteins containing insect-specific toxins as pest control agents: snowdrop lectin delivers fused insecticidal spider venom toxin to insect haemolymph following oral ingestion.

    J Insect Physiol 2004, 50:61-71. PubMed Abstract | Publisher Full Text OpenURL

  12. Khan SA, Zafar Y, Briddon RW, Malik KA, Mukhtar Z: Spider venom toxin protects plants from insect attack.

    Transgenic Res 2006, 15:349-357. PubMed Abstract | Publisher Full Text OpenURL

  13. Mazzuca M, Heurteaux C, Alloui A, Diochot S, Baron A, Voilley N, Blondeau N, Escoubas P, Gelot A, Cupo A, et al.: A tarantula peptide against pain via ASIC1a channels and opioid mechanisms.

    Nat Neurosci 2007, 10:943-945. PubMed Abstract | Publisher Full Text OpenURL

  14. Bode F, Sachs F, Franz MR: Tarantula peptide inhibits atrial fibrillation.

    Nature 2001, 409:35-36. PubMed Abstract | Publisher Full Text OpenURL

  15. Nunes KP, Costa-Gonçalves A, Lanza LF, Cortes SF, Cordeiro MN, Richardson M, Pimenta AM, Webb RC, Leite R, De Lima ME: Tx2-6 toxin of the Phoneutria nigriventer spider potentiates rat erectile function.

    Toxicon 2008, 51:1197-1206. PubMed Abstract | Publisher Full Text OpenURL

  16. Boeckmann B, Bairoch A, Apweiler R, Blatter MC, Estreicher A, Gasteiger E, Martin MJ, Michoud K, O'Donovan C, Phan I, et al.: The SWISS-PROT protein knowledgebase and its supplement TrEMBL in 2003.

    Nucleic Acids Res 2003, 31:365-370. PubMed Abstract | Publisher Full Text | PubMed Central Full Text OpenURL

  17. Benson DA, Karsch-Mizrachi I, Lipman DJ, Ostell J, Wheeler DL: GenBank.

    Nucleic Acids Res 2008, 36:D25-D30. PubMed Abstract | Publisher Full Text | PubMed Central Full Text OpenURL

  18. Berman HM, Westbrook J, Feng Z, Gilliland G, Bhat TN, Weissig H, Shindyalov IN, Bourne PE: The Protein Data Bank.

    Nucleic Acids Res 2000, 28:235-242. PubMed Abstract | Publisher Full Text | PubMed Central Full Text OpenURL

  19. Sollod BL, Wilson D, Zhaxybayeva O, Gogarten JP, Drinkwater R, King GF: Were arachnids the first to use combinatorial peptide libraries?

    Peptides 2005, 26:131-139. PubMed Abstract | Publisher Full Text OpenURL

  20. Alexander SPH, Mathie A, Peters JA: Guide to receptors and channels (GRAC), (2008).

    Br J Pharmacol 3rd edition. 2008, 153(Suppl 2):S1-S209. PubMed Abstract | Publisher Full Text | PubMed Central Full Text OpenURL

  21. Altschul SF, Madden TL, Schaffer AA, Zhang J, Zhang Z, Miller W, Lipman DJ: Gapped BLAST and PSI-BLAST: a new generation of protein database search programs.

    Nucleic Acids Res 1997, 25:3389-3402. PubMed Abstract | Publisher Full Text | PubMed Central Full Text OpenURL