This article is part of the supplement: Semantic e-Science in Biomedicine
AlzPharm: integration of neurodegeneration data using RDF
1 Interdepartmental Program in Computational Biology and Bioinformatics, Yale University, New Haven, CT, USA
2 Center for Medical Informatics, Yale University, New Haven, CT, USA
3 Department of Anesthesiology, Yale University, New Haven, CT, USA
4 Department of Neurobiology, Yale University, New Haven, CT, USA
5 Department of Molecular, Cellular and Developmental Biology, Yale University, New Haven, CT, USA
6 Department of Genetics, Yale University, New Haven, CT, USA
7 Department of Computer Science, Yale University, New Haven, CT, USA
8 Initiative in Innovative Computing, Harvard University, Cambridge, MA, USA
9 Massachusetts General Hospital, Boston, MA, USA
10 Alzheimer Research Forum
11 Oracle, Burlington, MA, USA
BMC Bioinformatics 2007, 8(Suppl 3):S4 doi:10.1186/1471-2105-8-S3-S4Published: 9 May 2007
Neuroscientists often need to access a wide range of data sets distributed over the Internet. These data sets, however, are typically neither integrated nor interoperable, resulting in a barrier to answering complex neuroscience research questions. Domain ontologies can enable the querying heterogeneous data sets, but they are not sufficient for neuroscience since the data of interest commonly span multiple research domains. To this end, e-Neuroscience seeks to provide an integrated platform for neuroscientists to discover new knowledge through seamless integration of the very diverse types of neuroscience data. Here we present a Semantic Web approach to building this e-Neuroscience framework by using the Resource Description Framework (RDF) and its vocabulary description language, RDF Schema (RDFS), as a standard data model to facilitate both representation and integration of the data.
We have constructed a pilot ontology for BrainPharm (a subset of SenseLab) using RDFS and then converted a subset of the BrainPharm data into RDF according to the ontological structure. We have also integrated the converted BrainPharm data with existing RDF hypothesis and publication data from a pilot version of SWAN (Semantic Web Applications in Neuromedicine). Our implementation uses the RDF Data Model in Oracle Database 10g release 2 for data integration, query, and inference, while our Web interface allows users to query the data and retrieve the results in a convenient fashion.
Accessing and integrating biomedical data which cuts across multiple disciplines will be increasingly indispensable and beneficial to neuroscience researchers. The Semantic Web approach we undertook has demonstrated a promising way to semantically integrate data sets created independently. It also shows how advanced queries and inferences can be performed over the integrated data, which are hard to achieve using traditional data integration approaches. Our pilot results suggest that our Semantic Web approach is suitable for realizing e-Neuroscience and generic enough to be applied in other biomedical fields.