<?xml version='1.0'?>
<!DOCTYPE art SYSTEM 'http://www.biomedcentral.com/xml/article.dtd'>
<art><ui>1752-0509-5-S2-S10</ui><ji>1752-0509</ji><fm>
<dochead>Proceedings</dochead>
<bibl>
<title>
<p>SigCS base: an integrated genetic information resource for human cerebral stroke</p>
</title>
<aug>
<au id="A1"><snm>Park</snm><fnm>Young-Kyu</fnm><insr iid="I1"/><insr iid="I2"/><email>ykpark@kribb.re.kr</email></au>
<au id="A2"><snm>Bang</snm><mnm>Sun</mnm><fnm>Ok</fnm><insr iid="I3"/><email>osbang@kiom.re.kr</email></au>
<au id="A3"><snm>Cha</snm><fnm>Min-Ho</fnm><insr iid="I3"/><email>mhchamin@kiom.re.kr</email></au>
<au id="A4"><snm>Kim</snm><fnm>Jaeheup</fnm><insr iid="I4"/><email>jaeheup@gmail.com</email></au>
<au id="A5"><snm>Cole</snm><mi>W</mi><fnm>John</fnm><insr iid="I5"/><email>jcole@som.umaryland.edu</email></au>
<au ca="yes" id="A6"><snm>Lee</snm><fnm>Doheon</fnm><insr iid="I2"/><email>dhlee@biosoft.kaist.ac.kr</email></au>
<au ca="yes" id="A7"><snm>Kim</snm><mnm>Joo</mnm><fnm>Young</fnm><insr iid="I6"/><email>yjkim8@kribb.re.kr</email></au>
</aug>
<insg>
<ins id="I1"><p>Medical Genome Research Center, KRIBB, Daejeon 305-806, Korea</p></ins>
<ins id="I2"><p>Department of Bio and Brain Engineering, KAIST, Daejeon 305-701, Korea</p></ins>
<ins id="I3"><p>Department of Medical Research, KIOM, Daejeon 305-811, Korea</p></ins>
<ins id="I4"><p>Cogent Biotechnology Inc., Rockville, MD 20850, USA</p></ins>
<ins id="I5"><p>Maryland Stroke Center, Department of Neurology, Baltimore Veterans Affairs Medical Center and the University of Maryland School of Medicine, Baltimore MD 21201-1559, USA</p></ins>
<ins id="I6"><p>Genome Resource Center, KRIBB, Daejeon 305-806, Korea</p></ins>
</insg>
<source>BMC Systems Biology</source>


<supplement><title><p>22nd International Conference on Genome Informatics: Systems Biology</p></title><editor>Doheon Lee, Xuegong Zhang, and KiYoung Lee</editor><note>Proceedings</note><url>1752-0509-5-S2.pdf</url></supplement><conference><title><p>22nd International Conference on Genome Informatics</p></title><location>Busan, Korea</location><date-range>5-7 December 2011</date-range><url>http://www.kobic.re.kr/giw2011/</url></conference><issn>1752-0509</issn>
<pubdate>2011</pubdate>
<volume>5</volume>
<issue>Suppl 2</issue>
<fpage>S10</fpage>
<url>http://www.biomedcentral.com/1752-0509/5/S2/S10</url>
<xrefbib><pubid idtype="doi">10.1186/1752-0509-5-S2-S10</pubid></xrefbib>
</bibl>
<history><pub><date><day>14</day><month>12</month><year>2011</year></date></pub></history>
<cpyrt><year>2011</year><collab>Park et al; licensee BioMed Central Ltd.</collab><note>This is an open access article distributed under the terms of the Creative Commons Attribution License (<url>http://creativecommons.org/licenses/by/2.0</url>), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</note></cpyrt>
<abs>
<sec>
<st>
<p>Abstract</p>
</st>
<sec>
<st>
<p>Background</p>
</st>
<p>To understand how stroke risk factors mechanistically contribute to stroke, the genetic components regulating each risk factor need to be integrated and evaluated with respect to biological function and through pathway-based algorithms. This resource will provide information to researchers studying the molecular and genetic causes of stroke in terms of genomic variants, genes, and pathways.</p>
</sec>
<sec>
<st>
<p>Methods</p>
</st>
<p>Reported genetic variants, gene structure, phenotypes, and literature information regarding stroke were collected and extracted from publicly available databases describing variants, genome, proteome, functional annotation, and disease subtypes. Stroke related candidate pathways and etiologic genes that participate significantly in risk were analyzed in terms of canonical pathways in public biological pathway databases. These efforts resulted in a relational database of genetic signals of cerebral stroke, SigCS base, which implements an effective web retrieval system.</p>
</sec>
<sec>
<st>
<p>Results</p>
</st>
<p>The current version of SigCS base documents 1943 non-redundant genes with 11472 genetic variants and 165 non-redundant pathways. The web retrieval system of SigCS base consists of two principal search flows, including: 1) a gene-based variant search using gene table browsing or a keyword search, and, 2) a pathway-based variant search using pathway table browsing. SigCS base is freely accessible at <url>http://sysbio.kribb.re.kr/sigcs</url>.</p>
</sec>
<sec>
<st>
<p>Conclusions</p>
</st>
<p>SigCS base is an effective tool that can assist researchers in the identification of the genetic factors associated with stroke by utilizing existing literature information, selecting candidate genes and variants for experimental studies, and examining the pathways that contribute to the pathophysiological mechanisms of stroke.</p>
</sec>
</sec>
</abs>
</fm><bdy>
<sec>
<st>
<p>Introduction</p>
</st>
<p>Stroke is a heterogeneous complex disease that results from the interaction between genetic and environmental risk factors and has many well-established etiologies <abbrgrp>
<abbr bid="B1">1</abbr>
<abbr bid="B2">2</abbr>
<abbr bid="B3">3</abbr>
<abbr bid="B4">4</abbr>
<abbr bid="B5">5</abbr>
</abbrgrp>. It is the second most common cause of death worldwide and is a major cause of acquired disability in survivors <abbrgrp>
<abbr bid="B5">5</abbr>
<abbr bid="B6">6</abbr>
</abbrgrp>. Environmental risk factors include smoking, alcohol intake, lack of physical activity, poor diet, and psychosocial stress/depression. Other established risk factors include hypertension, obesity, diabetes mellitus, and cardiovascular disease <abbrgrp>
<abbr bid="B2">2</abbr>
<abbr bid="B5">5</abbr>
</abbrgrp>. Several etiological risk factors including arteriovenous malformations, atherosclerosis, blood coagulation, sex hormone effects, hyperlipidemia, homocysteine, and inflammation have also been described <abbrgrp>
<abbr bid="B1">1</abbr>
<abbr bid="B3">3</abbr>
<abbr bid="B4">4</abbr>
<abbr bid="B5">5</abbr>
</abbrgrp>.</p>
<p>Many genetic studies evaluating stroke pathophysiology and mechanisms have been conducted, resulting in hundreds of genes that appear to be associated with stroke. Most stroke-related genetic variants are causal markers derived using candidate gene approaches <abbrgrp>
<abbr bid="B7">7</abbr>
<abbr bid="B8">8</abbr>
</abbrgrp>. Stroke-related genetic loci have recently been identified implementing genome-wide association (GWA) studies <abbrgrp>
<abbr bid="B9">9</abbr>
<abbr bid="B10">10</abbr>
</abbrgrp>. Further, genetic variants related to other stroke etiologies, such as aneurysms <abbrgrp>
<abbr bid="B11">11</abbr>
<abbr bid="B12">12</abbr>
</abbrgrp> and diabetes <abbrgrp>
<abbr bid="B13">13</abbr>
</abbrgrp>, have also been reported.</p>
<p>To understand the contribution of the various risk factors to the mechanism of stroke, the genetic basis of each risk factor must be analyzed and integrated, in terms of biological function and pathway relationships. To this end, an efficient database system that integrates genetic variants and annotated information on stroke and its etiologies is required. Currently, there is no published or established database for this purpose, except the previous work by the author&#8217;s research group, StrokeBase <abbrgrp>
<abbr bid="B14">14</abbr>
</abbrgrp>. In the previous work, cerebrovasular disease-related genes were collected from public databases, text-mining works with the goal to expand the functional information by using protein-protein interaction data and SNP-based genome-wide association study results. However, the system was substantially less informative due to an insufficient number of candidate genes, a lack of cross-links between the different types of the information, and inefficient user interfaces for the retrieval system.</p>
<p>Here we introduce SigCS base, an integrated genetic information resource for human cerebral stroke. SigCS base will allow researchers in molecular biology and genetics who study the causes and mechanisms of cerebral stroke to efficiently evaluate thousands of genomic variants and genes that are associated with stroke and its etiologies, as well as relevant annotated data from public molecular biology databases. By using the various retrieval features of SigCS base, researchers will be able to effectively refer to this information to select candidate genes and variants for their studies, to compare their genetic factors with previously reported results, and to examine the pathways that contribute to the pathological mechanism of stroke by comparing them between stroke and other etiologies.</p>
</sec>
<sec>
<st>
<p>Methods</p>
</st>
<sec>
<st>
<p>Data source and processing</p>
</st>
<p>Online Mendelian Inheritance in Man (OMIM) <abbrgrp>
<abbr bid="B15">15</abbr>
</abbrgrp> and Universal Protein Resource (UniProt) <abbrgrp>
<abbr bid="B16">16</abbr>
</abbrgrp> were used to retrieve information on stroke- and etiology-related genetic variants; dbSNP <abbrgrp>
<abbr bid="B17">17</abbr>
</abbrgrp> for SNP information; UCSC genome <abbrgrp>
<abbr bid="B18">18</abbr>
</abbrgrp> for gene structure information, including transcript, exon/intron, coding region, and functional element data on SNPs; HUGO Gene Nomenclature Committee (HGNC) database <abbrgrp>
<abbr bid="B19">19</abbr>
</abbrgrp> for standard gene names; Molecular Signatures Database (MSigDB) <abbrgrp>
<abbr bid="B20">20</abbr>
</abbrgrp> for pathway and functional gene set information; and OMIM for literature information and PubMed links as raw data sources.</p>
<p>Eleven stroke etiologies/risk factors were selected: hypertension <abbrgrp>
<abbr bid="B1">1</abbr>
<abbr bid="B2">2</abbr>
<abbr bid="B3">3</abbr>
<abbr bid="B21">21</abbr>
<abbr bid="B22">22</abbr>
</abbrgrp>, obesity <abbrgrp>
<abbr bid="B2">2</abbr>
<abbr bid="B3">3</abbr>
<abbr bid="B23">23</abbr>
</abbrgrp>&#8212;known to contribute to both ischemic and hemorrhagic stroke <abbrgrp>
<abbr bid="B5">5</abbr>
</abbrgrp>&#8212;type 2 diabetes mellitus <abbrgrp>
<abbr bid="B2">2</abbr>
<abbr bid="B3">3</abbr>
<abbr bid="B5">5</abbr>
<abbr bid="B22">22</abbr>
</abbrgrp>, hyperlipidemia <abbrgrp>
<abbr bid="B3">3</abbr>
<abbr bid="B5">5</abbr>
<abbr bid="B22">22</abbr>
</abbrgrp>, atherosclerosis <abbrgrp>
<abbr bid="B3">3</abbr>
<abbr bid="B24">24</abbr>
</abbrgrp>, blood coagulation <abbrgrp>
<abbr bid="B3">3</abbr>
<abbr bid="B4">4</abbr>
<abbr bid="B25">25</abbr>
</abbrgrp>, vascular inflammation <abbrgrp>
<abbr bid="B4">4</abbr>
<abbr bid="B26">26</abbr>
<abbr bid="B27">27</abbr>
</abbrgrp>, estrogen effect <abbrgrp>
<abbr bid="B28">28</abbr>
<abbr bid="B29">29</abbr>
</abbrgrp>, hyperhomocysteinemia <abbrgrp>
<abbr bid="B4">4</abbr>
<abbr bid="B25">25</abbr>
<abbr bid="B30">30</abbr>
</abbrgrp> for ischemic stroke, intracranial aneurysm <abbrgrp>
<abbr bid="B1">1</abbr>
<abbr bid="B31">31</abbr>
</abbrgrp>, and arteriovenous malformations <abbrgrp>
<abbr bid="B1">1</abbr>
<abbr bid="B31">31</abbr>
<abbr bid="B32">32</abbr>
</abbrgrp> for hemorrhagic stroke. Then, to construct a variant and gene set, we downloaded stroke- and stroke etiology-related records in XML format from the OMIM database through a refined keyword search (Table <tblr tid="T1">1</tblr>) and parsed them.</p>
<tbl id="T1"><title><p>Table 1</p></title><caption><p>Keywords used to retrieve OMIM database for stroke and its etiology related records</p></caption><tblbdy cols="3">
      <r>
         <c ca="center">
            <p>Stroke or its etiology</p>
         </c>
         <c ca="center">
            <p>Keywords for OMIM search</p>
         </c>
         <c ca="center">
            <p>Etiology Code</p>
         </c>
      </r>
      <r>
         <c cspan="3">
            <hr/>
         </c>
      </r>
      <r>
         <c ca="center">
            <p>Atherosclerosis</p>
         </c>
         <c ca="center">
            <p>Atherosclerosis; Atheroscleroses; Fatty Streak, Arterial; Arterial Fatty Streak; Streak, Arterial Fatty; Atheroma; Atherogenesis</p>
         </c>
         <c ca="center">
            <p>ATSC</p>
         </c>
      </r>
      <r>
         <c ca="center">
            <p>Blood coagulation</p>
         </c>
         <c ca="center">
            <p>Blood Coagulation; Blood Clotting</p>
         </c>
         <c ca="center">
            <p>COAG</p>
         </c>
      </r>
      <r>
         <c ca="center">
            <p>Cardiovascular Disease</p>
         </c>
         <c ca="center">
            <p>Cardiovascular Disease</p>
         </c>
         <c ca="center">
            <p>CAVD</p>
         </c>
      </r>
      <r>
         <c ca="center">
            <p>Diabetes mellitus, type 2</p>
         </c>
         <c ca="center">
            <p>Ketosis-Resistant Diabetes Mellitus; Non-Insulin-Dependent Diabetes Mellitus; Non Insulin Dependent Diabetes Mellitus; Noninsulin Dependent Diabetes Mellitus; Type 2 Diabetes Mellitus; Slow-Onset Diabetes Mellitus; Slow Onset Diabetes Mellitus; Stable Diabetes Mellitus; Diabetes Mellitus, Type II; Maturity-Onset Diabetes Mellitus; Maturity Onset Diabetes Mellitus; MODY; NIDDM; Adult-Onset Diabetes Mellitus</p>
         </c>
         <c ca="center">
            <p>T2DM</p>
         </c>
      </r>
      <r>
         <c ca="center">
            <p>Estrogen effect</p>
         </c>
         <c ca="center">
            <p>Estrogen</p>
         </c>
         <c ca="center">
            <p>ESTR</p>
         </c>
      </r>
      <r>
         <c ca="center">
            <p>Homocysteine</p>
         </c>
         <c ca="center">
            <p>Homocysteine</p>
         </c>
         <c ca="center">
            <p>HCYS</p>
         </c>
      </r>
      <r>
         <c ca="center">
            <p>Hyperlipidemia</p>
         </c>
         <c ca="center">
            <p>Hyperlipidemia; Hyperlipemia; Hyperlipidemia; Lipidemia; Lipemia</p>
         </c>
         <c ca="center">
            <p>HLIP</p>
         </c>
      </r>
      <r>
         <c ca="center">
            <p>Hypertension</p>
         </c>
         <c ca="center">
            <p>Hypertension; High Blood Pressure</p>
         </c>
         <c ca="center">
            <p>HTNS</p>
         </c>
      </r>
      <r>
         <c ca="center">
            <p>Vascular inflammation/Vasculitis</p>
         </c>
         <c ca="center">
            <p>Inflammation</p>
         </c>
         <c ca="center">
            <p>INFL</p>
         </c>
      </r>
      <r>
         <c ca="center">
            <p>Intracranial arteriovenous malformations (AVM); Aneurysm</p>
         </c>
         <c ca="center">
            <p>Intracranial Aneurysm; Basilar Artery Aneurysm; Cerebral Artery Aneurysm; Berry Aneurysm; Brain Aneurysm; Cerebral Aneurysm; Intracranial Mycotic Aneurysm; Communicating Artery Aneurysm; Intracranial Arteriovenous Malformation; AVM Intracranial; Intracranial AVM; Cerebral Arteriovenous Malformation; Ruptured Intracranial Arteriovenous Malformation</p>
         </c>
         <c ca="center">
            <p>IAVM</p>
         </c>
      </r>
      <r>
         <c ca="center">
            <p>Obesity</p>
         </c>
         <c ca="center">
            <p>Obesity</p>
         </c>
         <c ca="center">
            <p>OBES</p>
         </c>
      </r>
      <r>
         <c ca="center">
            <p>Stroke</p>
         </c>
         <c ca="center">
            <p>Stroke; Cerebral Stroke; Brain Vascular Accident; Cerebrovascular Apoplexy; Cerebrovascular Stroke; CVA; Cerebrovascular Accident; Apoplexy</p>
         </c>
         <c ca="center">
            <p>STRK</p>
         </c>
      </r>
   </tblbdy></tbl>
<p>To enhance the variant information, we extracted variant, phenotype, and literature data from the downloaded flat files of UniProt. To add the functional significance of each variant, we extracted gene structural annotations and SNP functional annotations from a downloaded flat file from UCSC genome. We extracted the flank sequences of SNPs from the dbSNP flat file to help researchers' experimental studies.</p>
<p>All gene symbol data was converted to the standard gene symbols of HGNC to ensure compatibility between the datasets.</p>
</sec>
<sec>
<st>
<p>Pathway analysis</p>
</st>
<p>To attain insight into the biological functions and pathological mechanisms of stroke and its etiologies, we analyzed the biological pathways that significantly overlapped with the curated stroke and etiology gene sets. For this, we counted common genes between the stroke and etiology gene sets and the genes in known canonical pathways in MSigDB <abbrgrp>
<abbr bid="B20">20</abbr>
</abbrgrp> and performed statistical testing to assess the significance of the overlaps. An one-tailed version of Fisher&#8217;s exact test (<url>http://en.wikipedia.org/wiki/Hypergeometric_distribution</url>) based on the hypergeometric distribution (k, K, n, N) of k overlapping genes in a user gene set of n genes and the pathway having K genes in a total gene space of N genes was used for the statistical test. An adjustment for multiple testing followed by false discovery rate evaluation was performed. For these analyses 639 pathways with including 5385 genes from BioCarta (<url>http://www.biocarta.com/</url>), KEGG (<url>http://www.genome.jp/kegg/</url>), and Reactome (<url>http://www.reactome.org/</url>) databases were implemented.</p>
</sec>
<sec>
<st>
<p>Database implementation</p>
</st>
<p>A relational database using MySQL was constructed implementing a web retrieval system in the PHP language. To manage and service all the derived information an Apache web server on a Linux platform (Dell PowerEdge Server with 2 2.66 GHz Intel Quad Core Xeon CPUs, 32 GB memory, and 6TB SAS hard drive) was implemented.</p>
</sec>
</sec>
<sec>
<st>
<p>Results</p>
</st>
<p>The current version of SigCS base documents 1943 non-redundant genes with 11472 genetic variants and 165 non-redundant pathways (Table <tblr tid="T2">2</tblr>). The web retrieval system of SigCS base consists of two principal search flows: 1) a gene-based variant search using gene table browsing or keyword search, 2) a pathway-based variant search using pathway table browsing. SigCS base is freely accessible at <url>http://sysbio.kribb.re.kr/sigcs</url>.</p>
<tbl id="T2"><title><p>Table 2</p></title><caption><p>Statistics on the information in SigCS base</p></caption><tblbdy cols="5">
      <r>
         <c ca="center">
            <p>No.</p>
         </c>
         <c ca="center">
            <p>Stroke or Etiology</p>
         </c>
         <c ca="center">
            <p># of Genes</p>
         </c>
         <c ca="center">
            <p># of Variants</p>
         </c>
         <c ca="center">
            <p># of Sig. Pathways<sup>*</sup></p>
         </c>
      </r>
      <r>
         <c cspan="5">
            <hr/>
         </c>
      </r>
      <r>
         <c ca="center">
            <p>1</p>
         </c>
         <c ca="center">
            <p>Stroke</p>
         </c>
         <c ca="center">
            <p>236</p>
         </c>
         <c ca="center">
            <p>1609</p>
         </c>
         <c ca="center">
            <p>18</p>
         </c>
      </r>
      <r>
         <c ca="center">
            <p>2</p>
         </c>
         <c ca="center">
            <p>Atherosclerosis</p>
         </c>
         <c ca="center">
            <p>200</p>
         </c>
         <c ca="center">
            <p>891</p>
         </c>
         <c ca="center">
            <p>19</p>
         </c>
      </r>
      <r>
         <c ca="center">
            <p>3</p>
         </c>
         <c ca="center">
            <p>Cardiovascular disease</p>
         </c>
         <c ca="center">
            <p>143</p>
         </c>
         <c ca="center">
            <p>501</p>
         </c>
         <c ca="center">
            <p>9</p>
         </c>
      </r>
      <r>
         <c ca="center">
            <p>4</p>
         </c>
         <c ca="center">
            <p>Coagulation, blood</p>
         </c>
         <c ca="center">
            <p>47</p>
         </c>
         <c ca="center">
            <p>732</p>
         </c>
         <c ca="center">
            <p>19</p>
         </c>
      </r>
      <r>
         <c ca="center">
            <p>5</p>
         </c>
         <c ca="center">
            <p>Estrogen</p>
         </c>
         <c ca="center">
            <p>353</p>
         </c>
         <c ca="center">
            <p>851</p>
         </c>
         <c ca="center">
            <p>6</p>
         </c>
      </r>
      <r>
         <c ca="center">
            <p>6</p>
         </c>
         <c ca="center">
            <p>Hypercysteinemia</p>
         </c>
         <c ca="center">
            <p>90</p>
         </c>
         <c ca="center">
            <p>118</p>
         </c>
         <c ca="center">
            <p>2</p>
         </c>
      </r>
      <r>
         <c ca="center">
            <p>7</p>
         </c>
         <c ca="center">
            <p>Hyperlipidemia</p>
         </c>
         <c ca="center">
            <p>102</p>
         </c>
         <c ca="center">
            <p>333</p>
         </c>
         <c ca="center">
            <p>10</p>
         </c>
      </r>
      <r>
         <c ca="center">
            <p>8</p>
         </c>
         <c ca="center">
            <p>Hypertension</p>
         </c>
         <c ca="center">
            <p>521</p>
         </c>
         <c ca="center">
            <p>2177</p>
         </c>
         <c ca="center">
            <p>13</p>
         </c>
      </r>
      <r>
         <c ca="center">
            <p>9</p>
         </c>
         <c ca="center">
            <p>Intracranial arteriovenous malformations (including intracranial aneurysm)</p>
         </c>
         <c ca="center">
            <p>60</p>
         </c>
         <c ca="center">
            <p>182</p>
         </c>
         <c ca="center">
            <p>11</p>
         </c>
      </r>
      <r>
         <c ca="center">
            <p>10</p>
         </c>
         <c ca="center">
            <p>Inflammation</p>
         </c>
         <c ca="center">
            <p>696</p>
         </c>
         <c ca="center">
            <p>1354</p>
         </c>
         <c ca="center">
            <p>94</p>
         </c>
      </r>
      <r>
         <c ca="center">
            <p>11</p>
         </c>
         <c ca="center">
            <p>Obesity</p>
         </c>
         <c ca="center">
            <p>412</p>
         </c>
         <c ca="center">
            <p>1121</p>
         </c>
         <c ca="center">
            <p>27</p>
         </c>
      </r>
      <r>
         <c ca="center">
            <p>12</p>
         </c>
         <c ca="center">
            <p>Diabetes mellitus, type 2</p>
         </c>
         <c ca="center">
            <p>208</p>
         </c>
         <c ca="center">
            <p>1603</p>
         </c>
         <c ca="center">
            <p>19</p>
         </c>
      </r>
      <r>
         <c ca="center">
            <p>Total</p>
         </c>
         <c ca="center">
            <p>Stroke + All etiologies</p>
         </c>
         <c ca="center">
            <p>3068(1943)<sup>&#8224;</sup></p>
         </c>
         <c ca="center">
            <p>11472</p>
         </c>
         <c ca="center">
            <p>247(165)<sup>&#8224;</sup></p>
         </c>
      </r>
   </tblbdy><tblfn>
      <p>* Pathways with the FDR q-values of less than 0.05.</p>
      <p>&#8224; Numbers in ( ) are counts for non-redundant genes or pathways over the etiologies.</p>
   </tblfn></tbl>
<sec>
<st>
<p>User interface</p>
</st>
<p>Users can retrieve genetic variant and biological pathway information that are related with stroke and its etiologies in SigCS base through its two principal search flows including: a gene-based variant search (M1 or M2 &#8594;1&#8594;2&#8594;5, 6, 7, 8, 9 or 4 in Figure <figr fid="F1">1</figr>) and a pathway-based variant search (M3&#8594;3&#8594;4&#8594;10 or 2 in Figure <figr fid="F1">1</figr>). The former uses genes as an entry point of variant information through a gene table browse function or keyword search. The latter uses pathways as an entry point of variant information through a pathway table browse function. The two search flows connect in the middle of the search flows using cross-links between genes and pathways.</p>
<fig id="F1"><title><p>Figure 1</p></title><caption><p>Two main search flows in SigCS base.</p></caption><text>
   <p><b>Two main search flows in SigCS base.</b> Users can retrieve stroke and its etiology related genetic variant and biological pathway information through the two principal search flows of SigCS base: a gene-based variant search (M1 or M2 &#8594;1&#8594;2&#8594;5, 6, 7, 8, 9 or 4) and a pathway-based variant search (M3&#8594;3&#8594;4&#8594;10 or 2).</p>
</text><graphic file="1752-0509-5-S2-S10-1"/></fig>
<sec>
<st>
<p>Gene-based variant search</p>
</st>
<p>In this search flow methodology, users can retrieve genetic variant information that is related to stroke and its etiologies through a table browse or keyword search function. To do so, at first users can peruse the resulting gene list in table format by choosing the &#8216;Browse&#8217; menu (M1 in Figure <figr fid="F1">1</figr>) or search the genes by keyword search on the gene symbols or gene descriptions stored in SigCS base at the keyword search page (A in Figure <figr fid="F2">2</figr>) by choosing the &#8216;Search&#8217; menu (M2 in Figure <figr fid="F1">1</figr>).</p>
<fig id="F2"><title><p>Figure 2</p></title><caption><p>Screenshots for the keyword search page and a gene and variant information page in SigCS base web interface.</p></caption><text>
   <p><b>Screenshots for the keyword search page and a gene and variant information page in SigCS base web interface.</b> The keyword search for the stroke and its etiology related genetic variant information in SigCS base is available at the keyword search page. Some useful user options including simple logical combination, limitation of search in terms of the existence of variants, batch search, and a multiple selection of etiologies to be searched are provided (Panel A). Gene and variant information page shows a gene information table, an OMIM-originated variant table, a UniProt-originated variant table, and a dbSNP-originated variant table (Panel B). In the figure, only the gene information table and an OMIM-originated variant table are shown.</p>
</text><graphic file="1752-0509-5-S2-S10-2"/></fig>
<p>After examining the gene list and description by the gene table browsing or keyword search, users can click a gene symbol of interest and retrieve the gene information page, comprising a gene information table, an OMIM-originated variant table, a UniProt-originated variant table, and a dbSNP-originated variant table (B in Figure <figr fid="F2">2</figr>).</p>
<p>The gene information table lists basic data on the gene, hyperlinks to functional annotation databases, and data on the biological pathway to which the gene is assigned. The hyperlinks connect to public annotation databases, including Entrez Gene <abbrgrp>
<abbr bid="B33">33</abbr>
</abbrgrp>, HGNC, OMIM, Ensembl <abbrgrp>
<abbr bid="B34">34</abbr>
</abbrgrp>, and HPRD <abbrgrp>
<abbr bid="B35">35</abbr>
</abbrgrp>. Clicking the link on a pathway name generates a pathway information page that contains detailed gene pathways, hyperlinks to the source pathway annotation database, a list of other genes that are assigned to the same pathway, and p-values for pathway assignments (B in Figure <figr fid="F3">3</figr>).</p>
<fig id="F3"><title><p>Figure 3</p></title><caption><p>Screenshots for the pathway browse page and a detailed pathway information page.</p></caption><text>
   <p><b>Screenshots for the pathway browse page and a detailed pathway information page.</b> Users can retrieve pathways and gene sets contributable for stroke or each etiology at the pathway browse page (Panel A) and get the detailed pathway information (Panel B) by clicking the link on the p-value of a pathway.</p>
</text><graphic file="1752-0509-5-S2-S10-3"/></fig>
<p>The OMIM-originated variant table displays the variants in the selected gene and related phenotype information. Links on the variant name generates a page that contains detailed variant information and shows experimental data, phenotype data, and references on that variant.</p>
<p>The UniProt-originated variant table shows nonsynonymous variants, ordered by amino acid coordinates in the protein, for the selected gene, as well as amino acid pairs that correspond to the variants, related phenotypes, and PubMed links.</p>
<p>The dbSNP-originated variant table lists the SNPs, based on the genomic structure of the gene, ordered by coding strand. The structure of the gene includes the 2K upstream base pairs, exons, introns, 5&#8217; UTR, 3&#8217; UTR, and 500 base-pair downstream region. Each SNP information block is composed of a genomic coordinate (in base pairs) on the chromosome, the rsID, mutation function, and links to the dbSNP and HapMap databases <abbrgrp>
<abbr bid="B36">36</abbr>
</abbrgrp>. Researchers can evaluate the function or usability of each SNP in experimental studies using the provided information on function, detailed SNP information from dbSNP, and linkage disequilibrium and tag SNP data from HapMap. A SNP flank sequence file can be downloaded selectively for further study.</p>
</sec>
<sec>
<st>
<p>Pathway-based variant search</p>
</st>
<p>In this search flow, users can retrieve pathways and gene sets that statistically analyzed to contribute functionally to stroke or each etiology through the &#8216;Pathways&#8217; menu (M3 in Figure <figr fid="F1">1</figr>). This flow is helpful in understanding the role of genes or variants when the relationships between the genes and stroke or its etiologies are obscure. By selecting the combo box at the top of the table at the pathway browse page, a table lists biological pathways that are statistically related to stroke or its etiologies, ordered by p-value and q-value (A in Figure <figr fid="F3">3</figr>). The link on the p-value generates a page that contains detailed pathway information, hyperlinks to the source pathway annotation database, and a list of other genes that are assigned to that pathway (B in Figure <figr fid="F3">3</figr>). The links on the gene symbols in the overlapping gene list provide crosslinks to gene information pages; thus, more efficient gene variant searches can be undertaken.</p>
</sec>
</sec>
</sec>
<sec>
<st>
<p>Conclusions</p>
</st>
<p>SigCS base is an effective user-friendly tool designed to assist researchers in the identification of the molecular and genetic factors associated with stroke. As described, SigCS base comprehensively implements numerous genetic databases, including OMIM <abbrgrp>
<abbr bid="B15">15</abbr>
</abbrgrp>, UniProt <abbrgrp>
<abbr bid="B16">16</abbr>
</abbrgrp>, dbSNP <abbrgrp>
<abbr bid="B17">17</abbr>
</abbrgrp>, UCSC genome <abbrgrp>
<abbr bid="B18">18</abbr>
</abbrgrp>, HGNC database <abbrgrp>
<abbr bid="B19">19</abbr>
</abbrgrp>, MSigDB <abbrgrp>
<abbr bid="B20">20</abbr>
</abbrgrp>, providing detailed variant, gene and pathway information to maximize the accuracy of the results produced. These results are further enhanced by including data regarding established stroke etiologies and risk factors, including: hypertension, obesity, type 2 diabetes mellitus, hyperlipidemia, atherosclerosis, blood coagulation, vascular inflammation, estrogen effects, hyperhomocysteinemia, intracranial aneurysms, and arteriovenous malformations.</p>
<p>While highly useful in its current form, SigCS base will be updated periodically as the molecular and pathophysiological understanding of stroke grows with regard to function and as more database content becomes available. For example, in the next SigCS base update, PubMed will be used as the raw data source for variant information rather than OMIM as currently implemented. While OMIM is accurate, the information content is often delayed as compared to PubMed. We also plan to employ text-mining techniques and manual curation to extract gene and variant information, in an effort to retrieve greater and more precise content. Our planned SigCS base content updates will not only provide an improved list of stroke etiologies and risk factor data, but will also increase accuracy by merging certain etiologies that share pathological causalities and adding new etiologies that contribute to stroke. The genetic variants, genes and pathway relationships as derived from more reliable etiologies will provide a more accurate understanding and insight towards the mechanisms of stroke. Users' comments and suggestions for additional features of interest are welcomed.</p>
</sec>
<sec>
<st>
<p>Competing interests</p>
</st>
<p>The authors declare that they have no competing interests.</p>
</sec>
<sec>
<st>
<p>Authors' contributions</p>
</st>
<p>YP, DL, and YJK conceived the study and designed functions. YP and YJK implemented core programs and web interfaces. OSB, MC, JK, and JWC selected etiology, curated data and constructed database. DL and YJK directed the investigation. YP, DL, and YJK wrote the manuscript. All authors read and approved the final manuscript.</p>
</sec>
</bdy><bm>
<ack>
<sec>
<st>
<p>Acknowledgements</p>
</st>
<p>This work was supported by a grant (K09200) from the Korea Institute of Oriental Medicine (KIOM), in part by a grant (2010-0012810) from the National Research Foundation of Korea (NRF). DL was supported by the World Class University program (R32-2008-000-10218-0) of the Ministry of Education, Science and Technology through the National Research Foundation of Korea.</p>
<p>This article has been published as part of <it>BMC Systems Biology</it> Volume 5 Supplement 2, 2011: 22nd International Conference on Genome Informatics: Systems Biology. The full contents of the supplement are available online at <url>http://www.biomedcentral.com/1752-0509/5?issue=S2</url>.</p>
</sec>
</ack>
<refgrp><bibl id="B1"><title><p>Classification of stroke subtypes</p></title><aug><au><snm>Amarenco</snm><fnm>P</fnm></au><au><snm>Bogousslavsky</snm><fnm>J</fnm></au><au><snm>Caplan</snm><fnm>LR</fnm></au><au><snm>Donnan</snm><fnm>GA</fnm></au><au><snm>Hennerici</snm><fnm>MG</fnm></au></aug><source>Cerebrovasc Dis</source><pubdate>2009</pubdate><volume>27</volume><fpage>493</fpage><lpage>501</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1159/000210432</pubid><pubid idtype="pmpid" link="fulltext">19342825</pubid></pubidlist></xrefbib></bibl><bibl id="B2"><title><p>Risk factor management to prevent first stroke</p></title><aug><au><snm>Rundek</snm><fnm>T</fnm></au><au><snm>Sacco</snm><fnm>RL</fnm></au></aug><source>Neurol Clin</source><pubdate>2008</pubdate><volume>26</volume><fpage>1007</fpage><lpage>1045</lpage><note>ix</note><xrefbib><pubidlist><pubid idtype="doi">10.1016/j.ncl.2008.09.001</pubid><pubid idtype="pmcid">2666965</pubid><pubid idtype="pmpid" link="fulltext">19026901</pubid></pubidlist></xrefbib></bibl><bibl id="B3"><title><p>Etiology and pathophysiology of stroke as a complex trait</p></title><aug><au><snm>Rubattu</snm><fnm>S</fnm></au><au><snm>Giliberti</snm><fnm>R</fnm></au><au><snm>Volpe</snm><fnm>M</fnm></au></aug><source>Am J Hypertens</source><pubdate>2000</pubdate><volume>13</volume><fpage>1139</fpage><lpage>1148</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1016/S0895-7061(00)01249-8</pubid><pubid idtype="pmpid">11041170</pubid></pubidlist></xrefbib></bibl><bibl id="B4"><title><p>A computerized algorithm for etiologic classification of ischemic stroke: the Causative Classification of Stroke System</p></title><aug><au><snm>Ay</snm><fnm>H</fnm></au><au><snm>Benner</snm><fnm>T</fnm></au><au><snm>Arsava</snm><fnm>EM</fnm></au><au><snm>Furie</snm><fnm>KL</fnm></au><au><snm>Singhal</snm><fnm>AB</fnm></au><au><snm>Jensen</snm><fnm>MB</fnm></au><au><snm>Ayata</snm><fnm>C</fnm></au><au><snm>Towfighi</snm><fnm>A</fnm></au><au><snm>Smith</snm><fnm>EE</fnm></au><au><snm>Chong</snm><fnm>JY</fnm></au><au><snm>Koroshetz</snm><fnm>WJ</fnm></au><au><snm>Sorensen</snm><fnm>AG</fnm></au></aug><source>Stroke</source><pubdate>2007</pubdate><volume>38</volume><fpage>2979</fpage><lpage>2984</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1161/STROKEAHA.107.490896</pubid><pubid idtype="pmpid" link="fulltext">17901381</pubid></pubidlist></xrefbib></bibl><bibl id="B5"><title><p>Risk factors for ischaemic and intracerebral haemorrhagic stroke in 22 countries (the INTERSTROKE study): a case-control study</p></title><aug><au><snm>O'Donnell</snm><fnm>MJ</fnm></au><au><snm>Xavier</snm><fnm>D</fnm></au><au><snm>Liu</snm><fnm>L</fnm></au><au><snm>Zhang</snm><fnm>H</fnm></au><au><snm>Chin</snm><fnm>SL</fnm></au><au><snm>Rao-Melacini</snm><fnm>P</fnm></au><au><snm>Rangarajan</snm><fnm>S</fnm></au><au><snm>Islam</snm><fnm>S</fnm></au><au><snm>Pais</snm><fnm>P</fnm></au><au><snm>McQueen</snm><fnm>MJ</fnm></au><au><snm>Mondo</snm><fnm>C</fnm></au><au><snm>Damasceno</snm><fnm>A</fnm></au><au><snm>Lopez-Jaramillo</snm><fnm>P</fnm></au><au><snm>Hankey</snm><fnm>GJ</fnm></au><au><snm>Dans</snm><fnm>AL</fnm></au><au><snm>Yusoff</snm><fnm>K</fnm></au><au><snm>Truelsen</snm><fnm>T</fnm></au><au><snm>Diener</snm><fnm>HC</fnm></au><au><snm>Sacco</snm><fnm>RL</fnm></au><au><snm>Ryglewicz</snm><fnm>D</fnm></au><au><snm>Czlonkowska</snm><fnm>A</fnm></au><au><snm>Weimar</snm><fnm>C</fnm></au><au><snm>Wang</snm><fnm>X</fnm></au><au><snm>Yusuf</snm><fnm>S</fnm></au><au><cnm>INTERSTROKE investigators</cnm></au></aug><source>Lancet</source><pubdate>2010</pubdate><volume>376</volume><fpage>112</fpage><lpage>123</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1016/S0140-6736(10)60834-3</pubid><pubid idtype="pmpid" link="fulltext">20561675</pubid></pubidlist></xrefbib></bibl><bibl id="B6"><title><p>Stroke in developing countries: can the epidemic be stopped and outcomes improved?</p></title><aug><au><snm>Feigin</snm><fnm>VL</fnm></au></aug><source>Lancet Neurol</source><pubdate>2007</pubdate><volume>6</volume><fpage>94</fpage><lpage>97</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1016/S1474-4422(07)70007-8</pubid><pubid idtype="pmpid" link="fulltext">17239789</pubid></pubidlist></xrefbib></bibl><bibl id="B7"><title><p>Genetics of stroke</p></title><aug><au><snm>Fornage</snm><fnm>M</fnm></au></aug><source>Curr Atheroscler Rep</source><pubdate>2009</pubdate><volume>11</volume><fpage>167</fpage><lpage>174</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1007/s11883-009-0027-5</pubid><pubid idtype="pmpid">19361347</pubid></pubidlist></xrefbib></bibl><bibl id="B8"><title><p>The genetics of ischaemic stroke</p></title><aug><au><snm>Matarin</snm><fnm>M</fnm></au><au><snm>Singleton</snm><fnm>A</fnm></au><au><snm>Hardy</snm><fnm>J</fnm></au><au><snm>Meschia</snm><fnm>J</fnm></au></aug><source>J Intern Med</source><pubdate>2010</pubdate><volume>267</volume><fpage>139</fpage><lpage>155</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1111/j.1365-2796.2009.02202.x</pubid><pubid idtype="pmpid" link="fulltext">20175863</pubid></pubidlist></xrefbib></bibl><bibl id="B9"><title><p>Advances in genomic analysis of stroke: what have we learned and where are we headed?</p></title><aug><au><snm>Lanktree</snm><fnm>MB</fnm></au><au><snm>Dichgans</snm><fnm>M</fnm></au><au><snm>Hegele</snm><fnm>RA</fnm></au></aug><source>Stroke</source><pubdate>2010</pubdate><volume>41</volume><fpage>825</fpage><lpage>832</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1161/STROKEAHA.109.570523</pubid><pubid idtype="pmpid" link="fulltext">20167918</pubid></pubidlist></xrefbib></bibl><bibl id="B10"><title><p>A genome-wide genotyping study in patients with ischaemic stroke: initial analysis and data release</p></title><aug><au><snm>Matar&#237;n</snm><fnm>M</fnm></au><au><snm>Brown</snm><fnm>WM</fnm></au><au><snm>Scholz</snm><fnm>S</fnm></au><au><snm>Sim&#243;n-S&#225;nchez</snm><fnm>J</fnm></au><au><snm>Fung</snm><fnm>HC</fnm></au><au><snm>Hernandez</snm><fnm>D</fnm></au><au><snm>Gibbs</snm><fnm>JR</fnm></au><au><snm>De Vrieze</snm><fnm>FW</fnm></au><au><snm>Crews</snm><fnm>C</fnm></au><au><snm>Britton</snm><fnm>A</fnm></au><au><snm>Langefeld</snm><fnm>CD</fnm></au><au><snm>Brott</snm><fnm>TG</fnm></au><au><snm>Brown</snm><fnm>RD</fnm><suf>Jr</suf></au><au><snm>Worrall</snm><fnm>BB</fnm></au><au><snm>Frankel</snm><fnm>M</fnm></au><au><snm>Silliman</snm><fnm>S</fnm></au><au><snm>Case</snm><fnm>LD</fnm></au><au><snm>Singleton</snm><fnm>A</fnm></au><au><snm>Hardy</snm><fnm>JA</fnm></au><au><snm>Rich</snm><fnm>SS</fnm></au><au><snm>Meschia</snm><fnm>JF</fnm></au></aug><source>Lancet Neurol</source><pubdate>2007</pubdate><volume>6</volume><fpage>414</fpage><lpage>420</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1016/S1474-4422(07)70081-9</pubid><pubid idtype="pmcid">2613843</pubid><pubid idtype="pmpid" link="fulltext">17434096</pubid></pubidlist></xrefbib></bibl><bibl id="B11"><title><p>Susceptibility loci for intracranial aneurysm in European and Japanese populations</p></title><aug><au><snm>Bilguvar</snm><fnm>K</fnm></au><au><snm>Yasuno</snm><fnm>K</fnm></au><au><snm>Niemel&#228;</snm><fnm>M</fnm></au><au><snm>Ruigrok</snm><fnm>YM</fnm></au><au><snm>von Und Zu Fraunberg</snm><fnm>M</fnm></au><au><snm>van Duijn</snm><fnm>CM</fnm></au><au><snm>van den Berg</snm><fnm>LH</fnm></au><au><snm>Mane</snm><fnm>S</fnm></au><au><snm>Mason</snm><fnm>CE</fnm></au><au><snm>Choi</snm><fnm>M</fnm></au><au><snm>Ga&#225;l</snm><fnm>E</fnm></au><au><snm>Bayri</snm><fnm>Y</fnm></au><au><snm>Kolb</snm><fnm>L</fnm></au><au><snm>Arlier</snm><fnm>Z</fnm></au><au><snm>Ravuri</snm><fnm>S</fnm></au><au><snm>Ronkainen</snm><fnm>A</fnm></au><au><snm>Tajima</snm><fnm>A</fnm></au><au><snm>Laakso</snm><fnm>A</fnm></au><au><snm>Hata</snm><fnm>A</fnm></au><au><snm>Kasuya</snm><fnm>H</fnm></au><au><snm>Koivisto</snm><fnm>T</fnm></au><au><snm>Rinne</snm><fnm>J</fnm></au><au><snm>Ohman</snm><fnm>J</fnm></au><au><snm>Breteler</snm><fnm>MM</fnm></au><au><snm>Wijmenga</snm><fnm>C</fnm></au><au><snm>State</snm><fnm>MW</fnm></au><au><snm>Rinkel</snm><fnm>GJ</fnm></au><au><snm>Hernesniemi</snm><fnm>J</fnm></au><au><snm>J&#228;&#228;skel&#228;inen</snm><fnm>JE</fnm></au><au><snm>Palotie</snm><fnm>A</fnm></au><au><snm>Inoue</snm><fnm>I</fnm></au><au><snm>Lifton</snm><fnm>RP</fnm></au><au><snm>G&#252;nel</snm><fnm>M</fnm></au></aug><source>Nat Genet</source><pubdate>2008</pubdate><volume>40</volume><fpage>1472</fpage><lpage>1477</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1038/ng.240</pubid><pubid idtype="pmcid">2682433</pubid><pubid idtype="pmpid" link="fulltext">18997786</pubid></pubidlist></xrefbib></bibl><bibl id="B12"><title><p>The genetics of primary haemorrhagic stroke, subarachnoid haemorrhage and ruptured intracranial aneurysms in adults</p></title><aug><au><snm>Peck</snm><fnm>G</fnm></au><au><snm>Smeeth</snm><fnm>L</fnm></au><au><snm>Whittaker</snm><fnm>J</fnm></au><au><snm>Casas</snm><fnm>JP</fnm></au><au><snm>Hingorani</snm><fnm>A</fnm></au><au><snm>Sharma</snm><fnm>P</fnm></au></aug><source>PLoS One</source><pubdate>2008</pubdate><volume>3</volume><fpage>e3691</fpage><xrefbib><pubidlist><pubid idtype="doi">10.1371/journal.pone.0003691</pubid><pubid idtype="pmcid">2579487</pubid><pubid idtype="pmpid" link="fulltext">19008959</pubid></pubidlist></xrefbib></bibl><bibl id="B13"><title><p>A genome-wide association study identifies novel risk loci for type 2 diabetes</p></title><aug><au><snm>Sladek</snm><fnm>R</fnm></au><au><snm>Rocheleau</snm><fnm>G</fnm></au><au><snm>Rung</snm><fnm>J</fnm></au><au><snm>Dina</snm><fnm>C</fnm></au><au><snm>Shen</snm><fnm>L</fnm></au><au><snm>Serre</snm><fnm>D</fnm></au><au><snm>Boutin</snm><fnm>P</fnm></au><au><snm>Vincent</snm><fnm>D</fnm></au><au><snm>Belisle</snm><fnm>A</fnm></au><au><snm>Hadjadj</snm><fnm>S</fnm></au><au><snm>Balkau</snm><fnm>B</fnm></au><au><snm>Heude</snm><fnm>B</fnm></au><au><snm>Charpentier</snm><fnm>G</fnm></au><au><snm>Hudson</snm><fnm>TJ</fnm></au><au><snm>Montpetit</snm><fnm>A</fnm></au><au><snm>Pshezhetsky</snm><fnm>AV</fnm></au><au><snm>Prentki</snm><fnm>M</fnm></au><au><snm>Posner</snm><fnm>BI</fnm></au><au><snm>Balding</snm><fnm>DJ</fnm></au><au><snm>Meyre</snm><fnm>D</fnm></au><au><snm>Polychronakos</snm><fnm>C</fnm></au><au><snm>Froguel</snm><fnm>P</fnm></au></aug><source>Nature</source><pubdate>2007</pubdate><volume>445</volume><fpage>881</fpage><lpage>885</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1038/nature05616</pubid><pubid idtype="pmpid" link="fulltext">17293876</pubid></pubidlist></xrefbib></bibl><bibl id="B14"><title><p>StrokeBase: a database of cerebrovascular disease-related candidate genes</p></title><aug><au><snm>Kim</snm><fnm>YU</fnm></au><au><snm>Kim</snm><fnm>IH</fnm></au><au><snm>Bang</snm><fnm>OS</fnm></au><au><snm>Kim</snm><fnm>YJ</fnm></au></aug><source>Genomics Informatics</source><pubdate>2008</pubdate><volume>6</volume><fpage>153</fpage><lpage>156</lpage><xrefbib><pubid idtype="doi">10.5808/GI.2008.6.3.153</pubid></xrefbib></bibl><bibl id="B15"><title><p>McKusick's Online Mendelian Inheritance in Man (OMIM)</p></title><aug><au><snm>Amberger</snm><fnm>J</fnm></au><au><snm>Bocchini</snm><fnm>CA</fnm></au><au><snm>Scott</snm><fnm>AF</fnm></au><au><snm>Hamosh</snm><fnm>A</fnm></au></aug><source>Nucleic Acids Res</source><pubdate>2009</pubdate><volume>37</volume><fpage>D793</fpage><lpage>D796</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1093/nar/gkn665</pubid><pubid idtype="pmcid">2686440</pubid><pubid idtype="pmpid" link="fulltext">18842627</pubid></pubidlist></xrefbib></bibl><bibl id="B16"><title><p>The Universal Protein Resource (UniProt) in 2010</p></title><aug><au><cnm>UniProt consortium</cnm></au></aug><source>Nucleic Acids Res</source><pubdate>2010</pubdate><volume>38</volume><fpage>D142</fpage><lpage>D148</lpage><xrefbib><pubidlist><pubid idtype="pmcid">2808944</pubid><pubid idtype="pmpid" link="fulltext">19843607</pubid></pubidlist></xrefbib></bibl><bibl id="B17"><title><p>dbSNP: the NCBI database of genetic variation</p></title><aug><au><snm>Sherry</snm><fnm>ST</fnm></au><au><snm>Ward</snm><fnm>MH</fnm></au><au><snm>Kholodov</snm><fnm>M</fnm></au><au><snm>Baker</snm><fnm>J</fnm></au><au><snm>Phan</snm><fnm>L</fnm></au><au><snm>Smigielski</snm><fnm>EM</fnm></au><au><snm>Sirotkin</snm><fnm>K</fnm></au></aug><source>Nucleic Acids Res</source><pubdate>2001</pubdate><volume>29</volume><fpage>308</fpage><lpage>311</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1093/nar/29.1.308</pubid><pubid idtype="pmcid">29783</pubid><pubid idtype="pmpid" link="fulltext">11125122</pubid></pubidlist></xrefbib></bibl><bibl id="B18"><title><p>The UCSC Genome Browser database: update 2010</p></title><aug><au><snm>Rhead</snm><fnm>B</fnm></au><au><snm>Karolchik</snm><fnm>D</fnm></au><au><snm>Kuhn</snm><fnm>RM</fnm></au><au><snm>Hinrichs</snm><fnm>AS</fnm></au><au><snm>Zweig</snm><fnm>AS</fnm></au><au><snm>Fujita</snm><fnm>PA</fnm></au><au><snm>Diekhans</snm><fnm>M</fnm></au><au><snm>Smith</snm><fnm>KE</fnm></au><au><snm>Rosenbloom</snm><fnm>KR</fnm></au><au><snm>Raney</snm><fnm>BJ</fnm></au><au><snm>Pohl</snm><fnm>A</fnm></au><au><snm>Pheasant</snm><fnm>M</fnm></au><au><snm>Meyer</snm><fnm>LR</fnm></au><au><snm>Learned</snm><fnm>K</fnm></au><au><snm>Hsu</snm><fnm>F</fnm></au><au><snm>Hillman-Jackson</snm><fnm>J</fnm></au><au><snm>Harte</snm><fnm>RA</fnm></au><au><snm>Giardine</snm><fnm>B</fnm></au><au><snm>Dreszer</snm><fnm>TR</fnm></au><au><snm>Clawson</snm><fnm>H</fnm></au><au><snm>Barber</snm><fnm>GP</fnm></au><au><snm>Haussler</snm><fnm>D</fnm></au><au><snm>Kent</snm><fnm>WJ</fnm></au></aug><source>Nucleic Acids Res</source><pubdate>2010</pubdate><volume>38</volume><fpage>D613</fpage><lpage>619</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1093/nar/gkp939</pubid><pubid idtype="pmcid">2808870</pubid><pubid idtype="pmpid" link="fulltext">19906737</pubid></pubidlist></xrefbib></bibl><bibl id="B19"><title><p>The HUGO Gene Nomenclature Committee (HGNC)</p></title><aug><au><snm>Povey</snm><fnm>S</fnm></au><au><snm>Lovering</snm><fnm>R</fnm></au><au><snm>Bruford</snm><fnm>E</fnm></au><au><snm>Wright</snm><fnm>M</fnm></au><au><snm>Lush</snm><fnm>M</fnm></au><au><snm>Wain</snm><fnm>H</fnm></au></aug><source>Hum Genet</source><pubdate>2001</pubdate><volume>109</volume><fpage>678</fpage><lpage>680</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1007/s00439-001-0615-0</pubid><pubid idtype="pmpid" link="fulltext">11810281</pubid></pubidlist></xrefbib></bibl><bibl id="B20"><title><p>Gene set enrichment analysis: a knowledge-based approach for interpreting genome-wide expression profiles</p></title><aug><au><snm>Subramanian</snm><fnm>A</fnm></au><au><snm>Tamayo</snm><fnm>P</fnm></au><au><snm>Mootha</snm><fnm>VK</fnm></au><au><snm>Mukherjee</snm><fnm>S</fnm></au><au><snm>Ebert</snm><fnm>BL</fnm></au><au><snm>Gillette</snm><fnm>MA</fnm></au><au><snm>Paulovich</snm><fnm>A</fnm></au><au><snm>Pomeroy</snm><fnm>SL</fnm></au><au><snm>Golub</snm><fnm>TR</fnm></au><au><snm>Lander</snm><fnm>ES</fnm></au><au><snm>Mesirov</snm><fnm>JP</fnm></au></aug><source>Proc Natl Acad Sci U S A</source><pubdate>2005</pubdate><volume>102</volume><fpage>15545</fpage><lpage>15550</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1073/pnas.0506580102</pubid><pubid idtype="pmcid">1239896</pubid><pubid idtype="pmpid" link="fulltext">16199517</pubid></pubidlist></xrefbib></bibl><bibl id="B21"><title><p>Hypertension and cerebrovascular damage</p></title><aug><au><snm>Veglio</snm><fnm>F</fnm></au><au><snm>Paglieri</snm><fnm>C</fnm></au><au><snm>Rabbia</snm><fnm>F</fnm></au><au><snm>Bisbocci</snm><fnm>D</fnm></au><au><snm>Bergui</snm><fnm>M</fnm></au><au><snm>Cerrato</snm><fnm>P</fnm></au></aug><source>Atherosclerosis</source><pubdate>2009</pubdate><volume>205</volume><fpage>331</fpage><lpage>341</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1016/j.atherosclerosis.2008.10.028</pubid><pubid idtype="pmpid" link="fulltext">19100549</pubid></pubidlist></xrefbib></bibl><bibl id="B22"><title><p>Hyperlipidemia and primary prevention of stroke: does risk factor identification and reduction really work?</p></title><aug><au><snm>Lewis</snm><fnm>A</fnm></au><au><snm>Segal</snm><fnm>A</fnm></au></aug><source>Curr Atheroscler Rep</source><pubdate>2010</pubdate><volume>12</volume><fpage>225</fpage><lpage>229</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1007/s11883-010-0117-4</pubid><pubid idtype="pmpid" link="fulltext">20495893</pubid></pubidlist></xrefbib></bibl><bibl id="B23"><title><p>Weight of the obesity epidemic: rising stroke rates among middle-aged women in the United States</p></title><aug><au><snm>Towfighi</snm><fnm>A</fnm></au><au><snm>Zheng</snm><fnm>L</fnm></au><au><snm>Ovbiagele</snm><fnm>B</fnm></au></aug><source>Stroke</source><pubdate>2010</pubdate><volume>41</volume><fpage>1371</fpage><lpage>1375</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1161/STROKEAHA.109.577510</pubid><pubid idtype="pmpid" link="fulltext">20508193</pubid></pubidlist></xrefbib></bibl><bibl id="B24"><title><p>Incidence of ischemic stroke in relation to asymptomatic carotid artery atherosclerosis in subjects with normal blood pressure. A prospective cohort study</p></title><aug><au><snm>Li</snm><fnm>C</fnm></au><au><snm>Engstrom</snm><fnm>G</fnm></au><au><snm>Berglund</snm><fnm>G</fnm></au><au><snm>Janzon</snm><fnm>L</fnm></au><au><snm>Hedblad</snm><fnm>B</fnm></au></aug><source>Cerebrovasc Dis</source><pubdate>2008</pubdate><volume>26</volume><fpage>297</fpage><lpage>303</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1159/000149577</pubid><pubid idtype="pmpid" link="fulltext">18667810</pubid></pubidlist></xrefbib></bibl><bibl id="B25"><title><p>Genetic aspects of ischemic stroke: coagulation, homocysteine, and lipoprotein metabolism as potential risk factors</p></title><aug><au><snm>Stankovic</snm><fnm>S</fnm></au><au><snm>Majkic-Singh</snm><fnm>N</fnm></au></aug><source>Crit Rev Clin Lab Sci</source><pubdate>2010</pubdate><volume>47</volume><fpage>72</fpage><lpage>123</lpage><xrefbib><pubidlist><pubid idtype="doi">10.3109/10408361003791520</pubid><pubid idtype="pmpid" link="fulltext">20590502</pubid></pubidlist></xrefbib></bibl><bibl id="B26"><title><p>Carotid artery disease and stroke in patients with peripheral arterial disease. The role of inflammation</p></title><aug><au><snm>Sirico</snm><fnm>G</fnm></au><au><snm>Spadera</snm><fnm>L</fnm></au><au><snm>De Laurentis</snm><fnm>M</fnm></au><au><snm>Brevetti</snm><fnm>G</fnm></au></aug><source>Monaldi Arch Chest Dis</source><pubdate>2009</pubdate><volume>72</volume><fpage>10</fpage><lpage>17</lpage><xrefbib><pubid idtype="pmpid">19645207</pubid></xrefbib></bibl><bibl id="B27"><title><p>Inflammation, statin therapy, and risk of stroke after an acute coronary syndrome in the MIRACL study</p></title><aug><au><snm>Kinlay</snm><fnm>S</fnm></au><au><snm>Schwartz</snm><fnm>GG</fnm></au><au><snm>Olsson</snm><fnm>AG</fnm></au><au><snm>Rifai</snm><fnm>N</fnm></au><au><snm>Szarek</snm><fnm>M</fnm></au><au><snm>Waters</snm><fnm>DD</fnm></au><au><snm>Libby</snm><fnm>P</fnm></au><au><snm>Ganz</snm><fnm>P</fnm></au></aug><source>Arterioscler Thromb Vasc Biol</source><pubdate>2008</pubdate><volume>28</volume><fpage>142</fpage><lpage>147</lpage><xrefbib><pubid idtype="pmpid" link="fulltext">17991875</pubid></xrefbib></bibl><bibl id="B28"><title><p>Estrogen and stroke: a review of the current literature</p></title><aug><au><snm>Carwile</snm><fnm>E</fnm></au><au><snm>Wagner</snm><fnm>AK</fnm></au><au><snm>Crago</snm><fnm>E</fnm></au><au><snm>Alexander</snm><fnm>SA</fnm></au></aug><source>J Neurosci Nurs</source><pubdate>2009</pubdate><volume>41</volume><fpage>18</fpage><lpage>25</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1097/JNN.0b013e31819345f8</pubid><pubid idtype="pmpid" link="fulltext">19368068</pubid></pubidlist></xrefbib></bibl><bibl id="B29"><title><p>Risk of ischemic stroke and lifetime estrogen exposure</p></title><aug><au><snm>de Lecin&#227;na</snm><fnm>MA</fnm></au><au><snm>Egido</snm><fnm>JA</fnm></au><au><snm>Fern&#225;ndez</snm><fnm>C</fnm></au><au><snm>Mart&#237;nez-Vila</snm><fnm>E</fnm></au><au><snm>Santos</snm><fnm>S</fnm></au><au><snm>Morales</snm><fnm>A</fnm></au><au><snm>Mart&#237;nez</snm><fnm>E</fnm></au><au><snm>Pareja</snm><fnm>A</fnm></au><au><snm>Alvarez-Sab&#237;n</snm><fnm>J</fnm></au><au><snm>Casado</snm><fnm>I</fnm></au><au><cnm>PIVE Study Investigators of the Stroke Project of the Spanish Cerebrovascular Diseases Study Group</cnm></au></aug><source>Neurology</source><pubdate>2007</pubdate><volume>68</volume><fpage>33</fpage><lpage>38</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1212/01.wnl.0000250238.69938.f5</pubid><pubid idtype="pmpid" link="fulltext">17200489</pubid></pubidlist></xrefbib></bibl><bibl id="B30"><title><p>Hyperhomocysteinemia as an independent risk factor for cardioembolic stroke in the Turkish population</p></title><aug><au><snm>Tascilar</snm><fnm>N</fnm></au><au><snm>Ekem</snm><fnm>S</fnm></au><au><snm>Aciman</snm><fnm>E</fnm></au><au><snm>Ankarali</snm><fnm>H</fnm></au><au><snm>Mungan</snm><fnm>G</fnm></au><au><snm>Ozen</snm><fnm>B</fnm></au><au><snm>Unal</snm><fnm>A</fnm></au></aug><source>Tohoku J Exp Med</source><pubdate>2009</pubdate><volume>218</volume><fpage>293</fpage><lpage>300</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1620/tjem.218.293</pubid><pubid idtype="pmpid" link="fulltext">19638733</pubid></pubidlist></xrefbib></bibl><bibl id="B31"><title><p>Intracranial arterial and arteriovenous malformations presenting with infarction. Lausanne Stroke Registry study</p></title><aug><au><snm>Herzig</snm><fnm>R</fnm></au><au><snm>Bogousslavsky</snm><fnm>J</fnm></au><au><snm>Maeder</snm><fnm>P</fnm></au><au><snm>Maeder-Ingvar</snm><fnm>M</fnm></au><au><snm>Reichhart</snm><fnm>M</fnm></au><au><snm>Urbano</snm><fnm>LA</fnm></au><au><snm>Leemann</snm><fnm>B</fnm></au></aug><source>Eur J Neurol</source><pubdate>2005</pubdate><volume>12</volume><fpage>93</fpage><lpage>102</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1111/j.1468-1331.2004.00954.x</pubid><pubid idtype="pmpid" link="fulltext">15679696</pubid></pubidlist></xrefbib></bibl><bibl id="B32"><title><p>Arterial aneurysms associated with cerebral arteriovenous malformations: classification, incidence, and risk of hemorrhage</p></title><aug><au><snm>Redekop</snm><fnm>G</fnm></au><au><snm>TerBrugge</snm><fnm>K</fnm></au><au><snm>Montanera</snm><fnm>W</fnm></au><au><snm>Willinsky</snm><fnm>R</fnm></au></aug><source>J Neurosurg</source><pubdate>1998</pubdate><volume>89</volume><fpage>539</fpage><lpage>546</lpage><xrefbib><pubidlist><pubid idtype="doi">10.3171/jns.1998.89.4.0539</pubid><pubid idtype="pmpid" link="fulltext">9761046</pubid></pubidlist></xrefbib></bibl><bibl id="B33"><title><p>Entrez Gene: gene-centered information at NCBI</p></title><aug><au><snm>Maglott</snm><fnm>D</fnm></au><au><snm>Ostell</snm><fnm>J</fnm></au><au><snm>Pruitt</snm><fnm>KD</fnm></au><au><snm>Tatusova</snm><fnm>T</fnm></au></aug><source>Nucleic Acids Res</source><pubdate>2007</pubdate><volume>35</volume><fpage>D26</fpage><lpage>31</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1093/nar/gkl993</pubid><pubid idtype="pmcid">1761442</pubid><pubid idtype="pmpid" link="fulltext">17148475</pubid></pubidlist></xrefbib></bibl><bibl id="B34"><title><p>Touring Ensembl: a practical guide to genome browsing</p></title><aug><au><snm>Spudich</snm><fnm>GM</fnm></au><au><snm>Fernandez-Suarez</snm><fnm>XM</fnm></au></aug><source>BMC Genomics</source><pubdate>2010</pubdate><volume>11</volume><fpage>295</fpage><xrefbib><pubidlist><pubid idtype="doi">10.1186/1471-2164-11-295</pubid><pubid idtype="pmcid">2894802</pubid><pubid idtype="pmpid" link="fulltext">20459808</pubid></pubidlist></xrefbib></bibl><bibl id="B35"><title><p>Human Protein Reference Database--2009 update</p></title><aug><au><snm>Keshava Prasad</snm><fnm>TS</fnm></au><au><snm>Goel</snm><fnm>R</fnm></au><au><snm>Kandasamy</snm><fnm>K</fnm></au><au><snm>Keerthikumar</snm><fnm>S</fnm></au><au><snm>Kumar</snm><fnm>S</fnm></au><au><snm>Mathivanan</snm><fnm>S</fnm></au><au><snm>Telikicherla</snm><fnm>D</fnm></au><au><snm>Raju</snm><fnm>R</fnm></au><au><snm>Shafreen</snm><fnm>B</fnm></au><au><snm>Venugopal</snm><fnm>A</fnm></au><au><snm>Balakrishnan</snm><fnm>L</fnm></au><au><snm>Marimuthu</snm><fnm>A</fnm></au><au><snm>Banerjee</snm><fnm>S</fnm></au><au><snm>Somanathan</snm><fnm>DS</fnm></au><au><snm>Sebastian</snm><fnm>A</fnm></au><au><snm>Rani</snm><fnm>S</fnm></au><au><snm>Ray</snm><fnm>S</fnm></au><au><snm>Harrys Kishore</snm><fnm>CJ</fnm></au><au><snm>Kanth</snm><fnm>S</fnm></au><au><snm>Ahmed</snm><fnm>M</fnm></au><au><snm>Kashyap</snm><fnm>MK</fnm></au><au><snm>Mohmood</snm><fnm>R</fnm></au><au><snm>Ramachandra</snm><fnm>YL</fnm></au><au><snm>Krishna</snm><fnm>V</fnm></au><au><snm>Rahiman</snm><fnm>BA</fnm></au><au><snm>Mohan</snm><fnm>S</fnm></au><au><snm>Ranganathan</snm><fnm>P</fnm></au><au><snm>Ramabadran</snm><fnm>S</fnm></au><au><snm>Chaerkady</snm><fnm>R</fnm></au><au><snm>Pandey</snm><fnm>A</fnm></au></aug><source>Nucleic Acids Res</source><pubdate>2009</pubdate><volume>37</volume><fpage>D767</fpage><lpage>772</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1093/nar/gkn892</pubid><pubid idtype="pmcid">2686490</pubid><pubid idtype="pmpid" link="fulltext">18988627</pubid></pubidlist></xrefbib></bibl><bibl id="B36"><title><p>A fine-scale map of recombination rates and hotspots across the human genome</p></title><aug><au><snm>Myers</snm><fnm>S</fnm></au><au><snm>Bottolo</snm><fnm>L</fnm></au><au><snm>Freeman</snm><fnm>C</fnm></au><au><snm>McVean</snm><fnm>G</fnm></au><au><snm>Donnelly</snm><fnm>P</fnm></au></aug><source>Science</source><pubdate>2005</pubdate><volume>310</volume><fpage>321</fpage><lpage>324</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1126/science.1117196</pubid><pubid idtype="pmpid" link="fulltext">16224025</pubid></pubidlist></xrefbib></bibl></refgrp>
</bm></art>