This article is part of the supplement: Selected articles from the IEEE International Conference on Bioinformatics and Biomedicine 2012: Medical Genomics
Integrating fMRI and SNP data for biomarker identification for schizophrenia with a sparse representation based variable selection method
1 Unit on Statistical Genomics, NIMH/NIH, Bethesda, MD, USA
2 Department of Biomedical Engineering, Tulane University, New Orleans, LA, USA
3 Department of Biostatistics & Bioinformatics, Tulane University, New Orleans, LA, USA
4 The Mind Research Network, Albuquerque, NM, USA
5 Department of Electrical and Computer Engineering at the University of New Mexico, both in Albuquerque, NM, USA
BMC Medical Genomics 2013, 6(Suppl 3):S2 doi:10.1186/1755-8794-6-S3-S2Published: 11 November 2013
In recent years, both single-nucleotide polymorphism (SNP) array and functional magnetic resonance imaging (fMRI) have been widely used for the study of schizophrenia (SCZ). In addition, a few studies have been reported integrating both SNPs data and fMRI data for comprehensive analysis.
In this study, a novel sparse representation based variable selection (SRVS) method has been proposed and tested on a simulation data set to demonstrate its multi-resolution properties. Then the SRVS method was applied to an integrative analysis of two different SCZ data sets, a Single-nucleotide polymorphism (SNP) data set and a functional resonance imaging (fMRI) data set, including 92 cases and 116 controls. Biomarkers for the disease were identified and validated with a multivariate classification approach followed by a leave one out (LOO) cross-validation. Then we compared the results with that of a previously reported sparse representation based feature selection method.
Results showed that biomarkers from our proposed SRVS method gave significantly higher classification accuracy in discriminating SCZ patients from healthy controls than that of the previous reported sparse representation method. Furthermore, using biomarkers from both data sets led to better classification accuracy than using single type of biomarkers, which suggests the advantage of integrative analysis of different types of data.
The proposed SRVS algorithm is effective in identifying significant biomarkers for complicated disease as SCZ. Integrating different types of data (e.g. SNP and fMRI data) may identify complementary biomarkers benefitting the diagnosis accuracy of the disease.