Logical Analysis of Data (LAD) model for the early diagnosis of acute ischemic stroke
- Equal contributors
1 Rutgers Center for Operations Research, RUTCOR, 640 Bartholomew Road, Piscataway, NJ 08854, USA
2 Molecular Imaging Laboratory, Clinical Center, National Institutes of Health, Bethesda, MD 20892, USA
3 National Institute of Neurological Disorders and Stroke, Bethesda, MD 20892, USA
4 Critical Care Medicine Department, Clinical Center, National Institutes of Health, Bethesda, MD 20892, USA
5 Department of Radiology, The Methodist Hospital, 6565 Fannin Street, Houston, TX 77030, USA
BMC Medical Informatics and Decision Making 2008, 8:30 doi:10.1186/1472-6947-8-30Published: 10 July 2008
Strokes are a leading cause of morbidity and the first cause of adult disability in the United States. Currently, no biomarkers are being used clinically to diagnose acute ischemic stroke. A diagnostic test using a blood sample from a patient would potentially be beneficial in treating the disease.
A classification approach is described for differentiating between proteomic samples of stroke patients and controls, and a second novel predictive model is developed for predicting the severity of stroke as measured by the National Institutes of Health Stroke Scale (NIHSS). The models were constructed by applying the Logical Analysis of Data (LAD) methodology to the mass peak profiles of 48 stroke patients and 32 controls. The classification model was shown to have an accuracy of 75% when tested on an independent validation set of 35 stroke patients and 25 controls, while the predictive model exhibited superior performance when compared to alternative algorithms. In spite of their high accuracy, both models are extremely simple and were developed using a common set consisting of only 3 peaks.
We have successfully identified 3 biomarkers that can detect ischemic stroke with an accuracy of 75%. The performance of the classification model on the validation set and on cross-validation does not deteriorate significantly when compared to that on the training set, indicating the robustness of the model. As in the case of the LAD classification model, the results of the predictive model validate the function constructed on our support-set for approximating the severity scores of stroke patients. The correlation and root mean absolute error of the LAD predictive model are consistently superior to those of the other algorithms used (Support vector machines, C4.5 decision trees, Logistic regression and Multilayer perceptron).