Email updates

Keep up to date with the latest news and content from BMC Medical Informatics and Decision Making and BioMed Central.

Open Access Research article

A genetic fuzzy system for unstable angina risk assessment

Wei Dong1, Zhengxing Huang2*, Lei Ji3 and Huilong Duan2

Author Affiliations

1 Cardiology Department of Chinese PLA General Hospital, Beijing, China

2 College of Biomedical Engineering and Instrument Science, Zhejiang University, 310008, Zhou Yiqing Building 510, Zheda road 38#, Hangzhou, Zhejiang, China

3 IT Department of Chinese PLA General Hospital, Beijing, China

For all author emails, please log on.

BMC Medical Informatics and Decision Making 2014, 14:12  doi:10.1186/1472-6947-14-12


The electronic version of this article is the complete one and can be found online at: http://www.biomedcentral.com/1472-6947/14/12


Received:10 June 2013
Accepted:13 February 2014
Published:18 February 2014

© 2014 Dong et al.; licensee BioMed Central Ltd.

This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited.

Abstract

Background

Unstable Angina (UA) is widely accepted as a critical phase of coronary heart disease with patients exhibiting widely varying risks. Early risk assessment of UA is at the center of the management program, which allows physicians to categorize patients according to the clinical characteristics and stratification of risk and different prognosis. Although many prognostic models have been widely used for UA risk assessment in clinical practice, a number of studies have highlighted possible shortcomings. One serious drawback is that existing models lack the ability to deal with the intrinsic uncertainty about the variables utilized.

Methods

In order to help physicians refine knowledge for the stratification of UA risk with respect to vagueness in information, this paper develops an intelligent system combining genetic algorithm and fuzzy association rule mining. In detail, it models the input information’s vagueness through fuzzy sets, and then applies a genetic fuzzy system on the acquired fuzzy sets to extract the fuzzy rule set for the problem of UA risk assessment.

Results

The proposed system is evaluated using a real data-set collected from the cardiology department of a Chinese hospital, which consists of 54 patient cases. 9 numerical patient features and 17 categorical patient features that appear in the data-set are selected in the experiments. The proposed system made the same decisions as the physician in 46 (out of a total of 54) tested cases (85.2%).

Conclusions

By comparing the results that are obtained through the proposed system with those resulting from the physician’s decision, it has been found that the developed model is highly reflective of reality. The proposed system could be used for educational purposes, and with further improvements, could assist and guide young physicians in their daily work.

Keywords:
Unstable angina risk assessment; Fuzzy association rule mining; Genetic algorithm

Background

Unstable Angina (UA) is a kind of chest discomfort or pain that occurs in a continuous and unpredictable way [1,2]. The unstable pain can result from the disruption of an atherosclerotic plaque in narrowed coronary vessels with lessened flexibility, embolization and vasospasm. As a major type of Cardiovascular Disease (CVD), UA lays its symptoms between stable angina and acute myocardium infarction and a further sudden death [2]. While the risk of UA is high, the population of UA is huge, especially for aged people and those with associated disease such as hypertension and diabetes [3]. To this end, reliable assessment of risk levels for individual UA patients will be of significant value and interest.

A number of models for UA risk assessment have been proposed in literature. Most of these models are derived from databases of clinical trials, e.g., the Thrombolysis in Myocardial Infarction (TIMI) [4], platelet glycoprotein IIb/IIIa in unstable angina: Receptor Suppression Using Integrilin (PURSUIT) [5], and the Global Registry of Acute Coronary Events (GRACE) [6], etc. They use standard patient features that are part of the routine medical evaluation of UA patients, and lead to a score to define prognostic groups [2]. Although there are many benefits related to the design and use of these prognostic models, a number of studies have highlighted possible shortcomings [2,4]. One serious drawback is that existing models lack the ability to deal with the intrinsic uncertainty about patient features utilized in UA risk assessment. Note that vagueness is fundamental and indispensable aspects of knowledge, so as in many practical problems, the experts face vagueness in feature vectors. According to Bellman and Zadeh "much of the decision making in the real world takes place in an environment in which the goals, the constraints, and consequences of possible actions are not known precisely" [7]. Regarding UA risk assessment, many patient features are vague, and not easy to be handled by existing models. It is, therefore, necessary to develop a new UA risk assessment model to deal with vague information.

In this paper, a novel UA risk assessment model has been developed using fuzzy set theories. The proposed model represents patient features with fuzzy sets and then extracts useful information with a descriptive rule induction approach based on fuzzy systems. To derive fuzzy rules from data, the proposed model employs genetic algorithms (GAs) to learn rule base from the collected data-set. GAs are search algorithms based on natural genetics that provide robust search capabilities in complex spaces [8]. The hybridization between fuzzy systems and GAs, called genetic fuzzy system (GFS), has attracted considerable attention in the computational intelligence community [9-12]. Our main goal is to develop a novel GFS such that we derive from clinical data-set a set of assessment rules, which has good interpretability before determining an efficient assessment model in order to get high accuracy of UA risk stratification. Since the accuracy of an assessment model can be largely affected by processing vague patient features, this paper also discusses a clustering-based method for patient feature partitioning.

This paper is organized as follows. Section ‘Preliminary’ presents preliminary knowledge used in this paper. Section ‘Method’ describes the development of the genetic-fuzzy system for UA risk assessment. Experimental studies of the performance of the proposed approach are presented in Section ‘Results and discussion’. Section ‘Conclusion’ concludes the paper.

Preliminary

Let D = {σ1,⋯,σn} be a patient data-set consisting of a finite set of UA patient cases. Let A = {a1,⋯,an} represent all patient features that appear in D and Class = {low-risk, medium-risk, high-risk} be a set of UA risk levels. Each feature a may have a categorical or numerical underlying domain, denoted dom(a). Each patient case σ (σ ∈ D) contains values of some patient features from A. Let σ(a) (σ(a) ∈ dom(a)) be the target feature value for the patient case σ for feature a.

For example, Table 1 shows an example patient data-set, which consists of five patient cases. Each case contains 1 numerical patient features (i.e., age) and 3 categorical patient features (i.e., sex, smoking, and has event recently).

Table 1. An example patient data-set

For numerical patient feature a (a ∈ A), let <a onClick="popup('http://www.biomedcentral.com/1472-6947/14/12/mathml/M1','MathML',630,470);return false;" target="_blank" href="http://www.biomedcentral.com/1472-6947/14/12/mathml/M1">View MathML</a> be a set of linguistic terms defined over a. Let μa,j(σ(a)) be the membership degree on the value of a feature a of the patient case σ to the fuzzy set corresponding to the linguistic label <a onClick="popup('http://www.biomedcentral.com/1472-6947/14/12/mathml/M2','MathML',630,470);return false;" target="_blank" href="http://www.biomedcentral.com/1472-6947/14/12/mathml/M2">View MathML</a> for this feature a. Note that the degree of membership of each value of a in any of the fuzzy sets specified for a is directly based on the evaluation of the membership function of the particular fuzzy set with the value of a as input. The fuzzy partition of dom(a) is composed of <a onClick="popup('http://www.biomedcentral.com/1472-6947/14/12/mathml/M3','MathML',630,470);return false;" target="_blank" href="http://www.biomedcentral.com/1472-6947/14/12/mathml/M3">View MathML</a> that satisfies <a onClick="popup('http://www.biomedcentral.com/1472-6947/14/12/mathml/M4','MathML',630,470);return false;" target="_blank" href="http://www.biomedcentral.com/1472-6947/14/12/mathml/M4">View MathML</a>.

In this study, we employ fuzzy rules of the following form [9,12]:

<a onClick="popup('http://www.biomedcentral.com/1472-6947/14/12/mathml/M5','MathML',630,470);return false;" target="_blank" href="http://www.biomedcentral.com/1472-6947/14/12/mathml/M5">View MathML</a>

(1)

where Cond is the antecedent part of the rule, and Class is the consequent part of the rule. For example, a fuzzy rule can be expressed as:

<a onClick="popup('http://www.biomedcentral.com/1472-6947/14/12/mathml/M6','MathML',630,470);return false;" target="_blank" href="http://www.biomedcentral.com/1472-6947/14/12/mathml/M6">View MathML</a>

(2)

It must be noted that any subset of the complete set of patient features, with any combination of linguistic labels related to the operators and and or, can take part in the rule antecedent. For this kind of fuzzy rule, we say that a patient case σ supports the antecedent part of a rule r if

<a onClick="popup('http://www.biomedcentral.com/1472-6947/14/12/mathml/M7','MathML',630,470);return false;" target="_blank" href="http://www.biomedcentral.com/1472-6947/14/12/mathml/M7">View MathML</a>

(3)

where <a onClick="popup('http://www.biomedcentral.com/1472-6947/14/12/mathml/M8','MathML',630,470);return false;" target="_blank" href="http://www.biomedcentral.com/1472-6947/14/12/mathml/M8">View MathML</a> is the membership degree of patient feature ai for σ to the fuzzy set corresponding to the linguistic label <a onClick="popup('http://www.biomedcentral.com/1472-6947/14/12/mathml/M9','MathML',630,470);return false;" target="_blank" href="http://www.biomedcentral.com/1472-6947/14/12/mathml/M9">View MathML</a> for ai; and APC is the antecedent part compatibility between a patient case and the antecedent part of a fuzzy rule. For the categorical features, the degrees of membership are zero or one.

For a patient case σ, the support degree of σ by a specific rule r is calculated as follows:

<a onClick="popup('http://www.biomedcentral.com/1472-6947/14/12/mathml/M10','MathML',630,470);return false;" target="_blank" href="http://www.biomedcentral.com/1472-6947/14/12/mathml/M10">View MathML</a>

(4)

In general, a fuzzy rule can be considered to be a classification rule if the antecedent contains fuzzy item sets, and the consequent part contains only one class label such as low-risk, medium-risk, or high-risk in this study. A fuzzy rule r : Cond → Class could be measured directly in terms of support and confidence as follows:

<a onClick="popup('http://www.biomedcentral.com/1472-6947/14/12/mathml/M11','MathML',630,470);return false;" target="_blank" href="http://www.biomedcentral.com/1472-6947/14/12/mathml/M11">View MathML</a>

(5)

<a onClick="popup('http://www.biomedcentral.com/1472-6947/14/12/mathml/M12','MathML',630,470);return false;" target="_blank" href="http://www.biomedcentral.com/1472-6947/14/12/mathml/M12">View MathML</a>

(6)

Method

In this section, we describe the process of utilizing GFS to develop an intelligent system for the problem of UA risk assessment. As shown in Figure 1, the proposed method consists of three steps. At first, all the numerical patient features of the data-set are given as input for the fuzzy clustering module for calculating the membership functions. Then, calculated function values are given to the rule generation module for obtaining UA risk assessment rules. Based on the derived rules, a classification model for UA risk assessment is generated.

thumbnailFigure 1. The main steps of the proposed UA risk assessment model.

The case study was performed in the Cardiology Department at the Chinese PLA General hospital. Prior approval was obtained from the data protection committee of the hospital to conduct the study. We state that the patient data was anonymized in this study and in the Method section of this paper.

Fuzzy clustering for numerical feature discretization

One of the most important steps in UA risk assessment is to deal with the intrinsic uncertainty about the variables utilized. As described in [13], fuzzy set is a common tool for facilitating the interpretation of rules in linguistic terms, and avoiding unnatural boundaries in the partitioning of the variable domains. It is especially useful in clinical settings where the boundaries of a piece of information used may not be clearly defined. Regarding our task of UA risk assessment, the quality of the results produced relies quite crucially on the appropriateness of fuzzy sets to the given patient features. So, fuzzy sets must be consistent with the values of the corresponding feature.

Fuzzy sets can be provided by physicians. However, the provided fuzzy sets by physicians may not be suitable for mining fuzzy association rules from data-set. Also, it is extremely difficult for physicians to estimate the most appropriate fuzzy sets. In order to cope with these problems, we first concentrate on how fuzzy sets of the given features are determined automatically from the collected data-set. Clustering techniques are usually employed as a preprocessing step to partition numerical features [14]. In this study, we employed a hierarchical agglomerative clustering [15-17] algorithm to partition numerical features.

As shown in algorithm 1, hierarchical agglomerative clustering begins with each value as a separate cluster and merges them into successively larger clusters. The process is repeated until the similarity between any pair of clusters is less than a threshold value ε. Consequently, the algorithm builds a structure called dendogram, i.e., a tree illustrating the merging process and intermediate clusters. Similarity between two clusters c1 and c2 can be measured as follows:

<a onClick="popup('http://www.biomedcentral.com/1472-6947/14/12/mathml/M13','MathML',630,470);return false;" target="_blank" href="http://www.biomedcentral.com/1472-6947/14/12/mathml/M13">View MathML</a>

(7)

where |c1| and |c2| are the number of clusters c1 and c2, respectively.

Algorithm 1 Hierarchical Agglomerative Clustering

This way, the values of each patient feature in the data-set are distributed over a set of derived clusters using Algorithm 1. For each patient feature, the centroids of the clusters are the set of midpoints of the fuzzy sets. To illustrate the process, suppose we want to find fuzzy sets for a specific patient feature a, which is quantitative with a range from min(dom(a)) to max(dom(a)). Let <a onClick="popup('http://www.biomedcentral.com/1472-6947/14/12/mathml/M15','MathML',630,470);return false;" target="_blank" href="http://www.biomedcentral.com/1472-6947/14/12/mathml/M15">View MathML</a> be the set of mid-points of the fuzzy sets for a. As a result, the derived fuzzy sets will have the following ranges: <a onClick="popup('http://www.biomedcentral.com/1472-6947/14/12/mathml/M16','MathML',630,470);return false;" target="_blank" href="http://www.biomedcentral.com/1472-6947/14/12/mathml/M16">View MathML</a> and <a onClick="popup('http://www.biomedcentral.com/1472-6947/14/12/mathml/M17','MathML',630,470);return false;" target="_blank" href="http://www.biomedcentral.com/1472-6947/14/12/mathml/M17">View MathML</a>, where <a onClick="popup('http://www.biomedcentral.com/1472-6947/14/12/mathml/M18','MathML',630,470);return false;" target="_blank" href="http://www.biomedcentral.com/1472-6947/14/12/mathml/M18">View MathML</a>, and <a onClick="popup('http://www.biomedcentral.com/1472-6947/14/12/mathml/M19','MathML',630,470);return false;" target="_blank" href="http://www.biomedcentral.com/1472-6947/14/12/mathml/M19">View MathML</a>.

After the fuzzy sets of each numerical feature are obtained, the corresponding membership function can be generated for each fuzzy set. In this study, we used membership functions of both semi-trapezoidal shape and triangular shape because they are in general the most appropriate shapes and the most widely used in fuzzy systems. For example, for the fuzzy set with a range from <a onClick="popup('http://www.biomedcentral.com/1472-6947/14/12/mathml/M20','MathML',630,470);return false;" target="_blank" href="http://www.biomedcentral.com/1472-6947/14/12/mathml/M20">View MathML</a> to <a onClick="popup('http://www.biomedcentral.com/1472-6947/14/12/mathml/M21','MathML',630,470);return false;" target="_blank" href="http://www.biomedcentral.com/1472-6947/14/12/mathml/M21">View MathML</a>, the membership function is given by

<a onClick="popup('http://www.biomedcentral.com/1472-6947/14/12/mathml/M22','MathML',630,470);return false;" target="_blank" href="http://www.biomedcentral.com/1472-6947/14/12/mathml/M22">View MathML</a>

(8)

For each fuzzy set with midpoint <a onClick="popup('http://www.biomedcentral.com/1472-6947/14/12/mathml/M23','MathML',630,470);return false;" target="_blank" href="http://www.biomedcentral.com/1472-6947/14/12/mathml/M23">View MathML</a>, where 1 ≤ j ≤ m, the membership function is given by

<a onClick="popup('http://www.biomedcentral.com/1472-6947/14/12/mathml/M24','MathML',630,470);return false;" target="_blank" href="http://www.biomedcentral.com/1472-6947/14/12/mathml/M24">View MathML</a>

(9)

And for the fuzzy set with a range from <a onClick="popup('http://www.biomedcentral.com/1472-6947/14/12/mathml/M25','MathML',630,470);return false;" target="_blank" href="http://www.biomedcentral.com/1472-6947/14/12/mathml/M25">View MathML</a> to <a onClick="popup('http://www.biomedcentral.com/1472-6947/14/12/mathml/M26','MathML',630,470);return false;" target="_blank" href="http://www.biomedcentral.com/1472-6947/14/12/mathml/M26">View MathML</a>, the membership function is given by

<a onClick="popup('http://www.biomedcentral.com/1472-6947/14/12/mathml/M27','MathML',630,470);return false;" target="_blank" href="http://www.biomedcentral.com/1472-6947/14/12/mathml/M27">View MathML</a>

(10)

For example, given a numerical feature, say age with three different ranges, i.e., [30, 56], [56, 74], and [74, 87]. The values of age range from 30 to 87, and can be classified into four fuzzy sets, as shown in Figure 2.

thumbnailFigure 2. The membership functions for patient feature, age.

Fuzzy association rule mining

This subsection presents a GFS for mining fuzzy association rules from a data-set. The proposed system uses fuzzy rule format defined in Equation (2), which offers a flexible structure to the rules, allowing each patient feature to take more than one value and facilitating the extraction of general UA risk assessment rules.

Chromosome representation

As mentioned above, a US risk assessment rule r consists of an antecedent part Cond and a consequent part Class. In this study, we code the antecedent part Cond of r as one chromosome consisting of a set of segments. Each segment corresponds to a specific patient feature. The set of possible values for the categorical features is that indicated by the problem, and for numerical features, it is the set of linguistic terms determined by the clustering method presented above. The consequent part Class of r is prefixed to one of the possible values of risk levels, i.e., high-risk, medium-risk, and low-risk.

Table 2 describes a representation for a rule with numerical and categorical features for the values of a specific risk level high-risk. Note that a bit for each one of the possible values of each feature is stored. In this way, if the value of the corresponding element is 0, it indicates that the value is not used in the rule. Otherwise, if the value is 1, it indicates that the corresponding value is included. If a rule contains all the elements corresponding to a feature of the value 1, or all of them contain the value 0, this indicates that this feature has no relevance for the information contributed in the rule, and so this feature is ignored. In these cases, the feature does not take part in the rule. For example, as shown in Table 2, the rule is represented by a binary string 〈(0011)(00)(10)(10) : high-risk〉, where parentheses are to separate segments, and ":" is to separate the IF part and the THEN part of the rule. In this example, a1 has four possible values and a2, a3, and a4 have two possible values. Note that a2 does not take part in the rule as a2 takes none of its values, and thus a2 is irrelevant for the rule. This binary string can be interpreted as the following rule: "IF age is (old or very old) and smoking is true and has heart events recently is true THEN UA risk is high".

Table 2. Representation of a fuzzy rule with numerical and categorical features in UA risk assessment

Fitness function and selection process

The objective of this step is to find the accurate and general rules for UA risk assessment. Thus, given a specific rule r, the GA method uses the composite fitness function consisting of support and confidence in the following way:

<a onClick="popup('http://www.biomedcentral.com/1472-6947/14/12/mathml/M28','MathML',630,470);return false;" target="_blank" href="http://www.biomedcentral.com/1472-6947/14/12/mathml/M28">View MathML</a>

(11)

The objective of the fitness function is defined as the composite measure of support and confidence. This composite measurement provides an effective selection environment which balances the accuracy and generality of the rules.

Three operators, i.e., selection, crossover, and mutation, are applied in the proposed GA method to generate the offspring population, which are illustrated as follows:

• The selection procedure is used for evolution where two individuals are selected randomly from the current population and used for crossover and mutation operators. During each generation, individuals with higher fitness values survive while those with lower fitness values are destroyed.

• Two parents are selected and recombined according to the predefined crossover probability during crossover. In this work, one point crossover is applied due to its simplicity, which can randomly select different cutoff points for each parent to generate offspring rule sets.

• Each element in the chromosome is applied to mutation with a predefined mutation probability. The value of a randomly selected element is converted to 0 if its value is 1, and vice versa. Elimination of existing rules and addition of new rules can also be used as mutation operations. As a result, the number of rules in the rule sets string can be changed accordingly.

During GA operations, redundant rules might be produced. For example, we say a rule "If Age is (Young) and ST is (Low) then Risk is (Low)" is redundant w.r.t the other rule "If Age is (Young OR Meddle-ages) and ST is (Low) then Risk is (Low)", if both rules have the same support degree on a data-set. Thus, the proposed algorithm must check the rule sets and maintains single among all the rules, to guarantee the consistency of fuzzy systems. The stopping criterion of the proposed algorithm is the number of generations. The scheme of the proposed algorithm is shown in Algorithm 2.

Algorithm 2 The GA-based rule mining algorithm for UA risk assessment.

Taking the data set shown in Table 1 as an example, and assuming the population size as 50, the number of generation as 1000, the crossover rate as 0.5, and the mutation rate as 0.2 in the proposed genetic algorithm, we can obtain an example rule-set, as shown in Table 3.

Table 3. Rules derived from the example patient data-set shown in Table1

UA risk assessment model

Based on the derived rule set, we can generate a classification model for UA risk assessment. Formally, let ℜ be the set of derived UA risk assessment rules. For each r : Cond → Class (r ∈ ℜ), the score value of the target class (class ∈ {low-risk,medium-risk,high-risk}) of r with respect to a given patient case σ can be assessed, by using the following equation:

<a onClick="popup('http://www.biomedcentral.com/1472-6947/14/12/mathml/M30','MathML',630,470);return false;" target="_blank" href="http://www.biomedcentral.com/1472-6947/14/12/mathml/M30">View MathML</a>

(12)

where Confidence(r) is the confidence value of rule r, and β(r,σ) is the firing strength of the input patient case σ on the antecedent part of rule r.

The firing strength β(r,σ) is defined as

<a onClick="popup('http://www.biomedcentral.com/1472-6947/14/12/mathml/M31','MathML',630,470);return false;" target="_blank" href="http://www.biomedcentral.com/1472-6947/14/12/mathml/M31">View MathML</a>

(13)

where μa,j represents the fuzzy membership function for the pair of a and the linguistic term <a onClick="popup('http://www.biomedcentral.com/1472-6947/14/12/mathml/M32','MathML',630,470);return false;" target="_blank" href="http://www.biomedcentral.com/1472-6947/14/12/mathml/M32">View MathML</a> in the <a onClick="popup('http://www.biomedcentral.com/1472-6947/14/12/mathml/M33','MathML',630,470);return false;" target="_blank" href="http://www.biomedcentral.com/1472-6947/14/12/mathml/M33">View MathML</a> in the antecedent part Cond of rule r.

With respect to the target values of risk levels, i.e., low-risk, medium-risk, high-risk, the corresponding scores vl,vm,vh can be generated based on Equation (12). And the risk level with the top score in the scoring vector v will be the predicted risk level for the patient case σ.

<a onClick="popup('http://www.biomedcentral.com/1472-6947/14/12/mathml/M34','MathML',630,470);return false;" target="_blank" href="http://www.biomedcentral.com/1472-6947/14/12/mathml/M34">View MathML</a>

(14)

Taking σ1 shown in Table 1 as an example, and using the derived rule-set shown in Table 3, the score values for σ1 are calculated as vl = 0.4, vm = 2.088, and vh = 1.745, by Equation (12). Thus, the predicted risk level for σ1 is medium-risk.

Results and discussion

To evaluate the feasibility of the presented methods, a clinical case study is conducted through the cooperation with the Cardiology Department of the Chinese PLA General Hospital. The data-set collected from the hospital consists of 54 patient cases. The target classes of UA risk levels include: low-risk, medium-risk, and high-risk. Physicians that evaluated these cases are experienced clinicians working for the hospital with 10 years of working experience on average. As a result, 16 cases are classified into the low-risk group, 33 cases are classified into the medium-risk group, and 5 cases are classified into the high-risk group, respectively. Patient features (9 numerical features and 17 categorical features) that appear in the data-set are shown in Table 4. These features are regularly recorded in UA treatment practice.

Table 4. Patient features utilized in UA risk assessment

All experiments were performed on a Lenevo Compatible PC with an Intel Pentium IV CPU 2.8 GHz, 4G byte main memory running on Microsoft Windows 7. The algorithms were implemented using Microsoft C#. A 10-fold cross-validation was performed to evaluate the proposed method, by using a 90% of the data-set as the training set, and the remaining 10% as the validation set. To reduce variability, 10 rounds of this validation process were performed by using different partitions.

For the first step of patient feature discretization, we applied Hierarchical Agglomerative Clustering method to each numerical feature to generate a set of fuzzy sets. The derived fuzzy sets of input numerical patient features are shown in Figure 3.

thumbnailFigure 3. Membership function of input numerical patient features.

To mine fuzzy association rules, we have taken the population size as 100, the number of generation as 1000, the crossover rate as 1.0, and the mutation rate as 0.2 in the proposed genetic algorithm. By using these parameters, we run our genetic algorithm for each target class of UA risk level, one by one, to obtain a set of fuzzy rules for a given class.

Table 5 shows the rules obtained, which have the best fitness values for the target classes of UA risk level (i.e., low-risk, medium-risk, and high-risk). In this table, the number of patient features involved in each rule (# of Feature.), and the Support and Confidence of each rule are shown. The values of Support and Confidence are between zero and one. High values in support means that the rule covers most of patient cases which are categorized into the class, and high values in confidence means that the rule has few negative patient cases [12]. Note that the knowledge discovered for each target value of risk level is understandable by physicians due to the use of fuzzy logic and the low number of rules and conditions in the rule antecedents (below 40% of 26 patient features). Tables 6, 7 and 8 show the rules obtained which have the best fitness values corresponding to the target classes of risk level.

Table 5. Results for low-risk, medium-risk , and high-risk

Table 6. Rules for low-risk

Table 7. Rules for medium-risk

Table 8. Rules for high-risk

Now, acquiring fuzzy rule base, it is possible to complete UA risk assessment through the proposed classification model. As we mentioned above, the ensemble of fuzzy rules and the proposed classification model perform the role of a mathematical function to obtain the system output. This output is the stratification of UA risk. This way, for each patient whose data are informed as system inputs, the most likely risk level for that patient is generated.

The comparison between the proposed model and physicians’ decisions is done for each partition of data-set. The comparison is given in Figure 4. The proposed system made the same decisions as the physician in 46 (out of a total of 54) tested cases (85.2%). From Figure 4, the proposed model did not predict the risk well in the case of the fifth (the physician assessment is medium-risk while the proposed model assessment is low-risk), the twenty third (the physician assessment is low-risk while the proposed model assessment is medium-risk), the twenty ninth (the physician assessment is medium-risk while the proposed model assessment is low-risk), the thirty third (the physician assessment is low-risk while the proposed model assessment is medium-risk), the thirty fourth (the physician assessment is medium-risk while the proposed model assessment is low-risk), the forty (the physician assessment is medium-risk while the proposed model assessment is low-risk), the forty second (the physician assessment is medium-risk while the proposed model assessment is low-risk), and the forty seventh (the physician assessment is medium-risk while the proposed model assessment is low-risk).

thumbnailFigure 4. Comparison between the proposed model and physicians’ decisions.

Furthermore, we measure the accuracy of the proposed approach using the sum of two performance measures: sensitivity (probability that the test correctly classifies a case with a specific risk level) and specificity (probability of correctly classifying a case without a specific risk level).

<a onClick="popup('http://www.biomedcentral.com/1472-6947/14/12/mathml/M35','MathML',630,470);return false;" target="_blank" href="http://www.biomedcentral.com/1472-6947/14/12/mathml/M35">View MathML</a>

(15)

<a onClick="popup('http://www.biomedcentral.com/1472-6947/14/12/mathml/M36','MathML',630,470);return false;" target="_blank" href="http://www.biomedcentral.com/1472-6947/14/12/mathml/M36">View MathML</a>

(16)

where vclass ∈ {low-risk, medium-risk, high-risk}; TP is the set True Positive, patient cases with the specific risk level vclass classified correctly; FN is the set False Negative, patient cases with the specific risk level vclass classified as other risk levels; TN is the set True Negative, patient cases without the specific risk level vclass classified; FP is the set False Positive, patient cases without the specific risk level vclass classified as vclass. In Table 9 the sensitivity and specificity obtained for each risk level are presented. The experimental results indicate that the proposed method is feasible for predicting risk levels of unstable angina patients.

Table 9. Sensitivity and specificity with different risk levels

Conclusion

In this paper, we have presented an intelligent system for UA risk assessment by combining genetic algorithm and fuzzy association rule mining. The developed approach has been tested on a data-set consisting of 54 UA patient cases from the Cardiology department of Chinese PLA General hospital. The experimental results show that considerable agreement is achieved between the proposed approach and physicians’ problem solving knowledge.

The main novelty of the developed model is that it represents a valuable objective tool for UA risk assessment. In medical literature, physicians are in discrepancies about the risk factors highlighted. This research has focused on the application of computational intelligence. In particular, a genetic-fuzzy system, to identify the key factors behind UA, is proposed, which could be used for educational purposes, and with further improvements, could assist and guide young physicians in their daily work.

For future studies, there may be a comparison of effectiveness in terms of the proposed system with traditional UA risk assessment models, such as TIMI, GRACE, etc. The application of the proposed system to other kinds of CVD, such as heart failure, will also be investigated. Furthermore, other computational intelligence techniques can be associated with the developed system.

Competing interests

The authors declare that they have no competing interests.

Authors’ contributions

WD: conception, design, analysis of data, development, interpretation and evaluation of results, and drafting of the manuscript. ZH: conception, design, and development of methods, drafting of the manuscript, and critical revision of manuscript. LJ: conception, design, extraction of data, interpretation and evaluation of results. HD: design of methods, analysis of data, and critical revision of manuscript. All authors read and approved the final manuscript.

Acknowledgements

This work was supported by the National Nature Science Foundation of China under Grant No 81101126, and the National Hi-Tech R&D Plan of China under Grant No 2012AA02A601. The author would like to give special thanks to all experts who cooperated in the evaluation of the proposed method.

References

  1. Yeghiazarians Y, Braunstein JB, Askari A, Stone PH: Unstable angina pectoris.

    N Engl J Hum Serv 2000, 342(2):101-114. OpenURL

  2. Graham CA, Tsay SX, Rotheray KR, Rainer TH: Validation of the TIMI risk score in Chinese patients presenting to the emergency department with chest pain.

    Int J Cardiol 2013, 168:597-598. PubMed Abstract | Publisher Full Text OpenURL

  3. 2012 Writing Committee Members, Jneid H, Anderson JL, Wright RS, Adams CD, Bridges CR, Casey DE Jr, Ettinger SM, Fesmire FM, Ganiats TG, Lincoff AM, Peterson ED, Philippides GJ, Theroux P, Wenger NK, Zidar JP, Anderson JL: 2012 ACCF/AHA focused update of the guideline for the management of patients with Unstable Angina/Non-ST-Elevation myocardial infarction (updating the 2007 guideline and replacing the 2011 focused update).

    Circulation 2012, 126(7):875-910. PubMed Abstract | Publisher Full Text OpenURL

  4. Kozieradzka A, Kamiǹski KA, Maciorkowska D, Olszewska M, Dobrzycki S, Nowak K, Kralisz P, Prokopczuk P, Musial WJ: GRACE, TIMI, Zwolle and CADILLAC risk scores - do they predict 5-year outcomes after ST-elevation myocardial infarction treated invasively?

    Int J Cardiol 2011, 148:70-75. PubMed Abstract | Publisher Full Text OpenURL

  5. Boersma E, Pieper KS, Steyerberg EW, Wilcox RG, Chang WC, Lee KL, Akkerhuis KM, Harrington RA, Deckers JW, Armstrong PW, Lincoff AM, Califf RM, Topol EJ, Simoons ML: For the PURSUIT Investigators. Predictors of outcome in patients with acute coronary syndromes without persistent ST-segment elevation. Results from an international trial of 9461 patients.

    Circulation 2000, 101:2557-2567. PubMed Abstract | Publisher Full Text OpenURL

  6. Granger CB, Goldberg RJ, Dabbous O, Pieper KS, Eagle KA, Cannon CP, Van De Werf F, Avezum A, Goodman SG, Flather MD, Fox KA, Global Registry of Acute Coronary Events Investigators: Predictors of hospital mortality in the global registry of acute coronary events.

    Arch Intern Med 2003, 163:2345-2353. PubMed Abstract | Publisher Full Text OpenURL

  7. Bellman R, Zadeh L: Decision-making in a fuzzy environment.

    Manag Sci 1970, 17:141-164. OpenURL

  8. Vidal F, Villard PF, Lutton E: Tuning of patient-specific deformable models using an adaptive evolutionary optimization strategy.

    Biomed Eng IEEE Trans 2012, 59(10):2942-2949. OpenURL

  9. Buczak A, Koshute P, Babin S, Feighner B, Lewis S: A data-driven epidemiological prediction method for dengue outbreaks using local and remote sensing data.

    BMC Med Inform Decis Mak 2012, 12:124.

    [ http://dx.doi.org/10.1186/1472-6947-12-124 webcite]

    PubMed Abstract | BioMed Central Full Text | PubMed Central Full Text OpenURL

  10. Mago V, Mehta R, Woolrych R, Papageorgiou E: Supporting meningitis diagnosis amongst infants and children through the use of fuzzy cognitive mapping.

    BMC Med Inform Decis Making 2012, 12:98.

    [ http://dx.doi.org/10.1186/1472-6947-12-98 webcite]

    BioMed Central Full Text OpenURL

  11. Carmona C, Gonzalez P, del Jesus M, Herrera F: NMEEF-SD: non-dominated multiobjective evolutionary algorithm for extracting fuzzy rules in subgroup discovery.

    Fuzzy Syst IEEE Trans 2010, 18(5):958-970. OpenURL

  12. del Jesus M, Gonzalez P, Herrera F, Mesonero M: Evolutionary fuzzy rule induction process for subgroup discovery: a case study in marketing.

    Fuzzy Syst IEEE Trans 2007, 15(4):578-592. OpenURL

  13. Casillas J, Carse B, Bull L: Fuzzy-XCS: a michigan genetic fuzzy system.

    Fuzzy Syst IEEE Trans 2007, 15(4):536-550. OpenURL

  14. Dubois D, Hülermeier E, Prade H: A systematic approach to the assessment of fuzzy association rules.

    Data Min Knowl Discov 2006, 13:167-192. Publisher Full Text OpenURL

  15. Bang YK, Lee CH: Fuzzy time series prediction using hierarchical clustering algorithms.

    Expert Syst Appl 2011, 38(4):4312-4325. Publisher Full Text OpenURL

  16. Horng SC, Yang FY, Lin SS: Hierarchical fuzzy clustering decision tree for classifying recipes of ion implanter.

    Expert Syst Appl 2011, 38:933-940. Publisher Full Text OpenURL

  17. West G, McArthur S, Towle D: Industrial implementation of intelligent system techniques for nuclear power plant condition monitoring.

    Expert Syst Appl 2012, 39(8):7432-7440. Publisher Full Text OpenURL

Pre-publication history

The pre-publication history for this paper can be accessed here:

http://www.biomedcentral.com/1472-6947/14/12/prepub