Open Access Research article

Fuzzy association rules for biological data analysis: A case study on yeast

Francisco J Lopez1*, Armando Blanco1, Fernando Garcia1, Carlos Cano1 and Antonio Marin2

Author Affiliations

1 Department of Computer Science and AI, University of Granada, 18071, Granada, Spain

2 Department of Genetics, University of Seville, 41012, Seville, Spain

For all author emails, please log on.

BMC Bioinformatics 2008, 9:107 doi:10.1186/1471-2105-9-107

Published: 19 February 2008

Abstract

Background

Last years' mapping of diverse genomes has generated huge amounts of biological data which are currently dispersed through many databases. Integration of the information available in the various databases is required to unveil possible associations relating already known data. Biological data are often imprecise and noisy. Fuzzy set theory is specially suitable to model imprecise data while association rules are very appropriate to integrate heterogeneous data.

Results

In this work we propose a novel fuzzy methodology based on a fuzzy association rule mining method for biological knowledge extraction. We apply this methodology over a yeast genome dataset containing heterogeneous information regarding structural and functional genome features. A number of association rules have been found, many of them agreeing with previous research in the area. In addition, a comparison between crisp and fuzzy results proves the fuzzy associations to be more reliable than crisp ones.

Conclusion

An integrative approach as the one carried out in this work can unveil significant knowledge which is currently hidden and dispersed through the existing biological databases. It is shown that fuzzy association rules can model this knowledge in an intuitive way by using linguistic labels and few easy-understandable parameters.