Using a literature-based NMF model for discovering gene functional relationships

Tjioe, Elina; Berry, Michael; Homayouni, Ramin; Heinrich, Kevin

doi:10.1186/1471-2105-9-S7-P1

Volume 9 Supplement 7

UT-ORNL-KBRIN Bioinformatics Summit 2008

Poster presentation
Open access
Published: 08 July 2008

Using a literature-based NMF model for discovering gene functional relationships

Elina Tjioe¹,
Michael Berry²,
Ramin Homayouni³ &
…
Kevin Heinrich⁴

BMC Bioinformatics volume 9, Article number: P1 (2008) Cite this article

3560 Accesses
5 Citations
2 Altmetric
Metrics details

Background

The rapid growth of the biomedical literature and genomic information present a major challenge for determining the functional relationships among genes. Several bioinformatics tools have been developed to extract and identify gene relationships from various biological databases. In this study, we develop a Web-based bioinformatics tool called Feature Annotation Using Nonnegative matrix factorization (FAUN) to facilitate both the discovery and classification of functional relationships among genes. The algorithms of nonnegative matrix factorization (NMF) described in [1] are used. Both the computational complexity and parameterization of NMF for processing gene sets are discussed. FAUN is first tested on a small manually constructed 50-gene (50TG) collection that we, as well as others, have previously used [2]. The screenshots of FAUN feature classification and gene-to-gene correlation for the 50TG collection are shown in Figures 1 and 2. We then apply FAUN to analyze several microarray-derived gene sets obtained from studies of the developing cerebellum in normal and mutant mice. FAUN provides utilities for collaborative knowledge discovery and identification of new gene relationships from text streams and repositories (e.g. MEDLINE). It is particularly useful for the validation and analysis of gene associations suggested by microarray experimentation. FAUN tool is publicly available at https://shad.eecs.utk.edu/faun.

Discussion

For a preliminary assessment of FAUN feature classification, each gene in the 50TG collection was classified based on its most dominant annotated feature or based on some feature weight threshold. The FAUN classification using the strongest feature (per gene) yielded 90% accuracy. A FAUN-based analysis of a new cerebellum gene set has revealed new knowledge – the gene set contains a large component of transcription factors.

References

Berry MW, Browne M, Langville AN, Pauca VP, Plemmons RJ: Algorithms and Applications for Approximate Nonnegative Matrix Factorization. Computational Statistics & Data Analysis 2007, 52(1):155–173. 10.1016/j.csda.2006.11.006
Article Google Scholar
Homayouni R, Heinrich K, Wei L, Berry MW: Gene clustering by latent semantic indexing of MEDLINE abstracts. Bioinformatics 2005, 21(1):104–115. 10.1093/bioinformatics/bth464
Article CAS PubMed Google Scholar

Download references

Acknowledgements

This work is supported by an NIH-subcontract (HD052472) involving the University of Tennessee, University of Memphis, Oak Ridge National Laboratory, and the University of British Columbia.

Author information

Authors and Affiliations

Genome Science and Technology Graduate School, University of Tennessee, Knoxville, TN, 37996, USA
Elina Tjioe
Department of Electrical Engineering and Computer Science, University of Tennessee, Knoxville, TN, 37996, USA
Michael Berry
Bioinformatics Program, Department of Biology, University of Memphis, Memphis, TN, 38152, USA
Ramin Homayouni
Computable Genomix LLC, Bartlett, TN, 38133, USA
Kevin Heinrich

Authors

Elina Tjioe
View author publications
You can also search for this author in PubMed Google Scholar
Michael Berry
View author publications
You can also search for this author in PubMed Google Scholar
Ramin Homayouni
View author publications
You can also search for this author in PubMed Google Scholar
Kevin Heinrich
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Michael Berry.

Rights and permissions

Open Access This article is published under license to BioMed Central Ltd. This is an Open Access article is distributed under the terms of the Creative Commons Attribution License ( https://creativecommons.org/licenses/by/2.0 ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Reprints and permissions

About this article

Cite this article

Tjioe, E., Berry, M., Homayouni, R. et al. Using a literature-based NMF model for discovering gene functional relationships. BMC Bioinformatics 9 (Suppl 7), P1 (2008). https://doi.org/10.1186/1471-2105-9-S7-P1

Download citation

Published: 08 July 2008
DOI: https://doi.org/10.1186/1471-2105-9-S7-P1

UT-ORNL-KBRIN Bioinformatics Summit 2008

Using a literature-based NMF model for discovering gene functional relationships

Background

Discussion

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Keywords

BMC Bioinformatics

Contact us

UT-ORNL-KBRIN Bioinformatics Summit 2008

Using a literature-based NMF model for discovering gene functional relationships

Background

Discussion

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords

BMC Bioinformatics

Contact us