Log on / register
Feedback | Support | My details
Open AccessHighly AccessDatabase

Towards systems genetic analyses in barley: Integration of phenotypic, expression and genotype data into GeneNetwork

Arnis Druka1 email, Ilze Druka1,2 email, Arthur G Centeno3 email, Hongqiang Li3 email, Zhaohui Sun3 email, William TB Thomas1 email, Nicola Bonar1 email, Brian J Steffenson4 email, Steven E Ullrich5 email, Andris Kleinhofs5 email, Roger P Wise6,7 email, Timothy J Close8 email, Elena Potokina9 email, Zewei Luo9 email, Carola Wagner10 email, Günther F Schweizer11 email, David F Marshall1 email, Michael J Kearsey9 email, Robert W Williams3 email and Robbie Waugh1 email

Genetics Programme, Scottish Crop Research Institute, Invergowrie, Dundee DD2 5DA, UK

School of Computing and Creative Technologies, University of Abertay, Dundee, DD1 1HG, UK

Department of Anatomy and Neurobiology, University of Tennessee, Memphis TN 38163, USA

Department of Plant Pathology, University of Minnesota, St. Paul, MN 55108, USA

Department of Crop and Soil Sciences, Washington State University, Pullman WA99164, USA

Corn Insects and Crop Genetics Research, USDA-ARS, Iowa State University, Ames IA 50011, USA

Department of Plant Pathology, Iowa State University, Ames IA 50011, USA

Department of Botany and Plant Sciences, University of California, Riverside, CA 92521, USA

School of Biosciences, University of Birmingham, Birmingham B15 2TT, UK

10  Department of Plant Breeding, Justus Liebig University Giessen, Heinrich-Buff-Ring 26-32, 35392 Giessen, Germany

11  Institute for Crop Production and Plant Breeding, Dep. Genome Analysis, Bavarian State Research Center for Agriculture, Am Gereuth 6, 85354 Freising-Weihenstephan, Germany

author email corresponding author email

BMC Genetics 2008, 9:73doi:10.1186/1471-2156-9-73

Published: 18 November 2008

Abstract

Background

A typical genetical genomics experiment results in four separate data sets; genotype, gene expression, higher-order phenotypic data and metadata that describe the protocols, processing and the array platform. Used in concert, these data sets provide the opportunity to perform genetic analysis at a systems level. Their predictive power is largely determined by the gene expression dataset where tens of millions of data points can be generated using currently available mRNA profiling technologies. Such large, multidimensional data sets often have value beyond that extracted during their initial analysis and interpretation, particularly if conducted on widely distributed reference genetic materials. Besides quality and scale, access to the data is of primary importance as accessibility potentially allows the extraction of considerable added value from the same primary dataset by the wider research community. Although the number of genetical genomics experiments in different plant species is rapidly increasing, none to date has been presented in a form that allows quick and efficient on-line testing for possible associations between genes, loci and traits of interest by an entire research community.

Description

Using a reference population of 150 recombinant doubled haploid barley lines we generated novel phenotypic, mRNA abundance and SNP-based genotyping data sets, added them to a considerable volume of legacy trait data and entered them into the GeneNetwork http://www.genenetwork.org webcite. GeneNetwork is a unified on-line analytical environment that enables the user to test genetic hypotheses about how component traits, such as mRNA abundance, may interact to condition more complex biological phenotypes (higher-order traits). Here we describe these barley data sets and demonstrate some of the functionalities GeneNetwork provides as an easily accessible and integrated analytical environment for exploring them.

Conclusion

By integrating barley genotypic, phenotypic and mRNA abundance data sets directly within GeneNetwork's analytical environment we provide simple web access to the data for the research community. In this environment, a combination of correlation analysis and linkage mapping provides the potential to identify and substantiate gene targets for saturation mapping and positional cloning. By integrating datasets from an unsequenced crop plant (barley) in a database that has been designed for an animal model species (mouse) with a well established genome sequence, we prove the importance of the concept and practice of modular development and interoperability of software engineering for biological data sets.


© 1999-2009 BioMed Central Ltd unless otherwise stated. Part of Springer Science+Business Media.