Towards systems genetic analyses in barley: Integration of phenotypic, expression and genotype data into GeneNetwork

Druka, Arnis; Druka, Ilze; Centeno, Arthur G; Li, Hongqiang; Sun, Zhaohui; Thomas, William TB; Bonar, Nicola; Steffenson, Brian J; Ullrich, Steven E; Kleinhofs, Andris; Wise, Roger P; Close, Timothy J; Potokina, Elena; Luo, Zewei; Wagner, Carola; Schweizer, Günther F; Marshall, David F; Kearsey, Michael J; Williams, Robert W; Waugh, Robbie

doi:10.1186/1471-2156-9-73

Database
Open access
Published: 18 November 2008

Towards systems genetic analyses in barley: Integration of phenotypic, expression and genotype data into GeneNetwork

Arnis Druka¹,
Ilze Druka^1,2,
Arthur G Centeno³,
Hongqiang Li³,
Zhaohui Sun³,
William TB Thomas¹,
Nicola Bonar¹,
Brian J Steffenson⁴,
Steven E Ullrich⁵,
Andris Kleinhofs⁵,
Roger P Wise^6,7,
Timothy J Close⁸,
Elena Potokina⁹,
Zewei Luo⁹,
Carola Wagner¹⁰,
Günther F Schweizer¹¹,
David F Marshall¹,
Michael J Kearsey⁹,
Robert W Williams³ &
…
Robbie Waugh¹

BMC Genetics volume 9, Article number: 73 (2008) Cite this article

8406 Accesses
19 Citations
3 Altmetric
Metrics details

Abstract

Background

A typical genetical genomics experiment results in four separate data sets; genotype, gene expression, higher-order phenotypic data and metadata that describe the protocols, processing and the array platform. Used in concert, these data sets provide the opportunity to perform genetic analysis at a systems level. Their predictive power is largely determined by the gene expression dataset where tens of millions of data points can be generated using currently available mRNA profiling technologies. Such large, multidimensional data sets often have value beyond that extracted during their initial analysis and interpretation, particularly if conducted on widely distributed reference genetic materials. Besides quality and scale, access to the data is of primary importance as accessibility potentially allows the extraction of considerable added value from the same primary dataset by the wider research community. Although the number of genetical genomics experiments in different plant species is rapidly increasing, none to date has been presented in a form that allows quick and efficient on-line testing for possible associations between genes, loci and traits of interest by an entire research community.

Description

Using a reference population of 150 recombinant doubled haploid barley lines we generated novel phenotypic, mRNA abundance and SNP-based genotyping data sets, added them to a considerable volume of legacy trait data and entered them into the GeneNetwork http://www.genenetwork.org. GeneNetwork is a unified on-line analytical environment that enables the user to test genetic hypotheses about how component traits, such as mRNA abundance, may interact to condition more complex biological phenotypes (higher-order traits). Here we describe these barley data sets and demonstrate some of the functionalities GeneNetwork provides as an easily accessible and integrated analytical environment for exploring them.

Conclusion

By integrating barley genotypic, phenotypic and mRNA abundance data sets directly within GeneNetwork's analytical environment we provide simple web access to the data for the research community. In this environment, a combination of correlation analysis and linkage mapping provides the potential to identify and substantiate gene targets for saturation mapping and positional cloning. By integrating datasets from an unsequenced crop plant (barley) in a database that has been designed for an animal model species (mouse) with a well established genome sequence, we prove the importance of the concept and practice of modular development and interoperability of software engineering for biological data sets.

Background

The systems genetics approach coined 'genetical genomics' aims to decompose phenotypic variation into a series of individual components by simultaneously analysing both 'trait' and 'molecular phenotype' data across genetically defined populations. The approach was originally tested by Damerval et al. in 1994 who applied protein profiling to an F2 population of maize [1]. More recently, genetical genomics has been applied to a range of species using microarray derived mRNA abundance phenotypes [2, 3]. In mouse, such analyses have been used to understand how regulatory networks controlling transcription relate to higher-order phenotypic traits at the genome-wide scale [4, 5]. Analogous genetical genomics experiments in plants have been reported for maize [3, 6], Arabidopsis [7, 8], eucalyptus [9, 10], poplar [11], wheat [12] and barley [13]. These experiments demonstrate that the control of gene expression is complex. However, they also can provide insight into the relationships between gene expression and phenotypic traits.

Genetical genomics experiments typically incorporate four separate data sets for each individual in a segregating population; genotype, mRNA abundance, phenotype and associated metadata. When the genetic materials are 'reference strains' that have been analysed by a broad community, there is an opportunity to incorporate legacy phenotypic and genotypic information. While the scale of the mRNA abundance datasets largely determine the predictive power of the approach, a key point is that these large, multidimensional datasets have considerable value beyond that extracted during their initial analysis. This was recognized early by the scientific community and is formally reflected in regulations specifying raw data quality and availability (archiving) by many funding agencies and journals [14]. However, easy access to the data, either raw or processed, is an equally important criterion that may significantly extend its potential usefulness and value [15, 16]. The sheer volume of the genetical genomics data components, if deposited in an open access but unprocessed and in a format designed for archiving, is likely to be of limited value, particularly if only a subset of the data is required for a specific analytical query.

We conducted a genetical genomics experiment in barley using a population of 150 doubled haploid lines [17]. The outcomes of this experiment included two mRNA profiling data sets, a Transcript Derived Marker (TDM)-based barley genetic linkage map and a set of new trait data obtained from over 4 years of field and glasshouse experiments. We also compiled publicly available trait segregation data that has been collected on this reference population by the barley genetics community over the last 15 years. Here we provide open access and availability to these data by integrating them into the GeneNetwork, a web-based analytical tool that has been designed for multiscale integration of networks of genes, transcripts and traits and optimized for on-line analysis of traits controlled by a combination of allelic variants and environmental factors. GeneNetwork with its central module WebQTL facilitates the exploitation of permanent genetic reference populations that are accompanied by genotypic, phenotypic and mRNA abundance datasets. Algorithms for both quantitative trait locus (QTL) mapping and genetic correlation analysis, supported by highly efficient graphical displays facilitate the identification of QTL controlling mRNA transcript abundance (expression-QTL or eQTL) and higher-order phenotypes. Consequently, GeneNetwork is an unique on-line environment for 'trait analysis' at the systems biology level [18, 19].

One of our long term goals is to construct integrated regulatory and structural gene association networks that explain relationships between component gene expression measures and traditional phenotypic traits. We have started this by constructing a trait association network to establish connections and to provide a framework for the identification and mapping of key regulatory genes. Here we describe these barley data sets and demonstrate how GeneNetwork's integrated analytical environment can be exploited to infer map positions of the barley genes and to construct barley trait association networks.

Methods

Database schema

Construction of the database underlying GeneNetwork for mouse data sets has been described previously [18, 19]. Database schema and description is available from [20].

The current barley data set in GeneNetwork

A population of 150 doubled haploid lines (DHLs) derived from a cross between cultivars (cvs.) Steptoe and Morex (St/Mx) was used to generate the mRNA transcript abundance, trait and genotypic data sets. These parents were selected because of their diversity for agronomic traits [21]. Steptoe is a high yielding, broadly adapted six-rowed feed-type barley from the Western United States (US), whereas Morex is a six-rowed malting cultivar from the Midwestern US.

Phenotypic traits

We have compiled and integrated into GeneNetwork data corresponding to 23 phenotypic traits, fifteen of them not published previously (Table 1). For the phenotypic data obtained from plants grown in the east of Scotland from 2002–2005, we maintained individual field trial data scores as separate entries. Similarly, for the published set of 8 traits [22], measured in 9–16 locations across the US and Canada, we kept the data from each location as a separate entry. For the rest of the traits that have replicate measurements, arithmetic mean, standard deviation and the number of replications were entered into GeneNetwork, thus enabling the use of variance for weighted regression analyses. The total count of individual higher-order phenotypic barley trait entries in GeneNetwork is 211.

Table 1 Condensed list of barley traits that have been measured using the Steptoe × Morex DHL population and are available for analysis through GeneNetwork.

Full size table

mRNA transcript abundance data

There are two barley transcript abundance data sets available for analysis in GeneNetwork – a set of 139 lines of embryo-derived tissues, and a set of 30 seedling leaf samples. The raw data (Affymetrix' CEL files) and all 22,840 Barley1 GeneChip signal values calculated using either RMA or MAS5.0 algorithms [23] using Genespring 7.3 (Agilent Technologies, Inc.) were incorporated into GeneNetwork (Table 2). Originally, profiling of embryo-derived tissues was done using 150 lines and seedling leaf using 35 lines. However, 11 lines had ambiguous genotypes, suggesting mishandling at some stage, and therefore were removed from the dataset [17].

Table 2 Barley expression data sets available for analysis in GeneNetwork.

Full size table

Genotypes

The linkage map presented here was generated as part of two barley association mapping projects in the United Kingdom (UK) [24] and US [25] (also [26, 27]). To create the genotype file, we used data from a pilot barley Illumina Oligo Pool Assay (POPA1) that employs GoldenGate BeadArray technology (Illumina, SanDiego CA) and tested 1,536 barley SNP markers in each of the 150 St/Mx DHLs. 471 high quality polymorphic SNPs were integrated into the existing St/Mx RFLP map [21] using Map Manager QTX (ver. 0.27) software [28]. A final map was generated by removing co-segregating markers (leaving a single marker per locus) and manually checking and correcting the relatively rare single marker double recombination events visible in graphical genotypes of the individuals in the population.

Discussion

Using GeneNetwork for barley

The framework for analysis using GeneNetwork for barley is shown in Figure 1A. Associations between transcript abundance, phenotypic traits and genotype can be established either using correlation or genetic linkage mapping functions [29, 30]. The main page of GeneNetwork at http://www.genenetwork.org provides access to subsets of data through pull-down menus that allow specific data sets to be queried. The datasets can be further restricted using a single text box for specific database entries to query probe set or trait ID, or annotations associated with the database entries. Once the resulting record set of the query is returned, it can be further restricted by selecting relevant records based on attached annotations before forwarding it for further analysis.

To map genetic loci associated with mRNA abundance or trait phenotypes, any one of the three QTL mapping functions currently employed by GeneNetwork's WebQTL module can be used. These are 1. interval mapping, 2. single-marker regression, or 3. composite mapping [29, 30]. A thousand permutations are used to calculate upper and lower Likelihood Ratio Statistic (LRS) thresholds for each trait [31], and 1000 bootstrap tests [32, 33] can be employed to determine the confidence intervals (Figure 1B).

The correlation analysis module performs either Pearson product-moment correlation or Spearman rank correlation. Different trait and transcript abundance values (either as integrated or individual probe signals) as well as genotypes can be used to correlate against other data sets of choice. Results of the correlation analyses can be displayed as a table showing correlation coefficients and p-values. The covariates can then be visualized pair-wise as scatter plots (Figure 1C), mapped using the QTL Cluster function (Figure 1D) or combined into association networks [34, 35] (Figure 1E).

Predicting gene position

One of the basic, but arguably most relevant applications of GeneNetwork for barley is to predict the map location of a gene. Until its genome is sequenced or all known barley genes are mapped as genetic markers (e.g. SNPs), the ability to infer a gene's chromosomal position (with a given degree of certainty) by mapping the genetic interval that controls the abundance of its mRNA (as an eQTL) provides valuable information about location of the gene itself. This is easily achieved in the GeneNetwork using its integrated QTL mapping functions.

When an eQTL is described by a single peak that coincides with the gene's location, then variation in cis-regulatory elements that control the expression of the associated gene is the most likely explanation. Alternatively, if the structural gene is located distantly from its eQTL peak, then the eQTL may represent the location of a regulatory factor, which affects the abundance of the monitored mRNA (i.e. a trans-regulator). One possible approach to inferring cis- vs. trans- regulation, and hence the gene's approximate position is based on the experimentally tested observation that strong eQTL (LRS > 30–40) are typically cis- regulated [3]. The scattergram in Figure 2A partitions 345 previously mapped genes into cis- and trans- eQTLs according to co-location of their structural genes and eQTLs (see also additional file 1). It shows that most eQTLs with an LRS>30 (~20% on the scattergram) are likely to be regulated as cis- (Figure 2B). It also shows that the prediction of trans- regulated genes can not be made using this approach because many cis- regulated genes are in the same LRS value range as trans- regulated genes.

Support for this simple designation of a gene's map location comes from an analysis of conserved synteny between the rice genome sequence and the barley gene map. The rationale is that an eQTL will more likely reflect the true position of its underlying gene if its rice ortholog is located in the conserved syntenic position. We sub-divided all the probe sets that reported significant eQTLs into the high (LRS > 30) and low (LRS < 30) LRS groups and plotted their barley eQTL peak positions against the physical positions of their putative rice orthologs (Additional file 2). For 9 out of 12 rice chromosomes, clear blocks of conserved synteny were revealed with eQTLs with high LRS values, whereas many low LRS value eQTLs were homogenously distributed across the rice genome (for example rice chromosome 1 in Figure 2B). Conservation of synteny provides additional support for the principle of mapping a barley gene based on QTL mapping of mRNA abundance values.

Constructing trait association networks

An association network for a given set of traits is a graphical display of all pair-wise correlations that are above an arbitrarily assigned correlation threshold value [36]. GeneNetwork has a function that constructs such association networks using either phenotype or transcript abundance, or indeed both simultaneously. It provides a visualization of the relative positions and numbers of possible interacting partners, how they interact (positive or negative correlation) and in some situations, based on prior knowledge, it may suggest the directionality of the interaction.

An association network using principal component scores calculated using a selected set of malting quality and yield-related trait data as variables provides an overview of the key barley traits that segregate in the St/Mx population (Figure 3, Additional File 3). The cumulative variation explained by the first four principle components ranged from around 90% for heading date to 40% for grain size (Figure 3A), suggesting a strong genetic component for the former, and a more complex situation for the latter. The derived association network (Figure 3B) revealed some known and obvious relationships. For example, the main yield component 'yield-c1' (c1 = principle component 1) is negatively correlated with 'plant height-c1' and 'lodging-c1' and 'lodging-c2'. In contrast, there is a positive correlation between 'lodging-c1' and -c2 with 'height-c1'. This is entirely consistent with taller plants lodging more which results in grain loss during harvest. The St/Mx population was originally designed to dissect two contrasting barley traits, yield and malting quality [21]. The trait association network in Figure 3B shows links only between the minor components of these traits (malting-c1 to yield-c3 and malting-c2 to yield-c2) suggesting complex underlying genetics.

Since association networks are based on correlation, they differentiate neither causal from reactive traits, nor genetic from environmental factors. Genetic linkage mapping, of course, can provide this distinction if a mapping population with sufficiently high resolution is used and sufficient replication is incorporated in the experimental design. Furthermore, in the case of transcript abundance traits, the integration of data from 'classical' or 'treatment-response' type profiling experiments as well as fine scale haplotype map information may clarify the difference between causal and reactive traits [5]. However we note that there is an extra layer of complexity when dealing with an unsequenced genome. Without knowing the regulatory genes underlying key phenotypic traits, and without having precise map positions for the majority of the genes, it is critical that any mRNA abundance based association network analysis is conducted with caution and stringent validation strategies deployed to support any putative links.

Future developments

The GeneNetwork is an acknowledged and widely used integrated platform designed primarily for analysis of data from mouse genetical genomics experiments [18, 19, 36]. In the future we intend to integrate mRNA profiling, phenotypic and genotypic data from alternative populations that have a different genetic architecture along with molecular profiling data, such as proteins or metabolites, together with access to gene and pathway models and annotations from model plant genomes.

Incorporating algorithms and data handling functions for mapping dynamic traits, also known as functional mapping [38, 39] is also a priority. The approach has been applied to diverse range of species, including humans, animals and plants, to uncover novel information [38, 40–46]. However, to our knowledge, there are no available barley data sets that are suitable for dynamic trait mapping. Preliminary experiments on grain development [47] and interactions with pathogens [48–51] provide examples and methodologies for obtaining trait values that could be easily applied to an expanded sample population, however, this hasn't been done yet. Functional mapping of data relating to classical traits such as height, flowering time and malting quality could also reveal novel QTL or relationships between existing QTL. However, this knowledge will only improve our understanding of the causal biological process if the genes underlying the QTL are cloned.

The collection of precise phenotypic data across a population and over time would reveal more significant QTL and provide a better link to 'surrogates' such as mRNA abundance, especially if the latter was derived from specific and relevant cell types. As an example, endosperm modification is a key barley quality trait central to both malting and distilling. We mapped endosperm modification as the area ratio of endosperm stained with calcuflor to the unstained area. Calcuflor stains polymeric 1,3–1,4 -beta glucans which are important barley cell wall constituents and their amount decreases when the cell walls are broken down by cellulytic enzymes. The collection of calcuflor staining data on a population of plants over time is an eminently feasible experiment and would allow endosperm modification to be considered as a dynamic trait with the obvious potential of revealing novel QTL controlling biochemical processes activated during germination.

The object models underlying GeneNetwork have been designed for handling data linked to a well established, stable sequencing data that for the mouse have been available for years. For barley and other less thoroughly researched species this is still in a distant future. This is viewed as a major hindrance for high level genetical genomics analysis by many researchers. However, we were able to integrate barley data in the software designed for mouse without any changes to the software itself and just minor adjustments to the existing barley data. This suggests that software that is designed according to the nature of the biological object can be easily adopted to work with objects of the same kind but lacking some essential property values. Therefore the lack of sequence shouldn't be an obstacle for genetical genomics analysis. By integrating datasets from an unsequenced crop plant (barley) in a database that has been designed for an animal model species (mouse) with well established genome sequence, we prove the importance of the concept and practice of modular development and interoperability of software engineering for biological data sets.

Linking barley data in the GeneNetwork to other relevant genomic resources, such as the Barley SNP Database (SNPDb) [52], Harvest [53], BarleyBase (within PLEXdb) [54], GrainGenes [55] and Gramene [56] will significantly enhance the interpretation of the molecular basis of higher order phenotypes in barley. The success of this implementation largely depends on the development of flexible and streamlined data processing and submission procedures that can handle heterogeneous data types and provide efficient cross-referencing. XML-based technologies seem well suited to handle this [57].

Conclusion

By integrating barley genotypic, phenotypic and mRNA abundance data sets directly within GeneNetwork's analytical environment we provide simple web access to the data for the research community. In this environment, a combination of correlation analysis and linkage mapping provides the potential to identify and substantiate gene targets for saturation mapping and positional cloning. By integrating datasets from an unsequenced crop plant (barley) in a database that has been designed for an animal model species (mouse) with well established genome sequence, we prove the importance of the concept and practice of modular development and interoperability of software engineering for biological data sets.

Availability and requirements

GeneNetwork usage conditions and limitations are available from here [58]. Online tutorial accompanying this manuscript can be either viewed or downloaded from the [59].

References

Damerval C, Maurice A, Josse JM, De Vienne D: Quantitative trait loci underlying gene product variation: a novel perspective for analyzing regulation of genome expression. Genetics. 1994, 137: 289-301.
PubMed Central CAS PubMed Google Scholar
Brem RB, Yvert G, Clinton R, Kruglyak L: Genetic dissection of transcriptional regulation in budding yeast. Science. 2002, 296: 752-755. 10.1126/science.1069516.
Article CAS PubMed Google Scholar
Schadt EE, Monks SA, Drake TA, Lusis AJ, Che N, Colinayo V, Ruff TG, Milligan SB, Lamb JR, Cavet G, Linsley PS, Mao M, Stoughton RB, Friend SH: Genetics of gene expression surveyed in maize, mouse and man. Nature. 2003, 422: 297-302. 10.1038/nature01434.
Article CAS PubMed Google Scholar
Chesler EJ, Wang J, Lu L, Qu Y, Manly KF, Williams RW: Genetic correlates of gene expression in recombinant inbred strains: a relational model system to explore neurobehavioral phenotypes. Neuroinformatics. 2003, 1: 343-357. 10.1385/NI:1:4:343.
Article PubMed Google Scholar
Schadt EE, Lamb J, Yang X, Zhu J, Edwards S, Guhathakurta D, Sieberts SK, Monks S, Reitman M, Zhang C, Lum PY, Leonardson A, Thieringer R, Metzger JM, Yang L, Castle J, Zhu H, Kash SF, Drake TA, Sachs A, Lusis AJ: An integrative genomics approach to infer causal associations between gene expression and disease. Nat Genet. 2005, 37: 710-717. 10.1038/ng1589.
Article PubMed Central CAS PubMed Google Scholar
Shi C, Uzarowska A, Ouzunova M, Landbeck M, Wenzel G, Lubberstedt T: Identification of candidate genes associated with cell wall digestibility and eQTL (expression quantitative trait loci) analysis in a Flint × Flint maize recombinant inbred line population. BMC Genomics. 2007, 8: 22-10.1186/1471-2164-8-22.
Article PubMed Central PubMed Google Scholar
Decook R, Lall S, Nettleton D, Howell SH: Genetic regulation of gene expression during shoot development in Arabidopsis. Genetics. 2005, 172: 1155-1164. 10.1534/genetics.105.042275.
Article PubMed Google Scholar
West MA, Kim K, Kliebenstein DJ, van Leeuwen H, Michelmore RW, Doerge RW, St Clair DA: Global eQTL Mapping Reveals the Complex Genetic Architecture of Transcript Level Variation in Arabidopsis. Genetics. 2006, 175: 1441-1450. 10.1534/genetics.106.064972.
Article PubMed Google Scholar
Kirst M, Myburg AA, De Leon JP, Kirst ME, Scott J, Sederoff R: Coordinated genetic regulation of growth and lignin revealed by quantitative trait locus analysis of cDNA microarray data in an interspecific backcross of eucalyptus. Plant Physiol. 2004, 135: 2368-2378. 10.1104/pp.103.037960.
Article PubMed Central CAS PubMed Google Scholar
Kirst M, Basten CJ, Myburg AA, Zeng ZB, Sederoff RR: Genetic architecture of transcript-level variation in differentiating xylem of a eucalyptus hybrid. Genetics. 2005, 169: 2295-2303. 10.1534/genetics.104.039198.
Article PubMed Central CAS PubMed Google Scholar
Street NR, Skogstrom O, Sjodin A, Tucker J, Rodriguez-Acosta M, Nilsson P, Jansson S, Taylor G: The genetics and genomics of the drought response in Populus. Plant J. 2006, 48: 321-341. 10.1111/j.1365-313X.2006.02864.x.
Article CAS PubMed Google Scholar
Jordan MC, Somers DJ, Banks TW: Identifying regions of the wheat genome controlling seed development by mapping expression quantitative trait loci. Plant Biotechnol J. 2007, 5: 442-453. 10.1111/j.1467-7652.2007.00253.x.
Article CAS PubMed Google Scholar
Potokina E, Druka A, Luo ZW, Wise R, Waugh R, Kearsey M: eQTL analysis of 16,000 barley genes reveals a complex pattern of genome wide transcriptional regulation. Plant J. 2007, 53: 90-101.
Article PubMed Google Scholar
Donohue TJ, Thomas CM: Policy proposal for publication of papers with data sets from genome-wide studies. Microbiology. 2004, 150: 3521-3522. 10.1099/mic.0.27635-0.
Article PubMed Central CAS PubMed Google Scholar
Bhalla R, Narasimhan K, Swarup S: Metabolomics and its role in understanding cellular responses in plants. Plant Cell Rep. 2005, 24: 562-571. 10.1007/s00299-005-0054-9.
Article CAS PubMed Google Scholar
Dalma-Weiszhausz DD, Chicurel ME, Gingeras TR: Microarrays and genetic epidemiology: a multipurpose tool for a multifaceted field. Genet Epidemiol. 2002, 23: 4-20. 10.1002/gepi.216.
Article PubMed Google Scholar
Luo ZW, Potokina E, Druka A, Wise R, Waugh R, Kearsey M: SFP genotyping from Affymetrix arrays is robust but largely detects cis-acting expression regulators. Genetics. 2007, 176: 789-800. 10.1534/genetics.106.067843.
Article PubMed Central CAS PubMed Google Scholar
Chesler EJ, Lu L, Wang J, Williams RW, Manly KF: WebQTL: rapid exploratory analysis of gene expression and genetic networks for brain and behaviour. Nat Neurosci. 2004, 7: 485-486. 10.1038/nn0504-485.
Article CAS PubMed Google Scholar
Wang J, Williams RW, Manly KF: WebQTL: web-based complex trait analysis. Neuroinformatics. 2003, 1: 299-308. 10.1385/NI:1:4:299.
Article PubMed Google Scholar
Schema and description of the database underlying GeneNetwork. [http://genenetwork.org/cgi-bin/schema.py]
Kleinhofs A, Kilian A, Saghai Maroof MA, Biyashev RM, Hayes P, Chen FQ, Lapitan N, Fenwick A, Blake TK, Kanazin V, Ananiev E, Dahleen L, Kudrna D, Bollinger J, Knapp SJ, Liu B, Sorrells M, Heun M, Franckowiak JD, Hoffman D, Skadsen R, Steffenson BJ: A molecular, isozyme and morphological map of the barley (Hordeum vulgare) genome. Theor Appl Genet. 1993, 86: 705-712. 10.1007/BF00222660.
Article CAS PubMed Google Scholar
Hayes P, Liu BH, Knapp SJ, Chen F, Jones B, Blake T, Franckowiak JD, Rasmusson D, Sorrells M, Ullrich SE, Wesenberg DM, Kleinhofs A: Quantitative trait locus effects and environmental interaction in a sample of North American barley germplasm. Theor Appl Genet. 1993, 87: 392-401. 10.1007/BF01184929.
Article CAS PubMed Google Scholar
Irizarry RA, Bolstad BM, Collin F, Cope LM, Hobbs B, Speed TP: Summaries of Affymetrix GeneChip probe level data. Nucleic Acids Res. 2003, 31: e15-10.1093/nar/gng015.
Article PubMed Central PubMed Google Scholar
Association Genetics of UK Elite Barley (AGOUEB). [http://www.agoueb.org]
Coordinated Agricultural Project (CAP). [http://barleycap.coafes.umn.edu]
Hayes P, Szucs P: Disequilibrium and association in barley: thinking outside the glass. Proc Natl Acad Sci USA. 2006, 103: 18385-18386. 10.1073/pnas.0609405103.
Article PubMed Central CAS PubMed Google Scholar
Rostoks N, Ramsay L, MacKenzie K, Cardle L, Bhat PR, Roose ML, Svensson JT, Stein N, Varshney RK, Marshall DF, Graner A, Close TJ, Waugh R: Recent history of artificial outcrossing facilitates whole-genome association mapping in elite inbred crop varieties. Proc Natl Acad Sci USA. 2006, 103: 18656-18661. 10.1073/pnas.0606133103.
Article PubMed Central CAS PubMed Google Scholar
Manly KF, Olson JM: Overview of QTL mapping software and introduction to map manager QT. Mamm Genome. 1999, 10: 327-334. 10.1007/s003359900997.
Article CAS PubMed Google Scholar
Wu RL, Ma CX, Casella G: Statistical Genetics of Quantitative Traits: Linkage, Maps, and QTL. 2007, Springer-Verlag, New York
Google Scholar
Doerge RW: Mapping and analysis of quantitative trait loci in experimental populations. Nat Rev Genet. 2002, 3: 43-52. 10.1038/nrg703.
Article CAS PubMed Google Scholar
Churchill GA, Doerge RW: Empirical threshold values for quantitative trait mapping. Genetics. 1994, 138: 963-971.
PubMed Central CAS PubMed Google Scholar
Bennewitz J, Reinsch N, Kalm E: Improved confidence intervals in quantitative trait loci mapping by permutation bootstrapping. Genetics. 2002, 160: 1673-1686.
PubMed Central CAS PubMed Google Scholar
Visscher PM, Thompson R, Haley CS: Confidence intervals in QTL mapping by bootstrapping. Genetics. 1996, 143: 1013-1020.
PubMed Central CAS PubMed Google Scholar
Magwene PM, Kim J: Estimating genomic coexpression networks using first-order conditional independence. Genome Biol. 2004, 5: R100-10.1186/gb-2004-5-12-r100.
Article PubMed Central PubMed Google Scholar
Yu T, Sun W, Yuan S, Li KC: Study of coordinative gene expression at the biological process level. Bioinformatics. 2005, 21: 3651-3657. 10.1093/bioinformatics/bti599.
Article CAS PubMed Google Scholar
Myers CL, Robson D, Wible A, Hibbs MA, Chiriac C, Theesfeld CL, Dolinski K, Troyanskaya OG: Discovery of biological networks from diverse functional genomic data. Genome Biol. 2005, 6: R114-10.1186/gb-2005-6-13-r114.
Article PubMed Central PubMed Google Scholar
List of papers describing use or referencing GeneNetwork. [http://genenetwork.org/reference.html]
Ma CX, Casella G, Wu RL: Functional mapping of quantitative trait loci underlying the character process: A theoretical framework. Genetics. 2002, 161: 1751-1762.
PubMed Central PubMed Google Scholar
Wu RL, Lin M: Functional mapping – how to map and study the genetic architecture of dynamic complex traits. Nat Rev Genet. 2006, 7: 229-237. 10.1038/nrg1804.
Article CAS PubMed Google Scholar
Wu RL, Ma CX, Hou W, Corva P, Medrano JF: Functional mapping of quantitative trait loci that interact with the hg gene to regulate growth trajectories in mice. Genetics. 2005, 171: 239-249. 10.1534/genetics.104.040162.
Article PubMed Central CAS PubMed Google Scholar
Mauricio R: Mapping quantitative trait loci in plants: uses and caveats for evolutionary biology. Nature Rev Genet. 2001, 2: 370-381. 10.1038/35072085.
Article CAS PubMed Google Scholar
Anholt RR, Mackay TFC: Quantitative genetic analyses of complex behaviours in Drosophila. Nature Rev Genet. 2004, 5: 838-849. 10.1038/nrg1472.
Article CAS PubMed Google Scholar
Ambros V: Control of developmental timing in Caenorhabditis elegans. Curr Opin Genet Dev. 2000, 10: 428-33. 10.1016/S0959-437X(00)00108-8.
Article CAS PubMed Google Scholar
Rougvie AE: Control of developmental timing in animals. Nature Rev Genet. 2001, 2: 690-701. 10.1038/35088566.
Article CAS PubMed Google Scholar
Wang ZH, Wu RL: A statistical model for high resolution mapping of quantitative trait loci determining human HIV-1 dynamics. Stat Med. 2004, 23: 3033-3051. 10.1002/sim.1870.
Article PubMed Google Scholar
Perelson AS, Neumann AU, Markowitz M, Leonard JM, Ho DD: HIV-1 dynamics in vivo: Virion clearance rate, infected cell life-span, and viral generation time. Science. 1996, 271: 1582-1586. 10.1126/science.271.5255.1582.
Article CAS PubMed Google Scholar
Druka A, Muehlbauer G, Druka I, Caldo R, Baumann U, Rostoks N, Schreiber A, Wise R, Close T, Kleinhofs A, Graner A, Schulman A, Langridge P, Sato K, Hayes P, McNicol J, Marshall D, Waugh R: An atlas of gene expression from seed to seed through barley development. Functional Integrative Genomics. 2006, 6: 202-211. 10.1007/s10142-006-0025-4.
Article CAS PubMed Google Scholar
Druka A, Potokina E, Luo Z, Bonar N, Druka I, Zhang L, Marshall DF, Steffenson BJ, Close TJ, Wise RP, Kleinhofs A, Williams RW, Kearsey MJ, Waugh R: Exploiting regulatory variation to identify genes underlying quantitative resistance to the wheat stem rust pathogen Puccinia graminis f. sp. tritici in barley. Theoretical and Applied Genetics. 2008, 117: 261-72. 10.1007/s00122-008-0771-x.
Article CAS PubMed Google Scholar
Caldo RA, Nettleton D, Peng J, Wise RP: Stage-specific suppression of basal defense discriminates barley plants containing fast- and delayed-acting Mla powdery mildew resistance alleles. Mol Plant Microbe Interact. 2006, 19: 939-47. 10.1094/MPMI-19-0939.
Article CAS PubMed Google Scholar
Caldo RA, Nettleton D, Wise RP: Interaction-dependent gene expression in Mla-specified response to barley powdery mildew. Plant Cell. 2004, 16: 2514-28. 10.1105/tpc.104.023382.
Article PubMed Central CAS PubMed Google Scholar
Zhang L, Castell-Miller C, Dahl S, Steffenson B, Kleinhofs A: Parallel expression profiling of barley-stem rust interactions. Funct Integr Genomics. 2008, 8: 187-98. 10.1007/s10142-007-0069-0.
Article CAS PubMed Google Scholar
Rostoks N, Mudie S, Cardle L, Russell J, Ramsay L, Booth A, Svensson JT, Wanamaker SI, Walia H, Rodriguez EM, Hedley PE, Liu H, Morris J, Close TJ, Marshall DF, Waugh R: Genome-wide SNP discovery and linkage analysis in barley based on genes responsive to abiotic stress. Mol Genet Genomics. 2005, 274: 515-527. 10.1007/s00438-005-0046-z.
Article CAS PubMed Google Scholar
Zheng J, Close TJ, Jiang T, Lonardi S: Efficient selection of unique and popular oligos for large EST databases. Bioinformatics. 2004, 20: 2101-2112. 10.1093/bioinformatics/bth210.
Article CAS PubMed Google Scholar
Shen L, Gong J, Caldo RA, Nettleton D, Cook D, Wise RP, Dickerson JA: BarleyBase–an expression profiling database for plant genomics. Nucleic Acids Res. 2005, 33: D614-D618. 10.1093/nar/gki123.
Article PubMed Central CAS PubMed Google Scholar
Matthews DE, Carollo VL, Lazo GR, Anderson OD: GrainGenes, the genome database for small-grain crops. Nucleic Acids Res. 2003, 31: 183-186. 10.1093/nar/gkg058.
Article PubMed Central CAS PubMed Google Scholar
Ware D, Jaiswal P, Ni J, Pan X, Chang K, Clark K, Teytelman L, Schmidt S, Zhao W, Cartinhour S, McCouch S, Stein L: Gramene: a resource for comparative grass genomics. Nucleic Acids Res. 2002, 30: 103-105. 10.1093/nar/30.1.103.
Article PubMed Central CAS PubMed Google Scholar
Druka I: Molecular Biology Data Exchange and Visualization with XML Technology. MSc Thesis. 2007, University of Abertay, School of Computing and Creative Technologies
Google Scholar
GeneNetwork Usage Conditions and Limitations. [http://genenetwork.org/conditionsofUse.html]
Online tutorial written specifically for barley component of the GeneNetwork. [http://barleygenetics.net/GN_barley_tutorial.html]
Marcel TC, Varshney RK, Barbieri M, Jafary H, de Kock MJ, Graner A, Niks RE: A high-density consensus map of barley to compare the distribution of QTL for partial resistance to Puccinia hordei and of defence gene homologues. Theor Appl Genet. 2007, 114: 487-500. 10.1007/s00122-006-0448-2.
Article CAS PubMed Google Scholar
Friesen TL, Faris JD, Lai Z, Steffenson BJ: Identification and chromosomal location of major genes for resistance to Pyrenophora teres in a doubled-haploid barley population. Genome. 2006, 49: 855-859. 10.1139/G06-024.
Article CAS PubMed Google Scholar
Bilgic H, Steffenson BJ, Hayes PM: Comprehensive genetic analyses reveal differential expression of spot blotch resistance in four populations of barley. Theor Appl Genet. 2005, 111: 1238-1250. 10.1007/s00122-005-0023-2.
Article CAS PubMed Google Scholar

Download references

Acknowledgements

This work was supported by the BBSRC/SEERAD response mode grant SCR/910/04 to Robbie Waugh and Mike Kearsey. GeneNetwork is funded by the NIH (U01AA13499, P20-DA 21131, U01CA105417, and U24 RR021760). Genotyping was funded by the NSF Plant Genome Research Program grant DBI-0321756.

Thanks to Kazuhiro Sato at the Barley Germplasm Center, Okayama University for providing the Steptoe × Morex population for the grain morphometric analysis. Beth Tacke and Howard Casper are acknowledged for the DON tests. The authors are also thankful to an anonymous reviewer for illustrating the potential of functional mapping for efficiently establishing associations between existing QTL, as well as for novel QTL discovery.

Author information

Authors and Affiliations

Genetics Programme, Scottish Crop Research Institute, Invergowrie, Dundee, DD2 5DA, UK
Arnis Druka, Ilze Druka, William TB Thomas, Nicola Bonar, David F Marshall & Robbie Waugh
School of Computing and Creative Technologies, University of Abertay, Dundee, DD1 1HG, UK
Ilze Druka
Department of Anatomy and Neurobiology, University of Tennessee, Memphis, TN, 38163, USA
Arthur G Centeno, Hongqiang Li, Zhaohui Sun & Robert W Williams
Department of Plant Pathology, University of Minnesota, St. Paul, MN, 55108, USA
Brian J Steffenson
Department of Crop and Soil Sciences, Washington State University, Pullman, WA99164, USA
Steven E Ullrich & Andris Kleinhofs
Corn Insects and Crop Genetics Research, USDA-ARS, Iowa State University, Ames, IA, 50011, USA
Roger P Wise
Department of Plant Pathology, Iowa State University, Ames, IA, 50011, USA
Roger P Wise
Department of Botany and Plant Sciences, University of California, Riverside, CA, 92521, USA
Timothy J Close
School of Biosciences, University of Birmingham, Birmingham, B15 2TT, UK
Elena Potokina, Zewei Luo & Michael J Kearsey
Department of Plant Breeding, Justus Liebig University Giessen, Heinrich-Buff-Ring 26-32, 35392, Giessen, Germany
Carola Wagner
Institute for Crop Production and Plant Breeding, Dep. Genome Analysis, Bavarian State Research Center for Agriculture, Am Gereuth 6, 85354, Freising-Weihenstephan, Germany
Günther F Schweizer

Authors

Arnis Druka
View author publications
You can also search for this author in PubMed Google Scholar
Ilze Druka
View author publications
You can also search for this author in PubMed Google Scholar
Arthur G Centeno
View author publications
You can also search for this author in PubMed Google Scholar
Hongqiang Li
View author publications
You can also search for this author in PubMed Google Scholar
Zhaohui Sun
View author publications
You can also search for this author in PubMed Google Scholar
William TB Thomas
View author publications
You can also search for this author in PubMed Google Scholar
Nicola Bonar
View author publications
You can also search for this author in PubMed Google Scholar
Brian J Steffenson
View author publications
You can also search for this author in PubMed Google Scholar
Steven E Ullrich
View author publications
You can also search for this author in PubMed Google Scholar
Andris Kleinhofs
View author publications
You can also search for this author in PubMed Google Scholar
Roger P Wise
View author publications
You can also search for this author in PubMed Google Scholar
Timothy J Close
View author publications
You can also search for this author in PubMed Google Scholar
Elena Potokina
View author publications
You can also search for this author in PubMed Google Scholar
Zewei Luo
View author publications
You can also search for this author in PubMed Google Scholar
Carola Wagner
View author publications
You can also search for this author in PubMed Google Scholar
Günther F Schweizer
View author publications
You can also search for this author in PubMed Google Scholar
David F Marshall
View author publications
You can also search for this author in PubMed Google Scholar
Michael J Kearsey
View author publications
You can also search for this author in PubMed Google Scholar
Robert W Williams
View author publications
You can also search for this author in PubMed Google Scholar
Robbie Waugh
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding authors

Correspondence to Robert W Williams or Robbie Waugh.

Additional information

Authors' contributions

AD concept, phenotyping, data entry, drafting the manuscript; ID concept, data processing, data entry, image analysis, writing; AGC data processing, data entry; HL data processing, data entry; ZS data processing, data entry; WTT phenotyping; NB SNP mapping, phenotyping; BJS phenotyping; SEU phenotyping; AK RFLP mapping; RPW RNA labelling, GeneChip processing; TJC SNP Illumina genotyping; EP QTL mapping; ZL SFP detection; CW phenotyping; GFS phenotyping; DFM rice-barley comparison; MJK project PI; concept; RWW concept, data processing, data entry; RW project PI, concept, writing.

Electronic supplementary material

12863_2008_639_MOESM1_ESM.txt

Additional file 1: Table S1. Inference of mRNA abundance regulation by cis-elements or trans-factors. This is a tab delimited table, the first row contains column headings. 'Cosegregating marker' – DNA marker ID that co-segregates with the gene underlying the 'Probe set'. 'Probe Set' – Affymetrix' Barley1 GeneChip probe set ID. 'DNA marker chromosome' and 'DNA marker position' – 'Cosegregating marker' locus parameters. 'mRNA abundance QTL chromosome', 'mRNA abundance QTL position' and 'LRS' – mRNA abundance QTL parameters of the gene underlying 'Probe set'. LRS – Likelihood Ratio Statistic. 'cis/trans' – c – cis, t – trans, inference on cis- or trans regulation of the gene underlying 'Probe set'. (TXT 16 KB)

12863_2008_639_MOESM2_ESM.txt

Additional file 2: Table S2. The Barley1 GeneChip probe sets that report significant mRNA abundance QTL and have rice homologs for the underlying genes. This is a tab delimited table, the first row represents column headings. 'ProbeSet' – Affymetrix' Barley1 GeneChip probe set IDs. 'LRS' – LRS (Likelihood Ratio Statistic) of the mRNA abundance QTL reported by the 'ProbeSet'. 'LRS_range' – subdivision of the LRS into 'low' and 'high' groups. 'Locus' and 'barley chromosome/marker' – location parameters of the mRNA abundance QTL. 'Rice Chr' and 'Rice 5" – location parameters of the rice homologs. (TXT 543 KB)

12863_2008_639_MOESM3_ESM.txt

Additional file 3: Table S3. Principal Components scores of the key traits that segregate in the SM population. This is a tab delimited file, first row represents column headings. F1–F4 represent factors of individual traits. (TXT 25 KB)

Authors’ original submitted files for images

Below are the links to the authors’ original submitted files for images.

Authors’ original file for figure 1

Authors’ original file for figure 2

Authors’ original file for figure 3

Rights and permissions

Open Access This article is published under license to BioMed Central Ltd. This is an Open Access article is distributed under the terms of the Creative Commons Attribution License ( https://creativecommons.org/licenses/by/2.0 ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Reprints and permissions

About this article

Cite this article

Druka, A., Druka, I., Centeno, A.G. et al. Towards systems genetic analyses in barley: Integration of phenotypic, expression and genotype data into GeneNetwork. BMC Genet 9, 73 (2008). https://doi.org/10.1186/1471-2156-9-73

Download citation

Received: 25 April 2008
Accepted: 18 November 2008
Published: 18 November 2008
DOI: https://doi.org/10.1186/1471-2156-9-73

Towards systems genetic analyses in barley: Integration of phenotypic, expression and genotype data into GeneNetwork