Table 1
|
|
Glossary of WGCNA Terminology.
|
|
Term
|
Definition
|
|
|
Co-expression network
|
We define co-expression networks as undirected, weighted gene networks. The nodes
of such a network correspond to gene expression profiles, and edges between genes
are determined by the pairwise correlations between gene expressions. By raising the
absolute value of the correlation to a power β ≥ 1 (soft thresholding), the weighted gene co-expression network construction emphasizes
high correlations at the expense of low correlations. Specifically, aij = |cor(xi, xj)|β represents the adjacency of an unsigned network. Optionally, the user can also specify
a signed co-expression network where the adjacency is defined as aij = |(1 + cor(xi, xj))/2|β.
|
|
|
Module
|
Modules are clusters of highly interconnected genes. In an unsigned co-expression
network, modules correspond to clusters of genes with high absolute correlations.
In a signed network, modules correspond to positively correlated genes.
|
|
|
Connectivity
|
For each gene, the connectivity (also known as degree) is defined as the sum of connection
strengths with the other network genes: ki = ∑u≠iaui. In co-expression networks, the connectivity measures how correlated a gene is with
all other network genes.
|
|
|
Intramodular connectivity kIM
|
Intramodular connectivity measures how connected, or co-expressed, a given gene is
with respect to the genes of a particular module. The intramodular connectivity may
be interpreted as a measure of module membership.
|
|
|
Module eigengene E
|
The module eigengene E is defined as the first principal component of a given module. It can be considered
a representative of the gene expression profiles in a module.
|
|
|
Eigengene significance
|
When a microarray sample trait y is available (e.g. case control status or body weight), one can correlate the module
eigengenes with this outcome. The correlation coefficient is referred to as eigengene
significance.
|
|
|
Module Membership, also known as eigengene-based connectivity kME
|
For each gene, we define a "fuzzy" measure of module membership by correlating its
gene expression profile with the module eigengene of a given module. For example,
MMblue(i) = = cor(xi, Eblue) measures how correlated gene i is to the blue module eigengene. MMblue(i) measures the membership of the i-th gene with respect to the blue module. If MMblue(i) is close to 0, the i-th gene is not part of the blue module. On the other hand, if MMblue(i) is close to 1 or -1, it is highly connected to the blue module genes. The sign of
module membership encodes whether the gene has a positive or a negative relationship
with the blue module eigengene. The module membership measure can be defined for all
input genes (irrespective of their original module membership). It turns out that
the module membership measure is highly related to the intramodular connectivity kIM. Highly connected intramodular hub genes tend to have high module membership values
to the respective module.
|
|
|
Hub gene
|
This loosely defined term is used as an abbreviation of "highly connected gene." By
definition, genes inside co-expression modules tend to have high connectivity.
|
|
|
Gene significance GS
|
To incorporate external information into the co-expression network, we make use of
gene significance measures. Abstractly speaking, the higher the absolute value of
GSi, the more biologically significant is the i-th gene. For example, GSi could encode pathway membership (e.g. 1 if the gene is a known apoptosis gene and
0 otherwise), knockout essentiality, or the correlation with an external microarray
sample trait. A gene significance measure could also be defined by minus log of a
p-value. The only requirement is that gene significance of 0 indicates that the gene
is not significant with regard to the biological question of interest. The gene significance
can take on positive or negative values.
|
|
|
Module significance
|
Module significance is determined as the average absolute gene significance measure
for all genes in a given module. When gene significance is defined as the correlation
of gene expression profiles with an external trait y, this measure tends to be highly related to the correlation between the module eigengene
and y.
|
|
|
Langfelder and Horvath BMC Bioinformatics 2008 9:559 doi:10.1186/1471-2105-9-559
|