Abstract
Background
A key challenge in metabonomics is to uncover quantitative associations between multidimensional spectroscopic data and biochemical measures used for disease risk assessment and diagnostics. Here we focus on clinically relevant estimation of lipoprotein lipids by ^{1}H NMR spectroscopy of serum.
Results
A Bayesian methodology, with a biochemical motivation, is presented for a real ^{1}H NMR metabonomics data set of 75 serum samples. Lipoprotein lipid concentrations were independently obtained for these samples via ultracentrifugation and specific biochemical assays. The Bayesian models were constructed by Markov chain Monte Carlo (MCMC) and they showed remarkably good quantitative performance, the predictive Rvalues being 0.985 for the very low density lipoprotein triglycerides (VLDLTG), 0.787 for the intermediate, 0.943 for the low, and 0.933 for the high density lipoprotein cholesterol (IDLC, LDLC and HDLC, respectively). The modelling produced a kernelbased reformulation of the data, the parameters of which coincided with the wellknown biochemical characteristics of the ^{1}H NMR spectra; particularly for VLDLTG and HDLC the Bayesian methodology was able to clearly identify the most characteristic resonances within the heavily overlapping information in the spectra. For IDLC and LDLC the resulting model kernels were more complex than those for VLDLTG and HDLC, probably reflecting the severe overlap of the IDL and LDL resonances in the ^{1}H NMR spectra.
Conclusion
The systematic use of Bayesian MCMC analysis is computationally demanding. Nevertheless, the combination of highquality quantification and the biochemical rationale of the resulting models is expected to be useful in the field of metabonomics.
Background
Genomics is increasingly complemented by metabonomics – the quantitative measurement of the timerelated multiparametric metabolic responses of multicellular systems to (patho)physiological stimuli or genetic modification [1]. Mass spectrometry and nuclear magnetic resonance (NMR) spectroscopy have become the two key technologies in the metabonomic field [2]. An appealing feature of NMR spectroscopy for metabonomic applications is its specific yet nonselective nature: proton (^{1}H) NMR can efficiently produce information on a large number of metabolites in biological samples like human serum. The abundance of protons and the inherently narrow as well as heterogeneous chemical shift range of ^{1}H NMR results in highly informative spectra that contain heavily overlapping resonances [3].
Recently, a call for applying ^{1}H NMR metabonomics to facilitate disease risk assessment and clinical diagnostics has emerged [1,2,48]. A key issue in bringing metabonomics for clinical use will be to bridge the gap between biochemistry – as revealed by ^{1}H NMR spectroscopy – and the relevant measures of current clinical practice. In a ^{1}H NMR spectrum, one metabolite can manifest several peaks, and the spectral intensities are both biochemically and (patho)physiologically related. Furthermore, the data sets are extensive but redundant: one measurement can yield tens of thousands of data points, but the effective dimensionality is much less due to a smaller number of NMRvisible compounds. Consequently, there are methodological challenges in trying to quantitatively associate ^{1}H NMR metabonomics data to relevant biochemical variables as well as to understand and visualise the underlying metabolic features that relate to various biomedical applications [8].
A key clinical application of ^{1}H NMR spectroscopy is to quantify lipoprotein lipids directly from plasma or serum samples [3,7,913]. One of the strategic reasons to use ^{1}H NMR to study lipoproteins is the avoidance of their tedious physical isolation from plasma via repetitive ultracentrifugations and thus the consequent potential to analyse extensive clinical data sets beyond current biochemical methodologies. Various ^{1}H NMR spectroscopy applications have focused on the main lipoprotein fractions, namely very low, intermediate, low and high density lipoproteins (VLDL, IDL, LDL and HDL, respectively), since these relate to general clinical guidelines to assess an individual's risk for atherosclerosis [3,6,12]. Interestingly, one of the advanced methods, already in clinical use, to determine plasma lipoproteins is a commercial ^{1}H NMR based assay named NMR LipoProfile^{® }by LipoScience Inc [13]. Thus, ^{1}H NMR spectroscopy and metabonomics of serum provides an extensively studied and demonstrative case of complex overlapping resonances with wellknown biochemical rationale and spectral characteristics [3,6,7,913].
Biomedical research relies heavily on the statistical analysis of empirical findings and extrapolation from limited sample sets to larger populations. Currently, hypothesis testing with preselected parametric formulations is the prevailing technique and statistical uncertainty is expressed indirectly by comparing the observations to a given null hypothesis. In multidimensional applications such as ^{1}H NMR metabonomics the null hypothesis is obtainable only for the simplest formulations, which are often inadequate to describe the data efficiently. In contrast, Bayesian theory [14,15] explicitly incorporates uncertainty in the form of probability distributions, hence the null hypothesis is no longer required as the reference point. Furthermore, the parametric formulations need not be preselected heuristically, but can be included in the modelling process itself. Hence, the analysis becomes more dependent on the data and prior knowledge, and less dependent on arbitrary practical restrictions such as analytical tractability. However, applications of Bayesian methodology in NMR spectroscopy are sparse [1618], perhaps due to the lack of computing power until recent years. A Bayesian spectral decomposition has produced promising results for metabonomic NMR data [19] but, to our knowledge, this is the first biomedical application of Bayesian inference on spectral quantification with special modelling emphasis on the metabolic rationale.
Thus, this work has two key objectives to establish. First, to quantify broad overlapping resonances from ^{1}H NMR spectra of serum using specific Bayesian models, and, second, to relate the resulting model kernels to the known biochemical characteristics of the spectra. This study focuses on a clinically significant application of ^{1}H NMR spectroscopy of serum for quantifying lipoprotein lipid concentrations used for the assessment of individuals' risk for coronary heart disease. A set of biochemically characterised serum samples, for which VLDL and IDL triglycerides (VLDLTG and IDLTG, respectively) as well as IDL, LDL and HDL cholesterol (IDLC, LDLC and HDLC, respectively) concentrations are independently measured, is the origin for the ^{1}H NMR spectra. A Markov chain Monte Carlo (MCMC) in Bayesian inference is used to set up quantitative models based on these ^{1}H NMR spectra and to automatically define the number and locations of Gaussian kernels to indicate the spectral features corresponding to each biochemical variable.
Methods
Serum samples and biochemical lipoprotein lipid analysis
The serum samples and the biochemical lipoprotein lipid analyses were available from 75 individuals, representing a wide range of plasma lipoprotein lipid concentrations. The blood samples were drawn after an overnight fast of 12 hours into EDTAcontaining tubes or tubes without anticoagulant for serum separation. Blood samples for serum separation were incubated at room temperature for 30 min prior to centrifugation. Serum and plasma were separated by centrifugation at 1200 g–1500 g for 10–15 min at 4°C. The main lipoprotein fractions were isolated from plasma by sequential ultracentrifugation using density ranges of ≤ 1.006 g/ml, 1.006–1.019 g/ml, 1.019–1.063 g/ml, and 1.063–1.210 g/ml for VLDL, IDL, LDL, and HDL, respectively [20]. Cholesterol and triglyceride concentrations in lipoproteins were determined with Specific Chemistry Analyser (Kone, Finland) using enzymatic colorimetric methods (kits by Boehringer Diagnostica, Mannheim GmbH, FRG) and expressed as mmol/l plasma.
^{1}H NMR spectroscopy
The ^{1}H NMR data were recorded at the physiological temperature of 310 K on a Bruker AVANCE spectrometer operating at 500.13 MHz equipped with a 5 mm BBI probehead. A double tube system facilitating absolute metabolite quantification was used [8,10]. The external reference tube (o.d. 2 mm, supported by a Teflon adapter) containing the reference substance (sodium 3trimethylsilyl[2,2,3,3d_{4}]propionate (TSP) 40 mmol/l, MnSO_{4 }0.6 mmol/l in 99.8% D_{2}O) was placed coaxially into the NMR sample tube (o.d. 5 mm) containing 430 μl of each sample. No water suppression was used and 128 transients were collected with a 90 degree flip angle using a spectral width of 5252 Hz and 64 k data points. Acquisition time of 6.2 s and a relaxation delay of 0.1 s were used. Prior to Fourier transformation, the measured free induction decays were zero filled and multiplied by an exponential window function with a linebroadening of 1.0 Hz. The PERCH NMR software was used for preprocessing the data [21]. The metabolite intensities in each spectrum were scaled according to the corresponding TSP reference signal before the Bayesian analyses.
Bayesian spectral analysis
The aliphatic regions of the ^{1}H NMR spectra (from 0.40 to 3.30 ppm; 18 093 data points) were analysed from the serum samples of those individuals that had the lipoprotein lipid concentrations for VLDLTG, IDLTG, IDLC, LDLC and HDLC available. The biochemical assays for these lipid variables and the ^{1}H NMR spectra are physically independent. Thus, by modelling the quantitative relation between the ^{1}H NMR metabonomics data and the clinical variables, the concentrations of these lipid fractions can be estimated from the serum spectra alone. A separate Bayesian model was constructed for VLDLTG, IDLTG, IDLC, LDLC and HDLC.
One could assume that all the data points are independent, but clearly this is not true just by looking at the smooth spectral curves. In addition, such assumption would lead to unnecessary methodological problems [14]. Here, the spectroscopic fact that adjacent data points are strongly correlated is not ignored but an unknown and nonconstant correlation length is allowed. This is achieved by representing adjacent points collectively through a Gaussian kernel with a given width and location. Specifically, the dot product between the spectral intensity vector of a sample and the Gaussian density function (truncated at 3σ) represents the value of the corresponding kernel. The minimum width was constrained to fulfil the known molecular characteristics in the NMR spectra [3,8], that is, kernel widths larger than 4 Hz were favoured.
Based on the application specific knowledge, it was reasonable to assume that a linear model of the target variables and kernel outputs was appropriate [7,10]. This does not imply a fully linear model, since the mapping from the raw spectra to the kernel space is nonlinear, especially since the kernel number is among the targets of the inference. In addition to the kernels, the mean level of each spectrum was used as a covariate. Student's tdistribution was preferred over the Gaussian distribution as a more robust residual model and the posterior inference was made by Markov chain Monte Carlo (MCMC) [14]. A useful property of our model specification is that marginal likelihoods, obtained by analytically integrating over the linear model weights, can be used to significantly improve the sampling quality [15]. Kernel locations and widths, and the degree of freedom for the residual model were inferred by slice sampling. In addition, the number of kernels was sampled by reversible jump MCMC in which the proposal distributions for new parameters were the corresponding prior distributions. The rest of the model parameters were updated using Gibbs' sampling with conditional distributions [14]. Interestingly, by allowing the selection of input variables to be among the targets of modelling, the effect of prior assumptions can be reduced if compared to conventional statistics. A Bayesian rationale and a brief mathematical formulation of the Bayesian modelling for the ^{1}H NMR spectra of serum are given in Additional file 1.
Additional File 1. The Bayesian rationale and mathematical formulation. The Bayesian rationale and a brief mathematical formulation of the quantitative Bayesian modelling for the ^{1}H NMR spectra of serum.
Format: PDF Size: 32KB Download file
This file can be viewed with: Adobe Acrobat Reader
An intuitive and practical consideration of the Bayesian methodology used here is as follows. First, the kernel outputs are computed, as specified by the locations and widths, i.e., the dot products of every kernel vector and every spectrum are computed. This generates a new n × k input matrix ϕ, where n is the number of spectra and k the number of kernels. These kernel features can now be connected to the target variable y through the (simplified) linear regression equation y = wϕ + ε, where w is the weight vector and ε represents the noise. To incorporate the uncertainties to the model, a sample of w is drawn according to the analytical posterior distribution (given current ϕ and noise distribution) instead of finding the algebraically optimal weight vector. Next, the shape parameters of the noise distribution are sampled in a similar fashion, given the current w and ϕ. Finally, the number of kernels k is changed (given current w, ϕ and noise distribution) to try if another number could produce better results. The above cycle is repeated until convergence seems stable and enough samples of the parameters have been obtained to construct histograms that serve as approximations of the posterior parameter distributions. Note that this is only a simplified account of the algorithm used (see Additional file 1 for a more detailed methodology).
In general, the distributions of all parameters converged fast and mixed well, although the number of kernels turned out to be somewhat slow in mixing. For a single MCMC run, reliable results are obtainable at 10000 iterations or one hour on a 1 GHz Alpha EV6.8CB processor. Before inference, the first 2000 iterations were discarded and only every 20th of the rest were included. Convergence was verified by comparing 10 independent chains. Replicates from the predictive distribution of the model were computed to serve as a test set in estimating the predictive performance. In a preliminary phase, a 10fold crossvalidation was used as a more robust strategy to check that the predictive replicate approach produced meaningful results. After the posterior distribution has been constructed, predictions for new spectra can be computed almost instantaneously.
The computations were performed using the MCMCstuff toolbox [22], which is a collection of Matlab functions for Bayesian inference by Markov chain Monte Carlo methods.
Results and discussion
Quantitative models
Figure 1 shows the results of the separate quantitative Bayesian models for the VLDLTG, IDLC, LDLC and HDLC. In Figure 1 correlation coefficients (R) between predictions and observations are shown to allow consistent comparison to previous studies ([3,12] and refs. therein). Since the same observations are used to update the posterior and to compute R, these correlation values are optimistic with regard to the predictive performance of the models for future data. Thus, we also estimated the predictive correlation coefficients (predictive R) by integrating over the uncertainty related to future observations; these predictive Rvalues then describe how well the models can predict the corresponding lipoprotein concentrations in the case of ^{1}H NMR spectra of new individuals. The predictive Rvalues/Rvalues are 0.985/0.990 (n = 75), 0.787/0.900 (n = 72), 0.943/0.983 (n = 72) and 0.933/0.959 (n = 67) for the VLDLTG, IDLC, LDLC and HDLC, respectively. It is notable that the predictive R calculated for IDLC clearly reflects the rather large uncertainties related to the experimental determinations of low concentrations. In general, these values represent excellent quantitative correspondence in this extensively studied complex application [3,6,7,913]. Particularly, in the case of IDLC and HDLC the current results appear slightly better than those previously reported (see [12] for a comparison for the quantitative performance of different methods used). Thus, these results verify the highquality quantification ability of the presented Bayesian MCMC approach in the case of broad overlapping resonances in the ^{1}H NMR spectra of serum. This is an important prerequisite to facilitate the assessment of the biochemical rationale of the Gaussian model kernels in relation to each lipoprotein fraction and the ^{1}H NMR spectroscopic characteristics of serum. The general quantification aspects of lipoprotein lipids using ^{1}H NMR spectroscopy of serum (or plasma) have been extensively handled in the literature [[3,6,7,913] and refs. therein] and will not be discussed in detail here.
Figure 1. The quantitative performance of the Bayesian models. The results of the Bayesian models for the VLDLTG (n = 75) (orange), IDLC (n = 72) (lime), LDLC (n = 72) (sherry) and HDLC (n = 67) (olive). The correlation coefficients (R) shown are between predictions and observations (for predictive Rvalues see Results and discussion). The straight black lines show a 1:1relationship and are drawn only to guide the eye.
We also tried to set up an equivalent Bayesian model to quantify IDLTG. However, a properly quantitative model was not achieved (a predictive Rvalue of only 0.608, n = 72; data not shown). Concerning quantification of IDLTG and IDLC somewhat varying results have been published [7,9,12,13]. The IDL fraction is included in the recently improved NMR LipoProfile^{® }method [23] and previously the application of neural network analysis have resulted in reasonable (semi)quantitative models [9,12]. It is also notable that all the IDLC and IDLTG concentrations for the current sample set are below 0.4 mmol/l. Thus, a likely reason to affect the modelling is the fact that biochemically measured concentrations below 0.5 mmol/l can contain experimental inaccuracies of several tens of percent [12]. Why the quantitative Bayesian model here for IDLC appears rather good but the model for IDLTG leads to more inaccuracies is not currently clear. Nevertheless, the resulting quantitative model for IDLC can be used here to assess how the Gaussian model kernels relate to the known ^{1}H NMR spectroscopic characteristics of the IDL particles.
Since kernel selection was part of our modelling, it is possible to estimate the marginal distributions for the number of kernels, as depicted in Figure 2. Before any interpretation, though, the coefficient prior of the linear regression model that connects the kernel outputs and the target variable has an effect on kernel number. Fortunately the predictions are not sensitive to this since the kernel number is integrated out, but the phenomenon may discriminate models with only a few kernels and large coefficients. In any case, IDLC and LDLC are clearly more dispersed than VLDLTG and HDLC and this also translates to Figure 3, where both VLDLTG and HDLC are dominated by a few kernels, but IDLC and LDLC produce clear associations at numerous locations. Note, however, that for a pair of highly correlated but nonadjacent spectral regions you might get two strong associations, but during the MCMC simulation you might get only one of them at a time. In this respect, the number of kernels provides an additional insight to the nature of the multivariate dependencies within the kernels.
Figure 2. The kernel distributions in the Bayesian models. The marginal posterior distribution for the number of kernels in the Bayesian models for the VLDLTG (orange), IDLC (lime), LDLC (sherry) and HDLC (olive).
Figure 3. A representative ^{1}H NMR spectrum of serum and the spectroscopic characteristics of the Bayesian model kernels. Illustration of the aliphatic spectral region of a representative experimental ^{1}H NMR spectrum (black) together with the main Bayesian model kernels for the VLDLTG (orange), IDLC (lime), LDLC (sherry) and HDLC (olive). The assignments for the resonances refer to fatty acids in triglycerides, cholesterol compounds and phospholipids in various lipoprotein particles, the cholesterol backbone C(18)
 H
 H
 H
 H
The biochemical rationale of the Bayesian model kernels
Typical aliphatic resonances in an experimental ^{1}H NMR spectrum of human serum are illustrated in Figure 3. The characteristic spectral features include broad overlapping resonances originating mainly from different lipid molecules in lipoprotein particles, for example, the C
 H
 H
 H
 H
 H
 H
The resulting Gaussian kernel models for each lipoprotein fraction are illustrated in Figure 3 with colour coding: orange for VLDLTG, lime for IDLC, sherry for LDLC and olive for HDLC. The highest intensity kernel for each lipoprotein fraction is scaled to 1.0. It is evident from Figure 3, that the most influential kernel for both VLDLTG and HDLC is located exactly at the frequency position expected by the aforementioned wellknown biochemical background and characteristics of the ^{1}H NMR spectra of serum. Thus, the high frequency side of the (C
 H
 H
 H
 H
The kernel models for VLDLTG and HDLC contain fewer kernels and are much simpler than the corresponding models for IDLC and LDLC. This result and these differences are likely to represent the overlap of the lipoprotein resonances in the ^{1}H NMR spectra. The molecular signals arising from the VLDL and HDL particles situate at the high and low frequency sides of the lipid resonances, respectively, while the contributions from the IDL and LDL particles are in the middle of the corresponding resonances [3,9,13]. This seems to have a marked effect on the Gaussian kernel models. Some of the individual kernels can be interpreted on the basis of the known characteristics of the ^{1}H NMR spectra, for instance, the influential kernels for LDLC at 0.6 ppm and at 0.8 ppm (cf., the discussion above) and the expected frequency positioning of all the lipoprotein kernels within the (C
 H
The quantification accuracy achieved via the Bayesian modelling is excellent also for LDLC and good for IDLC. The resulting more complex kernel models for IDLC and LDLC than for VLDLTG and HDLC are thus likely to represent the more severe signal overlap for IDLC and LDLC than for VLDLTG and HDLC. Since clear resonances identifiable to IDLC or LDLC are nonresolvable in the ^{1}H NMR spectra of serum, the Bayesian logic seems to give rise to quite balanced combinations of several kernels at frequency locations where some information on the modelled biochemical measure is (known to be) available. In the case of severe signal overlap, however, the optimisation of the quantification and the resulting kernel models seems to take place at the expense of the biochemical interpretability.
The analysis of ^{1}H NMR spectra seems to benefit from several characteristics of the Bayesian approach. First, feature extraction and selection from the highdimensional raw data can be included as parts of the model. Second, any prior knowledge about the parameters can be explicitly incorporated into the framework. Third, no null hypothesis needs to be constructed. Additionally, in contrast to standard computational methods used in the area of NMR metabonomics [8], the Bayesian approach produces results that are tightly connected to the statistics and yet rather easy to interpret biochemically.
Conclusion
A set of ^{1}H NMR spectra of serum samples, for which clinically relevant lipoprotein lipid concentrations were biochemically characterised, were analysed using an automated MCMC Bayesian inference. This real metabonomic case of ^{1}H NMR spectroscopy to quantify lipoprotein lipids directly from serum represents a biomedically relevant application with a wellknown biochemical background and spectroscopic characteristics. To the best of our knowledge this is the first Bayesian application to quantify biomedical ^{1}H NMR spectra and to relate the resulting model kernels to the known biochemical characteristics of the spectra. The results illustrate a highquality quantification ability of the presented Bayesian MCMC approach in the case of broad overlapping ^{1}H NMR resonances. If the signal overlap is severe, the resulting kernel models seem to form at the cost of the biochemical justification. In the case of more clearly resolvable resonances, the biochemical rationale of the uncomplicated kernel models appeared fully consistent with the known spectroscopic characteristics of the application. Hence, even though the Bayesian MCMC analysis is computationally demanding, it is anticipated to provide an advantageous complement to the currently used data analysis methods in the NMR metabonomics arena, not only in quantitative modelling but also in revealing metabolic rationale of the models and biomedical applications.
Authors' contributions
AV, VPM, KK and MAK conceived and designed the study; SMM, MJS and MLH collected the serum samples, isolated the lipoprotein fractions and measured the biochemical data; PS and PI measured the NMR data; AV wrote the computer code and performed the Bayesian analyses; AV, VPM and MAK wrote the manuscript. All authors contributed to, read and approved the final manuscript.
Acknowledgements
This work has been supported by the Academy of Finland Centre of Excellence programme for 2006–2011.
This article has been published as part of BMC Bioinformatics Volume 8, Supplement 2, 2007: Probabilistic Modeling and Machine Learning in Structural and Systems Biology. The full contents of the supplement are available online at http://www.biomedcentral.com/14712105/8?issue=S2.
References

Nicholson JK, Wilson ID: Understanding 'global' systems biology: metabonomics and the continuum of metabolism.
Nat Rev Drug Discov 2003, 2:668676. PubMed Abstract  Publisher Full Text

Fernie AF, Trethewey RN, Krotzky AJ, Willmitzer L: Metabolite profiling: from diagnostics to systems biology.
Nat Rev Mol Cell Biol 2004, 5:763769. PubMed Abstract  Publisher Full Text

AlaKorpela M: ^{1}H NMR spectroscopy of human blood plasma.
Progr Nucl Magn Reson Spectr 1995, 27:475554. Publisher Full Text

Clayton TA, Lindon JC, Cloarec O, Antti H, Charuel C, Hanton G, Provost JP, Le Net JL, Baker D, Walley RJ, Everett JR, Nicholson JK: Pharmacometabonomic phenotyping and personalized drug treatment.
Nature 2006, 440:10731077. PubMed Abstract  Publisher Full Text

Kirschenlohr HL, Griffin JL, Clarke SC, Rhydwen R, Grace AA, Schofield PM, Brindle KM, Metcalfe JC: Proton NMR analysis of plasma is a weak predictor of coronary artery disease.
Nat Med 2006, 12:705710. PubMed Abstract  Publisher Full Text

AlaKorpela M, Sipola P, Kaski K: Characterization and molecular detection of atherothrombosis by magnetic resonance – potential tools for individual risk assessment and diagnostics.
Ann Med 2006, 38:322336. PubMed Abstract  Publisher Full Text

AlaKorpela M, Lankinen N, Salminen A, Suna T, Soininen P, Laatikainen R, Ingman P, Jauhiainen M, Taskinen MR, Héberger K, Kaski K: The inherent accuracy of ^{1}H NMR spectroscopy to quantify plasma lipoproteins is subclass dependent.
Atherosclerosis 2006, 190:352358. PubMed Abstract  Publisher Full Text

Mäkinen VP, Soininen P, Forsblom C, Parkkonen M, Ingman P, Kaski K, Groop PH, AlaKorpela M, (on behalf of the FinnDiane Study Group): Diagnosing diabetic nephropathy by ^{1}H NMR metabonomics of serum.
MAGMA 2006, 19:281296. PubMed Abstract  Publisher Full Text

AlaKorpela M, Hiltunen Y, Bell JD: Quantification of biomedical NMR data using artificial neural network analysis: lipoprotein lipid profiles from ^{1}H NMR data of human plasma.
NMR Biomed 1995, 8:235244. PubMed Abstract  Publisher Full Text

AlaKorpela M, Korhonen A, Keisala J, Hörkkö S, Korpi P, Ingman LP, Jokisaari J, Savolainen MJ, Kesäniemi YA: ^{1}H NMRbased absolute quantitation of human lipoproteins and their lipid contents directly from plasma.
J Lipid Res 1994, 35:22922304. PubMed Abstract  Publisher Full Text

Otvos JD, Jeyarajah EJ, Hayes LW, Freedman DS, Janjan NA, Anderson T: Relationships between the proton nuclear magnetic resonance properties of plasma lipoproteins and cancer.
Clin Chem 1991, 37:369376. PubMed Abstract  Publisher Full Text

Bathen TF, Krane J, Engan T, Bjerve KS, Axelson D: Quantification of plasma lipids and apolipoproteins by use of proton NMR spectroscopy, multivariate and neural network analysis.
NMR Biomed 2000, 13:271288. PubMed Abstract  Publisher Full Text

Otvos JD: Measurement of lipoprotein subclass profiles by nuclear magnetic resonance spectroscopy.
Clin Lab 2002, 48:171180. PubMed Abstract

Gelman A, Carlin JB, Stern HS, Rubin DR: Bayesian Data Analysis. Second edition. Chapman & Hall; 2003.

O'Hagan A, Forster J: Kendalls's Advanced Theory of Statistics, Volume 2B, Bayesian Inference. Second edition. Arnold; 2004.

Ochs MF, Stoyanova RS, AriasMendoza F, Brown TR: A new method for spectral decomposition using a bilinear Bayesian approach.
J Magn Reson 1999, 137:161176. PubMed Abstract  Publisher Full Text

Andrec M, Inman KG, Weber DJ, Levy RM, Montelione GT: A Bayesian statistical method for the detection and quantification of rotational diffusion anisotropy from NMR relaxation data.
J Magn Reson 2000, 146:6680. PubMed Abstract  Publisher Full Text

Elster C, Schubert F, Link A, Walzel M, Seifert F, Rinneberg H: Quantitative magnetic resonance spectroscopy: semiparametric modeling and determination of uncertainties.
Magn Reson Med 2005, 53:12881296. PubMed Abstract  Publisher Full Text

Stoyanova R, Nicholson JK, Lindon JC, Brown TR: Sample classification based on Bayesian spectral decomposition of metabonomic NMR data sets.
Anal Chem 2004, 76:36663674. PubMed Abstract  Publisher Full Text

Hannuksela M, Marcel YL, Kesäniemi YA, Savolainen MJ: Reduction in the concentration and activity of plasma cholesteryl ester transfer protein by alcohol.
J Lipid Res 1992, 33:737744. PubMed Abstract  Publisher Full Text

Laatikainen R, Niemitz M, Malaisse WJ, Biesemans M, Willem RA: Computational strategy for the deconvolution of NMR spectra with multiplet structures and constraints: Analysis of overlapping C13H2 multiplets of C13 enriched metabolites from cell suspensions incubated in deuterated media.
Magn Reson Med 1996, 36:359365. PubMed Abstract  Publisher Full Text

MCMC Methods for MLP and GP and Stuff (for Matlab) V2.1 [http://www.lce.hut.fi/research/mm/mcmcstuff/] webcite

Festa A, Williams K, Hanley AJG, Otvos JD, Goff DC, Wagenknecht LE, Haffner SM: Nuclear magnetic resonance lipoprotein abnormalities in prediabetic subjects in the insulin resistance atherosclerosis study.
Circulation 2005, 111:34653472. PubMed Abstract  Publisher Full Text

Lounila J, AlaKorpela M, Jokisaari J, Savolainen MJ, Kesaniemi YA: Effects of orientational order and particle size on the NMR line positions of lipoproteins.
Phys Rev Lett 1994, 72:40494052. PubMed Abstract  Publisher Full Text