Email updates

Keep up to date with the latest news and content from BMC Medicine and BioMed Central.

Journal App

google play app store
Open Access Research article

Reduced levels of hydroxylated, polyunsaturated ultra long-chain fatty acids in the serum of colorectal cancer patients: implications for early screening and detection

Shawn A Ritchie1*, Pearson WK Ahiahonu1, Dushmanthi Jayasinghe1, Doug Heath1, Jun Liu1, Yingshen Lu1, Wei Jin1, Amir Kavianpour1, Yasuyo Yamazaki1, Amin M Khan1, Mohammad Hossain1, Khine Khine Su-Myat1, Paul L Wood1, Kevin Krenitsky2, Ichiro Takemasa3, Masakazu Miyake3, Mitsugu Sekimoto3, Morito Monden3, Hisahiro Matsubara4, Fumio Nomura5 and Dayan B Goodenowe1

Author Affiliations

1 Phenomenome Discoveries Inc, Saskatoon, SK, Canada

2 Bioserve Biotechnologies Inc, Laurel, MD, USA

3 Department of Surgery, Graduate School of Medicine, Osaka University, Osaka, Japan

4 Department of Frontier Surgery, Graduate School of Medicine, Chiba University, Chiba, Japan

5 Department of Molecular Diagnosis, Graduate School of Medicine, Chiba University, Chiba, Japan

For all author emails, please log on.

BMC Medicine 2010, 8:13  doi:10.1186/1741-7015-8-13

The electronic version of this article is the complete one and can be found online at:

Received:19 January 2010
Accepted:15 February 2010
Published:15 February 2010

© 2010 Ritchie et al; licensee BioMed Central Ltd.

This is an Open Access article distributed under the terms of the Creative Commons Attribution License (, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.



There are currently no accurate serum markers for detecting early risk of colorectal cancer (CRC). We therefore developed a non-targeted metabolomics technology to analyse the serum of pre-treatment CRC patients in order to discover putative metabolic markers associated with CRC. Using tandem-mass spectrometry (MS/MS) high throughput MS technology we evaluated the utility of selected markers and this technology for discriminating between CRC and healthy subjects.


Biomarker discovery was performed using Fourier transform ion cyclotron resonance mass spectrometry (FTICR-MS). Comprehensive metabolic profiles of CRC patients and controls from three independent populations from different continents (USA and Japan; total n = 222) were obtained and the best inter-study biomarkers determined. The structural characterization of these and related markers was performed using liquid chromatography (LC) MS/MS and nuclear magnetic resonance technologies. Clinical utility evaluations were performed using a targeted high-throughput triple-quadrupole multiple reaction monitoring (TQ-MRM) method for three biomarkers in two further independent populations from the USA and Japan (total n = 220).


Comprehensive metabolomic analyses revealed significantly reduced levels of 28-36 carbon-containing hydroxylated polyunsaturated ultra long-chain fatty-acids in all three independent cohorts of CRC patient samples relative to controls. Structure elucidation studies on the C28 molecules revealed two families harbouring specifically two or three hydroxyl substitutions and varying degrees of unsaturation. The TQ-MRM method successfully validated the FTICR-MS results in two further independent studies. In total, biomarkers in five independent populations across two continental regions were evaluated (three populations by FTICR-MS and two by TQ-MRM). The resultant receiver-operator characteristic curve AUCs ranged from 0.85 to 0.98 (average = 0.91 ± 0.04).


A novel comprehensive metabolomics technology was used to identify a systemic metabolic dysregulation comprising previously unknown hydroxylated polyunsaturated ultra-long chain fatty acid metabolites in CRC patients. These metabolites are easily measurable in serum and a decrease in their concentration appears to be highly sensitive and specific for the presence of CRC, regardless of ethnic or geographic background. The measurement of these metabolites may represent an additional tool for the early detection and screening of CRC.


Colorectal cancer (CRC) mortality remains one of the highest among all cancers, second to only lung cancer (Canadian Cancer Statistics, 2008). Despite the known benefits of early detection, screening programmes based on colonoscopy and fecal occult blood testing have been plagued with challenges such as public acceptance, cost, limited resources, accuracy and standardization. There is consensus in the field that the use of colonoscopy alone for CRC screening is not practical [1], and that a minimally-invasive serum-based test capable of accurately identifying subjects who are high risk for the development of CRC would result in a higher screening compliance than current approaches and better utilization of existing endoscopy resources [1-3]. Although there have been multiple reports of altered transcript levels [4-11], aberrantly methylated gene products [12-14] and proteomic patterns [15-18] associated with biological samples from CRC patients, few if any have advanced into clinically useful tests. This may be due to a number of reasons including technical hurdles in assay design, challenges obtaining reproducible results, costs and lengthy regulatory processes. Furthermore, most of the tests currently used or in development are based upon the detection of tumour-specific markers and have poor sensitivity for identifying subjects who are either very early stage, or are predisposed to risk but show no clinical presentation of disease.

Although causal genetic alterations for CRC have been well characterized, the number of cases due to adenomatous (APC) and hereditary nonpolyposs colorectal cancer are less than 5% of the total, with approximately 15% claimed to be attributable to inheritable family risk likely due to complex patterns of low penetrance mutations which have yet to be delineated [19]. The fact remains that approximately 80% of CRC cases are thought to arise sporadically, with diet and lifestyle as key risk factors [20,21]. In addition, an individual's microbiome is intricately linked to their gastrointestinal physiological status and may itself be involved as a risk factor [22]. Given that metabolism is heavily influenced by both diet and lifestyle and that the microbiome contributes its own metabolic processes, it is surprising that there has been little effort aimed identifying metabolic markers as risk indicators of CRC. This may, in part, have been due to the lack of platform technologies and informatics approaches capable of comprehensively characterizing metabolites in a similar way that DNA microarrays or surface-enhanced laser desorption ionization can characterize transcripts or proteins, respectively.

Recently, however, there have been rapid advances made in mass spectrometric-based systems which can identify large numbers of metabolic components within samples in a parallel manner [23-25]. Fourier transform ion cyclotron resonance mass spectrometry (FTICR-MS) is based upon the principle that charged particles exhibit cyclotron motion in a magnetic field, where the spin frequency is proportional the mass [26]. FTICR-MS is known for its high resolving power and capability of detecting ions with mass accuracy below 1 part per million (ppm). Liquid sample extracts can be directly infused using electrospray ionization (ESI) and atmospheric pressure chemical ionization (APCI) without chromatographic separation [23], where ions with differing mass to charge (m/z) ratios can be simultaneously resolved using a Fourier transformation. Using informatics approaches, spectral files from multiple samples can be accurately aligned and peak intensities across the samples compared [23]. High resolution also enables the prediction of elemental composition of all ions detected in a sample, providing a solid foundation for metabolite classification and identification, as well as the ability to construct de novo metabolic networks [23,27]. The combination of liquid extraction, flow injection, high resolution and informatics affords a unique opportunity to broadly characterize the biochemical composition of samples, with no a priori knowledge about the sample itself, to a degree which was not previously possible. This 'non-targeted' approach has the advantage of detecting novel compounds and is therefore ideally suited for biomarker-driven discovery applications. Using a MS-based discovery platform for metabolic biomarker identification also has the added advantage of straightforward translation into a quantitative method based upon triple-quadruple multiple-reaction-monitoring (TQ-MRM), similar to the clinical methods used to screen for inborn errors of metabolism [28].

Here we report on the application of this technology for characterizing the serum metabolomes of treatment-naive CRC patients and healthy asymptomatic subjects. A specific metabolic perturbation was discovered in the serum of CRC patients compared to controls in three independent and unrelated sets of samples (total n of 222). We further verify the perturbation using a tandem MS (MS/MS) approach in two additional independent case-control populations totalling 220 subjects. Implications of the findings for CRC screening are discussed.


Patient sample selection

Clinical samples used for the first discovery project were obtained from Genomics Collaborative, Inc (GCI, MA, USA), while samples for the second discovery project and one validation project were obtained from Seracare Lifesciences (MA, USA). These companies specialize in the collection and storage of serum and tissue samples specifically for research purposes. Samples were collected, processed and stored in a consistent manner by teams of physicians as part of a global initiative using standardized protocols and operating procedures. Collection protocols for GCI and Seracare Lifesciences were approved by the Western Institutional Review Board and all samples were properly consented. The inclusion criterion for patient sample selection from the GCI and Seracare biobanks for both the discovery and validation cohorts was that the serum be taken prior to any form of treatment, including surgery, chemo or radiation therapies. All samples were accompanied by detailed pathology reports which were independently verified by certified pathologists at GCI and Seracare. The GCI discovery sample set included serum samples from 40 pre-treatment CRC patients and matched 50 controls; the Seracare discovery set included samples from 26 pre-treatment CRC and matched 25 controls, and the validation Seracare set included 70 pretreatment CRC and matched 70 controls. The discovery samples provided by Osaka Medical University (Osaka, Japan) included 46 pre-surgery CRC patients matched 35 controls which were prospectively collected according to the standard collection protocol of the institution and were properly consented. Study protocols were performed according to the ethical guidelines set by the committee of the three ministries of the Japanese Government. The samples for the Chiba, Japan, validation population, which included 40 pre-surgery CRC patients and 40 matched controls, were also prospectively consented and collected under an ethics reviewed protocol approved by the Institutional Review Board of Graduate School of Medicine, Chiba University. A summary of the populations including disease staging is shown in Table 1. All samples were processed and analysed in a randomized manner and the results unblinded following analysis.

Table 1. Summary of case-control populations used in this study

Sample extraction

Serum samples were stored at -80°C until thawed for analysis and were only thawed once. All extractions were performed on ice. Serum samples were prepared for FTICR-MS analysis by first sequentially extracting equal volumes of serum with 1% ammonium hydroxide and ethyl acetate (EtOAc) three times. Samples were centrifuged between extractions at 4°C for 10 min at 3500 rpm and the organic layer removed and transferred to a new tube (extract A). A 1:5 ratio of EtOAc (extract A) to butanol (BuOH) was then evaporated under nitrogen to the original BuOH starting volume (extract B). All extracts were stored at -80°C until FTICR-MS analysis.

FTICR-MS analysis

For analysis under negative ESI conditions, sample extract B was diluted 10-fold in methanol:0.1% (v/v) ammonium hydroxide (50:50, v/v) prior to direct infusion. For APCI, extract A was directly injected without diluting. All analyses were performed on a Bruker Daltonics APEX III FTICR-MS equipped with a 7.0 T actively shielded superconducting magnet (Bruker Daltonics, MA, USA). Samples were directly injected using ESI and APCI at a flow rate of 600 μL per hour. Ion transfer/detection parameters were optimized using a standard mix of serine, tetra-alanine, reserpine, Hewlett-Packard tuning mix and the adrenocorticotrophic hormone fragment 4-10. In addition, the instrument conditions were tuned to optimize ion intensity and broad-band accumulation over the mass range of 100-1000 atomic mass units (amu) according to the instrument manufacturer's recommendations. A mixture of the above mentioned standards was used to internally calibrate each sample spectrum for mass accuracy over the acquisition range of 100-1000 amu. FTICR data were analysed using a linear least-squares regression line, mass axis values were calibrated such that each internal standard mass peak had a mass error of < 1 part ppm compared with its theoretical mass. Using XMASS software from Bruker Daltonics Inc (CA, USA), data file sizes of one megaword were acquired and zero-filled to two megawords. A SINm data transformation was performed prior to Fourier transform and magnitude calculations. The mass spectra from each analysis were integrated, creating a peak list that contained the accurate mass and absolute intensity of each peak. Compounds in the range of 100-1000 mz were analysed. In order to compare and summarize the data, all detected mass peaks were converted to their corresponding neutral masses, assuming hydrogen adduct formation. A self-generated two-dimensional (mass versus sample intensity) array was then created using DISCOVAmetrics™ software (Phenomenome Discoveries Inc, Saskatoon, Canada). The data from multiple files were integrated and this combined file was then processed in order to determine all of the unique masses. The average of each unique mass was determined, representing the y-axis. A column was created for each file that was originally selected to be analysed, representing the x-axis. The intensity for each mass found in each of the files selected was then filled into its representative x,y coordinate. Coordinates that did not contain an intensity value were left blank. Each of the spectra was then peak-picked in order to obtain the mass and intensity of all metabolites detected. The data from all modes were then merged to create one data file per sample. The data from all 90 discovery serum samples were then merged and aligned to create a two-dimensional metabolite array in which each sample is represented by a column, each unique metabolite is represented by a single row and each cell in the array corresponds to a metabolite intensity for a given sample. The array tables were then used for statistical analysis described in 'statistical analyses' (see Additional File 1).

Additional file 1. Fourier transform ion cyclotron resonance mass spectrometry feature data.

Format: XLSX Size: 915KB Download fileOpen Data

Full-scan quadruple time-of-flight (Q-TOF) and high performance liquid chromatography (HPLC)-coupled MS/MS spectrometry

Ethyl acetate extracts from five CRC and five normal samples were evaporated under nitrogen gas and reconstituted in 70 μL of isopropanol:methanol:formic acid (10:90:0.1). Ten microlitres of the reconstituted sample was subjected to HPLC (HP 1100 with Hypersil ODS 5 μm, 125 × 4 mm column; Agilent Technologies, CA, USA) for full scan and 30 μL for MS/MS at a flow rate of 1 mL/min. Eluate from the HPLC was analysed using an ABI QSTAR® XL mass spectrometer fitted with an APCI source in negative mode. The scan type in full scan mode was TOF with an accumulation time of 1.0000 s, mass range between 50 and 1500 Da and duration time of 55 min. Source parameters were as follows: ion source gas (GS) 1 80; ion GS2 10; curtain gas (CUR) 30; nebulizer current (NC) -3.0; temperature 400°C; declustering potential (DP) -60; focusing potential (FP) -265; DP2 -15. In MS/MS mode, scan type was product ion, accumulation time was 1.0000 s, scan range between 50 and 650 Da and duration time 55 min. All source parameters are the same as above, with collision energy (CE) of -35 V and collision gas (CID, nitrogen) of 5 psi. For MS3 work, the excitation energy was set at 180 V.

Preliminary isolation of CRC biomarkers and NMR analysis

For the thin layer chromatographic methods, all chemicals and media were purchased from Sigma-Aldrich Canada Ltd (ON, Canada). All solvents were HPLC grade. Analytical thin layer chromatography (TLC) was carried out on pre-coated silica gel TLC aluminum sheets (EM Science, NJ, USA; Kieselgel 60 F254, 5 × 2 cm × 0.2 mm). Compounds were visualized under ultraviolet light (254/366 nm) or placed in an iodine vapour tank and by dipping the plates in a 5% aqueous (w/v) phosphomolybdic acid solution containing 1% (w/v) ceric sulphate and 4% (v/v) H2SO4, followed by heating. NMR spectra were recorded on Bruker Avance spectrometers; for 1H (500 MHz), δ values were referenced to CDCl3 (CHCl3 at 7.24 ppm) and for 13C NMR (125.8 MHz) referenced to CDCl3 (77.23 ppm).

Ethyl acetate extracts of commercial serum (180 mL serum, 500 mg extract) was subjected to reverse phase flash column chromatography (FCC) with a step gradient elution; acetonitrile - water 25:75 to 100% acetonitrile. The fractions collected were analysed by LC/MS and MS/MS. The fractions containing the CRC biomarkers were pooled (12.5 mg). This procedure was repeated several times to obtain about 60 mg of CRC biomarker rich fraction. This combined sample was then subjected to FCC with a step gradient elution; hexane-chloroform-methanol and the fractions collected subjected to LC/MS and MS/MS analysis. The biomarker rich fraction labelled sample A (5.4 mg, about 65%) was analysed by NMR. Sample A (3 mg) was then treated with excess ethereal diazomethane and kept overnight at room temperature. After the removal of solvent, the sample was analyzed by NMR.

TQ-MRM methodology

Serum samples were extracted as described for non-targeted FTICR-MS analysis, with the addition of 10 ug/mL [13C1]cholic acid to the serum prior to extraction (resulting in a final ethyl acetate concentration of [13C1]cholic acid of 36 nM. The ethyl acetate organic fraction was used for the analysis of each sample. A series of [13C1]cholic acid dilutions in ethyl acetate from Randox serum extracts was used to generate a standard curve ranging between 0.00022 μg/mL and 0.222 μg/mL. 100 μL of sample were injected by flow-injection analysis into the 4000QTRAP™ equipped with a TurboV™ source with an APCI probe. The carrier solvent was 90% methanol:10% ethyl acetate, with a flow rate of 360 μL/min into the APCI source. The source gas parameters were as follows: CUR: 10.0, CAD: 6, NC: -3.0, TEM: 400, GS1: 15, interface heater on. 'Compound' settings were as follows: entrance potential (EP): -10, and collision cell exit potential (CXP): -20.0. The method is based on the MRM of one parent ion transition for each of the C28 molecules (445.3-383.4 Da, 447.4-385.4 Da, and 449.4-405.4 Da) and a single transition for the internal standard (408.3-343.4 Da). Each of the transitions was monitored for 250 ms for a total cycle time of 2.3 s. The total acquisition time per sample was approximately 1 min. All accepted analyses showed R2 correlation coefficients for the linear regression equation of >0.98. [13C1]cholic acid equivalents for each of the three C28 molecules were calculated by determining the percent recovery of [13C1]cholic acid in each sample by dividing the extrapolated concentration by 0.0148 ug/ml (36 nM, the theoretical amount present in the ethyl acetate extract of each sample). Metabolite concentrations represented as [13C1]cholic acid equivalents were then extrapolated, normalized by dividing by the percent recovery and multiplied by appropriate extraction dilution factors to yield a final serum concentration.

Statistical analysis

FTICR-MS accurate mass array alignments were performed using DISCOVAmetrics™ version 3.0 (Phenomenome Discoveries Inc, Saskatoon, Canada). Statistical analysis and graphs of FTICR-MS data was carried out using Microsoft Office Excel 2007 and distribution analysis of TQ-MRM data and was analysed using JMP version 8.0.1. Meta Analysis (Fisher's inverse chi-square method) was carried out using SAS 9.2 and R 2.9.0. Two-tailed unpaired Student's t-tests were used for determination of significance between CRC and controls. P-values of less than 0.05 were considered significant. Receiver operating curve (ROC) curves were generated using the continuous data mode of JROCFIT webcite.


FTICR metabolomic profiling

The experimental workflow for the studies described in this paper is summarized in Figure 1. Non-targeted metabolomic profiles of sera from three independent populations of treatment-naive CRC patients and geographically and ethnically matched healthy controls (summarized in Table 1) were generated over a 24-month period (that is, each study was separated by approximately 12 months). The first study comprised 40 CRC patients and 50 control subjects acquired from Genomics Collaborative, Inc (GCI); the second study comprised 26 CRC subjects and 25 controls acquired from Seracare Lifesciences Inc; and the third study included 46 CRC and 35 controls prospectively collected in Osaka, Japan. In all cases, serum metabolites were captured through a liquid extraction process (see Methods), followed by direct infusion of the extracts using negative electrospray ionization (nESI) and negative atmospheric pressure chemical ionization (nAPCI) on an FTICR mass spectrometer. The resulting spectral data of all the subjects for each study was aligned within 1 ppm mass accuracy, background peaks were subtracted, and a two-dimensional array table comprising the intensities each of the sample-specific spectral peaks was created using custom informatics software (see Methods). Metabolic differences between CRC patient and control profiles for the three independent studies were visualized by plotting the control mean-normalized log ratio peak intensities across the detected mass range as shown in Figures 2A to 2C. In each independent study, a region of spectra between approximately 440 and 600 Da showed peaks consistently reduced in intensity in CRC patients relative to controls (green, yellow, orange and red points in Figure 2). On average, this cluster of masses showed between 50% and 75% reduction in CRC patient serum compared to the respective controls, with p-values of 1 × 10-5 or lower in each study.

thumbnailFigure 1. Study design. The study comprised three phases: Fourier transform ion cyclotron resonance mass spectrometry metabolomic discovery in three independent sample sets, structural investigation and determination of metabolic biomarkers as hydroxylated polyunsaturated ultra long-chain fatty acids and validation using a triple-quadrupole multiple reaction monitoring targeted assay.

thumbnailFigure 2. Scatter plots of average sample peak intensity fold change between colorectal cancer (CRC) and normal patient sera in three independent studies. Sample-specific peaks for all subjects were log2 normalized to the mean of the control population, and plotted according to mass (Da). Points are coloured according to significance based on an unpaired Students't-test (see legend). (A) Genomics Collaborative Inc discovery population, (B) Seracare 1 discovery population, (C) Osaka discovery population. The region boxed in grey represents the cluster of masses between 440 and 600 Da consistently reduced in the CRC patient population compared to controls in all three cohorts.

The overlap between each of the discovery studies was further investigated by ranking the top 50 masses based upon P-value from each study and comparing them with masses showing a significant difference (P < 0.05 between CRC and controls) in the other studies as shown in Table 2. For example, 46 of the top 50 metabolites (92%) with the lowest p-values in the GCI discovery set were also significant (P < 0.05) in the Seracare 1 dataset, while 31 out of the 50 GCI masses were significant (P < 0.05) in the Osaka dataset. Likewise, the top 50 metabolites in the Osaka study showed 88% and 94% redundancy with metabolites showing P < 0.05 in the GCI and Seracare 1 studies, respectively. These results indicated a very high degree of commonality among significantly differentiated masses across the three studies, and in fact, 63% of the top 50 masses in each study were also present within the top 50 of at least one of the other two studies (see Additional File 2). Of the top 50 rank-ordered masses, only those identified in more than one study were found to exist within the 440 to 600 Da mass range highlighted above and there was not a single peak detected outside this region which was significantly different between CRCs and controls in any two of the studies. Filtering for metabolic differences detected exclusively in all three studies (as well as removal of C13 isotopic peaks and redundant masses detected in both ESI and APCI), resulted in 13 masses representing individual 12C metabolites as shown in Table 3. These represented the most statistically significant and robust discriminators among the three studies. Subsequent molecular formula assignments, as discussed further below, as well as related expression profiles, suggested that the metabolites belonged to a related chemical family.

Additional file 2. Top 50 discriminating masses (based on student's t-test) of each discovery project. Masses shaded grey were detected in the top 50 in two of the three studies. Indicated are the detected accurate mass, the computationally predicted molecular formula (for masses shaded in grey), the mass difference between the detected mass and mass of the predicted molecular formula in part per million (ppm), the mode of analysis (electrospray ionization; atmospheric pressure chemical ionization), the P-value (based on an unpaired student's t-test) between the average peak intensity of control subjects versus colorectal cancer (CRC) patients and the average peak intensity ratio between CRC patients and controls.

Format: PNG Size: 243KB Download fileOpen Data

Table 2. Percent overlap between top 50 most discriminating masses (based on student's t-test) of each discovery project and masses showing P < 0.05 in the remaining cohorts

The relative intensities of the two lowest molecular weight molecules with nominal masses of 446 and 448 are shown in Figure 3A. We observed little to no correlation between the reduction of the metabolites and disease stage (Figures 3A and 3B), and ROC curve analysis resulted in an average area under-the-curve (AUC) of 0.91 ± 0.03 (Figure 3C; individual AUCs shown) across all three studies for all stages combined.

thumbnailFigure 3. Relative intensities of metabolites 446 and 448 by disease stage and the area under the curves for each discovery dataset. (A) Bar charts of relative intensity versus disease stage in each sample set; (B) summary of P-value comparisons between disease stages and controls for metabolites 446 and 448; (C) receiver operating curve analysis based on markers 446 and 448 and all CRCs versus all controls in each discovery set.

Computational assignments of reasonable molecular formulas were then carried out for the 13 masses identified above, as well as the top 50 for each discovery set shown in Additional File 2. The assignments were based on a series of mathematical and chemometric rules as described previously [23], which are reliant on high mass accuracy for precise prediction. The algorithm computes the number of carbons, hydrogens, oxygens and other elements, based on their exact mass, which can be assigned to a detected accurate mass within defined constraints. Logical putative molecular formulas were computed for masses in Table 3 (and Additional File 2), resulting in elemental compositions containing either 28, 30, 32 or 36 carbons and four to six oxygen. We used this information in the subsequent section to select appropriate molecules for structural comparison studies. Collectively, the results indicated a consistent 50% to 75% reduction of organically soluble oxygenated metabolites ranging between 28 and 36 carbons in length, in the serum of CRC patients compared to controls.

Table 3. List of 13 masses detected among the top 50 masses inclusive to all three discovery projects

Structural elucidation

Selected ethyl acetate extracts of serum from the GCI cohort used in the FTICR-MS work described above were re-analysed using HPLC coupled to a quadrupole time-of-flight (Q-TOF) MS in full-scan APCI negative ion mode. Consistent with the FTICR-MS results, a cluster of peaks between approximately 440 and 600 Da at a retention time of between 16 and 18 min following reverse-phase HPLC was detected in asymptomatic control sera, but was absent from CRC patient serum (Figure 4). Molecular ions from all six C28 biomarkers (m/z 446, m/z 448, m/z 450, m/z 464, m/z 466 and m/z 468) as well as many of the remaining C32 and C36 markers were easily detectable within normal serum. Extracted masses up to 400 Da within the 16-18 min retention time showed similar peak intensities in both populations (Figure 4, region to the right of the box), as did extracted mass spectra at other retention times (not shown), reinforcing the specificity of this depleted metabolic region for CRC patient serum.

thumbnailFigure 4. Extracted mass spectrum of serum from normal subjects and colorectal cancer (CRC) patients. Extracts from five representative CRC and five control samples from the Genomics Collaborative discovery set were subject to high performance liquid chromatography followed by full-scan detection on an Applied Biosystems QSTAR XL™ mass spectrometer in atmospheric pressure chemical ionization negative mode. The average intensities of all ions within the mass range 100 to 700 Da eluting between 16 and 18 min are shown for each cohort. The boxed region indicates spectral features present in normal patients but absent from CRC-positive serum.

Tandem mass spectrometric fragmentation fingerprints were next generated for the six C28 biomarkers (Table 4, see Additional Files 345678) and for the higher C32 and C36 biomarkers (see Additional File 9). The MS/MS and MS3 fragmentation data of the six C28 biomarkers were dominated by peaks resulting from losses of H2O (m/z 427, 429, 431, 445, 447 and 449), losses of two molecules of H2O (m/z 409, 411, 413, 427, 429, 431), losses of CO2 (m/z 401, 403, 405, 419, 421, 423) and losses of CO2 and H2O (m/z 383, 385, 387, 401, 403, 405), indicating the presence of carboxylic acid functionality and two or more hydroxyl groups. The molecular formulae, organic properties of the molecules and the tandem MS data suggested that the metabolites were derivatives or analogues of one or more possible classes of molecules including fat soluble vitamins such as retinol and retinoic acid (vitamin A), calciferols (vitamin D), tocopherols (vitamin E), phylloquinones (vitamin K), steroids or bile acids, or long chain polyunsaturated hydroxy fatty acids. Tandem mass spectrometric fragmentation fingerprints of standards for each of these metabolic classes were therefore generated including 5S,6S-(7E,9E,11Z,14Z)-dihydroxyeicosatetraenoic acid (1), 15S-Hydroxy-(5Z,8Z,11Z,13E)-eicosatetraenoic acid (2) and 8R-Hydroxy-(5Z,9E,11Z,14Z)-eicosatetraenoic acid (3), α-tocopherol (4) γ-tocopherol (5), 13-(6-hydroxy-2,7,8-trimethylchroman-2-yl)-2,6,10-trimethyltridecanoic acid (6), 16-(4,5-dimethyl-3,6-dioxo cyclohexa-1,4-dienyl)-2,6,10,14-tetramethylhexadecanoic acid (7), 6-hydroxy-2,7-dimethyl-2-(4,8,12-trimethyltridecyl)chroman-8-carbaldehyde (8), 6-hydroxy-2,7-dimethyl-2-(4,8,12-trimethyltridecyl)chroman-8-carboxylic acid (9), calciferol (10), cholecalciferol (11), ergosterol (12), phylloquinone (13), retinol (14) and 3β,7α-dihydroxy-5-cholestenoic acid (15) (Table 5). The resulting MS/MS data for vitamins A, D, E, K as well as the steroidal molecules (4 - 15) showed no similarity to any of the metabolomic biomarkers; for vitamin E type molecules, all had diagnostic fragments characteristic of their chroman rings (m/z 163, 149, 149, 149, 163 and 179 for 4, 5, 6, 7, 8 and 9 respectively), for vitamin D and analogues, diagnostic fragments formed as a result of the loss of the side chain (m/z 271, 273 and 253, for 10, 11 and 13, respectively), for phylloquinone (13), the diagnostic fragment m/z 187 for the quinone ring system was prominent, for vitamin A (14), the fragment m/z 269 (M + H - H2O) loses the cyclohexyl ring moiety to form a diagnostic m/z 145 for retinol and for 3β,7α-dihydroxy-5-cholestenoic acid (15) the diagnostic retro diels alder fragment at m/z 277 was observed. In addition to this, other carboxylic acid standards with a pregnane ring system, as in 15 (for example, chenodeoxycholic acid and cholic acid), do not show losses of CO2 upon MS/MS fragmentation (not shown). However, MS/MS fragmentation data of hydroxy fatty acid standards 1, 2 and 3 (Table 5) showed peripheral cut ions similar to those produced by MS/MS of the CRC biomarkers, and consistent with what has been described by others for various hydroxylated long-chain fatty acids [29-33]. For example, marker m/z 446 showed peripheral cut ions 427 [M - H - H2O]-, 401 [M - H - CO2]-, 409 [M - H - 2H2O]-, 383 [M - H - CO2 - H2O]- and 365 [M - H - CO2 - 2H2O]- and chain cut ions, 223, 205, 277 as well as others (see Table 5 and Additional File 3). Similar ions were obtained for the other C28, C32 and C36 metabolites (Table 4 and Additional File 9). Collectively, these deductions indicated that the metabolomic markers were not analogues of vitamins A, D, E, K and steroids, but rather long-chain hydroxy fatty acids containing varying degrees of unsaturation. We collectively refer to these metabolites as hydroxylated polyunsaturated ultra long-chain fatty acids (hPULCFAs; where the term 'ultra' has been used to refer to C30 and longer chain fatty acids [34]).

Additional file 3. Tandem mass spectrometry spectra for biomarker m/z 446.

Format: PNG Size: 162KB Download fileOpen Data

Additional file 4. Tandem mass spectrometry spectra for biomarker m/z 448.

Format: PNG Size: 167KB Download fileOpen Data

Additional file 5. Tandem mass spectrometry spectra for biomarker m/z 450.

Format: PNG Size: 197KB Download fileOpen Data

Additional file 6. Tandem mass spectrometry spectra for biomarker m/z 464.

Format: PNG Size: 203KB Download fileOpen Data

Additional file 7. Tandem mass spectrometry spectra for biomarker m/z 466.

Format: PNG Size: 219KB Download fileOpen Data

Additional file 8. Tandem mass spectrometry spectra for biomarker m/z 468.

Format: PNG Size: 192KB Download fileOpen Data

Additional file 9. Tandem mass spectrometry of hydroxylated polyunsaturated ultra long-chain fatty acids.

Format: PNG Size: 1.4MB Download fileOpen Data

Table 4. Tandem-mass spectrometry (MS) analysis of selected 28-carbon containing masses

Table 5. Tandem mass spectrometric results of various standards

Next, an enrichment strategy using bulk serum extracts and a two-stage flash column chromatography approach followed by nuclear magnetic resonance (NMR) analysis was carried out to provide further structural verification of the hPULCFAs. First, reverse phase FCC using a water-acetonitrile solvent gradient was performed and the resulting fractions analysed by LC/MS. Fractions containing the hPULCFAs (fraction 9, Additional Files 10 and 11) were pooled and subjected to normal phase FCC using chloroform-methanol mixtures to obtain an approximately 65% rich semi-purified fraction labelled sample A (Additional File 12). LC and MS/MS analyses (MS2 and MS3) data on sample A were used to track and confirm enrichment of the markers. NMR (1H, 13C and 2D) analyses on sample A and its methyl esters revealed resonances and correlations (Table 6) consistent with very long chain polyunsaturated hydroxy fatty acids with observance of some suppression of resonances for hydrogen atoms attached to sp2 carbons.

Additional file 10. Purification process to obtain hydroxylated polyunsaturated ultra long-chain fatty acids (hPULCFA) enriched fractions from human serum. Dried organic extracts of serum were initially purified in a reversed phase flash column chromatography using water/acetonitrile step solvent gradient to obtain semi purified hPULCFA enriched fraction (F9). Several of F9s were combined for a secondary purification step in a normal phase flash column chromatography using hexane/chloroform/methanol step solvent gradient to obtain highly hPULCFA enriched fraction 7 (F7_2).

Format: PNG Size: 141KB Download fileOpen Data

Additional file 11. Liquid chromatography/mass spectrometry spectra of Stage I fraction 9 (F9) containing a mixture of fatty acids and colorectal cancer biomarkers obtained after fractionating serum extract on reverse phase column.

Format: PNG Size: 293KB Download fileOpen Data

Additional file 12. Liquid chromatography/mass spectrometry spectra of Stage II fraction 7 (F7) containing approximately 65% enrichment of hPULCFAs.

Format: PNG Size: 301KB Download fileOpen Data

Table 6. 1H nuclear magnetic resonance (NMR) data of colorectal cancer (CRC) biomarker pool (sample A) and their methyl esters

Independent validation using MRM methodology

Reduced levels of hPULCFAs in the blood of CRC patients was further confirmed using a MS/MS approach (see methods) in two more independent populations. The approach is based upon the measurement of parent-daughter fragment ion combinations (referred to as MRM) for quantifying analytes [28,35]. We developed an assay to specifically measure semi-quantitatively three of the 28 carbon hPULCFAs with four oxygens (parent masses 446, 448 and 450; C28H46O4, C28H48O4 and C28H50O4, respectively) as described in the methods. The first study comprised 70 treatment-naive CRC subjects and 70 matched controls, all of which were Caucasians from the USA. The [13C1]cholic acid equivalent concentrations of the three 28-carbon hPULCFAs (named according to nominal mass 446, 448 and 450) for each subject are shown in Figure 5A. Significantly lower levels (P < 0.001, actual values shown in Figure 5A) of each of the metabolites was observed in treatment-naive CRC-positive subjects compared to controls. ROC analysis resulted in AUCs of 0.87 ± 0.005 for each of the 28-carbon containing hPULCFAs (Figure 5B). Plotting patients by disease stage showed a slight (but not significant) reduction between stage I and III, with stage IV subjects showing the least reduction (Figures 5C and 5D), albeit it only seven subjects. The corresponding average AUCs of the 28-carbon pool by stage were 0.87 for stage I, 0.88 for stage II, 0.94 for stage III and 0.66 for stage IV.

thumbnailFigure 5. Results of triple-quadrupole multiple reaction monitoring analysis of the Seracare 2 validation sample set. (A) Scatter plots of the concentrations of hydroxylated polyunsaturated ultra long chain fatty acids (hPULCFAs) 446, 448 and 450 expressed as [13C1]-cholic acid equivalents in asymptomatic normal controls and pre-treatment colorectal cancer patients, (B) receiver operating curve (ROC) analysis based upon the corresponding scatter plots in (A). Grey dotted lines indicate the 95% confidence interval. (C) Bar charts of the average concentration equivalents of hPULCFAs by disease stage. Error bars represent standard errors of the mean. (D) ROC analysis by disease stage.

We next used the MRM method to characterize another independent population of CRC and control subjects from Chiba, Japan. Serum from 40 pre-treatment CRC subjects and 40 controls were analysed and a significant reduction was again observed in the CRC-positive group (Figure 6A). The corresponding average AUC for the three metabolites was 0.97 ± 0.014 (Figure 6B). In this study, a significant correlation with stage was observed (P < 0.05) for all comparisons between stages I, II and III/IV (Figures 6C and 6D). The AUCs by stage were 0.93 for stage I, 0.97 for stage II and 1.0 for stage III/IV (two stage IVs were grouped with stage III; Figure 6D).

thumbnailFigure 6. Results of triple-quadrupole multiple reaction monitoring analysis of the Chiba validation sample set. (A) Scatter plots of the concentrations of hydroxylated polyunsaturated ultra long-chain fatty acids (hPULCFAs) 446, 448 and 450 expressed as [13C1]-cholic acid equivalents in asymptomatic normal controls, and pre-treatment colorectal cancer patients, (B) receiver operating curve (ROC) analysis based upon the corresponding scatter plots in (A). Grey dotted lines indicate the 95% confidence interval. (C) Bar charts of the average concentration equivalents of hPULCFAs by disease stage. Error bars represent standard errors of the mean. (D) ROC analysis by disease stage.


We report here on the discovery of novel hydroxylated polyunsaturated ultra long-chain fatty acids containing between 28 and 36 carbons reduced in the serum of CRC patients compared to healthy asymptomatic controls. The utility of non-targeted metabolomics using high resolution FTICR-MS coupled with flow injection technology for biomarker discovery was demonstrated by applying the technology to three independent test populations. In contrast to the 'training/test-set' approach often used by splitting a single sample set in half to validate the performance of biomarkers [36-38], which often relies on complex algorithms (see review [39]) and can result in bias [40], we carried out fully independent discovery analyses on three separate sample sets of matched cases and controls of different ethnic backgrounds collected from multiple sites around the world to ensure a high degree of robustness and minimal chance of sampling bias. Of the top 50 metabolic discriminators discovered in the Osaka set, 44 and 47 of these were also significantly changed in the GCI and Seracare sets, respectively. This remarkable inter-study agreement indicates that not only is non-targeted FTICR-MS technology a reproducible biomarker discovery engine, but that disease-related metabolomic changes can be highly conserved across geographic locations and races. The reduction of hPULCFAs in the serum of CRC patients was further validated by translation of the non-targeted FTICR-MS discovery into a simple targeted TQ-MRM method for three hPULCFAs, which was used on two further independent and ethnically diverse case-control test populations. ROC AUCs generated from the TQ-MRM method on the two validation studies were consistent with those based upon the same fatty acids detected in the three FTICR-MS discovery studies (Figures 3, 5 and 6). In total, five independent study populations collectively comprising 222 treatment-naive CRC patient samples and 220 disease-free asymptomatic controls were evaluated using two different analytical methods. Indeed, the likelihood of the reported association between the reduction of hPULCFAs and CRC being a false positive result across the five independent sets of samples is astronomically low. Meta-analysis was performed on the false positive rates using Fisher's Inverse Chi-square Method (; p = P-values of five independent samples, k = five different samples, C = upper tail of the chi-square distribution with 2 k degrees of freedom ( = 18.31))[41,42]. Based upon the meta-analysis, the resulting P-values for markers 446 and 448 were more significant than the individual P-values, at 2.96 × 10-47 and 8.11 × 10-49, respectively. Although there were differences in the median ages between the CRC and control cohorts in two of the studies, there was no statistically significant trend between age and hPULCFA levels within the individual cohorts and we observed no significant difference between hPULCFA concentrations among the controls from the different populations (not shown). We also observed no differences between genders, and although there were slightly higher BMI levels in the control cohorts for the GCI and Seracare 1 cohorts, the BMIs were matched in the second Seracare validation population suggesting the markers are not related to BMI. A prospective analysis of disease-free subjects equally distributed across various age groups is underway specifically to address any potential age or BMI effects in more detail. Overall our results indicate with a high degree of confidence that a reduction in these metabolites is correlated with the presence of CRC.

The FTICR-MS provided resolution sufficient for confident molecular formula predictions based upon accurate mass in conjunction with extraction, ionization, and statistical correlative information. Although multiple elemental compositions were theoretically assignable to given biomarker masses, only formulas having 28 to 32 carbons, and four to six oxygen were consistently assignable to common masses detected in two or three of the discovery sets. Given a high degree of statistical interaction between the sample-to-sample expression profiles of the hPULCFAs (that is, a high degree of correlation between the relative intensities of the markers across subjects) we suspected they were all part of the same metabolic system and should therefore show related compositions. Detection in negative ionization mode also reduced the likelihood that nitrogen was present in any of the compositions. This information in conjunction with tandem mass spectrometry showing prominent losses of water and carbon dioxide enabled the determination of molecular formulas as shown in Table 3 and Additional File 2. A number of candidate classes of molecules theoretically fitting the molecular formula class were easily excluded using tandem MS. For example, we observed no fragments indicative of condensed ring systems such as those in steroids or vitamin D, and no fragments indicative of chroman ring systems such as those observed in the vitamin E tocopherols. Several other classes of molecules including vitamin K and retinol, and bile acids such as cholic acid and 3β,7α-dihydroxy-5-cholestenoic acid also did not show comparable fragmentation patterns. However, the similarity in fragmentation pattern, particularly in the relative abundances of daughter ions resulting from losses of CO2 and H2O, and chain cut ions from the hPULCFAs to known hydroxy fatty acid standards as well as other fatty acids reported in the literature such as the resolvins and protectins (discussed below), allowed for the identification of the metabolites as hydroxylated polyunsaturated ultra long-chain fatty acids. Examination of the MS/MS data for the C28 series (masses 446, 448, 450, 464, 466 and 448) revealed a consistent 113 Da daughter ion, which we conjecture to represent the carboxy-terminus chain fragment -CH2-CH=CH-CH2-CH2-COOH. In addition, a consistent loss of 54 (-CH=CH-CH2-CH2-) from the [M-(CO2+H2O)] daughter ion was observed for the 446, 448, 464 and 466, but not the 450 and 468 molecules, suggesting that (1) the 450 and 468 may have a saturated carboxy terminal region and (2), that there are likely no hydroxyl moieties within this region of the molecule. MS/MS data of all the C28 and other markers also did not show the diagnostic fragment obtained with a 1,2-diol motif as observed for 1 (base peak is chain cut ion at m/z 115) and NMR on fractions enriched via flash-column chromatography showed lower than expected integration values obtained for the 1H NMR signals at δ 2.78 (methylene interruptions between double bond carbons) and at δ 5.12 - 5.90 (hydrogen atoms on double bond carbons). Cumulatively these results suggested that the hydroxyl groups in the molecules are likely bonded to the carbon atoms between the sp2 carbons at least seven carbons from the carboxy end. Confirmation of the exact positions of the hydroxyl groups and precise locations of unsaturations in individual hPULCFAs using preparatory HPLC and chemical synthesis is in progress and will be reported in subsequent publications.

Interestingly, the metabolite markers reported in this study represent a human-specific metabolic system. We analysed serum samples from multiple species, including rat, mouse and bovine, as well as multiple different sample sources including numerous cell lines, conditioned media, tumour and normal colonic tissue from patients in the GCI discovery set, and brain, liver, adipose and other tissues from various species, all of which failed to show any detectable levels of these hPULCFAs (results not shown). We also could not detect these molecules in various plant tissues or grains, including policosanol extracts which are rich in saturated C28 and longer-chain fatty acids [43,44]. This suggests that the molecules may originate from human-specific metabolic processes, such as specific p450-mediated and/or microbiotic processes. The lack of detection in tumour or normal colonic tissue suggests that the metabolites are not 'tumour-derived markers' and, combined with the high rate of association in stage I cancer, it is not likely that the reduction is the result of tumour burden. Analysis of post-surgery samples is currently in progress to address this question. However, the further reduction of levels observed in some late stage Japanese cases (Figure 6) could be explained if lower levels of the hPULCFAs were indeed indicative of progression rate in this group. It is also important to note that in all control groups reported in this paper, subjects were not colonoscopy-confirmed to be free of tumours or advanced neoplasia. Based upon colonoscopy results by Collins et al in average-risk subjects, up to 10% of an asymptomatic population can be positive for advanced neoplasia [45]. Therefore, the ability of these metabolites to discriminate between subjects at risk and not at risk for CRC is likely under-estimated in our results. Studies are currently in progress to evaluate endoscopy-confirmed controls, to assess the effect of treatment on the markers, and to investigate any possible association with various grades of colon pathologies and non-malignant GI disorders as well as other cancers.

Although fatty acids of this length containing hydroxyl groups have never been reported as far as we are aware, they appear to resemble a class of hydroxylated very long-chain fatty acids knows as the resolvins and protectins that originate from the n3 essential fatty acids EPA and DHA, respectively, which are critical in promoting the resolution of acute inflammation. The inability to sufficiently 'resolve' acute inflammation is the leading theory behind the establishment of chronic inflammatory states which underlie multiple conditions including cancer [46] and Alzheimer's Disease [47]. Of particular relevance is the effect of pro-resolution long-chain hydroxy fatty acid mediators on intestinal inflammatory conditions such as irritable bowl disease (IBD), Crohn's Disease, Colitis and colon cancer. Both Resolvin E1 (RvE1) and Lipoxin A4 (LXA4) have been implicated with protective effects against colonic inflammation. RvE1 was shown to protect against the development of 2,4,6-trinitrobenze sulphonic acid-induced colitis in mice, accompanied by a block in leukocyte infiltration, decreased proinflammatory gene expression, induced nitric oxide synthase, with improvements in survival rates and sustained body weight [48]. Similarly, LXA4 analogues have been shown to attenuate chemokine secretion in human colon ex vivo [49], and attenuated 50% of genes, particularly those regulated by NFκB induced in response to pathogenically induced gastroenteritis [50]. In vivo, LXA4 analogues reduced intestinal inflammation in DSS-induced inflammatory colitis, resulting in significantly reduced weight loss, haematochezia and mortality [50]. Structurally, resolvins and protectins (as well the n6 lipoxins) comprise mono-, di- and tri-hydroxylated products of the parent VLCFAs, catalyzed by various lipoxygenases, cyclooxygenases and p450 enzymes [51-55]. The possibility that the hPULCFAs reported here represent elongation products of these molecules cannot be excluded. Future studies will be required to address the origin, as well as the biological role, if any, that these molecules may play in defending the body against CRC development.

Although we report results from multiple case-control cohorts each having a limited sample size, the average AUC across all the samples reported here was 0.91 ± 0.04, which translates into approximately 75% sensitivity at 90% specificity with little to no disease-stage bias. The real-world screening performance is currently being evaluated through two large ethically approved prospective clinical trials, one in collaboration with the Saskatchewan Cancer Agency and the Saskatchewan Provincial Government (PDI-CT-1; n = 5000), and the other with the University of Calgary (PDI-CT-3 n = 1500). Clinically relevant questions are being addressed, including correlation between hPULCFAs and CRC in a prospective hospital screening environment, correlation with other non-malignant gastrointestinal disorders (such as IBD, Crohn's and colitis), whether there is any correlation with various stages of neoplasia or polyps and family history and whether subjects with low hPULCFA levels show higher incidence rates of CRC than subjects with 'normal' levels over time.

In summary, we have identified a consistent reduction of novel circulating hPULCFAs in CRC patients which could have considerable implications for CRC diagnosis and screening and possibly prevention and treatment. Adherence to currently recommended screening modalities, namely faecal occult blood testing and colonoscopy, is poor due to a number of factors including public acceptance, risk, cost and available resources. The use of a serum-based test to screen the population for subjects who are high risk would focus endoscopy resources on subjects who need it the most, resulting in a higher detection rate, particularly in early stages of the disease. Given the positive prognosis of early-stage therapeutic intervention, it is tempting to speculate that hPULCFA-based screening could one day result in decreased CRC mortality.


We have shown that comprehensive non-targeted metabolomics technology based upon high-resolution FTICR mass spectrometry represents a powerful and robust approach for small-molecule biomarker-driven discovery. Accurate mass measurements combined with conventional MS/MS resulted in the rapid identification of key structural characteristics of the novel metabolites discovered and the assignment of putative chemical structures. The subsequent translation of these metabolite biomarker discoveries into an efficient and clinically viable high-throughput semi-quantitative triple-quadrupole platform represents a significant advancement in the clinical implementation of biomarker discoveries. The reduction of systemic hydroxylated ultra-long chain fatty acids in CRC patients raises intriguing biological and aetiological questions given the large numbers of sporadic CRC cases and the heavy influence of lifestyle and diet on risk. Further research is ongoing regarding the potential role(s) these novel molecules play in CRC progression and whether they have any association with previously established risk factors.


APC: adenomatous polyposis coli; APCI: atmospheric pressure chemical ionization; AUC: area under the curve; BuOH: butanol; CRC: colorectal cancer; CUR: curtain gas; ESI: electrospray ionization; EtOAc: ethylacetate; FCC: flash column chromatography; FTICR-MS: Fourier transform ion cyclotron resonance mass spectrometry; GS: gas source; HPLC: high performance liquid chromatography; hPULCFA: hydroxylated polyunsaturated ultra long-chain fatty acid; LC: liquid chromatography; LXA4: lipoxin A4; MRM: multiple reaction monitoring; MS/MS: tandem mass spectrometry; mz: mass to charge ratio; NC: nebulizer current; NMR: nuclear magnetic resonance; ppm: part per million; Q-TOF: quadrupole time-of-flight; ROC: receiver-operator characteristic; RvE1: resolving E1; SELDI: surface-enhanced laser desorption ionization; TLC: thin layer chromatography; TQ-MRM: triple-quadrupole MRM; VLCFA: very long-chain fatty acid.

Competing interests

The following authors are full-time employees and have received salaries from Phenomenome Discoveries Inc, Saskatoon, Canada: Shawn A Ritchie, Pearson W K Ahiahonu, Dushmanthi Jayasinghe, Doug Heath, Jun Liu, Yingshen Lu, Wei Jin, Amir Kavianpour, Yasuyo Yamazaki, Amin M. Khan, Mohammad Hossain, Khine Khine Su-Myat and Paul L Wood. Dayan B Goodenowe is the co-founder, President and CEO, of Phenomenome Discoveries, Inc. Only Dayan B Goodenowe owns shares in the company. Kevin Krenitsky, Ichiro Takemasa, Masakazu Miyake, Mitsugo Sekimoto, Morito Monden, Hisahiro Matsubara and Fumio Numura have no competing financial interests. None of the authors have non-financial competing interests. Phenomenome Discoveries Inc is financing the article processing fees for the manuscript. Shawn A Ritchie and Dayan B Goodenowe are named inventors on submitted patent applications relating to the discoveries disclosed within the manuscript and have both received a salary from Phenomenome Discoveries Inc (the organization named in the patents).

Authors' contributions

SAR was the group leader, lead author and writer, performed the study design and sample selection, completed all the discovery FTICR-MS data analysis, including statistical analysis, supervised the structural elucidation, MSMS interpretation and overall experimental designs. PWAK performed MS/MS and NMR analysis and interpretation and analysed all comparison MSMS standard data. DJ performed MS/MS and fraction column chromatography enrichment and NMR interpretation. DH developed FTICR-MS analytical methods, including extraction protocols, and helped to develop the TQ-MRM methodology used. JL developed high performance liquid chromatographic methods and analysed samples for HPLC-TOF analysis. YL developed and optimized TQ-MRM methods and analysed validation sample sets. WJ performed and optimized HPLC-TOF and MS/MS analysis and samples, including enriched serum extracts. AK was involved in the optimization of the TQ-MRM methodology, QAQC and analysis of validation (Seracare and Chiba) samples. YY developed the original chromatographic (HPLC) methods used to isolate the hPULCFAs and aided in the experimental design of the Chiba and Osaka sample sets. AMK oversaw the structural elucidation processes including NMR interpretation and MS/MS interpretation. MH aided in MS/MS and NMR data generation and interpretation. KKSM was involved in the biostatistical analysis, including the analysis of HTS data, Fisher analysis and all SAS work. PLW was involved in the interpretation of MS/MS data, NMR data, mechanistic insights into MSMS fragmentation patterns and proposed structural motifs. KK was involved in the sample selection and study design for the genomics collaborative and Seracare Lifesciences sample sets, the selection of matched cases and controls, clinical data organization and verification of disease pathology. IT was involved in the design of clinical trials in Osaka, patient selection and pathology confirmation. MM was involved in the clinical trial design in Osaka, patient selection and discovery analysis, clinical staging, pre-versus post surgery data collection and analysis. MS was the Head of the lower gastroenterological surgery group, Osaka, responsible for patient recruitment and trial design. MM was the group leader at the Osaka Department of Surgery and was involved in the experimental design of the Osaka discovery sample set, data analysis/interpretation and unblinding of data. HM was the head of surgery (for CRC population, Chiba) responsible for the patient enrolment and selection for the Chiba samples. FN was the group leader at Chiba overseeing the entire project, including protocol design and approvals, data analysis and unblinding. DBG was the President and CEO of Phenomenome Discoveries Inc, and oversaw most of the efforts at PDI, was integrally involved in the interpretation of MS/MS data, the development of FTICR methodology, the experimental designs and was also a significant contributor to the format and direction of the manuscript.


We would like to thank the following individuals for their contributions to this work: Hideaki Shimada, Takeshi Tomonaga and Kazuyuki Matsushita for sample collection, processing and clinical data management at Chiba, Japan.


  1. Roy HK, Backman V, Goldberg MJ: Colon cancer screening: the good, the bad, and the ugly.

    Arch Intern Med 2006, 166:2177-2179. PubMed Abstract | Publisher Full Text OpenURL

  2. Ouyang DL, Chen JJ, Getzenberg RH, Schoen RE: Noninvasive testing for colorectal cancer: a review.

    Am J Gastroenterol 2005, 100:1393-1403. PubMed Abstract | Publisher Full Text OpenURL

  3. Davies RJ, Miller R, Coleman N: Colorectal cancer screening: prospects for molecular stool analysis.

    Nat Rev Cancer 2005, 5:199-209. PubMed Abstract | Publisher Full Text OpenURL

  4. Kleivi K, Lind GE, Diep CB, Meling GI, Brandal LT, Nesland JM, Myklebost O, Rognum TO, Giercksky KE, Skotheim RI, Lothe RA: Gene expression profiles of primary colorectal carcinomas, liver metastases, and carcinomatoses.

    Mol Cancer 2007, 6:2. PubMed Abstract | BioMed Central Full Text | PubMed Central Full Text OpenURL

  5. Solmi R, Ugolini G, Rosati G, Zanotti S, Lauriola M, Montroni I, del Governatore M, Caira A, Taffurelli M, Santini D, Coppola D, Guidotti L, Carinci P, Strippoli P: Microarray-based identification and RT-PCR test screening for epithelial-specific mRNAs in peripheral blood of patients with colon cancer.

    BMC Cancer 2006, 6:250. PubMed Abstract | BioMed Central Full Text | PubMed Central Full Text OpenURL

  6. Komori T, Takemasa I, Higuchi H, Yamasaki M, Ikeda M, Yamamoto H, Ohue M, Nakamori S, Sekimoto M, Matsubara K, Monden M: Identification of differentially expressed genes involved in colorectal carcinogenesis using a cDNA microarray.

    J Exp Clin Cancer Res 2004, 23:521-527. PubMed Abstract OpenURL

  7. Hegde P, Qi R, Gaspard R, Abernathy K, Dharap S, Earle-Hughes J, Gay C, Nwokekeh NU, Chen T, Saeed AI, Sharov V, Lee NH, Yeatman TJ, Quackenbush J: Identification of tumor markers in models of human colorectal cancer using a 19,200-element complementary DNA microarray.

    Cancer Res 2001, 61:7792-7797. PubMed Abstract | Publisher Full Text OpenURL

  8. Kitahara O, Furukawa Y, Tanaka T, Kihara C, Ono K, Yanagawa R, Nita ME, Takagi T, Nakamura Y, Tsunoda T: Alterations of gene expression during colorectal carcinogenesis revealed by cDNA microarrays after laser-capture microdissection of tumor tissues and normal epithelia.

    Cancer Res 2001, 61:3544-3549. PubMed Abstract | Publisher Full Text OpenURL

  9. Notterman DA, Alon U, Sierk AJ, Levine AJ: Transcriptional gene expression profiles of colorectal adenoma, adenocarcinoma, and normal tissue examined by oligonucleotide arrays.

    Cancer Res 2001, 61:3124-3130. PubMed Abstract | Publisher Full Text OpenURL

  10. Takemasa I, Higuchi H, Yamamoto H, Sekimoto M, Tomita N, Nakamori S, Matoba R, Monden M, Matsubara K: Construction of preferential cDNA microarray specialized for human colorectal carcinoma: molecular sketch of colorectal cancer.

    Biochem Biophys Res Commun 2001, 285:1244-1249. PubMed Abstract | Publisher Full Text OpenURL

  11. Backert S, Gelos M, Kobalz U, Hanski ML, Bohm C, Mann B, Lovin N, Gratchev A, Mansmann U, Moyer MP, Riecken EO, Hanski C: Differential gene expression in colon carcinoma cells and tissues detected with a cDNA array.

    Int J Cancer 1999, 82:868-874. PubMed Abstract | Publisher Full Text OpenURL

  12. Mori Y, Cai K, Cheng Y, Wang S, Paun B, Hamilton JP, Jin Z, Sato F, Berki AT, Kan T, Ito T, Mantzur C, Abraham JM, Meltzer SJ: A genome-wide search identifies epigenetic silencing of somatostatin, tachykinin-1, and 5 other genes in colon cancer.

    Gastroenterology 2006, 131:797-808. PubMed Abstract | Publisher Full Text OpenURL

  13. Chen WD, Han ZJ, Skoletsky J, Olson J, Sah J, Myeroff L, Platzer P, Lu S, Dawson D, Willis J, Pretlow TP, Lutterbaugh J, Kasturi L, Willson JK, Rao JS, Shuber A, Markowitz SD: Detection in fecal DNA of colon cancer-specific methylation of the nonexpressed vimentin gene.

    J Natl Cancer Inst 2005, 97:1124-1132. PubMed Abstract | Publisher Full Text OpenURL

  14. Leung WK, To KF, Man EP, Chan MW, Bai AH, Hui AJ, Chan FK, Sung JJ: Quantitative detection of promoter hypermethylation in multiple genes in the serum of patients with colorectal cancer.

    Am J Gastroenterol 2005, 100:2274-2279. PubMed Abstract | Publisher Full Text OpenURL

  15. Ward DG, Suggett N, Cheng Y, Wei W, Johnson H, Billingham LJ, Ismail T, Wakelam MJ, Johnson PJ, Martin A: Identification of serum biomarkers for colon cancer by proteomic analysis.

    Br J Cancer 2006, 94:1898-1905. PubMed Abstract | Publisher Full Text | PubMed Central Full Text OpenURL

  16. Lou J, Fatima N, Xiao Z, Stauffer S, Smythers G, Greenwald P, Ali IU: Proteomic profiling identifies cyclooxygenase-2-independent global proteomic changes by celecoxib in colorectal cancer cells.

    Cancer Epidemiol Biomarkers Prev 2006, 15:1598-1606. PubMed Abstract | Publisher Full Text OpenURL

  17. Mazzanti R, Solazzo M, Fantappie O, Elfering S, Pantaleo P, Bechi P, Cianchi F, Ettl A, Giulivi C: Differential expression proteomics of human colon cancer.

    Am J Physiol Gastrointest Liver Physiol 2006, 290:G1329-1338. PubMed Abstract | Publisher Full Text OpenURL

  18. Roblick UJ, Hirschberg D, Habermann JK, Palmberg C, Becker S, Kruger S, Gustafsson M, Bruch HP, Franzen B, Ried T, Bergmann T, Auer G, Jornvall H: Sequential proteome alterations during genesis and progression of colon cancer.

    Cell Mol Life Sci 2004, 61:1246-1255. PubMed Abstract | Publisher Full Text OpenURL

  19. de la Chapelle A: Genetic predisposition to colorectal cancer.

    Nat Rev Cancer 2004, 4:769-780. PubMed Abstract | Publisher Full Text OpenURL

  20. Marshall JR: Prevention of colorectal cancer: diet, chemoprevention, and lifestyle.

    Gastroenterol Clin North Am 2008, 37:73-82. PubMed Abstract | Publisher Full Text | PubMed Central Full Text OpenURL

  21. Fearnhead NS, Wilding JL, Bodmer WF: Genetics of colorectal cancer: hereditary aspects and overview of colorectal tumorigenesis.

    Br Med Bull 2002, 64:27-43. PubMed Abstract | Publisher Full Text OpenURL

  22. McGarr SE, Ridlon JM, Hylemon PB: Diet, anaerobic bacterial metabolism, and colon cancer: a review of the literature.

    J Clin Gastroenterol 2005, 39:98-109. PubMed Abstract | Publisher Full Text OpenURL

  23. Aharoni A, Ric de Vos CH, Verhoeven HA, Maliepaard CA, Kruppa G, Bino R, Goodenowe DB: Nontargeted metabolome analysis by use of Fourier Transform Ion Cyclotron Mass Spectrometry.

    Omics 2002, 6:217-234. PubMed Abstract | Publisher Full Text OpenURL

  24. Dettmer K, Aronov PA, Hammock BD: Mass spectrometry-based metabolomics.

    Mass Spectrom Rev 2007, 26:51-78. PubMed Abstract | Publisher Full Text | PubMed Central Full Text OpenURL

  25. Want EJ, Nordstrom A, Morita H, Siuzdak G: From exogenous to endogenous: the inevitable imprint of mass spectrometry in metabolomics.

    J Proteome Res 2007, 6:459-468. PubMed Abstract | Publisher Full Text OpenURL

  26. Pinto DM, Boyd RK, Volmer DA: Ultra-high resolution for mass spectrometric analysis of complex and low-abundance mixtures - the emergence of FTICR-MS as an essential analytical tool.

    Anal Bioanal Chem 2002, 373:378-389. PubMed Abstract | Publisher Full Text OpenURL

  27. Breitling R, Ritchie S, Goodenowe D, Stewart ML, Barrett MP: Ab initio prediction of metabolic networks using Fourier transform mass spectrometry data.

    Metabolomics 2006, 2:155-164. Publisher Full Text OpenURL

  28. Zytkovicz TH, Fitzgerald EF, Marsden D, Larson CA, Shih VE, Johnson DM, Strauss AW, Comeau AM, Eaton RB, Grady GF: Tandem mass spectrometric analysis for amino, organic, and fatty acid disorders in newborn dried blood spots: a two-year summary from the New England Newborn Screening Program.

    Clin Chem 2001, 47:1945-1955. PubMed Abstract | Publisher Full Text OpenURL

  29. Hong S, Gronert K, Devchand PR, Moussignac RL, Serhan CN: Novel docosatrienes and 17S-resolvins generated from docosahexaenoic acid in murine brain, human blood, and glial cells. Autacoids in anti-inflammation.

    J Biol Chem 2003, 278:14677-14687. PubMed Abstract | Publisher Full Text OpenURL

  30. Hong S, Lu Y, Yang R, Gotlinger KH, Petasis NA, Serhan CN: Resolvin D1, protectin D1, and related docosahexaenoic acid-derived products: Analysis via electrospray/low energy tandem mass spectrometry based on spectra and fragmentation mechanisms.

    J Am Soc Mass Spectrom 2007, 18:128-144. PubMed Abstract | Publisher Full Text | PubMed Central Full Text OpenURL

  31. Serhan CN, Hong S, Gronert K, Colgan SP, Devchand PR, Mirick G, Moussignac RL: Resolvins: a family of bioactive products of omega-3 fatty acid transformation circuits initiated by aspirin treatment that counter proinflammation signals.

    J Exp Med 2002, 196:1025-1037. PubMed Abstract | Publisher Full Text | PubMed Central Full Text OpenURL

  32. Lu Y, Hong S, Yang R, Uddin J, Gotlinger KH, Petasis NA, Serhan CN: Identification of endogenous resolvin E1 and other lipid mediators derived from eicosapentaenoic acid via electrospray low-energy tandem mass spectrometry: spectra and fragmentation mechanisms.

    Rapid Commun Mass Spectrom 2007, 21:7-22. PubMed Abstract | Publisher Full Text OpenURL

  33. Murphy RC, Fiedler J, Hevko J: Analysis of nonvolatile lipids by mass spectrometry.

    Chem Rev 2001, 101:479-526. PubMed Abstract | Publisher Full Text OpenURL

  34. Poulos A, Beckman K, Johnson DW, Paton BC, Robinson BS, Sharp P, Usher S, Singh H: Very long-chain fatty acids in peroxisomal disease.

    Adv Exp Med Biol 1992, 318:331-340. PubMed Abstract OpenURL

  35. Johnson DW, Trinh MU: Analysis of isomeric long-chain hydroxy fatty acids by tandem mass spectrometry: application to the diagnosis of long-chain 3-hydroxyacyl CoA dehydrogenase deficiency.

    Rapid Commun Mass Spectrom 2003, 17:171-175. PubMed Abstract | Publisher Full Text OpenURL

  36. Lim JY, Cho JY, Paik YH, Chang YS, Kim HG: Diagnostic application of serum proteomic patterns in gastric cancer patients by ProteinChip surface-enhanced laser desorption/ionization time-of-flight mass spectrometry.

    Int J Biol Markers 2007, 22:281-286. PubMed Abstract OpenURL

  37. Su Y, Shen J, Qian H, Ma H, Ji J, Ma L, Zhang W, Meng L, Li Z, Wu J, Jin G, Zhang J, Shou C: Diagnosis of gastric cancer using decision tree classification of mass spectral data.

    Cancer Sci 2007, 98:37-43. PubMed Abstract | Publisher Full Text OpenURL

  38. Chen YD, Zheng S, Yu JK, Hu X: Artificial neural networks analysis of surface-enhanced laser desorption/ionization mass spectra of serum protein pattern distinguishes colorectal cancer from healthy population.

    Clin Cancer Res 2004, 10:8380-8385. PubMed Abstract | Publisher Full Text OpenURL

  39. Ringner M, Peterson C, Khan J: Analyzing array data using supervised methods.

    Pharmacogenomics 2002, 3:403-415. PubMed Abstract | Publisher Full Text OpenURL

  40. Baggerly KA, Morris JS, Coombes KR: Reproducibility of SELDI-TOF protein patterns in serum: comparing datasets from different experiments.

    Bioinformatics 2004, 20:777-785. PubMed Abstract | Publisher Full Text OpenURL

  41. LV H: Meta-Analysis.

    Journal of Educational Statistics 1992, 17:279-296. OpenURL

  42. Fisher RA: Statistical methods for research workers. 4th edition. London: Oliver & Boyd; 1932. OpenURL

  43. Marinangeli CP, Kassis AN, Jain D, Ebine N, Cunnane SC, Jones PJ: Comparison of composition and absorption of sugarcane policosanols.

    Br J Nutr 2007, 97:381-388. PubMed Abstract | Publisher Full Text OpenURL

  44. Wang MF, Lian HZ, Mao L, Zhou JP, Gong HJ, Qian BY, Fang Y, Li J: Comparison of various extraction methods for policosanol from rice bran wax and establishment of chromatographic fingerprint of policosanol.

    J Agric Food Chem 2007, 55:5552-5558. PubMed Abstract | Publisher Full Text OpenURL

  45. Collins JF, Lieberman DA, Durbin TE, Weiss DG: Accuracy of screening for fecal occult blood on a single stool sample obtained by digital rectal examination: a comparison with recommended sampling practice.

    Ann Intern Med 2005, 142:81-85. PubMed Abstract OpenURL

  46. Das UN: Essential fatty acids: biochemistry, physiology and pathology.

    Biotechnol J 2006, 1:420-439. PubMed Abstract | Publisher Full Text OpenURL

  47. Das UN: Folic acid and polyunsaturated fatty acids improve cognitive function and prevent depression, dementia, and Alzheimer's disease--but how and why?

    Prostaglandins Leukot Essent Fatty Acids 2008, 78:11-19. PubMed Abstract | Publisher Full Text OpenURL

  48. Arita M, Yoshida M, Hong S, Tjonahen E, Glickman JN, Petasis NA, Blumberg RS, Serhan CN: Resolvin E1, an endogenous lipid mediator derived from omega-3 eicosapentaenoic acid, protects against 2,4,6-trinitrobenzene sulfonic acid-induced colitis.

    Proc Natl Acad Sci USA 2005, 102:7671-7676. PubMed Abstract | Publisher Full Text | PubMed Central Full Text OpenURL

  49. Goh J, Baird AW, O'Keane C, Watson RW, Cottell D, Bernasconi G, Petasis NA, Godson C, Brady HR, MacMathuna P: Lipoxin A(4) and aspirin-triggered 15-epi-lipoxin A(4) antagonize TNF-alpha-stimulated neutrophil-enterocyte interactions in vitro and attenuate TNF-alpha-induced chemokine release and colonocyte apoptosis in human intestinal mucosa ex vivo.

    J Immunol 2001, 167:2772-2780. PubMed Abstract | Publisher Full Text OpenURL

  50. Gewirtz AT, Collier-Hyams LS, Young AN, Kucharzik T, Guilford WJ, Parkinson JF, Williams IR, Neish AS, Madara JL: Lipoxin a4 analogs attenuate induction of intestinal epithelial proinflammatory gene expression and reduce the severity of dextran sodium sulfate-induced colitis.

    J Immunol 2002, 168:5260-5267. PubMed Abstract | Publisher Full Text OpenURL

  51. Serhan CN: Controlling the resolution of acute inflammation: a new genus of dual anti-inflammatory and proresolving mediators.

    J Periodontol 2008, 79:1520-1526. PubMed Abstract | Publisher Full Text OpenURL

  52. Schwab JM, Chiang N, Arita M, Serhan CN: Resolvin E1 and protectin D1 activate inflammation-resolution programmes.

    Nature 2007, 447:869-874. PubMed Abstract | Publisher Full Text | PubMed Central Full Text OpenURL

  53. Serhan CN, Gotlinger K, Hong S, Lu Y, Siegelman J, Baer T, Yang R, Colgan SP, Petasis NA: Anti-inflammatory actions of neuroprotectin D1/protectin D1 and its natural stereoisomers: assignments of dihydroxy-containing docosatrienes.

    J Immunol 2006, 176:1848-1859. PubMed Abstract | Publisher Full Text OpenURL

  54. Serhan CN: Novel chemical mediators in the resolution of inflammation: resolvins and protectins.

    Anesthesiol Clin 2006, 24:341-364. PubMed Abstract | Publisher Full Text OpenURL

  55. Schwab JM, Serhan CN: Lipoxins and new lipid mediators in the resolution of inflammation.

    Curr Opin Pharmacol 2006, 6:414-420. PubMed Abstract | Publisher Full Text OpenURL

Pre-publication history

The pre-publication history for this paper can be accessed here: