BMC Bioinformatics

official impact factor 3.03

Open Access Highly Access Methodology article

An application of statistics to comparative metagenomics

Beltran Rodriguez-Brito1, Forest Rohwer2,3 and Robert A Edwards1,2,3,4*

Author Affiliations

1 Computational Science Research Center, San Diego State University, San Diego, USA

2 Center for Microbial Sciences, San Diego State University, San Diego, USA

3 Department of Biology, San Diego State University, San Diego, USA

4 Fellowship for Interpretation of Genomes, Burr Ridge, USA

For all author emails, please log on.

BMC Bioinformatics 2006, 7:162 doi:10.1186/1471-2105-7-162

Published: 20 March 2006

Additional files

Additional File 1:

Step-by-step description of the methods used to calculate the differences between subsystems

Format: TGZ Size: 11KB Download file

Open Data

Additional File 2:

A tar gzip compressed archive of source code and instructions for calculating difference of medians as described here. Version 0.01 of the software, released under the terms of the Gnu General Public License (GPL).

Format: PDF Size: 34KB Download file

This file can be viewed with: Adobe Acrobat Reader

Open Data

Additional File 3:

Presence of subsystems in different environmental samples in tab-separated text format.

Format: TXT Size: 68KB Download file

Open Data

Additional File 4:

Presence of subsystems in the AMD, Sargasso, and SEED samples in tab-separated text format.

Format: TXT Size: 22KB Download file

Open Data

Additional File 5:

Abundance of selected Kegg Pathways and Subsystems that are approximately similar as detected either based on the Normalized differences of Tringe et al (reference 6) or using the method described herein. Note that the relative gene content of the subsystems and Kegg pathways is never identical.

Format: DOC Size: 108KB Download file

This file can be viewed with: Microsoft Word Viewer

Open Data