Table 2

Clustering for Redundancy with the Sargasso Sea Sequences The number of sequences in each database at different redundancy levels. Sargasso comprised sections ead, eae, eaf, eag, eah, eai and eak of the Sargasso Sea resource, BactArch was a combination of bacterial and archaea sequences from the SWISS-PROT and TREMBL databases.

100%

90%

80%

70%

60%

50%


Sargasso

780756

509450

394592

310768

245027

188241

BactArch

761237

535059

485811

434773

379386

318309


Tress et al. BMC Bioinformatics 2006 7:213   doi:10.1186/1471-2105-7-213

Open Data