Table 1

Coverage of unicellular organisms in COGs

Species

Number of annotated proteins

Number (and percentage) of proteins in COGs

Number of COGs that include the given species


Bacteria


Proteobacteria (Gram-negative)


Agrobacterium tumefaciens

5299

4398 (83%)

1978

Brucella melitensis

3198

2678 (84%)

1654

Caulobacter crescentus

3737

2958 (79%)

1734

Mesorhizobium loti

7275

5653 (78%)

2175

Sinorhizobium meliloti

6205

5207 (84%)

2084

Rickettsia conorii

1374

891 (65%)

733

Rickettsia prowazekii

835

727 (87%)

647

Buchnera sp

574

567 (99%)

559

Escherichia coli K12

4279

3623 (85%)

2131

Escherichia coli O157:H7

5324

4050 (76%)

2190

Escherichia coli O157:H7 EDL933

5361

4023 (75%)

2200

Salmonella typhi

4553

3724 (82%)

2167

Yersinia pestis

4083

3341 (82%)

1993

Haemophilus influenzae

1714

1597 (93%)

1317

Pasteurella multocida

2015

1829 (91%)

1455

Vibrio cholerae

3463

2929 (85%)

1918

Pseudomonas aeruginosa

5567

4660 (84%)

2243

Xylella fastidiosa

2832

1740 (61%)

1310

Neisseria meningitidis MC58

2079

1561 (75%)

1255

Neisseria meningitides Z2491

2065

1573 (76%)

1260

Ralstonia solanaraceum

5116

3931 (77%)

2018

Campylobacter jejuni

1634

1328 (81%)

1093

Helicobacter pylori 26695

1576

1127 (72%)

920

Helicobacter pylori J99

1491

1106 (74%)

921


Low-GC Gram-positive bacteria


Bacillus halodurans

4066

3149 (77%)

1744

Bacillus subtilis

4112

3125 (76%)

1771

Clostridium acetobutilicum

3848

2879 (75%)

1549

Lactococcus lactis

2267

1798 (79%)

1208

Listeria innocua

3043

2428 (80%)

1522

Mycoplasma genitalium

484

385 (80%)

362

Mycoplasma pneumoniae

689

431 (63%)

383

Mycoplasma pulmonis

782

514 (66%)

426

Ureaplasma urealyticum

614

418 (68%)

378

Staphylococcus aureus

2625

2071 (79%)

1419

Streptococcus pneumoniae

2094

1586 (76%)

1105

Streptococcus pyogenes

1697

1356 (80%)

1030


Actinobacteria


Corinebacterium glutamicum

3040

2162 (71%)

1339

Mycobacterium tuberculosis H37Rv

3927

2843 (72%)

1450

Mycobacterium tuberculosis CDC1551

4187

2756 (66%)

1434

Mycobacterium leprae

1605

1180 (74%)

927


Hyperthermophilic bacteria


Aquifex aeolicus

1560

1349 (86%)

1088

Thermotoga maritima

1858

1565 (84%)

1167


Cyanobacteria


Synechocystis sp.

3167

2346 (74%)

1427

Nostoc sp.

6129

3832 (63%)

1673


Other bacteria


Borrelia burgdorferi

1638

701 (43%)

577

Treponema pallidum

1036

737 (71%)

639

Chlamydia trachomatis

895

644 (72%)

587

Chlamydophila pneumoniae

1054

667 (63%)

603

Deinococcus radiodurans

3182

2322 (73%)

1495

Fusobacterium nucleatum

2067

1556 (75%)

1143


Archaea


Euryarchaeota


Archaeoglobus fulgidus

2420

1953 (81%)

1244

Methanocaldococcus jannaschii

1758

1448 (82%)

1117

Methanothermobacter autotrophicus

1873

1500 (80%)

1123

Methanopyrus kandleri

1691

1253 (74%)

1022

Methanosarcina acetivorans

4540

3142 (69%)

1462

Pyrococcus abyssi

1769

1506 (85%)

1065

Pyrococcus horikoshii

1801

1425 (79%)

1019

Thermoplasma acidophilum

1482

1261 (85%)

890

Thermoplasma volcanium

1499

1277 (85%)

900

Halobacterium sp.

2622

1809 (69%)

1109


Crenarchaeota


Aeropyrum pernix

1840

1236 (67%)

947

Pyrobaculum aerophylum

2605

1529 (59%)

1015

Sulfolobus solfataricus

2977

2207 (74%)

1084


Eukaryota


Saccharomyces cerevisiae

6338

3012 (48%)

1299

Schizosaccharomyces pombe

4979

2774 (56%)

1282

Encephalitozoon cuniculi

1996

1105 (55%)

696


Tatusov et al. BMC Bioinformatics 2003 4:41   doi:10.1186/1471-2105-4-41

Open Data