Table 2

Most common 8-, 9- and 10-mers in Pasteurellacean genome sequences

Genomea

%G+C

Size (Mb)

Most common 10-merb (number, fold over-rep)

Most common 9-merb (number, fold over-rep)

Most common 8-merb (number, fold over-rep)


Hin

38.1

1.8

AAAGTGCGGT (1115, 429X)

AAGTGCGGT (1471, 175X)

AAGTGCGG (1687, 63X)

Aac

44.4

2.1

AAAGTGCGGT (1422, 384X)

AAGTGCGGT (1760, 132X)

AAGTGCGG (1863, 39X)

Pmu

40.4

2.3

AAAGTGCGGT (700, 200X)

AAGTGCGGT (927, 79X)

AAGTGCGG (1013, 26X)

Hso

37

2.1

AAAGTGCGGT (776, 273X)

AAGTGCGGT (1216, 135X)

AAGTGCGG (1446, 51X)

Msu

42.5

2.3

AAAGTGCGGT (1297, 333X)

AAGTGCGGT (1485, 111X)

AAGTGCGG (1622, 46X)

Apl

41.4

2.2

ACAAGCGGTC (429, 187X)

ACAAGCGGT (742, 68X)

CAAGCGGT (1361, 36X)

Mha

41

2.7

ACAAGCGGTC (506, 181X)

ACAAGCGGT (973, 70X)

CAAGCGGT (1636, 48X)

Hduc

38.2

1.76

TTTTGCAAAA (106, 9.6X)

AATAAGCGGTc (95, 21X)

AACAAGCGGTc (85, 31X)

AATAAAAAA (251, 2.8X)

ACAAGCGGTc (199, 23X)

AAAAATAA (680, 2.3X)

CAAGCGGTc (464, 16X)


a Hin: H. influenzae; Aac: A.s actinomycetemcomitans; Pmu: P. multocida; Hso: H. somnus; Msu: M. succiniciproducens; Apl: A. pleuropneumoniae; Mha: M. haemolytica; Hdu: H. ducreyi.

b The first number in parentheses is the combined number of copies of the sequence and its reverse complement. The second number in parentheses is the fold over-representation of each repeat compared to a random-sequence genome of the same size and base composition.

c The additional entries for H. ducreyi are the copy numbers of the most frequent USS-like repeats.

Redfield et al. BMC Evolutionary Biology 2006 6:82   doi:10.1186/1471-2148-6-82

Open Data