Table 1

Summary statistics for 15 genomes. t: millions of years since WGD. n: number of genes. m: number of single copy genes. d: halving distance.

genome

t

n

m

d

AU

above/on/below

p


S. cerevisiae

150

5616

4498

135

313

0.89

0.24

6598/5796/4178

0.17

C. glabrata

150

5180

4382

252

112

0.92

0.63

2119/3563/2667

0.18

V. polyspora

150

5112

4164

202

180

0.90

0.43

6354/4298/2624

0.21

S. bayanus

150

5857

4773

186

276

0.90

0.34

7300/5556/4007

0.17

N. castelli

150

5213

4053

221

251

0.88

0.38

5159/4412/2876

0.21


Paramecium

20

38626

14576

214

5276

0.55

0.02

6763/3155/904

0.49


Populus

70

20082

7228

2600

1020

0.53

0.41

1615/1412/542

0.62


Arabidopsis

50

25655

13267

2701

718

0.68

0.43

6626/2645/327

0.29


fugu

350

14251

12653

374

222

0.941

0.468

6709/3735/3245

0.27

medaka

350

14564

13352

362

143

0.957

0.597

8958/4838/2708

0.24

stickleback

350

16726

14876

519

240

0.941

0.561

871/5178/4110

0.27

tetraodon

350

17120

16088

310

129

0.969

0.601

13279/6184/3360

0.18


chicken

450

10077

8495

686

49

0.915

0.867

2776/2259/1262

0.46

opossum

450

13339

11589

751

69

0.93

0.858

8124/4182/1570

0.27

human

450

13828

12144

673

88

0.935

0.799

7484/3383/1761

0.29


AU (m > 0): analytical units, excluding adjacent pairs of paralogs. proportion of paralogs lost. normalized having distance (d depends only on the configuration of the pairs of duplicate genes). "above/on/below": number of real-simulated comparisons above, on and below the diagonal, with "above" indicating single copy concentration. p: geometric parameter of the AU distribution (Figure 3)

Sankoff et al. BMC Genomics 2010 11:313   doi:10.1186/1471-2164-11-313

Open Data