Table 1

Repetitive sequences in the soybean genome quantified using the difference between the contigs produced by an assembly algorithm with conservative parameters, and the predictions of the Lander-Waterman model for sampling a completely non-repetitive genome

Number of reads in contig

Predicted by model

Observed number of contigs

Repetitive reads (Observed-predicted)


2

41,126

42,221

2,189

3

2,511

9,742

21,693

4

153

3,498

13,379

5

9

1,646

8,183

6

1

937

5,619

7

0

634

4,438

> 7

0

4,213

238,389


total 293,890


Swaminathan et al. BMC Genomics 2007 8:132   doi:10.1186/1471-2164-8-132

Open Data