Table 3

The Not-N algorithm and its implementation by the Minimum SNPs computer program. A. Data for seven hypothetical sequence types (STs) at six single-nucleotide polymorphisms (SNPs). B. Not-N analysis output of the alignment at A. Four sets of two SNPs are identified, all of which reach 100% discrimination. C. Result obtained if positions 3 and 4 are excluded.

A.


Sequence ID

SNP 1

SNP 2

SNP 3

SNP 4

SNP 5

SNP 6


ST 1*

G

G

G

A

T

T

ST 2*

A

G

T

T

T

G

ST 3*

G

G

G

C

T

G

ST 4

A

G

A

G

A

T

ST 5

A

A

A

G

T

T

ST 6

A

G

T

C

A

A

ST 7

A

A

A

G

T

G


Consensus (STs 1, 2 & 3)

Not informative

Not-ACT

Not-AC

Not-G

Not-ACG

Not-AC


>ST4

n/a

+

-

-

-

+

>ST5

n/a

-

-

-

+

+

>ST6

n/a

+

+

+

-

-

>ST7

n/a

-

-

-

+

+


Confidence (%)

Position not used

50

75

75

50

25


*"Group of interest" sequences


B.


SNP set

SNP 1 position and consensus

Cumulative discriminatory power (%)

SNP 2 position and consensus

Cumulative discriminatory power


1

3, NOT AC,

75

5, NOT ACG

100

2

3, NOT AC

75

6, NOT AC

100

3

4, NOT G

75

5, NOT ACG

100

4

4, NOT G

75

6, NOT AC

100


C.


SNP set

SNP 1 position and consensus

Cumulative discriminatory power (%)

SNP 2 position and consensus

Cumulative discriminatory power (%)


1

2, NOT ACT,

50

5, NOT ACG

100


Price et al. BMC Bioinformatics 2007 8:278   doi:10.1186/1471-2105-8-278

Open Data