Scarcity Scores. (A) Scarcity scores (eq 2) plotted as a function of 3-gram index, sorted in descending order. (B) Histogram of scarcity scores. The abscissa shows the number of 3-grams having scarcity scores values lying in successive ranges of size 0.25 shown along the ordinate. The peak occurs at = 9.0 ± 0.125. Unique 3-grams (Figure 1) lie in the range > 10.908, or i < 480, indicated by the arrow, and their observed cumulative frequency is < 1.83 10-5.
Tobi and Bahar BMC Bioinformatics 2007 8:226 doi:10.1186/1471-2105-8-226