Process of counting n-mer frequencies. Given a value for n, the first step is generating all of the n-mer words that are possible. In the next step, we count the number of times that each word appears in the sequence. Finally, we normalize the frequency vector by dividing each number of occurrences by the total number of n-mers.
Higashi et al. BMC Genomics 2012 13(Suppl 5):S1 doi:10.1186/1471-2164-13-S5-S1