Table 1

Orthographic Feature

Features 1–11

e.g.

Features 12–21

e.g.


Comma

,

OneCap

T

Dot

.

AllCaps

CSF

Parenthesis

() []

CapLowAlpha

All

RomanDigit

II

CapMixAlpha

IgM

GreekLetter

Beta

LowMixAlpha

kDa

StopWord

in, at

AlphaDigitAlpha

H2A

ATCGsequence

ACAG

AlphaDigit

T4

OneDigit

5

DigitAlphaDigit

6C2

AllDigits

60

DigitAlpha

19D

DigitCommaDigit

1,25

Others

Other

DigitDotDigit

0.5


Zhou et al. BMC Bioinformatics 2005 6(Suppl 1):S7   doi:10.1186/1471-2105-6-S1-S7

Open Data