Table 3

The distribution of words and sentences in the scheme-annotated CRA corpus

S1

OBJ

METH

RES

CON

61483

39163

89575

35564

Words

2145

1396

3203

1241

Sentences

27%

17%

40%

16%

Sentences


S2

BKG

OBJ

METH

RES

CON

REL

FUT

36828

23493

41544

89538

30752

2456

1174

Words

1429

674

1473

3185

1082

95

47

Sentences

18%

8%

18%

40%

14%

1%

1%

Sentences


S3

HYP

MOT

BKG

GOAL

OBJT

EXP

MOD

METH

OBS

RES

CON

2676

4277

28028

10612

15894

22444

1157

17982

17402

75951

29362

Words

99

172

1088

294

474

805

41

637

744

2582

1049

Sentences

1%

2%

14%

4%

6%

10%

1%

8%

9%

32%

13%

Sentences


Guo et al. BMC Bioinformatics 2011 12:69   doi:10.1186/1471-2105-12-69

Open Data