Table 5

Frequency of coreferential types across domains

Subdomain

Coref. NPs

Personal

Non-personal

Anaphoric determiner


Newswire

0.141

(227185/1614963)

0.420

(95400)

0.432

(98075)

0.084

(19121)

Ethics

0.095

(24126/252710)

0.038

(906)

0.684

(16498)

0.267

(6447)

Education

0.078

(25992/334071)

0.011

(282)

0.722

(18761)

0.262

(6810)

Medical Informatics

0.058

(318719/5516880)

0.003

(955)

0.564

(179774)

0.430

(136983)

Public Health

0.057

(182046/3207601)

0.012

(2183)

0.651

(118560)

0.332

(60395)

Therapeutics

0.054

(13928/257030)

0.009

(120)

0.640

(8919)

0.345

(4807)

Psychiatry

0.053

(26733/500995)

0.016

(429)

0.617

(16481)

0.363

(9704)

Obstetrics

0.052

(14093/270168)

0.043

(600)

0.625

(8808)

0.328

(4621)

Geriatrics

0.051

(25706/500126)

0.015

(374)

0.614

(15784)

0.368

(9469)

Genetics

0.051

(441324/8598457)

0.002

(1083)

0.499

(220079)

0.495

(218598)

Pediatrics

0.050

(17390/351237)

0.028

(488)

0.597

(10390)

0.371

(6457)

Biochemistry

0.049

(657806/13324719)

0.000

(296)

0.505

(332027)

0.493

(324072)

PMC Average

0.048

(6037808/124612679)

0.008

(49679)

0.548

(3305860)

0.441

(2660534)

Molecular Biology

0.048

(65547/1365525)

0.000

(32)

0.508

(33276)

0.490

(32098)

Tropical Medicine

0.047

(72539/1542496)

0.007

(489)

0.570

(41324)

0.421

(30519)

Critical Care

0.047

(73103/1560348)

0.008

(569)

0.570

(41637)

0.418

(30589)

Biomedical Engineering

0.046

(17502/380556)

0.003

(50)

0.527

(9224)

0.467

(8173)

Ophthalmology

0.046

(14342/313401)

0.027

(394)

0.613

(8791)

0.357

(5113)

Environmental Health

0.044

(148239/3350018)

0.009

(1323)

0.534

(79129)

0.453

(67166)

Medicine

0.044

(103490/2344444)

0.006

(666)

0.572

(59198)

0.419

(43311)

Virology

0.043

(63323/1464489)

0.002

(140)

0.490

(31057)

0.504

(31913)

Science

0.043

(470150/10903540)

0.002

(1053)

0.518

(243402)

0.477

(224195)

Rheumatology

0.043

(69365/1630635)

0.003

(181)

0.527

(36573)

0.468

(32474)

Microbiology

0.043

(129326/3042730)

0.001

(72)

0.488

(63061)

0.510

(65965)

Neurology

0.042

(82817/1967337)

0.005

(402)

0.516

(42766)

0.476

(39445)

Genetics, Medical

0.041

(29605/721049)

0.015

(452)

0.460

(13632)

0.522

(15464)

Neoplasms

0.041

(154990/3780813)

0.004

(660)

0.523

(81084)

0.471

(72947)

Communicable Diseases

0.041

(65003/1588678)

0.020

(1280)

0.497

(32286)

0.481

(31270)

Pharmacology

0.041

(15892/388714)

0.001

(15)

0.535

(8506)

0.462

(7338)

Veterinary Medicine

0.041

(21566/529841)

0.007

(145)

0.563

(12140)

0.428

(9229)

Vascular Diseases

0.041

(20669/508466)

0.004

(92)

0.565

(11684)

0.428

(8855)

Physiology

0.040

(27113/672176)

0.000

(11)

0.522

(14163)

0.474

(12862)

Embryology

0.040

(30720/767573)

0.001

(27)

0.506

(15547)

0.491

(15078)

Pulmonary Medicine

0.040

(53096/1339071)

0.002

(132)

0.551

(29245)

0.444

(23590)

Gastroenterology

0.040

(17422/440064)

0.012

(216)

0.567

(9886)

0.418

(7285)

Botany

0.039

(48611/1257981)

0.000

(19)

0.532

(25875)

0.466

(22665)

Endocrinology

0.039

(18351/476147)

0.006

(107)

0.556

(10208)

0.436

(7992)

Biotechnology

0.037

(21374/571783)

0.001

(23)

0.507

(10830)

0.490

(10475)

Cell Biology

0.037

(51864/1401952)

0.000

(17)

0.510

(26456)

0.487

(25267)

Complementary Therapies

0.025

(15558/632625)

0.008

(131)

0.673

(10467)

0.314

(4882)


Lippincott et al. BMC Bioinformatics 2011 12:212   doi:10.1186/1471-2105-12-212

Open Data