Table 1

Retrieval accuracy for different subsets of SCOP database with the new and old finite-size correction
Method 25th percentile 50th percentile Full database
New correction 0.10373 ± 0.00022 0.10073 ± 0.00019 0.08535 ± 0.00013
Old correction 0.09201 ± 0.00020 0.09282 ± 0.00017 0.08358 ± 0.00014

The three subsets contain proteins shorter than 91 residues (25th percentile by length), shorter than 137 residues (50th percentile by length), and the full database. ROC-4852 scores are presented with an error (one standard deviation). The 25th percentile database contains 2533 sequences, the 50th percentile database contains 5008 sequences, and the full database contains 10,569 sequences. There are 4852 queries.

Park et al.

Park et al. BMC Research Notes 2012 5:286   doi:10.1186/1756-0500-5-286

Open Data