Relevance distribution in relevant and irrelevant documents. In the diagram, red line indicates the relevance scores in the relevant document set and the blue dots indicate the relevance scores in the irrelevant document set. If we select the classification threshold as the green line indicates, we would achieve a promising classification performance: in terms of precision 0.745, recall 0.676 and AUC 0.819.
Wang et al. BMC Bioinformatics 2009 10(Suppl 1):S55 doi:10.1186/1471-2105-10-S1-S55