Unachievable Region in Precision-Recall Space and Its Effect on Empirical Evaluation

dc.contributor.affiliationUniversity of Wisconsin-Madison Department of Computer Sciences
dc.contributor.authorSantos Costa, Vitor
dc.contributor.authorPage, David
dc.contributor.authorDavis, Jesse
dc.contributor.authorBoyd, Kendrick
dc.date.accessioned2012-07-12T17:35:08Z
dc.date.available2012-07-12T17:35:08Z
dc.date.issued2012-05-30
dc.description.abstractPrecision-recall (PR) curves and the areas under them are widely used to summarize machine learning results, especially for data sets exhibiting class skew. They are often used analogously to ROC curves and the area under ROC curves. It is known that PR curves vary as class skew changes. What was not recognized before this paper is that there is a region of PR space that is completely unachievable, and the size of this region depends only on the skew. This paper precisely characterizes the size of that region and discusses its implications for empirical evaluation methodology in machine learning.en
dc.identifier.citationTR1772en
dc.identifier.urihttp://digital.library.wisc.edu/1793/61736
dc.publisherUniversity of Wisconsin-Madison Department of Computer Sciencesen
dc.subjectF1 scoreen
dc.subjectprecision-recall curvesen
dc.titleUnachievable Region in Precision-Recall Space and Its Effect on Empirical Evaluationen
dc.typeTechnical Reporten

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
TR1772 prspace_techreport.pdf
Size:
409.12 KB
Format:
Adobe Portable Document Format
Description:
TR1772

License bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
license.txt
Size:
2.03 KB
Format:
Item-specific license agreed upon to submission
Description: