skip to main content
10.1145/1553374.1553398acmotherconferencesArticle/Chapter ViewAbstractPublication PagesicmlConference Proceedingsconference-collections
research-article

Nonparametric estimation of the precision-recall curve

Published:14 June 2009Publication History

ABSTRACT

The Precision-Recall (PR) curve is a widely used visual tool to evaluate the performance of scoring functions in regards to their capacities to discriminate between two populations. The purpose of this paper is to examine both theoretical and practical issues related to the statistical estimation of PR curves based on classification data. Consistency and asymptotic normality of the empirical counterpart of the PR curve in sup norm are rigorously established. Eventually, the issue of building confidence bands in the PR space is considered and a specific resampling procedure based on a smoothed and truncated version of the empirical distribution of the data is promoted. Arguments of theoretical and computational nature are presented to explain why such a bootstrap is preferable to a "naive" bootstrap in this setup.

References

  1. Bertail, P., Clémençon, S., & Vayatis, N. (2008). On bootstrapping the ROC curve. In Proc. of Neur. Inf. Proc. Syst. 2008, Vancouver, Canada.Google ScholarGoogle Scholar
  2. Bucklew, J. (2003). Introduction to rare event simulation. Springer.Google ScholarGoogle Scholar
  3. Clémençon, S., & Vayatis, N. (2008). Tree-structured ranking rules and approximation of the optimal ROC curve. Proceedings of the 2008 conference on Algorithmic Learning Theory. Lect. Notes Art. Int. 5254, pp. 22--37, Springer. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. Csorgo, M., & Revesz, P. (1981). Strong approximations in probability and statistics. Academic Press.Google ScholarGoogle Scholar
  5. Davis, J., & Goadrich, M. (2006). The relationship between precision-recall and ROC curves. In Proceedings of the 23 rd International Conference on Machine Learning, Vol. 148, pp. 233--240. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. Efron, B. (1979). Bootstrap methods: another look at the jacknife. Annals of Statistics, 7, 1--26.Google ScholarGoogle ScholarCross RefCross Ref
  7. Falk, M., & Reiss, R. (1989). Weak convergence of smoothed and nonsmoothed bootstrap quantile estimates. Annals of Probability, 17, 362--371.Google ScholarGoogle ScholarCross RefCross Ref
  8. Giné, E., & Guillou, A. (2002). Rates of strong uniform consistency for multivariate kernel density estimators. Ann. Inst. Poincaré (B), Probabilités et Statistiques, 38, 907--921.Google ScholarGoogle ScholarCross RefCross Ref
  9. Horvath, L., Horvath, Z., & Zhou (2008). Confidence bands for ROC curves. Journal of Statistical Planning and Inference, 138, 1894--1904.Google ScholarGoogle ScholarCross RefCross Ref
  10. Hsieh, F., & Turnbull, B. (1996). Nonparametric and semi-parametric statistical estimation of the ROC curve. The Annals of Statistics, 24, 25--40.Google ScholarGoogle ScholarCross RefCross Ref
  11. Macskassy, S., & Provost, F. (2004). Confidence bands for ROC curves: methods and an empirical study. In Proceedings of the first Workshop on ROC Analysis in Artif. Int. at Eur. Conf. on Artif. Int. 2004.Google ScholarGoogle Scholar
  12. Macskassy, S., Provost, F., & Rosset, S. (2005). Bootstrapping the ROC curve: an empirical evaluation. In Proceedings of Int. Conf. Mach. Learn.-2005 Workshop on ROC Analysis in Machine Learning. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. Manning, C. M., & Schutze, H. (1999). Foundations of statistical natural language processing. MIT Press. Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. Raghavan, V., Bollmann, P., & Jung, G. (1989). A critical investigation of recall and precision as measures of retrieval system performance. ACM Trans. Inf. Syst., 7, 205--229. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. Shao, G., & Tu, J. (1995). The jackknife and bootstrap. Springer, NY.Google ScholarGoogle Scholar
  16. Shorack, G., & Wellner, J. (1986). Empirical processes with applications to statistics. Wiley, NY.Google ScholarGoogle Scholar
  17. Silverman, B., & Young, G. (1987). The bootstrap: to smooth or not to smooth? Biometrika, 74, 469--479.Google ScholarGoogle ScholarCross RefCross Ref

Index Terms

  1. Nonparametric estimation of the precision-recall curve

                  Recommendations

                  Comments

                  Login options

                  Check if you have access through your login credentials or your institution to get full access on this article.

                  Sign in
                  • Published in

                    cover image ACM Other conferences
                    ICML '09: Proceedings of the 26th Annual International Conference on Machine Learning
                    June 2009
                    1331 pages
                    ISBN:9781605585161
                    DOI:10.1145/1553374

                    Copyright © 2009 Copyright 2009 by the author(s)/owner(s).

                    Publisher

                    Association for Computing Machinery

                    New York, NY, United States

                    Publication History

                    • Published: 14 June 2009

                    Permissions

                    Request permissions about this article.

                    Request Permissions

                    Check for updates

                    Qualifiers

                    • research-article

                    Acceptance Rates

                    Overall Acceptance Rate140of548submissions,26%

                  PDF Format

                  View or Download as a PDF file.

                  PDF

                  eReader

                  View online with eReader.

                  eReader