short-paper

Transparent Tree Ensembles

Authors:
Alexander Moore

eBay, Inc., New York, NY, USA

eBay, Inc., New York, NY, USA
View Profile

,
Vanessa Murdock

Amazon Research, Seattle, WA, USA

Amazon Research, Seattle, WA, USA
View Profile

,
Yaxiong Cai

Microsoft, Redmond, WA, USA

Microsoft, Redmond, WA, USA
View Profile

,
Kristine Jones

Microsoft, Redmond, WA, USA

Microsoft, Redmond, WA, USA
View Profile

SIGIR '18: The 41st International ACM SIGIR Conference on Research & Development in Information RetrievalJune 2018Pages 1241–1244https://doi.org/10.1145/3209978.3210151

Published:27 June 2018Publication History

SIGIR '18: The 41st International ACM SIGIR Conference on Research & Development in Information Retrieval

Pages 1241–1244

ABSTRACT

Every day more technologies and services are backed by complex machine-learned models, consuming large amounts of data to provide a myriad of useful services. While users are willing to provide personal data to enable these services, their trust in and engagement with the systems could be improved by providing insight into how the machine learned decisions were made. Complex ML systems are highly effective but many of them are black boxes and give no insight into how they make the choices they make. Moreover, those that do often do so at the model-level rather than the instance-level. In this work we present a method for deriving explanations for instance-level decisions in tree ensembles. As this family of models accounts for a large portion of industrial machine learning, this work opens up the possibility for transparent models at scale.

References

Leo Breiman . 2001. Random Forests. Machine Learning, Vol. 45, 1 (2001), 5--32. Google ScholarDigital Library
Rich Caruana, Yin Lou, Johannes Gehrke, Paul Koch, Marc Sturm, and Noemie Elhadad . 2015. Intelligible Models for HealthCare: Predicting Pneumonia Risk and Hospital 30-Day Readmission Proceedings of the ACM Conference on Knowledge Discovery and Data Mining. Sydney, Australia, 1721--1730. Google ScholarDigital Library
George Forman . 2003. An extensive empirical study of feature selection metrics for text classification. Journal of machine learning research Vol. 3, Mar (2003), 1289--1305. Google ScholarDigital Library
Satoshi Hara and Kohei Hayashi . 2016. Making tree ensembles interpretable. WHI 2016. arXiv preprint arXiv:1606.05390 (2016).Google Scholar
M. Lichman . 2013. UCI Machine Learning Repository. (2013). http://archive.ics.uci.edu/mlGoogle Scholar
Yin Lou, Rich Caruana, and Johannes Gehrke . 2012. Intelligible Models for Classification and Regression Proceedings of the ACM Conference on Knowledge Discovery and Data Mining. Beijing, China. Google ScholarDigital Library
Yin Lou, Rich Caruana, Johannes Gehrke, and Giles Hooker . 2013. Accurate Intelligible Models with Pairwise Interactions Proceedings of the ACM Conference on Knowledge Discovery and Data Mining. Chicago, Illinois. Google ScholarDigital Library
Marco Tulio Ribeiro, Sameer Singh, and Carlos Guestrin . 2016. Why Should I Trust You?: Explaining the Predictions of Any Classifier Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. ACM, 1135--1144. Google ScholarDigital Library

Index Terms

Transparent Tree Ensembles
1. Computing methodologies
  1. Machine learning

Recommendations

Software defect prediction using tree-based ensembles
PROMISE 2020: Proceedings of the 16th ACM International Conference on Predictive Models and Data Analytics in Software Engineering

Software defect prediction is an active research area in software engineering. Accurate prediction of software defects assists software engineers in guiding software quality assurance activities. In machine learning, ensemble learning has been proven to ...
Read More
Deep learning and Boosted trees for injuries prediction in power infrastructure projects
Abstract
Electrical injury impacts are substantial and massive. Investments in electricity will continue to increase, leading to construction project complexities, which undoubtedly contribute to injuries and associated effects. Machine learning (ML) ...
Highlights
- Presented deep learning and boosted tree approaches for safety management.
- Benchmark deep learning models with other machine learning techniques.
- Deep neural networks yield better prediction ability.
- Interpretable models for ...
Read More
Flexible Modeling and Multitask Learning using Differentiable Tree Ensembles
KDD '22: Proceedings of the 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining

Decision tree ensembles are widely used and competitive learning models. Despite their success, popular toolkits for learning tree ensembles have limited modeling capabilities. For instance, these toolkits support a limited number of loss functions and ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Published in
SIGIR '18: The 41st International ACM SIGIR Conference on Research & Development in Information Retrieval
June 2018
1509 pages
ISBN:9781450356572
DOI:10.1145/3209978
General Chairs:
Kevyn Collins-Thompson
University of Michigan, United States
,
Qiaozhu Mei
University of Michigan, United States
,
Program Chairs:
Brian Davison
Lehigh University, United States
,
Yiqun Liu
Tsinghua University, China
,
Emine Yilmaz
University College London, United Kingdom
Copyright © 2018 ACM
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 27 June 2018
Permissions
Request permissions about this article.
Request Permissions

Check for updates
Author Tags
boosted trees
model explainability
transparent ir
Qualifiers
- short-paper
Conference

Acceptance Rates
SIGIR '18 Paper Acceptance Rate86of409submissions,21%Overall Acceptance Rate792of3,983submissions,20%
More
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 12
  Total Citations
  View Citations
- 307
  Total Downloads
- Downloads (Last 12 months)17
- Downloads (Last 6 weeks)2
Other Metrics
View Author Metrics
Cited By
View all

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Transparent Tree Ensembles

SIGIR '18: The 41st International ACM SIGIR Conference on Research & Development in Information Retrieval

ABSTRACT

References

Cited By

Index Terms

Recommendations

Software defect prediction using tree-based ensembles

Deep learning and Boosted trees for injuries prediction in power infrastructure projects

Flexible Modeling and Multitask Learning using Differentiable Tree Ensembles