research-article

150 Successful Machine Learning Models: 6 Lessons Learned at Booking.com

Authors:
Lucas Bernardi

Booking.com, Amsterdam, Netherlands

Booking.com, Amsterdam, Netherlands
View Profile

,
Themistoklis Mavridis

Booking.com, Amsterdam, Netherlands

Booking.com, Amsterdam, Netherlands
View Profile

,
Pablo Estevez

Booking.com, Amsterdam, Netherlands

Booking.com, Amsterdam, Netherlands
View Profile

KDD '19: Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data MiningJuly 2019Pages 1743–1751https://doi.org/10.1145/3292500.3330744

Published:25 July 2019Publication History

KDD '19: Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining

Pages 1743–1751

ABSTRACT

Booking.com is the world's largest online travel agent where millions of guests find their accommodation and millions of accommodation providers list their properties including hotels, apartments, bed and breakfasts, guest houses, and more. During the last years we have applied Machine Learning to improve the experience of our customers and our business. While most of the Machine Learning literature focuses on the algorithmic or mathematical aspects of the field, not much has been published about how Machine Learning can deliver meaningful impact in an industrial environment where commercial gains are paramount. We conducted an analysis on about 150 successful customer facing applications of Machine Learning, developed by dozens of teams in Booking.com, exposed to hundreds of millions of users worldwide and validated through rigorous Randomized Controlled Trials. Following the phases of a Machine Learning project we describe our approach, the many challenges we found, and the lessons we learned while scaling up such a complex technology across our organization. Our main conclusion is that an iterative, hypothesis driven process, integrated with other disciplines was fundamental to build 150 successful products enabled by Machine Learning.

References

Ioannis Arapakis, Xiao Bai, and B. Barla Cambazoglu. 2014. Impact of Response Latency on User Behavior in Web Search. In Proceedings of the 37th International ACM SIGIR Conference on Research & Development in Information Retrieval (SIGIR '14). ACM, New York, NY, USA, 103--112. Google ScholarDigital Library
L Bernardi, J Kamps, Y Kiseleva, MJI Mueller, T Bogers, and M Koolen. 2015. The continuous cold start problem in e-commerce recommender systems. In CEUR Workshop Proceedings, Vol. 1448. CEUR-WS. org.Google Scholar
Alex Deng and Victor Hu. 2015. Diluted treatment effect estimation for trigger analysis in online controlled experiments. In Proceedings of the Eighth ACM International Conference on Web Search and Data Mining. ACM, 349--358. Google ScholarDigital Library
Miroslav Dud'ik, John Langford, and Lihong Li. 2011. Doubly Robust Policy Evaluation and Learning. In Proceedings of the 28th International Conference on International Conference on Machine Learning (ICML'11). Omnipress, USA, 1097--1104. http://dl.acm.org/citation.cfm?id=3104482.3104620 Google ScholarDigital Library
Dietmar Jannach, Paul Resnick, Alexander Tuzhilin, and Markus Zanker. 2016. Recommender systems--beyond matrix completion. Commun. ACM , Vol. 59, 11 (2016), 94--102. Google ScholarDigital Library
Raphael Lopez Kaufman, Jegar Pitchforth, and Lukas Vermeer. 2017. Democratizing online controlled experiments at Booking. com. arXiv preprint arXiv:1710.08217 (2017).Google Scholar
Timo Kluck. 2016. Using multivariant tests to determine performance impact. (2016). Retrieved Dec 04, 2017 from https://booking.ai/using-multivariant-tests-to-determine-performance-impact-c249ab9bfc16Google Scholar
Pavel Levin, Nishikant Dhanuka, Talaat Khalil, Fedor Kovalev, and Maxim Khalilov. 2017. Toward a full-scale neural machine translation in production: the Booking. com use case. arXiv preprint arXiv:1709.05820 (2017).Google Scholar
Xiao-Li Li and Bing Liu. 2005. Learning from positive and unlabeled examples with different data distributions. Machine Learning: ECML 2005 (2005), 218--229. Google ScholarDigital Library
Athanasios Noulas and M Stafseng Einarsen. 2014. User engagement through topic modelling in travel. Proceeding of the Second Workshop on User Engagement Optimization. 2--7.Google Scholar
James M Robins, Andrea Rotnitzky, and Lue Ping Zhao. 1994. Estimation of regression coefficients when some regressors are not always observed. Journal of the American statistical Association , Vol. 89, 427 (1994), 846--866.Google ScholarCross Ref
Kagan Tumer and Joydeep Ghosh. 2003. Bayes error rate estimation using classifier ensembles. International Journal of Smart Engineering System Design , Vol. 5, 2 (2003), 95--109.Google ScholarCross Ref
Kiri Wagstaff. 2012. Machine learning that matters. arXiv preprint arXiv:1206.4656 (2012). Google ScholarDigital Library

Index Terms

150 Successful Machine Learning Models: 6 Lessons Learned at Booking.com

Recommendations

Machine Learning: The State of the Art

The two fundamental problems in machine learning (ML) are statistical analysis and algorithm design. The former tells us the principles of the mathematical models that we establish from the observation data. The latter defines the conditions on which ...
Read More
Customer Lifetime Value Analysis Based on Machine Learning
ICISDM '22: Proceedings of the 6th International Conference on Information System and Data Mining

Customer lifetime value (CLV) is a powerful tool to determine the value of customers and filter customers most likely to attrite or most likely to make their first purchase, especially for e-commerce companies. This article reviewed machine learning ...
Read More
Machine Learning: Algorithms, Real-World Applications and Research Directions
Abstract
In the current age of the Fourth Industrial Revolution (4IR or Industry 4.0), the digital world has a wealth of data, such as Internet of Things (IoT) data, cybersecurity data, mobile data, business data, social media data, health data, etc. To ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Published in
KDD '19: Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining
July 2019
3305 pages
ISBN:9781450362016
DOI:10.1145/3292500
General Chairs:
Ankur Teredesai
KenSci
,
Vipin Kumar
University of Minnesota
,
Program Chairs:
Ying Li
EV Analysis Corporation
,
Rómer Rosales
LinkedIn
,
Evimaria Terzi
Boston University
,
George Karypis
University of Minnesota
Copyright © 2019 Owner/Author
Permission to make digital or hard copies of part or all of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for third-party components of this work must be honored. For all other uses, contact the Owner/Author.
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 25 July 2019
Check for updates
Author Tags
business impact
data science
e-commerce
experimentation
machine learning
product development
Qualifiers
- research-article
Conference

Acceptance Rates
KDD '19 Paper Acceptance Rate110of1,200submissions,9%Overall Acceptance Rate1,133of8,635submissions,13%
More
Upcoming Conference
KDD '24

Sponsor:

sigkdd

sigkdd

The 30th ACM SIGKDD Conference on Knowledge Discovery and Data Mining

August 25 - 29, 2024

Barcelona , Spain
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 60
  Total Citations
  View Citations
- 15,374
  Total Downloads
- Downloads (Last 12 months)279
- Downloads (Last 6 weeks)39
Other Metrics
View Author Metrics
Cited By
View all

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

150 Successful Machine Learning Models: 6 Lessons Learned at Booking.com

KDD '19: Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining

ABSTRACT

References

Cited By

Index Terms

Recommendations

Machine Learning: The State of the Art

Customer Lifetime Value Analysis Based on Machine Learning

Machine Learning: Algorithms, Real-World Applications and Research Directions

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Check for updates

Author Tags

Qualifiers

Conference

Acceptance Rates

Upcoming Conference

Funding Sources

Other Metrics

Article Metrics

Other Metrics

Cited By

PDF Format

eReader

Digital Edition

Caption

150 Successful Machine Learning Models: 6 Lessons Learned at Booking.com

KDD '19: Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining

ABSTRACT

References

Cited By

Index Terms

Recommendations

Machine Learning: The State of the Art

Customer Lifetime Value Analysis Based on Machine Learning

Machine Learning: Algorithms, Real-World Applications and Research Directions

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Check for updates

Author Tags

Qualifiers

Conference

Acceptance Rates

Upcoming Conference

Funding Sources

Article Metrics

Other Metrics

PDF Format

eReader

Digital Edition

Share this Publication link

Share on Social Media