skip to main content
10.1145/3292500.3330744acmconferencesArticle/Chapter ViewAbstractPublication PageskddConference Proceedingsconference-collections
research-article

150 Successful Machine Learning Models: 6 Lessons Learned at Booking.com

Published:25 July 2019Publication History

ABSTRACT

Booking.com is the world's largest online travel agent where millions of guests find their accommodation and millions of accommodation providers list their properties including hotels, apartments, bed and breakfasts, guest houses, and more. During the last years we have applied Machine Learning to improve the experience of our customers and our business. While most of the Machine Learning literature focuses on the algorithmic or mathematical aspects of the field, not much has been published about how Machine Learning can deliver meaningful impact in an industrial environment where commercial gains are paramount. We conducted an analysis on about 150 successful customer facing applications of Machine Learning, developed by dozens of teams in Booking.com, exposed to hundreds of millions of users worldwide and validated through rigorous Randomized Controlled Trials. Following the phases of a Machine Learning project we describe our approach, the many challenges we found, and the lessons we learned while scaling up such a complex technology across our organization. Our main conclusion is that an iterative, hypothesis driven process, integrated with other disciplines was fundamental to build 150 successful products enabled by Machine Learning.

References

  1. Ioannis Arapakis, Xiao Bai, and B. Barla Cambazoglu. 2014. Impact of Response Latency on User Behavior in Web Search. In Proceedings of the 37th International ACM SIGIR Conference on Research & Development in Information Retrieval (SIGIR '14). ACM, New York, NY, USA, 103--112. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. L Bernardi, J Kamps, Y Kiseleva, MJI Mueller, T Bogers, and M Koolen. 2015. The continuous cold start problem in e-commerce recommender systems. In CEUR Workshop Proceedings, Vol. 1448. CEUR-WS. org.Google ScholarGoogle Scholar
  3. Alex Deng and Victor Hu. 2015. Diluted treatment effect estimation for trigger analysis in online controlled experiments. In Proceedings of the Eighth ACM International Conference on Web Search and Data Mining. ACM, 349--358. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. Miroslav Dud'ik, John Langford, and Lihong Li. 2011. Doubly Robust Policy Evaluation and Learning. In Proceedings of the 28th International Conference on International Conference on Machine Learning (ICML'11). Omnipress, USA, 1097--1104. http://dl.acm.org/citation.cfm?id=3104482.3104620 Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. Dietmar Jannach, Paul Resnick, Alexander Tuzhilin, and Markus Zanker. 2016. Recommender systems--beyond matrix completion. Commun. ACM , Vol. 59, 11 (2016), 94--102. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. Raphael Lopez Kaufman, Jegar Pitchforth, and Lukas Vermeer. 2017. Democratizing online controlled experiments at Booking. com. arXiv preprint arXiv:1710.08217 (2017).Google ScholarGoogle Scholar
  7. Timo Kluck. 2016. Using multivariant tests to determine performance impact. (2016). Retrieved Dec 04, 2017 from https://booking.ai/using-multivariant-tests-to-determine-performance-impact-c249ab9bfc16Google ScholarGoogle Scholar
  8. Pavel Levin, Nishikant Dhanuka, Talaat Khalil, Fedor Kovalev, and Maxim Khalilov. 2017. Toward a full-scale neural machine translation in production: the Booking. com use case. arXiv preprint arXiv:1709.05820 (2017).Google ScholarGoogle Scholar
  9. Xiao-Li Li and Bing Liu. 2005. Learning from positive and unlabeled examples with different data distributions. Machine Learning: ECML 2005 (2005), 218--229. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. Athanasios Noulas and M Stafseng Einarsen. 2014. User engagement through topic modelling in travel. Proceeding of the Second Workshop on User Engagement Optimization. 2--7.Google ScholarGoogle Scholar
  11. James M Robins, Andrea Rotnitzky, and Lue Ping Zhao. 1994. Estimation of regression coefficients when some regressors are not always observed. Journal of the American statistical Association , Vol. 89, 427 (1994), 846--866.Google ScholarGoogle ScholarCross RefCross Ref
  12. Kagan Tumer and Joydeep Ghosh. 2003. Bayes error rate estimation using classifier ensembles. International Journal of Smart Engineering System Design , Vol. 5, 2 (2003), 95--109.Google ScholarGoogle ScholarCross RefCross Ref
  13. Kiri Wagstaff. 2012. Machine learning that matters. arXiv preprint arXiv:1206.4656 (2012). Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. 150 Successful Machine Learning Models: 6 Lessons Learned at Booking.com

            Recommendations

            Comments

            Login options

            Check if you have access through your login credentials or your institution to get full access on this article.

            Sign in
            • Published in

              cover image ACM Conferences
              KDD '19: Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining
              July 2019
              3305 pages
              ISBN:9781450362016
              DOI:10.1145/3292500

              Copyright © 2019 Owner/Author

              Permission to make digital or hard copies of part or all of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for third-party components of this work must be honored. For all other uses, contact the Owner/Author.

              Publisher

              Association for Computing Machinery

              New York, NY, United States

              Publication History

              • Published: 25 July 2019

              Check for updates

              Qualifiers

              • research-article

              Acceptance Rates

              KDD '19 Paper Acceptance Rate110of1,200submissions,9%Overall Acceptance Rate1,133of8,635submissions,13%

              Upcoming Conference

              KDD '24

            PDF Format

            View or Download as a PDF file.

            PDF

            eReader

            View online with eReader.

            eReader