Latent Cross: Making Use of Context in Recurrent Recommender Systems

Authors:
Alex Beutel

Google, Inc., Mountain View, CA, USA

Google, Inc., Mountain View, CA, USA
View Profile

,
Paul Covington

Google, Inc., Mountain View, CA, USA

Google, Inc., Mountain View, CA, USA
View Profile

,
Sagar Jain

Google, Inc., Mountain View, CA, USA

Google, Inc., Mountain View, CA, USA
View Profile

,
Can Xu

Google, Inc., Mountain View, CA, USA

Google, Inc., Mountain View, CA, USA
View Profile

,
Jia Li

University of Illinois at Chicago & Google, Inc., Chicago, IL, USA

University of Illinois at Chicago & Google, Inc., Chicago, IL, USA
View Profile

,
Vince Gatto

Google, Inc., Mountain View, CA, USA

Google, Inc., Mountain View, CA, USA
View Profile

,
Ed H. Chi

Google, Inc., Mountain View, CA, USA

Google, Inc., Mountain View, CA, USA
View Profile

WSDM '18: Proceedings of the Eleventh ACM International Conference on Web Search and Data MiningFebruary 2018Pages 46–54https://doi.org/10.1145/3159652.3159727

Published:02 February 2018Publication History

WSDM '18: Proceedings of the Eleventh ACM International Conference on Web Search and Data Mining

Pages 46–54

ABSTRACT

The success of recommender systems often depends on their ability to understand and make use of the context of the recommendation request. Significant research has focused on how time, location, interfaces, and a plethora of other contextual features affect recommendations. However, in using deep neural networks for recommender systems, researchers often ignore these contexts or incorporate them as ordinary features in the model.

In this paper, we study how to effectively treat contextual data in neural recommender systems. We begin with an empirical analysis of the conventional approach to context as features in feed-forward recommenders and demonstrate that this approach is inefficient in capturing common feature crosses. We apply this insight to design a state-of-the-art RNN recommender system. We first describe our RNN-based recommender system in use at YouTube. Next, we offer "Latent Cross," an easy-to-use technique to incorporate contextual data in the RNN by embedding the context feature first and then performing an element-wise product of the context embedding with model's hidden states. We demonstrate the improvement in performance by using this Latent Cross technique in multiple experimental settings.

References

Martín Abadi, Paul Barham, Jianmin Chen, Zhifeng Chen, Andy Davis, Jeffrey Dean, Matthieu Devin, Sanjay Ghemawat, Geoffrey Irving, Michael Isard, and others. 2016. TensorFlow: A system for large-scale machine learning. In Proceedings of the 12th USENIX Symposium on Operating Systems Design and Implementation (OSDI). Savannah, Georgia, USA. Google ScholarDigital Library
Amr Ahmed, Liangjie Hong, and Alexander J. Smola. 2013. Hierarchical geographical modeling of user locations from social media posts. In Proceedings of the 22nd international conference on World Wide Web (WWW). ACM, 25--36. Google ScholarDigital Library
Dzmitry Bahdanau, Kyunghyun Cho, and Yoshua Bengio. 2014. Neural machine translation by jointly learning to align and translate. arXiv preprint arXiv:1409.0473 (2014).Google Scholar
James Bennett, Stan Lanning, and others. 2007. The netflix prize. In Proceedings of KDD cup and workshop, Vol. 2007. New York, NY, USA, 35.Google Scholar
Alex Beutel, Ed H. Chi, Zhiyuan Cheng, Hubert Pham, and John Anderson. 2017. Beyond Globally Optimal: Focused Learning for Improved Recommendations. In Proceedings of the 26th International Conference on World Wide Web (WWW). ACM. Google ScholarDigital Library
Pedro G. Campos, Fernando Díez, and Iván Cantador. 2014. Time-aware recommender systems: a comprehensive survey and analysis of existing evaluation protocols. User Modeling and User-Adapted Interaction 24, 1--2 (2014), 67--119. Google ScholarDigital Library
Heng-Tze Cheng, Levent Koc, Jeremiah Harmsen, Tal Shaked, Tushar Chandra, Hrishi Aradhye, Glen Anderson, Greg Corrado, Wei Chai, Mustafa Ispir, and others. 2016. Wide & deep learning for recommender systems. In Proceedings of the 1st Workshop on Deep Learning for Recommender Systems. ACM, 7--10. Google ScholarDigital Library
Zhiyuan Cheng, James Caverlee, and Kyumin Lee. 2010. You are where you tweet: a content-based approach to geo-locating twitter users. In Proceedings of the 19th ACM international conference on Information and knowledge management. ACM, 759--768. Google ScholarDigital Library
Evangelia Christakopoulou and George Karypis. 2016. Local Item-Item Models For Top-N Recommendation. In Proceedings of the 10th ACM Conference on Recommender Systems (RecSys). ACM, 67--74. Google ScholarDigital Library
Junyoung Chung, Caglar Gülçehre, Kyunghyun Cho, and Yoshua Bengio. 2015. Gated Feedback Recurrent Neural Networks.. In ICML. 2067--2075. Google ScholarDigital Library
Paul Covington, Jay Adams, and Emre Sargin. 2016. Deep Neural Networks for YouTube Recommendations. In Proceedings of the 10th ACM Conference on Recommender Systems (RecSys). ACM, 191--198. Google ScholarDigital Library
Bin Cui, Anthony KH Tung, Ce Zhang, and Zhe Zhao. 2010. Multiple feature fusion for social media applications. In Proceedings of the 2010 ACM SIGMOD International Conference on Management of data. ACM, 435--446. Google ScholarDigital Library
Andrew M. Dai, Christopher Olah, and Quoc V. Le. 2015. Document embedding with paragraph vectors. arXiv preprint arXiv:1507.07998 (2015).Google ScholarDigital Library
Yann N Dauphin, Angela Fan, Michael Auli, and David Grangier. 2016. Language modeling with gated convolutional networks. arXiv preprint arXiv:1612.08083 (2016).Google Scholar
Nan Du, Yichen Wang, Niao He, Jimeng Sun, and Le Song. 2015. Time-sensitive recommendation from recurrent user activities. In Advances in Neural Information Processing Systems. 3492--3500. Google ScholarDigital Library
John Duchi, Elad Hazan, and Yoram Singer. 2011. Adaptive subgradient methods for online learning and stochastic optimization. Journal of Machine Learning Research 12, Jul (2011), 2121--2159. Google ScholarDigital Library
Hancheng Ge, James Caverlee, and Haokai Lu. 2016. TAPER: A contextual tensor-based approach for personalized expert recommendation. (2016).Google ScholarDigital Library
Alex Graves. 2013. Generating sequences with recurrent neural networks. arXiv preprint arXiv:1308.0850 (2013).Google Scholar
Xiangnan He and Tat-Seng Chua. 2017. Neural Factorization Machines for Sparse Predictive Analytics. In Proceedings of the 40th International ACMSIGIR Conference on Research and Development in Information Retrieval (SIGIR '17). ACM, New York, NY, USA, 355--364. Google ScholarDigital Library
Xiangnan He, Lizi Liao, Hanwang Zhang, Liqiang Nie, Xia Hu, and Tat-Seng Chua. 2017. Neural collaborative filtering. In Proceedings of the 26th International Conference on World Wide Web. International World Wide Web Conferences Steering Committee, 173--182. Google ScholarDigital Library
Balázs Hidasi, Alexandros Karatzoglou, Linas Baltrunas, and Domonkos Tikk. 2015. Session-based recommendations with recurrent neural networks. arXiv preprint arXiv:1511.06939 (2015).Google Scholar
Balázs Hidasi and Domonkos Tikk. 2016. General factorization framework for context-aware recommendations. Data Mining and Knowledge Discovery 30, 2 (2016), 342--371. Google ScholarDigital Library
Sepp Hochreiter and Jürgen Schmidhuber. 1997. Long short-term memory. Neural computation 9, 8 (1997), 1735--1780. Google ScholarDigital Library
Yifan Hu, Yehuda Koren, and Chris Volinsky. 2008. Collaborative filtering for implicit feedback datasets. In ICDM. Google ScholarDigital Library
How Jing and Alexander J. Smola. 2017. Neural Survival Recommender. In Proceedings of the Tenth ACM International Conference on Web Search and Data Mining (WSDM). 515--524. Google ScholarDigital Library
Diederik Kingma and Jimmy Ba. 2014. Adam: A method for stochastic optimization. arXiv preprint arXiv:1412.6980 (2014).Google Scholar
Ryan Kiros, Richard Zemel, and Ruslan R. Salakhutdinov. 2014. A multiplicative model for learning distributed text-based attribute representations. In Advances in neural information processing systems. 2348--2356. Google ScholarDigital Library
Yehuda Koren. 2008. Factorization meets the neighborhood: a multifaceted collaborative filtering model. In KDD. ACM, 426--434. Google ScholarDigital Library
Yehuda Koren. 2010. Collaborative filtering with temporal dynamics. Commun. ACM 53, 4 (2010), 89--97. Google ScholarDigital Library
Yehuda Koren, Robert Bell, and Chris Volinsky. 2009. Matrix Factorization Techniques for Recommender Systems. Computer 42, 8 (Aug. 2009), 30--37. Google ScholarDigital Library
Joonseok Lee, Seungyeon Kim, Guy Lebanon, and Yoram Singer. 2013. Local Low- Rank Matrix Approximation. In Proceedings of the 30th International Conference on Machine Learning (ICML). 82--90. http://jmlr.org/proceedings/papers/v28/ lee13.html Google ScholarDigital Library
Haokai Lu and James Caverlee. 2015. Exploiting geo-spatial preference for personalized expert recommendation. In Proceedings of the 9th ACM Conference on Recommender Systems (RecSys). ACM, 67--74. Google ScholarDigital Library
Yanru Qu, Han Cai, Kan Ren, Weinan Zhang, Yong Yu, Ying Wen, and Jun Wang. 2016. Product-based neural networks for user response prediction. In Data Mining (ICDM), 2016 IEEE 16th International Conference on. IEEE, 1149--1154.Google ScholarCross Ref
Steffen Rendle. 2012. Factorization Machines with libFM. ACM TIST 3, 3, Article 57 (May 2012), 22 pages. Google ScholarDigital Library
Ruslan Salakhutdinov and Andriy Mnih. 2008. Bayesian probabilistic matrix factorization using Markov chain Monte Carlo. In ICML. ACM, 880--887. Google ScholarDigital Library
Suvash Sedhain, Aditya Krishna Menon, Scott Sanner, and Lexing Xie. 2015. Autorec: Autoencoders meet collaborative filtering. In Proceedings of the 24th International Conference on World Wide Web (WWW). ACM, 111--112. Google ScholarDigital Library
Yue Shi, Alexandros Karatzoglou, Linas Baltrunas, Martha Larson, Alan Hanjalic, and Nuria Oliver. 2012. TFMAP: optimizing MAP for top-n context-aware recommendation. In Proceedings of the 35th international ACM SIGIR conference on Research and development in information retrieval. ACM, 155--164. Google ScholarDigital Library
Sainbayar Sukhbaatar, Jason Weston, Rob Fergus, and others. 2015. End-toend memory networks. In Advances in neural information processing systems. 2440--2448. Google ScholarDigital Library
Yong Kiam Tan, Xinxing Xu, and Yong Liu. 2016. Improved recurrent neural networks for session-based recommendations. In Proceedings of the 1st Workshop on Deep Learning for Recommender Systems. ACM, 17--22. Google ScholarDigital Library
Duyu Tang, Bing Qin, Ting Liu, and Yuekui Yang. 2015. User Modeling with Neural Network for Review Rating Prediction.. In IJCAI. 1340--1346. Google ScholarDigital Library
Bartlomiej Twardowski. 2016. Modelling Contextual Information in Session- Aware Recommender Systems with Neural Networks.. In RecSys. 273--276. Google ScholarDigital Library
Manasi Vartak, Hugo Larochelle, and Arvind Thiagarajan. 2017. A Meta-Learning Perspective on Cold-Start Recommendations for Items. In Advances in Neural Information Processing Systems. 6888--6898.Google Scholar
Chao-YuanWu, Amr Ahmed, Alex Beutel, Alexander J. Smola, and HowJing. 2017. Recurrent Recommender Networks. In Proceedings of the Tenth ACM International Conference on Web Search and Data Mining (WSDM). 495--503. Google ScholarDigital Library
YuhuaiWu, Saizheng Zhang, Ying Zhang, Yoshua Bengio, and Ruslan R. Salakhutdinov. 2016. On multiplicative integration with recurrent neural networks. In Advances in Neural Information Processing Systems. 2856--2864. Google ScholarDigital Library
Chunfeng Yang, Huan Yan, Donghan Yu, Yong Li, and Dah Ming Chiu. 2017. Multi-site User Behavior Modeling and Its Application in Video Recommendation. In Proceedings of the 40th International ACM SIGIR Conference on Research and Development in Information Retrieval. ACM, 175--184. Google ScholarDigital Library
Zhe Zhao, Zhiyuan Cheng, Lichan Hong, and Ed H. Chi. 2015. Improving User Topic Interest Profiles by Behavior Factorization. In Proceedings of the 24th International Conference on World Wide Web (WWW). 1406--1416. Google ScholarDigital Library
Lei Zheng, Vahid Noroozi, and Philip S. Yu. 2017. Joint deep modeling of users and items using reviews for recommendation. In Proceedings of the Tenth ACM International Conference onWeb Search and Data Mining (WSDM). ACM, 425--434. Google ScholarDigital Library
Yong Zheng, Bamshad Mobasher, and Robin Burke. 2014. CSLIM: Contextual SLIM recommendation algorithms. In Proceedings of the 8th ACM Conference on Recommender Systems. ACM, 301--304. Google ScholarDigital Library
Yu Zhu, Hao Li, Yikang Liao, Beidou Wang, Ziyu Guan, Haifeng Liu, and Deng Cai. 2017. What to Do Next: Modeling User Behaviors by Time-LSTM. In Proceedings of the Twenty-Sixth International Joint Conference on Artificial Intelligence, IJCAI-17. 3602--3608. Google ScholarDigital Library

Index Terms

Latent Cross: Making Use of Context in Recurrent Recommender Systems
1. Information systems

Recommendations

Mixing-RNN: A Recommendation Algorithm Based on Recurrent Neural Network
Knowledge Science, Engineering and Management
Abstract
Collaborative filtering algorithms have been used by recommender systems for item (e.g., movie) recommendation. However, traditional collaborative filtering algorithms face challenges to provide accurate recommendation when users’ interest and ...
Read More
Gated recurrent units based neural network for time heterogeneous feedback recommendation

Nowadays, recommender systems face the problem of time heterogeneous feedback recommendation, in which items are recommended according to several kinds of user feedback with time stamps. Previously proposed recurrent neural network based recommendation ...
Read More
Leveraging attribute latent features for addressing new item cold-start issue
Highlights
- Integrates CF, CB, and neural network technologies.
- Shows the advantage of ...
Abstract
A recommender system employs an information filtering technology aiming to recommend items that are likely to be of interest to users, based on user behavior, rating feedback of items, or item characteristics. The cold-start problem ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Published in
WSDM '18: Proceedings of the Eleventh ACM International Conference on Web Search and Data Mining
February 2018
821 pages
ISBN:9781450355810
DOI:10.1145/3159652
General Chairs:
Yi Chang
Jilin University, Huawei Inc.
,
Chengxiang Zhai
University of Illinois Urbana-Champaign
,
Program Chairs:
Yan Liu
University of Southern California
,
Yoelle Maarek
Amazon
Copyright © 2018 Owner/Author
This work is licensed under a Creative Commons Attribution-ShareAlike International 4.0 License.
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 2 February 2018
Permissions
Request permissions about this article.
Request Permissions

Check for updates
Author Tags
contextual recommendation
recommender system
recurrent neural network
Qualifiers
- research-article
Conference

Acceptance Rates
WSDM '18 Paper Acceptance Rate81of514submissions,16%Overall Acceptance Rate498of2,863submissions,17%
More
Upcoming Conference
WSDM '25

Sponsor:

sigir

sigir

sigir

sigir

The Eighteenth ACM International Conference on Web Search and Data Mining

April 7 - 11, 2025

Hannover , Germany
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 196
  Total Citations
  View Citations
- 5,615
  Total Downloads
- Downloads (Last 12 months)617
- Downloads (Last 6 weeks)74
Other Metrics
View Author Metrics
Cited By
View all

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Latent Cross: Making Use of Context in Recurrent Recommender Systems

WSDM '18: Proceedings of the Eleventh ACM International Conference on Web Search and Data Mining

ABSTRACT

References

Cited By

Index Terms

Recommendations

Mixing-RNN: A Recommendation Algorithm Based on Recurrent Neural Network

Gated recurrent units based neural network for time heterogeneous feedback recommendation

Leveraging attribute latent features for addressing new item cold-start issue