research-article

Factor in the neighbors: Scalable and accurate collaborative filtering

Author:
Yehuda Koren

Yahoo! Research, Haifa, Israel

Yahoo! Research, Haifa, Israel
View Profile

ACM Transactions on Knowledge Discovery from Data Volume 4 Issue 1Article No.: 1pp 1–24https://doi.org/10.1145/1644873.1644874

Published:18 January 2010Publication History

ACM Transactions on Knowledge Discovery from Data

Abstract

Recommender systems provide users with personalized suggestions for products or services. These systems often rely on collaborating filtering (CF), where past transactions are analyzed in order to establish connections between users and products. The most common approach to CF is based on neighborhood models, which originate from similarities between products or users. In this work we introduce a new neighborhood model with an improved prediction accuracy. Unlike previous approaches that are based on heuristic similarities, we model neighborhood relations by minimizing a global cost function. Further accuracy improvements are achieved by extending the model to exploit both explicit and implicit feedback by the users. Past models were limited by the need to compute all pairwise similarities between items or users, which grow quadratically with input size. In particular, this limitation vastly complicates adopting user similarity models, due to the typical large number of users. Our new model solves these limitations by factoring the neighborhood model, thus making both item-item and user-user implementations scale linearly with the size of the data. The methods are tested on the Netflix data, with encouraging results.

References

Adomavicius, G. and Tuzhilin, A. 2005. Toward the next generation of recommender systems: A survey of the state-of-the-art and possible extensions. IEEE Trans. Knowl. Data Eng. 17, 6, 734--749. Google ScholarDigital Library
Ali, K. and van Stam, W. 2004. Tivo: making show recommendations using a distributed collaborative filtering architecture. In Proceedings of the 10th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. ACM, New York, NY, 394--401. Google ScholarDigital Library
Bell, R. and Koren, Y. 2007a. Lessons from the Netflix Prize challenge. SIGKDD Explor. Newslet. 9, 2, 75--79. Google ScholarDigital Library
Bell, R., Koren, Y., and Volinsky, C. 2007. Modeling relationships at multiple scales to improve accuracy of large recommender systems. In Proceedings of the 13th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. ACM, New York, NY, 95--104. Google ScholarDigital Library
Bell, R. M. and Koren, Y. 2007b. Scalable collaborative filtering with jointly derived neighborhood interpolation weights. In Proceedings of the IEEE International Conference on Data Mining (ICDM). IEEE Computer Society, 43--52. Google ScholarDigital Library
Bennett, J. and Lanning, S. 2007. The Netflix Prize. In Proceedings of the KDD Cup and Workshop.Google Scholar
Blei, D. M., Ng, A. Y., and Jordan, M. I. 2003. Latent Dirichlet allocation. J. Mach. Learn. Res. 3, 993--1022. Google ScholarCross Ref
Canny, J. 2002. Collaborative filtering with privacy via factor analysis. In Proceedings of the 25th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR'02). ACM, New York, NY, 238--245. Google ScholarDigital Library
Das, A. S., Datar, M., Garg, A., and Rajaram, S. 2007. Google news personalization: Scalable online collaborative filtering. In Proceedings of the 16th International Conference on World Wide Web (WWW'07). ACM, New York, NY, 271--280. Google ScholarDigital Library
Deerwester, S., Dumais, S. T., Furnas, G. W., Landauer, T. K., and Harshman, R. 1990. Indexing by latent semantic analysis. J. Amer. Soc. Inform. Sci. 41, 391--407.Google ScholarCross Ref
Goldberg, D., Nichols, D., Oki, B. M., and Terry, D. 1992. Using collaborative filtering to weave an information tapestry. Comm. ACM 35, 12, 61--70. Google ScholarDigital Library
Herlocker, J. L., Konstan, J. A., Borchers, A., and Riedl, J. 1999. An algorithmic framework for performing collaborative filtering. In Proceedings of the 22nd Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR'99). ACM, New York, NY, 230--237. Google ScholarDigital Library
Herlocker, J. L., Konstan, J. A., and Riedl, J. 2000. Explaining collaborative filtering recommendations. In Proceedings of the ACM Conference on Computer Supported Cooperative Work (CSCW'00). ACM, New York, NY, 241--250. Google ScholarDigital Library
Hofmann, T. 2004. Latent semantic models for collaborative filtering. ACM Trans. Inform. Syst. 22, 1, 89--115. Google ScholarDigital Library
Kim, D. and Yum, B.-J. 2005. Collaborative filtering based on iterative principal component analysis. Expert Syst. Appl. 28, 4, 823--830. Google ScholarDigital Library
Koren, Y. 2008. Factorization meets the neighborhood: A multifaceted collaborative filtering model. In Proceedings of the 14th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD'08). ACM, New York, NY, 426--434. Google ScholarDigital Library
Linden, G., Smith, B., and York, J. 2003. Amazon.com recommendations: Item-to-item collaborative filtering. IEEE Internet Comput. 7, 1, 76--80. Google ScholarDigital Library
Marlin, B. M., Zemel, R. S., Roweis, S., and Slaney, M. 2007. Collaborative filtering and the missing at random assumption. In Proceedings of the 23rd Conference on Uncertainty in Artificial Intelligence (UAI).Google Scholar
Oard, D. and Kim, J. 1998. Implicit feedback for recommender systems. In Proceedings of the AAAI Workshop on Recommender Systems. 31--36.Google Scholar
Park, S.-T. and Pennock, D. M. 2007. Applying collaborative filtering techniques to movie search for better ranking and browsing. In Proceedings of the 13th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD'07). ACM, New York, NY, 550--559. Google ScholarDigital Library
Paterek, A. 2007. Improving regularized singular value decomposition for collaborative filtering. In Proceedings of KDD Cup and Workshop.Google Scholar
Piatetsky, G. 2007. Interview with Simon Funk. SIGKDD Explor. Newsl. 9, 1, 38--40. Google ScholarDigital Library
Salakhutdinov, R., Mnih, A., and Hinton, G. 2007. Restricted Boltzmann machines for collaborative filtering. In Proceedings of the 24th International Conference on Machine Learning (ICML'07). ACM, New York, NY, 791--798. Google ScholarDigital Library
Sarwar, B., Karypis, G., Konstan, J., and Reidl, J. 2001. Item-based collaborative filtering recommendation algorithms. In Proceedings of the 10th International Conference on World Wide Web (WWW'01). ACM, New York, NY, 285--295. Google ScholarDigital Library
Sarwar, B. M., Karypis, G., Konstan, J. A., and Riedl, J. T. 2000. Application of dimensionality reduction in recommender system—a case study. In Proceedings of the ACM WebKDD Workshop.Google Scholar
Takács, G., Pilászy, I., Németh, B., and Tikk, D. 2007. Major components of the gravity recommendation system. SIGKDD Explor. Newsl. 9, 2, 80--83. Google ScholarDigital Library
Tintarev, N. and Masthoff, J. 2007. A survey of explanations in recommender systems. In Proceedings of the 22nd International Conference on Data Engineering Workshops, 801--810. Google ScholarDigital Library
Wang, J., de Vries, A. P., and Reinders, M. J. T. 2006. Unifying user-based and item-based collaborative filtering approaches by similarity fusion. In Proceedings of the 29th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR'06). ACM, New York, NY, 501--508. Google ScholarDigital Library

Index Terms

Factor in the neighbors: Scalable and accurate collaborative filtering
1. Information systems
  1. Information retrieval
    1. Retrieval tasks and goals
      1. Document filtering
      2. Information extraction
  2. Information systems applications
    1. Data mining

Recommendations

Recommending new movies: even a few ratings are more valuable than metadata
RecSys '09: Proceedings of the third ACM conference on Recommender systems

The Netflix Prize (NP) competition gave much attention to collaborative filtering (CF) approaches. Matrix factorization (MF) based CF approaches assign low dimensional feature vectors to users and items. We link CF and content-based filtering (CBF) by ...
Read More
Investigation of various matrix factorization methods for large recommender systems
NETFLIX '08: Proceedings of the 2nd KDD Workshop on Large-Scale Recommender Systems and the Netflix Prize Competition

Matrix Factorization (MF) based approaches have proven to be efficient for rating-based recommendation systems. In this work, we propose several matrix factorization approaches with improved prediction accuracy. We introduce a novel and fast (semi)-...
Read More
Recommendation engine based on derived wisdom for more similar item neighbors

Collaborative filtering (CF) is a popular and widely accepted recommendation technique. CF is an automated form of word-of-mouth communication between like-minded or similar users. The search for these similar users as neighbors from a large user ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Article

Published in

ACM Transactions on Knowledge Discovery from Data Volume 4, Issue 1
January 2010
135 pages
ISSN:1556-4681
EISSN:1556-472X
DOI:10.1145/1644873
Issue’s Table of Contents

Copyright © 2010 ACM
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 18 January 2010
- Accepted: 1 May 2009
- Revised: 1 April 2009
- Received: 1 January 2009
Published in tkdd Volume 4, Issue 1

Permissions
Request permissions about this article.
Request Permissions

Check for updates
Author Tags
Netflix Prize
Recommender systems
collaborative filtering
Qualifiers
- research-article
- Research
- Refereed
Conference
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 508
  Total Citations
  View Citations
- 3,383
  Total Downloads
- Downloads (Last 12 months)100
- Downloads (Last 6 weeks)15
Other Metrics
View Author Metrics
Cited By
View all

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Factor in the neighbors: Scalable and accurate collaborative filtering

ACM Transactions on Knowledge Discovery from Data

Abstract

References

Cited By

Index Terms

Recommendations

Recommending new movies: even a few ratings are more valuable than metadata

Investigation of various matrix factorization methods for large recommender systems

Recommendation engine based on derived wisdom for more similar item neighbors

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Conference

Funding Sources

Other Metrics

Article Metrics

Other Metrics

Cited By

PDF Format

eReader

Digital Edition

Caption

Factor in the neighbors: Scalable and accurate collaborative filtering

ACM Transactions on Knowledge Discovery from Data

Abstract

References

Cited By

Index Terms

Recommendations

Recommending new movies: even a few ratings are more valuable than metadata

Investigation of various matrix factorization methods for large recommender systems

Recommendation engine based on derived wisdom for more similar item neighbors

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Conference

Funding Sources

Article Metrics

Other Metrics

PDF Format

eReader

Digital Edition

Share this Publication link

Share on Social Media