ABSTRACT
Lack of calibrated product sizing in popular categories such as apparel and shoes leads to customers purchasing incorrect sizes, which in turn results in high return rates due to fit issues. We address the problem of product size recommendations based on customer purchase and return data. We propose a novel approach based on Bayesian logit and probit regression models with ordinal categories Small, Fit, Largeto model size fits as a function of the difference between latent sizes of customers and products. We propose posterior computation based on mean-field variational inference, leveraging the Polya-Gamma augmentation for the logit prior, that results in simple updates, enabling our technique to efficiently handle large datasets. Our Bayesian approach effectively deals with issues arising from noise and sparsity in the data providing robust recommendations. Offline experiments with real-life shoe datasets show that our model outperforms the state-of-the-art in 5 of 6 datasets. and leads to an improvement of 17-26% in AUC over baselines when predicting size fit outcomes.
- Adomavicius, G., and Tuzhilin, A. Context-aware recommender systems. In Recommender systems handbook. Springer, 2011, pp. 217--253.Google ScholarCross Ref
- Agarwal, D., and Chen, B.-C. Regression-based latent factor models. In Proceedings of the 15th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (New York, NY, USA, 2009), KDD '09, ACM, pp. 19--28. Google ScholarDigital Library
- Aggarwal, C. C. Recommender systems. Springer, 2016. Google ScholarCross Ref
- Albert, J. H., and Chib, S. Bayesian analysis of binary and polychotomous response data. Journal of the American statistical Association 88, 422 (1993), 669--679.Google Scholar
- Bauer, J., and Nanopoulos, A. A framework for matrix factorization based on general distributions. In Proceedings of the 8th ACM Conference on Recommender Systems (New York, NY, USA, 2014), RecSys '14, ACM, pp. 249--256. Google ScholarDigital Library
- Bishop, C. M. Pattern Recognition and Machine Learning. Springer, 2006. Google ScholarDigital Library
- Charlin, L., Ranganath, R., McInerney, J., and Blei, D. M. Dynamic poisson factorization. In Proceedings of the 9th ACM Conference on Recommender Systems (New York, NY, USA, 2015), RecSys '15, ACM, pp. 155--162. Google ScholarDigital Library
- Chen, M.-H., and Dey, D. K. Bayesian analysis for correlated ordinal data models. BIOSTATISTICS-BASEL- 5 (2000), 133--158.Google Scholar
- Chu, W., and Park, S.-T. Personalized recommendation on dynamic content using predictive bilinear models. In Proceedings of the 18th International Conference on World Wide Web (New York, NY, USA, 2009), WWW '09, ACM, pp. 691--700. Google ScholarDigital Library
- Church, K., Smyth, B., Cotter, P., and Bradley, K. Mobile information access: A study of emerging search behavior on the mobile internet. ACM Trans. Web 1, 1 (May 2007). Google ScholarDigital Library
- Gopalan, P. K., Charlin, L., and Blei, D. Content-based recommendations with poisson factorization. In Advances in Neural Information Processing Systems (2014), pp. 3176--3184. Google ScholarDigital Library
- Greene, W. H., and Hensher, D. A. Modeling ordered choices: A primer. Cambridge University Press, 2010.Google ScholarCross Ref
- Grimmer, J. An introduction to bayesian inference via variational approximations. Political Analysis 19, 1 (2010), 32--47.Google ScholarCross Ref
- Harvey, M., Carman, M. J., Ruthven, I., and Crestani, F. Bayesian latent variable models for collaborative item rating prediction. In Proceedings of the 20th ACM International Conference on Information and Knowledge Management (New York, NY, USA, 2011), CIKM '11, ACM, pp. 699--708. Google ScholarDigital Library
- Joshi, B., Iutzeler, F., and Amini, M.-R. Asynchronous distributed matrix factorization with similar user and item based regularization. In Proceedings of the 10th ACM Conference on Recommender Systems (New York, NY, USA, 2016), RecSys '16, ACM, pp. 75--78. Google ScholarDigital Library
- Karatzoglou, A., Amatriain, X., Baltrunas, L., and Oliver, N. Multiverse recommendation: n-dimensional tensor factorization for context-aware collaborative filtering. In Proceedings of the fourth ACM conference on Recommender systems (2010), ACM, pp. 79--86. Google ScholarDigital Library
- Kawakatsu, H., and Largey, A. G. EM algorithms for ordered probit models with endogenous regressors. The Econometrics Journal 12, 1 (2009), 164--186.Google ScholarCross Ref
- Koren, Y., Bell, R., and Volinsky, C. Matrix factorization techniques for recommender systems. Computer 42, 8 (2009). Google ScholarDigital Library
- Liang, D., Altosaar, J., Charlin, L., and Blei, D. M. Factorization meets the item embedding: Regularizing matrix factorization with item co-occurrence. In Proceedings of the 10th ACM conference on recommender systems (2016), ACM, pp. 59--66. Google ScholarDigital Library
- McAuley, J., and Leskovec, J. Hidden factors and hidden topics: understanding rating dimensions with review text. In Proceedings of the 7th ACM conference on Recommender systems (2013), ACM, pp. 165--172. Google ScholarDigital Library
- McKinley, T. J., Morters, M., Wood, J. L., et al. Bayesian model choice in cumulative link ordinal regression models. Bayesian Analysis 10, 1 (2015), 1--30.Google ScholarCross Ref
- Narita, A., Hayashi, K., Tomioka, R., and Kashima, H. Tensor factorization using auxiliary information. Data Mining and Knowledge Discovery 25, 2 (2012), 298--324. Google ScholarDigital Library
- Palmisano, C., Tuzhilin, A., and Gorgoglione, M. Using context to improve predictive modeling of customers in personalization applications. IEEE transactions on knowledge and data engineering 20, 11 (2008), 1535--1549. Google ScholarDigital Library
- Panniello, U., Tuzhilin, A., Gorgoglione, M., Palmisano, C., and Pedone, A. Experimental comparison of pre-vs. post-filtering approaches in contextaware recommender systems. In Proceedings of the third ACM conference on Recommender systems (2009), ACM, pp. 265--268. Google ScholarDigital Library
- Polson, N. G., Scott, J. G., and Windle, J. Bayesian inference for logistic models using pólya--gamma latent variables. Journal of the American statistical Association 108, 504 (2013), 1339--1349.Google Scholar
- Qin, Z., Rishabh, I., and Carnahan, J. A scalable approach for periodical personalized recommendations. In Proceedings of the 10th ACM Conference on Recommender Systems (New York, NY, USA, 2016), RecSys '16, ACM, pp. 23--26. Google ScholarDigital Library
- Rafailidis, D., and Nanopoulos, A. Modeling the dynamics of user preferences in coupled tensor factorization. In Proceedings of the 8th ACM Conference on Recommender Systems (New York, NY, USA, 2014), RecSys '14, ACM, pp. 321--324. Google ScholarDigital Library
- Salakhutdinov, R., and Mnih, A. Bayesian probabilistic matrix factorization using markov chain monte carlo. In Proceedings of the 25th International Conference on Machine Learning (New York, NY, USA, 2008), ICML '08, ACM, pp. 880--887. Google ScholarDigital Library
- Salakhutdinov, R., and Mnih, A. Probabilistic matrix factorization. In Advances in Neural Information Processing Systems (2008), vol. 20. Google ScholarDigital Library
- Sembium, V., Rastogi, R., Saroop, A., and Merugu, S. Recommending product sizes to customers. In Proceedings of the Eleventh ACM Conference on Recommender Systems (New York, NY, USA, 2017), RecSys '17, ACM, pp. 243--250. Google ScholarDigital Library
- Vasile, F., Smirnova, E., and Conneau, A. Meta-prod2vec: Product embeddings using side-information for recommendation. In Proceedings of the 10th ACM Conference on Recommender Systems (2016), ACM, pp. 225--232 Google ScholarDigital Library
Index Terms
- Bayesian Models for Product Size Recommendations
Recommendations
A note on mean-field variational approximations in Bayesian probit models
We correct some conclusions presented by Consonni and Marin (2007) on the performance of mean-field variational approximations to Bayesian inferences in the case of a simple probit model. We show that some of their presentations are misleading and thus ...
Mean-field variational approximate Bayesian inference for latent variable models
The ill-posed nature of missing variable models offers a challenging testing ground for new computational techniques. This is the case for the mean-field variational Bayesian inference. The behavior of this approach in the setting of the Bayesian probit ...
Collaborative Variational Autoencoder for Recommender Systems
KDD '17: Proceedings of the 23rd ACM SIGKDD International Conference on Knowledge Discovery and Data MiningModern recommender systems usually employ collaborative filtering with rating information to recommend items to users due to its successful performance. However, because of the drawbacks of collaborative-based methods such as sparsity, cold start, etc., ...
Comments