article

Free Access

Learning bilinear model for matching queries and documents

Authors:
Wei Wu

Microsoft Research Asia, Beijing, P. R. China

Microsoft Research Asia, Beijing, P. R. China
View Profile

,
Zhengdong Lu

Huawei Noah's Ark Lab, Shatin, Hong Kong

Huawei Noah's Ark Lab, Shatin, Hong Kong
View Profile

,
Hang Li

Huawei Noah's Ark Lab, Shatin, Hong Kong

Huawei Noah's Ark Lab, Shatin, Hong Kong
View Profile

Authors Info & Claims

The Journal of Machine Learning Research Volume 14 Issue 1pp 2519–2548

Published:01 January 2013Publication History

The Journal of Machine Learning Research

Abstract

The task of matching data from two heterogeneous domains naturally arises in various areas such as web search, collaborative filtering, and drug design. In web search, existing work has designed relevance models to match queries and documents by exploiting either user clicks or content of queries and documents. To the best of our knowledge, however, there has been little work on principled approaches to leveraging both clicks and content to learn a matching model for search. In this paper, we propose a framework for learning to match heterogeneous objects. The framework learns two linear mappings for two objects respectively, and matches them via the dot product of their images after mapping. Moreover, when different regularizations are enforced, the framework renders a rich family of matching models. With orthonormal constraints on mapping functions, the framework subsumes Partial Least Squares (PLS) as a special case. Alternatively, with a l₁+l₂ regularization, we obtain a new model called Regularized Mapping to Latent Structures (RMLS). RMLS enjoys many advantages over PLS, including lower time complexity and easy parallelization. To further understand the matching framework, we conduct generalization analysis and apply the result to both PLS and RMLS. We apply the framework to web search and implement both PLS and RMLS using a click-through bipartite with metadata representing features of queries and documents. We test the efficacy and scalability of RMLS and PLS on large scale web search problems. The results show that both PLS and RMLS can significantly outperform baseline methods, while RMLS substantially speeds up the learning process.

References

J. Abernethy, F. Bach, T. Evgeniou, and J.P. Vert. A new approach to collaborative filtering: Operator estimation with spectral regularization. JMLR '09, 10:803-826, 2009. Google Scholar
S. Agarwal and P. Niyogi. Stability and generalization of bipartite ranking algorithms. In COLT'05, pages 32-47, 2005. Google Scholar
F. Bach. Exploring large feature spaces with hierarchical multiple kernel learning. In NIPS'08, pages 105-112, 2008.Google Scholar
F. Bach, G. Lanckriet, and M. Jordan. Multiple kernel learning, conic duality, and the SMO algorithm. In ICML'04, 2004. Google Scholar
R. Baeza-Yates and A. Tiberi. Extracting semantic relations from query logs. In SIGKDD'07, pages 76-85, 2007. Google Scholar
B. Bai, J. Weston, D. Grangier, R. Collobert, K. Sadamasa, Y. Qi, O. Chapelle, and K. Weinberger. Supervised semantic indexing. In CIKM'09, pages 187-196, 2009. Google Scholar
P. L. Bartlett and S. Mendelson. Rademacher and gaussian complexities: Risk bounds and structural results. JMLR'02, 3:463-482, 2002. Google Scholar
C. Burges, R. Ragno, and Q. Le. Learning to rank with nonsmooth cost functions. In NIPS'06, pages 395-402. 2006.Google Scholar
Y. Cao, J. Xu, T.Y. Liu, H. Li, Y. Huang, and H.W. Hon. Adapting ranking svm to document retrieval. In SIGIR '06, pages 186-193, 2006. Google Scholar
Z. Cao, T. Qin, T.Y. Liu, M.F. Tsai, and H. Li. Learning to rank: from pairwise approach to listwise approach. In ICML '07, pages 129-136, 2007. Google Scholar
T.Q. Chen, W.N. Zhang, Q.X. Lu, K.L. Chen, Z. Zhao, and Y. Yu. Svdfeature: A toolkit for feature-based collaborative filtering. JMLR'12, 13, 2012. Google Scholar
W. Chen, T.Y. Liu, and Z. Ma. Two-layer generalization analysis for ranking using rademacher average. NIPS'10, 23:370-378, 2010.Google Scholar
C. Cortes. Invited talk: Can learning kernels help performance? In ICML'09, page 161, 2009. Google Scholar
K. Crammer and Y. Singer. Pranking with ranking. In NIPS'01, pages 641-647, 2001.Google Scholar
N. Craswell and M. Szummer. Random walks on the click graph. In SIGIR'07, pages 239-246, 2007. Google Scholar
N. Cristianini, J. Shawe-taylor, A. Elisseeff, and J. Kandola. On kernel-target alignment. In NIPS'01, 2001.Google Scholar
J.V. Davis, B. Kulis, P. Jain, S. Sra, and I.S. Dhillon. Information-theoretic metric learning. In ICML'07, page 216, 2007. Google Scholar
S. Deerwester, S. Dumais, G. Furnas, T. Landauer, and R. Harshman. Indexing by latent semantic analysis. JASIS'90, 41:391-407, 1990.Google Scholar
J.H. Friedman. Greedy function approximation: a gradient boosting machine. Ann. Statist, 29(5): 1189-1232, 2001.Google Scholar
J. Gao, K. Toutanova, and W. Yih. Clickthrough-based latent semantic models for web search. In SIGIR'11, pages 675-684, 2011. Google Scholar
D. Grangier and S. Bengio. A discriminative kernel-based model to rank images from text queries. IEEE Transactions on PAMI, 30(8):1371-1384, 2008. Google Scholar
D.R. Hardoon, S. Szedmak, and J. Shawe-Taylor. Canonical correlation analysis: An overview with application to learning methods. Neural Computation, 16(12):2639-2664, 2004. Google Scholar
R. Herbrich, T. Graepel, and K. Obermayer. Large margin rank boundaries for ordinal regression. NIPS'99, pages 115-132, 1999.Google Scholar
T. Hertz, A. Bar-Hillel, and D. Weinshall. Boosting margin based distance functions for clustering. In ICML'04, page 50, 2004. Google Scholar
T. Hofmann. Latent semantic models for collaborative filtering. ACM Trans. Inf. Syst., 22:89-115, 2004. Google Scholar
S. CH. Hoi, W. Liu, M.R. Lyu, and W.Y. Ma. Learning distance metrics with contextual constraints for image retrieval. In CVPR'06, volume 2, pages 2072-2078, 2006. Google Scholar
S. JacobGoldberger and R. GeoffHinton. Neighbourhood components analysis. NIPS'04, 2004.Google Scholar
K. Jarvelin and J. Kekalainen. Ir evaluation methods for retrieving highly relevant documents. In SIGIR'00, pages 41-48, 2000. Google Scholar
T. Joachims. Optimizing search engines using clickthrough data. In KDD '02, pages 133-142, 2002. Google Scholar
I.T. Jolliffe. Principal Component Analysis. Springer verlag, 2002.Google Scholar
G. R. G. Lanckriet, N. Cristianini, P. Bartlett, L. E. Ghaoui, and M. I. Jordan. Learning the kernel matrix with semi-definite programming. In ICML'02, pages 323-330, 2002. Google Scholar
H. Li. Learning to rank for information retrieval and natural language processing. Synthesis Lectures on Human Language Technologies, 4(1):1-113, 2011.Google Scholar
T.Y. Liu. Learning to rank for information retrieval. Foundations and Trends in Information Retrieval , 3(3):225-331, 2009. Google Scholar
H. Ma, H.X. Yang, I. King, and M. R. Lyu. Learning latent semantic relations from clickthrough data for query suggestion. In CIKM'08, pages 709-718, 2008. Google Scholar
C. Micchelli and M. Pontil. Learning the kernel function via regularization. JMLR'05, 6:1099- 1125, 2005. Google Scholar
C.S. Ong, A.J. Smola, and R.C. Williamson. Learning the kernel with hyperkernels. JMLR '05, 6: 1043-1071, 2005. Google Scholar
J.M. Ponte and W.B. Croft. A language modeling approach to information retrieval. In SIGIR'98, pages 275-281, 1998. Google Scholar
S. E. Robertson, S. Walker, S. Jones, M. Hancock-Beaulieu, and M. Gatford. Okapi at trec-3. In TREC, 1994.Google Scholar
R. Rosipal and N. Krämer. Overview and recent advances in partial least squares. Subspace, Latent Structure and Feature Selection, pages 34-51, 2006. Google Scholar
C. Rudin, C. Cortes, M. Mohri, and R. E. Schapire. Margin-based ranking meets boosting in the middle. In COLT'05, pages 63-78, 2005. Google Scholar
G. Salton and M. McGill. Introduction to Modern Information Retrieval. McGraw-Hill, Inc., New York, NY, USA, 1986. Google Scholar
P.J. Schreier. A unifying discussion of correlation analysis for complex random vectors. Signal Processing, IEEE Transactions on, 56(4):1327-1336, 2008. Google Scholar
M. Schultz and T. Joachims. Learning a distance metric from relative comparisons. NIPS'03, 2003.Google Scholar
N. Srebro, J.D.M. Rennie, and T. Jaakkola. Maximum-margin matrix factorization. NIPS'05, pages 1329-1336, 2005.Google Scholar
M. Sugiyama. Dimensionality reduction of multimodal labeled data by local fisher discriminant analysis. JMLR'07, 8:1027-1061, 2007. Google Scholar
M. Varma and B. R. Babu. More generality in efficient multiple kernel learning. In ICML'09, page 134, 2009. Google Scholar
J.A. Wegelin. A survey of partial least squares (pls) methods, with emphasis on the two-block case. Technical Report, No.371, Seattle: Department of Statistics, Univ. of Wash., 2000.Google Scholar
K.Q. Weinberger and L.K. Saul. Distance metric learning for large margin nearest neighbor classification. JMLR'09, 10:207-244, 2009. Google Scholar
W. Wu, J. Xu, H. Li, and O. Satoshi. Learning a robust relevance model for search using kernel methods. JMLR'11, 12:1429-1458, 2011. Google Scholar
E.P. Xing, A.Y. Ng, M.I. Jordan, and S. Russell. Distance metric learning with application to clustering with side-information. NIPS'03, pages 521-528, 2003.Google Scholar
J. Xu and H. Li. Adarank: a boosting algorithm for information retrieval. In SIGIR' 07, pages 391-398, 2007. Google Scholar
J. Xu, H. Li, and Z.L. Zhong. Relevance ranking using kernels. In AIRS '10, 2010.Google Scholar
L. Yang, R. Jin, R. Sukthankar, and Y. Liu. An efficient algorithm for local distance metric learning. In AAAI'06, page 543, 2006. Google Scholar
Y. Ying, K. Huang, and C. Campbell. Sparse Metric Learning via Smooth Optimization. NIPS'09, pages 521-528, 2009.Google Scholar
C.X. Zhai and J. Lafferty. A study of smoothing methods for language models applied to information retrieval. ACM Trans. Inf. Syst., 22(2):179-214, 2004. Google Scholar

Index Terms

Learning bilinear model for matching queries and documents
1. Computing methodologies
  1. Machine learning
2. Information systems
  1. Information retrieval

Index terms have been assigned to the content through auto-classification.

Recommendations

Identifying popular search goals behind search queries to improve web search ranking
AIRS'11: Proceedings of the 7th Asia conference on Information Retrieval Technology

Web users usually have a certain search goal before they submit a search query. However, many laypersons can't transform their search goals into suitable queries. Thus, understanding original search goals behind a query is very important for search ...
Read More
Quality-biased ranking for queries with commercial intent
WWW '13 Companion: Proceedings of the 22nd International Conference on World Wide Web

Modern search engines are good enough to answer popular commercial queries with mainly highly relevant documents. However, our experiments show that users behavior on such relevant commercial sites may differ from one to another web-site with the same ...
Read More
Improving Ranking Consistency for Web Search by Leveraging a Knowledge Base and Search Logs
CIKM '15: Proceedings of the 24th ACM International on Conference on Information and Knowledge Management

In this paper, we propose a new idea called ranking consistency in web search. Relevance ranking is one of the biggest problems in creating an effective web search system. Given some queries with similar search intents, conventional approaches typically ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Article

Published in

The Journal of Machine Learning Research Volume 14, Issue 1
January 2013
3717 pages
ISSN:1532-4435
EISSN:1533-7928
Issue’s Table of Contents
Sponsors
In-Cooperation
Publisher
JMLR.org
Publication History
- Published: 1 January 2013
Published in jmlr Volume 14, Issue 1
Author Tags
generalization analysis
partial least squares
regularized mapping to latent structures
web search
Qualifiers
- article
Conference
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 8
  Total Citations
  View Citations
- 168
  Total Downloads
- Downloads (Last 12 months)10
- Downloads (Last 6 weeks)0
Other Metrics
View Author Metrics
Cited By
View all

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Learning bilinear model for matching queries and documents

The Journal of Machine Learning Research

Abstract

References

Cited By

Index Terms

Recommendations

Identifying popular search goals behind search queries to improve web search ranking

Quality-biased ranking for queries with commercial intent

Improving Ranking Consistency for Web Search by Leveraging a Knowledge Base and Search Logs

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Author Tags

Qualifiers

Conference

Funding Sources

Other Metrics

Article Metrics

Other Metrics

Cited By

PDF Format

eReader

Digital Edition

Caption

Learning bilinear model for matching queries and documents

The Journal of Machine Learning Research

Abstract

References

Cited By

Index Terms

Recommendations

Identifying popular search goals behind search queries to improve web search ranking

Quality-biased ranking for queries with commercial intent

Improving Ranking Consistency for Web Search by Leveraging a Knowledge Base and Search Logs

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Author Tags

Qualifiers

Conference

Funding Sources

Article Metrics

Other Metrics

PDF Format

eReader

Digital Edition

Share this Publication link

Share on Social Media