skip to main content
Skip header Section
Learning to Rank for Information Retrieval and Natural Language ProcessingApril 2011
Publisher:
  • Morgan & Claypool Publishers
ISBN:978-1-60845-707-6
Published:22 April 2011
Pages:
114
Skip Bibliometrics Section
Bibliometrics
Skip Abstract Section
Abstract

Learning to rank refers to machine learning techniques for training the model in a ranking task. Learning to rank is useful for many applications in information retrieval, natural language processing, and data mining. Intensive studies have been conducted on the problem recently and significant progress has been made. This lecture gives an introduction to the area including the fundamental problems, existing approaches, theories, applications, and future work. The author begins by showing that various ranking problems in information retrieval and natural language processing can be formalized as two basic ranking tasks, namely ranking creation (or simply ranking) and ranking aggregation. In ranking creation, given a request, one wants to generate a ranking list of offerings based on the features derived from the request and the offerings. In ranking aggregation, given a request, as well as a number of ranking lists of offerings, one wants to generate a new ranking list of the offerings. Ranking creation (or ranking) is the major problem in learning to rank. It is usually formalized as a supervised learning task. The author gives detailed explanations on learning for ranking creation and ranking aggregation, including training and testing, evaluation, feature creation, and major approaches. Many methods have been proposed for ranking creation. The methods can be categorized as the pointwise, pairwise, and listwise approaches according to the loss functions they employ. They can also be categorized according to the techniques they employ, such as the SVM based, Boosting SVM, Neural Network based approaches. The author also introduces some popular learning to rank methods in details. These include PRank, OC SVM, Ranking SVM, IR SVM, GBRank, RankNet, LambdaRank, ListNet & ListMLE, AdaRank, SVM MAP, SoftRank, Borda Count, Markov Chain, and CRanking. The author explains several example applications of learning to rank including web search, collaborative filtering, definition search, keyphrase extraction, query dependent summarization, and re-ranking in machine translation. A formulation of learning for ranking creation is given in the statistical learning framework. Ongoing and future research directions for learning to rank are also discussed. Table of Contents: Introduction / Learning for Ranking Creation / Learning for Ranking Aggregation / Methods of Learning to Rank / Applications of Learning to Rank / Theory of Learning to Rank / Ongoing and Future Work

Cited By

  1. ACM
    Chowdhury T, Rahimi R and Allan J Rank-LIME: Local Model-Agnostic Feature Attribution for Learning to Rank Proceedings of the 2023 ACM SIGIR International Conference on Theory of Information Retrieval, (33-37)
  2. Werner T (2022). A review on instance ranking problems in statistical learning, Machine Language, 111:2, (415-463), Online publication date: 1-Feb-2022.
  3. ACM
    Guo J, Cai Y, Fan Y, Sun F, Zhang R and Cheng X (2022). Semantic Models for the First-Stage Retrieval: A Comprehensive Review, ACM Transactions on Information Systems, 40:4, (1-42), Online publication date: 31-Oct-2022.
  4. ACM
    Omri S and Sinz C Learning to rank for test case prioritization Proceedings of the 15th Workshop on Search-Based Software Testing, (16-24)
  5. Khan M, Azim A, Liscano R, Smith K, Chang Y, Garcon S and Tauseef Q Failure prediction using machine learning in IBM WebSphere liberty continuous integration environment Proceedings of the 31st Annual International Conference on Computer Science and Software Engineering, (63-72)
  6. Chen Y, Zhuang Z and Qin W Learning to Rank High Closeness Centrality Nodes in a Given Network based on RankNet Method 2021 IEEE 17th International Conference on Automation Science and Engineering (CASE), (1695-1700)
  7. Sekar S, Vojnovic M and Yun S (2021). A Test Score-Based Approach to Stochastic Submodular Optimization, Management Science, 67:2, (1075-1092), Online publication date: 1-Feb-2021.
  8. Zeng A, Yu H, Da Q, Zhan Y, Yu Y, Zhou J and Miao C (2021). Improving search engine efficiency through contextual factor selection, AI Magazine, 42:2, (50-58), Online publication date: 1-Jun-2021.
  9. He Y, Liu J and Ning X (2020). Drug Selection via Joint Push and Learning to Rank, IEEE/ACM Transactions on Computational Biology and Bioinformatics, 17:1, (110-123), Online publication date: 1-Jan-2020.
  10. ACM
    Patro G, Chakraborty A, Banerjee A and Ganguly N Towards Safety and Sustainability: Designing Local Recommendations for Post-pandemic World Proceedings of the 14th ACM Conference on Recommender Systems, (358-367)
  11. ACM
    Wang Y, Chen Q, He C, Liu H and Wu X Knowledge Base Question Answering System Based on Knowledge Graph Representation Learning Proceedings of the 2020 the 4th International Conference on Innovation in Artificial Intelligence, (170-179)
  12. ACM
    Pasumarthi R, Bruch S, Wang X, Li C, Bendersky M, Najork M, Pfeifer J, Golbandi N, Anil R and Wolf S TF-Ranking Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, (2970-2978)
  13. ACM
    Ding K, Ma K and Wang S Intrinsic Image Popularity Assessment Proceedings of the 27th ACM International Conference on Multimedia, (1979-1987)
  14. ACM
    Yu H, Jatowt A, Joho H, Jose J, Yang X and Chen L WassRank Proceedings of the Twelfth ACM International Conference on Web Search and Data Mining, (24-32)
  15. ACM
    Pasumarthi R, Bruch S, Bendersky M and Wang X Neural Learning to Rank using TensorFlow Ranking Proceedings of the 2019 ACM SIGIR International Conference on Theory of Information Retrieval, (253-254)
  16. ACM
    Aliannejadi M, Zamani H, Crestani F and Croft W In Situ and Context-Aware Target Apps Selection for Unified Mobile Search Proceedings of the 27th ACM International Conference on Information and Knowledge Management, (1383-1392)
  17. ACM
    Zamani H, Dehghani M, Croft W, Learned-Miller E and Kamps J From Neural Re-Ranking to Neural Ranking Proceedings of the 27th ACM International Conference on Information and Knowledge Management, (497-506)
  18. ACM
    Aliannejadi M, Zamani H, Crestani F and Croft W Target Apps Selection The 41st International ACM SIGIR Conference on Research & Development in Information Retrieval, (215-224)
  19. ACM
    Zamani H, Croft W and Culpepper J Neural Query Performance Prediction using Weak Supervision from Multiple Signals The 41st International ACM SIGIR Conference on Research & Development in Information Retrieval, (105-114)
  20. Wang J, Wang Z, Gao C, Sang N and Huang R (2017). DeepList, IEEE Transactions on Circuits and Systems for Video Technology, 27:3, (513-524), Online publication date: 1-Mar-2017.
  21. (2017). Automatically generating effective search queries directly from community question-answering questions for finding related questions, Expert Systems with Applications: An International Journal, 77:C, (11-19), Online publication date: 1-Jul-2017.
  22. Zhang M, Liu Y, Luan H and Sun M (2016). Listwise ranking functions for statistical machine translation, IEEE/ACM Transactions on Audio, Speech and Language Processing (TASLP), 24:8, (1464-1472), Online publication date: 1-Aug-2016.
  23. ACM
    Ibrahim M and Carman M (2016). Comparing Pointwise and Listwise Objective Functions for Random-Forest-Based Learning-to-Rank, ACM Transactions on Information Systems, 34:4, (1-38), Online publication date: 14-Sep-2016.
  24. ACM
    Wang Q, Dimopoulos C and Suel T Fast First-Phase Candidate Generation for Cascading Rankers Proceedings of the 39th International ACM SIGIR conference on Research and Development in Information Retrieval, (295-304)
  25. ACM
    Na S (2015). Two-Stage Document Length Normalization for Information Retrieval, ACM Transactions on Information Systems, 33:2, (1-40), Online publication date: 26-Feb-2015.
  26. Wang M, Lu Z, Li H and Liu Q Syntax-based deep matching of short texts Proceedings of the 24th International Conference on Artificial Intelligence, (1354-1361)
  27. ACM
    Cao J, Huang Z and Yang Y Spatial-aware Multimodal Location Estimation for Social Images Proceedings of the 23rd ACM international conference on Multimedia, (119-128)
  28. ACM
    Schamoni S and Riezler S Combining Orthogonal Information in Large-Scale Cross-Language Information Retrieval Proceedings of the 38th International ACM SIGIR Conference on Research and Development in Information Retrieval, (943-946)
  29. Qian L, Gao J and Jagadish H (2015). Learning user preferences by adaptive pairwise comparison, Proceedings of the VLDB Endowment, 8:11, (1322-1333), Online publication date: 1-Jul-2015.
  30. ACM
    Xia X, Lo D, Wang X, Zhang C and Wang X Cross-language bug localization Proceedings of the 22nd International Conference on Program Comprehension, (275-278)
  31. ACM
    Yi X, Hong L, Zhong E, Liu N and Rajan S Beyond clicks Proceedings of the 8th ACM Conference on Recommender systems, (113-120)
  32. ACM
    Zamani H, Shakery A and Moradi P Regression and Learning to Rank Aggregation for User Engagement Evaluation Proceedings of the 2014 Recommender Systems Challenge, (29-34)
  33. Li H and Xu J (2014). Semantic Matching in Search, Foundations and Trends in Information Retrieval, 7:5, (343-469), Online publication date: 12-Jun-2014.
  34. Wang Y and Lin J The Impact of Future Term Statistics in Real-Time Tweet Search Proceedings of the 36th European Conference on IR Research on Advances in Information Retrieval - Volume 8416, (567-572)
  35. ACM
    Bendersky M, Garcia-Pueyo L, Harmsen J, Josifovski V and Lepikhin D Up next Proceedings of the 20th ACM SIGKDD international conference on Knowledge discovery and data mining, (1769-1778)
  36. ACM
    Mishne G, Dalton J, Li Z, Sharma A and Lin J Fast data in the era of big data Proceedings of the 2013 ACM SIGMOD International Conference on Management of Data, (1147-1158)
  37. ACM
    Lin J and Ryaboy D (2013). Scaling big data mining infrastructure, ACM SIGKDD Explorations Newsletter, 14:2, (6-19), Online publication date: 30-Apr-2013.
  38. ACM
    Macdonald C, Santos R, Ounis I and He B (2013). About learning models with multiple query-dependent features, ACM Transactions on Information Systems, 31:3, (1-39), Online publication date: 1-Jul-2013.
  39. ACM
    Asadi N and Lin J (2013). Fast candidate generation for real-time tweet search with bloom filter chains, ACM Transactions on Information Systems, 31:3, (1-36), Online publication date: 1-Jul-2013.
  40. ACM
    Jiang D, Pei J and Li H (2013). Mining search and browse logs for web search, ACM Transactions on Intelligent Systems and Technology, 4:4, (1-37), Online publication date: 1-Sep-2013.
  41. Asadi N and Lin J Training efficient tree-based models for document ranking Proceedings of the 35th European conference on Advances in Information Retrieval, (146-157)
  42. ACM
    Asadi N and Lin J Effectiveness/efficiency tradeoffs for candidate generation in multi-stage retrieval architectures Proceedings of the 36th international ACM SIGIR conference on Research and development in information retrieval, (997-1000)
  43. ACM
    Ji Z and Wang B Learning to rank for question routing in community question answering Proceedings of the 22nd ACM international conference on Information & Knowledge Management, (2363-2368)
  44. ACM
    Lu X, Wu F, Tang S, Zhang Z, He X and Zhuang Y A low rank structural large margin method for cross-modal ranking Proceedings of the 36th international ACM SIGIR conference on Research and development in information retrieval, (433-442)
  45. ACM
    Sharma A and Yan B Pairwise learning in recommendation Proceedings of the 7th ACM conference on Recommender systems, (193-200)
  46. ACM
    Wu W, Li H and Xu J Learning query and document similarities from click-through bipartite graph with metadata Proceedings of the sixth ACM international conference on Web search and data mining, (687-696)
  47. ACM
    Wang C, Bi K, Hu Y, Li H and Cao G Extracting search-focused key n-grams for relevance ranking in web search Proceedings of the fifth ACM international conference on Web search and data mining, (343-352)
  48. ACM
    Bendersky M, Metzler D and Croft W Effective query formulation with multiple information sources Proceedings of the fifth ACM international conference on Web search and data mining, (443-452)
  49. ACM
    Nanongkai D, Lall A, Das Sarma A and Makino K Interactive regret minimization Proceedings of the 2012 ACM SIGMOD International Conference on Management of Data, (109-120)
  50. ACM
    Lin J and Kolcz A Large-scale machine learning at twitter Proceedings of the 2012 ACM SIGMOD International Conference on Management of Data, (793-804)
  51. ACM
    Asadi N and Lin J Fast candidate generation for two-phase document ranking Proceedings of the 21st ACM international conference on Information and knowledge management, (2419-2422)
  52. Farkas R and Schmid H Forest reranking through subtree ranking Proceedings of the 2012 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning, (1038-1047)
  53. Ganitkevitch J, Cao Y, Weese J, Post M and Callison-Burch C Joshua 4.0 Proceedings of the Seventh Workshop on Statistical Machine Translation, (283-291)
  54. ACM
    Silva A and Martins B Tag recommendation for georeferenced photos Proceedings of the 3rd ACM SIGSPATIAL International Workshop on Location-Based Social Networks, (57-64)
  55. ACM
    Candeias R and Martins B Learning to associate relevant photos to georeferenced textual documents Proceedings of the 19th ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems, (445-448)
Contributors
  • Microsoft Research Asia

Recommendations