ABSTRACT
In a pilot application based on web search engine called Web-based Relation Completion (WebRC), we propose to join two columns of entities linked by a predefined relation by mining knowledge from the web through a web search engine. To achieve this, a novel retrieval task Relation Query Expansion (RelQE) is modelled: given an entity (query), the task is to retrieve documents containing entities in predefined relation to the given one. Solving this problem entails expanding the query before submitting it to a web search engine to ensure that mostly documents containing the linked entity are returned in the top K search results. In this paper, we propose a novel Learning-based Relevance Feedback (LRF) approach to solve this retrieval task. Expansion terms are learned from training pairs of entities linked by the predefined relation and applied to new entity-queries to find entities linked by the same relation. After describing the approach, we present experimental results on real-world web data collections, which show that the LRF approach always improves the precision of top-ranked search results to up to 8.6 times the baseline. Using LRF, WebRC also shows performances way above the baseline.
- S. Adafre, M. de Rijke, and E. Sang. Entity retrieval. Proceedings of RANLP, Bulgaria, September, 2007.Google Scholar
- R. French. The computational modeling of analogy-making. Trends in Cognitive Sciences, 6(5):200--205, 2002.Google ScholarCross Ref
- V. Lavrenko and W. Croft. Relevance based language models. In Proceedings of the 24th annual international ACM SIGIR conference on Research and development in information retrieval, pages 120--127, 2001. Google ScholarDigital Library
- Z. Li, L. Sitbon, L. Wang, X. Zhou, and X. Du. Approximate Membership Localization (AML) for Web-Based Join. In Proceedings of the 19th International Conference on Information and Knowledge Management, pages 1321--1324, 2010. Google ScholarDigital Library
- Y. Lv and C. Zhai. Positional relevance model for pseudo-relevance feedback. In Proceeding of the 33rd international ACM SIGIR conference on Research and development in information retrieval, pages 579--586, 2010. Google ScholarDigital Library
- S. Needleman and C. Wunsch. A general method applicable to the search for similarities in the amino acid sequence of two proteins. Journal of molecular biology, 48(3):443--453, 1970.Google ScholarCross Ref
- P. Turney and M. Littman. Corpus-based learning of analogies and semantic relations. In Machine Learning Journal, Volume 60, Numbers 1--3, 2005. Google ScholarDigital Library
- C. Zhai and J. Lafferty. A study of smoothing methods for language models applied to ad hoc information retrieval. In Proceedings of the 24th annual international ACM SIGIR conference on Research and development in information retrieval, pages 334--342, 2001. Google ScholarDigital Library
- J. Zhu, A. de Vries, G. Demartini, and T. Iofciu. Evaluating relation retrieval for entities and experts. In Proceedings of the SIGIR 2008 Workshop on Future Challenges in Expertise Retrieval (fCHER), 2008.Google Scholar
Index Terms
- Learning-based relevance feedback for web-based relation completion
Recommendations
Relevant Term Suggestion Based on Pseudo Relevance Feedback from Web Contexts
The Outreach of Digital Libraries: A Globalized Resource NetworkAbstractMost search engines rely on query logs to give query suggestions. By mining potential relevant terms surrounding the query from Web resources, we aim at improving query formulation and retrieval effectiveness without query logs. In this paper, we ...
Click data as implicit relevance feedback in web search
Search sessions consist of a person presenting a query to a search engine, followed by that person examining the search results, selecting some of those search results for further review, possibly following some series of hyperlinks, and perhaps ...
Toward Feedback-Based Web Search Engine
WAINA '09: Proceedings of the 2009 International Conference on Advanced Information Networking and Applications WorkshopsThe capability to easily find useful information on the Web becomes increasingly difficult as the available content increases. To assist users in finding relevant information Web Search Engines are developed. Those engines apply different strategies to ...
Comments