ABSTRACT
The inherent ambiguity of short keyword queries demands for enhanced methods for Web retrieval. In this paper we propose to improve such Web queries by expanding them with terms collected from each user's Personal Information Repository, thus implicitly personalizing the search output. We introduce five broad techniques for generating the additional query keywords by analyzing user data at increasing granularity levels, ranging from term and compound level analysis up to global co-occurrence statistics, as well as to using external thesauri. Our extensive empirical analysis under four different scenarios shows some of these approaches to perform very well, especially on ambiguous queries, producing a very strong increase in the quality of the output rankings. Subsequently, we move this personalized search framework one step further and propose to make the expansion process adaptive to various features of each query. A separate set of experiments indicates the adaptive algorithms to bring an additional statistically significant improvement over the best static expansion approach.
- J. Allan and H. Raghavan. Using part-of-speech patterns to reduce query ambiguity. In Proc. of the 25th Intl. ACM SIGIR Conf. on Research and development in information retrieval, 2002. Google ScholarDigital Library
- P. G. Anick and S. Tipirneni. The paraphrase search assistant: Terminological feedback for iterative information seeking. In Proc. of the 22nd Intl. ACM SIGIR Conf. on Research and Development in Information Retrieval, 1999. Google ScholarDigital Library
- D. Carmel, E. Farchi, Y. Petruschka, and A. Soffer. Automatic query refinement using lexical affinities with maximal information gain. In Proc. of the 25th Intl. ACM SIGIR Conf. on Research and development in information retrieval, pages 283--290, 2002. Google ScholarDigital Library
- C. Carpineto, R. de Mori, G. Romano, and B. Bigi. An information-theoretic approach to automatic query expansion. ACM TOIS, 19(1):1--27, 2001. Google ScholarDigital Library
- C.-H. Chang and C.-C. Hsu. Integrating query expansion and conceptual relevance feedback for personalized web information retrieval. In Proc. of the 7th Intl. Conf. on World Wide Web, 1998. Google ScholarDigital Library
- P. A. Chirita, C. Firan, and W. Nejdl. Summarizing local context to personalize global web search. In Proc. of the 15th Intl. CIKM Conf. on Information and Knowledge Management, 2006. Google ScholarDigital Library
- S. Cronen-Townsend, Y. Zhou, and W. B. Croft. Predicting query performance. In Proc. of the 25th Intl. ACM SIGIR Conf. on Research and development in information retrieval, 2002. Google ScholarDigital Library
- H. Cui, J.-R. Wen, J.-Y. Nie, and W.-Y. Ma. Probabilistic query expansion using query logs. In Proc. of the 11th Intl. Conf. on World Wide Web, 2002. Google ScholarDigital Library
- T. Dunning. Accurate methods for the statistics of surprise and coincidence. Computational Linguistics, 19:61--74, 1993. Google ScholarDigital Library
- H. P. Edmundson. New methods in automatic extracting. Journal of the ACM, 16(2):264--285, 1969. Google ScholarDigital Library
- E. N. Efthimiadis. User choices: A new yardstick for the evaluation of ranking algorithms for interactive query expansion. Information Processing and Management, 31(4):605--620, 1995. Google ScholarDigital Library
- D. Fogaras and B. Racz. Scaling link based similarity search. In Proc. of the 14th Intl. World Wide Web Conf., 2005. Google ScholarDigital Library
- T. Haveliwala. Topic-sensitive pagerank. In Proc. of the 11th Intl. World Wide Web Conf., Honolulu, Hawaii, May 2002. Google ScholarDigital Library
- B. He and I. Ounis. Inferring query performance using pre-retrieval predictors. In Proc. of the 11th Intl. SPIRE Conf. on String Processing and Information Retrieval, 2004.Google ScholarCross Ref
- K. Järvelin and J. Keklinen. Ir evaluation methods for retrieving highly relevant documents. In Proc. of the 23th Intl. ACM SIGIR Conf. on Research and development in information retrieval, 2000. Google ScholarDigital Library
- G. Jeh and J. Widom. Scaling personalized web search. In Proc. of the 12th Intl. World Wide Web Conference, 2003. Google ScholarDigital Library
- M.-C. Kim and K.-S. Choi. A comparison of collocation-based similarity measures in query expansion. Inf. Proc. and Mgmt., 35(1):19--30, 1999. Google ScholarDigital Library
- S.-B. Kim, H.-C. Seo, and H.-C. Rim. Information retrieval using word senses: root sense tagging approach. In Proc. of the 27th Intl. ACM SIGIR Conf. on Research and development in information retrieval, 2004. Google ScholarDigital Library
- R. Kraft and J. Zien. Mining anchor text for query refinement. In Proc. of the 13th Intl. Conf. on World Wide Web, 2004. Google ScholarDigital Library
- R. Krovetz and W. B. Croft. Lexical ambiguity and information retrieval. ACM Trans. Inf. Syst., 10(2), 1992. Google ScholarDigital Library
- A. M. Lam-Adesina and G. J. F. Jones. Applying summarization techniques for term selection in relevance feedback. In Proc. of the 24th Intl. ACM SIGIR Conf. on Research and Development in Information Retrieval, 2001. Google ScholarDigital Library
- S. Liu, F. Liu, C. Yu, and W. Meng. An effective approach to document retrieval via utilizing wordnet and recognizing phrases. In Proc. of the 27th Intl. ACM SIGIR Conf. on Research and development in information retrieval, 2004. Google ScholarDigital Library
- G. Miller. Wordnet: An electronic lexical database. Communications of the ACM, 38(11):39--41, 1995. Google ScholarDigital Library
- L. Nie, B. Davison, and X. Qi. Topical link analysis for web search. In Proc. of the 29th Intl. ACM SIGIR Conf. on Res. and Development in Inf. Retr., 2006. Google ScholarDigital Library
- L. Page, S. Brin, R. Motwani, and T. Winograd. The PageRank citation ranking: Bringing order to the web. Technical report, Stanford Univ., 1998.Google Scholar
- F. Qiu and J. Cho. Automatic indentification of user interest for personalized search. In Proc. of the 15th Intl. WWW Conf., 2006. Google ScholarDigital Library
- Y. Qiu and H. P. Frei. Concept based query expansion. In Proc. of the 16th Intl. ACM SIGIR Conf. on Research and Development in Inf. Retr., 1993. Google ScholarDigital Library
- J. Rocchio. Relevance feedback in information retrieval. The Smart Retrieval System: Experiments in Automatic Document Processing, pages 313--323, 1971.Google Scholar
- I. Ruthven. Re-examining the potential effectiveness of interactive query expansion. In Proc. of the 26th Intl. ACM SIGIR Conf., 2003. Google ScholarDigital Library
- T. Sarlos, A. A. Benczur, K. Csalogany, D. Fogaras, and B. Racz. To randomize or not to randomize: Space optimal summaries for hyperlink analysis. In Proc. of the 15th Intl. WWW Conf., 2006. Google ScholarDigital Library
- C. Shah and W. B. Croft. Evaluating high accuracy retrieval techniques. In Proc. of the 27th Intl. ACM SIGIR Conf. on Research and development in information retrieval, pages 2--9, 2004. Google ScholarDigital Library
- K. Sugiyama, K. Hatano, and M. Yoshikawa. Adaptive web search based on user profile constructed without any effort from users. In Proc. of the 13th Intl. World Wide Web Conf., 2004. Google ScholarDigital Library
- D. Sullivan. The older you are, the more you want personalized search, 2004. http://searchenginewatch.com/searchday/article.php/3385131.Google Scholar
- J. Teevan, S. Dumais, and E. Horvitz. Personalizing search via automated analysis of interests and activities. In Proc. of the 28th Intl. ACM SIGIR Conf. on Research and Development in Information Retrieval, 2005. Google ScholarDigital Library
- E. Volokh. Personalization and privacy. Commun. ACM, 43(8), 2000. Google ScholarDigital Library
- E. M. Voorhees. Query expansion using lexical-semantic relations. In Proc. of the 17th Intl. ACM SIGIR Conf. on Res. and development in Inf. Retr., 1994. Google ScholarDigital Library
- J. Xu and W. B. Croft. Query expansion using local and global document analysis. In Proc. of the 19th Intl. ACM SIGIR Conf. on Research and Development in Information Retrieval, 1996. Google ScholarDigital Library
- S. Yu, D. Cai, J. -R. Wen, and W. -Y. Ma. Improving pseudo-relevance feedback in web information retrieval using web page segmentation. In Proc. of the 12th Intl. Conf. on World Wide Web, 2003. Google ScholarDigital Library
Index Terms
- Personalized query expansion for the web
Recommendations
Personalized query expansion in the QIC system
JCDL '12: Proceedings of the 12th ACM/IEEE-CS joint conference on Digital LibrariesQuery In Context (QIC) is a personalized search system that enhances individual search by incorporating user preferences in query expansion, capturing meanings embedded in documents, and ranking search results with context-enriched features. In this ...
Personalized Web Search Using Probabilistic Query Expansion
WI-IATW '07: Proceedings of the 2007 IEEE/WIC/ACM International Conferences on Web Intelligence and Intelligent Agent Technology - WorkshopsThe web consists of huge amount of data and search engines provide an efficient way to help navigate the web and get the relevant information. General search engines, however,return query results without considering user's intention behind the query. ...
Improving personalized web search using result diversification
SIGIR '06: Proceedings of the 29th annual international ACM SIGIR conference on Research and development in information retrievalWe present and evaluate methods for diversifying search results to improve personalized web search. A common personalization approach involves reranking the top N search results such that documents likely to be preferred by the user are presented ...
Comments