skip to main content
10.1145/2484028.2484053acmconferencesArticle/Chapter ViewAbstractPublication PagesirConference Proceedingsconference-collections
research-article

Beliefs and biases in web search

Published:28 July 2013Publication History

ABSTRACT

People's beliefs, and unconscious biases that arise from those beliefs, influence their judgment, decision making, and actions, as is commonly accepted among psychologists. Biases can be observed in information retrieval in situations where searchers seek or are presented with information that significantly deviates from the truth. There is little understanding of the impact of such biases in search. In this paper we study search-related biases via multiple probes: an exploratory retrospective survey, human labeling of the captions and results returned by a Web search engine, and a large-scale log analysis of search behavior on that engine. Targeting yes-no questions in the critical domain of health search, we show that Web searchers exhibit their own biases and are also subject to bias from the search engine. We clearly observe searchers favoring positive information over negative and more than expected given base rates based on consensus answers from physicians. We also show that search engines strongly favor a particular, usually positive, perspective, irrespective of the truth. Importantly, we show that these biases can be counterproductive and affect search outcomes; in our study, around half of the answers that searchers settled on were actually incorrect. Our findings have implications for search engine design, including the development of ranking algorithms that con-sider the desire to satisfy searchers (by validating their beliefs) and providing accurate answers and properly considering base rates. Incorporating likelihood information into search is particularly important for consequential tasks, such as those with a medical focus.

References

  1. Agichtein, E., Brill, E., and Dumais, S. (2006). Improving web search ranking by incorporating user behavior information. Proc. SIGIR, 19--26. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. Ariely, D. (2008). Predictably Irrational: The Hidden Forces that Shape Our Decisions. Harper Collins.Google ScholarGoogle Scholar
  3. Baron, J. (2007). Thinking and Deciding. Cambridge Press.Google ScholarGoogle Scholar
  4. Belkin, N.J., Oddy, R.N., and Brooks, H.M. (1982). ASK for information retrieval: Part I - background and theory. J. Documentation, 38(2): 61--71.Google ScholarGoogle ScholarCross RefCross Ref
  5. Bennett, P.N. et al. (2012). Modeling the impact of short- and long-term behavior on search personalization. Proc. SIGIR, 185--194. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. Bilenko, M. and White, R.W. (2008). Mining the search trails of the surfing crowds: identifying relevant websites from user activity. Proc. WWW, 51--60. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. Brennan, R.L. and Prediger, D.J. (1981). Coefficient Kappa: Some uses, misuses, and alternatives. Educational and Psychological Measurement, (41): 687--699.Google ScholarGoogle Scholar
  8. Cho, J. and Roy, S. (2004). Impact of search engines on page popularity. Proc. WWW, 20--29. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. Clarke, C., Agichtein, E., Dumais, S., and White R.W. (2007). The influence of caption features on clickthrough patterns in Web search. Proc. SIGIR, 135--142. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. Craswell, N., Zoeter, O., Taylor, M., and Ramsey, B. (2008). An experimental comparison of click position-bias models. Proc. WSDM, 87--94. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. Dou, Z., Song, R., and Wen, J.R. (2007). A large-scale evaluation and analysis of personalization search strategy. Proc. WWW, 581--590. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. Dumais, S. et al. (2002). Web question answering: is more always better? Proc. SIGIR, 291--298. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. Fleiss, J.L. (1971). Measuring nominal scale agreement among many raters. Psychological Bulletin, 76(5): 378--382.Google ScholarGoogle ScholarCross RefCross Ref
  14. Fortunato, S., Flammini, A., Menczer, F., and Vespignani, A. (2006). Topical interests and the mitigation of search engine bias. PNAS, 103(34): 12684--12689.Google ScholarGoogle ScholarCross RefCross Ref
  15. Gigerenzer, G. and Todd, P.M. (2000). Simple Heuristics That Make Us Smart. Oxford University Press.Google ScholarGoogle Scholar
  16. Ieong, S., Mishra, N., Sadikov, E., and Zhang, I. (2012). Do-main bias in Web search. Proc. WSDM, 413--422. Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. Ingwersen, P. (1994). Polyrepresentation of information needs and semantic entities: Elements of a cognitive theory for information retrieval interaction. Proc. SIGIR, 101--110. Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. Inlander, C.B. (1993). Good operations, Bad operations: The People's Medical Society's Guide to Surgery. Viking Adult.Google ScholarGoogle Scholar
  19. Joachims, T. (2002). Optimizing search engines using click-through data. Proc. SIGKDD, 132--142. Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. Joachims, T. et al. (2007). Evaluating the accuracy of implicit feedback from clicks and query reformulations in web search. ACM TOIS, 25(2). Google ScholarGoogle ScholarDigital LibraryDigital Library
  21. Kahneman, D. and Tversky, A. (1974). Judgment under un-certainty: heuristics and biases. Science, 185(4157): 1214--1231.Google ScholarGoogle Scholar
  22. Klayman, J. and Ha, Y. (1987). Confirmation, disconfirmation, and information in hypothesis testing. Psychological Review, 94: 211--228.Google ScholarGoogle ScholarCross RefCross Ref
  23. Kuhlthau, C. (1991). Inside the search process: Information seeking from the user's perspective. JASIST, 42(5): 361--371.Google ScholarGoogle ScholarCross RefCross Ref
  24. Marchionini, G. (1995). Information Seeking in Electronic Environments. Cambridge University Press. Google ScholarGoogle ScholarDigital LibraryDigital Library
  25. Mowshowitz, A. and Kawaguchi, A. (2002). Bias on the Web. CACM, 45(9): 56--60. Google ScholarGoogle ScholarDigital LibraryDigital Library
  26. Nickerson, R.S. (1998). Confirmation bias: a ubiquitous phenomenon in many guises. Rev. Gen. Psych., 2(2): 175--220.Google ScholarGoogle ScholarCross RefCross Ref
  27. Pariser, E. (2011). The Filter Bubble: What is the Internet Hiding from You? Penguin Press. Google ScholarGoogle ScholarDigital LibraryDigital Library
  28. Popper, K. (1959). The Logic of Scientific Discovery. Basic Books.Google ScholarGoogle Scholar
  29. Radlinski, F. and Joachims, T. (2006). Minimally invasive randomization for collecting unbiased preferences from click-through logs. Proc. AAAI. Google ScholarGoogle ScholarDigital LibraryDigital Library
  30. Salton, G., Wong, A., and Yang, C.S. (1975). A vector space model for automatic indexing. CACM, 18(11): 613--620. Google ScholarGoogle ScholarDigital LibraryDigital Library
  31. Saracevic, T. (1997). The stratified model of information retrieval interaction: Extensions and applications. Proc. ASIS, 34: 313--327.Google ScholarGoogle Scholar
  32. Schwarz, J. and Morris, M.R. (2011). Augmenting Web pages and search results to help people find trustworthy information online. Proc. SIGCHI, 1245--1254. Google ScholarGoogle ScholarDigital LibraryDigital Library
  33. Simon, H. (1991). Bounded rationality and organizational learning. Organization Science, 2(1): 125--134.Google ScholarGoogle ScholarDigital LibraryDigital Library
  34. Snow, R., O'Connor, B., Jurafsky, D., and Ng, A.Y. (2008). Cheap and fast -- but is it good? Evaluating non-expert annotations for natural language tasks. Proc. EMNLP, 254--263. Google ScholarGoogle ScholarDigital LibraryDigital Library
  35. Sontag, D. et al. (2012). Probabilistic models for personalizing web search. Proc. WSDM, 433--442. Google ScholarGoogle ScholarDigital LibraryDigital Library
  36. Tang, T.T., Hawking, D., Craswell, N., and Griffiths, K. (2005). Focused crawling for both topical relevance and quality of medical information. Proc. CIKM, 147--154. Google ScholarGoogle ScholarDigital LibraryDigital Library
  37. Taylor, R.S. (1968). Question-negotiation and information seeking in libraries. College and Res. Libraries, 29: 178--194.Google ScholarGoogle ScholarCross RefCross Ref
  38. Teevan, J., Dumais, S.T., and Horvitz, E. (2005). Personalizing search via automated analysis of interests and activities. Proc. SIGIR, 449--456. Google ScholarGoogle ScholarDigital LibraryDigital Library
  39. Tversky, A. and Kahneman, D. (1973). Availability: A heuristic for judging frequency and probability. Cognitive Psychology, 5(1): 207--233.Google ScholarGoogle ScholarCross RefCross Ref
  40. Vaughn, L. and Thelwall, M. (2004). Search engine coverage bias: evidence and possible causes. IP&M, 40(4): 693--707. Google ScholarGoogle ScholarDigital LibraryDigital Library
  41. Wason, P.C. (1960). On the failure to eliminate hypotheses in a conceptual task, Q. J. of Exp. Psychology, 12: 129--140.Google ScholarGoogle ScholarCross RefCross Ref
  42. White, R.W., Bennett, P.N., and Dumais, S.T. (2010). Predict-ing short-term interests using activity-based search context. Proc. CIKM, 1009--1018. Google ScholarGoogle ScholarDigital LibraryDigital Library
  43. White, R.W. and Drucker, S.M. (2007). Investigating behavioral variability in Web search. Proc. WWW, 21--30. Google ScholarGoogle ScholarDigital LibraryDigital Library
  44. White, R.W. and Horvitz, E. (2009). Cyberchondria: Studies of the escalation of medical concerns in web search. ACM TOIS, 27(4): 23. Google ScholarGoogle ScholarDigital LibraryDigital Library
  45. White, R.W. and Horvitz, E. (2012). Studies on the onset and persistence of medical concerns in search logs. Proc. SIGIR, 265--274. Google ScholarGoogle ScholarDigital LibraryDigital Library
  46. Xiang, B. et al. (2010). Context-aware ranking in web search. Proc. SIGIR, 451--458. Google ScholarGoogle ScholarDigital LibraryDigital Library
  47. Yue, Y., Patel, R., and Roehrig, H. (2010). Beyond position bias: Examining result attractiveness as a source of presentation bias in clickthrough data. Proc. WWW, 1011--1018. Google ScholarGoogle ScholarDigital LibraryDigital Library
  48. Zhai, C., Cohen, W.W., and Lafferty, J. (2003). Beyond independent relevance: Methods and evaluation metrics for subtopic retrieval. Proc. SIGIR, 10--17. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. Beliefs and biases in web search

        Recommendations

        Reviews

        Ganapathy Mani

        It is a fact widely accepted by psychologists that people's judgments are influenced by their beliefs and unconscious biases. Although companies that provide Internet-related services excel in tracking user behavior online, little or no thought is given to the beliefs and unconscious biases that may influence the decisions of users. In this paper, White studies search-related biases through various probes: (1) a log analysis of user search behavior in a web search engine, (2) human labeling of captions and results returned by that search engine, and (3) a survey that reflects the retrospective view of the search. White also analyzes the users' health domain searches with yes or no questions in which he observes that the searchers themselves have their own biases. The paper also shows that the search engines, irrespective of the truth, always favor positive results. These findings can have profound implications on information retrieval system design, ranking, and recommendation systems. Findings of this novel research, which thoroughly investigates the biases of users, can have significant implications on how search engines interpret user search behavior and beliefs. The research sheds light on the cognitive biases of users, and on how a search engine can misinterpret these biases and, in turn, produce wrong search results. This research can indeed inspire new research collaborations between cognitive and computer sciences to further investigate user beliefs and unconscious biases that can influence online behavior. Online Computing Reviews Service

        Access critical reviews of Computing literature here

        Become a reviewer for Computing Reviews.

        Comments

        Login options

        Check if you have access through your login credentials or your institution to get full access on this article.

        Sign in
        • Published in

          cover image ACM Conferences
          SIGIR '13: Proceedings of the 36th international ACM SIGIR conference on Research and development in information retrieval
          July 2013
          1188 pages
          ISBN:9781450320344
          DOI:10.1145/2484028

          Copyright © 2013 ACM

          Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

          Publisher

          Association for Computing Machinery

          New York, NY, United States

          Publication History

          • Published: 28 July 2013

          Permissions

          Request permissions about this article.

          Request Permissions

          Check for updates

          Qualifiers

          • research-article

          Acceptance Rates

          SIGIR '13 Paper Acceptance Rate73of366submissions,20%Overall Acceptance Rate792of3,983submissions,20%

        PDF Format

        View or Download as a PDF file.

        PDF

        eReader

        View online with eReader.

        eReader