Abstract
Bias in Web data and use taints the algorithms behind Web-based applications, delivering equally biased results.
Supplemental Material
Available for Download
Further Readings
- ACM U.S. Public Policy Council. Statement on Algorithmic Transparency and Accountability, ACM, Washington, D.C., Jan. 2017; https://www.acm.org/binaries/content/assets/public-policy/2017_usacm_statement_algorithms.pdfGoogle Scholar
- Agarwal, D., Chen, B-C., and Elango, P. Explore/exploit schemes for Web content optimization. In Proceedings of the Ninth IEEE International Conference on Data Mining (Miami, FL, Dec. 6--9). IEEE Computer Society Press, 2009. Google ScholarDigital Library
- Baeza-Yates, R., Castillo, C., and López, V. Characteristics of the Web of Spain. Cybermetrics 9, 1 (2005), 1--41.Google Scholar
- Baeza-Yates, R. and Castillo, C. Relationship between Web links and trade (poster). In Proceedings of the 15th International Conference on the World Wide Web (Edinburgh, U.K., May 23--26). ACM Press, New York, 2006, 927--928. Google ScholarDigital Library
- Baeza-Yates, R., Castillo, C., and Efthimiadis, E.N. Characterization of national Web domains. ACM Transactions on Internet Technology 7, 2 (May 2007), article 9. Google ScholarDigital Library
- Baeza-Yates, R., Pereira, Á., and Ziviani, N. Genealogical trees on the Web: A search engine user perspective. In Proceedings of the 17th International Conference on the World Wide Web (Beijing, China, Apr 21--25). ACM Press, New York, 2008, 367--376. Google ScholarDigital Library
- Baeza-Yates, R. Incremental sampling of query logs. In Proceedings of the 38th ACM SIGIR Conference (Santiago, Chile, Aug. 9--13). ACM Press, New York, 2015, 1093--1096. Google ScholarDigital Library
- Baeza-Yates, R. and Saez-Trumper, D. Wisdom of the crowd or wisdom of a few? An analysis of users' content generation. In Proceedings of the 26th ACM Conference on Hypertext and Social Media (Guzelyurt, TRNC, Cyprus, Sept. 1--4). ACM Press, New York, 2015, 69--74. Google ScholarDigital Library
- Bolukbasi, R., Chang, K.W., Zou, J., Saligrama, V., and Kalai, A. Man is to computer programmer as woman is to homemaker? De-biasing word embeddings. In Proceedings of the 30th Conference on Neural Information Processing Systems (Barcelona, Spain, Dec. 5--10). Curran Associates, Inc., Red Hook, NY, 2016, 4349--4357. Google ScholarDigital Library
- Caliskan, A., Bryson, J.J., and Narayanan, A. Semantics derived automatically from language corpora contain human-like biases. Science 356, 6334 (Apr. 2017), 183--186.Google ScholarCross Ref
- Chapelle, O. and Zhang, Y. A dynamic Bayesian network click model for Web search ranking. In Proceedings of the 18th International Conference on the World Wide Web (Madrid, Spain, Apr. 20--24). ACM Press, New York, 2009, 1--10. Google ScholarDigital Library
- Dupret, G.E. and Piwowarski, B. A user-browsing model to predict search engine click data from past observations. In Proceedings of the 31st ACM SIGIR Conference (Singapore, July 20--24). ACM Press, New York, 2008, 331--338. Google ScholarDigital Library
- Fetterly, D., Manasse, M., and Najork, M. 0n the evolution of clusters of near-duplicate webpages. Journal of Web Engineering 2, 4 (Oct. 2003), 228--246. Google ScholarDigital Library
- Gong, W., Lim, E.-P., and Zhu, F. Characterizing silent users in social media communities. In Proceedings of the Ninth International AAAI Conference on Web and Social Media (Oxford, U.K., May 26--29). AAAI, Fremont, CA, 2015, 140--149.Google Scholar
- Graells-Garrido, E. and Lalmas, M. Balancing diversity to countermeasure geographical centralization in microblogging platforms. In Proceedings of the 25th ACM Conference on Hypertext and Social Media (Santiago, Chile, Sept. 1--4). ACM Press, New York, 2014, 231--236. Google ScholarDigital Library
- Graells-Garrido, E., Lalmas, M., and Menczer, F. First women, second sex: Gender bias in Wikipedia. In Proceedings of the 26th ACM Conference on Hypertext and Social Media (Guzelyurt, TRNC, Cyprus, Sept. 1--4). ACM Press, New York, 2015, 165--174. Google ScholarDigital Library
- Lazer, D.M.J. et al. The science of fake news. Science 359, 6380 (Mar. 2018), 1094--1096.Google ScholarCross Ref
- Mediative. The Evolution of Google's Search Results Pages & Effects on User Behaviour, White paper, 2014; http://www.mediative.com/SERPGoogle Scholar
- Mercer, A., Deane, C., and McGeeney, K. Why 2016 Election Polls Missed Their Mark, Pew Research Center, Washington, D.C., Nov 2016; http://www.pewresearch.org/fact-tank/2016/11/09/why-2016-election-polls-missed-their-mark/Google Scholar
- Olteanu, A., Castillo, C., Diaz, F., and Kiciman, E. Social Data: Biases, Methodological Pitfalls, and Ethical Boundaries, SSRN, Rochester, NY, Dec. 20, 2016; https://ssrn.com/abstract=2886526Google ScholarCross Ref
- Pariser, E. The Filter Bubble: How the New Personalized Web Is Changing What We Read and How We Think, Penguin, London, U.K., 2011. Google ScholarDigital Library
- Saez-Trumper, D., Castillo, C., and Lalmas, M. Social media news communities: Gatekeeping, coverage, and statement bias. In Proceedings of the ACM International Conference on Information and Knowledge Management (San Francisco, CA, Oct. 27-Nov. 1). ACM Press, New York, 2013, 1679--1684. Google ScholarDigital Library
- Silberzahn, R. and Uhlmann, E.L. Crowdsourced research: Many hands make tight work. Nature 526, 7572 (Oct. 2015), 189--191; https://psyarxiv.com/qkwst/Google ScholarCross Ref
- Smith, M., Patil, D.J., and Muñoz, C. Big Data: A Report on Algorithmic Systems, Opportunity, and Civil Rights. Executive Office of the President, Washington, D.C., 2016; https://obamawhitehouse.archives.gov/sites/default/files/microsites/ostp/2016_0504_data_discrimination.pdfGoogle Scholar
- Wagner, C., Garcia, D., Jadidi, M., and Strohmaier, M. It's a man's Wikipedia? Assessing gender inequality in an online encyclopedia. In Proceedings of the Ninth International AAAI Conference on Web and Social Media (Oxford, U.K., May 26--29). AAAI, Fremont, CA, 2015, 454--463.Google Scholar
- Wang, T. and Wang, D. Why Amazon's ratings might mislead you: The story of herding effects. Big Data 2, 4 (Dec. 2014), 196--204.Google ScholarCross Ref
- White, R. Beliefs and biases in Web search. In Proceedings of the 36th ACM SIGIR Conference (Dublin, Ireland, July 28-Aug. 1). ACM Press, New York, 2013, 3--12. Google ScholarDigital Library
- Wu, S., Hofman, J.M., Mason, W.A., and Watts, D.J. Who says what to whom on Twitter. In Proceedings of the 20th International Conference on the World Wide Web (Hyderabad, India, Mar. 28--Apr. 1). ACM Press, New York, 2011, 705--714. Google ScholarDigital Library
- Zipf, G.K. Human Behavior and the Principle of Least Effort, Addison-Wesley Press, Cambridge, MA, 1949.Google Scholar
Index Terms
- Bias on the web
Recommendations
Handling Web Bias 2019: Chairs' Welcome and Workshop Summary
WebSci '19: Companion Publication of the 10th ACM Conference on Web ScienceA key aspect of the Web Science conference is exploring the ethical challenges of technologies, data, algorithms, platforms, and people in the Web as well as detecting, preventing and predicting anomalies in web data including algorithmic and data ...
Bias on the web and beyond: an accessibility point of view
W4A '20: Proceedings of the 17th International Web for All ConferenceThe Web is the most powerful communication medium and the largest public data repository that humankind has created. Its content ranges from great reference sources such as Wikipedia to ugly fake news. Indeed, social (digital) media is just an ...
Comments