Addressing Age-Related Bias in Sentiment Analysis

Authors:
Mark Diaz

Northwestern University, Evanston, IL, USA

Northwestern University, Evanston, IL, USA
View Profile

,
Isaac Johnson

Northwestern University, Evanston, IL, USA

Northwestern University, Evanston, IL, USA
View Profile

,
Amanda Lazar

University of Maryland, College Park, MD, USA

University of Maryland, College Park, MD, USA
View Profile

,
Anne Marie Piper

Northwestern University, Evanston, IL, USA

Northwestern University, Evanston, IL, USA
View Profile

,
Darren Gergle

Northwestern University, Evanston, IL, USA

Northwestern University, Evanston, IL, USA
View Profile

CHI '18: Proceedings of the 2018 CHI Conference on Human Factors in Computing SystemsApril 2018Paper No.: 412Pages 1–14https://doi.org/10.1145/3173574.3173986

Published:21 April 2018Publication History

CHI '18: Proceedings of the 2018 CHI Conference on Human Factors in Computing Systems

Pages 1–14

ABSTRACT

Computational approaches to text analysis are useful in understanding aspects of online interaction, such as opinions and subjectivity in text. Yet, recent studies have identified various forms of bias in language-based models, raising concerns about the risk of propagating social biases against certain groups based on sociodemographic factors (e.g., gender, race, geography). In this study, we contribute a systematic examination of the application of language models to study discourse on aging. We analyze the treatment of age-related terms across 15 sentiment analysis models and 10 widely-used GloVe word embeddings and attempt to alleviate bias through a method of processing model training data. Our results demonstrate that significant age bias is encoded in the outputs of many sentiment analysis algorithms and word embeddings. We discuss the models' characteristics in relation to output bias and how these models might be best incorporated into research.

References

Paul Baker and Amanda Potts. 2013. "Why do white people have thin lips?" Google and the perpetuation of stereotypes via auto-complete search forms. Critical Discourse Studies 10, May 2015: 187--204.Google ScholarCross Ref
Shaowen Bardzell. 2010. Feminist HCI: Taking Stock and Outlining an Agenda for Design. In Proceedings of the SIGCHI Conference on Human Factors in Computing Systems (CHI '10), 1301--1310. Google ScholarDigital Library
Shaowen Bardzell and Jeffrey Bardzell. 2011. Towards a Feminist HCI Methodology: Social Science, Feminism, and HCI. In Proceedings of the SIGCHI Conference on Human Factors in Computing Systems (CHI '11), 675--684. Google ScholarDigital Library
Eric P.S. Baumer, Xiaotong Xu, Christine Chu, Shion Guha, and Geri K. Gay. 2017. When Subjects Interpret the Data: Social Media Non-use as a Case for Adapting the Delphi Method to CSCW. Proceedings of the 2017 ACM Conference on Computer Supported Cooperative Work and Social Computing (CSCW '17): 1527--1543. Google ScholarDigital Library
Michael S Bernstein, Eytan Bakshy, Moira Burke, Brian Karrer, and Menlo Park. 2013. Quantifying the Invisible Audience in Social Networks. Proceedings of the SIGCHI Conference on Human Factors in Computing Systems (CHI '13): 21--30. Google ScholarDigital Library
Tolga Bolukbasi, Kai-Wei Chang, James Zou, Venkatesh Saligrama, and Adam Kalai. 2016. Quantifying and Reducing Stereotypes in Word Embeddings. arXiv preprint.Google Scholar
Tolga Bolukbasi, Kai-Wei Chang, James Zou, Venkatesh Saligrama, and Adam Kalai. 2016. Man is to Computer Programmer as Woman is to Homemaker? Debiasing Word Embeddings. In 30th Conference on Neural Information Processing Systems (NIPS 2016). Google ScholarDigital Library
Danah Boyd, Karen Levy, and Alice Marwick. 2014. The Networked Nature of Algorithmic Discrimination. Data and Discrimination: Collected Essays. Open Technology Institute.Google Scholar
Robin Brewer, Meredith Ringel Morris, and Anne Marie Piper. 2016. "Why would anybody do this?": Understanding Older Adults' Motivations and Challenges in Crowd Work. Proceedings of the 2016 CHI Conference on Human Factors in Computing Systems (CHI '16): 2246--2257. Google ScholarDigital Library
Robin Brewer and Anne Marie Piper. 2016. "Tell It Like It Really Is": A Case of Online Content Creation and Sharing Among Older Adult Bloggers. In Proceedings of the 2016 CHI Conference on Human Factors in Computing Systems (CHI '16), 5529--5542. Google ScholarDigital Library
Alexander Budanitsky and Graeme Hirst. 2006. Evaluating WordNet-based Measures of Lexical Semantic Relatedness. Computational Linguistics 32, 1. Google ScholarDigital Library
Robert N. Butler. 1969. Age-ism: Another form of bigotry. Gerontologist 9, 4: 243--246.Google ScholarCross Ref
Aylin Caliskan, Joanna J Bryson, and Arvind Narayanan. 2017. Semantics derived automatically from language corpora necessarily contain human biases. Science 356: 183--186.Google ScholarCross Ref
Aylin Caliskan-islam, Joanna J Bryson, and Arvind Narayanan. 2016. Semantics derived automatically from language corpora necessarily contain human biases. arXiv:1608.07187v2 {cs.AI} 30 Aug 2016: 1--14.Google Scholar
Le Chen, Alan Mislove, and Christo Wilson. 2015. Peeking Beneath the Hood of Uber. Proceedings of the 2015 Internet Measurement Conference (IMC '15): 495--508. Google ScholarDigital Library
Kate Crawford. 2016. Can an Algorithm be Agonistic? Ten Scenes from Life in Calculated Publics. Science, Technology, & Human Values 41, 1: 77--92.Google ScholarCross Ref
Kimberle Crenshaw. 1991. Mapping the Margins: Intersectionality, Identity Politics, and Violence Against Women of Color. Stanford Law Review 43, 6: 1241--1299.Google ScholarCross Ref
Dmitry Davidov, Oren Tsur, and Ari Rappoport. 2010. Enhanced sentiment learning using Twitter hashtags and smileys. In Proceedings of the 23rd International Conference on Computational Linguistics: Posters (COLING '10), 241--249. Google ScholarDigital Library
Mark Davies. 2008. The Corpus of Contemporary American English (COCA): 520 million words, 1990present. BYE, Brigham Young University.Google Scholar
Michael A Devito. 2016. From Editors to Algorithms. Digital Journalism: 1--21.Google Scholar
Michael Devito, Darren Gergle, and Jeremy Birnholtz. 2017. "Algorithms ruin everything": # RIPTwitter, Folk Theories, and Resistance to Algorithmic Change in Social Media. Proceedings of the SIGCHI Conference on Human Factors in Computing Systems (CHI), In press. Google ScholarDigital Library
N Diakopoulos. 2014. Algorithmic accountability reporting: On the investigation of black boxes. Tow Center for Digital Journalism: A Tow/Knight Brief.Google Scholar
Lynn Dombrowski, Ellie Harmon, and Sarah Fox. 2016. Social Justice-Oriented Interaction Design: Outlining Key Design Strategies and Commitments. Proceedings of the Designing Interactive Systems Conference (DIS '16): 656--671. Google ScholarDigital Library
Motahhare Eslami, Karrie Karahalios, Christian Sandvig, Kristen Vaccaro, Aimee Rickman, Kevin Hamilton, and Alex Kirlik. 2016. First I "like" it, then I hide it: Folk Theories of Social Feeds. In Proceedings of the 2016 CHI Conference on Human Factors in Computing Systems (CHI '16), 2371--2382. Google ScholarDigital Library
Motahhare Eslami, Aimee Rickman, Kristen Vaccaro, Amirhossein Aleyasen, Andy Vuong, Karrie Karahalios, Kevin Hamilton, and Christian Sandvig. 2015. "I always assumed that I wasn't really that close to {her}": Reasoning about Invisible Algorithms in News Feeds. Proceedings of the 33rd Annual ACM Conference on Human Factors in Computing Systems (CHI '15): 153--162. Google ScholarDigital Library
Ronen Feldman. 2013. Techniques and applications for sentiment analysis. Communications of the ACM 56, 4: 82--89. Google ScholarDigital Library
Batya Friedman and Helen Nissenbaum. 1996. Bias in computer systems. ACM Transactions on Information Systems 14, 3: 330--347. Google ScholarDigital Library
Alec Go, Richa Bhayani, and Lei Huang. 2009. Twitter sentiment classification using distant supervision. CS224N Project Report, Stanford 1, 12.Google Scholar
Anthony G Greenwald, Debbie E Mcghee, and Jordan L K Schwartz. 1998. Measuring Individual Differences in Implicit Cognition: The Implicit Association Test. Journal of Personality and Soclal Psychology 74, 6: 1464--1480.Google ScholarCross Ref
Philip J Guo. 2017. Older Adults Learning Computer Programming: Motivations, Frustrations, and Design Opportunities. In Proceedings of the SIGCHI Conference on Human Factors in Computing Systems (CHI '17). Google ScholarDigital Library
Dave Harley and Geraldine Fitzpatrick. 2009. YouTube and intergenerational communication: the case of Geriatric1927. Universal Access in the Information Society 8, 1: 5--20. Google ScholarDigital Library
Minqing Hu and Bing Liu. 2004. Mining and summarizing customer reviews. In Proceedings of the 2004 ACM SIGKDD international conference on Knowledge discovery and data mining (KDD '04), 168-- 177. Google ScholarDigital Library
Eric H. Huang, Richard Socher, Christopher D. Manning, and Andrew Y. Ng. 2012. Improving word representations via global context and multiple word prototypes. Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics: Long Papers - Volume 1 (ACL '12) Vol. 1. Google ScholarDigital Library
Mary Lee Hummert, Teri A. Garstka, Laurie T. O'Brien, Anthony G. Greenwald, and Deborah S. Mellott. 2002. Using the Implicit Association Test to measure age differences in implicit social cognitions. Psychology and Aging 17, 3: 482--495.Google ScholarCross Ref
CJ J. Hutto and Eric Gilbert. 2014. Vader: A parsimonious rule-based model for sentiment analysis of social media text. In Eighth International AAAI Conference on Weblogs and Social Media, 216--225.Google Scholar
Lucas D. Introna and Helen Nissenbaum. 2000. Shaping the Web: why the politics of search engines matters. The Information Society 16: 169--185.Google ScholarCross Ref
Lucas D Introna and David Wood. 2004. Picturing Algorithmic Surveillance: The Politics of Facial Recognition Systems. Surveillance & Society: CCTV Special Issue 2, 2/3.Google Scholar
Lucas Introna and Helen Nissenbaum. 2000. Defining the Web: The Politics of Search Engines. Computer 33, 54--62. Google ScholarDigital Library
Lilly Irani, Janet Vertesi, and Paul Dourish. 2010. Postcolonial computing: a lens on design and development. Proceedings of the SIGCHI Conference on Human Factors in Computing Systems (CHI '10): 1311--1320. Google ScholarDigital Library
Isaac Johnson, Connor McMahon, Johannes Schöning, and Brent Hecht. 2017. The Effect of Population and "Structural" Biases on Social Media-based Algorithms -- A Case Study in Geolocation Inference Across the Urban- Rural Spectrum. In Proceedings of the 35th Annual ACM Conference on Human Factors in Computing Systems (CHI '17). Google ScholarDigital Library
Matthew Kay, Cynthia Matuszek, and Sean a. Munson. 2015. Unequal Representation and Gender Stereotypes in Image Search Results for Occupations. In Proceedings of the 33rd Annual ACM Conference on Human Factors in Computing Systems (CHI '15), 3819--3828. Google ScholarDigital Library
Rob Kitchin. 2017. Thinking critically about and researching algorithms. Information, Communication & Society 20, 1.Google ScholarCross Ref
Juhi Kulshrestha, Motahhare Eslami, Johnnatan Messias, Muhammad Bilal Zafar, Saptarshi Ghosh, Krishna P. Gummadi, and Karrie Krahalios. 2017. Quantifying Search Bias: Investigating Sources of Bias for Political Searches in Social Media. In Proceedings of the 2017 ACM Conference on Computer Supported Cooperative Work and Social Computing (CSCW '17), 417--432. Google ScholarDigital Library
Nathan R. Kuncel, Deniz S. Ones, and David M. Klieger. 2014. In Hiring, Algorithms Beat Instinct. Harvard Business Review May.Google Scholar
Joanna N. Lahey. 2010. International Comparison of Age Discrimination Laws. Research on Aging 32, 6: 679--697.Google ScholarCross Ref
K.P. Lasher and P.J. Faulkender. 1993. Measurement of Aging Anxiety: Development of the Anxiety About Aging Scale. The International Journal of Aging and Human Development 37, 4: 247--259.Google ScholarCross Ref
Amanda Lazar, Mark Diaz, Robin Brewer, Chelsea Kim, and Anne Marie Piper. 2017. Going Gray, Failure to Hire, and the Ick Factor: Analyzing How Older Bloggers Talk about Ageism. In Proceedings of the ACM Conference on Computer-Supported Cooperative Work & Social Computing (CSCW '17). Google ScholarDigital Library
Amanda Lazar, Caroline Edasis, and Anne Marie Piper. 2017. A Critical Lens on Dementia and Design in HCI. In Proceedings of the SIGCHI Conference on Human Factors in Computing Systems (CHI), In press. Google ScholarDigital Library
Becca Levy. 2009. Stereotype Embodiment: A Psychosocial Approach to Aging. Current directions in psychological science 18, 6: 332--336.Google Scholar
Q. Vera Liao, Wai-Tat Fu, and Markus Strohmaier. 2016. #Snowden: Understanding Biases Introduced by Behavioral Differences of Opinion Groups on Social Media. In Proceedings of the 2016 CHI Conference on Human Factors in Computing Systems (CHI '16), 3352--3363. Google ScholarDigital Library
Andrew L. Maas, Raymond E. Daly, Peter T. Pham, Dan Huang, Andrew Y. Ng, and Christopher Potts. 2011. Learning word vectors for sentiment analysis. In Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies - Volume 1 (HLT '11). Google ScholarDigital Library
Tomas Mikolov, Kai Chen, Greg Corrado, and Jeffrey Dean. 2013. Distributed Representations of Words and Phrases and their Compositionality. In Proceedings of the 26th International Conference on Neural Information Processing Systems (NIPS'13), 3111-- 3119. Google ScholarDigital Library
Boaz Miller and Isaac Record. 2016. Responsible epistemic technologies: A social-epistemological analysis of autocompleted web search. new media & society: 1--19.Google Scholar
Claire Cain Miller. 2015. Can an Algorithm Hire Better Than a Human. The New York Times.Google Scholar
Karine Nahon. 2015. Where there is Social Media there is Politics. In Forthcoming in Routledge Companion to Social Media and Politics, A. Bruns, E. Skogerbo, C. Christensen, O.A. Larsson and G.S. Enli (eds.). Routledge, NYC, NY.Google Scholar
Helen Nissenbaum. How Computer Systems Embody Values. Computer 34, 3: 118--119. Google ScholarDigital Library
Alana Officer, Mira Leonie Schneiders, Diane Wu, Paul Nash, Jotheeswaran Amuthavalli Thiyagarajan, and John R. Beard. 2016. Valuing older people: Time for a global campaign to combat ageism. Bulletin of the World Health Organization 94, 709--784.Google ScholarCross Ref
Bo Pang and Lillian Lee. 2006. Opinion Mining and Sentiment Analysis. Foundations and Trends® in Information Retrieval 1, 2: 91--231. Google ScholarDigital Library
Frank Pasquale. 2015. The Black Box Society: The Secret Algorithms That Control Money. Harvard University Press. Google ScholarDigital Library
Jeffrey Pennington, Richard Socher, and Christopher D Manning. 2014. GloVe: Global Vectors for Word Representation. Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing: 1532--1543.Google ScholarCross Ref
Pew Research Center. 2014. Older Adults and Technology Use. April. https://doi.org/202.419.4500Google Scholar
Filipe N. Ribeiro, Matheus Ara??jo, Pollyanna Gon??alves, Marcos Andr?? Gon??alves, and Fabr??cio Benevenuto. 2016. SentiBench - a benchmark comparison of state-of-the-practice sentiment analysis methods. EPJ Data Science 5, 1: 1--29.Google ScholarCross Ref
Jennifer A. Rode. 2011. A theoretical agenda for feminist HCI. Interacting with Computers 23, 5: 393-- 400. Google ScholarDigital Library
Shilad Sen, Margaret E Giesel, Rebecca Gold, Benjamin Hillmann, Matt Lesicko, Samuel Naden, Jesse Russell, Zixiao "Ken" Wang, and Brent Hecht. 2015. Turkers, Scholars, "Arafat" and "Peace": Cultural Communities and Algorithmic Gold Standards. In Proceedings of the 18th ACM Conference on Computer Supported Cooperative Work & Social Computing (CSCW '15), 826--838. Google ScholarDigital Library
Thomas Smyth and Jill Dimond. 2014. Anti-Oppressive Design. interactions 21, 6: 68--71. Google ScholarDigital Library
Richard Socher, Alex Perelygin, Jean Y. Wu, Jason Chuang, Christopher D. Manning, Andrew Y. Ng, and Christopher Potts. 2013. Recursive deep models for semantic compositionality over a sentiment treebank. In Proceedings of the conference on empirical methods in natural language processing (EMNLP), 1631--1642.Google Scholar
Kate Starbird and Leysia Palen. 2012. (How) will the revolution be retweeted?: information diffusion and the 2011 Egyptian uprising. Proceedings of the ACM 2012 conference on Computer Supported Cooperative Work (CSCW '12): 7--16. Google ScholarDigital Library
L Sweeney. 2013. Discrimination in online ad delivery. acmqueue 11, 3. Google ScholarDigital Library
Maite Taboada, Julian Brooke, Milan Tofiloski, Kimberly Voll, and Manfred Stede. 2011. LexiconBased Methods for Sentiment Analysis. Computational Linguistics 37, 2: 267--307. Google ScholarDigital Library
Antonio Torralba and Alexai A. Efros. 2011. Unbiased look at dataset bias. In 2011 IEEE Conference on Computer Vision and Pattern Recognition (CVPR). Google ScholarDigital Library
Zeynep Tufekci. 2014. Algorithmic Harms Beyond Facebook and Google: Emergent Challenges of Computational Agency. Journal on Telecommunications and High Technology Law 13: 203--218.Google Scholar
Marlon Twyman, Brian C. Keegan, and Aaron Shaw. 2016. Black Lives Matter in Wikipedia: Collaboration and Collective Memory around Online Social Movements. In Proceedings of the 2017 ACM Conference on Computer Supported Cooperative Work and Social Computing (CSCW '17), 1400--1412. Google ScholarDigital Library
W. N. Venables and B. D. Ripley. 2002. Modern Applied Statistics with S. Springer, New York. Google ScholarDigital Library
John Vines, Gary Pritchard, Peter Wright, Patrick Olivier, and Katie Brittain. 2015. An Age-Old Problem: Examining the Discourses of Ageing in HCI and Strategies for Future Research. ACM Transactions on Computer-Human Interaction 22, 1. Google ScholarDigital Library
Claudia Wagner, Eduardo Graells-Garrido, David Garcia, and Filippo Menczer. 2016. Women through the glass ceiling: gender asymmetries in Wikipedia. EPJ Data Science 5, 1.Google ScholarCross Ref
Theresa Wilson, Paul Hoffmann, Swapna Somasundaran, Jason Kessler, Janyce Wiebe, Yejin Choi, Claire Cardie, Ellen Riloff, and Siddharth Patwardhan. OpinionFinder: A system for subjectivity analysis. In Proceedings of hlt/emnlp on interactive demonstrations, 34--35. Google ScholarDigital Library

Index Terms

Addressing Age-Related Bias in Sentiment Analysis
1. Human-centered computing

Recommendations

Joint sentiment/topic model for sentiment analysis
CIKM '09: Proceedings of the 18th ACM conference on Information and knowledge management

Sentiment analysis or opinion mining aims to use automated tools to detect subjective information such as opinions, attitudes, and feelings expressed in text. This paper proposes a novel probabilistic modeling framework based on Latent Dirichlet ...
Read More
Sentence compression for aspect-based sentiment analysis

Sentiment analysis, which addresses the computational treatment of opinion, sentiment, and subjectivity in text, has received considerable attention in recent years. In contrast to the traditional coarse-grained sentiment analysis tasks, such as ...
Read More
Bias-aware lexicon-based sentiment analysis
SAC '15: Proceedings of the 30th Annual ACM Symposium on Applied Computing

Sentiment analysis of textual content is widely used for automatic summarization of opinions and sentiments expressed by people. With the growing popularity of social media and user-generated content, efficient and effective sentiment analysis is ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Published in
CHI '18: Proceedings of the 2018 CHI Conference on Human Factors in Computing Systems
April 2018
8489 pages
ISBN:9781450356206
DOI:10.1145/3173574
General Chairs:
Regan Mandryk
University of Saskatchewan, Canada
,
Mark Hancock
University of Waterloo, Canada
,
Program Chairs:
Mark Perry
Brunel University London, UK
,
Anna Cox
University College London, UK
Copyright © 2018 Owner/Author
Permission to make digital or hard copies of part or all of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for third-party components of this work must be honored. For all other uses, contact the Owner/Author.
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 21 April 2018
Check for updates
Badges
- Best Paper
Author Tags
aging
algorithmic bias
older adults
sentiment analysis
Qualifiers
- research-article
Conference

Acceptance Rates
CHI '18 Paper Acceptance Rate666of2,590submissions,26%Overall Acceptance Rate6,199of26,314submissions,24%
More
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 71
  Total Citations
  View Citations
- 3,944
  Total Downloads
- Downloads (Last 12 months)959
- Downloads (Last 6 weeks)156
Other Metrics
View Author Metrics
Cited By
View all

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Addressing Age-Related Bias in Sentiment Analysis

CHI '18: Proceedings of the 2018 CHI Conference on Human Factors in Computing Systems

ABSTRACT

References

Cited By

Index Terms

Recommendations

Joint sentiment/topic model for sentiment analysis

Sentence compression for aspect-based sentiment analysis

Bias-aware lexicon-based sentiment analysis