survey

A Survey on Automatic Detection of Hate Speech in Text

Authors:
Paula Fortuna

INESC TEC

INESC TEC

0000-0002-2306-9276
View Profile

,
Sérgio Nunes

INESC TEC and Faculty of Engineering, University of Porto, Portugal

INESC TEC and Faculty of Engineering, University of Porto, Portugal
View Profile

Authors Info & Claims

ACM Computing Surveys Volume 51 Issue 4Article No.: 85pp 1–30https://doi.org/10.1145/3232676

Published:31 July 2018Publication History

ACM Computing Surveys

Abstract

The scientific study of hate speech, from a computer science point of view, is recent. This survey organizes and describes the current state of the field, providing a structured overview of previous approaches, including core algorithms, methods, and main features used. This work also discusses the complexity of the concept of hate speech, defined in many platforms and contexts, and provides a unifying definition. This area has an unquestionable potential for societal impact, particularly in online communities and digital media platforms. The development and systematization of shared resources, such as guidelines, annotated datasets in multiple languages, and algorithms, is a crucial step in advancing the automatic detection of hate speech.

References

ACL. 2017. ALW1: 1st workshop on abusive language online. Retrieved from https://sites.google.com/site/abusivelanguageworkshop2017/home.Google Scholar
Swati Agarwal and Ashish Sureka. 2015. Using KNN and SVM based one-class classifier for detecting online radicalization on Twitter. In Proceedings of the International Conference on Distributed Computing and Internet Technology. Springer, 431--442. Google ScholarDigital Library
Swati Agarwal and Ashish Sureka. 2017. Characterizing linguistic attributes for automatic classification of intent based racist/radicalized posts on tumblr micro-blogging website. arXiv Preprint arXiv:1701.04931 (2017).Google Scholar
Pinkesh Badjatiya, Shashank Gupta, Manish Gupta, and Vasudeva Varma. 2017. Deep learning for hate speech detection in tweets. In Proceedings of the 26th International Conference on World Wide Web Companion. International World Wide Web Conferences Steering Committee, 759--760. Google ScholarDigital Library
Tanvi Banerjee, Amir H. Yazdavar, Andrew Hampton, Hemant Purohit, Valerie L. Shalin, and Amit P. Sheth. Identifying pragmatic functions in social media indicative of gender-based violence beliefs. Manuscript Submitted for Publication.Google Scholar
Jamie Bartlett, Richard Norrie, Sofia Patel, Rebekka Rumpel, and Simon Wibberley. 2014. Misogyny on Twitter. Technical Report. Demos.Google Scholar
Peter Burnap and Matthew L. Williams. 2014. Hate speech, machine classification and statistical modelling of information flows on Twitter: Interpretation and communication for policy decision making. In Proceedings of the Conference on the Internet, Policy 8 Politics. 1--18.Google Scholar
Pete Burnap and Matthew L. Williams. 2015. Cyber hate speech on Twitter: An application of machine classification and statistical modeling for policy and decision making. Policy Internet 7, 2 (2015), 223--242.Google ScholarCross Ref
Pete Burnap and Matthew L. Williams. 2016. Us and them: Identifying cyber hate on Twitter across multiple protected characteristics. EPJ Data Sci. 5, 1 (2016), 11.Google ScholarCross Ref
Ying Chen. 2011. Detecting Offensive Language in Social Medias for Protection of Adolescent Online Safety. Ph.D. Dissertation. The Pennsylvania State University.Google Scholar
CHI2017. 2017. 2017 Workshop on online harassment. Retrieved from http://social.umd.edu/woh/.Google Scholar
CLiPS. 2016. HADES. Retrieved from https://github.com/clips/hades.Google Scholar
CONTACT. 2017. Interdisciplinary conference on Hate speech. Definitions, Interpretations and Practices. Retrieved from https://sites.google.com/site/abusivelanguageworkshop2017/home.Google Scholar
Keith Cortis and Siegfried Handschuh. 2015. Analysis of cyberbullying tweets in trending world events. In Proceedings of the 15th International Conference on Knowledge Technologies and Data-driven Business. ACM, 7. Google ScholarDigital Library
CrowdFlower. 2017. Data for everyone. Retrieved from https://www.crowdflower.com/data-for-everyone/.Google Scholar
Maral Dadvar, Franciska de Jong, Roeland Ordelman, and Dolf Trieschnigg. 2012. Improved cyberbullying detection using gender information. In Proceedings of the 12th Dutch-Belgian Information Retrieval Workshop. University of Ghent, 23--25.Google Scholar
Ali A. Dashti, Ali A. Al-Kandari, and Hamed H. Al-Abdullah. 2015. The influence of sectarian and tribal discourse in newspapers readers’ online comments about freedom of expression, censorship and national unity in Kuwait. Telemat. Informat. 32, 2 (2015), 245--253. Google ScholarDigital Library
Thomas Davidson. 2017. Automated hate speech detection and the problem of offensive language. Retrieved from https://github.com/t-davidson/hate-speech-and-offensive-language.Google Scholar
Thomas Davidson, Dana Warmsley, Michael Macy, and Ingmar Weber. 2017. Automated hate speech detection and the problem of offensive language. arXiv Preprint arXiv:1703.04009 (2017).Google Scholar
Marie-Catherine De Marneffe and Christopher D. Manning. 2008. Stanford Typed Dependencies Manual. Technical report, Stanford University.Google Scholar
Guy De Pauw, Ben Verhoeven, Bart Desmet, and Els Lefever. 2016. First workshop on text analytics for cybersecurity and online safety (TA-COS 2016). In Proceedings of the 1st Workshop on Text Analytics for Cybersecurity and Online Safety (TACOS’16), collocated with the 10th International Conference on Language Resources and Evaluation (LREC’16). European Language Resources Association.Google Scholar
Fabio Del Vigna, Andrea Cimino, Felice Dell’Orletta, Marinella Petrocchi, and Maurizio Tesconi. 2017. Hate me, hate me not: Hate speech detection on Facebook. In Proceedings of the 1st Italian Conference on Cybersecurity. 86--95.Google Scholar
Cambridge Dictionary. 2017. Profanity. Retrieved from https://dictionary.cambridge.org/dictionary/english/profanity.Google Scholar
Karthik Dinakar, Roi Reichart, and Henry Lieberman. 2011. Modeling the detection of textual cyberbullying. Soc. Mobile Web 11, 02 (2011).Google Scholar
Nemanja Djuric, Jing Zhou, Robin Morris, Mihajlo Grbovic, Vladan Radosavljevic, and Narayan Bhamidipati. 2015. Hate speech detection with comment embeddings. In Proceedings of the 24th International Conference on World Wide Web. ACM, 29--30. Google ScholarDigital Library
Sara Douglass, Sheena Mirpuri, Devin English, and Tiffany Yip. 2016. They were just making jokes: Ethnic/racial teasing and discrimination among adolescents. Cultur. Divers. Ethnic Minor. Psychol. 22, 1 (2016), 69.Google ScholarCross Ref
Eyspahn. 2016. Online hate speech modeling using Python and reddit comment data. Retrieved from https://github.com/eyspahn/OnlineHateSpeech_PyLadiesSea.Google Scholar
Facebook. 2013. What does Facebook consider to be hate speech? Retrieved from https://www.facebook.com/help/135402139904490.Google Scholar
FBI. 2015. 2015 hate crime statistics. Retrieved from https://ucr.fbi.gov/hate-crime/.Google Scholar
Fabio Giblietto and Yenn Lee. 2015. To be or not to be Charlie: Twitter hashtags as a discourse and counter-discourse in the aftermath of the 2015 Charlie Hebdo shooting in France. In Proceedings of the 4th Workshop on Making Sense of Microposts (#Microposts’14).Google Scholar
Njagi Dennis Gitari, Zhang Zuping, Hanyurwimfura Damien, and Jun Long. 2015. A lexicon-based approach for hate speech detection. Int. J. Multimedia Ubiq. Eng. 10, 4 (2015), 215--230.Google ScholarCross Ref
Edel Greevy. 2004. Automatic Text Categorisation of Racist Webpages. Ph.D. Dissertation. Dublin City University.Google Scholar
Edel Greevy and Alan F. Smeaton. 2004. Classifying racist texts using a support vector machine. In Proceedings of the 27th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval. ACM, 468--469. Google ScholarDigital Library
Stanford NLP Group. 2017. The Stanford NLP Group. Retrieved from http://nlp.stanford.edu/.Google Scholar
Radhouane Guermazi, Mohamed Hammami, and Abdelmajid Ben Hamadou. 2007. Using a semi-automatic keyword dictionary for improving violent Web site filtering. In Proceedings of the 3rd International IEEE Conference on Signal-Image Technologies and Internet-Based System (SITIS’07). IEEE, 337--344. Google ScholarDigital Library
Yannis Haralambous and Philippe Lenca. 2014. Text classification using association rules, dependency pruning and hyperonymization. arXiv Preprint arXiv:1407.7357 (2014). Google ScholarDigital Library
Reporting Hate. 2017. Hate speech conference I.H.D.I.P. Retrieved from http://reportinghate.eu/contact2017/.Google Scholar
No hate speech movement. 2017. No hate speech movement. Retrieved from https://www.nohatespeechmovement.org/.Google Scholar
Hatebase. 2017. Hatebase. Retrieved from https://www.hatebase.org/.Google Scholar
Alex Hern. 2016. Facebook, YouTube, Twitter, and Microsoft sign EU hate speech code. Retrieved from https://www.theguardian.com/technology/2016/may/31/facebook-youtube-twitter-microsoft-eu-hate-speech-code.Google Scholar
Sarah Hewitt, Thanassis Tiropanis, and Christian Bokhove. 2016. The problem of identifying misogynist language on Twitter (and other online social spaces). In Proceedings of the 8th ACM Conference on Web Science. ACM, 333--335. Google ScholarDigital Library
ILGA. 2016. Hate crime and hate speech. Retrieved from http://www.ilga-europe.org/what-we-do/our-advocacy-work/hate-crime-hate-speech.Google Scholar
Jigsaw. 2017. Perspective API. Retrieved from https://www.perspectiveapi.com/.Google Scholar
Kaggle. 2013. Detecting insults in social commentary. Retrieved from https://www.kaggle.com/c/detecting-insults-in-social-commentary/data.Google Scholar
Panos Kompatsiaris. 2016. Whitewashing the nation: Racist jokes and the construction of the african “other” in Greek popular cinema. Soc. Ident. 23, 3 (2016), 360--375.Google ScholarCross Ref
Ivana Kottasová. 2017. Europe says Twitter is failing to remove hate speech. Retrieved from http://money.cnn.com/2017/06/01/technology/twitter-facebook-hate-speech-europe/index.html.Google Scholar
Till Krause and Hannes Grassegger. 2016. Facebook’s secret rules of deletion. Retrieved from http://international.sueddeutsche.de/post/154543271930/facebooks-secret-rules-of-deletion.Google Scholar
Klaus Krippendorff. 2004. Content Analysis: An Introduction to Its Methodology. Sage.Google Scholar
Giselinde Kuipers and Barbara van der Ent. 2016. The seriousness of ethnic jokes: Ethnic humor and social change in The Netherlands, 1995--2012. Humor 29, 4 (2016), 605--633.Google ScholarCross Ref
Irene Kwok and Yuzhou Wang. 2013. Locate the hate: Detecting tweets against blacks. In Proceedings of the Association for the Advancement of Artificial Intelligence. Google ScholarDigital Library
Anti-Defamation League. 2015. The trap of masculinity: how sexism impacts boys and men. Retrieved from https://www.adl.org/sites/default/files/documents/assets/pdf/education-outreach/trap-of-masculinity.pdf.Google Scholar
Shuhua Liu and Thomas Forss. 2014. Combining N-gram based similarity analysis with sentiment analysis in web content classification. In Proceedings of the International Joint Conference on Knowledge Discovery, Knowledge Engineering and Knowledge Management. 530--537. Google ScholarDigital Library
Shuhua Liu and Thomas Forss. 2015. New classification models for detecting hate and violence web content. In Proceedings of the 7th International Joint Conference on Knowledge Discovery, Knowledge Engineering and Knowledge Management (IC3K’15), Vol. 1. IEEE, 487--495. Google ScholarDigital Library
Wilson Jeffrey Maloba. 2014. Use of Regular Expressions for Multi-lingual Detection of Hate Speech in Kenya. Ph.D. Dissertation. iLabAfrica.Google Scholar
Lacy G. McNamee, Brittany L. Peterson, and Jorge Peña. 2010. A call to educate, participate, invoke. and indict: Understanding the communication of online hate groups. Commun. Monogr. 77, 2 (2010), 257--280.Google ScholarCross Ref
Yashar Mehdad and Joel Tetreault. 2016. Do characters abuse more than words? In Proceedings of the 17th Annual Meeting of the Special Interest Group on Discourse and Dialogue (SIGdial’16). 299--303.Google ScholarCross Ref
B. Nandhini and J. I. Sheeba. 2015. Cyberbullying detection and classification using information retrieval algorithm. In Proceedings of the 2015 International Conference on Advanced Research in Computer Science Engineering 8 Technology (ICARCSET’15). ACM, 20. Google ScholarDigital Library
Chikashi Nobata, Joel Tetreault, Achint Thomas, Yashar Mehdad, and Yi Chang. 2016. Abusive language detection in online user content. In Proceedings of the 25th International Conference on World Wide Web. International World Wide Web Conferences Steering Committee, 145--153. Google ScholarDigital Library
Andre Oboler and Karen Connelly. 2014. Hate speech: A quality of service challenge. In Proceedings of the IEEE Conference on e-Learning, e-Management, and e-Services (IC3e’14). IEEE, 117--121.Google ScholarCross Ref
United Nations Alliance of Civilizations (UNAOC). 2017. #SpreadNoHate: A global dialogue on hate speech against migrants and refugees in the media. Retrieved from https://www.unaoc.org/what-we-do/projects/hate-speech/.Google Scholar
David Martin Powers. 2011. Evaluation: From precision, recall and F-measure to ROC, informedness, markedness and correlation. Int. J. Mach. Learn. Technol. 2, 1 (2011), 37--63.Google ScholarCross Ref
Sheryl Prentice, Paul J. Taylor, Paul Rayson, Andrew Hoskins, and Ben O’Loughlin. 2011. Analyzing the semantic content and persuasive composition of extremist media: A case study of texts produced during the Gaza conflict. Info. Syst. Front. 13, 1 (2011), 61--73. Google ScholarDigital Library
Elaheh Raisi and Bert Huang. 2016. Cyberbullying identification using participant-vocabulary consistency. arXiv Preprint arXiv:1606.08084 (2016).Google Scholar
Vasu Reddy. 2002. Perverts and sodomites: Homophobia as hate speech in Africa. South. African Linguist. Appl. Lang. Studies 20, 3 (2002), 163--175.Google ScholarCross Ref
Bjorn Ross, Michael Rist, Guillermo Carbonell, Benjamin Cabrera, Nils Kurowsky, and Michael Wojatzki. 2017. Measuring the reliability of hate speech annotations: The case of the european refugee crisis. arXiv Preprint arXiv:1701.08118 (2017).Google Scholar
Anna Schmidt and Michael Wiegand. 2017. A survey on hate speech detection using natural language processing. In Proceedings of the Workshop on Natural Language Processing for Social Media (SocialNLP’17). 1.Google ScholarCross Ref
Leandro Silva, Mainack Mondal, Denzil Correa, Fabrício Benevenuto, and Ingmar Weber. 2016. Analyzing the targets of hate in online social media. arXiv Preprint arXiv:1603.07709 (2016).Google Scholar
Natalya Tarasova. 2016. Classification of Hate Tweets and Their Reasons using SVM. Master’s thesis. Uppsala Universitet.Google Scholar
Neil Thompson. 2016. Anti-discriminatory Practice: Equality, Diversity and Social Justice. Palgrave Macmillan.Google ScholarCross Ref
Annie Thorburn. 2016. Hate Speech ML. Retrieved from https://github.com/anniethorburn/Hate-Speech-ML.Google Scholar
Stéphan Tulkens, Lisa Hilte, Elise Lodewyckx, Ben Verhoeven, and Walter Daelemans. 2016. A dictionary-based approach to racism detection in dutch social media. arXiv Preprint arXiv:1608.08738 (2016).Google Scholar
Twitter. 2017. The Twitter Rules. Retrieved from https://support.twitter.com/articles/.Google Scholar
UCSM. 2016. IWG hatespeech public. Retrieved from https://github.com/UCSM-DUE/.Google Scholar
William Warner and Julia Hirschberg. 2012. Detecting hate speech on the world wide web. In Proceedings of the Second Workshop on Language in Social Media. Association for Computational Linguistics, 19--26. Google ScholarDigital Library
Zeerak Waseem. 2016. Are you a racist or am I seeing things? Annotator influence on hate speech detection on Twitter. In Proceedings of the 1st Workshop on Natural Language Processing and Computational Social Science. 138--142.Google ScholarCross Ref
Zeerak Waseem. 2016. Hate speech Twitter annotations. Retrieved from https://github.com/ZeerakW/hatespeech.Google Scholar
Zeerak Waseem and Dirk Hovy. 2016. Hateful symbols or hateful people? Predictive features for hate speech detection on Twitter. In Proceedings of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (NAACL-HLT’16). 88--93.Google ScholarCross Ref
Mike Wendling. 2015. 2015: The year that angry won the internet. Retrieved from http://www.bbc.com/news/blogs-trending-35111707.Google Scholar
Christian Wigand and Melanie Voin. 2017. Speech by Commissioner Jourová—10 years of the EU Fundamental Rights Agency: A call to action in defence of fundamental rights, democracy and the rule of law. Retrieved from http://europa.eu/rapid/press-release_SPEECH-17-403_en.htm.Google Scholar
Yahoo! 2017. Webscope datasets. Retrieved from https://webscope.sandbox.yahoo.com/.Google Scholar
David Yarowsky. 1994. Decision lists for lexical ambiguity resolution: Application to accent restoration in Spanish and French. In Proceedings of the 32nd Annual Meeting on Association for Computational Linguistics. Association for Computational Linguistics, 88--95. Google ScholarDigital Library
Youtube. 2017. Hate speech. Retrieved from https://support.google.com/youtube/answer/2801939?hl=en.Google Scholar
Shuhan Yuan, Xintao Wu, and Yang Xiang. 2016. A two phase deep learning model for identifying discrimination from tweets. In Proceedings of the International Conference on Extending Database Technology. 696--697.Google Scholar
Matthew Zook. 2012. Mapping racist tweets in response to President Obama’s re-election. Retrieved from https://www.theguardian.com/news/datablog/2012/nov/09/mapping-racist-tweets-president-obama-reelection.Google Scholar

Index Terms

A Survey on Automatic Detection of Hate Speech in Text
1. Computing methodologies
  1. Artificial intelligence
    1. Natural language processing
2. Information systems
  1. Information retrieval
    1. Retrieval tasks and goals
      1. Information extraction
      2. Sentiment analysis

Recommendations

Hate begets Hate: A Temporal Study of Hate Speech
CSCW

With the ongoing debate on 'freedom of speech' vs. 'hate speech,' there is an urgent need to carefully understand the consequences of the inevitable culmination of the two, i.e., 'freedom of hate speech' over time. An ideal scenario to understand this ...
Read More
Hate speech detection: A solved problem? The challenging case of long tail on Twitter
Special Issue on Semantic Deep Learning

In recent years, the increasing propagation of hate speech on social media and the urgent need for effective counter-measures have drawn significant investment from governments, companies, and researchers. A large number of methods have been developed ...
Read More
A Measurement Study of Hate Speech in Social Media
HT '17: Proceedings of the 28th ACM Conference on Hypertext and Social Media

Social media platforms provide an inexpensive communication medium that allows anyone to quickly reach millions of users. Consequently, in these platforms anyone can publish content and anyone interested in the content can obtain it, representing a ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Article

Published in
ACM Computing Surveys Volume 51, Issue 4
July 2019
765 pages
ISSN:0360-0300
EISSN:1557-7341
DOI:10.1145/3236632
Editor:
Sartaj Sahni
Department of Computer and Information Science and Engineering / University of Florida / Gainesville, FL 32611
Issue’s Table of Contents
Copyright © 2018 ACM
© 2018 Association for Computing Machinery. ACM acknowledges that this contribution was authored or co-authored by an employee, contractor or affiliate of a national government. As such, the Government retains a nonexclusive, royalty-free right to publish or reproduce this article, or to allow others to do so, for Government purposes only.
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 31 July 2018
- Accepted: 1 June 2018
- Revised: 1 May 2018
- Received: 1 October 2017
Published in csur Volume 51, Issue 4

Permissions
Request permissions about this article.
Request Permissions

Check for updates
Author Tags
Hate speech
literature review
natural language processing
opinion mining
text mining
Qualifiers
- survey
- Research
- Refereed
Conference
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 412
  Total Citations
  View Citations
- 8,284
  Total Downloads
- Downloads (Last 12 months)1,217
- Downloads (Last 6 weeks)135
Other Metrics
View Author Metrics
Cited By
View all

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

A Survey on Automatic Detection of Hate Speech in Text

ACM Computing Surveys

Abstract

References

Cited By

Index Terms

Recommendations

Hate begets Hate: A Temporal Study of Hate Speech

Hate speech detection: A solved problem? The challenging case of long tail on Twitter

A Measurement Study of Hate Speech in Social Media