Fake News Early Detection: A Theory-driven Model

Authors:
Xinyi Zhou

Syracuse University, Syracuse, NY, USA

Syracuse University, Syracuse, NY, USA

0000-0002-2388-254X
View Profile

,
Atishay Jain

Syracuse University, Syracuse, NY, USA

Syracuse University, Syracuse, NY, USA
View Profile

,
Vir V. Phoha

Syracuse University, Syracuse, NY, USA

Syracuse University, Syracuse, NY, USA

0000-0002-5390-8253
View Profile

,
Reza Zafarani

Syracuse University, Syracuse, NY, USA

Syracuse University, Syracuse, NY, USA
View Profile

Authors Info & Claims

Digital Threats: Research and Practice Volume 1 Issue 2Article No.: 12pp 1–25https://doi.org/10.1145/3377478

Published:11 June 2020Publication History

Digital Threats: Research and Practice

Abstract

Massive dissemination of fake news and its potential to erode democracy has increased the demand for accurate fake news detection. Recent advancements in this area have proposed novel techniques that aim to detect fake news by exploring how it propagates on social networks. Nevertheless, to detect fake news at an early stage, i.e., when it is published on a news outlet but not yet spread on social media, one cannot rely on news propagation information as it does not exist. Hence, there is a strong need to develop approaches that can detect fake news by focusing on news content. In this article, a theory-driven model is proposed for fake news detection. The method investigates news content at various levels: lexicon-level, syntax-level, semantic-level, and discourse-level. We represent news at each level, relying on well-established theories in social and forensic psychology. Fake news detection is then conducted within a supervised machine learning framework. As an interdisciplinary research, our work explores potential fake news patterns, enhances the interpretability in fake news feature engineering, and studies the relationships among fake news, deception/disinformation, and clickbaits. Experiments conducted on two real-world datasets indicate the proposed method can outperform the state-of-the-art and enable fake news early detection when there is limited content information.

References

Amol Agrawal. 2016. Clickbait detection using deep learning. In Proceedings of the 2nd International Conference on Next Generation Computing Technologies (NGCT’16). IEEE, 268--272.Google Scholar
Sanjeev Arora, Yingyu Liang, and Tengyu Ma. 2016. A simple but tough-to-beat baseline for sentence embeddings. In The International Conference on Learning Representations (ICLR’17).Google Scholar
Péter Bálint and Géza Bálint. 2009. The Semmelweis-reflex. Orvosi Het. 150, 30 (2009), 1430.Google ScholarCross Ref
Lawrence E. Boehm. 1994. The validity effect: A search for mediating variables. Person. Soc. Psychol. Bull. 20, 3 (1994), 285--293.Google ScholarCross Ref
Finn Brunton. 2013. Spam: A Shadow History of the Internet. The Mit Press.Google ScholarCross Ref
Sonia Castelo, Thais Almeida, Anas Elghafari, Aécio Santos, Kien Pham, Eduardo Nakamura, and Juliana Freire. 2019. A topic-agnostic approach for identifying fake news pages. In Proceedings of the World Wide Web Conference. ACM, 975--980.Google Scholar
Carlos Castillo, Marcelo Mendoza, and Barbara Poblete. 2011. Information credibility on Twitter. In Proceedings of the 20th International Conference on World Wide Web. ACM, 675--684.Google ScholarDigital Library
Abhijnan Chakraborty, Bhargavi Paranjape, Sourya Kakarla, and Niloy Ganguly. 2016. Stop clickbait: Detecting and preventing clickbaits in online news media. In Proceedings of the IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining. IEEE Press, 9--16.Google ScholarCross Ref
Abhijnan Chakraborty, Rajdeep Sarkar, Ayushi Mrigen, and Niloy Ganguly. 2017. Tabloids in the era of social media?: Understanding the production and consumption of clickbaits in Twitter. Proc. ACM on Hum.-comput. Interact. 1, CSCW (2017), 30.Google Scholar
Tianqi Chen and Carlos Guestrin. 2016. XGBoost: A scalable tree boosting system. In Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. ACM, 785--794.Google ScholarDigital Library
Yimin Chen, Niall J. Conroy, and Victoria L. Rubin. 2015. Misleading online content: Recognizing clickbait as false news. In Proceedings of the ACM Workshop on Multimodal Deception Detection. ACM, 15--19.Google Scholar
Giovanni Luca Ciampaglia, Prashant Shiralkar, Luis M. Rocha, Johan Bollen, Filippo Menczer, and Alessandro Flammini. 2015. Computational fact checking from knowledge networks. PloS One 10, 6 (2015), e0128193.Google ScholarCross Ref
Manqing Dong, Lina Yao, Xianzhi Wang, Boualem Benatallah, and Chaoran Huang. 2019. Similarity-aware deep attentive model for clickbait detection. In Proceedings of the Pacific-Asia Conference on Knowledge Discovery and Data Mining. Springer, 56--69.Google Scholar
Xin Dong, Evgeniy Gabrilovich, Geremy Heitz, Wilko Horn, Ni Lao, Kevin Murphy, Thomas Strohmann, Shaohua Sun, and Wei Zhang. 2014. Knowledge vault: A web-scale approach to probabilistic knowledge fusion. In Proceedings of the 20th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. ACM, 601--610.Google ScholarDigital Library
Mengnan Du, Ninghao Liu, and Xia Hu. 2019. Techniques for interpretable machine learning. Commun. ACM 63, 1 (2019), 68--77.Google ScholarDigital Library
Song Feng, Ritwik Banerjee, and Yejin Choi. 2012. Syntactic stylometry for deception detection. In Proceedings of the 50th Meeting of the Association for Computational Linguistics: Short Papers-Volume 2. Association for Computational Linguistics, 171--175.Google ScholarDigital Library
Alison Gianotto. 2014. Downworthy: A browser plugin to turn hyperbolic viral headlines into what they really mean. downworthy.snipe.net/. (2014).Google Scholar
Manish Gupta, Peixiang Zhao, and Jiawei Han. 2012. Evaluating event credibility on Twitter. In Proceedings of the SIAM International Conference on Data Mining. SIAM, 153--164.Google ScholarCross Ref
Shashank Gupta, Raghuveer Thirukovalluru, Manjira Sinha, and Sandya Mannarswamy. 2018. CIMTDetect: A community infused matrix-tensor coupled factorization based method for fake news detection. Arxiv Preprint Arxiv:1809.05252 (2018).Google Scholar
Joan B. Hooper. 1974. On assertive predicates. In Syntax and Semantics, Vol. 4. Indiana University Linguistics Club.Google Scholar
Kokil Jaidka, Tanya Goyal, and Niyati Chhaya. 2018. Predicting email and article clickthroughs with domain-adaptive language models. In Proceedings of the 10th ACM Conference on Web Science. ACM, 177--184.Google Scholar
Yangfeng Ji and Jacob Eisenstein. 2014. Representation learning for text-level discourse parsing. In Proceedings of the 52nd Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), Vol. 1. 13--24.Google Scholar
Zhiwei Jin, Juan Cao, Yongdong Zhang, and Jiebo Luo. 2016. News verification by exploiting conflicting social viewpoints in microblogs. In Proceedings of the AAAI Conference on Artificial Intelligence (AAAI’16). 2972--2978.Google Scholar
Marcia K. Johnson and Carol L. Raye. 1981. Reality monitoring.Psychol. Rev. 88, 1 (1981), 67.Google ScholarCross Ref
Junaed Younus Khan, Md Khondaker, Tawkat Islam, Anindya Iqbal, and Sadia Afroz. 2019. A benchmark study on machine learning methods for fake news detection. Arxiv Preprint Arxiv:1905.04749 (2019).Google Scholar
Quoc Le and Tomas Mikolov. 2014. Distributed representations of sentences and documents. In Proceedings of the International Conference on Machine Learning. 1188--1196.Google ScholarDigital Library
Yang Liu and Yi-Fang Brook Wu. 2018. Early detection of fake news on social media through propagation path classification with recurrent and convolutional networks. In Proceedings of the 32nd AAAI Conference on Artificial Intelligence.Google Scholar
George Loewenstein. 1994. The psychology of curiosity: A review and reinterpretation. Psychol. Bull. 116, 1 (1994), 75.Google ScholarCross Ref
Colin MacLeod, Andrew Mathews, and Philip Tata. 1986. Attentional bias in emotional disorders.J. Abnorm. Psychol. 95, 1 (1986), 15.Google ScholarCross Ref
Steven A. McCornack, Kelly Morrison, Jihyun Esther Paik, Amy M. Wisner, and Xun Zhu. 2014. Information manipulation theory 2: A propositional theory of deceptive discourse production. J. Lang. Soc. Psychol. 33, 4 (2014), 348--377.Google ScholarCross Ref
Tomas Mikolov, Kai Chen, Greg Corrado, and Jeffrey Dean. 2013. Efficient estimation of word representations in vector space. Arxiv Preprint Arxiv:1301.3781 (2013).Google Scholar
Federico Monti, Fabrizio Frasca, Davide Eynard, Damon Mannion, and Michael M. Bronstein. 2019. Fake news detection on social media using geometric deep learning. Arxiv Preprint Arxiv:1902.06673 (2019).Google Scholar
Maximilian Nickel, Kevin Murphy, Volker Tresp, and Evgeniy Gabrilovich. 2016. A review of relational machine learning for knowledge graphs. Proc. IEEE 104, 1 (2016), 11--33.Google ScholarCross Ref
Raymond S. Nickerson. 1998. Confirmation bias: A ubiquitous phenomenon in many guises. Rev. Gen. Psychol. 2, 2 (1998), 175.Google ScholarCross Ref
Jeppe Nørregaard, Benjamin D. Horne, and Sibel Adalı. 2019. NELA-GT-2018: A large multi-labelled news dataset for the study of misinformation in news articles. In Proceedings of the International AAAI Conference on Web and Social Media, Vol. 13. 630--638.Google ScholarCross Ref
Ray Oshikawa, Jing Qian, and William Yang Wang. 2018. A survey on natural language processing for fake news detection. Arxiv Preprint Arxiv:1811.00770 (2018).Google Scholar
Shivam B. Parikh and Pradeep K. Atrey. 2018. Media-rich fake news detection: A survey. In Proceedings of the IEEE Conference on Multimedia Information Processing and Retrieval (MIPR’18). IEEE, 436--441.Google Scholar
Shivam B. Parikh, Vikram Patil, Ravi Makawana, and Pradeep K. Atrey. 2019. Towards impact scoring of fake news. In Proceedings of the IEEE Conference on Multimedia Information Processing and Retrieval (MIPR’19). IEEE, 529--533.Google Scholar
James W. Pennebaker, Ryan L. Boyd, Kayla Jordan, and Kate Blackburn. 2015. The Development and Psychometric Properties of LIWC’15. Technical Report. The University of Texas at Austin.Google Scholar
Verónica Pérez-Rosas, Bennett Kleinberg, Alexandra Lefevre, and Rada Mihalcea. 2017. Automatic detection of fake news. Arxiv Preprint Arxiv:1708.07104 (2017).Google Scholar
Martin Potthast, Johannes Kiesel, Kevin Reinartz, Janek Bevendorff, and Benno Stein. 2017. A stylometric inquiry into hyperpartisan and fake news. Arxiv Preprint Arxiv:1702.05638 (2017).Google Scholar
Martin Potthast, Sebastian Köpsel, Benno Stein, and Matthias Hagen. 2016. Clickbait detection. In Proceedings of the European Conference on Information Retrieval. Springer, 810--817.Google Scholar
Kenneth Rapoza. 2017. Can “fake news” impact the stock market? Retrieved from www.forbes.com/sites/kenrapoza/2017/02/26/can-fake-news-impact-the-stock-market/ (9. 7. 2018).Google Scholar
Marta Recasens, Cristian Danescu-Niculescu-Mizil, and Dan Jurafsky. 2013. Linguistic models for analyzing and detecting biased language. In Proceedings of the 51st Meeting of the Association for Computational Linguistics (Volume 1: Long Papers). 1650--1659.Google Scholar
Victoria L. Rubin. 2010. On deception and deception detection: Content analysis of computer-mediated stated beliefs. Proc. Assoc. Inf. Sci. Technol. 47, 1 (2010), 1--10.Google ScholarCross Ref
Victoria L. Rubin and Tatiana Lukoianova. 2015. Truth and deception at the rhetorical structure level. J. Assoc. Inf. Sci. Technol. 66, 5 (2015), 905--917.Google ScholarDigital Library
Natali Ruchansky, Sungyong Seo, and Yan Liu. 2017. CSI: A hybrid deep model for fake news detection. In Proceedings of the ACM Conference on Information and Knowledge Management. ACM, 797--806.Google ScholarDigital Library
Baoxu Shi and Tim Weninger. 2016. Discriminative predicate path mining for fact checking in knowledge graphs. Knowl-based Syst. 104 (2016), 123--133.Google ScholarDigital Library
Kai Shu, Limeng Cui, Suhang Wang, Dongwon Lee, and Huan Liu. 2019. dEFEND: Explainable fake news detection. In Proceedings of the IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining. IEEE Press.Google Scholar
Kai Shu, Deepak Mahudeswaran, Suhang Wang, Dongwon Lee, and Huan Liu. 2018. FakeNewsNet: A data repository with news content, social context, and dynamic information for studying fake news on social media. Arxiv Preprint Arxiv:1809.01286 (2018).Google Scholar
Kai Shu, Suhang Wang, and Huan Liu. 2019. Beyond news contents: The role of social context for fake news detection. In Proceedings of the 12th ACM International Conference on Web Search and Data Mining. ACM, 312--320.Google ScholarDigital Library
Craig Silverman. 2016. This analysis shows how viral fake election news stories outperformed real news on Facebook. BuzzFeed News 16 (2016).Google Scholar
Niraj Sitaula, Chilukuri K. Mohan, Jennifer Grygiel, Xinyi Zhou, and Reza Zafarani. 2019. Credibility-based fake news detection. Arxiv Preprint Arxiv:1911.00643 (2019).Google Scholar
Amos Tversky and Daniel Kahneman. 1974. Judgment under uncertainty: Heuristics and biases. Science 185, 4157 (1974), 1124--1131.Google Scholar
Udo Undeutsch. 1967. Beurteilung der glaubhaftigkeit von aussagen. Handb. Psychol. 11 (1967), 26--181.Google Scholar
Soroush Vosoughi, Deb Roy, and Sinan Aral. 2018. The spread of true and false news online. Science 359, 6380 (2018), 1146--1151.Google Scholar
William Yang Wang. 2017. “Liar, liar pants on fire”: A new benchmark dataset for fake news detection. Arxiv Preprint Arxiv:1705.00648 (2017).Google Scholar
Yaqing Wang, Fenglong Ma, Zhiwei Jin, Ye Yuan, Guangxu Xun, Kishlay Jha, Lu Su, and Jing Gao. 2018. EANN: Event adversarial neural networks for multi-modal fake news detection. In Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery 8 Data Mining. ACM, 849--857.Google ScholarDigital Library
Ke Wu, Song Yang, and Kenny Q. Zhu. 2015. False rumors detection on Sina Eeibo by propagation structures. In Proceedings of the IEEE 31st International Conference on Data Engineering (ICDE’15). IEEE, 651--662.Google Scholar
Reza Zafarani, Mohammad Ali Abbasi, and Huan Liu. 2014. Social Media Mining: An Introduction. Cambridge University Press.Google ScholarDigital Library
Reza Zafarani, Xinyi Zhou, Kai Shu, and Huan Liu. 2019. Fake news research: Theories, detection strategies, and open problems. In Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery 8 Data Mining. ACM, 3207--3208.Google Scholar
Amy X. Zhang, Aditya Ranganathan, Sarah Emlen Metz, Scott Appling, Connie Moon Sehat, Norman Gilmore, Nick B. Adams, Emmanuel Vincent, Jennifer Lee, Martin Robbins, et al. 2018. A structured response to misinformation: Defining and annotating credibility indicators in news articles. In Proceedings of the Web Conference. International World Wide Web Conferences Steering Committee, 603--612.Google Scholar
Jiawei Zhang, Limeng Cui, Yanjie Fu, and Fisher B. Gouza. 2018. Fake news detection with deep diffusive network model. Arxiv Preprint Arxiv:1805.08751 (2018).Google Scholar
Xinyi Zhou, Jindi Wu, and Reza Zafarani. 2020. SAFE: Similarity-aware multi-modal fake news detection. In Proceedings of the Pacific-Asia Conference on Knowledge Discovery and Data Mining. Springer.Google Scholar
Xinyi Zhou and Reza Zafarani. 2018. Fake news: A survey of research, detection methods, and opportunities. Arxiv Preprint Arxiv:1812.00315 (2018).Google Scholar
Xinyi Zhou and Reza Zafarani. 2019. Network-based fake news detection: A pattern-driven approach. SIGKDD Explor. 21, 2 (2019), 48--60.Google ScholarDigital Library
Miron Zuckerman, Bella M. DePaulo, and Robert Rosenthal. 1981. Verbal and nonverbal communication of deception. In Proceedings of the Advances in Experimental Social Psychology. Vol. 14. Elsevier, 1--59.Google Scholar

Index Terms

Fake News Early Detection: A Theory-driven Model

Recommendations

Fake News Research: Theories, Detection Strategies, and Open Problems
KDD '19: Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining

Fake news has become a global phenomenon due its explosive growth, particularly on social media. The goal of this tutorial is to (1) clearly introduce the concept and characteristics of fake news and how it can be formally differentiated from other ...
Read More
Satire or Fake News: Social Media Consumers' Socio-Demographics Decide
WWW '18: Companion Proceedings of the The Web Conference 2018

Ever since the surprising results from the 2016 U.S. presidential race, the subject of Fake News in our worldwide media consumption has grown steadily. On a smaller scale, mainstream media have taken a closer look at the relatively narrow genre of ...
Read More
Fake News: Fundamental Theories, Detection Strategies and Challenges
WSDM '19: Proceedings of the Twelfth ACM International Conference on Web Search and Data Mining

The explosive growth of fake news and its erosion to democracy, justice, and public trust increased the demand for fake news detection. As an interdisciplinary topic, the study of fake news encourages a concerted effort of experts in computer and ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Article

Published in
Digital Threats: Research and Practice Volume 1, Issue 2
Field Notes
June 2020
139 pages
EISSN:2576-5337
DOI:10.1145/3403598
Editors:
Arun Lakhotia
University of Louisiana at Lafayette and Cythereal, USA
,
Leigh Metcalf
CERT, USA
Issue’s Table of Contents
Copyright © 2020 ACM
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 11 June 2020
- Online AM: 7 May 2020
- Accepted: 1 December 2019
- Revised: 1 November 2019
- Received: 1 April 2019
Published in dtrap Volume 1, Issue 2

Permissions
Request permissions about this article.
Request Permissions

Check for updates
Author Tags
Fake news
click-bait
disinformation
fake news detection
feature engineering
interdisciplinary research
news verification
Qualifiers
- research-article
- Research
- Refereed
Conference
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 139
  Total Citations
  View Citations
- 15,396
  Total Downloads
- Downloads (Last 12 months)1,811
- Downloads (Last 6 weeks)231
Other Metrics
View Author Metrics
Cited By
View all

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

HTML Format

View this article in HTML Format .

View HTML Format

Fake News Early Detection: A Theory-driven Model

Digital Threats: Research and Practice

Abstract

References

Cited By

Index Terms

Recommendations

Fake News Research: Theories, Detection Strategies, and Open Problems

Satire or Fake News: Social Media Consumers' Socio-Demographics Decide

Fake News: Fundamental Theories, Detection Strategies and Challenges