Abstract
Fake news is nowadays an issue of pressing concern, given its recent rise as a potential threat to high-quality journalism and well-informed public discourse. The Fake News Challenge (FNC-1) was organized in early 2017 to encourage the development of machine-learning-based classification systems for stance detection (i.e., for identifying whether a particular news article agrees, disagrees, discusses, or is unrelated to a particular news headline), thus helping in the detection and analysis of possible instances of fake news. This article presents a novel approach to tackle this stance detection problem, based on the combination of string similarity features with a deep neural network architecture that leverages ideas previously advanced in the context of learning-efficient text representations, document classification, and natural language inference. Specifically, we use bi-directional Recurrent Neural Networks (RNNs), together with max-pooling over the temporal/sequential dimension and neural attention, for representing (i) the headline, (ii) the first two sentences of the news article, and (iii) the entire news article. These representations are then combined/compared, complemented with similarity features inspired on other FNC-1 approaches, and passed to a final layer that predicts the stance of the article toward the headline. We also explore the use of external sources of information, specifically large datasets of sentence pairs originally proposed for training and evaluating natural language inference methods to pre-train specific components of the neural network architecture (e.g., the RNNs used for encoding sentences). The obtained results attest to the effectiveness of the proposed ideas and show that our model, particularly when considering pre-training and the combination of neural representations together with similarity features, slightly outperforms the previous state of the art.
- Darren Baker Ali K. Chaudhry and Philipp Thun-Hohenstein. 2017. Stance detection for the fake news challenge: Identifying textual relationships with deep neural nets. CS224n: Natural Language Processing with Deep Learning (2017).Google Scholar
- Gaurav Bhatt, Aman Sharma, Shivam Sharma, Ankush Nagpal, Balasubramanian Raman, and Ankush Mittal. 2018. Combining neural, statistical and external features for fake news stance identification. In Proceedings of the The Web Conference. Google ScholarDigital Library
- Peter Bourgonje, Julian Moreno Schneider, and Georg Rehm. 2017. From clickbait to fake news detection: An approach based on detecting the stance of headlines to articles. In Proceedings of the Conference on Empirical Methods in Natural Language Processing.Google ScholarCross Ref
- Samuel R. Bowman, Gabor Angeli, Christopher Potts, and Christopher D. Manning. 2015. A large annotated corpus for learning natural language inference. In Proceedings of the Conference on Empirical Methods in Natural Language Processing.Google Scholar
- Daniel Cer, Yinfei Yang, Sheng-yi Kong, Nan Hua, Nicole Limtiaco, Rhomni St. John, Noah Constant, Mario Guajardo-Cespedes, Steve Yuan, Chris Tar, et al. 2018. Universal sentence encoder. Arxiv Preprint Arxiv:1803.11175 (2018).Google Scholar
- Delphine Charlet and Geraldine Damnati. 2017. SimBow at SemEval-2017 Task 3: Soft-cosine semantic similarity between questions for community question answering. In Proceedings of the International Workshop on Semantic Evaluation.Google ScholarCross Ref
- Qian Chen, Xiaodan Zhu, Zhen-Hua Ling, Si Wei, Hui Jiang, and Diana Inkpen. 2017. Enhanced LSTM for natural language inference. In Proceedings of the Annual Meeting of the Association for Computational Linguistics.Google ScholarCross Ref
- Qian Chen, Xiaodan Zhu, Zhen-Hua Ling, Si Wei, Hui Jiang, and Diana Inkpen. 2017. Recurrent neural network-based sentence encoder with gated attention for natural language inference. In Proceedings of the Workshop on Evaluating Vector Space Representations for NLP.Google ScholarCross Ref
- Jihun Choi, Taeuk Kim, and Sang goo Lee. 2018. Cell-aware stacked LSTMs for modeling sentences. Arxiv Preprint Arxiv:1809.02279 (2018).Google Scholar
- J. Choi, K. M. Yoo, and S.-g. Lee. 2017. Learning to compose task-specific tree structures. In Proceedings of the Conference of the Association for the Advancement of Artificial Intelligence.Google Scholar
- Junyoung Chung, Caglar Gulcehre, KyungHyun Cho, and Yoshua Bengio. 2014. Empirical evaluation of gated recurrent neural networks on sequence modeling. In Proceedings of the NIPS Workshop on Deep Learning.Google Scholar
- Alexis Conneau, Douwe Kiela, Holger Schwenk, Loïc Barrault, and Antoine Bordes. 2017. Supervised learning of universal sentence representations from natural language inference data. In Proceedings of the Conference on Empirical Methods in Natural Language Processing.Google ScholarCross Ref
- Mostafa Dehghani, Stephan Gouws, Oriol Vinyals, Jakob Uszkoreit, and Åukasz Kaiser. 2019. Universal transformers. In Proceedings of the International Conference on Learning Representations.Google Scholar
- Francisco Duarte, Bruno Martins, Cátia Sousa Pinto, and Mário J. Silva. 2018. A deep learning method for ICD-10 coding of free-text death certificates. In Proceedings of the EPIA Conference on Artificial Intelligence.Google Scholar
- Yoav Goldberg. 2016. A primer on neural network models for natural language processing. J. Artif. Intell. Res. 57, 1 (2016), 345--420. Google ScholarCross Ref
- Yichen Gong, Heng Luo, and Jian Zhang. 2018. Natural language inference over interaction space. In Proceedings of the International Conference on Learning Representations.Google Scholar
- Sepp Hochreiter and Jürgen Schmidhuber. 1997. Long short-term memory. Neur. Comput. 9, 8 (1997).Google Scholar
- Jinbae Im and Sungzoon Cho. 2017. Distance-based self-attention network for natural language inference. Arxiv Preprint Arxiv:1712.02047 (2017).Google Scholar
- Krzysztof Janowicz and Grant McKenzie. 2017. How “alternative” are alternative facts? measuring statement coherence via spatial analysis. In Proceedings of the ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems.Google Scholar
- Richard Socher Jeffrey Pennington and Christopher D. Manning. 2014. GloVe: Global vectors for word representation. In Proceedings of the Conference on Empirical Methods in Natural Language Processing.Google Scholar
- Kevin Gimpel John Wieting, Mohit Bansal and Karen Livescu. 2016. Towards universal paraphrastic sentence embeddings. In Proceedings of the International Conference on Learning Representations.Google Scholar
- Diederik Kingma and Jimmy Ba. 2015. Adam: A method for stochastic optimization. In Proceedings of the International Conference on Learning Representations.Google Scholar
- Ryan Kiros, Yukun Zhu, Ruslan R. Salakhutdinov, Richard Zemel, Raquel Urtasun, Antonio Torralba, and Sanja Fidler. 2015. Skip-thought vectors. In Proceedings of the Neural Information Processing Systems Conference. Google ScholarDigital Library
- Lev Konstantinovskiy, Oliver Price, Mevan Babakar, and Arkaitz Zubiaga. 2018. Towards automated factchecking: Developing an annotation schema and benchmark for consistent automated claim detection. In Proceedings of the EMNLP Workshop on Fact Extraction and Verification.Google Scholar
- Matt Kusner, Yu Sun, Nicholas Kolkin, and Kilian Weinberger. 2015. From word embeddings to document distances. In Proceedings of the International Conference on Machine Learning. Google ScholarDigital Library
- David M. J. Lazer, Matthew A. Baum, Yochai Benkler, Adam J. Berinsky, Kelly M. Greenhill, Filippo Menczer, Miriam J. Metzger, Brendan Nyhan, Gordon Pennycook, David Rothschild, et al. 2018. The science of fake news. Science 359, 6380 (2018).Google Scholar
- Chin-Yew Lin. 2004. ROUGE: A package for automatic evaluation of summaries. In Proceedings of the ACL Workshop on Text Summarization Branches Out.Google Scholar
- Andre Martins and Ramon Astudillo. 2016. From softmax to sparsemax: A sparse model of attention and multi-label classification. In Proceedings o the International Conference on Machine Learning.Google Scholar
- Tomas Mikolov, Kai Chen, Greg Corrado, and Jeffrey Dean. 2013. Efficient estimation of word representations in vector space. In Proceedings of the International Conference on Learning Representations.Google Scholar
- Mitra Mohtarami, Ramy Baly, James Glass, Preslav Nakov, Lluis Marquez, and Alessandro Moschitti. 2018. Automatic stance detection using end-to-end memory networks. In Proceedings of the Conference of the North American Chapter of the Association for Computational Linguistics.Google ScholarCross Ref
- Yixin Nie and Mohit Bansal. 2017. Shortcut-stacked sentence encoders for multi-domain inference. In Proceedings of the Workshop on Evaluating Vector Space Representations for NLP.Google ScholarCross Ref
- Boyuan Pan, Yazheng Yang, Zhou Zhao, Yueting Zhuang, Deng Cai, and Xiaofei He. 2018. Discourse marker augmented network with reinforcement learning for natural language inference. In Proceedings of the Annual Meeting of the Association for Computational Linguistics.Google ScholarCross Ref
- Kishore Papineni, Salim Roukos, Todd Ward, and Wei-Jing Zhu. 2002. BLEU: A method for automatic evaluation of machine translation. In Proceedings of the Annual Meeting on Association for Computational Linguistics. Google ScholarDigital Library
- Verónica Pérez-Rosas, Bennett Kleinberg, Alexandra Lefevre, and Rada Mihalcea. 2018. Automatic detection of fake news. In Proceedings of the International Conference on Computational Linguistics.Google Scholar
- Oskar Triebe Pfohl and Ferdinand Legros. 2017. Stance detection for the fake news challenge with attention and conditional encoding. CS224n: Natural Language Processing with Deep Learning (2017).Google Scholar
- Kashyap Popat, Subhabrata Mukherjee, Andrew Yates, and Gerhard Weikum. 2018. DeClarE: Debunking fake news and false claims using evidence-aware deep learning. In Proceedings of the Conference on Empirical Methods in Natural Language Processing.Google ScholarCross Ref
- Benjamin Riedel, Isabelle Augenstein, Georgios P Spithourakis, and Sebastian Riedel. 2017. A simple but tough-to-beat baseline for the fake news challenge stance detection task. Arxiv Preprint Arxiv:1707.03264 (2017).Google Scholar
- T. Shen, T. Zhou, G. Long, J. Jiang, S. Pan, and C. Zhang. 2018. DiSAN: Directional self-attention network for RNN/CNN-Free language understanding. In Proceedings of the Conference of the Association for the Advancement of Artificial Intelligence.Google Scholar
- Tao Shen, Tianyi Zhou, Guodong Long, Jing Jiang, Sen Wang, and Chengqi Zhang. 2018. Reinforced self-attention network: A hybrid of hard and soft attention for sequence modeling. In Proceedings of the International Joint Conference on Artificial Intelligence.Google ScholarCross Ref
- Kai Shu, Deepak Mahudeswaran, Suhang Wang, Dongwon Lee, and Huan Liu. 2018. FakeNewsNet: A data repository with news content, social context and dynamic information for studying fake news on social media. Arxiv Preprint Arxiv:1809.01286 (2018).Google Scholar
- Kai Shu, Suhang Wang, and Huan Liu. 2017. Exploiting tri-relationship for fake news detection. Arxiv Preprint Arxiv:1712.07709 (2017).Google Scholar
- Yi Tay, Luu Anh Tuan, and Siu Cheung Hui. 2018. A compare-propagate architecture with alignment factorization for natural language inference. In Proceedings of the Conference on Empirical Methods in Natural Language Processing.Google Scholar
- Yi Tay, Luu Anh Tuan, and Siu Cheung Hui. 2018. Co-stack residual affinity networks with multi-level attention refinement for matching text sequences. In Proceedings of the Conference on Empirical Methods in Natural Language Processing.Google ScholarCross Ref
- Yi Tay, Luu Anh Tuan, and Siu Cheung Hui. 2018. A compare-propagate architecture with alignment factorization for natural language inference. In Proceedings of the Conference on Empirical Methods in Natural Language Processing.Google Scholar
- M. Tosik, A. Mallia, and K. Gangopadhyay. 2018. Debunking fake news one feature at a time. Arxiv Preprint Arxiv:1808.02831 (2018).Google Scholar
- Ashish Vaswani, Noam Shazeer, Niki Parmar, Jakob Uszkoreit, Llion Jones, Aidan N Gomez, Łukasz Kaiser, and Illia Polosukhin. 2017. Attention is all you need. In Proceedings of the Neural Information Processing Systems Conference. Google ScholarDigital Library
- Ramakrishna Vedantam, C. Lawrence Zitnick, and Devi Parikh. 2015. CIDEr: Consensus-based image description evaluation. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.Google ScholarCross Ref
- Soroush Vosoughi, Deb Roy, and Sinan Aral. 2018. The spread of true and false news online. Science 359, 6380 (2018).Google Scholar
- Adina Williams, Nikita Nangia, and Samuel R. Bowman. 2018. A broad-coverage challenge corpus for sentence understanding through inference. In Proceedings of the Conference of the North American Chapter of the Association for Computational Linguistics.Google Scholar
- Theresa Wilson, Janyce Wiebe, and Paul Hoffmann. 2018. DR-BiLSTM: Dependent reading bidirectional LSTM for natural language inference. In Proceedings of the Conference of the North American Chapter of the Association for Computational Linguistics.Google Scholar
- William E. Winkler. 1990. String comparator metrics and enhanced decision rules in the fellegi-sunter model of record linkage. In Proceedings of the Section on Survey Research Methods of the American Statistical Association (1990).Google Scholar
- Zichao Yang, Diyi Yang, Chris Dyer, Xiaodong He, Alexander J. Smola, and Eduard H. Hovy. 2016. Hierarchical attention networks for document classification. In Proceedings of the Annual Conference of the North American Chapter of the Association for Computational Linguistics.Google Scholar
- Wenpeng Yin, Katharina Kann, Mo Yu, and Hinrich Schütze. 2017. Comparative study of CNN and RNN for natural language processing. Arxiv Preprint Arxiv:1702.01923 (2017).Google Scholar
- Qi Zeng, Quan Zhou, and Shanshan Xu. 2017. Neural stance detectors for fake news challenge. CS224n: Natural Language Processing with Deep Learning (2017).Google Scholar
Index Terms
- Combining Similarity Features and Deep Representation Learning for Stance Detection in the Context of Checking Fake News
Recommendations
Combining Neural, Statistical and External Features for Fake News Stance Identification
WWW '18: Companion Proceedings of the The Web Conference 2018Identifying the veracity of a news article is an interesting problem while automating this process can be a challenging task. Detection of a news article as fake is still an open question as it is contingent on many factors which the current state-of-...
Deep Learning for Fake News Detection: Theories and Models
EITCE '22: Proceedings of the 2022 6th International Conference on Electronic Information Technology and Computer EngineeringWith the rapid growth of networking platforms, fake news has experienced a wide spread on social media during the past few years, which is a critical threat to public safety. There are a series of potential detrimental societal impacts along with fake ...
Arabic Fake News Detection: A Fact Checking Based Deep Learning Approach
Fake news stories can polarize society, particularly during political events. They undermine confidence in the media in general. Current NLP systems are still lacking the ability to properly interpret and classify Arabic fake news. Given the high stakes ...
Comments