skip to main content
10.1145/3209978.3210144acmconferencesArticle/Chapter ViewAbstractPublication PagesirConference Proceedingsconference-collections
short-paper

Identifying Clickbait: A Multi-Strategy Approach Using Neural Networks

Authors Info & Claims
Published:27 June 2018Publication History

ABSTRACT

Online media outlets, in a bid to expand their reach and subsequently increase revenue through ad monetisation, have begun adopting clickbait techniques to lure readers to click on articles. The article fails to fulfill the promise made by the headline. Traditional methods for clickbait detection have relied heavily on feature engineering which, in turn, is dependent on the dataset it is built for. The application of neural networks for this task has only been explored partially. We propose a novel approach considering all information found in a social media post. We train a bidirectional LSTM with an attention mechanism to learn the extent to which a word contributes to the post's clickbait score in a differential manner. We also employ a Siamese net to capture the similarity between source and target information. Information gleaned from images has not been considered in previous approaches. We learn image embeddings from large amounts of data using Convolutional Neural Networks to add another layer of complexity to our model. Finally, we concatenate the outputs from the three separate components, serving it as input to a fully connected layer. We conduct experiments over a test corpus of 19538 social media posts, attaining an F1 score of 65.37% on the dataset bettering the previous state-of-the-art, as well as other proposed approaches, feature engineering or otherwise.

References

  1. Ankesh Anand, Tanmoy Chakraborty, and Noseong Park . 2017. We used Neural Networks to Detect Clickbaits: You won't believe what happened Next! Advances in Information Retrieval. 39th European Conference on IR Research (ECIR 17) (Lecture Notes in Computer Science). Springer.Google ScholarGoogle Scholar
  2. Dzmitry Bahdanau, Kyunghyun Cho, and Yoshua Bengio . 2014. Neural machine translation by jointly learning to align and translate. arXiv preprint arXiv:1409.0473 (2014).Google ScholarGoogle Scholar
  3. Prakhar Biyani, Kostas Tsioutsiouliklis, and John Blackmer . 2016. "8 Amazing Secrets for Getting More Clicks": Detecting Clickbaits in News Streams Using Article Informality. In Proceedings of the Thirtieth AAAI Conference on Artificial Intelligence (AAAI'16). AAAI Press, 94--100. deftempurl%http://dl.acm.org/citation.cfm?id=3015812.3015827 tempurl Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. Abhijnan Chakraborty, Bhargavi Paranjape, Sourya Kakarla, and Niloy Ganguly . 2016. Stop Clickbait: Detecting and preventing clickbaits in online news media. 2016 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM) (2016), 9--16. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. Y. Le Cun, B. Boser, J. S. Denker, R. E. Howard, W. Habbard, L. D. Jackel, and D. Henderson . 1990. Advances in Neural Information Processing Systems 2. Morgan Kaufmann Publishers Inc., San Francisco, CA, USA, Chapter Handwritten Digit Recognition with a Back-propagation Network, 396--404. deftempurl%http://dl.acm.org/citation.cfm?id=109230.109279 tempurl Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. Pieter-Tjerk de Boer, Dirk P. Kroese, Shie Mannor, and Reuven Y. Rubinstein . 2005. A Tutorial on the Cross-Entropy Method. Annals of Operations Research Vol. 134, 1 (01 Feb . 2005), 19--67.Google ScholarGoogle ScholarCross RefCross Ref
  7. C'ıcero Nogueira Dos Santos and Bianca Zadrozny . 2014. Learning Character-level Representations for Part-of-speech Tagging Proceedings of the 31st International Conference on International Conference on Machine Learning - Volume 32 (ICML'14). JMLR.org, II--1818--II--1826. deftempurl%http://dl.acm.org/citation.cfm?id=3044805.3045095 tempurl Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. Xavier Glorot and Yoshua Bengio . 2010. Understanding the difficulty of training deep feedforward neural networks. Aistats, Vol. Vol. 9. 249--256.Google ScholarGoogle Scholar
  9. Sepp Hochreiter and Jürgen Schmidhuber . 1997. Long short-term memory. Neural computation Vol. 9, 8 (1997), 1735--1780. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. Quoc Le and Tomas Mikolov . 2014. Distributed representations of sentences and documents Proceedings of the 31st International Conference on Machine Learning (ICML-14). 1188--1196. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. George Loewenstein . 1994. The Psychology of Curiosity: A Review and Reinterpretation. Vol. 116 (07 . 1994), 75--98.Google ScholarGoogle Scholar
  12. Tomas Mikolov, Kai Chen, Greg Corrado, and Jeffrey Dean . 2013. Efficient Estimation of Word Representations in Vector Space. CoRR Vol. abs/1301.3781 (2013). deftempurl%http://arxiv.org/abs/1301.3781 tempurlGoogle ScholarGoogle Scholar
  13. Paul Neculoiu, Maarten Versteegh, and Mihai Rotaru . 2016. Learning Text Similarity with Siamese Recurrent Networks. (01 . 2016).Google ScholarGoogle Scholar
  14. Martin Potthast, Tim Gollub, Kristof Komlossy, Sebastian Schuster, Matti Wiegmann, Erika Garces, Matthias Hagen, and Benno Stein . 2017. Crowdsourcing a Large Corpus of Clickbait on Twitter (to appear).Google ScholarGoogle Scholar
  15. Martin Potthast, Sebastian Köpsel, Benno Stein, and Matthias Hagen . 2016. Clickbait Detection. In Advances in Information Retrieval. 38th European Conference on IR Research (ECIR 16) (Lecture Notes in Computer Science), bibfieldeditorNicola Ferro, Fabio Crestani, Marie-Francine Moens, Josiane Mothe, Fabrizio Silvestri, Giorgio Maria Di Nunzio, Claudia Hauff, and Gianmaria Silvello (Eds.), Vol. Vol. 9626. Springer, Berlin Heidelberg New York, 810--817.Google ScholarGoogle Scholar
  16. Radim v Rehr uv rek and Petr Sojka . 2010. Software Framework for Topic Modelling with Large Corpora Proceedings of the LREC 2010 Workshop on New Challenges for NLP Frameworks. ELRA, Valletta, Malta, 45--50. http://is.muni.cz/publication/884893/enGoogle ScholarGoogle Scholar
  17. Karen Simonyan and Andrew Zisserman . 2014. Very Deep Convolutional Networks for Large-Scale Image Recognition. CoRR Vol. abs/1409.1556 (2014).Google ScholarGoogle Scholar
  18. Philippe Thomas . 2017. Clickbait Identification using Neural Networks. CoRR Vol. abs/1710.08721 (2017). showeprint{arxiv}1710.08721deftempurl%http://arxiv.org/abs/1710.08721 tempurlGoogle ScholarGoogle Scholar
  19. Matthew D Zeiler . 2012. ADADELTA: an adaptive learning rate method. arXiv preprint arXiv:1212.5701 (2012).Google ScholarGoogle Scholar
  20. Yiwei Zhou . 2017. Clickbait Detection in Tweets Using Self-attentive Network. CoRR Vol. abs/1710.05364 (2017). showeprint{arxiv}1710.05364deftempurl%http://arxiv.org/abs/1710.05364 tempurlGoogle ScholarGoogle Scholar

Index Terms

  1. Identifying Clickbait: A Multi-Strategy Approach Using Neural Networks

        Recommendations

        Comments

        Login options

        Check if you have access through your login credentials or your institution to get full access on this article.

        Sign in
        • Published in

          cover image ACM Conferences
          SIGIR '18: The 41st International ACM SIGIR Conference on Research & Development in Information Retrieval
          June 2018
          1509 pages
          ISBN:9781450356572
          DOI:10.1145/3209978

          Copyright © 2018 ACM

          Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

          Publisher

          Association for Computing Machinery

          New York, NY, United States

          Publication History

          • Published: 27 June 2018

          Permissions

          Request permissions about this article.

          Request Permissions

          Check for updates

          Qualifiers

          • short-paper

          Acceptance Rates

          SIGIR '18 Paper Acceptance Rate86of409submissions,21%Overall Acceptance Rate792of3,983submissions,20%

        PDF Format

        View or Download as a PDF file.

        PDF

        eReader

        View online with eReader.

        eReader