research-article

Free Access

MemeSequencer: Sparse Matching for Embedding Image Macros

Authors:
Abhimanyu Dubey

Massachusetts Institute of Technology, Cambridge, MA, USA

Massachusetts Institute of Technology, Cambridge, MA, USA
View Profile

,
Esteban Moro

Massachusetts Institute of Technology & Universidad Carlos III de Madrid, Cambridge, MA, USA

Massachusetts Institute of Technology & Universidad Carlos III de Madrid, Cambridge, MA, USA
View Profile

,
Manuel Cebrian

Massachusetts Institute of Technology, Cambridge, MA, USA

Massachusetts Institute of Technology, Cambridge, MA, USA
View Profile

,
Iyad Rahwan

Massachusetts Institute of Technology, Cambridge, MA, USA

Massachusetts Institute of Technology, Cambridge, MA, USA
View Profile

WWW '18: Proceedings of the 2018 World Wide Web ConferenceApril 2018Pages 1225–1235https://doi.org/10.1145/3178876.3186021

Published:10 April 2018Publication History

WWW '18: Proceedings of the 2018 World Wide Web Conference

Pages 1225–1235

ABSTRACT

The analysis of the creation, mutation, and propagation of social media content on the Internet is an essential problem in computational social science, affecting areas ranging from marketing to political mobilization. A first step towards understanding the evolution of images online is the analysis of rapidly modifying and propagating memetic imagery or "memes". However, a pitfall in proceeding with such an investigation is the current incapability to produce a robust semantic space for such imagery, capable of understanding differences in Image Macros. In this study, we provide a first step in the systematic study of image evolution on the Internet, by proposing an algorithm based on sparse representations and deep learning to decouple various types of content in such images and produce a rich semantic embedding. We demonstrate the benefits of our approach on a variety of tasks pertaining to memes and Image Macros, such as image clustering, image retrieval, topic prediction and virality prediction, surpassing the existing methods on each. In addition to its utility on quantitative tasks, our method opens up the possibility of obtaining the first large-scale understanding of the evolution and propagation of memetic imagery.

References

{n. d.}. reddit: the front page of the internet. http://www.reddit.com/. ({n. d.}). Accessed: 2017-09--30.Google Scholar
Martín Abadi, Paul Barham, Jianmin Chen, Zhifeng Chen, Andy Davis, Jeffrey Dean, Matthieu Devin, Sanjay Ghemawat, Geoffrey Irving, Michael Isard, et al. {n. d.}. TensorFlow: A System for Large-Scale Machine Learning.Google Scholar
Lada A Adamic, Thomas M Lento, Eytan Adar, and Pauline C Ng. 2016. Information evolution in social networks. In Proceedings of the Ninth ACM International Conference on Web Search and Data Mining. ACM, 473--482. Google ScholarDigital Library
Edoardo Amaldi and Viggo Kann. 1998. On the approximability of minimizing nonzero variables or unsatisfied relations in linear systems. Theoretical Computer Science 209, 1--2 (1998), 237--260. Google ScholarDigital Library
Eytan Bakshy, Itamar Rosenn, Cameron Marlow, and Lada Adamic. 2012. The role of social networks in information diffusion. In Proceedings of the 21st international conference on World Wide Web. ACM, 519--528. Google ScholarDigital Library
Albert-László Barabási. 2016. Network science. Cambridge university press.Google Scholar
Jonah Berger and Katherine L Milkman. 2012. What makes online content viral? Journal of marketing research 49, 2 (2012), 192--205.Google ScholarCross Ref
Jonah Berger and Eric M Schwartz. 2011. What drives immediate and ongoing word of mouth? Journal of Marketing Research 48, 5 (2011), 869--880.Google ScholarCross Ref
Vincent Buskens and Jeroen Weesie. 2000. Cooperation via social networks. Analyse & Kritik 22, 1 (2000), 44--74.Google ScholarCross Ref
Vincent Buskens and Kazuo Yamaguchi. 1999. A new model for information diffusion in heterogeneous social networks. Sociological methodology 29, 1 (1999), 281--325.Google Scholar
Emmanuel J Candes, Justin K Romberg, and Terence Tao. 2006. Stable signal recovery from incomplete and inaccurate measurements. Communications on pure and applied mathematics 59, 8 (2006), 1207--1223.Google Scholar
Scott Shaobing Chen, David L Donoho, and Michael A Saunders. 2001. Atomic decomposition by basis pursuit. SIAM review 43, 1 (2001), 129--159. Google ScholarDigital Library
Justin Cheng, Lada Adamic, P Alex Dow, Jon Michael Kleinberg, and Jure Leskovec. 2014. Can cascades be predicted?. In Proceedings of the 23rd international conference on World wide web. ACM, 925--936. Google ScholarDigital Library
Justin Cheng, Lada A Adamic, Jon M Kleinberg, and Jure Leskovec. 2016. Do Cascades Recur?. In Proceedings of the 25th International Conference on World Wide Web. International World Wide Web Conferences Steering Committee, 671--681. Google ScholarDigital Library
Ronan Collobert and Jason Weston. 2008. A unified architecture for natural language processing: Deep neural networks with multitask learning. In Proceedings of the 25th international conference on Machine learning. ACM, 160--167. Google ScholarDigital Library
Michele Coscia. 2013. Competition and Success in the Meme Pool: A Case Study on Quickmeme. com.Google Scholar
Michele Coscia. 2014. Average is boring: How similarity kills a meme's success. Scientific reports 4 (2014), 6477.Google Scholar
David L Davies and Donald W Bouldin. 1979. A cluster separation measure. IEEE transactions on pattern analysis and machine intelligence 2 (1979), 224--227. Google ScholarDigital Library
Jia Deng, Wei Dong, Richard Socher, Li-Jia Li, Kai Li, and Li Fei-Fei. 2009. Imagenet: A large-scale hierarchical image database. In Computer Vision and Pattern Recognition, 2009. CVPR 2009. IEEE Conference on. IEEE, 248--255.Google ScholarCross Ref
Arturo Deza and Devi Parikh. 2015. Understanding image virality. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 1818--1826.Google ScholarCross Ref
Jeff Donahue, Yangqing Jia, Oriol Vinyals, Judy Hoffman, Ning Zhang, Eric Tzeng, and Trevor Darrell. 2014. Decaf: A deep convolutional activation feature for generic visual recognition. In International conference on machine learning. 647--655. Google ScholarDigital Library
David L Donoho. 2006. For most large underdetermined systems of linear equations the minimal-norm solution is also the sparsest solution. Communications on pure and applied mathematics 59, 6 (2006), 797--829.Google Scholar
Cícero Nogueira Dos Santos and Maira Gatti. 2014. Deep Convolutional Neural Networks for Sentiment Analysis of Short Texts. In COLING. 69--78.Google Scholar
Abhimanyu Dubey and Sumeet Agarwal. 2017. Modeling Image Virality with Pairwise Spatial Transformer Networks. In Proceedings of the 2017 ACM on Multi- media Conference, MM 2017, Mountain View, CA, USA, October 23--27, 2017. 663--671. Google ScholarDigital Library
Phil Edwards. {n. d.}. The reason every meme uses that one font. https://www.vox.com/2015/7/26/9036993/meme-font-impact/. ({n. d.}). Accessed: 2017-09--30.Google Scholar
James P Gleeson, Kevin P O'Sullivan, Raquel A Baños, and Yamir Moreno. 2016. Effects of network structure, competition and memory time on social spreading phenomena. Physical Review X 6, 2 (2016), 021019.Google Scholar
Alex Graves and Jurgen Schmidhuber. 2005. Framewise phoneme classification with bidirectional LSTM and other neural network architectures. Neural Networks 18, 5 (2005), 602--610. Google ScholarDigital Library
Kaiming He, Xiangyu Zhang, Shaoqing Ren, and Jian Sun. 2016. Deep residual learning for image recognition. In Proceedings of the IEEE conference on computer vision and pattern recognition. 770--778.Google ScholarCross Ref
Chih-Wei Hsu and Chih-Jen Lin. 2002. A comparison of methods for multiclass support vector machines. IEEE transactions on Neural Networks 13, 2 (2002), 415--425. Google ScholarDigital Library
Thorsten Joachims. 2002. Optimizing search engines using clickthrough data. In Proceedings of the eighth ACM SIGKDD international conference on Knowledge discovery and data mining. ACM, 133--142. Google ScholarDigital Library
Eric Jones, Travis Oliphant, Pearu Peterson, et al. 2001--. SciPy: Open source scientific tools for Python. (2001--). http://www.scipy.org/ {Online; accessed 9/30/2017}.Google Scholar
Aditya Khosla, Atish Das Sarma, and Raffay Hamid. 2014. What makes an image popular?. In Proceedings of the 23rd international conference on World wide web. ACM, 867--876. Google ScholarDigital Library
Ryan Kiros, Yukun Zhu, Ruslan R Salakhutdinov, Richard Zemel, Raquel Urtasun, Antonio Torralba, and Sanja Fidler. 2015. Skip-thought vectors. In Advances in neural information processing systems. 3294--3302. Google ScholarDigital Library
Michele Knobel and Colin Lankshear. {n. d.}. Online memes, affinities, and cultural production. ({n. d.}).Google Scholar
Alex Krizhevsky, Ilya Sutskever, and Geoffrey E Hinton. 2012. Imagenet classification with deep convolutional neural networks. In Advances in neural information processing systems. 1097--1105. Google ScholarDigital Library
Himabindu Lakkaraju, Julian J McAuley, and Jure Leskovec. 2013. What's in a Name? Understanding the Interplay between Titles, Content, and Communities in Social Media. (2013).Google Scholar
Jure Leskovec, Lars Backstrom, and Jon Kleinberg. 2009. Meme-tracking and the dynamics of the news cycle. In Proceedings of the 15th ACM SIGKDD international conference on Knowledge discovery and data mining. ACM, 497--506. Google ScholarDigital Library
M Douglas McIlroy. 1960. Macro instruction extensions of compiler languages. Commun. ACM 3, 4 (1960), 214--220. Google ScholarDigital Library
LR Medsker and LC Jain. 2001. Recurrent neural networks. Design and Applications 5 (2001).Google Scholar
Tomas Mikolov, Ilya Sutskever, Kai Chen, Greg S Corrado, and Jeff Dean. 2013. Distributed representations of words and phrases and their compositionality. In Advances in neural information processing systems. 3111--3119. Google ScholarDigital Library
Nasrin Mostafazadeh, Nathanael Chambers, Xiaodong He, Devi Parikh, Dhruv Batra, Lucy Vanderwende, Pushmeet Kohli, and James Allen. 2016. A corpus and evaluation framework for deeper understanding of commonsense stories. arXiv preprint arXiv:1604.01696 (2016).Google Scholar
Ruth Page. 2012. The linguistics of self-branding and micro-celebrity in Twitter: The role of hashtags. Discourse & communication 6, 2 (2012), 181--201.Google Scholar
Adam Paszke, Sam Gross, and Soumith Chintala. 2017. PyTorch. (2017).Google Scholar
Slobodan Petrovic. 2006. A comparison between the silhouette index and the davies-bouldin index in labelling ids clusters. In Proceedings of the 11th Nordic Workshop of Secure IT Systems. 53--64.Google Scholar
R Rehurek and P Sojka. 2011. Gensim--python framework for vector space modelling. NLP Centre, Faculty of Informatics, Masaryk University, Brno, Czech Republic 3, 2 (2011).Google Scholar
Daniel M Romero, Brendan Meeder, and Jon Kleinberg. 2011. Differences in the mechanics of information diffusion across topics: idioms, political hashtags, and complex contagion on twitter. In Proceedings of the 20th international conference on World wide web. ACM, 695--704. Google ScholarDigital Library
Kazumi Saito, Ryohei Nakano, and Masahiro Kimura. 2008. Prediction of information diffusion probabilities for independent cascade model. In Knowledge-based intelligent information and engineering systems. Springer, 67--75. Google ScholarDigital Library
Limor Shifman. 2013. Memes in a digital world: Reconciling with a conceptual troublemaker. Journal of Computer-Mediated Communication 18, 3 (2013), 362--377.Google ScholarCross Ref
Karen Simonyan and Andrew Zisserman. 2014. Very deep convolutional networks for large-scale image recognition. arXiv preprint arXiv:1409.1556 (2014).Google Scholar
Krishna Kumar Singh and Yong Jae Lee. 2016. End-to-end localization and ranking for relative attributes. In European Conference on Computer Vision. Springer, 753--769.Google ScholarCross Ref
Ray Smith. 2007. An overview of the Tesseract OCR engine. In Document Analysis and Recognition, 2007. ICDAR 2007. Ninth International Conference on, Vol. 2. IEEE, 629--633. Google ScholarDigital Library
Thomas W Valente. 1995. Network models of the diffusion of innovations. (1995).Google Scholar
Lilian Weng and Filippo Menczer. 2015. Topicality and impact in social media: diverse messages, focused messengers. PloS one 10, 2 (2015), e0118410.Google ScholarCross Ref
Lilian Weng, Filipspo Menczer, and Yong-Yeol Ahn. 2013. Virality prediction and community structure in social networks. Scientific reports 3 (2013), 2522.Google Scholar
Lilian Weng, Filippo Menczer, and Yong-Yeol Ahn. 2014. Predicting Successful Memes Using Network and Community Structure.. In ICWSM.Google Scholar
John Wright, Allen Y Yang, Arvind Ganesh, S Shankar Sastry, and Yi Ma. 2009. Robust face recognition via sparse representation. IEEE transactions on pattern analysis and machine intelligence 31, 2 (2009), 210--227. Google ScholarDigital Library
Fanyi Xiao and Yong Jae Lee. 2015. Discovering the spatial extent of relative attributes. In Proceedings of the IEEE International Conference on Computer Vision. 1458--1466. Google ScholarDigital Library
Zichao Yang, Diyi Yang, Chris Dyer, Xiaodong He, Alexander J Smola, and Eduard H Hovy. 2016. Hierarchical Attention Networks for Document Classification.. In HLT-NAACL. 1480--1489.Google Scholar
Chenxi Zhang, Jizhou Gao, Oliver Wang, Pierre Georgel, Ruigang Yang, James Davis, Jan-Michael Frahm, and Marc Pollefeys. 2014. Personal photograph enhancement using internet photo collections. IEEE transactions on visualization and computer graphics 20, 2 (2014), 262--275. Google ScholarDigital Library
Ye Zhang and Byron Wallace. 2015. A sensitivity analysis of (and practitioners' guide to) convolutional neural networks for sentence classification. arXiv preprint arXiv:1510.03820 (2015).Google Scholar
Xiaoqing Zheng, Hanyang Chen, and Tianyu Xu. 2013. Deep Learning for Chinese Word Segmentation and POS Tagging. In EMNLP. 647--657.Google Scholar

Index Terms

MemeSequencer: Sparse Matching for Embedding Image Macros
1. Computing methodologies
  1. Artificial intelligence
    1. Computer vision
      1. Computer vision representations
        Image representations
      2. Computer vision tasks
        Visual content-based indexing and retrieval
    2. Knowledge representation and reasoning
      1. Semantic networks
2. Information systems
  1. World Wide Web
    1. Web applications
      1. Social networks

Recommendations

Modeling Image Virality with Pairwise Spatial Transformer Networks
MM '17: Proceedings of the 25th ACM international conference on Multimedia

The study of virality and information diffusion is a topic gaining traction rapidly in the computational social sciences. Computer vision and social network analysis research have also focused on understanding the impact of content and information ...
Read More
Co-design in the wild: a case study on meme creation tools
PDC '16: Proceedings of the 14th Participatory Design Conference: Full papers - Volume 1

The internet meme has become a vital form of self-expression in social communities throughout the Internet. The tools facilitating meme-creation, specifically image macros, have been little-studied but endow non-technical users with the ability to ...
Read More
Measuring influence on Twitter
i-KNOW '11: Proceedings of the 11th International Conference on Knowledge Management and Knowledge Technologies

There are currently over 175 million Twitter accounts worldwide, making Twitter one of the most popular and most observed Social Media platform. But Twitter is not so much a social network where the exchange of personal information is facilitated -- in ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Published in
WWW '18: Proceedings of the 2018 World Wide Web Conference
April 2018
2000 pages
ISBN:9781450356398
General Chairs:
Pierre-Antoine Champin
Universitè Claude Bernard Lyon 1, France
,
Fabien Gandon
Inria, Université Côte d'Azur, CNRS, I3S, France
,
Lionel Médini
Université Claude Bernard Lyon 1, France
,
Program Chairs:
Mounia Lalmas
Spotify, UK
,
Panagiotis G. Ipeirotis
New York University, USA
Copyright © 2018 ACM
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]
Sponsors
In-Cooperation
Publisher
International World Wide Web Conferences Steering Committee
Republic and Canton of Geneva, Switzerland
Publication History
- Published: 10 April 2018
Permissions
Request permissions about this article.
Request Permissions

Check for updates
Author Tags
content understanding
embeddings
feature extraction
image macros
image virality
social network analysis
sparse representation
Qualifiers
- research-article
Conference

Acceptance Rates
WWW '18 Paper Acceptance Rate170of1,155submissions,15%Overall Acceptance Rate1,899of8,196submissions,23%
More
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 15
  Total Citations
  View Citations
- 714
  Total Downloads
- Downloads (Last 12 months)115
- Downloads (Last 6 weeks)17
Other Metrics
View Author Metrics
Cited By
View all

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

HTML Format

View this article in HTML Format .

View HTML Format

MemeSequencer: Sparse Matching for Embedding Image Macros

WWW '18: Proceedings of the 2018 World Wide Web Conference

ABSTRACT

References

Cited By

Index Terms

Recommendations

Modeling Image Virality with Pairwise Spatial Transformer Networks

Co-design in the wild: a case study on meme creation tools

Measuring influence on Twitter