ABSTRACT
The analysis of the creation, mutation, and propagation of social media content on the Internet is an essential problem in computational social science, affecting areas ranging from marketing to political mobilization. A first step towards understanding the evolution of images online is the analysis of rapidly modifying and propagating memetic imagery or "memes". However, a pitfall in proceeding with such an investigation is the current incapability to produce a robust semantic space for such imagery, capable of understanding differences in Image Macros. In this study, we provide a first step in the systematic study of image evolution on the Internet, by proposing an algorithm based on sparse representations and deep learning to decouple various types of content in such images and produce a rich semantic embedding. We demonstrate the benefits of our approach on a variety of tasks pertaining to memes and Image Macros, such as image clustering, image retrieval, topic prediction and virality prediction, surpassing the existing methods on each. In addition to its utility on quantitative tasks, our method opens up the possibility of obtaining the first large-scale understanding of the evolution and propagation of memetic imagery.
- {n. d.}. reddit: the front page of the internet. http://www.reddit.com/. ({n. d.}). Accessed: 2017-09--30.Google Scholar
- Martín Abadi, Paul Barham, Jianmin Chen, Zhifeng Chen, Andy Davis, Jeffrey Dean, Matthieu Devin, Sanjay Ghemawat, Geoffrey Irving, Michael Isard, et al. {n. d.}. TensorFlow: A System for Large-Scale Machine Learning.Google Scholar
- Lada A Adamic, Thomas M Lento, Eytan Adar, and Pauline C Ng. 2016. Information evolution in social networks. In Proceedings of the Ninth ACM International Conference on Web Search and Data Mining. ACM, 473--482. Google ScholarDigital Library
- Edoardo Amaldi and Viggo Kann. 1998. On the approximability of minimizing nonzero variables or unsatisfied relations in linear systems. Theoretical Computer Science 209, 1--2 (1998), 237--260. Google ScholarDigital Library
- Eytan Bakshy, Itamar Rosenn, Cameron Marlow, and Lada Adamic. 2012. The role of social networks in information diffusion. In Proceedings of the 21st international conference on World Wide Web. ACM, 519--528. Google ScholarDigital Library
- Albert-László Barabási. 2016. Network science. Cambridge university press.Google Scholar
- Jonah Berger and Katherine L Milkman. 2012. What makes online content viral? Journal of marketing research 49, 2 (2012), 192--205.Google ScholarCross Ref
- Jonah Berger and Eric M Schwartz. 2011. What drives immediate and ongoing word of mouth? Journal of Marketing Research 48, 5 (2011), 869--880.Google ScholarCross Ref
- Vincent Buskens and Jeroen Weesie. 2000. Cooperation via social networks. Analyse & Kritik 22, 1 (2000), 44--74.Google ScholarCross Ref
- Vincent Buskens and Kazuo Yamaguchi. 1999. A new model for information diffusion in heterogeneous social networks. Sociological methodology 29, 1 (1999), 281--325.Google Scholar
- Emmanuel J Candes, Justin K Romberg, and Terence Tao. 2006. Stable signal recovery from incomplete and inaccurate measurements. Communications on pure and applied mathematics 59, 8 (2006), 1207--1223.Google Scholar
- Scott Shaobing Chen, David L Donoho, and Michael A Saunders. 2001. Atomic decomposition by basis pursuit. SIAM review 43, 1 (2001), 129--159. Google ScholarDigital Library
- Justin Cheng, Lada Adamic, P Alex Dow, Jon Michael Kleinberg, and Jure Leskovec. 2014. Can cascades be predicted?. In Proceedings of the 23rd international conference on World wide web. ACM, 925--936. Google ScholarDigital Library
- Justin Cheng, Lada A Adamic, Jon M Kleinberg, and Jure Leskovec. 2016. Do Cascades Recur?. In Proceedings of the 25th International Conference on World Wide Web. International World Wide Web Conferences Steering Committee, 671--681. Google ScholarDigital Library
- Ronan Collobert and Jason Weston. 2008. A unified architecture for natural language processing: Deep neural networks with multitask learning. In Proceedings of the 25th international conference on Machine learning. ACM, 160--167. Google ScholarDigital Library
- Michele Coscia. 2013. Competition and Success in the Meme Pool: A Case Study on Quickmeme. com.Google Scholar
- Michele Coscia. 2014. Average is boring: How similarity kills a meme's success. Scientific reports 4 (2014), 6477.Google Scholar
- David L Davies and Donald W Bouldin. 1979. A cluster separation measure. IEEE transactions on pattern analysis and machine intelligence 2 (1979), 224--227. Google ScholarDigital Library
- Jia Deng, Wei Dong, Richard Socher, Li-Jia Li, Kai Li, and Li Fei-Fei. 2009. Imagenet: A large-scale hierarchical image database. In Computer Vision and Pattern Recognition, 2009. CVPR 2009. IEEE Conference on. IEEE, 248--255.Google ScholarCross Ref
- Arturo Deza and Devi Parikh. 2015. Understanding image virality. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 1818--1826.Google ScholarCross Ref
- Jeff Donahue, Yangqing Jia, Oriol Vinyals, Judy Hoffman, Ning Zhang, Eric Tzeng, and Trevor Darrell. 2014. Decaf: A deep convolutional activation feature for generic visual recognition. In International conference on machine learning. 647--655. Google ScholarDigital Library
- David L Donoho. 2006. For most large underdetermined systems of linear equations the minimal-norm solution is also the sparsest solution. Communications on pure and applied mathematics 59, 6 (2006), 797--829.Google Scholar
- Cícero Nogueira Dos Santos and Maira Gatti. 2014. Deep Convolutional Neural Networks for Sentiment Analysis of Short Texts. In COLING. 69--78.Google Scholar
- Abhimanyu Dubey and Sumeet Agarwal. 2017. Modeling Image Virality with Pairwise Spatial Transformer Networks. In Proceedings of the 2017 ACM on Multi- media Conference, MM 2017, Mountain View, CA, USA, October 23--27, 2017. 663--671. Google ScholarDigital Library
- Phil Edwards. {n. d.}. The reason every meme uses that one font. https://www.vox.com/2015/7/26/9036993/meme-font-impact/. ({n. d.}). Accessed: 2017-09--30.Google Scholar
- James P Gleeson, Kevin P O'Sullivan, Raquel A Baños, and Yamir Moreno. 2016. Effects of network structure, competition and memory time on social spreading phenomena. Physical Review X 6, 2 (2016), 021019.Google Scholar
- Alex Graves and Jurgen Schmidhuber. 2005. Framewise phoneme classification with bidirectional LSTM and other neural network architectures. Neural Networks 18, 5 (2005), 602--610. Google ScholarDigital Library
- Kaiming He, Xiangyu Zhang, Shaoqing Ren, and Jian Sun. 2016. Deep residual learning for image recognition. In Proceedings of the IEEE conference on computer vision and pattern recognition. 770--778.Google ScholarCross Ref
- Chih-Wei Hsu and Chih-Jen Lin. 2002. A comparison of methods for multiclass support vector machines. IEEE transactions on Neural Networks 13, 2 (2002), 415--425. Google ScholarDigital Library
- Thorsten Joachims. 2002. Optimizing search engines using clickthrough data. In Proceedings of the eighth ACM SIGKDD international conference on Knowledge discovery and data mining. ACM, 133--142. Google ScholarDigital Library
- Eric Jones, Travis Oliphant, Pearu Peterson, et al. 2001--. SciPy: Open source scientific tools for Python. (2001--). http://www.scipy.org/ {Online; accessed 9/30/2017}.Google Scholar
- Aditya Khosla, Atish Das Sarma, and Raffay Hamid. 2014. What makes an image popular?. In Proceedings of the 23rd international conference on World wide web. ACM, 867--876. Google ScholarDigital Library
- Ryan Kiros, Yukun Zhu, Ruslan R Salakhutdinov, Richard Zemel, Raquel Urtasun, Antonio Torralba, and Sanja Fidler. 2015. Skip-thought vectors. In Advances in neural information processing systems. 3294--3302. Google ScholarDigital Library
- Michele Knobel and Colin Lankshear. {n. d.}. Online memes, affinities, and cultural production. ({n. d.}).Google Scholar
- Alex Krizhevsky, Ilya Sutskever, and Geoffrey E Hinton. 2012. Imagenet classification with deep convolutional neural networks. In Advances in neural information processing systems. 1097--1105. Google ScholarDigital Library
- Himabindu Lakkaraju, Julian J McAuley, and Jure Leskovec. 2013. What's in a Name? Understanding the Interplay between Titles, Content, and Communities in Social Media. (2013).Google Scholar
- Jure Leskovec, Lars Backstrom, and Jon Kleinberg. 2009. Meme-tracking and the dynamics of the news cycle. In Proceedings of the 15th ACM SIGKDD international conference on Knowledge discovery and data mining. ACM, 497--506. Google ScholarDigital Library
- M Douglas McIlroy. 1960. Macro instruction extensions of compiler languages. Commun. ACM 3, 4 (1960), 214--220. Google ScholarDigital Library
- LR Medsker and LC Jain. 2001. Recurrent neural networks. Design and Applications 5 (2001).Google Scholar
- Tomas Mikolov, Ilya Sutskever, Kai Chen, Greg S Corrado, and Jeff Dean. 2013. Distributed representations of words and phrases and their compositionality. In Advances in neural information processing systems. 3111--3119. Google ScholarDigital Library
- Nasrin Mostafazadeh, Nathanael Chambers, Xiaodong He, Devi Parikh, Dhruv Batra, Lucy Vanderwende, Pushmeet Kohli, and James Allen. 2016. A corpus and evaluation framework for deeper understanding of commonsense stories. arXiv preprint arXiv:1604.01696 (2016).Google Scholar
- Ruth Page. 2012. The linguistics of self-branding and micro-celebrity in Twitter: The role of hashtags. Discourse & communication 6, 2 (2012), 181--201.Google Scholar
- Adam Paszke, Sam Gross, and Soumith Chintala. 2017. PyTorch. (2017).Google Scholar
- Slobodan Petrovic. 2006. A comparison between the silhouette index and the davies-bouldin index in labelling ids clusters. In Proceedings of the 11th Nordic Workshop of Secure IT Systems. 53--64.Google Scholar
- R Rehurek and P Sojka. 2011. Gensim--python framework for vector space modelling. NLP Centre, Faculty of Informatics, Masaryk University, Brno, Czech Republic 3, 2 (2011).Google Scholar
- Daniel M Romero, Brendan Meeder, and Jon Kleinberg. 2011. Differences in the mechanics of information diffusion across topics: idioms, political hashtags, and complex contagion on twitter. In Proceedings of the 20th international conference on World wide web. ACM, 695--704. Google ScholarDigital Library
- Kazumi Saito, Ryohei Nakano, and Masahiro Kimura. 2008. Prediction of information diffusion probabilities for independent cascade model. In Knowledge-based intelligent information and engineering systems. Springer, 67--75. Google ScholarDigital Library
- Limor Shifman. 2013. Memes in a digital world: Reconciling with a conceptual troublemaker. Journal of Computer-Mediated Communication 18, 3 (2013), 362--377.Google ScholarCross Ref
- Karen Simonyan and Andrew Zisserman. 2014. Very deep convolutional networks for large-scale image recognition. arXiv preprint arXiv:1409.1556 (2014).Google Scholar
- Krishna Kumar Singh and Yong Jae Lee. 2016. End-to-end localization and ranking for relative attributes. In European Conference on Computer Vision. Springer, 753--769.Google ScholarCross Ref
- Ray Smith. 2007. An overview of the Tesseract OCR engine. In Document Analysis and Recognition, 2007. ICDAR 2007. Ninth International Conference on, Vol. 2. IEEE, 629--633. Google ScholarDigital Library
- Thomas W Valente. 1995. Network models of the diffusion of innovations. (1995).Google Scholar
- Lilian Weng and Filippo Menczer. 2015. Topicality and impact in social media: diverse messages, focused messengers. PloS one 10, 2 (2015), e0118410.Google ScholarCross Ref
- Lilian Weng, Filipspo Menczer, and Yong-Yeol Ahn. 2013. Virality prediction and community structure in social networks. Scientific reports 3 (2013), 2522.Google Scholar
- Lilian Weng, Filippo Menczer, and Yong-Yeol Ahn. 2014. Predicting Successful Memes Using Network and Community Structure.. In ICWSM.Google Scholar
- John Wright, Allen Y Yang, Arvind Ganesh, S Shankar Sastry, and Yi Ma. 2009. Robust face recognition via sparse representation. IEEE transactions on pattern analysis and machine intelligence 31, 2 (2009), 210--227. Google ScholarDigital Library
- Fanyi Xiao and Yong Jae Lee. 2015. Discovering the spatial extent of relative attributes. In Proceedings of the IEEE International Conference on Computer Vision. 1458--1466. Google ScholarDigital Library
- Zichao Yang, Diyi Yang, Chris Dyer, Xiaodong He, Alexander J Smola, and Eduard H Hovy. 2016. Hierarchical Attention Networks for Document Classification.. In HLT-NAACL. 1480--1489.Google Scholar
- Chenxi Zhang, Jizhou Gao, Oliver Wang, Pierre Georgel, Ruigang Yang, James Davis, Jan-Michael Frahm, and Marc Pollefeys. 2014. Personal photograph enhancement using internet photo collections. IEEE transactions on visualization and computer graphics 20, 2 (2014), 262--275. Google ScholarDigital Library
- Ye Zhang and Byron Wallace. 2015. A sensitivity analysis of (and practitioners' guide to) convolutional neural networks for sentence classification. arXiv preprint arXiv:1510.03820 (2015).Google Scholar
- Xiaoqing Zheng, Hanyang Chen, and Tianyu Xu. 2013. Deep Learning for Chinese Word Segmentation and POS Tagging. In EMNLP. 647--657.Google Scholar
Index Terms
- MemeSequencer: Sparse Matching for Embedding Image Macros
Recommendations
Modeling Image Virality with Pairwise Spatial Transformer Networks
MM '17: Proceedings of the 25th ACM international conference on MultimediaThe study of virality and information diffusion is a topic gaining traction rapidly in the computational social sciences. Computer vision and social network analysis research have also focused on understanding the impact of content and information ...
Co-design in the wild: a case study on meme creation tools
PDC '16: Proceedings of the 14th Participatory Design Conference: Full papers - Volume 1The internet meme has become a vital form of self-expression in social communities throughout the Internet. The tools facilitating meme-creation, specifically image macros, have been little-studied but endow non-technical users with the ability to ...
Measuring influence on Twitter
i-KNOW '11: Proceedings of the 11th International Conference on Knowledge Management and Knowledge TechnologiesThere are currently over 175 million Twitter accounts worldwide, making Twitter one of the most popular and most observed Social Media platform. But Twitter is not so much a social network where the exchange of personal information is facilitated -- in ...
Comments