ABSTRACT
In this work, we present a fashion-focused Creative Commons dataset, which is designed to contain a mix of general images as well as a large component of images that are focused on fashion (i.e., relevant to particular clothing items or fashion accessories). The dataset contains 4810 images and related metadata. Furthermore, a ground truth on image's tags is presented. Ground truth generation for large-scale datasets is a necessary but expensive task. Traditional expert based approaches have become an expensive and non-scalable solution. For this reason, we turn to crowdsourcing techniques in order to collect ground truth labels; in particular we make use of the commercial crowdsourcing platform, Amazon Mechanical Turk (AMT). Two different groups of annotators (i.e., trusted annotators known to the authors and crowdsourcing workers on AMT) participated in the ground truth creation. Annotation agreement between the two groups is analyzed. Applications of the dataset in different contexts are discussed. This dataset contributes to research areas such as crowdsourcing for multimedia, multimedia content analysis, and design of systems that can elicit fashion preferences from users.
- M. Larson, M. Soleymani, M. Eskevich, P. Serdyukov, R. Ordelman and G. Jones, The Community and the Crowd: Multimedia Benchmark Dataset Development, in IEEE Multimedia, 2012. Google ScholarDigital Library
- A. J. Quinn and B. B. Bederson, Human computation: a survey and taxonomy of a growing field, in Proceedings of the SIGCHI Conference on Human Factors in Computing Systems, 2011. Google ScholarDigital Library
- S. Nowak and S. Ruger, How reliable are annotations via crowdsourcing? a study about inter-annotator agreement for multi-label image annotation, in The 11th ACM International Conference on Multimedia Information Retrieval (MIR), Philadelphia, USA, 2010. Google ScholarDigital Library
- L. Ahn, Games with a purpose, IEEE Computer Society, vol. 39, pp. 92--96, 2006. Google ScholarDigital Library
- L. Ahn and L. Dabbish, Labeling images with a computer game, in Proceedings of the SIGCHI Conference on Human Factors in Computing Systems (CHI '04), 2004. Google ScholarDigital Library
- S. Li, J. Feng, Z. Song, T. Zhang, H. Lu, C. Xu and S. Yan, Hi, Magic Closet, Tell Me What to Wear, in Proceeding of International Conference of ACM MM, Nara, Japan, 2012. Google ScholarDigital Library
- T. Iwata, S. Watanabe and H. Sawada, Fashion coordinates recommender system using photographs from fashion magazines, in In Proceedings of the Twenty-Second international joint conference on Artificial Intelligence, 2011. Google ScholarDigital Library
- M. Fukuda and Y. Nakatani, What to Wear in Different Situations? A New Approach to a Fashion Coordinate Support System, in Proceedings of the World Congress on Engineering and Computer Science, 2011.Google Scholar
- R. Sakurai and J.-H. Lee, People and Clothes Recognition based on Topic Model Integration (SII), in EEE/SICE International Symposium on System, 2011.Google Scholar
- K. Yamaguchi, H. Kiapour, L. Ortiz and T. Berg, Parsing clothing in fashion photographs, in IEEE Conference on Computer Vision and Pattern Recognition, 2012. Google ScholarDigital Library
- J. S. Pedro, S. Siersdorfer and M. Sanderson, Content redundancy in YouTube and its application to video tagging, in ACM Trans. Inf. Syst., 2011. Google ScholarDigital Library
- K. Filippova and K. Hall, Improved video categorization from text metadata and user comments, in Proceedings of the 34th international ACM SIGIR conference on Research and development in Information (SIGIR-2011), 2011. Google ScholarDigital Library
- S. V. Chelaru, C. Orellana-Rodriguez and I. S. Altingövde, Can Social Features Help Learning to Rank YouTube Videos?, in 13th International Conference on Web Information System Engineering, 2012. Google ScholarDigital Library
- A. Cox, P. Clough and S. Siersdorfer, "Developing metrics to characterize Flickr groups," Am. Soc. Inf. Sci. Technol., vol. 62, pp. 493--506, 2011. Google ScholarDigital Library
- J. San Pedro, T. Yeh and N. Oliver, Leveraging user comments for aesthetic aware image search reranking, in WWW'12, 2012. Google ScholarDigital Library
- C. Eickhoff and C. Vries, Increasing cheat robustness of crowdsourcing tasks, Information Retrieval Journal, 2012. Google ScholarDigital Library
- P. Fraternali, M. Tagliasacchi and D. Martinenghi, The CUBRIK project: human-enhanced time-aware multimedia search, in WWW 2012 -- European Projects Track, 2012. Google ScholarDigital Library
- J. Randolph, Free-marginal multirater kappa: An alternative to Fleiss' fixed-marginal multirater kappa, in Joensuu University Learning and Instruction Symposium, 2005.Google Scholar
- L. v. Ahn, R. Liu and a. M. Blum, Peekaboom:a game for locating objects in images, in In Proceedings of the SIGCHI Conference on Human Factors in Computing Systems (CHI '06), 2006. Google ScholarDigital Library
- L. Galli, P. Fraternali, J. Novak, D. Martinenghi and M. Tagliasacchi, A Draw-and-Guess Game to Segment Images, in ASE International Conference on Social Computing (SocialCom), 2012. Google ScholarDigital Library
- J. A. Noble, Minority voices of crowdsourcing: why we should pay attention to every member of the crowd, in In Proceedings of the ACM 2012 conference on Computer Supported Cooperative Work Companion (CSCW '12), New York, USA, 2012. Google ScholarDigital Library
Index Terms
- Fashion-focused creative commons social dataset
Recommendations
Fashion 10000: an enriched social image dataset for fashion and clothing
MMSys '14: Proceedings of the 5th ACM Multimedia Systems ConferenceIn this work, we present a new social image dataset related to the fashion and clothing domain. The dataset contains more than 32000 images, their context and social metadata. Furthermore the dataset is enriched with several types of annotations ...
The 2012 social event detection dataset
MMSys '13: Proceedings of the 4th ACM Multimedia Systems ConferenceThis paper presents the 2012 Social Event Detection dataset (SED2012). The dataset constitutes a challenging benchmark for methods that detect social events in large collections of multimedia items. More specifically, the dataset comprises more than 160 ...
Reddit entity linking dataset
AbstractWe introduce and make publicly available an entity linking dataset from Reddit that contains 17,316 linked entities, each annotated by three human annotators and then grouped into Gold, Silver, and Bronze to indicate inter-annotator ...
Highlights- We release a new entity linking dataset taken from Reddit.
- Human annotators ...
Comments