ABSTRACT
Online video chat services, such as Chatroulette, Omegle, and vChatter are becoming increasingly popular and have attracted millions of users. One critical problem encountered in such applications is the presence of misbehaving users ("flashers") and obscene content. Automatically filtering out obscene content from these systems in an efficient manner poses a difficult challenge. This paper presents a novel Fine-Grained Cascaded (FGC) classification solution that significantly speeds up the compute-intensive process of classifying misbehaving users by dividing image feature extraction into multiple stages and filtering out easily classified images in earlier stages, thus saving unnecessary computation costs of feature extraction in later stages. Our work is further enhanced by integrating new webcam-related contextual information (illumination and color) into the classification process, and a 2-stage soft margin SVM algorithm for combining multiple features. Evaluation results using real-world data set obtained from Chatroulette show that the proposed FGC based classification solution significantly outperforms state-of-the-art techniques.
Supplemental Material
- Picblock web site. http://www.cinchworks.com/.Google Scholar
- Chatroulette web site. http://www.chatroulette.com/.Google Scholar
- G. Csurka, C. Dance, L. Fan, J. Willamowski, and C. Bray. Visual categorization with bags of keypoints. In Workshop on statistical learning in computer vision, ECCV, volume 1, page 22, 2004.Google Scholar
- N. Dalal and B. Triggs. Histograms of oriented gradients for human detection. In CVPR, volume 1, pages 886 --893 vol. 1, june 2005. Google ScholarDigital Library
- T. Deselaers, L. Pimenidis, and H. Ney. Bag-of-visual-words models for adult image classification and filtering. In ICPR 2008., pages 1--4, dec. 2008.Google ScholarCross Ref
- L. Fei-Fei and P. Perona. A bayesian hierarchical model for learning natural scene categories. In CVPR, volume 2, pages 524--531 vol. 2, june 2005. Google ScholarDigital Library
- M. Fleck, D. Forsyth, and C. Bregler. Finding naked people. In ECCV'96, pages 593--602. Springer Berlin / Heidelberg, 1996. Google ScholarDigital Library
- P. Gehler and S. Nowozin. On feature combination for multiclass object classification. In ICCV, pages 221--228. IEEE, 2009.Google ScholarCross Ref
- M. Hammami, Y. Chahir, and L. Chen. Webguard: A web filtering engine combining textual, structural, and visual content-based analysis. IEEE Trans. on Knowl. and Data Eng., 18:272--284, February 2006. Google ScholarDigital Library
- W. Hu, O. Wu, Z. Chen, Z. Fu, and S. Maybank. Recognition of pornographic web pages by classifying texts and images. IEEE Transactions on Pattern Analysis and Machine Intelligence, pages 1019--1034, 2007. Google ScholarDigital Library
- C. Jansohn, A. Ulges, and T. M. Breuel. Detecting Pornographic Video Content by Combining Image Features with Motion Information. In Proceeding of the seventeen ACM international conference on Multimedia, 2009. Google ScholarDigital Library
- M. J. Jones and J. M. Rehg. Statistical color models with application to skin detection. In International Journal of Computer Vision, pages 274--280, 1999. Google ScholarDigital Library
- P. Kakumanu, S. Makrogiannis, and N. Bourbakis. A survey of skin-color modeling and detection methods. Pattern Recognition, 40(3):1106--1122, 2007. Google ScholarDigital Library
- M. Kloft, U. Brefeld, S. Sonnenburg, P. Laskov, K. R. Müller, and A. Zien. Efficient and accurate lp-norm multiple kernel learning. NIPS, 22(22):997--1005, 2009.Google Scholar
- S. Lazebnik, C. Schmid, and J. Ponce. Beyond bags of features: Spatial pyramid matching for recognizing natural scene categories. In CVPR, volume 2, pages 2169--2178, 2006. Google ScholarDigital Library
- J.-S. Lee, Y.-M. Kuo, P.-C. Chung, and E.-L. Chen. Naked image detection based on adaptive and extensible skin color model. Pattern Recognition, 40(8):2261--2270, 2007. Google ScholarDigital Library
- P. Lee, S. Hui, and A. Fong. An intelligent categorization engine for bilingual web content filtering. IEEE Transactions on Multimedia, 7(6):1183--1190, dec. 2005. Google ScholarDigital Library
- A. Lopes, S. de Avila, A. Peixoto, R. Oliveira, and A. Araújo. A bag-of-features approach based on hue-sift descriptor for nude detection. In Proceedings of the 17th European Signal Processing Conference, Glasgow, Scotland, 2009.Google Scholar
- D. Lowe. Object recognition from local scale-invariant features. In ICCV, volume 2, pages 1150 --1157 vol.2, 1999. Google ScholarDigital Library
- Flasher detection algorithm aims to clean up video chat. January 19, 2011. http://www.technologyreview.com/blog/arxiv/26281/.Google Scholar
- Myyearbook live web site. http://live.myyearbook.com/.Google Scholar
- T. Ojala, M. Pietikäinen, and D. Harwood. A comparative study of texture measures with classification based on featured distributions. Pattern Recognition, pages 51--59, 1996.Google Scholar
- Omegle web site. http://www.omegle.com/.Google Scholar
- J. Philbin, O. Chum, M. Isard, J. Sivic, and A. Zisserman. Object retrieval with large vocabularies and fast spatial matching. In ICCV, pages 1--8, 2007.Google ScholarCross Ref
- P. Pudil, J. Novovicová, and J. Kittler. Floating search methods in feature selection. Pattern recognition letters, 15(11):1119--1125, 1994. Google ScholarDigital Library
- G. Ratsch. Robust boosting via convex optimization: Theory and applications. PhD thesis, University of Potsdam, Potsdam, Germany, 2001.Google Scholar
- Tinychat web site. http://tinychat.com/.Google Scholar
- J. Uijlings, A. Smeulders, and R. Scha. Real-time bag of words, approximately. In Proceeding of the ACM International Conference on Image and Video Retrieval, page 6. ACM, 2009. Google ScholarDigital Library
- A. Vedaldi and B. Fulkerson. VLFeat: An open and portable library of computer vision algorithms. http://www.vlfeat.org/, 2011.Google Scholar
- A. Vedaldi, V. Gulshan, M. Varma, and A. Zisserman. Multiple kernels for object detection. In ICCV, 2009.Google ScholarCross Ref
- A. Vedaldi and A. Zisserman. Efficient additive kernels via explicit feature maps. In CVPR, pages 3539--3546. IEEE, 2010.Google ScholarCross Ref
- X. Xing, Y.-l. Liang, H. Cheng, J. Dang, R. Han, X. Liu, Q. Lv, and S. Mishra. Safevchat: Detecting obscene content and misbehaving users in online video chat services. In WWW, 2011. Google ScholarDigital Library
- J. Ze Wang, J. Li, G. Wiederhold, and O. Firschein. System for screening objectionable images. Computer Communications, 21(15):1355--1360, 1998. Google ScholarDigital Library
Index Terms
- Efficient misbehaving user detection in online video chat services
Recommendations
SafeVchat: detecting obscene content and misbehaving users in online video chat services
WWW '11: Proceedings of the 20th international conference on World wide webOnline video chat services such as Chatroulette, Omegle, and vChatter that randomly match pairs of users in video chat sessions are fast becoming very popular, with over a million users per month in the case of Chatroulette. A key problem encountered in ...
Scalable misbehavior detection in online video chat services
KDD '12: Proceedings of the 18th ACM SIGKDD international conference on Knowledge discovery and data miningThe need for highly scalable and accurate detection and filtering of misbehaving users and obscene content in online video chat services has grown as the popularity of these services has exploded in popularity. This is a challenging problem because ...
Detecting Misbehavior in Online Video Chat Services
Randomly pairing up strangers in online video chat services has quickly gained popularity over the past few years. One major issue that has emerged is the existence of misbehaviors and obscene content. Detecting misbehavior in online video chat services ...
Comments