ABSTRACT
User Generated Content (UGC) is re-shaping the way people watch video and TV, with millions of video producers and consumers. In particular, UGC sites are creating new viewing patterns and social interactions, empowering users to be more creative, and developing new business opportunities. To better understand the impact of UGC systems, we have analyzed YouTube, the world's largest UGC VoD system. Based on a large amount of data collected, we provide an in-depth study of YouTube and other similar UGC systems. In particular, we study the popularity life-cycle of videos, the intrinsic statistical properties of requests and their relationship with video age, and the level of content aliasing or of illegal content in the system. We also provide insights on the potential for more efficient UGC VoD systems (e.g. utilizing P2P techniques or making better use of caching). Finally, we discuss the opportunities to leverage the latent demand for niche videos that are not reached today due to information filtering effects or other system scarcity distortions. Overall, we believe that the results presented in this paper are crucial in understanding UGC systems and can provide valuable information to ISPs, site administrators, and content owners with major commercial and technical implications.
- Daum UCC. http://ucc.daum.net.Google Scholar
- Imdb statistics. http://www.imdb.com/database_statistics.Google Scholar
- Lovefilm. http://www.lovefilm.com.Google Scholar
- Netflix prize. http://www.netflixprize.com.Google Scholar
- Yahoo! Movies. http://movies.yahoo.com.Google Scholar
- YouTube. http://www.youtube.com.Google Scholar
- Surveys: Internet Traffic Touched by YouTube, January 2006. http://www.lightreading.com/document.asp?doc_id=115816.Google Scholar
- L. Amaral, A. Scala, M. Barthélémy, and H. E. Stanley. Classes of Small-World Networks. In Proc. Natl. Acad. Sci. USA, 2000.Google ScholarCross Ref
- C. Anderson. A Problem With the LongTail. http://www.longtail.com/scifoo.ppt.Google Scholar
- C. Anderson. The Long Tail: Why the Future of Business Is Selling Less of More. Hyperion, 2006. Google ScholarDigital Library
- E. Auchard. Participation on Web 2.0 Sites Remains Weak, April 2007. http://www.reuters.com/article/internetNews/idUSN1743638820070418.Google Scholar
- A.-L. Barabási and R. Albert. Emergence of Scaling in Random Networks. Science, 286:509--512, 1999.Google Scholar
- S. Bausch and L. Han. YouTube U.S. Web Traffic Grows 75 Percent Week over Week, July 2006. Neilsen/Netratings, http://www.nielsen-netratings.com/pr/pr_060721_2.pdf.Google Scholar
- B. Cheng, X. Liu, Z. Zhang, and H. Jin. A Measurement Study of a Peer-to-Peer Video-on-Demand System. In Proc. of IPTPS, 2007.Google Scholar
- J. Cho and S. Roy. Impact of Search Engines on Page Popularity. In Proc. of WWW, 2004. Google ScholarDigital Library
- C. Costa, I. Cunha, A. Borges, C. Ramos, M. Rocha, J. Almeida, and B. Ribeiro-Neto. Analyzing client interactivity in streaming media. In Proc. of WWW, 2004. Google ScholarDigital Library
- M. E. Crovella and A. Bestavros. Self-Similarity in World Wide Web Traffic: Evidence and Possible Causes. IEEE/ACM ToN, 5(6):835--846, 1997. Google ScholarDigital Library
- T. Do, K. A. Hua, and M. Tantaoui. P2VoD: Providing Fault Tolerant Video-on-Demand Streaming in Peer-to-Peer Environment. Proc. of IEEE ICC, 2004.Google ScholarCross Ref
- A. B. Downey. The Structural Cause of File Size Distributions. In Proc. of IEEE MASCOTS, 2001. Google ScholarDigital Library
- T. Fenner, M. Levene, and G. Loizou. A Stochastic Evolutionary Model Exhibiting Power-Law Behaviour with an Exponential Cutoff. Physica, (13), 2005.Google Scholar
- S. Fortunato, A. Flammini, F. Menczer, and A. Vespignani. Topical Interests and the Mitigation of Search Engine Bias. In Proc. Natl. Acad. Sci. USA, 2006.Google ScholarCross Ref
- C. Gkantsidis, T. Karagiannis, P. Rodriguez, and M. Vojnovic. Planet Scale Software Updates. In Proc. of ACM SIGCOMM, 2006. Google ScholarDigital Library
- L. Gomes. Will all of us get our 15 minutes on a youtube video?, The Wall Street Journal Online, August 2006.Google Scholar
- C. Griwodz, M. Biig, and L. C. Wolf. Long-term Movie Popularity Models in Video-on-Demand Systems. In Proc. of ACM Multimedia, 1997. Google ScholarDigital Library
- S. Guha, S. Annapureddy, C. Gkantsidis, D. Gunawardena, and P. Rodriguez. Is High-Quality VoD Feasible using P2P Swarming? In Proc. of WWW, 2007. Google ScholarDigital Library
- K. P. Gummadi, R. J. Dunn, S. Saroiu, S. D. Gribble, H. M. Levy, and J. Zahorjan. Measurement, Modeling, and Analysis of a Peer-to-Peer File-Sharing Workload. In Proc. of ACM SOSP, 2003. Google ScholarDigital Library
- Y. Guo, K. Suh, J. Kurose, and D. Towsley. P2Cast: Peer-to-Peer Patching Scheme for VoD Service. In Proc. of WWW, 2003. Google ScholarDigital Library
- B. Holt, H. R. Lynn, and M. Sowers. Analysis of Copyrighted Videos on YouTube.com. http://www.vidmeter.com/i/vidmeter_copyright_report.pdf.Google Scholar
- C. Huang, J. Li, and K. Ross. Peer-Assisted VoD: Making Internet Video Distribution Cheap. In Proc. of IPTPS, 2007.Google Scholar
- Y. Ijiri and H. Simon. Skew Distributions and the Size of Business Firms. North Holland, Amsterdam, 1977.Google Scholar
- D. A. L. Li, J. Doyle, and W. Willinger. Towards a Theory of Scale-Free Graphs: Definition, Properties, and Implications. Internet Mathematics, 2(4), 2006.Google Scholar
- E. Limpert, W. A. Stahel, and M. Abbt. Log-normal Distributions across the Sciences: Keys and Clues. BioScience, 51(5):341, 2001.Google ScholarCross Ref
- N. Magharei and R. Rejaie. PRIME: Peer-to-Peer Receiver-drIven MEsh-based Streaming. In Proc. of IEEE INFOCOM, 2007.Google Scholar
- N. Miller. Manifesto for a New Age. Wired Magazine, March 2007.Google Scholar
- M. Mitzenmacher. A Brief History of Generative Models for Power Law and Lognormal Distributions. Internet Mathematics, 1(2):226--251, 2004.Google ScholarCross Ref
- S. Mossa, M. Barthélémy, H. E. Stanley, and L. A. N. Amaral1. Truncation of Power Law Behavior in "Scale-Free" Network Models due to Information Filtering. Phys. Rev. Lett., (13), 2002.Google Scholar
- M. E. J. Newman. Power laws, Pareto distributions and Zipf 's law. Contemporary Physics, 46:323, 2005.Google ScholarCross Ref
- V. M. W. Gong, Y. Liu and D. Towsley. On the Tails of Web File Size Distributions. In Proc. of 39th Allerton Conference on Communication, Control, and Computing, 2001.Google Scholar
- H. Yu, D. Zheng, B. Y. Zhao, and W. Zheng. Understanding User Behavior in Large-Scale Video-on-Demand Systems. In Proc. of ACM Eurosys, 2006. Google ScholarDigital Library
- G. U. Yule. A Mathematical Theory of Evolution, Based on the Conclusions of Dr. J. C. Willis, F. R. S. Royal Society of London Philosophical Transactions Series B, 213:21--87, 1925.Google ScholarCross Ref
Index Terms
I tube, you tube, everybody tubes: analyzing the world's largest user generated content video system
Recommendations
Exploring the user-generated content (UGC) uploading behavior on youtube
WWW '14 Companion: Proceedings of the 23rd International Conference on World Wide WebYouTube is the world's largest video sharing platform where both professional and non-professional users participate in creating, uploading, and viewing content. In this work, we analyze content in the music category created by the non-professionals, ...
CPH-VoD: A Novel CDN---P2P-Hybrid Architecture Based VoD Scheme
11th International Conference on Web Information Systems Engineering --- WISE 2010 - Volume 6488Taking advantages of both CDN and P2P networks has been considered as a feasible solution for large-scale video stream delivering systems. Recent researches have shown great interested in CDN-P2P-hybrid architecture and ISP-friendly P2P content ...
On adapting HTTP protocol to content centric networking
CFI '12: Proceedings of the 7th International Conference on Future Internet TechnologiesDesigned around host-reachability, today's Internet architecture faces many limitations while serving content-oriented applications which generate most traffic load to the Internet. CCN (Content Centric Networking) [1] is one of the most important ...
Comments