skip to main content
10.1145/2713168.2713178acmconferencesArticle/Chapter ViewAbstractPublication PagesmmsysConference Proceedingsconference-collections
research-article

Video composition by the crowd: a system to compose user-generated videos in near real-time

Published:18 March 2015Publication History

ABSTRACT

To compose high-quality movies directors need life-long learning and talent. User-generated video defines a new era of video production in which non-professionals record videos and share them on platforms such as YouTube. As hiring professional directors results in high costs, our work focuses on replacing those directors by crowdsourcing. The proposed system allows users to record and stream live videos to servers on which workers create a video mashup. A smartphone application for recording live video has been designed that supports the composition in the crowd by a multi-modal analysis of the recording quality. The contributions of this work are: The proposed system demonstrates that composing a large number of video views can be achieved in near real-time. Second, the system achieves comparable video quality for user-generated video in comparison to manual composition. Third, it offers insights on how to design real-time capable crowdsourcing systems. Fourth, by leveraging multi-modal features that can already be evaluated during recording the number of streams considered for presentation can be reduced.

References

  1. S. A. Ay, R. Zimmermann, and S. H. Kim. Viewable scene modeling for geospatial video search. In Proceeding of the 16th ACM international conference on Multimedia - MM '08, pages 309--318, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. M. S. Bernstein, J. Brandt, R. C. Miller, and D. R. Karger. Crowds in two seconds: enabling realtime crowd-powered interfaces. In ACM Symposium on User Interface Software and Technology, pages 33--42, 2011. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. F. Cricri, K. Dabov, I. D. D. Curcio, S. Mate, and M. Gabbouj. Multimodal extraction of events and of information about the recording activity in user generated videos. Multimedia Tools and Applications, 70(1):119--158, 2012. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. F. Cricri, M. Roininen, J. Leppaenen, S. Mate, I. Curcio, S. Uhlmann, and M. Gabbouj. Sport type classification of mobile videos. IEEE Transactions on Multimedia, 16(4): 917--932, 2014. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. A. Engström, M. Esbjörnsson, and O. Juhlin. Mobile Collaborative Live Video Mixing. In Int. Conference on Human Computer Interaction with Mobile Devices and Services, pages 157--166, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. R. Graham. An efficient algorithm for determining the convex hull of a finite planar set. In Information Processing Letters, 1972.Google ScholarGoogle Scholar
  7. ITU. ITU-R Recommendation P. 910. Technical report, 2008.Google ScholarGoogle Scholar
  8. ITU. ITU-R Recommendation BT.500. Technical report, 2012.Google ScholarGoogle Scholar
  9. S. Kim, Y. Lu, and G. Constantinou. MediaQ: mobile multimedia management system. In 5th ACM Multimedia Systems Conference, pages 224--235, 2014. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. W. S. Lasecki, K. I. Murray, S. White, R. C. Miller, and J. P. Bigham. Real-time crowd control of existing interfaces. In ACM Symposium on User Interface Software and Technology, pages 23--32. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. M. A. Mughal and O. Juhlin. Context-dependent software solutions to handle video synchronization and delay in collaborative live mobile video production. Personal and Ubiquitous Computing, 18(3):709--721, 2013. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. M. J. Murphy, C. D. Miller, W. S. Lasecki, and J. P. Bigham. Adaptive time windows for real-time crowd captioning. In Extended Abstracts on Human Factors in Computing Systems, pages 13--18, 2013. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. M. K. Saini, R. Gadde, S. Yan, and W. T. Ooi. MoViMash: Online Mobile Video Mashup. In ACM Int. Conference on Multimedia, pages 139--148, 2012. Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. P. Shrestha, P. H. de With, H. Weda, M. Barbieri, and E. H. Aarts. Automatic mashup generation from multiple-camera concert recordings. In ACM Int. Conference on Multimedia, pages 541--550, 2010. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. G. Wang, B. Seo, and R. Zimmermann. Automatic positioning data correction for sensor-annotated mobile videos. In ACM Int. Conference on Advances in Geographic Information Systems, pages 470--473, 2012. Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. S. Wilk and W. Effelsberg. The influence of camera shakes, harmful occlusions and camera misalignment on the perceived quality in user-generated video. In IEEE Int. Conference on Multimedia and Expo, pages 1--6, 2014.Google ScholarGoogle ScholarCross RefCross Ref
  17. T. Yan, V. Kumar, and D. Ganesan. Crowdsearch: exploiting crowds for accurate real-time image search on mobile phones. In ACM Int. Conference on Mobile Systems, Applications, and Services, pages 77--90, 2010. Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. K.-c. Yang, C. Guest, and P. Das. Perceptual Sharpness Metric (PSM) for Compressed Video. In IEEE Int. Conference on Multimedia and Expo, pages 777--780, 2006.Google ScholarGoogle Scholar

Index Terms

  1. Video composition by the crowd: a system to compose user-generated videos in near real-time

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in
    • Published in

      cover image ACM Conferences
      MMSys '15: Proceedings of the 6th ACM Multimedia Systems Conference
      March 2015
      277 pages
      ISBN:9781450333511
      DOI:10.1145/2713168

      Copyright © 2015 ACM

      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      • Published: 18 March 2015

      Permissions

      Request permissions about this article.

      Request Permissions

      Check for updates

      Qualifiers

      • research-article

      Acceptance Rates

      MMSys '15 Paper Acceptance Rate12of41submissions,29%Overall Acceptance Rate176of530submissions,33%

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader