research-article

Video composition by the crowd: a system to compose user-generated videos in near real-time

Authors:
Stefan Wilk

TU Darmstadt, Germany

TU Darmstadt, Germany
View Profile

,
Stephan Kopf

University of Mannheim

University of Mannheim
View Profile

,
Wolfgang Effelsberg

TU Darmstadt, Germany

TU Darmstadt, Germany
View Profile

MMSys '15: Proceedings of the 6th ACM Multimedia Systems ConferenceMarch 2015Pages 13–24https://doi.org/10.1145/2713168.2713178

Published:18 March 2015Publication History

MMSys '15: Proceedings of the 6th ACM Multimedia Systems Conference

Pages 13–24

ABSTRACT

To compose high-quality movies directors need life-long learning and talent. User-generated video defines a new era of video production in which non-professionals record videos and share them on platforms such as YouTube. As hiring professional directors results in high costs, our work focuses on replacing those directors by crowdsourcing. The proposed system allows users to record and stream live videos to servers on which workers create a video mashup. A smartphone application for recording live video has been designed that supports the composition in the crowd by a multi-modal analysis of the recording quality. The contributions of this work are: The proposed system demonstrates that composing a large number of video views can be achieved in near real-time. Second, the system achieves comparable video quality for user-generated video in comparison to manual composition. Third, it offers insights on how to design real-time capable crowdsourcing systems. Fourth, by leveraging multi-modal features that can already be evaluated during recording the number of streams considered for presentation can be reduced.

References

S. A. Ay, R. Zimmermann, and S. H. Kim. Viewable scene modeling for geospatial video search. In Proceeding of the 16th ACM international conference on Multimedia - MM '08, pages 309--318, 2008. Google ScholarDigital Library
M. S. Bernstein, J. Brandt, R. C. Miller, and D. R. Karger. Crowds in two seconds: enabling realtime crowd-powered interfaces. In ACM Symposium on User Interface Software and Technology, pages 33--42, 2011. Google ScholarDigital Library
F. Cricri, K. Dabov, I. D. D. Curcio, S. Mate, and M. Gabbouj. Multimodal extraction of events and of information about the recording activity in user generated videos. Multimedia Tools and Applications, 70(1):119--158, 2012. Google ScholarDigital Library
F. Cricri, M. Roininen, J. Leppaenen, S. Mate, I. Curcio, S. Uhlmann, and M. Gabbouj. Sport type classification of mobile videos. IEEE Transactions on Multimedia, 16(4): 917--932, 2014. Google ScholarDigital Library
A. Engström, M. Esbjörnsson, and O. Juhlin. Mobile Collaborative Live Video Mixing. In Int. Conference on Human Computer Interaction with Mobile Devices and Services, pages 157--166, 2008. Google ScholarDigital Library
R. Graham. An efficient algorithm for determining the convex hull of a finite planar set. In Information Processing Letters, 1972.Google Scholar
ITU. ITU-R Recommendation P. 910. Technical report, 2008.Google Scholar
ITU. ITU-R Recommendation BT.500. Technical report, 2012.Google Scholar
S. Kim, Y. Lu, and G. Constantinou. MediaQ: mobile multimedia management system. In 5th ACM Multimedia Systems Conference, pages 224--235, 2014. Google ScholarDigital Library
W. S. Lasecki, K. I. Murray, S. White, R. C. Miller, and J. P. Bigham. Real-time crowd control of existing interfaces. In ACM Symposium on User Interface Software and Technology, pages 23--32. Google ScholarDigital Library
M. A. Mughal and O. Juhlin. Context-dependent software solutions to handle video synchronization and delay in collaborative live mobile video production. Personal and Ubiquitous Computing, 18(3):709--721, 2013. Google ScholarDigital Library
M. J. Murphy, C. D. Miller, W. S. Lasecki, and J. P. Bigham. Adaptive time windows for real-time crowd captioning. In Extended Abstracts on Human Factors in Computing Systems, pages 13--18, 2013. Google ScholarDigital Library
M. K. Saini, R. Gadde, S. Yan, and W. T. Ooi. MoViMash: Online Mobile Video Mashup. In ACM Int. Conference on Multimedia, pages 139--148, 2012. Google ScholarDigital Library
P. Shrestha, P. H. de With, H. Weda, M. Barbieri, and E. H. Aarts. Automatic mashup generation from multiple-camera concert recordings. In ACM Int. Conference on Multimedia, pages 541--550, 2010. Google ScholarDigital Library
G. Wang, B. Seo, and R. Zimmermann. Automatic positioning data correction for sensor-annotated mobile videos. In ACM Int. Conference on Advances in Geographic Information Systems, pages 470--473, 2012. Google ScholarDigital Library
S. Wilk and W. Effelsberg. The influence of camera shakes, harmful occlusions and camera misalignment on the perceived quality in user-generated video. In IEEE Int. Conference on Multimedia and Expo, pages 1--6, 2014.Google ScholarCross Ref
T. Yan, V. Kumar, and D. Ganesan. Crowdsearch: exploiting crowds for accurate real-time image search on mobile phones. In ACM Int. Conference on Mobile Systems, Applications, and Services, pages 77--90, 2010. Google ScholarDigital Library
K.-c. Yang, C. Guest, and P. Das. Perceptual Sharpness Metric (PSM) for Compressed Video. In IEEE Int. Conference on Multimedia and Expo, pages 777--780, 2006.Google Scholar

Index Terms

Video composition by the crowd: a system to compose user-generated videos in near real-time
1. Computing methodologies
  1. Artificial intelligence
    1. Computer vision
      1. Computer vision tasks
        Video summarization

Recommendations

Modus Operandi of Crowd Workers: The Invisible Role of Microtask Work Environments

The ubiquity of the Internet and the widespread proliferation of electronic devices has resulted in flourishing microtask crowdsourcing marketplaces, such as Amazon MTurk. An aspect that has remained largely invisible in microtask crowdsourcing is that ...
Read More
Quantifying the Invisible Labor in Crowd Work
CSCW2

Crowdsourcing markets provide workers with a centralized place to find paid work. What may not be obvious at first glance is that, in addition to the work they do for pay, crowd workers also have to shoulder a variety of unpaid invisible labor in these ...
Read More
Crowd Anatomy Beyond the Good and Bad: Behavioral Traces for Crowd Worker Modeling and Pre-selection
Abstract
The suitability of crowdsourcing to solve a variety of problems has been investigated widely. Yet, there is still a lack of understanding about the distinct behavior and performance of workers within microtasks. In this paper, we first introduce a ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Published in
MMSys '15: Proceedings of the 6th ACM Multimedia Systems Conference
March 2015
277 pages
ISBN:9781450333511
DOI:10.1145/2713168
General Chair:
Wei Tsang Ooi
National University of Singapore, Singapore
,
Program Chairs:
Wu-chi Feng
Portland State University
,
Feng Liu
Portland State University
Copyright © 2015 ACM
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 18 March 2015
Permissions
Request permissions about this article.
Request Permissions

Check for updates
Author Tags
crowdsourcing
video composition
video mashup
Qualifiers
- research-article
Conference

Acceptance Rates
MMSys '15 Paper Acceptance Rate12of41submissions,29%Overall Acceptance Rate176of530submissions,33%
More
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 9
  Total Citations
  View Citations
- 362
  Total Downloads
- Downloads (Last 12 months)10
- Downloads (Last 6 weeks)1
Other Metrics
View Author Metrics
Cited By
View all

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Video composition by the crowd: a system to compose user-generated videos in near real-time

MMSys '15: Proceedings of the 6th ACM Multimedia Systems Conference

ABSTRACT

References

Cited By

Index Terms

Recommendations

Modus Operandi of Crowd Workers: The Invisible Role of Microtask Work Environments

Quantifying the Invisible Labor in Crowd Work

Crowd Anatomy Beyond the Good and Bad: Behavioral Traces for Crowd Worker Modeling and Pre-selection