demonstration

repoVizz: a framework for remote storage, browsing, annotation, and exchange of multi-modal data

Authors:
Oscar Mayor

Universitat Pompeu Fabra, Barcelona, Spain

Universitat Pompeu Fabra, Barcelona, Spain
View Profile

,
Quim Llimona

Universitat Pompeu Fabra, Barcelona, Spain

Universitat Pompeu Fabra, Barcelona, Spain
View Profile

,
Marco Marchini

Universitat Pompeu Fabra, Barcelona, Spain

Universitat Pompeu Fabra, Barcelona, Spain
View Profile

,
Panos Papiotis

Universitat Pompeu Fabra, Barcelona, Spain

Universitat Pompeu Fabra, Barcelona, Spain
View Profile

,
Esteban Maestre

Universitat Pompeu Fabra, Barcelona, Spain

Universitat Pompeu Fabra, Barcelona, Spain
View Profile

MM '13: Proceedings of the 21st ACM international conference on MultimediaOctober 2013Pages 415–416https://doi.org/10.1145/2502081.2502247

Published:21 October 2013Publication History

MM '13: Proceedings of the 21st ACM international conference on Multimedia

Pages 415–416

ABSTRACT

In this technical demo we present repoVizz (http://repovizz.upf.edu), an integrated online system capable of structural formatting and remote storage, browsing, exchange, annotation, and visualization of synchronous multi-modal, time-aligned data. Motivated by a growing need for data-driven collaborative research, repoVizz aims to resolve commonly encountered difficulties in sharing or browsing large collections of multi-modal data. At its current state, repoVizz is designed to hold time-aligned streams of heterogeneous data: audio, video, motion capture, physiological signals, extracted descriptors, annotations, et cetera. Most popular formats for audio and video are supported, while Broadcast WAVE or CSV formats are adopted for streams other than audio or video (e.g., motion capture or physiological signals). The data itself is structured via customized XML files, allowing the user to (re-) organize multi-modal data in any hierarchical manner, as the XML structure only holds metadata and pointers to data files. Datasets are stored in an online database, allowing the user to interact with the data remotely through a powerful HTML5 visual interface accessible from any standard web browser; this feature can be considered a key aspect of repoVizz since data can be explored, annotated, or visualized from any location or device. Data exchange and upload/download is made easy and secure via a number of data conversion tools and a user/permission management system.

Supplemental Material

mm164de.mp4

mp4

47 MB

Download

References

O. Mayor and J. Llop and E. Maestre, Repovizz: a multimodal on-line database and browsing tool for music performane research, Proceedings of Int. Symposium for Music Information Retrieval, 2011.Google Scholar

Index Terms

repoVizz: a framework for remote storage, browsing, annotation, and exchange of multi-modal data

Recommendations

Semiotic schemas: A framework for grounding language in action and perception
Special volume on connecting language to the world

A theoretical framework for grounding language is introduced that provides a computational path from sensing and motor action to words and speech acts. The approach combines concepts from semiotics and schema theory to develop a holistic approach to ...
Read More
A probabilistic multimodal approach for predicting listener backchannels

During face-to-face interactions, listeners use backchannel feedback such as head nods as a signal to the speaker that the communication is working and that they should continue speaking. Predicting these backchannel opportunities is an important ...
Read More
An extension of the multimodal presentation markup language (MPML) to a three-dimensional VRML space

We are conducting research into multimodal presentations that make use of anthropomorphic character agents as a new type of multimodal media that can be used to effectively communicate information in conjunction with the World Wide Web (WWW) and we are ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Published in
MM '13: Proceedings of the 21st ACM international conference on Multimedia
October 2013
1166 pages
ISBN:9781450324045
DOI:10.1145/2502081
General Chairs:
Alejandro (Alex) Jaimes
Yahoo!, Spain
,
Nicu Sebe
University of Trento, Italy
,
Nozha Boujemaa
INRIA, France
,
Program Chairs:
Daniel Gatica-Perez
IDIAP & EPFL, Switzerland
,
David A. Shamma
Yahoo!, USA
,
Marcel Worring
University of Amsterdam, The Netherlands
,
Roger Zimmermann
National University of Singapore, Singapore
Copyright © 2013 Copyright is held by the owner/author(s)
Permission to make digital or hard copies of part or all of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for third-party components of this work must be honored. For all other uses, contact the Owner/Author.
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 21 October 2013
Check for updates
Author Tags
multimodal
Qualifiers
- demonstration
Conference

Acceptance Rates
MM '13 Paper Acceptance Rate47of235submissions,20%Overall Acceptance Rate995of4,171submissions,24%
More
Upcoming Conference
MM '24

Sponsor:

sigmm

MM '24: The 32nd ACM International Conference on Multimedia

October 28 - November 1, 2024

Melbourne , VIC , Australia
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 7
  Total Citations
  View Citations
- 190
  Total Downloads
- Downloads (Last 12 months)9
- Downloads (Last 6 weeks)1
Other Metrics
View Author Metrics
Cited By
View all

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

repoVizz: a framework for remote storage, browsing, annotation, and exchange of multi-modal data

MM '13: Proceedings of the 21st ACM international conference on Multimedia

ABSTRACT

Supplemental Material

References

Cited By

Index Terms

Recommendations

Semiotic schemas: A framework for grounding language in action and perception

A probabilistic multimodal approach for predicting listener backchannels

An extension of the multimodal presentation markup language (MPML) to a three-dimensional VRML space