Abstract
In this paper, we propose an optimal key frame representation scheme based on global statistics for video shot retrieval. Each pixel in this optimal key frame is constructed by considering the probability of occurrence of those pixels at the corresponding pixel position among the frames in a video shot. Therefore, this constructed key frame is called temporally maximum occurrence frame (TMOF), which is an optimal representation of all the frames in a video shot. The retrieval performance of this representation scheme is further improved by considering the k pixel values with the largest probabilities of occurrence and the highest peaks of the probability distribution of occurrence at each pixel position for a video shot. The corresponding schemes are called k-TMOF and k-pTMOF, respectively. These key frame representation schemes are compared to other histogram-based techniques for video shot representation and retrieval. In the experiments, three video sequences in the MPEG-7 content set were used to evaluate the performances of the different key frame representation schemes. Experimental results show that our proposed representations outperform the alpha-trimmed average histogram for video retrieval.
Index Terms
- A new key frame representation for video segment retrieval
Recommendations
Content-based video retrieval and compression: a unified solution
ICIP '97: Proceedings of the 1997 International Conference on Image Processing (ICIP '97) 3-Volume Set-Volume 1 - Volume 1Video compression and retrieval have been treated as separate problems in the past. We present an object-based video representation that facilitates both compression and retrieval. Typically in retrieval applications, a video sequence is subdivided in ...
Using depth features to retrieve monocular video shots
CIVR '07: Proceedings of the 6th ACM international conference on Image and video retrievalIn recent years, many features have been suggested to facilitate the task of video retrieval. However, up to now, three-dimensional (3D) scene information has not been utilized to retrieve shots from a database consisting of monocular video sequences. ...
On the significance of cluster-temporal browsing for generic video retrieval: a statistical analysis
MM '06: Proceedings of the 14th ACM international conference on MultimediaIn this paper, we test statistically the effect of content-based browsing in generic video retrieval. Using TRECVID 2004 and 2005 experiments, we demonstrate that content-based browsing improves retrieval over sequential queries and relevance feedback. ...
Comments