Abstract
A novel motion activity descriptor and its extraction from a compressed MPEG (MPEG-1/2) video stream are presented. The descriptor consists of two parts, a temporal descriptor and a spatial descriptor. To get the temporal descriptor, the “motion intensity” is first computed based on P frame macroblock information. Then the motion intensity histogram is generated for a given video unit as the temporal descriptor. To get the spatial descriptor, the average magnitude of the motion vector in a P frame is used to threshold the macro-blocks into “zero” and “non-zero” types. The average magnitude of the motion vectors and three types of runs of zeros in the frame are then taken as the spatial descriptor. Experimental results show that the proposed descriptor is fast, and that the combination of the temporal and spatial attributes is effective. Key elements of the intensity parameter, spatial parameters and the temporal histogram of the descriptor have been adopted by the draft MPEG-7 standard [10].
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Reference
Shih-Fu Chang, William Chen, Horace J.Meng, Hari Sundaram and Di Zhong, "A Fully Automated Content-Based Video Search Engine Supporting Spatiotemoral Queries", IEEE Trans. on Circuits and Systems for Video Technology, 8(5), pp.602–615, 1998.
A. Divakaran and H. Sun, “A Descriptor for spatial distribution of motion activity”, Proc. SPIE Conf. on Storage and Retrieval from Image and Video Databases, San Jose, CA 24-28 Jan. 2000.
Y. Deng and B.S. Manjunath, "NeTra-V: toward an object-based video representation", IEEE Transactions on Circuits and Systems for Video Technology, vol.8, (no.5), p.616-27, Sep 1998.
A. Gersho and R.M. Gray, “Vector Quantization and Signal Compression,” Kluwer Academis, 1991
B. G. Haskell, A. Puri and A. N. Netravali, “Digital Video: An Introduction to MPEG 2, ” Chapman and Hall, 1997.
W. Y. Ma and B. S. Manjunath, "NETRA: A toolbox for navigat ing large image databases," IEEE International Conference on Image Processing, pp. 568–571,1997
X. Sun, M. Kankanhalli, Y. Zhu and J. Wu, “Content-Based Representative Frame Extraction for Digital Video,” International Conference on Multimedia Computing and Systems, pp. 190–194, 1998
Hongjiang Zhang, Chien-Yong Low, and Stephen W. Smoliar, “Video parsing and browsing using compressed data,” Multimedia Tools and Applications, 1(1): pp.89–111, 1995.
URL: http://www.cselt.it/mpeg/, official MPEG site.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2001 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Sun, X., Divakaran, A., Manjunath, B.S. (2001). A Motion Activity Descriptor and Its Extraction in Compressed Domain. In: Shum, HY., Liao, M., Chang, SF. (eds) Advances in Multimedia Information Processing — PCM 2001. PCM 2001. Lecture Notes in Computer Science, vol 2195. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-45453-5_58
Download citation
DOI: https://doi.org/10.1007/3-540-45453-5_58
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-42680-6
Online ISBN: 978-3-540-45453-3
eBook Packages: Springer Book Archive