Abstract
This chapter provides a brief overview of the recent hot research topics and challenges in video coding and systems.This chapter consists of four parts. The first part introduces the perceptual coding-related technology, and the second part details the emerging Internet media-oriented coding. The third part talks about the future challenges and the last part summarizes this chapter.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Au O, Li S, Zou R, Dai W, Sun L (2012) Digital photo album compression based on global motion compensation and intra/inter prediction. In: 2012 international conference on audio, language and image processing (ICALIP). IEEE, pp 84–90
Bay H, Tuytelaars T, Van Gool L (2006) SURF: speeded up robust features. In: Computer vision—ECCV 2006. Springer, Heidelberg, pp 404–417
Chai D, Ngan K (1997) Foreground/background video coding scheme. In: Proceedings of 1997 IEEE international symposium on circuits and systems, ISCAS’97, vol 2. IEEE, pp 1448–1451
Chandrasekhar V, Takacs G, Chen D, Tsai S, Grzeszczuk R, Girod B (2009) CHOG: compressed histogram of gradients a low bit-rate feature descriptor. In: IEEE conference on computer vision and pattern recognition, CVPR 2009. IEEE, pp 2504–2511
Chen Z, Guillemot C (2010) Perceptually-friendly H. 264/AVC video coding based on foveated just-noticeable-distortion model. IEEE Trans Circuits Syst Video Technol 20(6):806–819
Chen CP, Chen CS, Chung KL, Lu HI, Tang GY (2004) Image set compression through minimal-cost prediction structures. In: ICIP, pp 1289–1292
Chen Z, Han J, Ngan KN (2006) Dynamic bit allocation for multiple video object coding. IEEE Trans Multimed 8(6):1117–1124
Chen DM, Tsai SS, Chandrasekhar V, Takacs G, Singh J, Girod B (2009) Tree histogram coding for mobile image matching. In: Data compression conference, DCC’09. IEEE, pp 143–152
Chen J, Zheng J, Xu F, Villasenor J (2012) Adaptive frequency weighting for high-performance video coding. IEEE Trans Circuits Syst Video Technol 22(7):1027–1036
Chetouani A, Mostafaoui G, Beghdadi A (2009) Deblocking method using a perceptual recursive filter. In: 2009 16th IEEE international conference on image processing (ICIP). IEEE, pp 3925–3928
Chou CH, Li YC (1995) A perceptually tuned subband image coder based on the measure of just-noticeable-distortion profile. IEEE Trans Circuits Syst Video Technol 5(6):467–476
Dong J, Ngan KN, Fong CK, Cham WK (2009) 2-d order-16 integer transforms for HD video coding. IEEE Trans Circuits Syst Video Technol 19(10):1462–1474
Duan LY, Gao F, Chen J, Lin J, Huang T (2013) Compact descriptors for mobile visual search and MPEG CDVS standardization. In: 2013 IEEE international symposium on circuits and systems (ISCAS). IEEE, pp 885–888
Eitz M, Richter R, Hildebrand K, Boubekeur T, Alexa M (2011) Photosketcher: interactive sketch-based image synthesis. IEEE Comput Graph Appl 31(6):56–66
Elad M (2010) Sparse and redundant representations: from theory to applications in signal and image processing. Springer, New York
Horev I, Bryt O, Rubinstein R (2012) Adaptive image compression using sparse dictionaries. In: 2012 19th international conference on systems, signals and image processing (IWSSIP). IEEE, pp 592–595
Huang T, Dong S, Tian Y (2014) Representing visual objects in HEVC coding loop. IEEE J Emerg Sel Top Circuits Syst 4(1):5–16
Huang YH, Ou TS, Su PY, Chen HH (2010) Perceptual rate-distortion optimization using structural similarity index as quality metric. IEEE Trans Circuits Syst Video Technol 20(11):1614–1624
Itti L (2004) Automatic foveation for video compression using a neurobiological model of visual attention. IEEE Trans Image Process 13(10):1304–1318
Ji R, Duan LY, Chen J, Gao W (2012) Towards compact topical descriptors. In: 2012 IEEE conference on computer vision and pattern recognition (CVPR). IEEE, pp 2925–2932
Jia Y, Lin W, Kassim AA (2006) Estimating just-noticeable distortion for video. IEEE Trans Circuits Syst Video Technol 16(7):820–829
Karadimitriou K, Tyler JM (1998) The centroid method for compressing sets of similar images. Pattern Recognit Lett 19(7):585–593
Kwon D and Budagavi M (2013) RCE3: Results of test 3.3 on Intra motion compensation, JCTVC-N0205, 14th Meeting, Vienna, AT, 25, 2013
Lan C, Xu J, Sullivan G, Wu F (2012) Intra transform skipping. In: Joint Collaborative Team on Video Coding (JCT-VC) of ITU-T SG16 WP3 and ISO/IEC JTC1/SC29/WG11 Document JCTVC-I0408, 9th Meeting: Geneva
Lin W (2005) Computational models for just-noticeable difference. In: Digital video image quality and perceptual coding. CRC Press, Boca Raton
Liu H, Klomp N, Heynderickx I (2010) A perceptually relevant approach to ringing region detection. IEEE Trans Image Process 19(6):1414–1426
Liu A, Lin W, Narwaria M (2012) Image quality assessment based on gradient similarity. IEEE Trans Image Process 21(4):1500–1512
Liu D, Sun X, Wu F, Li S, Zhang YQ (2007) Image compression with edge-based inpainting. IEEE Trans Circuits Syst Video Technol 17(10):1273–1287
Liu D, Sun X, Wu F, Zhang YQ (2008) Edge-oriented uniform intra prediction. IEEE Trans Image Process 17(10):1827–1836
Lou J, Sun MT (2011) Rate-distortion optimized rate-allocation for motion-compensated predictive video codecs using pixelrank. J Vis Commun Image Represent 22(2):107–116
Lowe DG (2004) Distinctive image features from scale-invariant keypoints. Int J Comput Vis 60(2):91–110
Ma L, Wu F, Zhao D, Gao W, Ma S (2008) Learning-based image restoration for compressed image through neighboring embedding. In: Advances in multimedia information processing-PCM 2008. Springer, Berlin, pp 279–286
Ma Z, Xu M, Ou YF, Wang Y (2012) Modeling of rate and perceptual quality of compressed video as functions of frame rate and quantization stepsize and its applications. IEEE Trans Circuits Syst Video Technol 22(5):671–682
Mai ZY, Yang CL, Kuang KZ, Po LM (2006) A novel motion estimation method based on structural similarity for h. 264 inter prediction. In: 2006 IEEE international conference on acoustics, speech and signal processing, ICASSP 2006 Proceedings. IEEE, vol 2, pp II–II
Makar M, Chandrasekhar V, Tsai S, Chen D, Girod B (2014) Interframe coding of feature descriptors for mobile augmented reality
Mrak M, Xu JZ (2012) Improving screen content coding in HEVC by transform skipping. In: 2012 proceedings of the 20th European signal processing conference (EUSIPCO). IEEE, pp 1209–1213
Musatenko YS, Kurashov VN (1998) Correlated image set compression system based on new fast efficient algorithm of Karhunen-Loeve transform. In: Photonics East (ISAM, VVDC, IEMB), international society for optics and photonics, pp 518–529
Ndjiki-Nya P, Bull D, Wiegand T (2009) Perception-oriented video coding based on texture analysis and synthesis. In: 2009 16th IEEE international conference on image processing (ICIP). IEEE, pp 2273–2276
Nguyen AG, Hwang JN (2002) A novel hybrid hvpc/mathematical model rate control for low bit-rate streaming video. Signal Process: Image Commun 17(5):423–440
Olshausen B, Field D (1996) Emergence of simple-cell receptive field properties by learning a sparse code for natural images. Nature 381(6583):607
Pan F, Sun Y, Lu Z, Kassim A (2005) Complexity-based rate distortion optimization with perceptual tuning for scalable video coding. In: IEEE international conference on image processing, ICIP 2005, vol 3. IEEE, pp III-37
Robinson D (1965) The mechanics of human smooth pursuit eye movement. J Physiol 180(3):569
Scharnowski F, Hermens F, Herzog MH (2007) Blochs law and the dynamics of feature fusion. Vis Res 47(18):2444–2452
Seshadrinathan K, Bovik AC (2010) Motion tuned spatio-temporal quality assessment of natural videos. IEEE Trans Image Process 19(2):335–350
Seyler A, Budrikis Z (1959) Measurements of temporal adaptation to spatial detail vision. Nature 184:1215–1217
Sheikh HR, Bovik AC (2006) Image information and visual quality. IEEE Trans Image Process 15(2):430–444
Shi Z, Sun X, Wu F (2014) Photo album compression for cloud storage using local features. IEEE J Emerg Sel Top Circuits Syst 4(1):17–28
Sun C, Wang HJ, Li H (2008) Macroblock-level rate-distortion optimization with perceptual adjustment for video coding. In: Data compression conference, DCC 2008. IEEE, pp 546–546
Tang CW, Chen CH, Yu YH, Tsai CJ (2006) Visual sensitivity guided bit allocation for video coding. IEEE Trans Multimed 8(1):11–18
Wang Z, Bovik A (2006) Modern image quality assessment (synthesis lectures on image, video, and multimedia processing). San Rafael, CA: Morgan Claypool
Wang Y, Jiang T, Ma S, Gao W (2012) Novel spatio-temporal structural information based video quality metric. IEEE Trans Circuits Syst Video Technol 22(7):989–998
Wang S, Rehman A, Wang Z, Ma S, Gao W (2013) Perceptual video coding based on SSIM-inspired divisive normalization. IEEE Trans Image Process 22(4):1418–1429
Wang H, Ma M, Jiang YG, Wei Z (2014) A framework of video coding for compressing near-duplicate videos. In: MultiMedia modeling. Springer, Berlin, pp 518–528
Webster AA, Jones CT, Pinson MH, Voran SD, Wolf S (1993) Objective video quality assessment system based on human perception. In: IS&T/SPIE’s symposium on electronic imaging: science and technology, international society for optics and photonics, pp 15–26
Weinzaepfel P, Jégou H, Pérez P (2011) Reconstructing an image from its local descriptors. In: 2011 IEEE conference on computer vision and pattern recognition (CVPR). IEEE, pp 337–344
Wu HR, Reibman AR, Lin W, Pereira F, Hemami SS (2013) Perceptual visual signal compression and transmission. Proc IEEE
Xiong H, Pan Z, Ye X, Chen CW (2013) Sparse spatio-temporal representation with adaptive regularized dictionary learning for low bit-rate video coding. IEEE Trans Circuits Syst Video Technol 23(4):710–728
Yang X, Ling W, Lu Z, Ong EP, Yao S (2005) Just noticeable distortion model and its applications in video coding. Signal Process: Image Commun 20(7):662–680
Yeung CH, Au OC, Tang K, Yu Z, Luo E, Wu Y, Tu SF (2011) Compressing similar image sets using low frequency template. In: 2011 IEEE international conference on multimedia and expo (ICME). IEEE, pp 1–6
Yue H, Sun X, Yang J, Wu F (2013) Cloud-based image coding for mobile devices toward thousands to one compression. IEEE Trans Multimed 15(4):845
Zhai G, Cai J, Lin W, Yang X, Zhang W (2008) Three dimensional scalable video adaptation via user-end perceptual quality assessment. IEEE Trans Broadcast 54(3):719–727
Zhang L, Zhang D, Mou X (2011) FSIM: a feature similarity index for image quality assessment. IEEE Trans Image Process 20(8):2378–2386
Zhang X, Huang T, Tian Y, Gao W (2014) Background-modeling based adaptive prediction for surveillance video coding
Zhu C, Sun X, Wu F, Li H (2008) Video coding with spatio-temporal texture synthesis and edge-based inpainting. In: 2008 IEEE international conference on multimedia and expo. IEEE, pp 813–816
Zou R, Au OC, Zhou G, Dai W, Hu W, Wan P (2013) Personal photo album compression and management. In: 2013 IEEE international symposium on circuits and systems (ISCAS). IEEE, pp 1428–1431
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
Copyright information
© 2014 Springer International Publishing Switzerland
About this chapter
Cite this chapter
Gao, W., Ma, S. (2014). Hot Research Topics in Video Coding and Systems. In: Advanced Video Coding Systems. Springer, Cham. https://doi.org/10.1007/978-3-319-14243-2_12
Download citation
DOI: https://doi.org/10.1007/978-3-319-14243-2_12
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-14242-5
Online ISBN: 978-3-319-14243-2
eBook Packages: Computer ScienceComputer Science (R0)