Hot Research Topics in Video Coding and Systems

Gao, Wen; Ma, Siwei

doi:10.1007/978-3-319-14243-2_12

Wen Gao³ &
Siwei Ma³

1359 Accesses

Abstract

This chapter provides a brief overview of the recent hot research topics and challenges in video coding and systems.This chapter consists of four parts. The first part introduces the perceptual coding-related technology, and the second part details the emerging Internet media-oriented coding. The third part talks about the future challenges and the last part summarizes this chapter.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Hardcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Au O, Li S, Zou R, Dai W, Sun L (2012) Digital photo album compression based on global motion compensation and intra/inter prediction. In: 2012 international conference on audio, language and image processing (ICALIP). IEEE, pp 84–90
Google Scholar
Bay H, Tuytelaars T, Van Gool L (2006) SURF: speeded up robust features. In: Computer vision—ECCV 2006. Springer, Heidelberg, pp 404–417
Google Scholar
Chai D, Ngan K (1997) Foreground/background video coding scheme. In: Proceedings of 1997 IEEE international symposium on circuits and systems, ISCAS’97, vol 2. IEEE, pp 1448–1451
Google Scholar
Chandrasekhar V, Takacs G, Chen D, Tsai S, Grzeszczuk R, Girod B (2009) CHOG: compressed histogram of gradients a low bit-rate feature descriptor. In: IEEE conference on computer vision and pattern recognition, CVPR 2009. IEEE, pp 2504–2511
Google Scholar
Chen Z, Guillemot C (2010) Perceptually-friendly H. 264/AVC video coding based on foveated just-noticeable-distortion model. IEEE Trans Circuits Syst Video Technol 20(6):806–819
Article Google Scholar
Chen CP, Chen CS, Chung KL, Lu HI, Tang GY (2004) Image set compression through minimal-cost prediction structures. In: ICIP, pp 1289–1292
Google Scholar
Chen Z, Han J, Ngan KN (2006) Dynamic bit allocation for multiple video object coding. IEEE Trans Multimed 8(6):1117–1124
Article Google Scholar
Chen DM, Tsai SS, Chandrasekhar V, Takacs G, Singh J, Girod B (2009) Tree histogram coding for mobile image matching. In: Data compression conference, DCC’09. IEEE, pp 143–152
Google Scholar
Chen J, Zheng J, Xu F, Villasenor J (2012) Adaptive frequency weighting for high-performance video coding. IEEE Trans Circuits Syst Video Technol 22(7):1027–1036
Article Google Scholar
Chetouani A, Mostafaoui G, Beghdadi A (2009) Deblocking method using a perceptual recursive filter. In: 2009 16th IEEE international conference on image processing (ICIP). IEEE, pp 3925–3928
Google Scholar
Chou CH, Li YC (1995) A perceptually tuned subband image coder based on the measure of just-noticeable-distortion profile. IEEE Trans Circuits Syst Video Technol 5(6):467–476
Article Google Scholar
Dong J, Ngan KN, Fong CK, Cham WK (2009) 2-d order-16 integer transforms for HD video coding. IEEE Trans Circuits Syst Video Technol 19(10):1462–1474
Article Google Scholar
Duan LY, Gao F, Chen J, Lin J, Huang T (2013) Compact descriptors for mobile visual search and MPEG CDVS standardization. In: 2013 IEEE international symposium on circuits and systems (ISCAS). IEEE, pp 885–888
Google Scholar
Eitz M, Richter R, Hildebrand K, Boubekeur T, Alexa M (2011) Photosketcher: interactive sketch-based image synthesis. IEEE Comput Graph Appl 31(6):56–66
Article Google Scholar
Elad M (2010) Sparse and redundant representations: from theory to applications in signal and image processing. Springer, New York
Google Scholar
Horev I, Bryt O, Rubinstein R (2012) Adaptive image compression using sparse dictionaries. In: 2012 19th international conference on systems, signals and image processing (IWSSIP). IEEE, pp 592–595
Google Scholar
Huang T, Dong S, Tian Y (2014) Representing visual objects in HEVC coding loop. IEEE J Emerg Sel Top Circuits Syst 4(1):5–16
Article Google Scholar
Huang YH, Ou TS, Su PY, Chen HH (2010) Perceptual rate-distortion optimization using structural similarity index as quality metric. IEEE Trans Circuits Syst Video Technol 20(11):1614–1624
Article Google Scholar
Itti L (2004) Automatic foveation for video compression using a neurobiological model of visual attention. IEEE Trans Image Process 13(10):1304–1318
Article Google Scholar
Ji R, Duan LY, Chen J, Gao W (2012) Towards compact topical descriptors. In: 2012 IEEE conference on computer vision and pattern recognition (CVPR). IEEE, pp 2925–2932
Google Scholar
Jia Y, Lin W, Kassim AA (2006) Estimating just-noticeable distortion for video. IEEE Trans Circuits Syst Video Technol 16(7):820–829
Article Google Scholar
Karadimitriou K, Tyler JM (1998) The centroid method for compressing sets of similar images. Pattern Recognit Lett 19(7):585–593
Article Google Scholar
Kwon D and Budagavi M (2013) RCE3: Results of test 3.3 on Intra motion compensation, JCTVC-N0205, 14th Meeting, Vienna, AT, 25, 2013
Google Scholar
Lan C, Xu J, Sullivan G, Wu F (2012) Intra transform skipping. In: Joint Collaborative Team on Video Coding (JCT-VC) of ITU-T SG16 WP3 and ISO/IEC JTC1/SC29/WG11 Document JCTVC-I0408, 9th Meeting: Geneva
Google Scholar
Lin W (2005) Computational models for just-noticeable difference. In: Digital video image quality and perceptual coding. CRC Press, Boca Raton
Google Scholar
Liu H, Klomp N, Heynderickx I (2010) A perceptually relevant approach to ringing region detection. IEEE Trans Image Process 19(6):1414–1426
Article MathSciNet Google Scholar
Liu A, Lin W, Narwaria M (2012) Image quality assessment based on gradient similarity. IEEE Trans Image Process 21(4):1500–1512
Article MathSciNet Google Scholar
Liu D, Sun X, Wu F, Li S, Zhang YQ (2007) Image compression with edge-based inpainting. IEEE Trans Circuits Syst Video Technol 17(10):1273–1287
Article Google Scholar
Liu D, Sun X, Wu F, Zhang YQ (2008) Edge-oriented uniform intra prediction. IEEE Trans Image Process 17(10):1827–1836
Article MathSciNet Google Scholar
Lou J, Sun MT (2011) Rate-distortion optimized rate-allocation for motion-compensated predictive video codecs using pixelrank. J Vis Commun Image Represent 22(2):107–116
Article Google Scholar
Lowe DG (2004) Distinctive image features from scale-invariant keypoints. Int J Comput Vis 60(2):91–110
Article Google Scholar
Ma L, Wu F, Zhao D, Gao W, Ma S (2008) Learning-based image restoration for compressed image through neighboring embedding. In: Advances in multimedia information processing-PCM 2008. Springer, Berlin, pp 279–286
Google Scholar
Ma Z, Xu M, Ou YF, Wang Y (2012) Modeling of rate and perceptual quality of compressed video as functions of frame rate and quantization stepsize and its applications. IEEE Trans Circuits Syst Video Technol 22(5):671–682
Article Google Scholar
Mai ZY, Yang CL, Kuang KZ, Po LM (2006) A novel motion estimation method based on structural similarity for h. 264 inter prediction. In: 2006 IEEE international conference on acoustics, speech and signal processing, ICASSP 2006 Proceedings. IEEE, vol 2, pp II–II
Google Scholar
Makar M, Chandrasekhar V, Tsai S, Chen D, Girod B (2014) Interframe coding of feature descriptors for mobile augmented reality
Google Scholar
Mrak M, Xu JZ (2012) Improving screen content coding in HEVC by transform skipping. In: 2012 proceedings of the 20th European signal processing conference (EUSIPCO). IEEE, pp 1209–1213
Google Scholar
Musatenko YS, Kurashov VN (1998) Correlated image set compression system based on new fast efficient algorithm of Karhunen-Loeve transform. In: Photonics East (ISAM, VVDC, IEMB), international society for optics and photonics, pp 518–529
Google Scholar
Ndjiki-Nya P, Bull D, Wiegand T (2009) Perception-oriented video coding based on texture analysis and synthesis. In: 2009 16th IEEE international conference on image processing (ICIP). IEEE, pp 2273–2276
Google Scholar
Nguyen AG, Hwang JN (2002) A novel hybrid hvpc/mathematical model rate control for low bit-rate streaming video. Signal Process: Image Commun 17(5):423–440
Google Scholar
Olshausen B, Field D (1996) Emergence of simple-cell receptive field properties by learning a sparse code for natural images. Nature 381(6583):607
Article Google Scholar
Pan F, Sun Y, Lu Z, Kassim A (2005) Complexity-based rate distortion optimization with perceptual tuning for scalable video coding. In: IEEE international conference on image processing, ICIP 2005, vol 3. IEEE, pp III-37
Google Scholar
Robinson D (1965) The mechanics of human smooth pursuit eye movement. J Physiol 180(3):569
Article Google Scholar
Scharnowski F, Hermens F, Herzog MH (2007) Blochs law and the dynamics of feature fusion. Vis Res 47(18):2444–2452
Article Google Scholar
Seshadrinathan K, Bovik AC (2010) Motion tuned spatio-temporal quality assessment of natural videos. IEEE Trans Image Process 19(2):335–350
Article MathSciNet Google Scholar
Seyler A, Budrikis Z (1959) Measurements of temporal adaptation to spatial detail vision. Nature 184:1215–1217
Article Google Scholar
Sheikh HR, Bovik AC (2006) Image information and visual quality. IEEE Trans Image Process 15(2):430–444
Article Google Scholar
Shi Z, Sun X, Wu F (2014) Photo album compression for cloud storage using local features. IEEE J Emerg Sel Top Circuits Syst 4(1):17–28
Article Google Scholar
Sun C, Wang HJ, Li H (2008) Macroblock-level rate-distortion optimization with perceptual adjustment for video coding. In: Data compression conference, DCC 2008. IEEE, pp 546–546
Google Scholar
Tang CW, Chen CH, Yu YH, Tsai CJ (2006) Visual sensitivity guided bit allocation for video coding. IEEE Trans Multimed 8(1):11–18
Article Google Scholar
Wang Z, Bovik A (2006) Modern image quality assessment (synthesis lectures on image, video, and multimedia processing). San Rafael, CA: Morgan Claypool
Google Scholar
Wang Y, Jiang T, Ma S, Gao W (2012) Novel spatio-temporal structural information based video quality metric. IEEE Trans Circuits Syst Video Technol 22(7):989–998
Article Google Scholar
Wang S, Rehman A, Wang Z, Ma S, Gao W (2013) Perceptual video coding based on SSIM-inspired divisive normalization. IEEE Trans Image Process 22(4):1418–1429
Article MathSciNet Google Scholar
Wang H, Ma M, Jiang YG, Wei Z (2014) A framework of video coding for compressing near-duplicate videos. In: MultiMedia modeling. Springer, Berlin, pp 518–528
Google Scholar
Webster AA, Jones CT, Pinson MH, Voran SD, Wolf S (1993) Objective video quality assessment system based on human perception. In: IS&T/SPIE’s symposium on electronic imaging: science and technology, international society for optics and photonics, pp 15–26
Google Scholar
Weinzaepfel P, Jégou H, Pérez P (2011) Reconstructing an image from its local descriptors. In: 2011 IEEE conference on computer vision and pattern recognition (CVPR). IEEE, pp 337–344
Google Scholar
Wu HR, Reibman AR, Lin W, Pereira F, Hemami SS (2013) Perceptual visual signal compression and transmission. Proc IEEE
Google Scholar
Xiong H, Pan Z, Ye X, Chen CW (2013) Sparse spatio-temporal representation with adaptive regularized dictionary learning for low bit-rate video coding. IEEE Trans Circuits Syst Video Technol 23(4):710–728
Article Google Scholar
Yang X, Ling W, Lu Z, Ong EP, Yao S (2005) Just noticeable distortion model and its applications in video coding. Signal Process: Image Commun 20(7):662–680
Google Scholar
Yeung CH, Au OC, Tang K, Yu Z, Luo E, Wu Y, Tu SF (2011) Compressing similar image sets using low frequency template. In: 2011 IEEE international conference on multimedia and expo (ICME). IEEE, pp 1–6
Google Scholar
Yue H, Sun X, Yang J, Wu F (2013) Cloud-based image coding for mobile devices toward thousands to one compression. IEEE Trans Multimed 15(4):845
Article Google Scholar
Zhai G, Cai J, Lin W, Yang X, Zhang W (2008) Three dimensional scalable video adaptation via user-end perceptual quality assessment. IEEE Trans Broadcast 54(3):719–727
Article Google Scholar
Zhang L, Zhang D, Mou X (2011) FSIM: a feature similarity index for image quality assessment. IEEE Trans Image Process 20(8):2378–2386
Article MathSciNet Google Scholar
Zhang X, Huang T, Tian Y, Gao W (2014) Background-modeling based adaptive prediction for surveillance video coding
Google Scholar
Zhu C, Sun X, Wu F, Li H (2008) Video coding with spatio-temporal texture synthesis and edge-based inpainting. In: 2008 IEEE international conference on multimedia and expo. IEEE, pp 813–816
Google Scholar
Zou R, Au OC, Zhou G, Dai W, Hu W, Wan P (2013) Personal photo album compression and management. In: 2013 IEEE international symposium on circuits and systems (ISCAS). IEEE, pp 1428–1431
Google Scholar

Download references

Author information

Authors and Affiliations

Peking University, Beijing, China
Wen Gao & Siwei Ma

Authors

Wen Gao
View author publications
You can also search for this author in PubMed Google Scholar
Siwei Ma
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Wen Gao .

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Gao, W., Ma, S. (2014). Hot Research Topics in Video Coding and Systems. In: Advanced Video Coding Systems. Springer, Cham. https://doi.org/10.1007/978-3-319-14243-2_12

Download citation

DOI: https://doi.org/10.1007/978-3-319-14243-2_12
Published: 16 January 2015
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-14242-5
Online ISBN: 978-3-319-14243-2
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics