Parallel Variable-Length Encoding on GPGPUs

Balevic, Ana

doi:10.1007/978-3-642-14122-5_6

Ana Balevic⁸

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 6043))

Included in the following conference series:

European Conference on Parallel Processing

1407 Accesses
14 Citations

Abstract

Variable-Length Encoding (VLE) is a process of reducing input data size by replacing fixed-length data words with codewords of shorter length. As VLE is one of the main building blocks in systems for multimedia compression, its efficient implementation is essential. The massively parallel architecture of modern general purpose graphics processing units (GPGPUs) has been successfully used for acceleration of inherently parallel compression blocks, such as image transforms and motion estimation. On the other hand, VLE is an inherently serial process due to the requirement of writing a variable number of bits for each codeword to the compressed data stream. The introduction of the atomic operations on the latest GPGPUs enables writing to the output memory locations by many threads in parallel. We present a novel data parallel algorithm for variable length encoding using atomic operations, which archives performance speedups of up to 35-50x using a CUDA-enabled GPGPU.

Download to read the full chapter text

Chapter PDF

GVLE: a highly optimized GPU-based implementation of variable-length encoding

Article 18 December 2022

SkelCL: a high-level extension of OpenCL for multi-GPU systems

Article 28 May 2014

GPU Architecture

Keywords

These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

References

Huffman, D.: A method for the construction of Minimum-Redundancy codes. Proceedings of the IRE 40(9), 1098–1101 (1952)
Article Google Scholar
Allusse, Y., Horain, P., Agarwal, A., Saipriyadarshan, C.: GpuCV: an opensource GPU-accelerated framework forimage processing and computer vision. In: 2006 IEEE International Conference on Multimedia and Expo. (2008)
Google Scholar
Chen, W., Hang, H.: H. 264/AVC motion estimation implmentation on Compute Unified Device Architecture (CUDA). In: 2008 IEEE International Conference on Multimedia and Expo., pp. 697–700 (2008)
Google Scholar
Fung, J., Mann, S.: Using graphics devices in reverse: GPU-based image processing and computer vision. In: 2008 IEEE International Conference on Multimedia and Expo., pp. 9–12 (2008)
Google Scholar
Blelloch, G.E.: Prefix sums and their applications. Synthesis of Parallel Algorithms, 35–60 (1990)
Google Scholar
Roger, D., Assarsson, U., Holzschuch, N.: Efficient stream reduction on the gpu. In: Kaeli, D., Leeser, M. (eds.) Workshop on General Purpose Processing on Graphics Processing Units (October 2007)
Google Scholar
Castaño, I.: High quality dxt compression using cuda. Technical report, NVIDIA (last access, May 2008)
Google Scholar
Lindholm, E., Nickolls, J., Oberman, S., Montrym, J.: NVIDIA Tesla: A unified graphics and computing architecture. IEEE Micro 28(2), 39–55 (2008)
Article Google Scholar
NVIDIA Corporation Technical Staff: Nvidia cuda -programming guide 2.0. Technical report, NVIDIA (last access, May 2009)
Google Scholar
Atallah, M., Kosaraju, S., Larmore, L., Miller, G., Teng, S.: Constructing trees in parallel. In: Proceedings of the first annual ACM symposium on Parallel algorithms and architectures, pp. 421–431. ACM, New York (1989)
Chapter Google Scholar
Harris, M., Sengupta, S., Owens, J.D.: Parallel prefix sum (scan) with cuda. GPU Gems 3 (2007)
Google Scholar

Download references

Author information

Authors and Affiliations

University of Stuttgart, Germany
Ana Balevic

Authors

Ana Balevic
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Insitute for Applied Mathematics, Delft University of Technology, 2628, Delft, The Netherlands
Hai-Xiang Lin
Scaledinfra technologies GmbH, Köllnerhofgasse 3/15A, 1010, Vienna, Austria
Michael Alexander
VTT, Kaitovayla 1, 90570, Oulu, Finland
Martti Forsell
Technische Universität Dresden, 01069, Dresden, Germany
Andreas Knüpfer
Institute for Computer Science, Technical University of Innsbruck, 6020, Innsbruck, Austria
Radu Prodan
Instituto Superior Técnico/INESC-ID., Rua Alves Redol 9, 1000-029, Lisbon, Portugal
Leonel Sousa
Jülich Supercomputing Centre, 52425, Jülich, Germany
Achim Streit

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Balevic, A. (2010). Parallel Variable-Length Encoding on GPGPUs. In: Lin, HX., et al. Euro-Par 2009 – Parallel Processing Workshops. Euro-Par 2009. Lecture Notes in Computer Science, vol 6043. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-14122-5_6

Download citation

DOI: https://doi.org/10.1007/978-3-642-14122-5_6
Published: 17 June 2010
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-14121-8
Online ISBN: 978-3-642-14122-5
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Parallel Variable-Length Encoding on GPGPUs

Abstract

Chapter PDF

Similar content being viewed by others

GVLE: a highly optimized GPU-based implementation of variable-length encoding

SkelCL: a high-level extension of OpenCL for multi-GPU systems

GPU Architecture

Keywords

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Navigation

Parallel Variable-Length Encoding on GPGPUs

Abstract

Chapter PDF

Similar content being viewed by others

GVLE: a highly optimized GPU-based implementation of variable-length encoding

SkelCL: a high-level extension of OpenCL for multi-GPU systems

GPU Architecture

Keywords

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation