Abstract
The eigenvalues and eigenvectors of a symmetric matrix are of interest in a myriad of applications. One of the fastest and most accurate numerical techniques for the eigendecomposition is the Algorithm of Multiple Relatively Robust Representations (MRRR), the first stable algorithm that computes the eigenvalues and eigenvectors of a tridiagonal symmetric matrix in O(n 2) arithmetic operations. In this paper we present a parallelization of the MRRR algorithm for data parallel coprocessors using the CUDA programming environment. The results demonstrate the potential of data-parallel coprocessors for scientific computations: compared to routine sstemr, LAPACK’s implementation of MRRR, our parallel algorithm provides 10-fold speedups.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Golub, G.H., Loan, C.F.V.: Matrix Computations, 3rd edn. Johns Hopkins University Press, Baltimore (1996)
Parlett, B.N.: The Symmetric Eigenvalue Problem. Prentice-Hall, Inc., Upper Saddle River (1998)
Martin, R.S., Reinsch, C., Wilkinson, J.H.: Householder’s Tridiagonalization of a Symmetric Matrix. Numer. Math. 11, 181–195 (1968)
Dhillon, I.S.: A New O(n 2) Algorithm for the Symmetric Tridiagonal Eigenvalue/Eigenvector Problem. PhD thesis, EECS Department, University of California, Berkeley (1997)
Bientinesi, P., Dhillon, I.S., van de Geijn, R.A.: A Parallel Eigensolver for Dense Symmetric Matrices Based on Multiple Relatively Robust Representations. SIAM J. Sci. Comput. 27(1), 43–66 (2005)
NVIDIA Corporation: CUDA Programming Guide. First edn. NVIDIA Corporation, 2701 San Toman Expressway, Santa Clara, CA 95050, USA (2007)
Anderson, E., Bai, Z., Bischof, C., Blackford, S., Demmel, J., Dongarra, J., Du Croz, J., Greenbaum, A., Hammarling, S., McKenney, A., Sorensen, D.: LAPACK Users’ Guide, 3rd edn. Society for Industrial and Applied Mathematics, Philadelphia (1999)
Dhillon, I.S., Parlett, B.N., Vömel, C.: The Design and Implementation of the MRRR Algorithm. ACM Trans. Math. Softw. 32(4), 533–560 (2006)
Dhillon, I.S., Parlett, B.N.: Multiple Representations to Compute Orthogonal Eigenvectors of Symmetric Tridiagonal Matrices. Linear Algebra and its Applications 387(1), 1–28 (2004)
Lessig, C.: Eigenvalue Computation with CUDA. Technical report, NVIDIA Corporation (August 2007)
Harris, M., Sengupta, S., Owens, J.: Parallel Prefix Sum (Scan) with CUDA. In: Nguyen, H. (ed.) GPU Gems 3. Addison Wesley, Reading (August 2007)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2010 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Lessig, C., Bientinesi, P. (2010). On Parallelizing the MRRR Algorithm for Data-Parallel Coprocessors. In: Wyrzykowski, R., Dongarra, J., Karczewski, K., Wasniewski, J. (eds) Parallel Processing and Applied Mathematics. PPAM 2009. Lecture Notes in Computer Science, vol 6067. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-14390-8_41
Download citation
DOI: https://doi.org/10.1007/978-3-642-14390-8_41
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-14389-2
Online ISBN: 978-3-642-14390-8
eBook Packages: Computer ScienceComputer Science (R0)