Abstract
Computing platforms are increasingly moving to accelerated architectures, and here we deal particularly with GPUs. In Norman et al. (2011), a method was developed for atmospheric simulation to improve efficiency on large, distributed-memory machines by reducing communication demand and increasing the time step. Here, we improve upon this method to further target GPU-accelerated platforms by reducing GPU memory accesses, removing a synchronization point, and clustering computations. The modified code ran more than two times faster than the original in some cases even though more computations were required, demonstrating the importance of improving memory handling on the GPU. Furthermore, we discovered that the modification also has a near 100 % hit rate in fast, on-chip L1 cache and discuss the reasons for this. Finally, we remark on further potential improvements to GPU efficiency.
This submission was written by the author(s) acting in (his/her/their) own independent capacity and not on behalf of UT-Battelle, LLC, or its affiliates or successors.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Ahmad N, Linderman J (2007) Euler solutions using flux-based wave decomposition. Int J Numer Methods Fluids 54:47–72
Capdeville G (2008) A central weno scheme for solving hyperbolic conservation laws on non-uniform meshes. J Comput Phys 227:2977–3014
Cullen MJP, Davies T (1991) A conservative split-explicit integration scheme with fourth-order horizontal advection. Q J R Meteorol Soc 117:993–1002
Durran DR (1991) The third-order Adams-Bashforth method: an attractive alternative to leapfrog time differencing. Monthly Weather Rev 119:702–720
Evans KJ, Rouson DW, Salinger AG, Taylor MA, Weijer W, White IJB (2009). A scalable and adaptable solution framework within components of the community climate system model. In: ICCS 2009 proceedings of the 9th international conference on computational science.
Gassmann A (2005) An improved two-time-level split-explicit integration scheme for non-hydrostatic compressible models. Meteorol Atmos Phys 88:23–38
Giraldo FX, Restelli M (2008) A study of spectral element and discontinuous Galerkin methods for the Navier-Stokes equations in nonhydrostatic mesoscale atmospheric modeling: equation sets and test cases. J Comput Phys 227:3849–3877
Khairoutdinov M, Randall D, DeMott C (2005) Simulations of the atmospheric general circulation using a cloud-resolving model as a superparameterization of physical processes. J Atmos Sci 62:2136–2154
Klemp JB, Skamarock WC, Dudhia J (2007) Conservative split-explicit time integration methods for the compressible nonhydrostatic equations. Monthly Weather Rev 135:2897–2913
Knoll DA, Keyes DE (2004) Jacobian-free Newton-Krylov methods: a survey of approaches and applications. J Comput Phys 193:357–397
Leveque RJ (2002) Finite volume methods for hyperbolic problems. Cambridge University Press, Cambridge
Lin S-J, Rood RB (1997) An explicit flux-form semi-Lagrangian shallow-water model on the sphere. Q J R Meteorol Soc 123:2477–2498
Nair RD, Choi H-W, Tufo HM (2009) Computational aspects of a scalable high-order discontinuous Galerkin atmospheric dynamical core. Comput Fluids 38:309–319
Norman MR, Nair RD, Semazzi FHM (2011) A low communication and large time step explicit finite-volume solver for non-hydrostatic atmospheric dynamics. J Comput Phys 230(4):1567–1584
Staniforth A, Cote J (1991) Semi-lagrangian integration schemes for atmospheric models: a review. Monthly Weather Rev 119(9):2206–2223
Taylor MA, Tribbia JJ, Iskandarani M (1997) The spectral element method for the shallow water equations on the sphere. J Comput Phys 130:92–108
Taylor MA, Edwards J, Thomas S, Nair R (2007) A mass and energy conserving spectral element atmospheric dynamical core on the cubed-sphere grid. J Phys Conf Ser 78:012074
Williamson DL (2007) The evolution of dynamical cores for global atmospheric models. J Meteorol Soc Jpn 85B:241–269
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2013 Springer-Verlag Berlin Heidelberg
About this chapter
Cite this chapter
Norman, M.R. (2013). Targeting Atmospheric Simulation Algorithms for Large, Distributed-Memory, GPU-Accelerated Computers. In: Yuen, D., Wang, L., Chi, X., Johnsson, L., Ge, W., Shi, Y. (eds) GPU Solutions to Multi-scale Problems in Science and Engineering. Lecture Notes in Earth System Sciences. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-16405-7_17
Download citation
DOI: https://doi.org/10.1007/978-3-642-16405-7_17
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-16404-0
Online ISBN: 978-3-642-16405-7
eBook Packages: Earth and Environmental ScienceEarth and Environmental Science (R0)