Next Article in Journal
Electron Correlations in Local Effective Potential Theory
Next Article in Special Issue
Evaluation of External Memory Access Performance on a High-End FPGA Hybrid Computer
Previous Article in Journal
Highly Excited States from a Time Independent Density Functional Method
Previous Article in Special Issue
Automatic Generation of Massively Parallel Codes from ExaSlang
Article Menu

Export Article

Open AccessArticle
Computation 2016, 4(3), 29; doi:10.3390/computation4030029

DiamondTorre Algorithm for High-Performance Wave Modeling

1
Keldysh Institute of Applied Mathematics, Moscow 125047, Russia
2
Kintech Lab, Moscow 123298, Russia
These authors contributed equally to this work.
*
Author to whom correspondence should be addressed.
Academic Editor: Demos T. Tsahalis
Received: 12 May 2016 / Revised: 27 June 2016 / Accepted: 8 August 2016 / Published: 12 August 2016
(This article belongs to the Special Issue High Performance Computing (HPC) Software Design)
View Full-Text   |   Download PDF [6186 KB, uploaded 12 August 2016]   |  

Abstract

Effective algorithms of physical media numerical modeling problems’ solution are discussed. The computation rate of such problems is limited by memory bandwidth if implemented with traditional algorithms. The numerical solution of the wave equation is considered. A finite difference scheme with a cross stencil and a high order of approximation is used. The DiamondTorre algorithm is constructed, with regard to the specifics of the GPGPU’s (general purpose graphical processing unit) memory hierarchy and parallelism. The advantages of these algorithms are a high level of data localization, as well as the property of asynchrony, which allows one to effectively utilize all levels of GPGPU parallelism. The computational intensity of the algorithm is greater than the one for the best traditional algorithms with stepwise synchronization. As a consequence, it becomes possible to overcome the above-mentioned limitation. The algorithm is implemented with CUDA. For the scheme with the second order of approximation, the calculation performance of 50 billion cells per second is achieved. This exceeds the result of the best traditional algorithm by a factor of five. View Full-Text
Keywords: LRnLA; wave equation; finite difference; stencil; GPGPU; CUDA LRnLA; wave equation; finite difference; stencil; GPGPU; CUDA
Figures

This is an open access article distributed under the Creative Commons Attribution License which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. (CC BY 4.0).

Scifeed alert for new publications

Never miss any articles matching your research from any publisher
  • Get alerts for new papers matching your research
  • Find out the new papers from selected authors
  • Updated daily for 49'000+ journals and 6000+ publishers
  • Define your Scifeed now

SciFeed Share & Cite This Article

MDPI and ACS Style

Levchenko, V.; Perepelkina, A.; Zakirov, A. DiamondTorre Algorithm for High-Performance Wave Modeling. Computation 2016, 4, 29.

Show more citation formats Show less citations formats

Note that from the first issue of 2016, MDPI journals use article numbers instead of page numbers. See further details here.

Related Articles

Article Metrics

Article Access Statistics

1

Comments

[Return to top]
Computation EISSN 2079-3197 Published by MDPI AG, Basel, Switzerland RSS E-Mail Table of Contents Alert
Back to Top