You are currently on the new version of our website. Access the old version .

516 Results Found

  • Article
  • Open Access
2 Citations
3,068 Views
23 Pages

17 July 2023

The LU factorization of very large sparse matrices requires a significant amount of computing resources, including memory and broadband communication. A hybrid MPI + OpenMP + CUDA algorithm named SuperLU3D can efficiently compute the LU factorization...

  • Article
  • Open Access
1 Citations
2,437 Views
21 Pages

13 April 2023

In this paper, a GPU-accelerated Cholesky decomposition technique and a coupled anisotropic random field are suggested for use in the modeling of diversion tunnels. Combining the advantages of GPU and CPU processing with MATLAB programming control yi...

  • Article
  • Open Access
6 Citations
3,359 Views
18 Pages

The importance of autonomous marine vehicles is increasing in a wide range of ocean science and engineering applications. Multi-objective optimization, where trade-offs between multiple conflicting objectives are achieved (such as minimizing expected...

  • Article
  • Open Access
16 Citations
3,709 Views
16 Pages

A GPU-Accelerated and LTS-Based Finite Volume Shallow Water Model

  • Peng Hu,
  • Zixiong Zhao,
  • Aofei Ji,
  • Wei Li,
  • Zhiguo He,
  • Qifeng Liu,
  • Youwei Li and
  • Zhixian Cao

15 March 2022

This paper presents a GPU (Graphics Processing Unit)-accelerated and LTS (Local-time-Step)-based finite volume Shallow Water Model (SWM). The model performance is compared against the other five model versions (Single CPU versions with/without LTS, M...

  • Article
  • Open Access
7 Citations
2,640 Views
33 Pages

19 June 2023

In this work, we introduce a scalable and efficient GPU-accelerated methodology for volumetric particle advection and finite-time Lyapunov exponent (FTLE) calculation, focusing on the analysis of Lagrangian coherent structures (LCS) in large-scale di...

  • Article
  • Open Access
1 Citations
2,551 Views
15 Pages

Lightweight GPU-Accelerated Parallel Processing of the SCHISM Model Using CUDA Fortran

  • Hongchun Zhang,
  • Qian Cao,
  • Changmao Wu,
  • Guangjun Xu,
  • Yuli Liu,
  • Xingru Feng,
  • Meibing Jin and
  • Changming Dong

The SCHISM model is widely used for ocean numerical simulations, but its computational efficiency is constrained by the substantial resources it requires. To enhance its performance, this study develops GPU–SCHISM, a GPU-accelerated parallel ve...

  • Article
  • Open Access
9 Citations
4,404 Views
20 Pages

25 March 2023

The precision of numerical overland flow models is limited by their computational cost. A GPU-accelerated 2D shallow flow model is developed to overcome this challenge in this study. The model employs a Godunov-type finite volume method (FVM) to solv...

  • Article
  • Open Access
1 Citations
1,466 Views
21 Pages

7 January 2025

The traditional finite element program is executed on the CPU; however, it is challenging for the CPU to compute the ultra-large scale finite element model. In this paper, we present a set of efficient algorithms based on GPU acceleration technology...

  • Article
  • Open Access
1,609 Views
17 Pages

Extreme rainstorms are difficult to predict and often result in catchment-scale rainfall flooding, leading to substantial economic losses globally. Enhancing the numerical computational efficiency of flood models is essential for improving flood fore...

  • Article
  • Open Access
11 Citations
2,628 Views
9 Pages

Toward Real-Time Giga-Voxel Optoacoustic/Photoacoustic Microscopy: GPU-Accelerated Fourier Reconstruction with Quasi-3D Implementation

  • Pavel Subochev,
  • Florentin Spadin,
  • Valeriya Perekatova,
  • Aleksandr Khilov,
  • Andrey Kovalchuk,
  • Ksenia Pavlova,
  • Alexey Kurnikov,
  • Martin Frenz and
  • Michael Jaeger

29 December 2021

We propose a GPU-accelerated implementation of frequency-domain synthetic aperture focusing technique (SAFT) employing truncated regularized inverse k-space interpolation. Our implementation achieves sub-1s reconstruction time for data sizes of up to...

  • Article
  • Open Access
3 Citations
2,518 Views
24 Pages

Inverse synthetic aperture radar (ISAR) imaging techniques are frequently used in target classification and recognition applications, due to its capability to produce high-resolution images for moving targets. In order to meet the demand of ISAR imag...

  • Article
  • Open Access
1 Citations
680 Views
17 Pages

25 April 2025

Microstructure simulations of continuous casting billets are vital for understanding solidification mechanisms and optimizing process parameters. However, the commonly used CA (Cellular Automaton) model is limited by grid anisotropy, which affects th...

  • Article
  • Open Access
2 Citations
1,881 Views
19 Pages

8 November 2023

The study of ship waves is important for ship detection, coastal erosion and wave drag. This paper proposed a highly paralleled numerical computation method for efficiently simulating three-dimensional nonlinear kelvin waves. First, a numerical model...

  • Article
  • Open Access
4 Citations
3,150 Views
15 Pages

4 December 2021

In this study, a CUDA Fortran-based GPU-accelerated Laplace equation model was developed and applied to several cases. The Laplace equation is one of the equations that can physically analyze the groundwater flows, and is an equation that can provide...

  • Feature Paper
  • Article
  • Open Access
1,089 Views
15 Pages

11 October 2025

Pseudospectral methods are effective tools for solving optimal control problems, but they result in large-scale nonlinear programming (NLP) problems that are computationally demanding. A major bottleneck is the repeated evaluation of the objective fu...

  • Communication
  • Open Access
1 Citations
2,749 Views
17 Pages

2 July 2024

Modeling and simulating the underwater optical imaging process can assist in optimizing the configuration of underwater optical imaging technology. Based on the Monte Carlo (MC) method, we propose an optical imaging model which is tailored for deep-s...

  • Article
  • Open Access
3 Citations
2,427 Views
18 Pages

GPU Accelerated Processing Method for Feature Point Extraction and Matching in Satellite SAR Images

  • Lei Dong,
  • Niangang Jiao,
  • Tingtao Zhang,
  • Fangjian Liu and
  • Hongjian You

14 February 2024

This paper addresses the challenge of extracting feature points and image matching in Synthetic Aperture Radar (SAR) satellite images, particularly focusing on large-scale embedding. The widely used Scale Invariant Transform (SIFT) algorithm, success...

  • Article
  • Open Access
8 Citations
2,187 Views
11 Pages

16 June 2022

The application of the traditional planar acoustics method is limited due to the low accuracy when computing the echo characteristics of underwater targets. Based on the concept of the shooting and bouncing ray which considers multiple reflections on...

  • Article
  • Open Access
1 Citations
2,033 Views
12 Pages

Heterogeneous CPU-GPU Accelerated Subgridding in the FDTD Modelling of Microwave Breakdown

  • Jian Feng,
  • Kaihong Song,
  • Ming Fang,
  • Wei Chen,
  • Guoda Xie,
  • Zhixiang Huang and
  • Xianliang Wu

14 November 2022

Microwave breakdown is crucial to the transmission of high-power microwave (HPM) devices, where a growing number of studies have analyzed the complex interactions between electromagnetic waves and the evolving plasma from theoretical and analytical p...

  • Communication
  • Open Access
12 Citations
3,833 Views
15 Pages

GPU-Accelerated Monte Carlo Simulation for a Single-Photon Underwater Lidar

  • Yupeng Liao,
  • Mingjia Shangguan,
  • Zhifeng Yang,
  • Zaifa Lin,
  • Yuanlun Wang and
  • Sihui Li

5 November 2023

The Monte Carlo (MC) simulation, due to its ability to accurately simulate the backscattered signal of lidar, plays a crucial role in the design, optimization, and interpretation of the backscattered signal in lidar systems. Despite the development o...

  • Technical Note
  • Open Access
3 Citations
3,231 Views
15 Pages

GPU Acceleration for SAR Satellite Image Ortho-Rectification

  • Lei Dong,
  • Tingtao Zhang,
  • Fangjian Liu,
  • Rui Liu and
  • Hongjian You

7 April 2024

Synthetic Aperture Radar (SAR) satellite image ortho-rectification requires pixel-level calculations, which are time-consuming. Moreover, for SAR images with large overlapping areas, the processing time for ortho-rectification increases linearly, sig...

  • Article
  • Open Access
10 Citations
5,142 Views
20 Pages

This paper proposes an efficient approach for simulating volumetric deformable objects using the Position-Based Dynamics (PBD) method. Volumetric bodies generated by TetGen are used to represent three-dimensional objects, which accurately capture com...

  • Article
  • Open Access
3 Citations
1,714 Views
22 Pages

GPU Accelerating Algorithms for Three-Layered Heat Conduction Simulations

  • Nicolás Murúa,
  • Aníbal Coronel,
  • Alex Tello,
  • Stefan Berres and
  • Fernando Huancas

9 November 2024

In this paper, we consider the finite difference approximation for a one-dimensional mathematical model of heat conduction in a three-layered solid with interfacial conditions for temperature and heat flux between the layers. The finite difference sc...

  • Article
  • Open Access
11 Citations
2,737 Views
18 Pages

23 October 2023

Traditional simultaneous localization and mapping (SLAM) performs well in a static environment; however, with the abrupt increase of dynamic points in dynamic environments, the algorithm is influenced by a lot of meaningless information, leading to l...

  • Feature Paper
  • Article
  • Open Access
15 Citations
4,451 Views
25 Pages

A Study of Multi-Component Oscillating-Foil Hydrokinetic Turbines with a GPU-Accelerated Boundary Element Method

  • Panagiotis E. Koutsogiannakis,
  • Evangelos S. Filippas and
  • Kostas A. Belibassakis

21 November 2019

A biomimetic semi-activated oscillating-foil device with multiple foils in a parallel configuration is studied for the extraction of marine renewable energy. For the present investigation, an unsteady boundary element method (BEM) is used for the sim...

  • Article
  • Open Access
1,111 Views
23 Pages

27 March 2025

With the exponential growth of big data, efficient groupby aggregation (GA) has become critical for real-time analytics across industries. GA is a key method for extracting valuable information. Current CPU-based solutions (such as large-scale parall...

  • Article
  • Open Access
7 Citations
3,955 Views
30 Pages

GoRG: Towards a GPU-Accelerated Multiview Hyperspectral Depth Estimation Tool for Medical Applications

  • Jaime Sancho,
  • Pallab Sutradhar,
  • Gonzalo Rosa,
  • Miguel Chavarrías,
  • Angel Perez-Nuñez,
  • Rubén Salvador,
  • Alfonso Lagares,
  • Eduardo Juárez and
  • César Sanz

14 June 2021

HyperSpectral (HS) images have been successfully used for brain tumor boundary detection during resection operations. Nowadays, these classification maps coexist with other technologies such as MRI or IOUS that improve a neurosurgeon’s action, with t...

  • Article
  • Open Access
18 Citations
4,646 Views
13 Pages

12 July 2019

We developed a GPU-accelerated 2D physically based distributed rainfall runoff model for a PC environment. The governing equations were derived from the diffusive wave model for surface flow and the Horton infiltration model for rainfall loss. A nume...

  • Article
  • Open Access
2 Citations
1,583 Views
24 Pages

GPU-Accelerated Fock Matrix Computation with Efficient Reduction

  • Satoki Tsuji,
  • Yasuaki Ito,
  • Haruto Fujii,
  • Nobuya Yokogawa,
  • Kanta Suzuki,
  • Koji Nakano,
  • Victor Parque and
  • Akihiko Kasagi

25 April 2025

In quantum chemistry, constructing the Fock matrix is essential to compute Coulomb interactions among atoms and electrons and, thus, to determine electron orbitals and densities. In the fundamental framework of quantum chemistry such as the Hartree&n...

  • Article
  • Open Access
1,693 Views
19 Pages

11 September 2025

Using artificial intelligence tools to evaluate financial derivatives has become increasingly popular. PSO (particle swarm optimization) is one such tool. We present a comprehensive study of PSO for pricing American options on GPUs using OpenCL. PSO...

  • Article
  • Open Access
2,781 Views
15 Pages

Field Programmable Gate Arrays (FPGAs), renowned for their reconfigurable nature, offer unmatched flexibility and cost-effectiveness in engineering experimentation. They stand as the quintessential platform for hardware acceleration and prototype val...

  • Communication
  • Open Access
6 Citations
4,892 Views
15 Pages

GPU-Accelerated Signal Processing for Passive Bistatic Radar

  • Xinyu Zhao,
  • Peng Liu,
  • Bingnan Wang and
  • Yaqiu Jin

19 November 2023

Passive bistatic radar is a novel radar technology that passively detects targets without actively emitting signals. Since passive bistatic radar entails larger data volumes and computations compared to traditional active radiation radar, the develop...

  • Article
  • Open Access
1 Citations
995 Views
20 Pages

Low-Earth-Orbit (LEO) satellite networks offer a promising avenue for achieving global connectivity, despite certain technical and economic challenges such as high implementation costs and the complexity of network management. Nonetheless, real-time...

  • Article
  • Open Access
7 Citations
3,661 Views
14 Pages

9 October 2021

A prototype of a three-dimensional (3-D) radiation model is developed using the lattice Boltzmann method (LBM) and implemented on a graphical processing unit (GPU) to accelerate the model’s computational speed. This radiative transfer-lattice Boltzma...

  • Article
  • Open Access
7 Citations
3,541 Views
18 Pages

A GPU-Accelerated Modern Fortran Version of the ECHO Code for Relativistic Magnetohydrodynamics

  • Luca Del Zanna,
  • Simone Landi,
  • Lorenzo Serafini,
  • Matteo Bugli and
  • Emanuele Papini

6 January 2024

The numerical study of relativistic magnetohydrodynamics (MHD) plays a crucial role in high-energy astrophysics but unfortunately is computationally demanding, given the complex physics involved (high Lorentz factor flows, extreme magnetization, and...

  • Review
  • Open Access
29 Citations
10,287 Views
34 Pages

25 July 2024

Computer Vision (CV) has become increasingly important for Single-Board Computers (SBCs) due to their widespread deployment in addressing real-world problems. Specifically, in the context of smart cities, there is an emerging trend of developing end-...

  • Article
  • Open Access
2 Citations
2,457 Views
42 Pages

20 June 2024

Traditional population-based metaheuristic algorithms are effective in solving complex real-world problems but require careful strategy selection and parameter tuning. Metaphorless population-based optimization algorithms have gained importance due t...

  • Article
  • Open Access
21 Citations
22,628 Views
26 Pages

GPU Acceleration of CFD Simulations in OpenFOAM

  • Federico Piscaglia and
  • Federico Ghioldi

8 September 2023

We introduce algorithmic advancements designed to expedite simulations in OpenFOAM using GPUs. These developments include the following. (a) The amgx4Foam library, which connects the open-source AmgX library from NVIDIA to OpenFOAM. Matrix generation...

  • Article
  • Open Access
1,312 Views
14 Pages

Accelerating Batched Power Flow on Heterogeneous CPU-GPU Platform

  • Jiao Hao,
  • Zongbao Zhang,
  • Zonglin He,
  • Zhengyuan Liu,
  • Zhengdong Tan and
  • Yankan Song

12 December 2024

As the scale of China’s interconnected power grid continues to expand, traditional serial computing methods are no longer sufficient for the rapid analysis and computation of electrical networks with tens of thousands of nodes due to their smal...

  • Article
  • Open Access
1,093 Views
21 Pages

16 October 2025

With the increasing emphasis on energy-efficient computing, edge devices accelerated by graphics processing units (GPUs) are gaining attention for their potential in scientific workloads. These platforms support compute-intensive simulations under st...

  • Article
  • Open Access
7 Citations
4,886 Views
18 Pages

1 March 2022

There are numerous global navigation satellite system-denied regions in urban areas, where the localization of autonomous driving remains a challenge. To address this problem, a high-resolution light detection and ranging (LiDAR) sensor was recently...

  • Article
  • Open Access
2 Citations
3,549 Views
15 Pages

CUDA-Optimized GPU Acceleration of 3GPP 3D Channel Model Simulations for 5G Network Planning

  • Nasir Ali Shah,
  • Mihai T. Lazarescu,
  • Roberto Quasso and
  • Luciano Lavagno

The simulation of massive multiple-input multiple-output (MIMO) channel models is becoming increasingly important for testing and validation of fifth-generation new radio (5G NR) wireless networks and beyond. However, simulation performance tends to...

  • Article
  • Open Access
3 Citations
2,592 Views
24 Pages

28 August 2023

Slightly off-axis digital holography is proposed using transmission grating to obtain quantitative phase distribution. The experimental device is based on an improved 4f optical system in which a two-window input plane is used to form the object beam...

  • Article
  • Open Access
4 Citations
3,922 Views
9 Pages

GPU Accelerated PIC and SIC for OFDM-NOMA

  • Talgat Manglayev,
  • Refik Caglar Kizilirmak and
  • Nor Asilah Wati Abdul Hamid

Non-orthogonal multiple access (NOMA) is a candidate multiple access scheme for the fifth-generation (5G) cellular networks. In NOMA systems, all users operate at the same frequency and time, which poses a challenge in the decoding process at the rec...

  • Article
  • Open Access
2,188 Views
13 Pages

23 January 2025

Polygon reconstruction is widely used across various fields. Although the current polygon reconstruction algorithms have achieved near-linear time complexity, they still fail to meet the speed demands imposed by the exponential growth in polygon numb...

  • Article
  • Open Access
13 Citations
4,248 Views
24 Pages

14 January 2021

The heart consists of a complex network of billions of cells. Under physiological conditions, cardiac cells propagate electrical signals in space, generating the heartbeat in a synchronous and coordinated manner. When such a synchronization fails, li...

  • Article
  • Open Access
8 Citations
3,360 Views
23 Pages

20 November 2023

Patch-based methods improve the performance of infrared small target detection, transforming the detection problem into a Low-Rank Sparse Decomposition (LRSD) problem. However, two challenges hinder the success of these methods: (1) The interference...

  • Article
  • Open Access
6 Citations
3,104 Views
28 Pages

Real-Time Edge Computing vs. GPU-Accelerated Pipelines for Low-Cost Microscopy Applications

  • Gloria Bueno,
  • Lucia Sanchez-Vargas,
  • Alberto Diaz-Maroto,
  • Jesus Ruiz-Santaquiteria,
  • Maria Blanco,
  • Jesus Salido and
  • Gabriel Cristobal

26 February 2025

Environmental microscopy is crucial for analyzing microorganisms, but traditional optical microscopes are often expensive, bulky, and impractical for field use. AI-driven image recognition, powered by deep learning models like YOLO, enhances microsco...

  • Article
  • Open Access
38 Citations
9,373 Views
32 Pages

Validation of the GPU-Accelerated CFD Solver ELBE for Free Surface Flow Problems in Civil and Environmental Engineering

  • Christian F. Janßen,
  • Dennis Mierke,
  • Micha Überrück,
  • Silke Gralher and
  • Thomas Rung

This contribution is dedicated to demonstrating the high potential and manifold applications of state-of-the-art computational fluid dynamics (CFD) tools for free-surface flows in civil and environmental engineering. All simulations were performed wi...

  • Article
  • Open Access
3 Citations
2,853 Views
11 Pages

17 January 2024

Simulation of atomic force microscopy (AFM) computationally emulates experimental scanning of a biomolecular structure to produce topographic images that can be correlated with measured images. Its application to the enormous amount of available high...

of 11