You are currently on the new version of our website. Access the old version .

27 Results Found

  • Article
  • Open Access
11 Citations
5,119 Views
18 Pages

7 April 2020

It is challenging to build a real-time information retrieval system, especially for systems with high-dimensional big data. To structure big data, many hashing algorithms that map similar data items to the same bucket to advance the search have been...

  • Article
  • Open Access
1,734 Views
31 Pages

Hardware Design of DRAM Memory Prefetching Engine for General-Purpose GPUs

  • Freddy Gabbay,
  • Benjamin Salomon,
  • Idan Golan and
  • Dolev Shema

General-purpose graphics computing on processing units (GPGPUs) face significant performance limitations due to memory access latencies, particularly when traditional memory hierarchies and thread-switching mechanisms prove insufficient for complex a...

  • Article
  • Open Access
16 Citations
4,493 Views
22 Pages

Three-Dimensional Combined Finite-Discrete Element Modeling of Shear Fracture Process in Direct Shearing of Rough Concrete–Rock Joints

  • Gyeongjo Min,
  • Daisuke Fukuda,
  • Sewook Oh,
  • Gyeonggyu Kim,
  • Younghun Ko,
  • Hongyuan Liu,
  • Moonkyung Chung and
  • Sangho Cho

12 November 2020

A three-dimensional combined finite-discrete element element method (FDEM), parallelized by a general-purpose graphic-processing-unit (GPGPU), was applied to identify the fracture process of rough concrete–rock joints under direct shearing. The...

  • Article
  • Open Access
1 Citations
3,685 Views
24 Pages

Performing similarity analysis on trajectories consisting of massive numbers of tracking points is computationally challenging. We introduce a progressive minimum bounding rectangle (MBR) and minimum distance (MINDIST) approach to process the K Best...

  • Article
  • Open Access
5 Citations
3,074 Views
26 Pages

Parallel PSO for Efficient Neural Network Training Using GPGPU and Apache Spark in Edge Computing Sets

  • Manuel I. Capel,
  • Alberto Salguero-Hidalgo and
  • Juan A. Holgado-Terriza

26 August 2024

The training phase of a deep learning neural network (DLNN) is a computationally demanding process, particularly for models comprising multiple layers of intermediate neurons.This paper presents a novel approach to accelerating DLNN training using th...

  • Article
  • Open Access
695 Views
22 Pages

Lagrangian Relaxation (LR) is an effective method for solving spatial optimization problems in geospatial analysis and GIS. Among others, it has been used to solve the classic p-median problem that served as a unified local model in GIS since the 199...

  • Article
  • Open Access
1,256 Views
18 Pages

This paper proposes an optimized shared memory access technique to enhance parallel processing performance and reduce memory accesses for the ARIA block cipher in GPU environments. To overcome the limited size of GPU shared memory, we merged ARIA&rsq...

  • Article
  • Open Access
14 Citations
5,623 Views
12 Pages

1 March 2020

Today, many big data applications require massively parallel tasks to compute complicated mathematical operations. To perform parallel tasks, platforms like CUDA (Compute Unified Device Architecture) and OpenCL (Open Computing Language) are widely us...

  • Article
  • Open Access
750 Views
26 Pages

2 November 2025

This paper proposes and implements a method to efficiently parallelize constraint solving in rigid body simulation using GPUs. Rigid body simulation is widely used in robot development, computer games, movies, and other fields, and there is a growing...

  • Article
  • Open Access
2,102 Views
20 Pages

11 February 2025

This paper presents Boundary-Aware Concurrent Queue (BACQ), a high-performance queue designed for modern GPUs, which focuses on high concurrency in massively parallel environments. BACQ operates at the warp level, leveraging intra-warp locality to im...

  • Article
  • Open Access
6 Citations
4,410 Views
24 Pages

Real-Time Parallel-Serial LiDAR-Based Localization Algorithm with Centimeter Accuracy for GPS-Denied Environments

  • Jakub Niedzwiedzki,
  • Adam Niewola,
  • Piotr Lipinski,
  • Piotr Swaczyna,
  • Aleksander Bobinski,
  • Pawel Poryzala and
  • Leszek Podsedkowski

11 December 2020

In this paper, we introduce a real-time parallel-serial algorithm for autonomous robot positioning for GPS-denied, dark environments, such as caves and mine galleries. To achieve a good complexity-accuracy trade-off, we fuse data from light detection...

  • Article
  • Open Access
2 Citations
3,694 Views
9 Pages

20 December 2020

GPGPU (General-Purpose Graphics Processing Unit) consists of hardware resources that can execute tens of thousands of threads simultaneously. However, in reality, the parallelism is limited as resource allocation is performed by the base unit called...

  • Article
  • Open Access
15 Citations
6,119 Views
15 Pages

DiamondTorre Algorithm for High-Performance Wave Modeling

  • Vadim Levchenko,
  • Anastasia Perepelkina and
  • Andrey Zakirov

Effective algorithms of physical media numerical modeling problems’ solution are discussed. The computation rate of such problems is limited by memory bandwidth if implemented with traditional algorithms. The numerical solution of the wave equation i...

  • Article
  • Open Access
261 Views
22 Pages

26 December 2025

General-Purpose Graphics Processing Units (GPGPUs) rely on warp scheduling and control flow management to organize parallel thread execution, making efficient control flow mechanisms essential for modern GPGPU design. Currently, the mainstream RISC-V...

  • Article
  • Open Access
5 Citations
8,290 Views
16 Pages

A General-Purpose Graphics Processing Unit (GPGPU)-Accelerated Robotic Controller Using a Low Power Mobile Platform

  • Syed Tahir Hussain Rizvi,
  • Gianpiero Cabodi,
  • Denis Patti and
  • Muhammad Majid Gulzar

Robotic controllers have to execute various complex independent tasks repeatedly. Massive processing power is required by the motion controllers to compute the solution of these computationally intensive algorithms. General-purpose graphics processin...

  • Article
  • Open Access
4 Citations
4,639 Views
29 Pages

The simulation of fire is a challenging task due to its occurrence on multiple space-time scales and the non-linear interaction of multiple physical processes. Current state-of-the-art software such as the Fire Dynamics Simulator (FDS) implements mos...

  • Article
  • Open Access
5 Citations
4,061 Views
17 Pages

Multi-Gbps LDPC Decoder on GPU Devices

  • Jingxin Dai,
  • Hang Yin,
  • Yansong Lv,
  • Weizhang Xu and
  • Zhanxin Yang

25 October 2022

To meet the high throughput requirement of communication systems, the design of high-throughput low-density parity-check (LDPC) decoders has attracted significant attention. This paper proposes a high-throughput GPU-based LDPC decoder, aiming at the...

  • Article
  • Open Access
1 Citations
2,782 Views
14 Pages

The growing number of space objects leads to increases in the potential risks of damage to satellites and generates space debris after colliding. Conjunction assessment analysis is the one of keys to evaluating the collision risk of satellites and sa...

  • Article
  • Open Access
4 Citations
9,924 Views
20 Pages

25 February 2014

Segmentation in ultrasound (US) images is a challenge in computer vision, due to the high signal noise, artifacts that produce discontinuities in the boundaries and shadows that hide part of the received signal. In this paper, a solution based on ell...

  • Article
  • Open Access
1 Citations
1,412 Views
37 Pages

Parallel Implicit Solvers for 2D Numerical Models on Structured Meshes

  • Yaoxin Zhang,
  • Mohammad Z. Al-Hamdan and
  • Xiaobo Chao

12 July 2024

This paper presents the parallelization of two widely used implicit numerical solvers for the solution of partial differential equations on structured meshes, namely, the ADI (Alternating-Direction Implicit) solver for tridiagonal linear systems and...

  • Article
  • Open Access
12 Citations
4,808 Views
23 Pages

A Pragmatic Approach to the Design of Advanced Precision Terrain-Aided Navigation for UAVs and Its Verification

  • Jungshin Lee,
  • Chang-Ky Sung,
  • Juhyun Oh,
  • Kyungjun Han,
  • Sangwoo Lee and
  • Myeong-Jong Yu

28 April 2020

Autonomous unmanned aerial vehicles (UAVs) require highly reliable navigation information. Generally, navigation systems with the inertial navigation system (INS) and global navigation satellite system (GNSS) have been widely used. However, the GNSS...

  • Article
  • Open Access
4 Citations
4,364 Views
30 Pages

22 February 2021

Generation and propagation of waves in a numerical wave tank constructed using Weakly Compressible Smoothed Particle Hydrodynamics (WCSPH) are considered here. Numerical wave tank simulations have been carried out with implementations of different We...

  • Article
  • Open Access
4,159 Views
26 Pages

9 January 2021

Research on autonomous cars has become one of the main research paths in the automotive industry, with many critical issues that remain to be explored while considering the overall methodology and its practical applicability. In this paper, we presen...

  • Article
  • Open Access
15 Citations
5,489 Views
19 Pages

Evaluation of NVIDIA Xavier NX Platform for Real-Time Image Processing for Plasma Diagnostics

  • Bartłomiej Jabłoński,
  • Dariusz Makowski,
  • Piotr Perek,
  • Patryk Nowak vel Nowakowski,
  • Aleix Puig Sitjes,
  • Marcin Jakubowski,
  • Yu Gao,
  • Axel Winter and
  • The W-X Team

12 March 2022

Machine protection is a core task of real-time image diagnostics aiming for steady-state operation in nuclear fusion devices. The paper evaluates the applicability of the newest low-power NVIDIA Jetson Xavier NX platform for image plasma diagnostics....

  • Article
  • Open Access
10 Citations
3,312 Views
20 Pages

17 March 2022

Recent years have seen an increase in demand for the demolition of obsolete and potentially hazardous structures, including reinforced concrete (RC) structures, using blasting techniques. However, because the risk of failure is significantly higher w...

  • Feature Paper
  • Article
  • Open Access
34 Citations
4,968 Views
32 Pages

Flapping foils located beneath or to the side of the hull of the ship can be used as unsteady thrusters, augmenting ship propulsion in waves. The basic setup is composed of a horizontal wing, which undergoes an induced vertical motion due to the ship...

  • Article
  • Open Access
6 Citations
3,321 Views
15 Pages

GPU@SAT DevKit: Empowering Edge Computing Development Onboard Satellites in the Space-IoT Era

  • Gionata Benelli,
  • Giovanni Todaro,
  • Matteo Monopoli,
  • Gianluca Giuffrida,
  • Massimiliano Donati and
  • Luca Fanucci

4 October 2024

Advancements in technology have driven the miniaturization of embedded systems, making them more cost-effective and energy-efficient for wireless applications. As a result, the number of connectable devices in Internet of Things (IoT) networks has in...