Next Article in Journal
Knowledge-Enhanced Zero-Shot Graph Learning-Based Mobile Application Identification
Previous Article in Journal
DnCNN-Based Denoising Model for Low-Dose Myocardial CT Perfusion Imaging
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
This is an early access version, the complete PDF, HTML, and XML versions will be available soon.
Article

A High-Performance Branch Control Mechanism for GPGPU Based on RISC-V Architecture

1
School of Electronic Engineering, Beijing University of Posts and Telecommunications,Beijing 100876, China
2
School of Information and Communication Engineering, Hainan University, Haikou 570228, China
*
Author to whom correspondence should be addressed.
Electronics 2026, 15(1), 125; https://doi.org/10.3390/electronics15010125 (registering DOI)
Submission received: 19 November 2025 / Revised: 17 December 2025 / Accepted: 25 December 2025 / Published: 26 December 2025

Abstract

General-Purpose Graphics Processing Units (GPGPUs) rely on warp scheduling and control flow management to organize parallel thread execution, making efficient control flow mechanisms essential for modern GPGPU design. Currently, the mainstream RISC-V GPGPU Vortex adopts the Single Instruction Multiple Threads (SIMT) stack control mechanism. This approach introduces high complexity and performance overhead, becoming a major limitation for further improving control efficiency. To address this issue, this paper proposes a thread-mask-based branch control mechanism for the RISC-V architecture. The mechanism introduces explicit mask primitives at the Instruction Set Architecture (ISA) level and directly manages the active status of threads within a warp through logical operations, enabling branch execution without jumps and thus reducing the overhead of the original control flow mechanism. Unlike traditional thread mask mechanisms in GPUs, our design centers on RISC-V and realizes co-optimization at both the ISA and microarchitecture levels. The mechanism was modeled and validated on Vortex SimX. Experimental results show that, compared with the Vortex SIMT stack mechanism, the proposed approach maintains correct control semantics while reducing branch execution cycles by an average of 31% and up to 40%, providing a new approach for RISC-V GPGPU control flow optimization.
Keywords: RISC-V; GPGPU; ISA; thread mask; branch divergence RISC-V; GPGPU; ISA; thread mask; branch divergence

Share and Cite

MDPI and ACS Style

Cheng, Y.; Man, Y.; Zhou, X. A High-Performance Branch Control Mechanism for GPGPU Based on RISC-V Architecture. Electronics 2026, 15, 125. https://doi.org/10.3390/electronics15010125

AMA Style

Cheng Y, Man Y, Zhou X. A High-Performance Branch Control Mechanism for GPGPU Based on RISC-V Architecture. Electronics. 2026; 15(1):125. https://doi.org/10.3390/electronics15010125

Chicago/Turabian Style

Cheng, Yao, Yi Man, and Xinbing Zhou. 2026. "A High-Performance Branch Control Mechanism for GPGPU Based on RISC-V Architecture" Electronics 15, no. 1: 125. https://doi.org/10.3390/electronics15010125

APA Style

Cheng, Y., Man, Y., & Zhou, X. (2026). A High-Performance Branch Control Mechanism for GPGPU Based on RISC-V Architecture. Electronics, 15(1), 125. https://doi.org/10.3390/electronics15010125

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Article metric data becomes available approximately 24 hours after publication online.
Back to TopTop