Next Article in Journal
A Hybrid Algorithm with a Data Augmentation Method to Enhance the Performance of the Zero-Inflated Bernoulli Model
Previous Article in Journal
A Cloud-Based Approach to Modeling ERP Information Flows Using a Bivariate Pólya–Aeppli Process
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
This is an early access version, the complete PDF, HTML, and XML versions will be available soon.
Article

Efficient AI-Driven Query Optimization in Large-Scale Databases: A Reinforcement Learning and Graph-Based Approach

by
Najla Sassi
* and
Wassim Jaziri
Department of Management Information Systems, School of Business, King Faisal University, Hofuf 31982, Saudi Arabia
*
Author to whom correspondence should be addressed.
Mathematics 2025, 13(11), 1700; https://doi.org/10.3390/math13111700
Submission received: 18 April 2025 / Revised: 20 May 2025 / Accepted: 21 May 2025 / Published: 22 May 2025
(This article belongs to the Section E1: Mathematics and Computer Science)

Abstract

As data-centric applications become increasingly complex, understanding effective query optimization in large-scale relational databases is crucial for managing this complexity. Yet, traditional cost-based and heuristic approaches simply do not scale, adapt, or remain accurate in highly dynamic multi-join queries. This research work proposes the reinforcement learning and graph-based hybrid query optimizer (GRQO), the first ever to apply reinforcement learning and graph theory for optimizing query execution plans, specifically in join order selection and cardinality estimation. By employing proximal policy optimization for adaptive policy learning and using graph-based schema representations for relational modeling, GRQO effectively traverses the combinatorial optimization space. Based on TPC-H (1 TB) and IMDB (500 GB) workloads, GRQO runs 25% faster in query execution time, scales 30% better, reduces CPU and memory use by 20–25%, and reduces the cardinality estimation error by 47% compared to traditional cost-based optimizers and machine learning-based optimizers. These findings highlight the ability of GRQO to optimize performance and resource efficiency in database management in cloud computing, data warehousing, and real-time analytics.
Keywords: query optimization; reinforcement learning; graph neural networks; join order selection; large-scale databases; resource efficiency; scalability query optimization; reinforcement learning; graph neural networks; join order selection; large-scale databases; resource efficiency; scalability

Share and Cite

MDPI and ACS Style

Sassi, N.; Jaziri, W. Efficient AI-Driven Query Optimization in Large-Scale Databases: A Reinforcement Learning and Graph-Based Approach. Mathematics 2025, 13, 1700. https://doi.org/10.3390/math13111700

AMA Style

Sassi N, Jaziri W. Efficient AI-Driven Query Optimization in Large-Scale Databases: A Reinforcement Learning and Graph-Based Approach. Mathematics. 2025; 13(11):1700. https://doi.org/10.3390/math13111700

Chicago/Turabian Style

Sassi, Najla, and Wassim Jaziri. 2025. "Efficient AI-Driven Query Optimization in Large-Scale Databases: A Reinforcement Learning and Graph-Based Approach" Mathematics 13, no. 11: 1700. https://doi.org/10.3390/math13111700

APA Style

Sassi, N., & Jaziri, W. (2025). Efficient AI-Driven Query Optimization in Large-Scale Databases: A Reinforcement Learning and Graph-Based Approach. Mathematics, 13(11), 1700. https://doi.org/10.3390/math13111700

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop