Leveraging Machine Learning (ML) to Enhance the Structural Properties of a Novel Alkali Activated Bio-Composite

Mahamat, Assia Aboubakar; Boukar, Moussa Mahamat; Obianyo, Ifeyinwa Ijeoma; Nshimiyimana, Philbert; Ngayakamo, Blasius; Leklou, Nordine; Bih, Numfor Linda

doi:10.3390/jcs9090464

Open AccessArticle

Leveraging Machine Learning (ML) to Enhance the Structural Properties of a Novel Alkali Activated Bio-Composite

by

Assia Aboubakar Mahamat

^1,2,*

,

Moussa Mahamat Boukar

^3,4,

Ifeyinwa Ijeoma Obianyo

⁵

,

Philbert Nshimiyimana

⁶,

Blasius Ngayakamo

⁷,

Nordine Leklou

⁸ and

Numfor Linda Bih

¹

Department of Civil Engineering, African University of Science and Technology, Federal Capital Territory, Abuja 900100, Nigeria

²

Departement de Bâtiment, Ecole Nationale Supérieure des Travaux Publics (ENSTP), Yaoundé B.P 510, Cameroon

³

Department of Computer Science, Prime University, Federal Capital Territory, Abuja 900100, Nigeria

⁴

Department of Computer Science, Universitè Virtuelle du Tchad, N’Djamena 5711, Chad

⁵

Department of Civil Engineering, Nile University of Nigeria, Federal Capital Territory, Abuja 900100, Nigeria

⁶

Laboratoire Eco-Matériaux et Habitats Durables (LEMHaD), Institut International d’Ingénierie de l’Eau et de l’Environnement (Institut 2iE), Rue de la Science, Ouagadougou 01 BP 594, Burkina Faso

⁷

Department of Civil Engineering, Dar es Salaam Institute of Technology, Dar es Salaam P.O. Box 2958, Tanzania

⁸

Institut de Recherche EN Gènie Civil ET Mècanique, GeM, CNRS, UMR 6183, Nantes Université, F-44600 Saint-Nazaire, France

^*

Author to whom correspondence should be addressed.

J. Compos. Sci. 2025, 9(9), 464; https://doi.org/10.3390/jcs9090464

Submission received: 30 July 2025 / Revised: 26 August 2025 / Accepted: 28 August 2025 / Published: 1 September 2025

(This article belongs to the Special Issue Machine Learning Applications in the Design and Analysis of Composite Materials)

Download

Browse Figures

Versions Notes

Abstract

This study explored the use of Borassus fruit fiber as reinforcement for earthen matrices (BFRC). The experimental results of the testing carried out on the structural properties were used to generate a primary dataset for training and testing machine learning (ML) models. Linear regression (LR), Decision tree regressor (DTR), and gradient boosting regression (GBR) were used to build an ensemble learning (EL) model during the prediction of the hygroscopic properties, Young’s modulus, and compressive strength of the BFRC. Fiber content, activation concentration, curing days, dry weight, saturated weight, mass, flexural vibration, longitudinal vibration, correction factor, maximum load, and cross-sectional area were the various inputs considered in the structural properties prediction. The performance of both EL and single models (SMs) was appraised via three performance metrics—mean square error (MSE), root mean square (RMSE), and the coefficient of determination (R²)—to comparatively ascertain the model’s efficiency. Results showed that all models exhibited high accuracy in predicting Young’s modulus and compressive strength. Ensemble learning outperformed single models in predicting these properties, with MSE, RMSE, and R² of 0.01 MPa, 0.1 MPa, and 99% and 3,923,262.5 MPa, 1980.7 Pa, and 99% for compressive strength and Young’s modulus, respectively. However, for hygroscopic behavior, linear regression (LR) demonstrated superior performance compared to other models, with MSE, RMSE, and R² values of 0.13%, 0.36%, and 99%.

Keywords:

gradient boosting; ensemble learning; Young’s modulus; compressive strength; hygroscopic properties; earthen matrices

1. Introduction

In response to escalating environmental issues linked to the widespread production and utilization of cement and its byproducts, earthen materials, one of the most prevalent traditional building materials, have attracted significant interest in the construction industry in recent decades.

These concerns are mirrored in climate change and the depletion of raw material resources [1], ecological system instability, etc. Consequently, research has focused on identifying and generating “green” alternatives for the construction sector. Earthen materials constitute the best alternative because of their beneficial properties. They have a low thermal conductivity (0.5–1 W/mK) [2], robustness (defined from key factors such as strength and volumetric stability) [3], and low fabrication energy consumption (2856–17,136 GJ/tons) [4]. They also release no greenhouse gases during manufacturing or implementation [5], and they are readily available. These materials are attractive since they tackle the sustainability challenges posed by the extensive utilization of Portland cement, the production of which releases CO₂ and disrupts the environment [6]. Per contra, earthen materials have some drawbacks that hinder their widespread application [7]. Among these drawbacks, their properties are often lower than required for structural application [8]. Therefore, they are mixed with admixtures [9], additives [10], and fiber reinforcement [11], and treated by mechanical [12] or chemical activation [13], during their production. Among many other techniques for earthen matrix production, natural fiber reinforcement is the preferred strengthening technology. The most-utilized fibers are sisal [14], jute [15], and coconut fiber or coir [16].

The Borassus tree is a plant in the coconut tree plant family [17]. The fruit’s primary constituents are fiber and a jelly-like substance [18]. It is readily available in Asia and sub-Saharan Africa, where it remains underutilized and is therefore regarded as agro-waste. The use of agro-waste has garnered significant interest from the construction sector due to its renewability, reduction of dependence on less sustainable traditional materials, cost-effectiveness, and excellent performance. The palm oil business is one of the major producers of waste: from the entire number of fresh fruits, 20% is waste (nutshell waste), and 30% of fibers/empty bunches are produced [19], and coconut fibers, or coir, are utilized in the building and automotive industries [20]. Given their familiarity with coir, Borassus fruit fibers are an attractive option for fiber reinforcement in earthen composites for construction applications. To improve the mechanical properties of the earthen matrix, the fiber from the Borassus fruit has been utilized as reinforcement. A different strategy to increase the material’s toughness, ductility, and resistance to deformation is to add fibers to matrices that exhibit mechanical fragility. The fibers regulate crack growth throughout the matrix under various loading scenarios and when retracting takes place.

An area of computer science known as artificial intelligence (AI) aims to emulate human intelligence on tasks by training computers how to perceive and learn inputs for perception, knowledge representation, reasoning, problem-solving, and planning. There are many different types of innovative AI technologies that are all designed to replicate human cognitive abilities. Because of this, these systems can purposefully, intelligently, and adaptably deal with situations that are becoming more complicated and ambiguous [21]. AI is defined by Russell as “the study of how to make machines do things, which at the moment, people do better” [22]; it is a discipline that envelops everything that makes a machine intelligent [23]. Artificial intelligence is usually conceived of as combining data analytics and machine learning (ML). Without explicit programming, ML, a subfield of AI, enables computers to learn and perform better through data analysis. The primary goal of ML algorithms, which are used to create predictive models, is to identify and analyze patterns in datasets [24]. Among the ML techniques are the following: (i) Supervised ML: This field of research focuses on the method by which computers determine what to do after learning input and intended output pairs from labeled datasets. It is divided into classification and regression. (ii) Unsupervised Machine Learning: This field focuses on teaching machines the fundamental structures found in unlabeled datasets. It is categorized into clustering and dimension reduction techniques. (iii) Reinforcement Learning: This is a computational approach involving learning from the outcome of interactions with the environment. It can be defined as discovering an approach to maximize a scalar reward or reinforcement signal by mapping situations to actions. (iv) Deep Learning: This processes huge amounts of data and performs operations like speech translation and image recognition using intricate algorithms that mimic the structure of the human brain [25].

Supervised ML techniques have gained attention in civil engineering applications, mainly for prediction of the mechanical properties of concrete and other composites. Supervised machine learning is a type of ML that trains on data that has already been labeled with the desired output [26]. This data is then used to build a model that can predict the output for new, unseen data. Many studies have been undertaken using ML approaches to forecast the strength properties of concrete and its structural elements as well [27]. Nguyen et al. [28] employed a machine learning approach to assess the compressive characteristics of geo-polymer concrete [28]. The thermo-mechanical properties of a waste-based composite was predicted through ML approaches. In that investigation the authors found that LR regression displayed the highest performance, followed by random forest (RF) and gradient boosting (GB) [29].

An ML technique called ensemble learning (EL) generates a single forecast that is more accurate than the predictions of many models by combining their predictions. EL is divided into three primary categories: stacking, boosting, and bagging. By choosing a subset of the provided dataset at random and replacing it, the bagging approach trains several models on various subsets of the data. Boosting is a method that trains the models sequentially, with each model trying to correct the errors of the previous model. Random forest is a variant of bagging algorithms [30], and adaptive boosting (AdaBoost) is one of the most popular boosting algorithms [31]. The stacking method trains a separate model (called a meta-learner) to combine the predictions from the other models [32]. An ensemble learning (EL) model has been used by Shatnawi et al. [33] to predict the shear capacity of slender steel fiber-reinforced concrete (SFRC) beams with high accuracy. The model displayed R² values of 0.963 and 0.972 for the testing and training sets, respectively. In addition, both the training and testing sets of the gradient boosting regression tree (GBRT) model had low RMSE and MAE values, indicating that the prediction capability of the EL model can be trusted with high confidence [33]. Gradient boosting (GBoost) was utilized by Munir et al. [34] to forecast the compressive strength of concrete with recycled and natural aggregate. According to their findings, GBoost outperformed the other models they employed to forecast the recycled aggregate concrete’s compressive strength [34]. Random forest, another EL model, outperformed Artificial Neural Network (ANN) models in terms of prediction accuracy during the forecast of geopolymer concrete’s strength [35]. An EL model was developed using the experimental data to forecast the structural characteristics of earthen composites.

The EL developed in this study is derived from gradient boosting regression (GBR), decision tree regression (DTR), and linear regression (LR). An ML model called linear regression (LR) fits a linear equation to the observed data in order to model the connection between a dependent variable and one or more independent variables [36]. The objective is to develop a mathematical formula that, given the values of the independent variables, reliably forecasts the value of the dependent variable. Linear regression is particularly effective when the relationship between variables is linear and can be easily interpreted [37]. Numerous disciplines, notably civil engineering, make extensive use of it. It is important to remember that linear regression assumes that there is a linear relationship, which may restrict its use when the data shows non-linear patterns. In contrast to decision trees, linear regression frequently necessitates meticulous data preprocessing, including managing outliers and missing values, to ensure accurate outcomes [29]. Additionally, while decision trees can handle both categorical and numerical data without extensive preprocessing, linear regression typically requires converting categorical data into numerical representations.

The decision tree regression (DTR) algorithm builds a model in the form of a tree using training data, with each internal node representing a test, the branches representing the test’s outcomes, and the leaves representing the decisions [38]. Tree creation and tree trimming are the two procedures in this form of modeling. The training dataset zone is separated into precisely defined sections as part of the first stage, also known as the tree-building phase. A tree with many branches could be the outcome of this stage. In order to reduce the size of the non-essential or unnecessary decision tree components, the second stage, known as tree pruning, is deciding which branches of the built tree to remove. The decision tree model is a graphical method that directly applies probability analysis; it depicts a mapping relationship between object characteristics and object outcomes [39]. Decision tree models were trained to categorize high-strength concrete mix design techniques based on concrete mix proportions with great accuracy. It was demonstrated that the model could correctly determine the mixing method by which the high-strength concrete mix was designed simply by providing the fundamental proportions of the basic elements [40]. A DTR with ensemble algorithms such as bagging was developed to predict the compressive strength of concrete with waste material. The results demonstrated that the DTR with bagging gives more precise performance than an individual one because DTR with bagging enhances the model accuracy by giving fewer errors [41]. The algorithm for DTR is comparatively straightforward and simple to comprehend. Additionally, it does not require any additional preprocessing to handle continuous and categorical information.

On the other hand, gradient boosting (GBR) is an efficient supervised machine learning technique that constructs an ensemble of weak learners (usually decision trees) one after the other until it produces a single, stronger prediction model. Every weak learner in the series concentrates on fixing the mistakes committed by the ones before it, creating a final model with higher precision [42]. Gradient boosting can provide state-of-the-art performance on a variety of tasks, such as regression, classification, and ranking, by iteratively improving predictions. It may operate with a variety of data formats, including mixed, continuous, and categorical features. It can be easier to comprehend how the model makes predictions by looking at the decision tree structure of weak learners, which can offer some insights into the model’s reasoning [33].

This study intends to develop an ensemble learning (EL) model from linear regression (LR), decision tree regression (DTR), and gradient boosting regressor (GBR) models to compare with these individual models during the prediction of the hygroscopic and mechanical behavior of the Borassus fiber-reinforced composite (BFRC). The prediction of the hygroscopic and mechanical behavior is carried out to assess the structural property of the novel composite. The aim of building the EL model is to improve the prediction’s performance of the single model (SM).

Developed primary datasets were used for EL and SM data training, testing, and validation. The experimental results from the analyses of water absorption, Young’s modulus, and compressive strength were used to create our primary datasets. The main dataset was created in order to build models that can effectively handle modest amounts of data. The importance and originality of this study are as follows: (i) Conducting experimental work for Young’s modulus, compressive strength, and water absorption for BFRC to generate a primary dataset from the experimental results. (ii) Developing EL models and SMs that will efficiently perform on a small-sized dataset. (iii) Pioneering the comparative evaluation of EL vs. SM during the prediction of the structural properties of BFRC using primary dataset. This study develops and evaluates ensemble learning models specifically designed to perform effectively with limited primary data by providing a rigorous comparison to determine the effectiveness of the ensemble learning approach in predicting structural properties of BFRC. The importance of this research is to understand the role of input parameters such as fiber content, activator concentration, curing days, maximum loads on compression, torsion, and flexion on the accuracy of the output from the EL model and SM. Then a comparison of the models’ efficiency (EL and SM) in the case of this primary dataset is performed. Each model’s performance was appraised using the evaluation metrics. The outcome will enable supporting local economies through the valorisation of the Borassus fruit fiber into a sustainable reinforcement for construction applications.

2. Materials and Methods

2.1. Materials

2.1.1. Soil Excavation and Processing

The vegetal soil was provided from a construction field; thus, it is an excavated soil. The granulometry of the soil was obtained using sieve analysis. The coarse aggregates were crushed before removal of unwanted elements, mechanical grinding, and dry sieving were performed according to the British standard BS 1377:2 code [43]. Moisture content analysis was also carried out on the soil in accordance with the BS 1377:2 to facilitate the evaluation of the optimum moisture content, which guides the amount of water needed during sample production. Other soil’s characteristics such as specific gravity, dry density, and Atterberg limit were carried out in accordance with the ASTM D854—14 [44], D7263-21 [45,46], and D4318-17e1 [46], respectively.

2.1.2. Natural Fiber Extraction

The fiber used as reinforcement to the earthen matrix in this investigation was obtained naturally from Borassus fruit. The extraction was achieved manually and chemical-free on fully ripe Borassus fruit. The fruit was segmented vertically into smaller pieces to separate the mesocarp from the seed. Then it was washed under running tap water for 30 min [18]. The extracted fibers were oven-dried, and two types of fibers were extracted: coarse and fine, both used during this investigation. The natural fiber was used during this investigation without undergoing any chemical treatment, with a uniform length of 3 cm. The details of the Borassus tree, fruit, and extracted fibers are shown in Figure 1.

2.1.3. Composite Production

The processed earthen matrix was mixed with 0.5 wt% of natural Borassus fruit fiber (BF). The alkaline activator or KCO₃ was added to the dry mixture before mixing in a laboratory mixer for 5 min at a concentration level of 0.3 wt%. Distilled water at room temperature (27 °C) was added to the dry mixture gradually until reaching the required amount. The paste was allowed to cool for a few minutes because of the exothermic reaction before being placed into metallic moulds of 10 mm × 10 mm × 10 mm [47] and 40 mm × 40 mm × 160 mm [18] for compression and Young’s modulus testing, respectively. The sample underwent mechanical compression to attain denser and stronger bricks before being demoulded; the samples were oven dried for 24 h at 60 °C and left in an oven to cure for the curing periods of 14 and 90 days [18]. Figure 2 summarizes the steps undertaken during this investigation.

2.2. Experimental Program

2.2.1. Mechanical Behavior Experiment

Compressive strength defines the behavior of a material to deform under compression forces. It defines the structural integrity and ability of building materials to be used properly for load-bearing or non-bearing purposes, making it a significant property. It was carried out by placing the BFRC at a loading rate of 1.2 kN/s monotonically into an electromechanical testing machine, UTM7001 Model 4002, in accordance with ASTM C109/C109M-20 [48].

The elastic modulus, or Young’s modulus, is an essential property of construction materials. For the BFRC, the Young’s modulus testing was carried out via the impulse excitation technique (IET) using GrindoSonic equipment [49]. The method was selected because it was more cost-effective than a uniaxial compression test, safe, non-destructive, and quick to perform. While both methods yield precise measurements, destructive testing offers the most reliable results. According to ASTM C 1548–02 [50], its basic method is to measure the resonance frequencies by impulse in three separate vibrational modes: flexural, torsional, and longitudinal, as seen in Figure 3. It was determined according to the following equation [49]:

E = 0.9465 (m {\times f_{f}}^{2} / b) (L^{3} / t^{3}) T_{1}

(1)

where E is the Young’s modulus (Pa), m is the mass of the specimen (g),

f_{f}

is the fundamental resonant frequency of the specimen during flexural vibration (Hz), b is the width of the specimen (mm), L is the length of the specimen (mm), t is the thickness of the specimen (mm), and T₁ is the correction factor that accounts for the finite thickness of specimen, Poisson’s ratio, and so forth [49].

2.2.2. Hygroscopic Analysis

The hygroscopic behavior was assessed through a water absorption experiment. The samples were immersed into distilled water at room temperature (27 °C) for 24 h. This was performed to assess the dimensional integrity of the samples when exposed to moisture resulting in their swelling. The testing procedure was replicated from the author’s previous work [51] using the following equation:

W_{a} = \frac{{(m}_{w e t} - m_{d r y})}{m_{d r y}} \times 100

(2)

where Wa is the water absorption in (%), m_wet the specimen’s wet mass in (g) and m_dry the specimen’s dry mass (g).

2.2.3. Scanning Electron Microscopy (SEM)/Electron Dispersive X-Ray (EDX)

After mechanical failure of the specimens, microstructural analysis was performed. Scanning electron microscopy (SEM) was carried out to determine the morphology of the Borassus fruit-reinforced composite (BFRC). Meanwhile, energy-dispersive X-ray spectroscopy (EDX) was used to detect the semi-quantitative estimates of the chemical compositions. This was done using a Carl Zeiss scanning electron microscope that was instrumented with a Model EVO LS10 EDX system.

2.2.4. Machine Learning Models

Using an ML technique called EL, several weak learners were combined to produce a single, stronger learner. Henceforth, during this investigation LR, GBR, and DTR were used to build up an EL as described in Figure 4. The mathematical formulation of the models used are detailed in Table 1.

The results obtained from the water absorption experiment were used to build a primary dataset composed of 72 observations with 5 input variables (fiber content, activator concentration, curing days, dry weight, and saturated weight) and the water absorption as the output variable, as seen in Table 2. Meanwhile, for the mechanical properties’ evaluation, an experimental test was performed for the compressive strength and Young’s modulus of the BFRC. The experimental results from the mechanical testing were used to generate a primary dataset on the compressive strength and Young’s modulus containing 72 observations for each property. The input variables for the compressive strength and Young’s modulus were 5 (fiber content, activator concentration, curing days, maximum load, and cross-sectional area) and 7 (fiber content, activator concentration, curing days, mass, flexural vibration, longitudinal vibration, and correction factor), as shown in Table 3 and Table 4, respectively. To identify the most effective model between EL and SM for this composite, the prediction outcomes of the models were assessed comparatively based on their performance metrics.

Feature preprocessing: All numerical features were normalized using min–max scaling to ensure comparability and to improve convergence during training. No categorical variables were involved.

Feature selection: Correlation analysis was performed to identify and eliminate redundant features. Only variables with significant contributions to prediction accuracy were retained.

Performance metrics

A model’s efficiency can be appraised by the combination of performance metrics because each performance metric estimates the error differently. The testing and training metrics used during this investigation are mean square error (MSE), root mean square error (RMSE), and the coefficient of determination (R²). MSE measures the average of the squares of the errors; it is a key metric because it emphasizes larger errors more than smaller ones due to squaring each error [52]. RMSE is the square root of MSE; it provides a measure of the model’s accuracy during the output prediction. R² is a measure of the way the regression of the predicted output fits the experimental data [24]. The various metrics can be described in Equations (3), (4) and (5) respectively, where

y_{e x p}

is the experimental data values,

y_{p r e d}

is the predicted value, and n is the total number of data value.

M S E = 1 / n \sum_{i = 1}^{n} {(y_{e x p} - y_{p r e d})}^{2}

(3)

R^{2} = 1 - \frac{\sum_{i = 1}^{n} {(y_{e x p} - y_{p r e d})}^{2}}{\sum_{i = 1}^{n} {(y_{p r e d} - (1 / n \sum_{i = 1}^{n} y_{e x p}))}^{2}}

(4)

R M S E = \sqrt{M S E}

(5)

b.: Framework to establish the machine learning models

Python 2.7.12 was utilized to implement the tests. Figure 5 depicts the various steps from the composite manufacturing until the comparative analysis of the EL and SM models’ performance. During the training and validation of DTR, optimal values were determined through grid search were max_depth = 5, min_samples_split = 2, and min_samples_leaf = 1. The following features of the experimental protocol are included to enhance its reproducibility [53]:

Data: X (matrix), y (vector)

Split: X_train, y_train, X_test, y_test

Tree: T = DecisionTree()

T.build(X_train, y_train)
○
Recursively split nodes:
■
Find best feature f using a splitting criterion (e.g., Gini impurity, information gain)
■
Split node into child nodes based on f
○
Stop splitting when criteria met

Prediction:

ŷ = T.predict(x)

Evaluation:

Metrics on X_test

For GBR, the best parameters obtained via grid search were n_estimators = 300, max_depth = 3, and learning_rate = 0.1. Meanwhile, its experimental protocol is outlined below [53]:

Initialization:

ŷ₀ = initial model prediction (e.g., average target value)
M = ensemble of weak learners

Boosting iterations:

For m = 1 to M:
○
Calculate residuals: r_i = y_i − ŷ_i−1
○
Train weak learner hₘ(x) on (X, r)
○
Update ensemble: ŷ_i = ŷ_i₋₁ + α ∗ hₘ(x_i)

Prediction:

ŷ = Σ[αₘ ∗ hₘ(x)]

Evaluation:

Metrics on X_test

For LR, default settings were used with no hyperparameters required. The following experimental procedure was used during the prediction of the LR model:

Initialization:

ŷ₀ = initial model prediction
y = model parameters: β = 0

Training:

For each training (x_i, y_i):
○
Calculate the residual: r_i = y_i − ŷ_i
○
Update the model parameters: β = β + α ∗ x_i ∗ r_i
○
Update the prediction: ŷ_i = β ∗ x_i

Prediction:

For a new input x:
○
Calculate the predicted value: ŷ = β ∗ x

Evaluation:

Calculate metrics on X_test

For the EL strategy, a weighted averaging method was employed, where weights were assigned to each model based on its validation performance. This allowed the EL model to leverage the strengths of individual algorithms while minimizing weaknesses. These additions clarify the methodological rigor adopted in the study and ensure reproducibility of the proposed framework.

3. Results and Discussion

3.1. Physico-Morphological Observations of Borassus Fruit-Reinforced Composite (BFRC)

Using mechanical sieving in compliance with BS 1377:2, the excavated soil’s particle size distribution was ascertained; it showed that about 75% of the soil’s particles were smaller than 80 µm (Figure 6).

This particle size distribution is similar to the one displayed by the soil obtained from the termite hill in a previous work carried out by the authors [54]. The degree to which soil particles pack together depends on their size and arrangement. The fact that about 75% of our soil particles are below 80 µm enables the obtention of a denser earthen matrix, as seen in the micrograph (Figure 7a). On the other hand, from the micrograph, it can be noticed that the soil’s particles depict curved, flaky, and annular morphologies. The curved, flaky, and annular shapes observed in this matrix enable it to pack easily with fewer voids. Thus, the packing and density behavior of the earthen matrix is significantly affected by the soil particles’ shapes. They also affect the water retention and stability of the matrix because the curved and flaky particles create more friction and smaller channels for water to flow through the matrix [55].

The plasticity index and liquid limit—the points above which soil transitions from a solid to a plastic and from a plastic to a liquid state—were evaluated by looking at the excavated soil’s Atterberg limit. This makes it easy to comprehend how the soil behaves during stress and moisture changes, avoiding potential issues with dimensional stability brought on by matrix swelling or shrinking [56]. The strength and dimensional stability of the earthen matrix are significantly impacted by the moisture content, which is found to be 3.55%. It was shown that measuring the matrix’s level of compaction guarantees that the soil particles would bond together efficiently during compaction, increasing stability and strength. By supplying the proper amount of moisture required during the composite construction process, it permits the removal of voids and inhibits erosion. As the moisture content increases, the soil lubricates itself, making it easier to compact and have a higher density [57].

The dry density and specific gravity of the excavated soil were 0.56 g/cm³ and 2.5 respectively; they influence the compaction behavior of the earthen matrix and the void ratio. The results of the physical characterization are shown in Table 5. A higher dry density typically leads to improved strength, stiffness, and bearing capacity. However, excessive dry density can reduce workability and increase susceptibility to cracking. The optimal balance between specific gravity and dry density is crucial for achieving desired mechanical performance in various applications [58].

The morphology of both the earthen matrix and the BNF are very important to understand the bonding mechanism between the reinforcement and the matrix. The results of the morphological characterization performed on the BNF indicated a length variation of 5 cm to 10 cm for both fine and coarse fiber. These results are within the range of the results obtained in a previous work on the Borassus fiber [59]. However, the diameter greatly varied from 50 µm to 170 µm, with an elongation of 25–30%, as illustrated in Figure 7b. The presence of superficial pores in the fibers with minimal depths is also noticed from the micrograph (Figure 7a). These pores cannot trap significant moisture due to their depth, but these pores can create less friction resulting in a better adhesion to the earthen matrix [60]. Figure 7c shows the detachment line of the fiber during the mechanical failure.

3.2. Young’s Modulus Prediction

During the prediction of the Young’s modulus of BFRC, all models exhibited high accuracy: single models and the ensemble model with a high R² value of 99%, indicating a strong relationship between the selected input variables (fiber content, activator concentration, curing days, mass, flexural vibration, longitudinal vibration, and correction factor) and the output (Young’s modulus). Linear regression exhibited a strong linear relationship between predicted and experimental values and achieved excellent performance, with MSE = 0.46, RMSE = 0.67, and R² = 99% (Figure 8a). A lower value of MSE and RMSE indicates higher accuracy of the model; meanwhile, high metric values mean low performance. In terms of MSE and RMSE, all the models displayed high values; however, linear regression exhibited lesser values compared to the other models. Ensemble learning demonstrated an ideal fit with minimal deviation, achieving superior performance, with MSE = 3.98, RMSE = 1.99, and R² = 99% (Figure 8d). Linear regression and ensemble learning performed well: they demonstrated the best performance, with LR showing an almost linear fit and EL achieving the lowest MSE and RMSE values. Higher values of MSE and RMSE indicate lower performance of the model. LR might be suitable for linear relationships, while EL can handle more complex patterns. The gradient roosting regressor showed a less perfect fit (Figure 8b), especially for higher Young’s modulus values (after 550,000 Pa). Performance metrics for GBR were MSE = 13.6, RMSE = 3.68, and R² = 99%.

DTR generated results that were almost exactly the same as the GBR results, with the ideal line matching the best fitting line and the prediction line occasionally deviating slightly from the two lines that came before it (Figure 8c). The MSE, RMSE, and R² performance scores for DTR were 7.62, 2.76, and 99%, respectively. The DTR and GBR performed similarly, with just tiny departures from the optimum fit, particularly at higher Young’s modulus values. When gradient boosting was used to predict the Young’s modulus of compositionally complex alloys alongside extreme gradient boosting (XGboost), support vector machine (SVM), LASSO regression, random forest (RF), etc., it displayed the highest accuracy with lower performance metric values (R² and MAE) [53]. This similarity with our present investigation can be attributed to the fact that when using a boosting technique, weak learners may change into strong ones. Though this is not always the case, decision trees are typically employed as a base for weak learners. Many boosting techniques construct models step-by-step and then generalize them by optimizing any differentiable loss function. Boosting techniques also partially mitigate the issue of over-fitting, assist in addressing collinearity among features, and resolve non-linear relations between target properties and inputs [61].

The highest error values were showcased by GBR as an SM. The R² of 99% obtained using LR during this study is completely different from the one obtained when the mechanical properties of fly-ash/slag-based geopolymer concrete were predicted. During the prediction of the fly-ash/slag-based geopolymer, LR presented a value of 63.7% for R², showing that LR may be used to anticipate non-linear analysis results to a limited extent [62]. However, in that same investigation, DTR as an SM showcased an R² of 76%, which is less performant that the value obtained by DTR in the present investigation.

3.3. Compressive Strength Prediction

The results presented in Figure 9a–d demonstrate the performance of different ML models used in predicting the compressive strength of the BFRC. While all models exhibited high accuracy, as evidenced by the R² values approaching 1, the specific performance metrics and the visual representation of the predicted versus experimental values revealed distinct characteristics. LR demonstrated a strong linear relationship between predicted and experimental values, as indicated by the almost linear fitting line. It achieved reasonable performance as an SM, with MSE, RMSE, and R² values of 0.073, 0.27, and 99%, respectively. However, its accuracy decreased for higher compressive strength values (11.5 MPa), suggesting potential limitations in capturing non-linear relationships. With a near-perfect R² value of 99% and lower MSE and RMSE values of 0.01 and 0.1, respectively, GBR showed a more accurate fitting line than LR. The smallest difference between the predicted and actual value fitting lines showed that the GBR model was robust in capturing both linear and non-linear interactions. DTR showed intermediate performance, with MSE, RMSE, and R² values of 0.04, 0.2, and 99%, respectively. DTR’s performance was consistent across different compressive strength ranges, suggesting its potential for generalizability, even though the predicted vs. experimental fitting line was completely vertical, which is completely different from the fitting lines displayed by the other models and indicates a nonsensical relationship between the input and output variables. EL exhibited excellent accuracy in predicting the compressive strength of BFRC, with performance metrics MSE, RMSE, and R² of 0.01, 0.1, and 99% respectively, comparable to GBR performance. In a study where DT, LR, RF, and other models were used to predict the compressive strength of fly-ash-based concrete; DT and GB models have demonstrated high efficiency (R² = 99%), achieving minor errors as compared to other ML models [63]. This result is like the findings of the present study. An EL model built from random forest (RF), regression tree (RT), and gradient boosting (GB) was used for the estimation of unconfined compressive strength of cemented paste backfill, where the EL model displayed higher performance than the SM, similarly to our present findings [64]. A comparison of four (4) EL models—AdaBoost, GBDT, XGBoost, and RF—was carried out during the prediction of high-performance concrete strength. The study resulted in the GBDT model outperforming other models, with higher efficiency, like the present study [65].

3.4. Hygroscopic Properties Prediction

The prediction of hygroscopic behavior was assessed using three evaluation metrics: Mean squared error (MSE), root mean squared error (RMSE), and R-squared (R²). These metrics provide insights into the accuracy and reliability of the different models. LR demonstrated the best performance among the models (Figure 10a), with the lowest MSE (0.13) and RMSE (0.36) values and the highest R² (99%) value. This indicates a strong linear relationship between the predicted and experimental values, suggesting that LR effectively captures the hygroscopic behavior of the BFRC as a SM. GBR exhibited a scattered plot, as shown in Figure 10b, indicating some degree of variability in its predictions, while the R² value was still high (95%), and MSE was 4.07 and RMSE was 2.01. The R² was lower than that of LR, suggesting that GBR might not be as accurate in capturing the exact linear relationship (Figure 10c). A study using XGboost during the prediction of water absorption displayed an R² value of 94%, which is very close to the present finding, and a MAE of 0.036 [66]. The ensemble learning (EL) model showed improved performance compared to its individual components (Figure 10d). As demonstrated by the EL model’s reasonably accurate results, which include MSE, RMSE, and R² values of 2.04, 1.42, and 97%, respectively, integrating multiple models can increase the hygroscopic forecast accuracy. DTR produced a vertical line that showed the expected and actual values, indicating that there was little variability in its predictions. This suggests that DTR might not be suitable for capturing the hygroscopic behavior in this case, with MSE, RMSE, and R² values of 5.89, 2.43, and 93%, respectively. It is impossible to precisely evaluate the BFRC’s efficiency because the use of DTR provides insufficient information to predict its hygroscopic characteristics. However, when DTR was used as the EL model, better prediction was demonstrated. This aligns with the results of a previous work in which DT was used individually and with a bagging approach, which improved its R² from 72% to 92% [57]. LR appears to be the most effective model for predicting hygroscopic behavior in this context. However, its performance might be limited to scenarios with a strong linear relationship between the predictors and the target variable.

The analysis of the model’s performance during the prediction of Young’s modulus depicts the LR as a single model displaying the lowest metrics in terms of MSE, RMSE, and R², proving the linear relationship between the selected input and the predicted Young’s modulus. It is followed by the EL model, which exhibited lower errors than DTR and GBR.

3.5. Feature Importance Analysis

A feature importance analysis was performed using the decision tree and gradient boosting models, which provide built-in measures of feature contribution. The results indicate that fiber content (%) and curing days were the most influential predictors of Young’s modulus, followed by water absorption (%), while activator content (%) had a relatively smaller effect. This finding is consistent with the experimental observations and domain knowledge, thereby reinforcing the credibility of the predictive models (Table 6).

4. Limitations, Challenges, and Practical and Theoretical Implications with Examples to Enhance Decision-Making

In this section, some key limitations and challenges that must be carefully considered when applying model-based predictions in real-world scenarios are considered. By understanding these limitations, we can develop strategies to mitigate their impact and ensure the reliability and robustness of our predictions. These limitations arise from various factors, including the quality of the data used to train the models, the complexity of the underlying relationships, and the inherent uncertainty associated with predictions.

The input used during training has a substantial impact on the model’s performance based on the expected attributes.
EL models can often achieve higher accuracy; however, some SMs compete with EL models in terms of performance.
Making rational choices requires quantifying the degree of uncertainty in the forecasts. Because of the amount and caliber of the dataset, the models were straightforward to comprehend.

The investigation’s implications center on the choice of model, which ought to be determined by the needs of the application. The models’ performance is also impacted by the amount and quality of the experimental data and the choice of predictive techniques.

4.1. Theoretical Implications

Various SMs were used, and the EL model was developed based on the application of the properties predicted. The EL model often lead to better predictions; however, the combination of the SMs needs to be carried out taking into consideration various parameters.

4.2. Practical Implications

Considering things like the type of data (e.g., linear vs. non-linear relationships) while choosing a model, integrating predictions from various base models by stacking or combining different GBR models, analyzing models utilizing both visual analysis and performance measures (such as R², MSE, and MAE) will assist in detecting potential biases, constraints, and areas in need of development.

5. Conclusions

The mechanical behavior and physical characteristics of Borassus fiber-reinforced earthen composite (BFRC) were examined in this work. The findings revealed that the excavated soil’s particle size distribution and morphology significantly influence the packing density and water retention of the earthen matrix. Some of the focal findings can be summarized as follows:

The morphological characterization of the Borassus fibers showed an appropriate fiber length and diameter for reinforcement, with superficial pores that could improve adhesion to the matrix. The earthen matrix properties are influenced by its moisture content, Atterberg limits, dry density, and specific gravity.
The prediction of the Young’s modulus and compressive strength of the BFRC using machine learning (ML) models demonstrated the superiority of ensemble learning (EL) and gradient boosting regression (GBR) over single models (SMs). These models exhibited high accuracy and robustness in capturing the complex relationships between the input variables and the output properties.
Using ML models to predict the hygroscopic properties, it was shown that LR best captured the linear relationship between the experimental and predicted values. Even while the EL model performed better than its SM, in this instance, it was not superior to LR.

In summary, the models can be used to track material qualities and spot any deviations from planned requirements during production, and the findings can help design high-performance and sustainable earthen composites for a range of applications. This can help engineers and architects select the most suitable materials for specific applications, prevent the use of substandard materials, reduce the risk of structural failurem and ensure structural integrity and safety. The models can be used to optimize the mix design of novel construction materials by predicting the different properties based on the various proportions of the new mix components. To enhance predictive modeling of mechanical and hygroscopic properties, future studies need to (i) examine non-linear models for hygroscopic behavior and (ii) validate models in practical settings.

Author Contributions

Conceptualization, A.A.M., N.L., and N.L.B.; Formal analysis, N.L.B.; Funding acquisition, A.A.M.; Investigation, M.M.B. and P.N.; Methodology, I.I.O., P.N., and B.N.; Supervision, M.M.B. and N.L.; Visualization, I.I.O. and B.N.; Writing—original draft, A.A.M.; Writing—review and editing, A.A.M., M.M.B., I.I.O., P.N., B.N., N.L., and N.L.B. All authors have read and agreed to the published version of the manuscript.

Funding

Work by the first author was supported by the Schlumberger Foundation Faculty for the Future program.

Data Availability Statement

Dataset available on request from the authors. The raw data supporting the conclusions of this article will be made available by the authors on request.

Acknowledgments

The authors acknowledge Prime University Abuja, Nigeria for their administrative and technical support.

Conflicts of Interest

The authors declare no conflicts of interest.

Abbreviations

The following abbreviations are used in this manuscript:

AI	Artificial Intelligence
ANN	Artificial Neural Network
BF	Borassus Fruit Fiber
BFRC	Borassus Fruit Fiber Reinforced Composite
BS	British Standard
DTR	Decision Tree Regressor (or Regression)
EDX	Energy-Dispersive X-ray Spectroscopy
EL	Ensemble Learning
GB	Gradient Boosting
GBR	Gradient Boosting Regression
GBRT	Gradient Boosting Regression Tree
HPC	High-Performance Concrete
HSC	High-Strength Concrete
IET	Impulse Excitation Technique
LR	Linear Regression
LSSVM	Least Squares Support Vector Machine
LSSVR	Least Squares Support Vector Regression
MAE	Mean Absolute Error
MAPE	Mean Absolute Percentage Error
ML	Machine Learning
MSE	Mean Square Error
RAC	Recycled Aggregate Concrete
RF	Random Forest
RMSE	Root Mean Square Error
SEM	Scanning Electron Microscopy
SFRC	Steel Fiber Reinforced Concrete
SMs	Single Models
SVM	Support Vector Machine
UHPC	Ultra-High-Performance Concrete
UTM	Universal Testing Machine
Wa	Water Absorption
wt%	Weight Percent

References

Frmer, G.T.; Cook, J. Scientific Principles and the Scientific Method. In Climate Change Science: A Modern Synthesis; Springer: New York, NY, USA, 2012. [Google Scholar]
Huang, G.; Abou-Chakra, A.; Geoffroy, S.; Absi, J. A Multi-Scale Numerical Simulation on Thermal Conductivity of Bio-Based Construction Materials. Constr. Mater. 2022, 2, 148–165. [Google Scholar] [CrossRef]
Medvey, B.; Dobszay, G. Durability of Stabilized Earthen Constructions: A Review. Geotech. Geol. Eng. 2020, 38, 2403–2425. [Google Scholar] [CrossRef]
Hashemi, A.; Cruickshank, H.; Cheshmehzangi, A. Environmental impacts and embodied energy of construction methods and materials in low-income tropical housing. Sustainability 2015, 7, 7866–7883. [Google Scholar] [CrossRef]
McLellan, B.C.; Williams, R.P.; Lay, J.; Van Riessen, A.; Corder, G.D. Costs and carbon emissions for geopolymer pastes in comparison to ordinary portland cement. J. Clean. Prod. 2011, 19, 1080–1090. [Google Scholar] [CrossRef]
Zhang, J.; Liu, G.; Chen, B.; Song, D.; Qi, J.; Liu, X. Analysis of CO₂ Emission for the cement manufacturing with alternative raw materials: A LCA-based framework. In Energy Procedia; Elsevier Ltd.: Amsterdam, The Netherlands, 2014; pp. 2541–2545. [Google Scholar]
Mahamat, A.A.; Leklou, N.; Obianyo, I.I.; Stanislas, T.T.; Ayeni, O.; Bih, N.L. Evaluation of the microstructural and physico-mechanical characteristics of cement-stabilized termite hill soil for construction application. Discov. Civ. Eng. 2024, 1, 54. [Google Scholar] [CrossRef]
Ngayakamo, B.; Aboubakar, A.M.; Komadja, C.G.; Bello, A.; Onwualu, A.P. Eco-friendly use of eggshell powder as a bio-filler and flux material to enhance technological properties of fired clay bricks. Metall. Mater. Eng. 2021, 27, 371–383. [Google Scholar] [CrossRef]
Stazi, F.; Nacci, A.; Tittarelli, F.; Pasqualini, E.; Munafò, P. An experimental study on earth plasters for earthen building protection: The effects of different admixtures and surface treatments. J. Cult. Herit. 2016, 17, 27–41. [Google Scholar] [CrossRef]
Toniolo, N.; Rincón, A.; Roether, J.A.; Ercole, P.; Bernardo, E.; Boccaccini, A.R. Extensive reuse of soda-lime waste glass in fly ash-based geopolymers. Constr. Build. Mater. 2018, 188, 1077–1084. [Google Scholar] [CrossRef]
Ahmad, J.; Arbili, M.M.; Majdi, A.; Althoey, F.; Farouk Deifalla, A.; Rahmawati, C. Performance of concrete reinforced with jute fibers (natural fibers): A review. J. Eng. Fiber Fabr. 2022, 17, 15589250221121871. [Google Scholar] [CrossRef]
Stanislas, T.T.; Komadja, G.C.; Obianyo, I.I.; Ayeni, O.; Mahamat, A.A.; Tendo, J.F.; Junior, H.S. Multivariate regression approaches to predict the flexural performance of cellulose fibre reinforced extruded earth bricks for sustainable buildings. Clean. Mater. 2023, 7, 100180. [Google Scholar] [CrossRef]
Provis, J.L. Alkali-activated materials. Cem. Concr. Res. 2018, 114, 40–48. [Google Scholar] [CrossRef]
Savastano, H.; Turner, A.; Mercer, C.; Soboyejo, W.O. Mechanical behavior of cement-based materials reinforced with sisal fibers. J. Mater. Sci. 2006, 41, 6938–6948. [Google Scholar] [CrossRef]
Concha-Riedel, J.; Araya-Letelier, G.; Antico, F.C.; Reidel, U.; Glade, A. Influence of Jute Fibers to Improve Flexural Toughness, Impact Resistance and Drying Shrinkage Cracking in Adobe Mixes. In Earthen Dwellings and Structures; Springer Transactions in Civil and Environmental Engineering: Cham, Switzerland, 2019; pp. 269–278. [Google Scholar]
Ayeni, O.; Mahamat, A.A.; Bih, N.L.; Stanislas, T.T.; Isah, I.; Junior, H.S.; Boakye, E.; Onwualu, A.P. Effect of Coir Fiber Reinforcement on Properties of Metakaolin-Based Geopolymer Composite. Appl. Sci. 2022, 12, 5478. [Google Scholar] [CrossRef]
Morton, J. Notes on Distribution, Propagation, and Products of Borassus Palms (Arecaceae). Econ. Bot. 1988, 42, 420–441. [Google Scholar] [CrossRef]
Mahamat, A.A.; Leklou, N.; Obianyo, I.I.; Poullain, P.; Stanislas, T.T.; Ayeni, O.; Bih, N.L.; Savastano, H. Assessment of hygrothermal and mechanical performance of alkali activated Borassus fiber reinforced earth-based bio-composite. J. Build. Eng. 2022, 62, 105411. [Google Scholar] [CrossRef]
Chowdhury, M.N.K.; Beg, M.D.H.; Khan, M.R.; Mina, M.F. Modification of oil palm empty fruit bunch fibers by nanoparticle impregnation and alkali treatment. Cellulose 2013, 20, 1477–1490. [Google Scholar] [CrossRef]
Sridhar, R. A Review on performance of coir fiber reinforced sand. Int. J. Eng. Technol. 2017, 9, 249–256. [Google Scholar] [CrossRef]
Zhang, L.; Pan, Y.; Wu, X.; Skibniewski, M.J. Lecture Notes in Civil Engineering Artificial Intelligence in Construction Engineering and Management; Springer: Singapore, 2021; Available online: http://www.springer.com/series/15087 (accessed on 19 July 2024).
Russell, S.J.; Norvig, P.; Davis, E.; Edwards, D.D.; Forsyth, D.; Hay, N.J.; Malik, J.M.; Mittal, V.; Sahami, M.; Thrun, S. Artificial Intelligence A Modern Approach, 3rd ed.; Pearson: London, UK, 2022. [Google Scholar]
Koyamparambath, A.; Adibi, N.; Szablewski, C.; Adibi, S.A.; Sonnemann, G. Implementing Artificial Intelligence Techniques to Predict Environmental Impacts: Case of Construction Products. Sustainability 2022, 14, 3699. [Google Scholar] [CrossRef]
Mahamat Boukar, M.; Mahamat, A.A.; Djibrine, O.H. The Impact of Artificial Intelligence (AI) on Content Management Systems (CMS): A Deep Dive. Int. J. Intell. Syst. Appl. Eng. 2024, 12, 552–560. Available online: www.ijisae.org (accessed on 19 July 2024).
Oyedele, A.O.; Ajayi, A.O.; Oyedele, L.O. Machine learning predictions for lost time injuries in power transmission and distribution projects. Mach. Learn. Appl. 2021, 6, 100158. [Google Scholar] [CrossRef]
Mahamat, A.A.; Boukar, M.M.; Ibrahim, N.M.; Stanislas, T.T.; Bih, N.L.; Obianyo, I.I.; Savastano, H. Machine learning approaches for prediction of the compressive strength of alkali activated termite mound soil. Appl. Sci. 2021, 11, 4754. [Google Scholar] [CrossRef]
Li, Y.; Zhang, Q.; Kamiński, P.; Deifalla, A.F.; Sufian, M.; Dyczko, A.; Ben Kahla, N.; Atig, M. Compressive Strength of Steel Fiber-Reinforced Concrete Employing Supervised Machine Learning Techniques. Materials 2022, 15, 4209. [Google Scholar] [CrossRef]
Nguyen, K.T.; Nguyen, Q.D.; Le, T.A.; Shin, J.; Lee, K. Analyzing the compressive strength of green fly ash based geopolymer concrete using experiment and machine learning approaches. Constr. Build. Mater. 2020, 247, 118581. [Google Scholar] [CrossRef]
Mahamat, A.A.; Boukar, M.M.; Leklou, N.; Obianyo, I.I.; Stanislas, T.T.; Bih, N.L.; Ayeni, O.; Ibrahim, N.M.; Savastano, H. A Machine Learning Led Investigation Predicting the Thermos-mechanical Properties of Novel Waste-based Composite in Construction. Waste Biomass Valorization 2024, 15, 5445–5461. [Google Scholar] [CrossRef]
Arabnia, H.R.; Tran, Q.N. (Eds.) Software Tools and Algorithms for Biological Systems; Advances in Experimental Medicine and Biology; Springer: New York, NY, USA, 2011; Volume 696, Available online: https://link.springer.com/10.1007/978-1-4419-7046-6 (accessed on 19 July 2024).
Paudel, S.; Pudasaini, A.; Shrestha, R.K.; Kharel, E. Compressive strength of concrete material using machine learning techniques. Clean. Eng. Technol. 2023, 15, 100661. [Google Scholar] [CrossRef]
Rincy, T.N.; Gupta, R. Ensemble learning techinques and its efficiency in machine learning: A survey. In Proceedings of the 2nd International Conference on Data, Engineering and Applications (IDEA), Bhopal, India, 28–29 February 2020. [Google Scholar]
Shatnawi, A.; Alkassar, H.M.; Al-Abdaly, N.M.; Al-Hamdany, E.A.; Bernardo, L.F.A.; Imran, H. Shear Strength Prediction of Slender Steel Fiber Reinforced Concrete Beams Using a Gradient Boosting Regression Tree Method. Buildings 2022, 12, 550. [Google Scholar] [CrossRef]
Munir, M.J.; Kazmi, S.M.S.; Wu, Y.F.; Lin, X.; Ahmad, M.R. Development of a novel compressive strength design equation for natural and recycled aggregate concrete through advanced computational modeling. J. Build. Eng. 2022, 55, 104690. [Google Scholar] [CrossRef]
Upreti, K.; Verma, M.; Agrawal, M.; Garg, J.; Kaushik, R.; Agrawal, C.; Singh, D.; Narayanasamy, R.; Chelladurai, S.J.S. Prediction of Mechanical Strength by Using an Artificial Neural Network and Random Forest Algorithm. J. Nanomater. 2022, 2022, 7791582. [Google Scholar] [CrossRef]
Mahamat, A.A.; Boukar, M.M. Machine learning techniques versus classical statistics in strength predictions of eco-friendly masonry units. In Proceedings of the 16th International Conference on Electronics Computer and Computation (ICECCO 2021), Kaskelen, Kazakhstan, 25–26 November 2021. [Google Scholar]
Obianyo, I.I.; Onwualu, A.P.; Mahamat, A.A. Evaluation of Predictive Models for Mechanical Properties of Earth-Based Composites for Sustainable Building Applications. In New Advances in Soft Computing in Civil Engineering, AI-Based Optimization and Prediction; Bekdaş, G., Nigdeli, S.M., Eds.; Studies in Systems, Decision and Control; Springer Nature: Cham, Switzerland, 2024; Volume 547, pp. 179–190. Available online: https://link.springer.com/10.1007/978-3-031-65976-8 (accessed on 19 July 2024).
Mangalathu, S.; Jang, H.; Hwang, S.H.; Jeon, J.S. Data-driven machine-learning-based seismic failure mode identification of reinforced concrete shear walls. Eng. Struct. 2020, 208, 110331. [Google Scholar] [CrossRef]
Mohanraj, T.; Yerchuru, J.; Krishnan, H.; Nithin Aravind, R.S.; Yameni, R. Development of tool condition monitoring system in end milling process using wavelet features and Hoelder’s exponent with machine learning algorithms. Measurement 2021, 173, 108671. [Google Scholar] [CrossRef]
Alghamdi, S.J. Classifying High Strength Concrete Mix Design Methods Using Decision Trees. Materials 2022, 15, 1950. [Google Scholar] [CrossRef]
Ahmad, A.; Farooq, F.; Niewiadomski, P.; Ostrowski, K.; Akbar, A.; Aslam, F.; Alyousef, R. Prediction of compressive strength of fly ash based concrete using individual and ensemble algorithm. Materials 2021, 14, 794. [Google Scholar] [CrossRef]
Phung, B.N.; Le, T.H.; Mai, H.V.T.; Nguyen, T.A.; Ly, H.B. Advancing basalt fiber asphalt concrete design: A novel approach using gradient boosting and metaheuristic algorithms. Case Stud. Constr. Mater. 2023, 19, e02528. [Google Scholar] [CrossRef]
BS 1377-2:2022; Part 2: British Standard Methods of Test for Soils for Civil Engineering Purposes. British Standard Institutions: London, UK, 2020.
ASTM D854-23; Standard Test Methods for Specific Gravity of Soil Solids by Water Pycnometer. ASTM International: West Conshohocken, PA, USA, 2002; Volume 4, pp. 1–9.
ASTM D7263-21; Test Methods for Laboratory Determination of Density (Unit Weight) of Soil Specimens. ASTM International: West Conshohocken, PA, USA, 2021. Available online: http://www.astm.org/cgi-bin/resolver.cgi?D7263-21 (accessed on 19 July 2024).
ASTM D4318-17e1; Test Methods for Liquid Limit, Plastic Limit, and Plasticity Index of Soils. ASTM International: West Conshohocken, PA, USA, 2017. Available online: http://www.astm.org/cgi-bin/resolver.cgi?D4318-17E1 (accessed on 19 July 2024).
Mahamat, A.A.; Obianyo, I.I.; Ngayakamo, B.; Bih, N.L.; Ayeni, O.; Azeko, S.T.; Savastano, H. Alkali activation of compacted termite mound soil for eco-friendly construction materials. Heliyon 2021, 7, e06597. [Google Scholar] [CrossRef] [PubMed]
ASTM C109/C109M-20; Standard Test Method for Compressive Strength of Hydraulic Cement Mortars (Using 2-in. or [50-mm] Cube Specimens). ASTM International: West Conshohocken, PA, USA, 2020. Available online: https://store.astm.org/c0109_c0109m-20.html (accessed on 19 July 2024).
Barnaure, M.; Bonnet, S.; Poullain, P. Earth buildings with local materials: Assessing the variability of properties measured using non-destructive methods. Constr. Build. Mater. 2021, 281, 122613. [Google Scholar] [CrossRef]
ASTM C1548–02 (R2007); Standard Test Method for Dynamic Young’s Modulus, Shear Modulus, and Poisson’s Ratio of Refractory Materials by Impulse Excitation of Vibration1. ASTM International: West Conshohocken, PA, USA, 2007. Available online: www.astm.org (accessed on 12 January 2024).
Linda Bih, N.; Aboubakar Mahamat, A.; Bidossèssi Hounkpè, J.; Azikiwe Onwualu, P.; Boakye, E.E. The Effect of Polymer Waste Addition on the Compressive Strength and Water Absorption of Geopolymer Ceramics. Appl. Sci. 2021, 11, 3540. [Google Scholar] [CrossRef]
AA Mahamat, M.M. Boukar. On the Use of Machine Learning Technique to Appraise Thermal Properties of Novel Earthen Composite for Sustainable Housing in Sub-Saharan Africa. In Innovations and Interdisciplinary Solutions for Underserved Areas; Springer: Cham, Switzerland, 2023; pp. 161–170. [Google Scholar]
Mahamat, A.A.; Boukar, M.M.; Leklou, N.; Celino, A.; Obianyo, I.I.; Bih, N.L.; Stanislas, T.T.; Savastanos, H. Decision Tree Regression vs. Gradient Boosting Regressor Models for the Prediction of Hygroscopic Properties of Borassus Fruit Fiber. Appl. Sci. 2024, 14, 7540. [Google Scholar] [CrossRef]
Mahamat, A.A.; Bih, N.L.; Ayeni, O.; Onwualu, P.A.; Savastano, H.; Soboyejo, W.O. Development of sustainable and eco-friendly materials from termite hill soil stabilized with cement for low-cost housing in Chad. Buildings 2021, 11, 86. [Google Scholar] [CrossRef]
Abushanab, W.S.; Moustafa, E.B.; Ghandourah, E.I.; Hussein, H.; Taha, M.A.; Mosleh, A.O. Impact of Hard and Soft Reinforcements on the Microstructure, Mechanical, and Physical Properties of the Surface Composite Matrix Manufactured by Friction Stir Processing. Coatings 2023, 13, 284. [Google Scholar] [CrossRef]
Mahamat, A.A.; Dayyabu, A.; Sanusi, A.; Ado, M.; Obianyo, I.I.; Stanislas, T.T.; Bih, N.L. Dimensionnal stability and strength appraisal of termite hill soil stabilisation using hybrid bio-waste and cement for eco-friendly housing. Heliyon 2022, 8, e09406. [Google Scholar] [CrossRef]
Rahmat, M.N.; Ismail, N. Effect of optimum compaction moisture content formulations on the strength and durability of sustainable stabilised materials. Appl. Clay Sci. 2018, 157, 257–266. [Google Scholar] [CrossRef]
Alshameri, B. Maximum dry density of sand–kaolin mixtures predicted by using fine content and specific gravity. SN Appl. Sci. 2020, 2, 1693. [Google Scholar] [CrossRef]
Obi Reddy, K.; Shukla, M.; Uma Maheswari, C.; Varada Rajulu, A. Mechanical and physical characterization of sodium hydroxide treated Borassus fruit fibers. J. For. Res. 2012, 23, 667–674. [Google Scholar] [CrossRef]
Verma, D.; Goh, K.L. Effect of mercerization/alkali surface treatment of natural fibres and their utilization in polymer composites: Mechanical and morphological studies. J. Compos. Sci. 2021, 5, 175. [Google Scholar] [CrossRef]
Friedman, J.H. Greedy Function Approximation: A Gradient Boosting Machine. Ann. Stat. 2001, 29, 1189–1232. [Google Scholar] [CrossRef]
Amin, M.N.; Khan, K.; Javed, M.F.; Aslam, F.; Qadir, M.G.; Faraz, M.I. Prediction of Mechanical Properties of Fly-Ash/Slag-Based Geopolymer Concrete Using Ensemble and Non-Ensemble Machine-Learning Techniques. Materials 2022, 15, 3478. [Google Scholar] [CrossRef] [PubMed]
Wadhawan, S.; Bassi, A.; Singh, R.; Patel, M. Prediction of Compressive Strength for Fly Ash-Based Concrete: Critical Comparison of Machine Learning Algorithms. J. Soft Comput. Civ. Eng. 2023, 7, 68–110. [Google Scholar]
Lu, X.; Zhou, W.; Ding, X.; Shi, X.; Luan, B.; Li, M. Ensemble Learning Regression for Estimating Unconfined Compressive Strength of Cemented Paste Backfill. IEEE Access 2019, 7, 72125–72133. [Google Scholar] [CrossRef]
Li, Q.F.; Song, Z.M. High-performance concrete strength prediction based on ensemble learning. Constr. Build. Mater. 2022, 324, 126694. [Google Scholar] [CrossRef]
Xiong, W.; Xiao, L.; Han, D.; Yue, W. A prediction model for water absorption in sublayers based on stacking ensemble learning method. Geoenergy Sci. Eng. 2024, 239, 212896. [Google Scholar] [CrossRef]

Figure 1. Borassus tree (a), fruit (b), and extracted fine and coarse fibers (c).

Figure 2. Diagram summarizing the various steps followed during the composite production.

Figure 3. Diagram of the Young’s modulus testing setup.

Figure 4. Diagram showcasing the steps to build the ensemble learning model.

Figure 5. Flow chart describing the BFRC production, laboratory testing, creation of primary dataset, generation of EL, prediction of EL vs. SM with performance comparison.

Figure 6. Particle size distribution of the earthen matrix.

Figure 7. SEM-EDX micrographs of (a) the excavated soil (matrix), (b) Borassus fruit-reinforced composite (BFRC), and (c) natural Borassus fruit fiber (BNF).

Figure 8. Young’s modulus prediction results for (a) LR, (b) GBR, (c) DTR, and (d) EL.

Figure 9. Compressive strength prediction using (a) LR, (b) GBR, (c) DTR, and (d) EL.

Figure 10. Prediction of water absorption using (a) LR, (b) GBR, (c) DTR, and (d) EL.

Table 1. Mathematical formulas of LR, GBRs and DTR.

LR	GBR	DTR
$Y = β_{0} + β_{1} x$ + ε	$Y = \sum_{m = 1}^{M} γ_{m} h_{m} (x)$	Non-parametric, henceforth complex to represent mathematically

where Y represents the dependent variable, x is the independent variable, β₀ is the intercept (the value of y when x is 0), β₁ is the slope (the change in Y for a one-unit change in x), ε is the error term (the difference between the observed value of Y and the predicted value), h_m(x) are the basis functions;

γ_{m}

are the shrinkage parameters.

Table 2. Details of input variables obtained from experiments during the prediction of the water absorption.

Fiber Content (%)	Activator (%)	Curing Days	Dry Weight (g)	Saturated Weight (g)	Water Absorption (%)
0.5	0.03	14	479.1	541.5	13.02
0.5	0.03	14	442.2	489.1	10.60
0.5	0.03	14	482.4	535.4	10.99
0.5	0.03	14	501.6	596.5	18.92
0.5	0.03	14	496.9	607.6	22.28
0.5	0.03	14	489.2	605.4	23.75
0.5	0.03	14	488.4	610.7	25.04
0.5	0.03	14	538.6	670.9	24.57
0.5	0.03	14	478.4	640.3	33.83
0.5	0.03	14	489.1	531.5	8.67
0.5	0.03	14	452.2	499.1	10.37
…	…	…	…	…	…
…	…	…	…	…	…
…	…	…	…	…	…
0.5	0.03	90	499.3	615.8	23.33
0.5	0.03	90	452.2	605.7	33.94
0.5	0.03	90	492.2	640.6	30.16

Table 3. Details of input variables used during the prediction of compressive strength obtained from experiments.

Fiber Content (%)	Activator (%)	Curing Days	Cross Sectional Area (mm²)	Maximum Load (kN)	Compressive Strength (MPa)
0.5	0.03	14	1000	1290	1.29
0.5	0.03	14	1000	1290	1.29
0.5	0.03	14	1000	1290	1.29
0.5	0.03	14	1000	2290	2.29
0.5	0.03	14	1000	2345	2.35
0.5	0.03	14	970	1234	1.27
0.5	0.03	14	970	1580	1.63
0.5	0.03	14	970	1305	1.35
0.5	0.03	14	970	1495	1.54
0.5	0.03	14	900	1495	1.66
0.5	0.03	14	900	1890	2.10
…	…	…	…	…	…
…	…	…	…	…	…
…	…	…	…	…	…
0.5	0.03	90	800	19,475	24.34
0.5	0.03	90	800	18,543	23.18
0.5	0.03	90	800	17,456	21.82

Table 4. Details of input variables obtained from experiments during the prediction of Young’s modulus.

Fiber Content (%)	Activator (%)	Curing Days	Mass (g)	Flexural Vibration (Hz)	Torsional Vibration (Hz)	Correction Factor	Young’s Modulus (Pa)
0.5	0.03	14	465.17	3.15	4.41	1.4115625	462,555.777
0.5	0.03	14	500.12	3.12	4.48	1.4115625	534,674.114
0.5	0.03	14	520.09	2.94	2.71	1.4115625	578,226.139
0.5	0.03	14	421.16	2.74	4.07	1.4115625	379,170.854
0.5	0.03	14	469.08	2.73	4.41	1.4115625	470,364.51
0.5	0.03	14	532.77	3.11	4.35	1.4115625	606,764.603
0.5	0.03	14	487.53	3.13	3.9	1.4115625	508,093.224
0.5	0.03	14	469.46	3.03	3.87	1.4115625	471,126.9
0.5	0.03	14	476.57	3.1	3.18	1.4115625	485,505.454
0.5	0.03	14	520.6	2.59	3.16	1.4115625	579,360.711
0.5	0.03	14	451.6	2	2.63	1.4115625	435,961.943
…	…	…	…	…	…	…	…
…	…	…	…	…	…	…	…
…	…	…	…	…	…	…	…
0.5	0.03	90	425.16	3.02	3.16	1.4115625	386,407.467
0.5	0.03	90	468	2.97	2.63	1.4115625	468,201.089
0.5	0.03	90	512	2.82	3.37	1.4115625	560,377.43

Table 5. Raw material characteristics.

Soil’s Physical Characteristics		BNF Properties: Fine Fibers
Particle size distribution	75% > 80 μm	Length/diameter	5 cm/50 µm
Specific gravity	2.50	Elongation	25%
Dry density	0.56 g/cm³	Modulus	7.5 GPa
Moisture content	3.55%	BNF Properties: Coarse Fibers
Liquid limit	33.50%	Length/diameter	10 cm/170 µm
Plastic limit	20.30%	Elongation	30%
Plasticity index	13.20%	Modulus	8.5 GPa

Table 6. Feature importance.

Feature	Importance
Feature	Young’s Modulus	Compressive Strength	Water Absorption
Fiber content (%)	0.5535	0.6218	0.4715
Activator (%)	0.2504	0.2227	0.2740
Curing days	0.1961	0.1555	0.2545

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Mahamat, A.A.; Boukar, M.M.; Obianyo, I.I.; Nshimiyimana, P.; Ngayakamo, B.; Leklou, N.; Bih, N.L. Leveraging Machine Learning (ML) to Enhance the Structural Properties of a Novel Alkali Activated Bio-Composite. J. Compos. Sci. 2025, 9, 464. https://doi.org/10.3390/jcs9090464

AMA Style

Mahamat AA, Boukar MM, Obianyo II, Nshimiyimana P, Ngayakamo B, Leklou N, Bih NL. Leveraging Machine Learning (ML) to Enhance the Structural Properties of a Novel Alkali Activated Bio-Composite. Journal of Composites Science. 2025; 9(9):464. https://doi.org/10.3390/jcs9090464

Chicago/Turabian Style

Mahamat, Assia Aboubakar, Moussa Mahamat Boukar, Ifeyinwa Ijeoma Obianyo, Philbert Nshimiyimana, Blasius Ngayakamo, Nordine Leklou, and Numfor Linda Bih. 2025. "Leveraging Machine Learning (ML) to Enhance the Structural Properties of a Novel Alkali Activated Bio-Composite" Journal of Composites Science 9, no. 9: 464. https://doi.org/10.3390/jcs9090464

APA Style

Mahamat, A. A., Boukar, M. M., Obianyo, I. I., Nshimiyimana, P., Ngayakamo, B., Leklou, N., & Bih, N. L. (2025). Leveraging Machine Learning (ML) to Enhance the Structural Properties of a Novel Alkali Activated Bio-Composite. Journal of Composites Science, 9(9), 464. https://doi.org/10.3390/jcs9090464

Article Menu

Leveraging Machine Learning (ML) to Enhance the Structural Properties of a Novel Alkali Activated Bio-Composite

Abstract

1. Introduction

2. Materials and Methods

2.1. Materials

2.1.1. Soil Excavation and Processing

2.1.2. Natural Fiber Extraction

2.1.3. Composite Production

2.2. Experimental Program

2.2.1. Mechanical Behavior Experiment

2.2.2. Hygroscopic Analysis

2.2.3. Scanning Electron Microscopy (SEM)/Electron Dispersive X-Ray (EDX)

2.2.4. Machine Learning Models

3. Results and Discussion

3.1. Physico-Morphological Observations of Borassus Fruit-Reinforced Composite (BFRC)

3.2. Young’s Modulus Prediction

3.3. Compressive Strength Prediction

3.4. Hygroscopic Properties Prediction

3.5. Feature Importance Analysis

4. Limitations, Challenges, and Practical and Theoretical Implications with Examples to Enhance Decision-Making

4.1. Theoretical Implications

4.2. Practical Implications

5. Conclusions

Author Contributions

Funding

Data Availability Statement

Acknowledgments

Conflicts of Interest

Abbreviations

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI