Extended Modelling of Molecular Calcium Signalling in Platelets by Combined Recurrent Neural Network and Partial Least Squares Analyses

Tantiwong, Chukiat; Cheung, Hilaire Yam Fung; Dunster, Joanne L.; Gibbins, Jonathan M.; Heemskerk, Johan W. M.; Cavill, Rachel

doi:10.3390/ijms26146820

Open AccessArticle

Extended Modelling of Molecular Calcium Signalling in Platelets by Combined Recurrent Neural Network and Partial Least Squares Analyses

by

Chukiat Tantiwong

^1,2,

Hilaire Yam Fung Cheung

^2,3,

Joanne L. Dunster

¹,

Jonathan M. Gibbins

¹

,

Johan W. M. Heemskerk

^1,4,*

and

Rachel Cavill

^5,*

¹

Institute for Cardiovascular and Metabolic Research (ICMR), School of Biological Sciences, University of Reading, Whiteknights, Reading RG6 6AS, UK

²

Department of Biochemistry, Maastricht University, P.O. Box 616, 6200 MD Maastricht, The Netherlands

³

Institute of Cardiovascular Sciences, College of Medical and Dental Sciences, University of Birmingham, Edgbaston, Birmingham B15 2TT, UK

⁴

Synapse Research Institute Maastricht, Kon. Emmaplein 7, 6217 KD Maastricht, The Netherlands

⁵

Department of Data Science and Knowledge Engineering, Maastricht University, P.O. Box 616, 6200 MD Maastricht, The Netherlands

^*

Authors to whom correspondence should be addressed.

Int. J. Mol. Sci. 2025, 26(14), 6820; https://doi.org/10.3390/ijms26146820

Submission received: 18 December 2024 / Revised: 27 June 2025 / Accepted: 14 July 2025 / Published: 16 July 2025

(This article belongs to the Section Molecular Pathology, Diagnostics, and Therapeutics)

Download

Browse Figures

Versions Notes

Abstract

Platelets play critical roles in haemostasis and thrombosis. The platelet activation process is driven by agonist-induced rises in cytosolic [Ca²⁺]_i, where the patterns of Ca²⁺ responses are still incompletely understood. In this study, we developed a number of techniques to model the [Ca²⁺]_i curves of platelets from a single blood donor. Fura-2-loaded platelets were quasi-simultaneously stimulated with various agonists, i.e., thrombin, collagen, or CRP, in the presence or absence of extracellular Ca²⁺ entry, secondary mediator effects, or Ca²⁺ reuptake into intracellular stores. To understand the calibrated time curves of [Ca²⁺]_i rises, we developed two non-linear models, a multilayer perceptron (MLP) network and an autoregressive network with exogenous inputs (NARX). The trained networks accurately predicted the [Ca²⁺]_i curves for combinations of agonists and inhibitors, with the NARX model achieving an R² of 0.64 for the trend prediction of unforeseen data. In addition, we used the same dataset for the construction of a partial least square (PLS) linear regression model, which estimated the explained variance of each input. The NARX model demonstrated that good fits could be obtained for the nanomolar [Ca²⁺]_i curves modelled, whereas the PLS model gave useful interpretable information on the importance of each variable. These modelling results can be used for the development of novel platelet [Ca²⁺]_i-inhibiting drugs, such as the drug 2-aminomethyl diphenylborinate, blocking Ca²⁺ entry in platelets, or for the evaluation of general platelet signalling defects in patients with a bleeding disorder.

Keywords:

calcium signalling; collagen; neuronal network; platelets; thrombin

1. Introduction

Platelets, derived from megakaryocytes, contribute to haemostasis, thrombosis, and thrombo-inflammation via receptor-induced signalling responses [1,2,3]. Physiologically important receptors are the protease-activated receptors (PAR1/4) for thrombin, the purinergic receptors (P2Y_1/12) for ADP, which signal as G-protein coupled receptors (GPCRs), and the glycoprotein VI (GPVI) receptor for collagen, acting as a protein tyrosine kinase-linked receptor (TKLR) [4]. Since the activation and aggregation of platelets frequently drive arterial thrombotic complications [5], which are prominent causes of death worldwide [6], a clear understanding of the activation process is a must.

In platelets stimulated via GPCRs or TKLRs, rises in cytosolic [Ca²⁺]_i are a common initial event, contributing to essentially all platelet functions [7,8]. The receptor-induced mobilisation of Ca²⁺ from intracellular stores in the endoplasmic reticulum (or dense tubular system) proceeds via inositol 1,4,5-trisphosphate receptors (IP₃Rs), while sarcoplasmic/endoplasmic reticulum Ca²⁺-ATPases (SERCAs) are responsible for the back pumping of Ca²⁺ into the stores (Figure S1) [7,8]. The IP₃R channels are operated via IP₃, which is produced as a result of activation of the GPCRs for thrombin [9] and ADP [10] and upon the activation of GPVI by collagen or collagen-related peptide (CRP) [8].

In the process of store-operated Ca²⁺ entry (SOCE), Ca²⁺ store depletion is coupled to entry of Ca²⁺ from the extracellular medium via Orai1 channels, which open upon interaction with the Ca²⁺ sensor STIM1 (stromal interaction molecule 1) in the endoplasmic reticulum membrane [7]. The back pumping of Ca²⁺ over the plasma membrane occurs via plasma membrane Ca²⁺-ATPases (PMCAs). Furthermore, primary agonists such as thrombin and CRP stimulate the release of autocrine agents, which enhance the Ca²⁺ signalling process. Particularly relevant are the autocrine agents thromboxane A₂ (TxA₂) and ADP, both of which stimulate IP₃ production via GPCRs [11]. Another paracrine-dependent Ca²⁺ entry mechanism is provided by ATP, which activates P2X₁ channels that specifically mediate Ca²⁺ entry [12].

Several pharmacological inhibitors are known to interfere with platelet Ca²⁺ responses. The entry of Ca²⁺ from blood plasma is prevented by the Ca²⁺ chelator EGTA. The back pumping of Ca²⁺ from cytosol to intracellular stores is inhibited by the compound thapsigargin, which accordingly potentiates Orai1-STIM1-dependent entry [7]. The effects of autocrine agents are suppressed by the addition of apyrase (degrading ATP and ADP) and indomethacin (blocking TxA₂ formation). Figure S1 illustrates the actions of these platelet receptors, ligands, inhibitors, and channels relevant for the present study.

The high complexity of Ca²⁺-related signalling in platelets has led to the development of mathematical models, aiming to better understand the process and identify therapeutic targets. Authors have combined the Ca²⁺ fluxes in various platelet compartments into one model based on ordinary differential equations (ODEs) [13]. Even though this system did not include ligand–receptor interactions, it consisted of 34 entities, 35 interactions, and 86 parameters, thus reflecting the complexity of the Ca²⁺ signalling process. An alternative approach presented by Chatterjee and Diamond [14] was to create a neural network model that was trained from the Ca²⁺ response patterns to specific agonists. This neural network, acting as a black box, was able to predict synergistic effects on the Ca²⁺ responses of up to six agonists. A trade-off of the network model was that all the parameters needed to be trained, and, hence, required extensive experimental data. Another limitation of the neural network approach was that it did not predict the contribution of each Ca²⁺ channel and pump to the overall cytosolic [Ca²⁺]_i level.

In the present study, we constructed computational models to predict the magnitude and shape of the [Ca²⁺]_i time curves in platelets in response to collagen, thrombin, and CRP for a given set of experimental conditions in the absence or presence of known inhibitors. We first built two neural network models to predict agonist and inhibitor effects on the [Ca²⁺]_i curves. We then used partial least square (PLS) regression analysis to better understand how specific curve variables contributed to the obtained response. To exclude inter-individual variation, we used a coherent set of Ca²⁺ response curves taken from the platelets of one healthy subject, checked to be representative for five healthy subjects.

2. Results

2.1. Comparing Multiple Agonist-Induced Platelet [Ca²⁺]_i Curves

Using a high-throughput method described before [15], Fura-2-loaded platelets from a representative healthy donor were incubated in the presence of EGTA or CaCl₂ with or without the secondary mediator inhibitors apyrase and indomethacin (AI) and then stimulated with collagen, CRP, or thrombin. Under these various different conditions, agonist-induced rises in [Ca²⁺]_i were measured as nM concentrations over a time period of 540 s. By also varying the agonist concentrations, curves were obtained for over 70 different experimental conditions. By comparing analogous sets of collagen-, thrombin-, and CRP-activated [Ca²⁺]_i curves from five healthy subjects [15], subject 1 was taken as representative for all (Figure S2). As shown in a comparative heatmap, the overall concordance between subjects was high per condition (experiment) and between conditions of the [Ca²⁺]_i rises, with a mean coefficient of variation of 29%. Accordingly, for the present analysis, we used a coherent set of 72 [Ca²⁺]_i time curves obtained with the platelets from subject 1 (Figure 1).

Comparing the set of original traces (Figure S3), typical characteristics were observed, in addition to the expected agonist dose dependency [15]. In general, the [Ca²⁺]_i curves induced by the weak GPVI agonist collagen showed steady increases with lower maximal amplitudes (Exp. 9–31) when compared to the higher amplitude and often biphasic [Ca²⁺]_i rises induced by the strong GPVI agonist CRP (Exp. 37–48). On the other hand, the curves with the PAR1/4 agonist thrombin (Exp. 49–72) often had a transient shape, indicating high activity of the SERCA Ca²⁺ pumps. Other differences included amplitude traces up to four times higher (depending on other variables) in the presence of CaCl₂ compared to EGTA, which can be explained by Orai1-dependent Ca²⁺ entry [15]. Furthermore, we confirmed potent [Ca²⁺]_i increases with CaCl₂ and the SERCA inhibitor thapsigargin, stimulating SOCE and activating the Orai1 channels [7]. The autocrine inhibitors indomethacin and apyrase (IA), in general, lowered most of the curves.

2.2. Workflow of the Modelling Approaches

To prepare the raw experimental data for processing, we first interpolated and smoothed the 72 curves at 1 s time intervals (Figure S4), and then y-axis scaled each curve in the range of 0–1 (Figure S5). The subsequent workflow (Figure 2) consisted of feature generation by combining and squaring the experimental variables (see below) and splitting the curves into training, validation, and test sets. The processed curves were then used as inputs for two types of modelling, i.e., a neural network and a PLS (partial least squares) method. Using neural network analysis, we performed an NARX (non-linear autoregressive network with exogenous input) procedure for trend prediction and an MLP (multilayer perceptron) procedure for magnitude prediction. The combined network was then tested on overall performance. Furthermore, we used PLS regression analysis to comparatively model scalar characteristics of the curves. The results from these approaches were interpreted and cross-checked with each other.

2.3. MLP Network for Magnitude Prediction

We first aimed to understand how the smoothed [Ca²⁺]_i curves of the platelets relied on the chosen experimental conditions (CaCl₂/EGTA, agonist, dose, AI, and thapsigargin). For this purpose, we generated a simple network able to predict the magnitude of the Ca²⁺ signal. After the training and validation of the constructed MLP network, it appeared that the best architecture had three nodes with one hidden layer (Figure 3A). We then generated plots to compare the observed data with the predictions in linear of log scale. In the plots, each data point represented an experimental and predicted magnitude value (Figure S6). The MLP parameters associated with each node are shown in Figure S7, also providing the relative weights of combined and squared parameters. The output plots indicate a reasonable fitting, especially in the log-scale setting. We concluded that the MLP approach provided suitable predictions of curve magnitudes, although this procedure did not predict curve shapes.

2.4. Neural NARX Network for Trend Prediction

To predict the shape or trend of the [Ca²⁺]_i curves, we applied a uniform amplitude scaling of 0–1. The developed NARX method was then used for prediction modelling of the scaled curves. For training of the NARX network, we choose 58 scaled curves (Figure S7), which resulted in the best network architecture (mean R² = 0.84) with three hidden layers and 4 × 12 × 4 nodes (Figure 3B). The suitability of this network was confirmed using a validation set of seven curves (mean R² = 0.71) (Figure S8). Subsequent application of a test set with seven curves resulted in a lesser fitting (R² = 0.64), explained by the transiency induced by thrombin (Figure 4). In comparison, the testing of unscaled curves in the MLP network resulted in a good prediction, especially for the high-magnitude curves.

The NARX predictions provided information on the non-linear shapes of some of the [Ca²⁺]_i curves. Examining the R² trend values, it appeared that these were negative for Exp. 58 (Figure S9) and Exp. 63 (Figure 4). This indicated an explained variance worse than random, and, hence, the inability of fitting. Furthermore, other conditions with thrombin as an agonist (Exp. 67, 70, and 72) gave a relatively low R² < 0.4. This can be explained by the transiency of several thrombin-induced [Ca²⁺]_i rises. The apparently additive information from either procedure prompted us to integrate the neural network results of magnitude and trend prediction.

2.5. Combining the MLP and NARX Networks

For a combined network prediction, we used a training set of 58 curves (Figure S10). The initial training with respect to magnitude and trend predictions was validated and tested using the remaining 14 curves (Figure S11). This combined modelling resulted in an improved outcome. We then performed one-at-a-time (OAT) factor analysis by varying the agonist concentrations from 1 to 10% of maximum at different inhibitor combinations, presenting the results as magnitude curves (Figure 5A) and heatmaps of scalar characteristics (Figure 5B).

The OAT prediction in Figure 5A indicated that the magnitude of the ‘no inhibitor’ [0 0 0] condition was more changed with the thrombin concentration when compared to collagen or CRP. The presence of EGTA reduced the magnitude prediction mostly with collagen and CRP. The thrombin concentration showed the highest magnitude effects in both the absence or presence of inhibitors. Furthermore, thapsigargin increased the overall magnitude effect at different agonist concentrations, particularly in the absence of EGTA (i.e., with CaCl2). Overall, this OAT analysis pointed to a high sensitivity of the [Ca²⁺]_i curves in the order of thrombin > CRP > collagen.

Regarding the scalar curves, we compared three indications for non-linearity, i.e., tmax, ylast, and absdev (Figure 6). The [Ca²⁺]_i peak time (tmax) provided information on early curve saturation (<540 s). The parameter ylast indicated curve transiency when <1, whilst absdev informed on the extent of non-linearity. The heatmaps in Figure 5B show similar trends for ylast and tmax, particularly for thrombin in the absence of thapsigargin. Thus, the prediction indicated that the non-linear curve pattern with thrombin extended to higher agonist concentrations. This transiency was not seen for collagen or CRP. The analysis of absdev showed that most of the thrombin curves were non-monotonic, except for conditions in which thapsigargin and CaCl₂ were present, i.e., resulting in more linear curves (Figure 5B). For all inhibitor conditions, the predicted absdev values for collagen and CRP were in the same lower range. Accordingly, the scaled curve characteristics informed on the Ca²⁺ response patterns at low agonist doses in the absence or presence of CaCl₂, AI, or thapsigargin. Examination of the unscaled curves showed a better picture of the prediction at low agonist doses (Figure S12). The model thereby predicted that already low concentrations of thrombin (<0.2 nM), CRP (<0.2 μg/mL), or collagen (<0.5 μg/mL) produced relevant [Ca²⁺]_i rises, i.e., even below the doses inducing platelet aggregation.

From combining the tested MLP and NARX networks, several conclusions can be drawn. The transient [Ca²⁺]_i curves with thrombin required a different modelling approach than the non-transient responses obtained with other agonists. For the weak GPVI agonist collagen and the strong agonist CRP, the scaling approach showed monotonic curves, being close to linear at low agonist concentrations. Further, the combined modelling indicated additive effects of EGTA (Exp. 36) and AI (Exp. 42) for CRP, of which the former was stronger.

2.6. Partial Least Square (PLS) Regression Analyses

As an integrative approach, we then investigated how each of the experimental variables contributed to the scalar curve characteristics tmax, ylast, and absdev. For this purpose, we used PLS regression analysis as a linear model, directly assessing the impacts of all input variables.

As inputs for the PLS model, we normalised the experimental conditions to six variables, i.e., agonist type, concentration, EGTA/CaCl₂, AI, and thapsigargin, which all varied from zero (none) to one (maximum). This resulted in a six-component model. As indicated in Figure S13, only the first two components contributed to the variance of the target. We then fitted the two-component PLS regression for the curves of magnitude, tmax, ylast, and absdev using the training set of 58 experimental conditions, while keeping the remaining 14 conditions (previous validation and test sets) as test set of the PLS model. Regression analysis was then used to predict the scalar characteristics of the test set, which overall showed a good or underestimated fit, but also gave errors for the training and test sets, which pointed to partial overfitting (Figure S14).

It appeared that the first PLS component of the magnitude prediction had a negative loading in the presence of EGTA and/or AI (Figure 7A), in agreement with the lower levels of [Ca²⁺]_i reached. On the other hand, the presence of thapsigargin resulted in a highly positive loading, linked to the increased [Ca²⁺]_i levels. These opposite loading coefficients reflect that the presence of EGTA prevented the entry of extracellular Ca²⁺, whereas thapsigargin increased this process by inhibiting the SERCA-type Ca²⁺ pumps controlling the STIM1-Orai1 Ca²⁺ entry pathway [15]. The agonists CRP > thrombin had strong positive predictions in component 2, indicating that these variables differed from the inhibitor effects. Furthermore, the ylast and tmax predictions showed opposite loadings in component 1 for thrombin (negative) and thapsigargin (positive) (Figure 7B,C). This reflects the transient, non-linear Ca²⁺ responses observed with thrombin in comparison to the continuously rising curves with thapsigargin. Regarding the absdev prediction, the thrombin condition showed a particularly high positive weight in component 2, in contrast to the negative loading in component 1 for thapsigargin (Figure 7D), also as a consequence of the different curve shapes. Accordingly, the four PLS regression models provided valuable information on the relations between curve sizes and shapes across conditions.

For validation of the combined model, based on magnitude, trend, and scalar predictions of the characteristics of agonist-induced [Ca²⁺]_i time curves, we evaluated the effects of a drug, 2-aminomethyl diphenylborinate (2APB), previously identified as a potent inhibitor of the STIM1-Orai1 Ca²⁺ entry pathway [15]. For this purpose, we generated 16 sets of curves with the variables collagen, thrombin, CRP, thapsigargin, and EGTA/CaCl₂ both in the presence and absence of 2APB. For convenient and logistic reasons, we used only the high agonist concentrations. The raw curves, representative for platelets from three subjects, are provided in Figure S15A. The application of PLS regression analyses using this 16-fold dataset provided interesting insights into platelet Ca²⁺ signalling for all four curve characteristics, including magnitude, tmax, ylast, and absdev, supporting the known action mechanism of the drug. Examining the first two PLS components, the opposite loadings of EGTA and thapsigargin were retained in the magnitude and tmax predictions, in both the absence and presence of the drug (Figure S15B(a,b)). The transiency of the traces with thrombin again appeared as separate dots in the ylast and absdev predictions (Figure S15B(c,d)). Importantly, the PLS linear regression analyses also pointed to a high similarity of the curve profiles in the absence or presence of the drug 2APB, showing highly similar loadings for all variables (Figure S15B(i,ii)). In addition, when the drug was introduced as an additional variable, the loadings in components 1–2 for “Drug” and “EGTA” were close to each other (Figure S15B(iii)). This indicated that, in general, the drug 2APB did not affect the overall curve shapes, but approached the conditions with EGTA, hence confirming its action mechanism as a Ca²⁺ entry blocker regardless of the presence of agonist or other inhibitor.

3. Discussion

The combined modelling approaches presented here introduce a new way to predict the response size and pattern of agonist-induced platelet Ca²⁺ responses under a great variety of conditions. The constructed MLP and NARX neural networks were able to produce mostly correct magnitude curves for [Ca²⁺]_i, whereas the modelling by PLS regression captured the characteristic curve shapes. Our work thereby adds to the idea of a platelet Ca²⁺ calculator introduced by Diamond and colleagues [14], in that, now, curve patterns can also be predicted without mathematical modelling. However, we did not consider the synergistic effects of agonist combinations such as those presented in that study.

It is important to note that, while the present machine learning techniques were able to fit most of the input data, the obtained output did not give a direct biological interpretation. This is in contrast to modelling approaches based on biological concepts, such as enzyme and receptor reaction rates in ODE-based kinetic models. However, the latter approaches cannot easily capture the complex interactions between signalling steps, for instance due to combinations of agonists and inhibitors.

Both the NARX network and PLS regression modelling yielded useful results for understanding the variation in [Ca²⁺]_i curves. The magnitude differences between curves in the presence of EGTA or CaCl₂ (due to Ca²⁺ entry into the platelets) were well captured by the MLP and PLS regression models. The prediction results—i.e., sensitivity for MLP and components 1/2 for PLS—were well interpretable for this variable. On the other hand, NARX outperformed in capturing some curve variables. Thus, the subtle curve magnitude and shape effects (tmax and absdev) induced by thapsigargin were captured by NARX, but not by PLS regression. This illustrates that neural networks such as NARX can easily handle non-linear effects due to their complex activation functions, whereas PLS relies on linear regression analysis.

A specific limitation encountered was the shape differences in the [Ca²⁺]_i curves used for training approaches, i.e., more often transient with thrombin and non-transient with CRP or collagen. Although neural networks can capture any function, they need sufficient data to train for such curve differences. In our case, a limited number of curves per agonist was available for training, which caused an imbalance in this set. One way to fix this problem is to use data augmentation, for example, by a synthetic minority oversampling technique [16].

In the present paper, we used the platelets from a single donor for training all models, which allowed for a detailed investigation of the complex Ca²⁺ signalling pathways involved. We chose this approach because [Ca²⁺]_i curve aspects such as magnitude and shape often vary between blood donors [14]. However, as shown in Figure S2, it was checked for the majority of curves that the chosen subject was representative for four other healthy subjects. On the other hand, the use of blood from a single donor can be seen as a limitation, because the amount of obtained platelets reduced the number of variable experimental conditions and, accordingly, the machine learning models had a limited predictive power. These models can now be used to generate hypotheses for additional experimentation and provide insights that are otherwise not obtained by traditional analytical approaches. Appropriate use is important, ensuring that the data used for training are representative, while independent data are available for validation. However, comparing the platelet responses from a large cohort of healthy donors will increase the accuracy of overall predictions, ultimately aiming to more easily identify systematic aberrations in donors with suspected platelet bleeding disorders. Conversely, the current predictions of [Ca²⁺]_i rises with multiple agonists offer a foundation for estimating the thresholds for platelet activation (OAT) and for testing the effects of new antithrombotic drugs, directly or indirectly targeting platelet Ca²⁺ responses (PLS). Another application could be effect prediction in patients with gain- or loss-of-function mutations in genes encoding for Ca²⁺ response modulators, such as STIM1 and ORAI1 [17].

A solution to this issue is the approach of transfer learning [18], in which a generic model is built for samples from various donors and then refined to obtain adjusted weights per donor. This approach has already been used to build personalised models for drug development [19]. Regardless of the approach followed, modelled analysis will be important to understand the effects of clinically relevant inhibitors of Ca²⁺ signalling pathways, such as P2X₁ Ca²⁺ channel antagonists [20]. In this paper, we examined this for a drug blocking the clinically important STIM1-Orai1 pathway [20], namely 2APB. The PLS regression analyses performed well, capturing the curve size and shape effects of this drug and giving loadings in the models resembling the condition “EGTA”, with no Ca²⁺ entry.

Differently from the neural network models, the PLS regression analysis performed better with the available sample size. The present PLS regression analysis to predict the (scaled) [Ca²⁺]_i curve features would easily allow for comparisons with platelets from more donors. In work by the Diamond laboratory [14], a NARX model was generalised by fitting networks constructed from several donors and determining their average prediction. Our analysis indicates that this can be conducted more easily by PLS regression techniques.

4. Methodology

4.1. Materials

Human α-thrombin was obtained from Kordia (Leiden, The Netherlands); cross-linked collagen-related peptide (CRP-XL) from the University of Cambridge (UK); Fura-2 acetoxymethyl ester from Invitrogen (Carlsbad, CA, USA); and Pluronic F-127 from Molecular Probes (Eugene, OR, USA). Horm-type collagen was obtained from Nycomed (Hoofddorp, The Netherlands). 2-Aminomethyl diphenylborinate (2APB) came from Sigma-Aldrich (St. Louis, MO, USA). Other materials were from sources described before [21].

4.2. Blood Collection and Platelet Preparation

This study was approved by the Medical Ethics Committee of Maastricht University. Blood donor age and sex could not be recorded. Blood taken into 3.2% sodium citrate (Vacuette tubes, Greiner Bio-One, Alphen a/d Rijn, The Netherlands) was obtained from consenting healthy volunteers who had not taken anti-platelet medication in the previous ten days. Platelet counts were within the reference range.

Platelet-rich plasma (PRP) was obtained from citrated blood by centrifuging, after which collected platelets were washed in the presence of apyrase (1 unit/mL) and loaded with Fura-2 acetoxymethyl ester (3 µM) and Pluronic (0.4 µg/mL) at a count of 2 × 10⁸/mL for 40 min at room temperature, as described before [22]. The isolated platelets were finally resuspended at a concentration of 2 × 10⁸/mL in Hepes buffer at pH 7.45 (10 mM Hepes, 136 mM NaCl, 2.7 mM KCl, 2 mM MgCl₂, 5.5 mM glucose, and 0.1% bovine serum albumin).

4.3. Calibrated Cytosolic Ca²⁺ Measurements

In the Fura-2-loaded platelets, changes in cytosolic [Ca²⁺]_i were measured in 96-well plates with a FlexStation 3 (Molecular Devices, San Jose, CA, USA), as previously described [22]. When desired, the platelets in the wells were pretreated with apyrase (0.1 unit/mL) plus indomethacin (20 µM), or with thapsigargin (1 µM) for 10 min. After the addition of either 0.1 mM EGTA or 1 mM CaCl₂, the platelets were stimulated by automated pipetting with one of the following agonists: CRP (1 or 10 µg/mL), collagen (1, 3, 10, or 30 µg/mL), thrombin (0.3, 1, 3, or 10 nM), or none of these (vehicle controls). In wells per row, changes in Fura-2 fluorescence were measured quasi-simultaneously over time at 37 °C by ratiometric fluorometry, including appropriate calibrator controls for obtaining nM concentrations of [Ca²⁺]_i [22]. For the independent testing of pharmacological drugs known to affect SOCE, the platelets were preincubated with 2APB (30 μM), as studied and titrated before [15,23]; the agonist concentrations were maximal: CRP 10 μg/mL or thrombin 10 nM.

4.4. Selection of Platelet [Ca²⁺]_i Curves for Modelling

For the majority of experimental conditions, the Ca²⁺ responses were studied in Fura-2-loaded platelets obtained from 5 healthy donors, thus resulting in calibrated time series of nM [Ca²⁺]_i [15]. For the present modelling approach, a complete set of 72 time curves was taken from subject 1 and checked to be representative for those of all subjects (Figure S2). In Figure 1, the chosen experiments for model validation and testing are highlighted in blue and red, respectively, based on criteria indicated below.

4.5. Preparation of Input Data

The raw curves of [Ca²⁺]_i changes in platelets stimulated with CRP or collagen had a sampling time of 4 s, while those with thrombin had a sampling time of 2 s. To allow for direct comparisons, the raw nM values (Figure S3) were linearly resampled and interpolated to generate 1 s time steps from 0 s to 540 s. To minimise noise disturbances, the curves were smoothed with a Savitzky–Golay filter (Figure S4).

In cases where scaling was needed, the smoothed curves were subjected to a min–max scaling algorithm, giving values between 0 and 1. To scale the input conditions, experimental variables were set as [0, 1], except for the agonist concentrations, which were scaled in the range of [0, 10] (Figure 1). Herein, 0 indicated no agonist or inhibitor present.

For constructing the multilayer perceptron (MLP) network, a regression model was built using the magnitudes of all [Ca²⁺]_i time series. The experimental variables were taken as inputs (Figure 3A), while the mean square error was used as a cost function. This ensured a better fit for the larger values. For this purpose, we set the target (output) for the model as log-scaled values of the nM [Ca²⁺]_i range as log₁₀(max − min). This improved the overall accuracy of log scales.

Considering that the number of total features was small with 6 experimental variables (Figure 1), we also generated polynomial features (quadratic feature combinations), which increased this number from 6 to 27. For the MLP network, the number of hidden layers was set to 1, while the number of nodes was randomly selected from 1 to 10. The network architecture options were chosen as to train only low numbers of parameters to prevent overfitting. Networks were trained 100 times, starting from random weights. As the best structure, the network with a minimal score in the cost function of the validation set was taken. Network training was performed using the Levenberg–Marquardt algorithm, containing a rectified linear unit as the activation function in each node. The modelling was conducted using Matlab R2022a and the Neural Network Toolbox.

4.6. Trend Prediction of NARX Network

A separate neural network was constructed to predict the trends (shapes) of smoothed and scaled [Ca²⁺]_i time curves. To better capture the time dynamics, we chose a non-linear autoregressive network with exogenous input (NARX) and parallel architecture [24,25], which is also known as a closed-loop neural network. For this NARX network, the model’s output y(t) was used to fit the target (i.e., the smoothed and scaled [Ca²⁺]_i curves). The output then generated feedback as an additional input to the network when combined with the experimental condition (Figure 3B). The mathematical expression for [Ca²⁺](t) is then written as follows:

y (t) = f (L_{4} \times f (H_{3} \times y_{h} + L_{3} \times f (H_{2} \times y_{h} + L_{2} \times f (H_{1} \times y_{h} + W \times I + b_{1}) + b_{2}) + b_{3}) + b_{4})

(1)

where y(t) is [Ca²⁺]_i over time, I is an input matrix of the experimental conditions, and y_h is the feedback delay (history) of y. Furthermore, W and H_n are the input matrix weight and feedback delay of y, respectively; b_n are biases; L_n are the weights of each hidden layer; and f is the activation (transfer) function. Note that the product of the matrix is also a matrix, meaning that the equation represents a summation of numerous parameters and functions.

For feedback delays, we chose the values at 1, 3, 6, 10, 15, 21, 28, and 36 s prior to the current value of a [Ca²⁺]_i time series. Hence, these feedback delays kept the information about current values, while preserving the long-term memory of the system. The initial values of the feedback delays were set to zero, as the system was assumed to be in a steady state prior to the agonist-induced activation of platelets. The use of MSE as a cost function allowed us to make predictions of the scaled min–max [Ca²⁺]_i time series. Scaling was performed per time series, implying that each series had the same range [0, 1]. Polynomial features were used also in this network, thus expanding the number of inputs from 6 to 27.

The neural network architecture was optimised to maximise the goodness of fit but to prevent overfitting. We used three hidden layers, with each layer’s size varying between 2 and 20 nodes (not including feedback delays). This gave approximately 7000 different architectures being trained. A randomised grid search was employed to find the best architecture. For training, the Levenberg–Marquardt algorithm was used with a hyperbolic tangent sigmoid as an activation function. Since parameter fitting in the neural network depended on a random seed, each architecture was fitted 100 times, after which the best parameters were used for comparison. The networks were built and trained in Matlab R2022a.

4.7. Parameter Sensitivity Analysis

To perform agonist concentration sensitivity analysis, the method of one-at-a-time (OAT) factor was applied [26]. This kept the variables fixed to the central or baseline value, while changing one variable at a time. Since effects were computed with reference to the same central point in space, this improved the comparability of the outcomes. As default, we set the conditions of EGTA or CaCl₂, autocrine inhibitors (AI) or not, and thapsigargin or not as 1 or 0 (2³ = 8 combinations). Furthermore, we scaled the agonist concentration from 0 to 10% of the maximal concentrations (30 μg/mL collagen, 10 μg/mL CRP, or 10 nM thrombin). The shape of each [Ca²⁺]_i time curve was defined according to four scalar characteristics, namely the magnitude of the response, peak time, relative terminal level, and the mean deviation from a straight line (Figure 2).

4.8. Partial Least Square (PLS) Regression Analysis

Regression analysis with PLS was used as an extension of principal component analysis [27,28], which maximises the covariance between an input matrix X and output matrix Y. In this method, each component has a latent variable t_i, while the linearly weighted combination of the latent variables generates the prediction of outcomes (Y matrix), as follows:

Y = C₁t₁ + C₂t₂ + …, where C_i = a_1ix₁ + a_2ix₂

(2)

The experimental conditions of Figure 1 were used as the X matrix and the scalar characteristics of a [Ca²⁺]_i time series were used as the Y matrix. The number of components in the PLS analysis was taken from the optimal variance achieved. The loading weights depended on the input variables that contributed most to the prediction. By maximising the covariance between explanatory variable X and response variable Y, the most relevant components in X were obtained for changes in Y. Stated otherwise, by examining the loading weights of a few latent variables accounting for most of the explained covariance, we could identify the experimental conditions with the most significant impact on the [Ca²⁺]_i time curves.

5. Conclusions

Of the two developed non-linear models, a multilayer perceptron (MLP) network and an autoregressive network with exogenous inputs (NARX), the trained networks accurately predicted platelet [Ca²⁺]_i curves in the presence of combinations of agonists and inhibitors. The NARX model achieved good results for the trend prediction of unforeseen data. Furthermore, the NARX model demonstrated good fits for the modelled calcium curves, whereas the PLS regression models gave useful interpretable information on the importance of each variable. These modelling results are suitable for the development of novel platelet [Ca²⁺]_i-inhibiting drugs, as we demonstrated for the drug 2APB, blocking agonist-induced Ca²⁺ entry in platelets.

Supplementary Materials

The following supporting information can be downloaded at: https://www.mdpi.com/article/10.3390/ijms26146820/s1.

Author Contributions

Conceptualisation, methodology, and formal analysis, C.T., J.L.D., and R.C.; investigation, C.T. and H.Y.F.C.; resources and supervision, J.M.G., J.W.M.H., and R.C.; data curation, C.T. and H.Y.F.C.; writing—original draft preparation, C.T. and R.C.; writing—review and editing, C.T., J.W.M.H., and R.C.; funding acquisition, J.M.G. and J.W.M.H.; manuscript revision, C.T., J.W.M.H., and J.L.D. All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported by the European Union’s Horizon 2020 research and innovation program under the Marie Sklodowska-Curie grant agreement No. 766118 to all co-authors. C.T. was enrolled in a joint PhD program at the Universities of Maastricht (The Netherlands) and Reading (United Kingdom). H.Y.F.C. was enrolled in a joint PhD program at the Universities of Birmingham (United Kingdom) and Maastricht (The Netherlands).

Institutional Review Board Statement

The study was approved by the local Medical Ethics Committees (Maastricht University Medical Centre, NL31480.068.10, 29 May 2013). All subjects gave full informed consent according to the Declaration of Helsinki, and all methods were performed in accordance with the relevant guidelines and regulations.

Informed Consent Statement

Informed consent was obtained from all subjects involved in the study. According to ethical permission, all subjects gave blood without tracing samples to certain individuals.

Data Availability Statement

The data are included in the manuscript as figures, tables, or Supplementary Materials.

Conflicts of Interest

J.W.M.H. is an advisor of the Synapse Research Institute Maastricht. The other authors declare no relevant conflicts of interest.

References

Patel, S.R.; Hartwig, J.H.; Italiano, J.E. The biogenesis of platelets from megakaryocyte proplatelets. J. Clin. Investig. 2005, 115, 3348–3354. [Google Scholar] [CrossRef]
Chesterman, C.; Owe-Young, R.; Macpherson, J.; Krilis, S. Substrate for endothelial prostacyclin production in the presence of platelets exposed to collagen is derived from the platelets rather than the endothelium. Blood 1986, 67, 1744–1750. [Google Scholar] [CrossRef] [PubMed]
Davis, G.E.; Senger, D.R. Endothelial extracellular matrix: Biosynthesis, remodelling, and functions during vascular morphogenesis and neovessel stabilisation. Circ. Res. 2005, 97, 1093–1107. [Google Scholar] [CrossRef] [PubMed]
Van der Meijden, P.E.; Heemskerk, J.W. Platelet biology and functions: New concepts and clinical perspectives. Nat. Rev. Cardiol. 2019, 16, 166–179. [Google Scholar] [CrossRef]
Jerjes-Sánchez, C. Venous and arterial thrombosis: A continuous spectrum of the same disease? Eur. Heart J. 2005, 26, 3–4. [Google Scholar] [CrossRef] [PubMed]
Jackson, S.P. Arterial thrombosis-insidious, unpredictable and deadly. Nat. Med. 2011, 17, 1423–1436. [Google Scholar] [CrossRef]
Mammadova-Bach, E.; Nagy, M.; Heemskerk, J.W.; Nieswandt, N.; Braun, A. Store-operated calcium entry in blood cells in thrombo-inflammation. Cell Calcium 2019, 77, 39–48. [Google Scholar] [CrossRef]
Versteeg, H.H.; Heemskerk, J.W.; Levi, M.; Reitsma, P.S. New fundamentals in hemostasis. Physiol. Rev. 2013, 93, 327–358. [Google Scholar] [CrossRef]
Watson, S.P.; McConnell, R.T.; Lapetina, E.G. The rapid formation of inositol phosphates in human platelets by thrombin is inhibited by prostacyclin. J. Biol. Chem. 1984, 259, 13199–13203. [Google Scholar] [CrossRef]
Daniel, J.L.; Dangelmaier, C.A.; Selak, M.; Smith, J.B. ADP stimulates IP₃ formation in human platelets. FEBS Lett. 1986, 206, 299–303. [Google Scholar] [CrossRef]
Capra, V.; Bäck, M.; Angiolillo, D.J.; Cattaneo, M.; Sakariassen, K.S. Impact of vascular thromboxane prostanoid receptor activation on hemostasis, thrombosis, oxidative stress, and inflammation. J. Thromb. Haemost. 2014, 12, 126–137. [Google Scholar] [CrossRef]
Oury, C.; Toth-Zsamboki, E.; Thys, C.; Tytgat, J.; Vermylen, J.; Hoylaerts, M.F. The ATP-gated P2X₁ ion channel acts as a positive regulator of platelet responses to collagen. Thromb. Haemost. 2001, 86, 1264–1271. [Google Scholar]
Dolan, A.T.; Diamond, S.L. Systems modelling of Ca²⁺ homeostasis and mobilisation in platelets mediated by IP₃ and store-operated Ca²⁺ entry. Biophys. J. 2014, 106, 2049–2060. [Google Scholar] [CrossRef] [PubMed]
Chatterjee, M.S.; Purvis, J.E.; Brass, L.F.; Diamond, S.L. Pairwise agonist scanning predicts cellular signalling responses to combinatorial stimuli. Nat. Biotechnol. 2010, 28, 727–732. [Google Scholar] [CrossRef] [PubMed]
Cheung, H.Y.; Zou, J.; Tantiwong, C.; Fernández, D.I.; Huang, J.; Ahrends, R.; Roest, M.; Cavill, R.; Gibbins, J.M.; Heemskerk, J.W. High-throughput assessment identifying major platelet Ca²⁺ entry pathway via tyrosine kinase-linked and G protein-coupled receptors. Cell Calcium 2023, 112, 102738. [Google Scholar] [CrossRef] [PubMed]
Chawla, N.V.; Bowyer, K.W.; Hall, L.O.; Kegelmeyer, W.P. SMOTE: Synthetic minority over-sampling technique. J. Artific. Intell. Res. 2002, 16, 321–357. [Google Scholar] [CrossRef]
Shawer, H.; Norman, K.; Cheng, C.W.; Foster, R.; Beech, D.J.; Bailey, M.A. ORAI1 Ca²⁺ channel as a therapeutic target in pathological vascular remodelling. Front. Cell Dev. Biol. 2021, 9, 653812. [Google Scholar] [CrossRef]
Neyshabur, B.; Sedghi, H.; Zhang, C. What is being transferred in transfer learning? In Proceedings of the 34th International Conference on Neural Information Processing Systems, Vancouver BC, Canada, 6–12 December 2020. [Google Scholar]
Dana, D.; Gadhiya, S.V.; Surin, L.G.; Li, D.; Naaz, F.; Ali, Q.; Paka, L.; Yamin, M.A.; Narayan, M.; Goldberg, I.D.; et al. Deep learning in drug discovery and medicine; scratching the surface. Molecules 2018, 23, 2384. [Google Scholar] [CrossRef]
Bennetts, F.M.; Mobbs, J.I.; Ventura, S.; Thal, D.M. The P2X₁ receptor as a therapeutic target. Purinergic Signal. 2022, 18, 421–433. [Google Scholar] [CrossRef]
Gilio, K.; Munnix, I.C.; Mangin, P.; Cosemans, J.M.; Feijge, M.A.; van der Meijden, P.E.; Olieslagers, S.; Chrzanowska-Wodnicka, M.B.; Lillian, R.; Schoenwaelder, S.; et al. Non-redundant roles of phosphoinositide 3-kinase isoforms α and β in glycoprotein VI-induced platelet signalling and thrombus formation. J. Biol. Chem. 2009, 284, 33750–33762. [Google Scholar] [CrossRef]
Jooss, N.J.; De Simone, I.; Provenzale, I.; Fernández, D.I.; Brouns, S.L.; Farndale, R.W.; Henskens, Y.M.; Kuijpers, M.J.; ten Cate, H.; van der Meijden, P.E.; et al. Role of platelet glycoprotein VI and tyrosine kinase Syk in thrombus formation on collagen-like surfaces. Int. J. Mol. Sci. 2019, 20, 2788. [Google Scholar] [CrossRef]
Zou, J.; Zhang, P.; Solari, F.A.; Schönichen, C.; Provenzale, I.; Mattheij, N.J.; Kuijpers, M.J.; Rauch, J.S.; Swieringa, F.; Sickmann, A.; et al. Suppressed ORAI1-STIM1-dependent Ca²⁺ entry by protein kinase C isoforms regulating platelet procoagulant activity. J. Biol. Chem. 2024, 300, 107899. [Google Scholar] [CrossRef]
Xie, H.; Tang, H.; Liao, Y. Time series prediction based on NARX neural networks: An advanced approach. In Proceedings of the 2009 International Conference on Machine Learning and Cybernetics, Hebei, China, 12–15 July 2009. [Google Scholar]
Hewamalage, H.; Bergmeir, C.; Bandara, K. Recurrent neural networks for time series forecasting: Current status and future directions. Int. J. Forecast. 2021, 37, 388–427. [Google Scholar] [CrossRef]
Razavi, S.; Gupta, H.V. What do we mean by sensitivity analysis? The need for comprehensive characterisation of global sensitivity in earth and environmental systems models. Water Resourc. Res. 2015, 51, 3070–3092. [Google Scholar] [CrossRef]
Wold, S.; Sjöström, M.; Eriksson, L. PLS-regression: A basic tool of chemometrics. Chemom. Intell. Lab. Syst. 2001, 58, 109–130. [Google Scholar] [CrossRef]
Abdi, H. Partial least squares regression and projection on latent structure regression (PLS regression). Wiley Interdiscip. Rev. Comput. Stat. 2010, 2, 97–106. [Google Scholar] [CrossRef]

Figure 1. Assignment matrix of variables of 72 numbered experimental conditions. Calibrated curves of Fura-2-loaded platelets from one donor, representative for 5 donors, were used as input data. The conditions highlighted in blue (solid borders) were used as validation set, and those in red (dashed borders) were used as test set. Abbreviations: Col, collagen (μg/mL); CRP, collagen-related peptide (μg/mL); Thr, thrombin (nM); EG, EGTA: 0.1 mM if assigned to 1 or 1 mM CaCl₂ if assigned to 0; AI, apyrase (0.1 U/mL) plus indomethacin (20 μM) if assigned to 1; Thap, thapsigargin (1 μM) if assigned to 1.

Figure 2. Workflow used for the data processing, neural network construction, and scalar model development. For explanation, see text.

Figure 3. Construction of two neural networks. (A) Setup of MLP network as a fully connected feedforward neural network, which was used for magnitude prediction of the [Ca²⁺]_i time curves. (B) Setup of closed-loop non-linear autoregressive network with exogenous inputs (NARX), which was developed as a recurrent neural network for the trend prediction of [Ca²⁺]_i time curves.

Figure 4. Test results of the NARX network to predict curve shapes. (A–G) Testing of trend prediction of 0–1 scaled [Ca²⁺]_i curves. Experiments of the test set are shown in Figure 1. Red solid lines = experimental values, blue dashed lines = predicted values. Indicated per condition are the calculated R² values (a negative R² indicates an explained variance worse than random). (H) For comparison, results from the same test set are given as obtained by the MLP network. Shown here are the target and predicted nM [Ca²⁺]_i levels in log scale.

Figure 5. Combined effects of magnitude and trend prediction of platelet [Ca²⁺]_i curves at varying agonist concentrations. (A) Panels of prediction efficacy of scaled curves per agonist concentration. Lightest grey lines represent basal levels, while darker lines point to predicted curves in the presence of agonist at 1–10% of the maximum concentration in the training set. Columns show conditions with indicated agonists, collagen (Col), CRP, or thrombin (Thr). Rows represent different inhibitor conditions: + or − mean presence or not; from top to bottom: EGTA, apyrase plus indomethacin (AI), and thapsigargin (Thap). (B) Sensitivity characteristics of the scalar curves generated by MLP and NARX. Columns indicate: [Ca²⁺]_i level at log10 base (magnitude), time of [Ca²⁺]_i (tmax), final [Ca²⁺]_i level (ylast), and mean deviation from linear (absdev).

Figure 6. Scalar characteristic of the [Ca²⁺]_i time curves. Scaling resulted in the following parameters: magnitude (nM) points to the maximal minus minimal value of a curve; tmax refers to the time point where the maximal value is reached, scaled by time range (540 s). The parameter ylast indicates the end value, scaled according to the magnitude; absdev represents the mean deviation of the time curve from linear (red line). The deviation from this line (green) was calculated per time point, in which absdev means the average of all deviations.

Figure 7. Loading coefficients of experimental variables in the PLS regression analysis. PLS regression analysis was performed for the prediction of curve magnitude (A), tmax (B), ylast (C), and absdev (D). Colours indicate contributions per variable. Plots show for two principal components the contribution of six experimental variables (collagen dose, CRP dose, thrombin dose, EGTA/CaCl₂, AI, thapsigargin).

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Tantiwong, C.; Cheung, H.Y.F.; Dunster, J.L.; Gibbins, J.M.; Heemskerk, J.W.M.; Cavill, R. Extended Modelling of Molecular Calcium Signalling in Platelets by Combined Recurrent Neural Network and Partial Least Squares Analyses. Int. J. Mol. Sci. 2025, 26, 6820. https://doi.org/10.3390/ijms26146820

AMA Style

Tantiwong C, Cheung HYF, Dunster JL, Gibbins JM, Heemskerk JWM, Cavill R. Extended Modelling of Molecular Calcium Signalling in Platelets by Combined Recurrent Neural Network and Partial Least Squares Analyses. International Journal of Molecular Sciences. 2025; 26(14):6820. https://doi.org/10.3390/ijms26146820

Chicago/Turabian Style

Tantiwong, Chukiat, Hilaire Yam Fung Cheung, Joanne L. Dunster, Jonathan M. Gibbins, Johan W. M. Heemskerk, and Rachel Cavill. 2025. "Extended Modelling of Molecular Calcium Signalling in Platelets by Combined Recurrent Neural Network and Partial Least Squares Analyses" International Journal of Molecular Sciences 26, no. 14: 6820. https://doi.org/10.3390/ijms26146820

APA Style

Tantiwong, C., Cheung, H. Y. F., Dunster, J. L., Gibbins, J. M., Heemskerk, J. W. M., & Cavill, R. (2025). Extended Modelling of Molecular Calcium Signalling in Platelets by Combined Recurrent Neural Network and Partial Least Squares Analyses. International Journal of Molecular Sciences, 26(14), 6820. https://doi.org/10.3390/ijms26146820

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Extended Modelling of Molecular Calcium Signalling in Platelets by Combined Recurrent Neural Network and Partial Least Squares Analyses

Abstract

1. Introduction

2. Results

2.1. Comparing Multiple Agonist-Induced Platelet [Ca²⁺]_i Curves

2.2. Workflow of the Modelling Approaches

2.3. MLP Network for Magnitude Prediction