Next Article in Journal
Sedentary Behavior and the Use of Wearable Technology: An Editorial
Next Article in Special Issue
An Automatic Approach Designed for Inference of the Underlying Cause-of-Death of Citizens
Previous Article in Journal
A WHO Pathfinder Survey of Dental Caries in 6 and 12-Year Old Transylvanian Children and the Possible Correlation with Their Family Background, Oral-Health Behavior, and the Intake of Sweets
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Leveraging Machine Learning Techniques and Engineering of Multi-Nature Features for National Daily Regional Ambulance Demand Prediction

1
School of Computer Science and Engineering, Nanyang Technological University, Singapore 639798, Singapore
2
SingHealth Duke-NUS Emergency Medicine Academic Clinical Program, Duke-National University of Singapore Medical School, Singapore 169857, Singapore
3
SingHealth Emergency Medicine Residency Programme, Duke-National University of Singapore Medical School, Singapore 169608, Singapore
4
Signature Research Programme in Cardiovascular & Metabolic Disorders, Duke-National University of Singapore Medical School, Singapore 169857, Singapore
5
Science, Mathematics and Technology Cluster, Singapore University of Technology and Design (SUTD), Singapore 487372, Singapore
6
SUTD-Massachusetts Institute of Technology International Design Centre, Singapore 487372, Singapore
7
Institute of High Performance Computing, Agency for Science, Technology and Research, Singapore 138632, Singapore
8
Lee Kong Chian School of Medicine, Nanyang Technological University, Singapore 636921, Singapore
9
Faculty of Medicine, Nursing and Health Sciences, Monash University, VIC 3800, Australia
10
Emergency Medicine, Tan Tock Seng Hospital, Singapore 308433, Singapore
11
Home Team Medical Services Division, Ministry of Home Affairs, Singapore 179369, Singapore
12
School of Computing, National University of Singapore, Singapore 117417, Singapore
13
Health Services & Systems Research, Duke-NUS Medical School, Singapore 169857, Singapore
14
Department of Emergency Medicine, Singapore General Hospital, Singapore 169608, Singapore
*
Author to whom correspondence should be addressed.
Joint first author.
Joint last author.
Int. J. Environ. Res. Public Health 2020, 17(11), 4179; https://doi.org/10.3390/ijerph17114179
Submission received: 3 May 2020 / Revised: 29 May 2020 / Accepted: 2 June 2020 / Published: 11 June 2020

Abstract

:
The accurate prediction of ambulance demand provides great value to emergency service providers and people living within a city. It supports the rational and dynamic allocation of ambulances and hospital staffing, and ensures patients have timely access to such resources. However, this task has been challenging due to complex multi-nature dependencies and nonlinear dynamics within ambulance demand, such as spatial characteristics involving the region of the city at which the demand is estimated, short and long-term historical demands, as well as the demographics of a region. Machine learning techniques are thus useful to quantify these characteristics of ambulance demand. However, there is generally a lack of studies that use machine learning tools for a comprehensive modeling of the important demand dependencies to predict ambulance demands. In this paper, an original and novel approach that leverages machine learning tools and extraction of features based on the multi-nature insights of ambulance demands is proposed. We experimentally evaluate the performance of next-day demand prediction across several state-of-the-art machine learning techniques and ambulance demand prediction methods, using real-world ambulatory and demographical datasets obtained from Singapore. We also provide an analysis of this ambulatory dataset and demonstrate the accuracy in modeling dependencies of different natures using various machine learning techniques.

1. Introduction

The accurate prediction of the daily ambulance demands across different regions of a city is of great importance to emergency service providers and its residents. Through the lens of emergency service operators, such information is valuable for a rational and dynamic deployment of ambulances of different types and increases operational effectiveness in fleet management. This in turn ensures that patients have shorter waiting times through location planning and increased ambulance availability when the need arises. This is an important goal for pre-hospital emergency medical services [1,2,3,4] and especially necessary for patients in critical conditions [5,6]. Furthermore, it helps in the efficient staffing for shifts in the hospitals, as well as early identification of any surges in ambulance demands. With the growth in focus towards data collection and analysis over the years, massive datasets of ambulance records are increasingly available for use by healthcare professionals. This encourages a stronger understanding of ambulance demands and the efficient planning of healthcare resources.
However, estimating ambulance demand through human efforts has nevertheless been challenging due to various multi-nature considerations [7]. First, ambulance demand is affected by spatial-related characteristics, such as the region at which demand is estimated. For example, a region is likely to experience a higher demand for ambulance than another region due to a larger elderly population. Moreover, a city is often demarcated by the local government into various development regions, with each having a particular purpose, e.g., financial district, or residential district. Demand is likely to be different depending on the region type. Second, ambulance demand is also affected by high-level temporal attributes, such as day of week and day of month, since the demand may often experience periodicity. Third, it is often correlated with short-term and long-term historical demands in that region. For example, if a region experiences a sudden outbreak of a disease and requires more ambulatory interventions or a sporadic mass sports event that lasts for a few days, the demand for ambulances in that region is likely to be similar over those consecutive few days. The cumulation of these multi-nature features renders ambulance demand a nonlinear dynamical system. While it may be straightforward to infer demand information from historical demands due to their temporal periodicities, it may not be as easy to do so for other features like the region identifier (ID) or day of the week. Given the inherent complexity and chaos within such systems, advanced machine learning methods will be useful for extracting insights that support the prediction of ambulance demand.
Specifically, machine learning methods have been gaining momentum over the years due to their capabilities of modelling complex patterns within data, encouraged by the advancement of computational hardware. They have demonstrated success in various areas of emergency medicine [8], such as predicting in-patient admission [9], postsurgical mortality, and intensive care unit admission [10], and in-hospital mortality of emergency department patients [11,12,13,14], all of which are complex non-linear dynamical systems. In the domain of ambulance-related research, machine learning has been considered for ambulance travel time estimation [15,16], location selection for ambulance stations [17], and demand prediction [17,18]. Despite the progress that has had been made, there is generally a lack of studies that consider such methods for the ambulance demand prediction problem.
Furthermore, existing work may consider machine learning methods for ambulance demand prediction, they generally do not incorporate in a sufficiently comprehensive manner the various types of dependencies affecting ambulance demand. They either consider prediction of the whole city [19] or predict demand at equally sized square grids [7,20], which is not reflective of the actual regionalization by the local government. In other instances, the focus is only on prediction of demands at some, but not all, regions of a city [18]. In this paper, we propose an original and novel approach that leverages a massive dataset of historical ambulance demand records to model the multi-nature dependencies of ambulance demand for predicting the next-day demand at all regions of a city-state. Our approach elicits useful insights that represent each of the different types of dependencies. Then, it utilizes a machine learning model to learn these dependencies for prediction. We evaluate the performance across several state-of-the-art machine learning techniques, using real-world ambulance demand datasets recorded by Singapore Civil Defence Force (SCDF).

2. Materials and Methods

2.1. Data Sources

In this study, we make use of a dataset obtained from the SCDF that includes all the ambulance calls in the city-state, Singapore, from 2006 to 2016. SCDF is the single national emergency ambulance provider which manages a fleet of around 60 ambulances in 2016 [21]. SCDF activates these ambulances by a centralized “995” dispatching system and does not charge for any emergency cases it conveys to hospitals [22]. Each ambulance call in the dataset corresponds to an incident, which has the following characteristics: time of incident, ambulance origin station, incident classification, incident subclass, patient incident subclass, patient’s emergency status, patient’s year of birth, ambulance destination hospital, patient location common name, patient location postal code, patient location street, incident location latitude, incident location longitude, and gender. To obtain the regions and map of Singapore, we make use of the 2010 Planning Area Census by the Singapore government, which consists of every Development Guide Plan (DGP) region (similar conceptually to census tracts). We also leverage datasets obtained from polyclinics in Singapore to extract useful demographical information such as the population count of people above a certain age at each region of the city-state. Finally, we consider additional socioeconomic information from The Census of Population conducted by the Singapore Department of Statistics in 2010, which is the most recent one available. Such census is conducted once every ten years and is based on a person’s place of usual residence.

2.2. Model Overview

Using the above-mentioned datasets, we design an approach that involves a Feature Engineering stage and a prediction stage using a Machine Learning Predictor. Figure 1 shows an overview schematic of the approach. In the following sections, we elaborate each of these stages in details.

2.3. Feature Engineering

Data processing is first carried out on the SCDF dataset to generate the aggregated demand, i.e., number of calls, and several associated features of each region of each day from 2006 to 2016. We denote the engineered dataset as SCDF-Engineered. Each data sample in SCDF-Engineered consists of the aggregated ambulance demand at a particular region of a day within 2006 to 2016, which is the outcome of interest. It also includes several features associated with that region on the particular day. Specifically, these features fall under three classes: Attributes, Short-term Historical Demands, and Long-term Historical Aggregated Demands.
The description of the features in each of the three classes are as follows:
  • Attributes. These are categorical features that provide high-level information about the record. These features are multi-nature and can be further classified into (1) spatial, (2) temporal, and (3) demographic attributes. Specifically, the spatial attributes consist of the region ID, which is a number that uniquely identifies each DGP region. The rationale behind its inclusion is to differentiate the regions within Singapore, since different regions may have different demand characteristics. For example, a region with more elderly people may experience a higher demand than another region with mostly young people. The temporal attributes consist of the following features: day of week, day of month, and month of year. These are included to account for the periodicity of ambulance demands. Finally, since the demand at a region may be higher if it has more people who are older in age, we also consider a demographic attribute: the total no. of people in that region who are aged 50 and above on that particular year
  • Short-term Historical Demands. These features are demands at a region over each of the previous 7 days. These 7 continuous features are considered to account for the correlations between the demands of a particular day with that of the previous days. For example, a sudden spike in the dengue mosquitos’ population at a region may result in the rise of dengue-related cases over a few consecutive days.
  • Long-term Historical Aggregated Demands. These features consist of the total demand at a region over the past 30 days, the total demand over the past 7 days, the total demand of the week up until the sample date, and the total demand of the month up until the sample date. These aggregated demands are included to account for the demand on the broader scale without the higher variances present in short-term demands. For example, a region may experience a high short-term historical demand solely due to a recent occurrence of a large-scale traffic accident but does not typically have high demands as it is not a populous area.
Apart from SCDF-Engineered, we also further build a dataset SCDF-Engineered-Socio. The rationale behind building this dataset is to explore whether ambulance demand has any correlations with the socio-economic characteristics of the people in a region. Similar to SCDF-Engineered, each data sample in SCDF-Engineered-Socio contains all the features in Attributes, Short-term Historical Demands, and Long-term Aggregated Demands. However, additional socioeconomic features of each region obtained from The Census of Population is also included in this dataset. Specifically, SCDF-Engineered-Socio also considers the following additional features: number of residents who travel by buses, number of residents who travel by cars, number of residents who travel by taxis, number of residents who travel by trains, number of residents who are in active employment, number of residents who are unemployed, number of residents who are tenants, and number of residents who are home owners. Since the socioeconomic information is only available for a subset of regions in Singapore, SCDF-Engineered-Socio contains only data samples from these regions.
As observed, the features considered so far are in their entirety, multi-nature. However, each feature represents a piece of information of only a single nature and does not consider the impacts of mixed features. For example, the feature day of week only reveals temporal information about a record, but it does not reveal its relationship with the region at which the record corresponds to. In order to study the impact of mixed features, we further engineer composite features based on the existing features generated. Specifically, we consider spatiotemporal features, and create the following composite features: unique ID that represents (region ID, day of week, day of month, month of year), unique ID that represents (region ID, day of week), unique ID representing (region ID, day of month), and unique ID that represents (region ID, month of year). For evaluation purpose of such composite features, we create a separate dataset SCDF-Engineered-Spatiotemporal (SCDF-Engineered-ST). This dataset includes the same features present in SCDF-Engineered, as well as the engineered spatiotemporal composite features.

2.4. Key Implementation Details of Feature Engineering

The key component of our feature engineering lies in the extracting of the short and long-term demands, since other features can be obtained either directly from the raw dataset in the case of features like the day of week, or mapped easily using a third-party Application Programming Interface (API) in the case of other features like region ID. To extract these demand features, we first use simple aggregations to transform the SCDF dataset into a dataset that records the total daily demand of each region and sort these demands in a chronological order.
Then, we use a sliding-window-based approach to obtain the relevant demand variations for each data sample. Figure 2 demonstrates an example of such a process. The red box represents a sliding window that essentially contains the historical ambulance demand values over each of the past 30 days. Within this window, the relevant short-term historical demand features are extracted. The green box represents a sliding window that considers the historical demand over each of the past 7 days. Within this window the long-term historical and aggregated demand features are extracted. As mentioned, the time frames chosen are 7 and 30 days to account for the weekly and monthly demand periodicities respectively. Once this is completed, the two sliding windows move on to the next time step to extract the similar features for the next day. This process is then conducted for all regions of the city to get all the data samples used in this study.

2.5. Primary Outcome

The outcome of interest is the next-day aggregated ambulance demand at a DGP region of Singapore. This demand may arise from incidents of different emergency statuses, i.e., Dead on Arrival, Emergency Critical, Emergency (Non-ambulatory), Emergency (Ambulatory), and Non-emergency. It is also agnostic to trauma and medical incidents and incidents where assistance were not required. Hence, it is a regression task from the machine learning point of view.

2.6. Machine Learning Methods Considered

Given the engineered dataset, we want to train a machine learning model that predicts the demand by leveraging the above-mentioned features. To this end, we make use of the hold-out evaluation technique. Specifically, the data samples in SCDF-Engineered from 2006–2015 are used for model training, while the samples from 2016 are used for model validating. The similar separation is done for SCDF-Engineered-Socio. Several methods are considered, and they are chosen because they are either typically effective for regression problems or had been previously considered in existing work. The methods are as follows:
  • Regional Moving Average. This method estimates the next-day demand at a region simply by taking the average of the daily demand values over the past 7 days at this region.
  • Linear Regression. This method is a popular regression method that finds the best-fit hyperplane across the multi-feature data samples [23]. It assumes a linear relationship between the dependent variable, i.e., demand, and the independent variables, i.e., features. To model this relationship, the mean square error function is first considered as a loss function. Then, the gradient descent algorithm is used to iteratively find the minimum of this function and also the resulting hyperplane. The coefficients of this hyperplane represent the degree of impact each feature has on the predicted value. To accurately represent the categorical features, i.e., Attributes, one-hot encoding is used during preprocessing. Min-max scaling is also applied on the continuous features. This method is applied using the Python Scikit-Learn library [24].
  • Support Vector Regression (SVR). SVR is a support-vector machine that performs regression by finding a hyperplane, i.e., support vector, that fits as many points as possible within a space that is bounded by two boundary hyperplanes parallel to this support vector. Unlike Linear Regression, SVR typically finds the best-fitting hyperplane in the higher dimensions. To this end, it utilizes a kernel, which is a function that maps lower-dimensional data points to higher-dimension data points. The advantage of doing so is that it allows the method to capture certain non-linear relationships, which may not be possible with Linear Regression. SVR has been demonstrated to be one of the more effective machine learning approaches for predicting ambulance demand in [18]. Similar to Linear Regression, we apply this method using the Python Scikit-Learn library [24] and process the categorical features with one-hot encoding.
  • Multi-layer Perceptron (MLP). This method is an artificial neural architecture that has been explored and demonstrated in [19] to be an improvement over the traditional ambulance demand prediction method. The MLP is a standard neural architecture that is essentially made up of a sequence of linear layers. In this baseline, the size of the hidden layer is equal to that of the input layer, and 3 hidden layers are considered in total. Furthermore, the loss function used for the training of the model is the squared loss function. The learning rate used is 0.01, and the activation function used is the ReLU function. This method is also applied from the Python Scikit-Learn library [24].
  • Radial Basis Function Network (RBFN). We also consider the Radial Basis Function (RBF) network, a variant of the artificial neural network (ANN), for comparison. Unlike a typical MLP network, a RBFN consists of three layers: an input layer, a linear output layer, and a hidden layer that uses the non-linear radial basis function as the activation function. It has been demonstrated to be more effective than traditional MLPs in certain problems [25].
  • Light Gradient Boosting Machine (LightGBM). LightGBM [26] is one of the most efficient and high-performing gradient-boosting decision tree methods. The key idea behind such gradient-boosting methods is that they consider the ensemble of various individual regression trees to fine-tune the accuracy of prediction. This is achieved by sequentially combining the trees such that each tree fits to the residual of the previous tree it is extended from. The input for this method is similar to that of the previous methods, with the exception that attributes are specified as categorical features in the program. Furthermore, the specific key settings considered in this work are as follows. (1) Number of trees, 2000; (2) number of leaves, 31; (3) learning rate, 0.005; (4) feature fraction, 0.8. The boosting approach considered is gradient boosting decision tree. This method can be applied by using the LightGBM library [26] in Python.
The error metrics used in the experiments are weighted absolute percentage error (WAPE), mean absolute error (MAE), and mean squared error (MSE) [27,28]. WAPE is used as an error metric instead of the mean absolute percentage error (MAPE). This is because the ground-truth demand at a region may sometimes be zero, which results in the zero-division error if MAPE is used. Specifically, the formulation for WAPE is as follows:
W A P E = ( | A F | A ) ,
where A denotes a ground-truth demand, and F denotes its corresponding predicted value.
In our implementation, feature engineering is carried out using Python (version 2.7.16, Python Software Foundation, Delaware, USA). To map an incident to its corresponding region, the Shapely library is used [29]. The above-mentioned data-mining regression methods are built using Python Scikit-Learn library (version 0.20.0) [24], and LightGBM library (version 2.2.3) [26]. We also make use of QGIS (Open Source Geospatial Foundation, Beaverton, Oregon, USA) for spatial-related visualizations.

3. Results

Table 1 shows the key characteristics of the SCDF demand dataset. Specifically, it shows different compositions of the dataset, over the following category types: incident year, incident classification, incident subclass, patient incident subclass, patient’s birth year, and patient’s gender. As observed, there is a general increasing trend for ambulance demands from 2006 to 2016. The median age (based on the age of the patient by the year-end of the incident) is 55 and largely between 34 and 73. This reveals that more than half of the incidents occurred to middle-aged and elderly people and that most of the incidents happened to people who were at least young adults.
On the biennial level, the patient ages generally increase from 2006 to 2016. In terms of incident classification, the majority of the incidents were trauma in nature. The analysis of patient incident subclass shows that the majority of calls were due to problems associated with the nervous system. However, there is also a large proportion of calls where the patient was uninjured or did not have any medical complaints. Other major sources of calls were problems associated with the bone/connective tissue, respiratory system, and cardiovascular system. This is in line with the idea that the increasing demand may be due to an increasingly aging population since problems at these parts of the body tend to be associated with the elderly.
Preprocessing and feature engineering are conducted on the SCDF dataset to build SCDF-Engineered. Table 2 shows some of the characteristics of this engineered dataset. The overall mean daily regional demand is 6.33 and ranges largely from 0 to 10. The mean of the total past-7-days regional demand is 44 and largely ranges from 4 to 69. This is in contrast to the total past-30-days demand, where the mean, the first quartile, and the last quartile are 190, 20, and 294, respectively. Within SCDF-Engineered, each record consists of the Attributes, Short-term Historical Demands, and Long-term Historical Aggregated Demands features, as per the descriptions in Section 2. Since each record in SCDF-Engineered is specific to a day and a region; the ground-truth values associated with this record is simply the demand on that day and in that region. These ground-truth values are used as the target variables during model training (resp. validation), using records from 2006–2015 (resp. 2016) in SCDF-Engineered.
Figure 3 shows the variance of the daily demands of each region over the days of 2006–2016. As observed, the demand variances vary across the regions. This highlights differences in demand behaviors across different regions and the importance of considering region ID as a feature to account for such differences.
Table 3 shows the accuracies of the five methods compared on SCDF-Engineered, with the best results highlighted in bold. As observed, the performance of both Linear Regression and LightGBM are the best and comparable with each other, with the former having a slight edge in terms of the MSE metric. The regional moving average is the worst performing method, while the performances of SVR, MLP, and RBFN are somewhere in the middle of all methods compared. Comparing MLP and RBFN, the former also demonstrates a stronger performance for the problem we are solving. Although Linear Regression is one of the best-performing methods, it may be subjected to overfitting, since according to analysis, the mean coefficient value is 3.9 × 1011, and the interquartile range is between −1.55 and 2.7 × 1011. As such, Linear Regression may not be a suitable model due to the instability introduced through the largely varying coefficients that arise from overfitting. This implies that it may not perform as well on other datasets. Since Gradient-boosting Decision Tree is also highly effective for structured data, e.g., table of features as in our case, such methods are preferred in our context. Due to its effectiveness, LightGBM is specifically chosen.
Table 4 shows the gain-based importance of the features derived from the training process of LightGBM, as well as the mean absolute SHapely Additive exPlanations (SHAP) value of each feature. The SHAP value essentially assigns each feature an importance value for each prediction [30]. To obtain the overall importance for each feature, the mean absolute SHAP value is considered, where a larger value represents a greater feature importance. In terms of the relative importance of a feature among all considered features, both the LightGBM’s gain-based importance and mean absolute SHAP value are observed to be in agreement with each other.
The most important features are the total demand over the past 30 days and the total demand over the past 7 days in that region. This highlights the importance of considering long-term historical aggregated demands. What follows is the ID of the region at which the demand is predicted. This demonstrates the importance of differentiating a region from other regions, since they may have vastly different demand characteristics, as shown in Figure 2. The total number of people aged 50 and above in the region of the particular year is also considered important. This is in line with our intuition that people who are older in age are more likely to require emergency assistance than younger ones. The day of month, day of week, and month of year are also significant features, since there are periodicities within the ambulance demands. Finally, the demand at the region on each of the past 7 days contributes to the estimation by a fair extent.
To demonstrate the effects of regional socioeconomic data, we evaluate how the best-performing model of Table 3 performs when these regional socioeconomic features are included. Table 5 compares the accuracy of the prediction when LightGBM is applied on SCDF-Engineered-Socio, with the accuracy when these socioeconomic features are excluded from SCDF-Engineered-Socio. However, as observed, including additional socioeconomic features does not improve the prediction. A reason may be because these features are constant throughout all the years within the dataset. Any insights that these regional socioeconomic features provide may have already been represented by the region ID, which is one of the most important features according to Table 3. This is unlike the regional demographic feature present in the Attributes, where the total number of people aged 50 is different every year.
To evaluate the impact of adding spatiotemporal composite features, we also apply LightGBM on SCDF-Engineered-ST. The resulting accuracies are WAPE = 24.7%, MAE = 2.11, and MSE = 10.4. As observed, these accuracies are worse than when no composite features are used. The reason may be because even though the features consider the spatiotemporal characteristics of a record, they may be noise to LightGBM, which algorithmically considers the mixed effects of different features in a finer-grained manner, by merits of the algorithm. This further highlights the benefits of using gradient-boosting machines for such problems.

4. Discussion

This study analyses a large city-scale ambulance demand dataset using machine learning algorithms to further develop a daily regional demand prediction tool. Our work is novel because it is the first reported study in Singapore to leverage machine learning in the development of tools that assist in the planning of emergency response resources. To this end, it considers various multi-nature dependencies of ambulance demands. This motivates future work in conducting machine learning-based analysis on datasets of similar types.
Our solution considers the engineering of various attributional features, short-term historical demands, and long-term historical aggregated demands. LightGBM is then applied on these features for the prediction of demand. Other methods either do not perform as well, or encounter problems like overfitting, as in the case of Linear Regression. As such, LightGBM remains the top choice in our solution. The reason why an ensemble model like LightGBM performs better than individual ones like linear regression may be that it combines various independent models via the gradient boosting approach. Specifically, each model within LightGBM is a regression tree, which in itself is more suitable than models like Linear Regression in capturing the non-linear dependencies of ambulance demand. The key idea behind gradient boosting is that prediction can be refined by adding these trees one at a time while using a gradient descent procedure.
The proposed features contribute in varying degrees to the model training in LightGBM. The most important features are the ID of the region at which demand is predicted, long-term historical aggregated demand features, day of month, and number of people aged 50 and above. With the results obtained from this study, it provides emergency healthcare resource planners additional insights on how different features affect the demand at a region for effective ambulatory resource planning in the future. For example, understanding that the region of the city-state is one of the greatest determinants of demands allows the planners to dispatch ambulances in a finer-grained manner. Furthermore, understanding that the demand at a region also strongly depends on its long-term historical demand and number of people aged above 50 encourages planners to focus suitable amount of resources to regions based on the historical incidents that occur at the region. It also encourages paying more attention to the demographical changes in each region.
While the accuracy of around 25% is considered satisfactory, it may not be as high when compared to the prediction of other vehicles like taxis/on-demand vehicles [31,32]. We note that the reason for this may be that the regional demand for vehicles like taxis is typically much larger than that of ambulances to begin with, which in our case is only around 6 per day per region. As such, the prediction percentage error in our case is more likely to be larger, due to the relatively much smaller size of the target outcome used in the machine learning training and prediction. The periodicity of ambulance demand is not as strong as other types of vehicles like taxis. While the demands for taxis or private hires may be highly dependent on the days of the week, similar results cannot be inferred for ambulances. This motivates us to consider various external data sources in our future work, e.g., weather conditions, to model the other possible dependencies that may affect the ambulance demand. Furthermore, our solution is a preliminary take on this problem in Singapore, and it predicts the demands only under certain typical conditions. Although there are peaks and troughs in demands every now and then, these values are in no way near the extremes that happen during very large-scale incidents, e.g., epidemics, haze [33,34,35], and diurnal temperature changes. A potential area of improvement is to make use of historical data to model the demand at such extreme cases of large-scale incidents.
We have seen applications of artificial intelligence and machine learning techniques across different disciplines [36,37,38,39]. Our work here focuses on health services research, which has not gained much attention until now. Other than predicting the daily demand, future work involves further optimization to investigate finer-grained demands, e.g., hourly. However, as we look at increasing the granularity of analysis to identify “micro-trends” and pockets of demand, which may potentially be matched with better optimized placements or additional ambulance staffing, the operational limitations of block scheduling need to be considered. It may not be practical to call up a person to work for just 1–2 h instead of the typical 8 to 12 h shifts. The emergency medical services (EMS) systems may also have more rigid shift patterns, and this may limit the flexibility for optimization. Furthermore, given that the mean daily regional demand is low, considering a finer-grained timescale may result in the issue of data sparsity that inhibits accurate model trainings. These considerations are beyond the scope of the current study and will form part of our future investigation.
This study demonstrates the usage of a single-source vehicle, i.e., ambulance, dataset for building of a solution that models and predicts the ambulance demand in the regions of a city-state. For future work, we may additionally consider the insights obtainable from other vehicle datasets, e.g., taxi trajectories or public transportation smart card data. For example, a potential direction is to further consider the accessibility of each region to its respective nearest hospitals or clinics using certain metrics, e.g., average travel distance/duration of trips originating from a region and ending at a hospital. The idea is that if accessibility by other forms of transportation is higher, it gives people more alternatives for traveling to the hospitals instead of focusing solely on ambulance, especially for non-critical incidents. This may in turn affect the demand of ambulances in that particular region. Furthermore, leveraging geospatial datasets from other vehicles also allows us to understand the medical demand of people within a region. If a particular region sees on average a larger number of people traveling to the hospitals/clinics via public transportation or taxis than another region, an assumption can possibly be made that the former region tends to house more people who may require medical care than the latter. While this does not necessarily imply a higher ambulance demand, which focuses on more urgent cases, a potential exploration on the correlations between these two pieces of information can also be considered for future work.

5. Conclusions

In this study, we have utilized a 10-year city-wide emergency ambulance dataset to predict ambulance demand. The forecasting capability presented here is important because it enables informed resource and ambulance demand and is applicable across hospitals and general medical facilities. Several machine learning techniques are compared: Regional Moving Average, Linear Regression, Support Vector Regression, Multi-layer Perceptron, and LightGBM. Based on the preliminary work carried out here, LightGBM is found to perform the best. The most important features are the total demand over the past 30 days and the total demand over the past 7 days in that region.

Author Contributions

Conceptualization, K.H.C., X.X., and M.E.H.O.; Data curation, A.X.L., A.F.W.H., K.H.C., Z.L., W.C., M.L.C., Y.Y.N., X.X. and M.E.H.O.; Formal analysis, A.X.L., A.F.W.H., K.H.C., Z.L., W.C., M.L.C., Y.Y.N., X.X. and M.E.H.O.; Funding acquisition, K.H.C., X.X. and M.E.H.O.; Investigation, A.X.L., A.F.W.H., K.H.C., Z.L., W.C., M.L.C., Y.Y.N., X.X. and M.E.H.O.; Methodology, A.X.L., A.F.W.H., K.H.C., Z.L., W.C., M.L.C., Y.Y.N., X.X. and M.E.H.O.; Project administration, K.H.C., X.X. and M.E.H.O.; Resources, K.H.C., X.X. and M.E.H.O.; Software, A.X.L. and A.F.W.H.; Supervision, K.H.C., X.X. and M.E.H.O; Validation, A.X.L., A.F.W.H., K.H.C., Z.L., W.C., M.L.C., Y.Y.N., X.X. and M.E.H.O; Visualization, A.X.L., A.F.W.H., K.H.C., Z.L., W.C., M.L.C., X.X. and M.E.H.O.; Writing–original draft, A.X.L., A.F.W.H., K.H.C., X.X., M.E.H.O.; Writing–review & editing, A.X.L., A.F.W.H., K.H.C., Z.L., W.C., M.L.C., Y.Y.N., X.X., M.E.H.O. All authors carried out research, analyzed the results, and wrote the manuscript. All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported by the Singapore University of Technology and Design (Grant No. SGPCTRS1804) and National Research Foundation of Singapore through the Virtual Singapore Program (Grant No. NRF2017VSG-AT3DCM001-031).

Acknowledgments

The authors acknowledge the participation of Virtual Singapore team. The authors are also grateful to the Singapore Civil Defence Force for data sharing and domain-expertise input.

Conflicts of Interest

The authors declare no conflict of interest. The funders had no role in the design of the study; in the collection, analyses, or interpretation of data; in the writing of the manuscript; or in the decision to publish the results.

References

  1. Fitch, J. Response times: Myths, measurement & management. J. Emerg. Med. Serv. 2005, 30, 47–56. [Google Scholar]
  2. Henriksen, F.L.; Schorling, P.; Hansen, B.; Schakow, H.; Larsen, M.L. FirstAED emergency dispatch, global positioning of community first responders with distinct roles - a solution to reduce the response times and ensuring an AED to early defibrillation in the rural area Langeland. Int. J. Netw. Virtual Organ. 2016, 16, 86. [Google Scholar] [CrossRef] [Green Version]
  3. Peleg, K.; Pliskin, J.S. A geographic information system simulation model of EMS: Reducing ambulance response time. Am. J. Emerg. Med. 2004, 22, 164–170. [Google Scholar] [CrossRef] [PubMed]
  4. Peters, J.; Hall, G. Assessment of ambulance response performance using a geographic information system. Soc. Sci. Med. 1999, 49, 1551–1566. [Google Scholar] [CrossRef]
  5. Simonsen, S.A.; Andresen, M.; Michelsen, L.; Viereck, S.; Lippert, F.; Iversen, H.K. Evaluation of pre-hospital transport time of stroke patients to thrombolytic treatment. Scand. J. Trauma. Resusc. Emerg. Med. 2014, 22, 65. [Google Scholar] [CrossRef] [Green Version]
  6. Timm, A.; Maegele, M.; Lefering, R.; Wendt, K.; Wyen, H. Pre-hospital rescue times and actions in severe trauma. A comparison between two trauma systems: Germany and the Netherlands. Injury 2014, 45, S43–S52. [Google Scholar] [CrossRef]
  7. Setzler, H.; Saydam, C.; Park, S. EMS call volume predictions: A comparative study. Comput. Oper. Res. 2009, 36, 1843–1851. [Google Scholar] [CrossRef]
  8. Liu, N.; Zhang, Z.; Ho, A.F.W.; Ong, M.E.H. Artificial intelligence in emergency medicine. J. Emerg. Crit. Care Med. 2018, 2, 82. [Google Scholar] [CrossRef]
  9. Rendell, K.; Koprinska, I.; Kyme, A.Z.; Ebker-White, A.; Dinh, M. The Sydney Triage to Admission Risk Tool (START2) using machine learning techniques to support disposition decision-making. Emerg. Med. Australas. 2018, 31, 429–435. [Google Scholar] [CrossRef]
  10. Chiew, C.J.; Liu, N.; Wong, T.H.; Sim, Y.E.; Abdullah, H.R. Utilizing Machine Learning Methods for Preoperative Prediction of Postsurgical Mortality and Intensive Care Unit Admission. Ann. Surg. 2019. [Google Scholar] [CrossRef]
  11. Eken, C.; Bilge, U.; Kartal, M.; Eray, O. Artificial neural network, genetic algorithm, and logistic regression applications for predicting renal colic in emergency settings. Int. J. Emerg. Med. 2009, 2, 99–105. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  12. Harrison, R.; Kennedy, R.L. Artificial Neural Network Models for Prediction of Acute Coronary Syndromes Using Clinical Data From the Time of Presentation. Ann. Emerg. Med. 2005, 46, 431–439. [Google Scholar] [CrossRef]
  13. Silva, A.; Cortez, P.; Santos, M.F.; Gomes, L.; Neves, J. Mortality assessment in intensive care units via adverse events using artificial neural networks. Artif. Intell. Med. 2006, 36, 223–234. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  14. Taylor, R.A.; Pare, J.R.; Venkatesh, A.K.; Mowafi, H.; Melnick, E.R.; Fleischman, W.; Hall, M.K. Prediction of In-hospital Mortality in Emergency Department Patients With Sepsis: A Local Big Data-Driven, Machine Learning Approach. Acad. Emerg. Med. 2016, 23, 269–278. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  15. Boutilier, J.J.; Chan, T.C.Y. Ambulance Emergency Response Optimization in Developing Countries. arXiv 2018, arXiv:1801.05402. [Google Scholar]
  16. Westgate, B.S.; Woodard, D.B.; Matteson, D.S.; Henderson, S. Large-network travel time distribution estimation for ambulances. Eur. J. Oper. Res. 2016, 252, 322–333. [Google Scholar] [CrossRef]
  17. Li, Y.; Zheng, Y.; Ji, S.; Wang, W.; Leong, H.U.; Gong, Z. Location selection for ambulance stations. In Proceedings of the 23rd SIGSPATIAL International Conference on Advances in Geographic Information Systems-GIS ’15, Association for Computing Machinery (ACM), Seattle, WA, USA, 3–6 November 2015; Volume 85, pp. 1–4. [Google Scholar]
  18. Chen, A.; Lu, T.-Y.; Ma, M.H.-M.; Sun, W.-Z. Demand Forecast Using Data Analytics for the Preallocation of Ambulances. IEEE J. Biomed. Health Inform. 2015, 20, 1178–1187. [Google Scholar] [CrossRef]
  19. Channouf, N.; L’Ecuyer, P.; Ingolfsson, A.; Avramidis, A.N. The application of forecasting techniques to modeling emergency medical system calls in Calgary, Alberta. Heal. Care Manag. Sci. 2007, 10, 25–45. [Google Scholar] [CrossRef]
  20. Zhou, Z.; Matteson, D.S. Predicting Ambulance Demand. In Proceedings of the 21th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining-KDD ’15, Sydney, NSW, Australia, 10–15 August 2015; Association for Computing Machinery (ACM): New York, NY, USA; pp. 2297–2303. [Google Scholar]
  21. Singapore Civil Defence Force. EMERGENCY MEDICAL SERVICES STATISTICS 2016. Singapore Civil Defence Force. Available online: https://www.scdf.gov.sg/docs/default-source/scdf-library/publications/amb-fire-inspection-statistics/ems-stats-2016 (accessed on 4 June 2020).
  22. Ho, A.F.W.; Chew, D.; Wong, T.H.; Ng, Y.Y.; Pek, P.P.; Lim, S.H.; Anantharaman, V.; Ong, M.E.H. Prehospital Trauma Care in Singapore. Prehospital Emerg. Care 2014, 19, 409–415. [Google Scholar] [CrossRef]
  23. Ho, A.F.W.; To, B.Z.Y.S.; Koh, J.M.; Cheong, K.H. Forecasting Hospital Emergency Department Patient Volume Using Internet Search Data. IEEE Access 2019, 7, 93387–93395. [Google Scholar] [CrossRef]
  24. Pedregosa, F.; Varoquaux, G.; Gramfort, A.; Michel, V.; Thirion, B.; Grisel, O.; Blondel, M.; Prettenhofer, P.; Weiss, R.; Dubourg, V.; et al. Scikit-learn: Machine Learning in Python. J. Mach. Learn. Res. 2011, 12, 2825–2830. [Google Scholar] [CrossRef]
  25. Arnaiz-González, A.; Fernández-Valdivielso, A.; Bustillo, A.; De Lacalle, L.N.L. Using artificial neural networks for the prediction of dimensional error on inclined surfaces manufactured by ball-end milling. Int. J. Adv. Manuf. Technol. 2015, 83, 847–859. [Google Scholar] [CrossRef]
  26. Ke, G.; Meng, Q.; Finley, T.; Wang, T.; Chen, W.; Ma, W.; Ye, Q.; Liu, T.-Y. LightGBM: A Highly Efficient Gradient Boosting Decision Tree. In Proceedings of the Advances in Neural Information Processing Systems 30: Annual Conference on Neural Information Processing Systems 2017, Long Beach, CA, USA, 4–9 December 2017; pp. 3146–3154. [Google Scholar]
  27. Fildes, R. The evaluation of extrapolative forecasting methods. Int. J. Forecast. 1992, 8, 81–98. [Google Scholar] [CrossRef]
  28. Hyndman, R.J.; Koehler, A.B. Another look at measures of forecast accuracy. Int. J. Forecast. 2006, 22, 679–688. [Google Scholar] [CrossRef] [Green Version]
  29. Gilles, S. Shapely: Manipulation and analysis of geometric objects. Available online: https://github.com/Toblerity/Shapely (accessed on 4 June 2020).
  30. Lundberg, S.; Lee, S.-I. A unified approach to interpreting model predictions. In Proceedings of the Advances in Neural Information Processing Systems 30: Annual Conference on Neural Information Processing Systems 2017, Long Beach, CA, USA, 4–9 December 2017; pp. 4765–4774. [Google Scholar]
  31. Geng, X.; Li, Y.; Wang, L.; Zhang, L.; Yang, Q.; Ye, J.; Liu, Y. Spatiotemporal Multi-Graph Convolution Network for Ride-Hailing Demand Forecasting. In Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence (AAAI-19), Honolulu, HI, USA, 27 January–1 February 2019; Volume 33, pp. 3656–3663. [Google Scholar]
  32. Yao, H.; Wu, F.; Ke, J.; Tang, X.; Jia, Y.; Lu, S.; Gong, P.; Ye, J.; Li, Z. Deep Multi-View Spatial-Temporal Network for Taxi Demand Prediction. In Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence (AAAI-18), New Orleans, LA, USA, 2–7 February 2018; pp. 2588–2595. [Google Scholar]
  33. Cheong, K.H.; Ngiam, N.J.; Morgan, G.G.; Pek, P.P.; Tan, B.Y.-Q.; Lai, J.W.; Koh, J.M.; Ong, M.E.H.; Ho, A.W.F. Acute Health Impacts of the Southeast Asian Transboundary Haze Problem—A Review. Int. J. Environ. Res. Public Health 2019, 16, 3286. [Google Scholar] [CrossRef] [Green Version]
  34. Ho, A.F.W.; Zheng, H.; Cheong, K.H.; En, W.L.; Pek, P.P.; Zhao, X.; Morgan, G.G.; Earnest, A.; Tan, B.Y.Q.; Ng, Y.; et al. The Relationship Between Air Pollution and All-Cause Mortality in Singapore. Atmosphere 2019, 11, 9. [Google Scholar] [CrossRef] [Green Version]
  35. Ho, A.F.W.; Zheng, H.; Earnest, A.; Cheong, K.H.; Pek, P.P.; Seok, J.Y.; Liu, N.; Kwan, Y.H.; Tan, J.W.C.; Wong, T.H.; et al. Time-Stratified Case Crossover Study of the Association of Outdoor Ambient Air Pollution With the Risk of Acute Myocardial Infarction in the Context of Seasonal Exposure to the Southeast Asian Haze Problem. J. Am. Hear. Assoc. 2019, 8, e011272. [Google Scholar] [CrossRef] [Green Version]
  36. Cheong, K.H.; Koh, J.M. A hybrid genetic-Levenberg Marquardt algorithm for automated spectrometer design optimization. Ultramicroscopy 2019, 202, 100–106. [Google Scholar] [CrossRef]
  37. Cheong, K.H.; Poeschmann, S.; Lai, J.W.; Koh, J.M.; Acharya, U.; Yu, S.C.M.; Tang, K.J.W. Practical Automated Video Analytics for Crowd Monitoring and Counting. IEEE Access 2019, 7, 183252–183261. [Google Scholar] [CrossRef]
  38. Jahmunah, V.; Oh, S.L.; Rajinikanth, V.; Ciaccio, E.J.; Cheong, K.H.; Arunkumar, N.; Acharya, R.U. Automated detection of schizophrenia using nonlinear signal processing methods. Artif. Intell. Med. 2019, 100, 101698. [Google Scholar] [CrossRef]
  39. Koh, J.M.; Cheong, K.H. Automated electron-optical system optimization through switching Levenberg–Marquardt algorithms. J. Electron Spectrosc. Relat. Phenom. 2018, 227, 31–39. [Google Scholar] [CrossRef]
Figure 1. Approach overview schematic.
Figure 1. Approach overview schematic.
Ijerph 17 04179 g001
Figure 2. Sliding window for extraction of demand features.
Figure 2. Sliding window for extraction of demand features.
Ijerph 17 04179 g002
Figure 3. Map of regional variance of daily demand in Singapore from 2006 to 2016.
Figure 3. Map of regional variance of daily demand in Singapore from 2006 to 2016.
Ijerph 17 04179 g003
Table 1. Characteristics of ambulance demand dataset.
Table 1. Characteristics of ambulance demand dataset.
CharacteristicsValue
Incident Year
2006–2007190,608 (13.6%)
2008–2009216,841 (15.5%)
2010–2011237,451 (17.0%)
2012–2013268,596 (19.2%)
2014–2016311,251 (22.3%)
2016172,009 (12.3%)
Patient Age (yrs)55 (34–73)
2006–200751 (32–71)
2008–2009 52 (32–71)
2010–201153 (33–72)
2012–201356 (35–73)
2014–201657 (36–74)
201658 (36–75)
Incident Classification
Medical968,375 (69.3%)
Trauma391,986 (28.1%)
Assistance Not Required35,460 (2.54%)
Patient Incident Subclass
Nervous System381,634 (27.3%)
No Medical Complaint/Un-Injured385,430 (27.6%)
Bone/Connective Tissue116,173 (8.32%)
Alcoholic Intoxication25,865 (1.85%)
Respiratory System132.163 (9.46%)
Reproductive System117,21 (0.839%)
Cardiovascular System115,587 (8.28%)
Digestive System98,129 (7.03%)
Poisoning/Drug Overdose6791 (0.486%)
Ear/Nose/Throat/Eye Condition5601 (0.401%)
Kidney/Urinary System16,433 (1.18%)
Blood Related5590 (0.400%)
Maternity/Childbirth5062 (0.362 %)
Liver/Biliary Tract1438 (0.103%)
Psychiatric Emergencies4413 (0.316%)
Endocrine System31,850 (2.28%)
Infectious Disease/Disorder of Skin4649 (0.333%)
Others35,240 (2.52%)
Unknown9474 (0.678%)
Unclassified2705 (0.194%)
Gender
Male838,737 (60.0%)
Female554,237 (39.7%)
Unclassified 2163 (0.213%)
For continuous variables, data is presented in medians and interquartile ranges. For categorical variables, data is presented in frequencies and percentages.
Table 2. Characteristics of engineered dataset.
Table 2. Characteristics of engineered dataset.
CharacteristicsValue
Daily Regional Demand6.33 (0–10)
Total Regional Demands over Past 7 Days44. (4–69)
Total Regional Demands over Past 30 Days190 (20–294)
Data is presented in means and interquartile ranges.
Table 3. Method accuracy comparisons
Table 3. Method accuracy comparisons
MethodWAPE (%)MAEMSE
Regional Moving Average25.82.2011.2
Linear Regression24.52.0910.1
MLP24.62.1010.1
RBFN25.12.1410.8
SVR25.22.1511.2
LightGBM24.52.0910.2
Bold indicates the best results for each column. WAPE: weighted absolute percentage error; MAE: mean absolute error; MSE: mean squared error; MLP: multilayer perceptron; RBFN: Radial Basis Function network; SVR: Support Vector Regression; LightGBM: Light Gradient Boosting Machine.
Table 4. Feature importance.
Table 4. Feature importance.
FeatureGain-Based ImportanceMean Absolute SHAP Value
Region ID14,121,0710.230
Day of Week844,2830.069
Day of Month2,400,4620.043
Month of Year723,0540.031
Demand 1 Day Ago308,5900.022
Demand 2 Days Ago138,5430.011
Demand 3 Days Ago159,6490.012
Demand 4 Days Ago209,7710.014
Demand 5 Days Ago144,1380.015
Demand 6 Days Ago146,9660.009
Demand 7 Days Ago432,1360.022
Total Demand of the Week up to the Data Sample Day1,368,8480.101
Total Demand of the Month up to the Data Sample Day82,8470.005
Total Demand over Past 30 Days820,758,8934.626
Total Demand over Past 7 Days77,034,3860.466
Total Number of People Aged 50 and Above in the Year2,528,0210.223
SHAP: SHapley Additive exPlanations; ID: identifier.
Table 5. Accuracy comparisons on inclusion/exclusion of regional socioeconomic features.
Table 5. Accuracy comparisons on inclusion/exclusion of regional socioeconomic features.
DatasetWAPE (%)MAEMSE
SCDF-Engineered-Socio22.03.0016.3
SCDF-Engineered-Socio, excluding regional socioeconomic features 22.03.0016.3
WAPE: weighted absolute percentage error; MAE: mean absolute error; MSE: mean squared error; SCDF: Singapore Civil Defence Force.

Share and Cite

MDPI and ACS Style

Lin, A.X.; Ho, A.F.W.; Cheong, K.H.; Li, Z.; Cai, W.; Chee, M.L.; Ng, Y.Y.; Xiao, X.; Ong, M.E.H. Leveraging Machine Learning Techniques and Engineering of Multi-Nature Features for National Daily Regional Ambulance Demand Prediction. Int. J. Environ. Res. Public Health 2020, 17, 4179. https://doi.org/10.3390/ijerph17114179

AMA Style

Lin AX, Ho AFW, Cheong KH, Li Z, Cai W, Chee ML, Ng YY, Xiao X, Ong MEH. Leveraging Machine Learning Techniques and Engineering of Multi-Nature Features for National Daily Regional Ambulance Demand Prediction. International Journal of Environmental Research and Public Health. 2020; 17(11):4179. https://doi.org/10.3390/ijerph17114179

Chicago/Turabian Style

Lin, Adrian Xi, Andrew Fu Wah Ho, Kang Hao Cheong, Zengxiang Li, Wentong Cai, Marcel Lucas Chee, Yih Yng Ng, Xiaokui Xiao, and Marcus Eng Hock Ong. 2020. "Leveraging Machine Learning Techniques and Engineering of Multi-Nature Features for National Daily Regional Ambulance Demand Prediction" International Journal of Environmental Research and Public Health 17, no. 11: 4179. https://doi.org/10.3390/ijerph17114179

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop