Integrating Data Mining and Microsimulation Modelling to Reduce Traffic Congestion: A Case Study of Signalized Intersections in Dhaka, Bangladesh

: A growing body of research has applied intelligent transportation technologies to reduce tra ﬃ c congestion at signalized intersections. However, most of these studies have not considered the systematic integration of tra ﬃ c data collection methods when simulating optimum signal timing. The present study developed a three-part system to create optimized variable signal timing proﬁles for a congested intersection in Dhaka, regulated by ﬁxed-time tra ﬃ c signals. Video footage of tra ﬃ c from the studied intersection was analyzed using a computer vision tool that extracted tra ﬃ c ﬂow data. The data underwent a further data-mining process, resulting in greater than 90% data accuracy. The ﬁnal data set was then analyzed by a local tra ﬃ c expert. Two hybrid scenarios based on the data and the expert’s input were created and simulated at the micro level. The resultant, custom, variable timing proﬁles for the tra ﬃ c signals yielded a 40% reduction in vehicle queue length, increases in average travel speed, and a signiﬁcant overall reduction in tra ﬃ c congestion. of


Introduction
With the continuing growth in both urban populations and the number of personal vehicles, traffic congestion and delays in journey time are critical issues for the overall sustainability of urban transportation systems around the globe [1][2][3][4]. The consequences of traffic congestion (e.g., increases in traffic safety issues, delays, and traffic-related air pollution) have been widely investigated in different city contexts globally [4][5][6]. These issues are of significant interest to transportation planners and managers tasked with overcoming increasing urban transportation challenges. Inefficiencies in traffic signal operation and the resultant deficiencies in traffic management have received critical attention due to their influence on congestion, vehicle speed, journey time, and vehicular emission levels [1,3,5]. Recent studies [4][5][6][7] have suggested that context optimized traffic signals are more efficient and can help reduce congestion, improve vehicle speed, and reduce overall air pollution. There is a growing body of literature that recognizes the importance of real-time/near real-time optimized traffic signaling systems as aids in overcoming traffic congestion, increasing traffic safety, and reducing the level of date, there has not been enough confirmation to support policy decisions by urban planners and traffic managers regarding less-expensive alternate technologies for integrating and automating the signal updating process. The use of technologies such as video analysis tools in conjunction with data mining (e.g., automated traffic counting, classifying vehicles, and localizing vehicles) may not only overcome current cost-based deficiencies in traffic data collection [14], but may also allow data to be integrated in microsimulation modelling to obtain optimized traffic signal timing in congested traffic intersections [1,3].
The present study's purpose was to develop an integrated automated traffic data mining technique based on the video analysis of traffic at an isolated signalized intersection. Additionally, this data was utilized to model the traffic in micro simulations, with the intention of discovering optimized signal timing patterns. This modelling, in turn, allowed the creation of variable timing profiles customized to a given isolated intersection. These variable timing profiles allowed signal timing to vary with different levels of traffic volume throughout the day and night based on the results of the optimized traffic flow simulations. The key novelty of this research is the integration of different technological and processed-based solutions to develop a semi-automated system that can reduce traffic congestion effectively. Additionally, the system developed in this study can be implemented at a lower cost than Urban Sci. 2019, 3, 41 3 of 23 highly sophisticated and expensive systems, such as adaptive or fully actuated city-wide traffic control systems [15].
The present study was conducted on an isolated signalized intersection in the city of Dhaka. Previously, Labib et al. [2] and Khadem [16] found that most intersections in Dhaka have obsolete traffic signal systems with fixed timing cycles that are poorly suited to current traffic patterns and flows. These fixed signal timing cycles result in increases in traffic congestion and vehicular emissions and reduced traffic speed at affected intersections [2,[16][17][18]. The present study illustrates that a combination of low-cost hardware (e.g., CCTV), simulation software, and minimum expert inputs could be integrated and applied to implement optimized signal timing on real-world traffic conditions in high-traffic urban environments. To the authors' knowledge, no such study has been conducted in the context of a city like Dhaka.

Data Mining in Traffic Management: Video Analysis
Within the wider domain of transportation management, data mining refers to processes and systems that extract implicit knowledge and rules from large traffic data that govern the movement of traffic under different real-world conditions [19,20]. The recent developments of integrated Internet, Wi-Fi, sensors, and server technologies, and the emerging development of connected vehicles, provide opportunities to collect a large amount of real-time or near-real-time traffic data (e.g., vehicle volume, speed, intensity, trajectories, vehicle types, and accident events) [11,19,21]. This information constitutes Big-data in the context of traffic management, and extracting meaningful insight (e.g., patterns of traffic traveling in certain corridors, traffic congestion at specific intersections), and traffic managers can use these insights to make informed, quick, and reliable decisions that increase the efficiency of the traffic network [19][20][21]. There are several tools used in traffic Big-data mining. The most common data mining tools are associated with (a) vision-driven data mining (e.g., video log analyses); (b) multi-sensor data mining (e.g., GPS and loop-detectors data analyses); (c) learning-based mining (e.g., using artificial intelligence models and rule extraction); visualization-based mining (e.g., line charts, data images); and (d) geospatial data mining (e.g., GIS tools used to extract spatial information related to traffic networks) [19,21].
In recent years, video image processing has been adopted for real-time traffic data mining because of benefits associated with fast response times; easy installation, operation, and maintenance; and the ability to monitor wide areas [13,22,23]. Among different sources of traffic videos, closed-circuit television (CCTV) has obtained wider attention, and this system is a closed video system that uses video cameras to transmit images that can then be monitored or recorded [22]. Diverse data on large regions can be collected through CCTV, increasing the scope for automatic analysis of urban traffic activity. Processed data extracted from visual imaging using CCTV and other traffic videos (e.g., simulated videos) can provide valuable information regarding traffic parameters including speed, traffic composition, vehicle shapes, vehicle types, vehicle identification numbers, traffic violations or road accidents, and turning movements important to junction design [13,[22][23][24]. This data aids in transportation management and provide advantages over other data collection methods [13,22,23]. Additionally, video image processing makes the traffic data mining process easier by identifying the pattern or rules of traffic movement [13,19].
Vehicle detection using video analysis can be conducted using multiple methods (e.g., frame differencing, optical flow, and background subtraction (BS)) [13,25]. In the case of optical flow, vehicles are detected using pixel level intensity, and this method is more functional if a video capturing device is moving. The method is less efficient in real-time conditions due to massive computational requirements [13,26]. The frame differencing method uses the intensity of pixel variations to identify moving objects in the videos and consider dynamic environmental conditions within the frames of the videos they analyze. Frame differencing is vulnerable to vehicle misclassification when the background Urban Sci. 2019, 3, 41 4 of 23 and a vehicle have a similar color. Frame differencing is also unable to detect static objects within the video frames [13]. In contrast to vehicle detection and frame differencing, BS is a more robust method considering its ability to detect both static and moving objects and the fact that BS is not affected by the color of vehicles [13,27]. Considering the robustness of BS algorithms, many recent studies and applications of vehicle detection from video analysis used this method and obtained accuracies around 95% in simple traffic scenarios and over 70% accuracy in complex scenarios [13,28,29]. Within the BS approach, there are several parametric (e.g., single Gaussian or median filter); non-parametric (e.g., kernel density estimation); and predictive (e.g., Kalman filter) techniques used for detecting vehicles of different categories in a region of interest (ROI) [30,31]. These regions of interest can be detected based on predefined virtual loops (e.g., rectangular box within the video frames) or by using blob tracking where the algorithm also detects the vehicle trajectory within the ROI [32]. Using a method such as optical flow or BS with a virtual loop or blob tracking usually depends on the need of the system design and context of the system where they will be applied. Current literature suggests the BS method, along with parametric techniques using virtual loops, are receiving wider appreciation due to their robustness and simplicity in use.

Signal Time Optimizations Using Simulation Models
Signal time optimization to reduce traffic congestion was first proposed in the 1950s [1]. With the advent of modern technologies, the availability of real-time data, data mining tools, and wider use of micro and macro-simulation models, traffic signal timing management has become more efficient [1,13,19]. The optimization of traffic signal systems is typically achieved through the use of two different types of signal optimization systems, the split cycle and offset optimization technique (SCOOT) [33] and the microprocessor optimised vehicle actuation (MOVA) [34].
SCOOT systems optimize multiple linked signalized traffic junctions at the city scale or within certain zones of a city [1,33]. By contrast, MOVA is used to optimize single signalized traffic junctions at the microscale [34]. SCOOT systems minimize overall traffic congestion, vehicle stops-starts, and journey times by synchronizing green-red signal times based on traffic flow approaching a junction and traffic flow running between consecutive junctions [34,35]. These systems efficiently manage macroscale traffic; however, SCOOT requires considerable investment and management. Simulation software such as TRANSYT and LinSig are popular software environments for SCOOT optimization processes [12,36].
In contrast to SCOOT, MOVA optimizes the signal time cycle based on vehicle flow approaching a single junction. The MOVA process does not account for multiple junctions or the traffic moving between consecutive junctions [37]. MOVA is less resource intensive, works significantly better for high traffic flow, and is focused on increasing junction capacity. Therefore, MOVA systems can be utilized independently in several junctions in a city instead of a city-wide SCOOT system [37,38]. Several modelling software programs are available for MOVA systems. OSCADY [39], PTV-VISSIM, and Vistro [12,40,41] are MOVA systems commonly used for simulation modelling when optimizing and evaluating unconnected single junctions.
Several studies have utilized simulation methods with different optimization algorithms (e.g., genetic algorithm [GA], particle swarm optimization [POS], and artificial neural network [ANN]) to reduce traffic congestion at signalized intersections [8]. Gökçe et al. [42] applied a POS optimization method and a VISSIM simulation to study a signalized roundabout in Turkey. Gökçe et al. found that compared to a fixed signal time, optimized signalling systems can result in significant decreases in average delay times. Ghanim and Abu-Lebdeh [43] utilized both genetic algorithms and ANN to optimize traffic signal timing dynamically within VISSIM. Ghanim and Abu-Lebdeh reported a considerable reduction in delay time based on the optimization provided by the algorithms. Several other studies also applied signal optimization integration with simulation modelling [1,12,17,33]. However, these studies mostly used static traffic data that was collected and operated manually. In addition, these studies seldom integrated any expert knowledge from traffic managers to optimize signal times. The present study addressed these research gaps and produced evidence demonstrating that automatic traffic data collection and simulation modelling can be used to design effective signal timing in signalized traffic intersections in Dhaka.

Studies on Dhaka's Traffic Signals
Dhaka served as the study site for this research. Dhaka City Corporation is the authoritative body responsible for installing and managing traffic signals within the metropolitan area. At the beginning period of the signal installation, an inadequate systematic approach was applied to determine signal timing [16]. Subsequent research was conducted to analyze the condition of the signalized intersections and evaluate operational alternatives. One of the most comprehensive studies to address the operational issues of Dhaka's intersections was conducted by Hadiuzzaman [44]. Hadiuzzaman developed a saturation flow and delay model for selected intersections that estimated delays in the selected intersections using several models, namely the Webster Delay Model, the TRANSYT Model, Ackelik's Model, Reilly's Model, and the Highway Capacity Manual 2000 Model. The estimated values of these models were compared in reference to the traffic conditions of Dhaka. Based on these comparisons, a modified delay model was proposed. At the final stage of the study, an AIMSUN simulation model applied to estimate the vehicles stop in the over-saturated approaches of the selected intersections. The study did not result in any specific suggestions regarding traffic signal timing or automation.
In another study, Roy et al. [17] attempted to improve the level of service of Dhaka's Science Laboratory-Elephant Road intersections using the VISSIM software platform. Using field survey counts, this study modeled the existing signal timing. The study evaluated three different signal timing scenarios with reference to average traffic speed, average delay, and average queue length. Based on this analysis, this study proposed some effective signal timing alternatives. A similar study was conducted by Bhuyian et al. [45] using Webster's model of saturation flow to address the inefficiency of Dhaka's pre-timed intersections. Using existing field traffic counts and estimated traffic calculations, the study proposed alternative signal timing to improve the efficacy of the city's pre-timed signalized system.
Due to emergent concerns about climate change and air pollution in megacities, the efficient management of congested intersections in the developing world is a topic of interest in the transport sector. Labib et al. [2] conducted a comprehensive study to address concerns regarding vehicle CO 2 emissions in selected intersections in Dhaka. Using a combination of field traffic counts and remote sensing data, this study calculated an Emission Bio-Capacity Index for each of the selected intersections in Dhaka. The study identified unsustainable CO 2 emission levels in most of the intersections because of the absence of optimized signal systems.
The aforementioned studies on Dhaka's transportation system provide proof that traffic signal systems can become inadequate and ineffective, and they pointed to the need for further improvement in signal system management. Several transportation infrastructure development projects are currently underway in Dhaka (e.g., the metro rail and the elevated expressway). However, these projects are designed to address issues related to increasing numbers of vehicles [2], congestion, and travel demands. The topic of traffic signal timing has not received adequate attention from city authorities. Therefore, further investigations are necessary to highlight the potential for improvement resulting from this aspect of traffic management.

Case Study Area
A single traffic intersection in Dhaka was selected as a case study area. Dhaka's road network, which is roughly 3000 km in length, comprises 7% of the total city area [46]. There are approximately 650 major intersections in Dhaka, of which, 70 have traffic lights [16,47]. Most of these signal lights are fixed-timed or pretimed [16]. Frequently, fixed-timed patterns cannot cope with Dhaka's high traffic volume conditions, and this causes traffic delays [2,16]. The Dhaka Metropolitan Police (DMP) control the signal timing in some intersections manually. In most of these intersections, operational deficiencies exist [2,16].
For this case study, a three-leg intersection named 'Sheraton Hotel Junction' was selected. This is one of the busiest intersections in Dhaka as it is surrounded by important commercial areas, and the renowned five-star hotel The Intercontinental (previously the Sheraton Hotel) is located at one corner of this intersection. The latitude of the intersection is 23.709 and the longitude is 90.407. The legs of the intersection direct traffic towards the Bangla Motor area on the north side, Shahbag on the south side, and Minto Road on the east side ( Figure 1). This intersection is a pretimed intersection used by a heterogeneous mixture of traffic. The most common vehicles found in the intersection were cars, covered vans (HGV), buses, vans, and motorcycles. Traffic flows in both directions on each leg of the intersection, and each direction consists of three effective lanes.

Traffic Data Collection at Base Level
The base condition traffic data were collected using manual counts. Surveyors were assigned to collect the data in each direction of the junction. There are three links in this intersection, and they were each assigned a number, as shown in Figure 1b. Each of these links supports traffic flow in both directions. Each directional flow represents three lanes. Therefore, each of these links comprises six lanes supporting traffic flow in both directions. The field traffic counts were done in 15 minutes intervals during a single weekday in March of 2015 from 7 to 11 am (Dhaka's peak morning travel hours). Collected data included traffic volume counts in each direction and signal light timing. The vehicle counts were recorded by modes. These base level traffic data were used to simulate existing traffic conditions with fixed-signal timing. Later, this information was used to evaluate how the changes in signal timing affected traffic flow, speed, and congestion levels.

Overview Of The Proposed System
This study proposed an integrated system that combines data mining and simulation processes to update traffic signal timing. The proposed system used video analysis to extract information regarding vehicle types and numbers of vehicles traveling in different links of the studied junction.

Traffic Data Collection at Base Level
The base condition traffic data were collected using manual counts. Surveyors were assigned to collect the data in each direction of the junction. There are three links in this intersection, and they were each assigned a number, as shown in Figure 1b. Each of these links supports traffic flow in both directions. Each directional flow represents three lanes. Therefore, each of these links comprises six lanes supporting traffic flow in both directions. The field traffic counts were done in 15 min intervals during a single weekday in March of 2015 from 7 to 11 am (Dhaka's peak morning travel hours). Collected data included traffic volume counts in each direction and signal light timing. The vehicle counts were recorded by modes. These base level traffic data were used to simulate existing traffic conditions with fixed-signal timing. Later, this information was used to evaluate how the changes in signal timing affected traffic flow, speed, and congestion levels.

Overview of the Proposed System
This study proposed an integrated system that combines data mining and simulation processes to update traffic signal timing. The proposed system used video analysis to extract information regarding vehicle types and numbers of vehicles traveling in different links of the studied junction. These data were then used to run simulation models to identify optimum signal timing patterns that could reduce congestion levels. Figure 2 illustrates the overall design of the proposed system. The process starts with streaming videos from live or simulated sources. Streamed videos act as sources of vehicle-related information required to run the next process. The videos enter the 'Video Analysis Unit' (Subsystem 1), where the virtual loop detectors use an algorithm (discussed in the next section) to detect and classify different vehicles and related information such as the lane, link connection, and location on the road links. These data are then stored in the data storage system (i.e., a server). In the next step of the system (Subsystem 2), an automatic query process run from the server produces a summary of information at each 15 min interval. This information includes the numbers and types of vehicles detected in different links of the junction, how many vehicles interchanges from one link to another link, and vice versa. All this information is essential for preparing a simulation model as necessary inputs for the third subsystem of the proposed system. process starts with streaming videos from live or simulated sources. Streamed videos act as sources of vehicle-related information required to run the next process. The videos enter the 'Video Analysis Unit' (Subsystem 1), where the virtual loop detectors use an algorithm (discussed in the next section) to detect and classify different vehicles and related information such as the lane, link connection, and location on the road links. These data are then stored in the data storage system (i.e., a server). In the next step of the system (Subsystem 2), an automatic query process run from the server produces a summary of information at each 15 minutes interval. This information includes the numbers and types of vehicles detected in different links of the junction, how many vehicles interchanges from one link to another link, and vice versa. All this information is essential for preparing a simulation model as necessary inputs for the third subsystem of the proposed system. As illustrated in Figure 2, the video analysis and query results produce key inputs required for the 'Simulation Unit' (Subsystem 3). The Simulation Unit enabled experimentation with different signal time scenarios that changed vehicle flow and signal timing speed. These scenarios were then compared to the results of the existing signal timing. There are two approaches that this proposed system can implement to reduce congestion by achieving the optimum signal time, either using computer algorithms to obtain the optimum signal time [8] or using expert knowledge of the local traffic conditions (e.g., peak/off-peak flow, driver behavior, and vehicle mix). With these approaches, different signal timing combinations can be explored in the simulation, and the best results can be implemented to reduce congestion. Once the simulation unit produced an optimal signal timing that reduces congestion and increases vehicle speed at the junction, the signal timing parameters can be sent to the traffic light control system. Actual changes in traffic congestion and speed at the junction can then be observed.
It should be noted that the video analysis, data-mining, and simulation subunits developed in this proposed system have already been applied in other studies focused on transportation management [12,13,22,40]. However, the system proposed in this study integrated these three As illustrated in Figure 2, the video analysis and query results produce key inputs required for the 'Simulation Unit' (Subsystem 3). The Simulation Unit enabled experimentation with different signal time scenarios that changed vehicle flow and signal timing speed. These scenarios were then compared to the results of the existing signal timing. There are two approaches that this proposed system can implement to reduce congestion by achieving the optimum signal time, either using computer algorithms to obtain the optimum signal time [8] or using expert knowledge of the local traffic conditions (e.g., peak/off-peak flow, driver behavior, and vehicle mix). With these approaches, different signal timing combinations can be explored in the simulation, and the best results can be implemented to reduce congestion. Once the simulation unit produced an optimal signal timing that reduces congestion and increases vehicle speed at the junction, the signal timing parameters can be sent to the traffic light control system. Actual changes in traffic congestion and speed at the junction can then be observed.
It should be noted that the video analysis, data-mining, and simulation subunits developed in this proposed system have already been applied in other studies focused on transportation management [12,13,22,40]. However, the system proposed in this study integrated these three subsystems and partially automated the process of integration. Previous studies either utilized one of these subsystems or applied a more manual approach to management solutions (e.g., standalone use of vehicle tracking, 13). This study not only combined subsystems (e.g., running an automatic query to produce summary information) but also introduced the option to integrate local experts' knowledge. These steps were taken to provide more efficacy and effectiveness when managing the traffic signal timing.

Micro-Simulation Model
In order to replicate the actual traffic conditions at the selected junction within the simulation environment, a business-as-usual or base simulation model was created using VISSIM microsimulation model software. The base model served two purposes in this study. The first was to mimic the actual traffic conditions in a simulation system, thus allowing for the evaluation of changes after adjusting the signal timing. The second purpose was to produce 3D videos (vehicles traveling) of different links that would allow the proposed automation system to be tested when updating the signal timing.
In order to generate the base model, the three-legged traffic junction (i.e., Sheraton Hotel Junction) was drawn in VISSIM, incorporating the actual geometric measurements of different traffic lanes obtained from field measurements. From the field survey, data on the traffic flow per hour, the modal share of different vehicles, the interchanges among different links, and the existing signal timing were included in the base model. Additionally, traffic behavior parameters (i.e., car following, lane changing, and lateral values) were selected for the base model (supp 1). The parameters in Table 1 represent the values of some key variables required to run stochastic microsimulations based on the Wiedemann car-following model. This car-following model data, along with data on lane and lateral changing behavior, were necessary inputs for the fundamental algorithms that run the microsimulation in VISSIM [48]. A detailed discussion of these parameters and the car-following model is beyond the scope of this paper, but a comprehensive discussion can be found in Fellendorf and Peter [48]. Signal time used in the base model were collected during the field survey. For this traffic intersection, the signal cycle was 165 s. As illustrated in Figure 3, Link-1 had 80 s of green time, Link-2 had 25 s of green time, and Link-3 had 40 s of green time.

Lateral Change
Consider Next turning direction Not Marked Over take on same lane Not marked Min. lateral distance 1 at 0 km/hr 1 at 50 km/ hr Signal time used in the base model were collected during the field survey. For this traffic intersection, the signal cycle was 165 seconds. As illustrated in Figure 3, Link-1 had 80 seconds of green time, Link-2 had 25 seconds of green time, and Link-3 had 40 seconds of green time.

Video Analysis Unit
Developing and applying an automated vehicle count system from traffic videos was one of the crucial aspects of this study. The video analysis unit is essential to the proposed system as it analyzes any traffic-related videos (e.g., live traffic or simulation videos) to extract the necessary information automatically. Traffic videos will be streamed into this system from a pre-defined hard disk drive (HDD) where the videos are stored. For each road link of the intersection (e.g., Link-1), a fixed video frame will be utilized. To process the video frames in the video processing unit, a script has been developed using the Python programming language. The processing of stored video frames will be performed with one of the Open Source Computer Vision Library-openCV algorithms [29,49] (e.g., example code, Table 2). Code block-1 initiates the script and libraries (e.g., numpy, cv2), imports and resizes the video(s), and indicates the methods that will be utilized in detecting the vehicle classes [50]. Block-2 declares the data file as comma-separated values (CSV) on which the detected information will be stored, selects the method, and runs the basic algorithm to detect different vehicle types. Additionally, block-2 subtracts the background, delineates different frames/predefined virtual loops and their attributes, and declares image moments to detect different vehicles as objects within the virtual loops. Finally, block-3 detects different vehicle types (e.g., cars, buses) based on image moments and threshold values for different vehicles.
In this study, within the Python script for vehicle identification, a "Template matching algorithm" was developed utilizing openCV [49]. In the first step, background subtraction has been performed for the videos entered in the system (Code Block-2, Table 2). This study utilized simulated 3D videos due to the unavailability of CCTV traffic videos of the selected intersection. In the next step, an image sample was extracted from a frame of the stored video stream. The ideal image is a lower-sized image (i.e., an image size of 50:50 pixels where the video frame is 640:480 pixels). Thus, vehicle images are obtained by searching in the frame using a "template." These templates pass through the entire frame covering every pixel (2D convolution), and each pixel is assigned a value based on the normalized sum of differences of corresponding pixel values between a portion of the frame consisting of neighboring values for each pixel of frame and template image [49]. If the matching value of each vehicle exceeded the threshold value of sample vehicle sizes (determined based on experimenting with different vehicle types, Code Block-3), then that type of vehicle would be assumed to be detected. Thus, matching for each type of vehicle is conducted individually throughout the whole frame. Finally, the data is written in a text file and sent to the database as CSV file format.
In addition to identifying different vehicle types and their positions on the links, the script also detected vehicle occlusion conditions during the red signal. When links showed red signals, vehicle occlusions formed [50], and within the virtual loop template, the matching algorithm was unable to detect any individual vehicles. The script marked that condition as 'Red Signal' and recorded this in the database. This process allowed the system to explain the occlusions in different links and therefore, aid in understanding vehicle queue length during congestion periods. However, this also introduced problems related to false detection during vehicle occlusion and underestimating the actual numbers of vehicles traveling through the virtual loop [50]. Careful attention was given when setting the virtual loop position within the frame. Based on the recommendation by Xia et al. [13], a decision was made to position the loop close to the bottom view in order to reduce the potential for false detections due to vehicle occlusion. Overall, the script recorded not only the vehicle types and their locations but also the signal timing. By utilizing this data when running a query in the database, a comprehensive summary of the junction was obtained.

Data Mining: Running Automatic Query
The video analysis unit detected the vehicles, their position, and interchanges from one lane to another. However, the database also needed to summarize output such as total vehicle volume and mode share (e.g., percentage of different vehicles) to provide useful inputs for the simulation modelling. Producing a summary that can be directly integrated into a simulation unit requires running queries at a certain interval (e.g., hourly). To automate the summary production, this study developed and ran query scripts rather than manually running a query each time. The script has been written in PHP code so that the query can be presented in any web browser. The script imported the data produced from the 'Video Analysis Unit' subsystem and produced a summary based on those observations. The script has two major sections. The first code block converts and exports the data from the video analysis (saved as a CSV file) into a MySQL database (Example Block-1, Table 3). The second code block (Example Block-2, Table 3) runs the query from the MySQL database and produces the summary required for simulation model inputs.

Signal Timing Updates (Tuning) and Related Changes
Updating fixed signal timing to reduce traffic congestion is the main objective of this study. The video analysis (computer vision technique) and query produced the information traffic patterns. However, these methods cannot inform decisions regarding how signal timing should be updated to reduce congestion. To optimize signal timing, the consequent effects of these patterns needed to be checked in simulation models. Signal timing optimization can be accomplished using two approaches, either using computer algorithms and artificial intelligence (discussed in Section 2.2) or utilizing expert knowledge based on previous experience and context-specific information. Usually, previous studies have utilized computer algorithms without considering context-specific expert knowledge. By contrast, in this study, expert knowledge and context-specific information have been applied to update signal timing and observe if such changes actually reduce traffic congestion. The expert-based approach produced two scenarios: (1) changing the signal cycle time as well as green time for different links-Scenerio1 (SC1), and (2) keeping the original signal cycle time but changing the green time for different links-Scenerio2 (SC2).
There are numerous possible scenarios that can be evaluated. However, for Scenario-1, the previous experience of morning peak hour traffic in Dhaka suggested that either changing the signal cycle by shortening it and allocating more green time for congested link/s may influence the congestion. Additionally, several traffic manuals also suggested that shorter signal cycles often are more effective at managing traffic flow at intersections [51]. In contrast, for Scenario-2, keeping the existing signal cycle length (165 s) but increasing the green time for the congested link (Link-1) seems more logical, considering that more traffic will be added in the queue with less green time allocated. Previous studies also indicated that extending the green time for congested links may reduce overall congestion for an intersection [43,52]. Based on these arguments, two new signal timing cycles were proposed and tested for Scenario-1 and Scenario-2 ( Figure 4). allocated. Previous studies also indicated that extending the green time for congested links may reduce overall congestion for an intersection [43,52]. Based on these arguments, two new signal timing cycles were proposed and tested for Scenario-1 and Scenario-2 ( Figure 4). In addition to signal timing and cycle length, expert observations of the driving behavior in Dhaka led to changes in the traffic behavior parameter values applied when running the experimental scenarios. The new parameter values are listed in Table 4. In Dhaka, vehicles usually are in closer proximity, both when stopped and when moving, compared to the VISSIM default. The expert noted that most drivers do not consider it necessary to provide a safe distance of more than 1.00 meter between vehicles, and field observation validated this assertion. Therefore, the new car following parameters reduced average standstill and safety distances (Table 4). Vehicles in Dhaka also usually do not follow exact lane markers based on field observations. Vehicles frequently take any position in the lane rather than the usual standard of being at the middle of the lane. Thus, 'any position' was selected for the desired position. Vehicles in Dhaka generally maintain shorter lateral distances (e.g., 0.3 meters) when changing direction or overtaking (or attempting to overtake) other vehicles on the same lane. Therefore, these parameters were also changed. The context-specific, expert knowledge of Dhaka's traffic not only aided in producing more realistic simulations, but the information also introduced new insights into the model that usual computer algorithms may not have added without prior training using artificial intelligence or data-driven methods. For these reasons, using expert knowledge was considered a more viable option for this study.  In addition to signal timing and cycle length, expert observations of the driving behavior in Dhaka led to changes in the traffic behavior parameter values applied when running the experimental scenarios. The new parameter values are listed in Table 4. In Dhaka, vehicles usually are in closer proximity, both when stopped and when moving, compared to the VISSIM default. The expert noted that most drivers do not consider it necessary to provide a safe distance of more than 1.00 m between vehicles, and field observation validated this assertion. Therefore, the new car following parameters reduced average standstill and safety distances (Table 4). Vehicles in Dhaka also usually do not follow exact lane markers based on field observations. Vehicles frequently take any position in the lane rather than the usual standard of being at the middle of the lane. Thus, 'any position' was selected for the desired position. Vehicles in Dhaka generally maintain shorter lateral distances (e.g., 0.3 m) when changing direction or overtaking (or attempting to overtake) other vehicles on the same lane. Therefore, these parameters were also changed. The context-specific, expert knowledge of Dhaka's traffic not only aided in producing more realistic simulations, but the information also introduced new insights into the model that usual computer algorithms may not have added without prior training using artificial intelligence or data-driven methods. For these reasons, using expert knowledge was considered a more viable option for this study.

Evaluation of Tuned Signal Scenarios
Reduction in traffic congestion and increasing a vehicle's average speed are the indicators considered for evaluating the effectiveness of the tuned signal time scenarios. These two indicators are usually outputs of the VISSIM microsimulation model [45]. Reduction in congestion is explained in terms of reduction in the queue length in each link of the intersection. Specifically, the objective of this study was to reduce queue length in the congested link of the intersection in comparison to the base level condition. Additionally, possible increases in vehicle speed approaching and crossing the intersection are considered a positive change [1,43,45].
In addition to evaluating the effect of signal changes, the accuracy of the simulated model traffic volume and the video analysis traffic volume was evaluated using "GEH Statistic" [53,54]. Geoffrey E. Havers developed a continuous volume tolerance formula while working as a transport planner in London, England in the 1970s. Although its mathematical form is similar to a chi-squared test, it is not a true statistical test. Rather, it is an empirical formula that has proven useful for a variety of traffic analysis purposes: where, M = the hourly traffic volume from the traffic model (or new count) C = the real-world hourly traffic count (Image detection traffic count or the forecasted traffic) For traffic modelling work in the "baseline" scenario, a GEH less than 5.0 is considered a good match between the modeled and observed hourly volumes, a GEH greater than 5.0 is considered for further investigation, and a GEH greater than 8.0 is considered to be unsuitable [53,54].

Overview of Existing Traffic Conditions
The most dominant vehicle using the intersection was cars, followed by motorcycles, buses, and HGVs (Table 5). It is noticeable that Link-1 had approximately three-times the traffic volume than Link-2 and Link-3. Link-1 was also the most congested link in the intersection. This was because Link-1 is connected to Dhaka's Central Business District (CBD), Motijheel. In the morning peak hours, traffic is normally directed towards CBD, and this study period took place during the morning peak hours. Link-1 collects huge morning peak traffic from preceding intersections, namely the 'Bangla Motor Intersection' and 'Frame-gate Intersection'. Because of this high volume of traffic, Link-1 has a longer green signal time (80 s) than the other links.
In comparison to a standard urban three-lane road, the hourly volume of traffic is relatively high in Link-1, and this result was also observed in the previous study by Hadiuzzaman [44] for the Sheraton Hotel Intersection. Due to this higher hourly volume of traffic, both intersection delay time and queue length increased in this study area. Higher traffic volume and longer queue length created extreme congestion and a near saturation traffic flow situation in Link-1. The result is a considerable delay in travel during the morning peak hours and, consequently, extra fuel consumption and CO 2 emissions in the study area. The base simulated model also summarizes the traffic condition of the studied junction. The base model tried to mimic the field data and traffic condition with the existing signal timing. The results of the base model are presented in Table 6. As presented in Table 6, the simulated models found very similar traffic volume for all three links. GEH values less than five indicated that the simulated volume is representative of the actual field survey volume. The base model shows the queue length for each link and indicates that the major arterial link directed towards the CBD was the most congested link (Link-1) with the highest queue length (i.e., 131 m) and an average speed of 26.3 km/h. The overview indicates that during the morning peak time, this intersection has a relatively low travel speed, and the numbers of stops on Link-1 are nearly three times the number of stops on Link-2 and Link-3, which are off-peak during morning hours (Table 6). Thus, it can be argued that, in addition to high traffic volume during morning-peak, the signal timing in this intersection is demonstrating poor performance in managing congestion and travel speed. The simulated result of the base model successfully mimics the existing condition of the intersection and therefore produced reliable 3D traffic videos, which were used for automatic traffic data counts during the video analysis unit.
The video analysis of the 3D simulated video extracted necessary traffic data required to further update the signal timing in the proposed scenarios. The visual outcome of the video analysis script is illustrated in Figure 5. Figure 5a shows the virtual loop setting on Link-1. There are few virtual loops positioned to not only detect vehicle types and count them but also to produce vehicle interchange information required for simulation modelling. After placing the virtual loops before running the template matching algorithm background, subtraction was conducted in the script, and the subtracted background is illustrated in Figure 5b. When running the script, these interfaces remain visible, and related information is displayed in the Python editor. See the text in Figure 5a The overall system developed for the video analysis unit detected the vehicle types, and later, the query section produced a traffic condition summary for the intersection presented in Table 7. Table 7 presents the comparison between the actual vehicle volume observed in the 3D traffic video and the vehicle volume extracted by the video analysis unit. The numbers indicate that the video analysis unit underestimated the numbers of vehicles. The error percentages indicate that, overall, the video analysis demonstrated, approximately, a 5% error when detecting and counting vehicle volume. However, the GEH values for all links are below five. This finding indicates that despite miscounting the vehicle numbers, the video analysis volume was very close to representing the actual volume. Therefore, it can be argued that the video analysis outcomes are reliable representations of the traffic conditions in the intersection.

Results of Tuned Signal Timing
The video analysis traffic data, along with the proposed variable signal timing for Scenario-1 and Scenario-2, produced two simulation outputs and provided opportunities to evaluate the effectiveness of signal time tuning for the Sheraton Hotel Intersection. The results of these two simulated scenarios are presented in Table 8. As illustrated in Table 8, for Scenario-1, the simulated volume was slightly overestimated compared to the video analysis volume. However, the GEH values for all links were less than 5. The acceptable GEH values mean that the simulated volumes in Scenario-1 reflect the actual traffic volume. The simulation results show that for Scenario-1, the updated signal time has considerably reduced the queue length for Link-1.
In the base model there was a 165 second signal cycle (i.e., Link-1 green time: 80 s, Link-2 green time: 25 s, Link-3 green time: 60 s), and the queue length was 131 meters. In Scenario-1, the signal cycle was 145 s seconds (i.e., Link-1 green time: 75 s, Link-2 green time: 25 s, Link-3 green time: 45 s), and the queue length became 81 meters. Scenario-1 represented a 38.16% reduction in queue length for the congested link (Link-1) of the intersection. There was also a considerable reduction (38%) in queue length for Link-2. For Link-3, the queue length was only reduced by 3 meters. Figure 6 illustrates the queue length (in approximate numbers of cars) changes for different scenarios compared to the base condition. It is clear from Figure 6 that the congestion level, measured by the The overall system developed for the video analysis unit detected the vehicle types, and later, the query section produced a traffic condition summary for the intersection presented in Table 7. Table 7 presents the comparison between the actual vehicle volume observed in the 3D traffic video and the vehicle volume extracted by the video analysis unit. The numbers indicate that the video analysis unit underestimated the numbers of vehicles. The error percentages indicate that, overall, the video analysis demonstrated, approximately, a 5% error when detecting and counting vehicle volume. However, the GEH values for all links are below five. This finding indicates that despite miscounting the vehicle numbers, the video analysis volume was very close to representing the actual volume. Therefore, it can be argued that the video analysis outcomes are reliable representations of the traffic conditions in the intersection.

Results of Tuned Signal Timing
The video analysis traffic data, along with the proposed variable signal timing for Scenario-1 and Scenario-2, produced two simulation outputs and provided opportunities to evaluate the effectiveness of signal time tuning for the Sheraton Hotel Intersection. The results of these two simulated scenarios are presented in Table 8. As illustrated in Table 8, for Scenario-1, the simulated volume was slightly overestimated compared to the video analysis volume. However, the GEH values for all links were less than 5. The acceptable GEH values mean that the simulated volumes in Scenario-1 reflect the actual traffic volume. The simulation results show that for Scenario-1, the updated signal time has considerably reduced the queue length for Link-1.
In the base model there was a 165 s signal cycle (i.e., Link-1 green time: 80 s, Link-2 green time: 25 s, Link-3 green time: 60 s), and the queue length was 131 m. In Scenario-1, the signal cycle was 145 s seconds (i.e., Link-1 green time: 75 s, Link-2 green time: 25 s, Link-3 green time: 45 s), and the queue length became 81 m. Scenario-1 represented a 38.16% reduction in queue length for the congested link (Link-1) of the intersection. There was also a considerable reduction (38%) in queue length for Link-2. For Link-3, the queue length was only reduced by 3 m. Figure 6 illustrates the queue length (in approximate numbers of cars) changes for different scenarios compared to the base condition. It is clear from Figure 6 that the congestion level, measured by the numbers of cars in the queues, was significantly reduced for Scenario-1 compared to base level, and the highest reductions were observed for Link-1 and Link-2. For Scenario-2, the simulated traffic volume is also slightly overestimated; however, the GEH values for all links were less than five. Thus, the simulated scenario can be accepted as an accurate representation of the observed volume. In Scenario-2, the signal cycle length remained the same as the base condition, but the green time for the links changed. Scenario-2's updated signal time was still 165 s, but the Link-1 green time was 100 s, the Link-2 green time was 25 s, and the Link-3 green time was 40 s. The additional 20 s of green time for Link-1 (compared to the base condition) resulted in a considerable reduction of queue length. Compared to the queue lengths for the base condition (i.e., 131 m for Link-1) and Scenario-1 (i.e., 81 m for Link-1), the queue length for Link-1 in Scenario-2 was 75 m ( Table 8). The updated signal time of Scenario-2 resulted in a 42.7% reduction in queue length for the congested link compared to the base condition and an additional 4.7% queue length reduction compared to Scenario-1. For Link-2, there was a 30% reduction in queue length (compared to the base condition), but compared to Scenario-1, there was an 8% increase observed in the queue length for Link-2. Additionally, for Link-3, the queue length was only reduced by 2 m ( Table 8).
The increase of queue length in Link-2 (compared to Scenario-1) and insignificant reduction in queue length for Link-3 were the logical outcomes of the signal timing used when modelling Scenario-2. In this case, more green time was allocated for the congested Link-1. Thus, while there was a slight increase in queue length for Link-2 compared to Scenario-1, there was a considerable reduction in queue length compared to the base condition. For Link-3, there was almost no congestion observed, and the vehicle flow was relatively free from congestion. Thus, the changes in queue length with tuned signal timing were not significant. Considering all the results, it can be argued that the signal timing in Scenario-2 was most effective in reducing congestion in the intersection. In addition, the overall comparison in the visual representation of queue length in Figure 6 confirms that the lowest queue length (measured in numbers of cars) for Link-1 was observed in Scenario-2. Therefore, Scenario-2 more effectively fulfills the objective to reduce the congestion in the intersection.
For Link-1, the numbers of stops significantly dropped for both Scenario-1 (Stops 278) and Scenario-2 (Stops 272, Table 8) compared to the base condition (Stops 615, Table 6). This variance indicates that under the updated signal condition, vehicle operation was smooth in the simulation. Additionally, this finding means the vehicles had reduced delay time on the congested link. However, the increase in average travel speed was not substantially high compared to the base condition. For Link-1, the numbers of stops significantly dropped for both Scenario-1 (Stops 278) and Scenario-2 (Stops 272, Table 8) compared to the base condition (Stops 615, Table 6). This variance indicates that under the updated signal condition, vehicle operation was smooth in the simulation. Additionally, this finding means the vehicles had reduced delay time on the congested link. However, the increase in average travel speed was not substantially high compared to the base condition. The average vehicle speed was 26.96 km/hr at the base condition, but for Scenario-1 the average speed was increased by 0.2 km/hr and became 27.16 km/hr. For Scenario-2, the average speed increased by 0.58 km/hr, becoming 27.54 km/hr. For the congested link (Link-1), the average speed increased by 1km/hr in Scenario-2. Thus, it can be argued that the change in signal time had more influence on the average speed of the congested link than other links in the intersection. The possible reason for the minor increase in vehicle speed may be attributed to the signal capacity and presence of near saturation flow at the congested links and the connecting intersections (Bangla Motor and Shahbagh).

Evatulation of Proposed Traffic Signal System and Solutions
The purpose of the present study was to develop an expert system that automatically extracts traffic-related information from intersection video footage and utilizes the extracted data to optimally model traffic signal timing to reduce traffic congestion. This study's aim was achieved by the development and integration of three subsystems into a semi-automatic traffic management system. The first subsystem extracted data from traffic videos, the second subsystem performed mining processes on the extracted data, and the third subsystem used the data to run several microsimulations based on expert input to the data sets (see Section 3.3).

Evatulation of Proposed Traffic Signal System and Solutions
The purpose of the present study was to develop an expert system that automatically extracts traffic-related information from intersection video footage and utilizes the extracted data to optimally model traffic signal timing to reduce traffic congestion. This study's aim was achieved by the development and integration of three subsystems into a semi-automatic traffic management system. The first subsystem extracted data from traffic videos, the second subsystem performed mining processes on the extracted data, and the third subsystem used the data to run several microsimulations based on expert input to the data sets (see Section 3.3).
The present study applied openCV algorithms to video analysis of traffic footage (e.g., BS and template matching) to achieve vehicle detection. Test results demonstrated that the video analysis system could detect and classify vehicles with more than 90% accuracy. This result complies with other previous research that used video analysis for vehicle detection and classification in which 70-95% accuracy was obtained in various traffic scenarios (e.g., free-flow or congested) [13,28,55]. Thus, for the present study, the video analysis system was both significantly accurate and reliable. However, it should be noted that the video analysis vehicle count is not identical to the manual count, and the number of vehicles detected by video analysis was lower than the manual observation ( Figure 7). The reason for this was that the video analysis used the background subtraction method, and during congestion or vehicle occlusion (red signal), the script was not able to detect vehicles properly due to lack of motion. Similar outcomes were observed in other studies [13,15,50]. This issue resulted in lower numbers of vehicles detected and misclassification of vehicle types. Nonetheless, the GEH statistics (discussed in detail in Section 3.8) indicated the differences between the manual count and the video count for all three links were insignificant (GEH values are less than five, Figure 7). This indicates that the video count can be accepted as a reflection of real traffic volume. As a result, it was applicable to practical traffic management applications.
However, it should be noted that the video analysis vehicle count is not identical to the manual count, and the number of vehicles detected by video analysis was lower than the manual observation ( Figure  7). The reason for this was that the video analysis used the background subtraction method, and during congestion or vehicle occlusion (red signal), the script was not able to detect vehicles properly due to lack of motion. Similar outcomes were observed in other studies [13,15,50]. This issue resulted in lower numbers of vehicles detected and misclassification of vehicle types. Nonetheless, the GEH statistics (discussed in detail in section 3.8) indicated the differences between the manual count and the video count for all three links were insignificant (GEH values are less than five, Figure 7). This indicates that the video count can be accepted as a reflection of real traffic volume. As a result, it was applicable to practical traffic management applications. In addition to video-based vehicle information extraction, the system proposed in this study improved the process of using the extracted data by integrating a newly developed MySQL-based automatic query system to facilitate the process of data-mining. Substantial current research has utilized vehicle tracking utilizing video analysis [13,25,28,29]. However, applying data mining by a query to produce useful summaries of the results of such tracking has rarely been explored to date. To the authors' best knowledge such approaches have not yet been applied to traffic studies in the context of cities such as Dhaka. The use of such a query-based approach enhanced the efficiency of the proposed system and assisted in producing useful information from large traffic databases. These methods can be replicated in other studies that utilize big traffic data from different sources (e.g., GPS, CCTV video) [20]. The query process used in this study produced the necessary information required for microsimulations (e.g., the vehicle flow summary and the vehicle turning rate) and allowed modelling of two different hybrid signal timing scenarios that reduced traffic congestion compared to the base scenario of a single fixed timing rate for an isolated intersection. In addition to video-based vehicle information extraction, the system proposed in this study improved the process of using the extracted data by integrating a newly developed MySQL-based automatic query system to facilitate the process of data-mining. Substantial current research has utilized vehicle tracking utilizing video analysis [13,25,28,29]. However, applying data mining by a query to produce useful summaries of the results of such tracking has rarely been explored to date. To the authors' best knowledge such approaches have not yet been applied to traffic studies in the context of cities such as Dhaka. The use of such a query-based approach enhanced the efficiency of the proposed system and assisted in producing useful information from large traffic databases. These methods can be replicated in other studies that utilize big traffic data from different sources (e.g., GPS, CCTV video) [20]. The query process used in this study produced the necessary information required for microsimulations (e.g., the vehicle flow summary and the vehicle turning rate) and allowed modelling of two different hybrid signal timing scenarios that reduced traffic congestion compared to the base scenario of a single fixed timing rate for an isolated intersection.
Ultimately, reducing traffic congestion was the true goal of the present study. The results indicated that the selected case study intersection (i.e., the Sheraton Hotel Intersection) had high traffic volume during peak hours and poorly designed fixed timing of the traffic signals. These factors caused considerable unnecessary congestion in the intersection. It should be noted that this finding was in accord with several recent studies on Dhaka's traffic including studies by Labib et al. [2], Hadiuzzaman [44], and Roy et al. [17]. These studies all noted that traffic volume was high on all the major roads and intersections in Dhaka and that the existing fixed-timing system was obsolete and did not provide effective traffic management.
The present study proposed and tested two variable timing signal scenarios. These scenarios were based on simulations that incorporated a combination of extracted traffic data and expert knowledge (hybrid system). In both scenarios, a significant reduction in the queue length of different links was observed. Additionally, the average travel speed was increased by a small margin. The combination of expert knowledge with microsimulations based on intersection-specific data provided an opportunity to effectively address intersection congestion in cities such as Dhaka, where traffic is not only more heterogeneous than in developed world contexts but also exhibits more random driver behaviors [44,56,57]. The proposed traffic management system and the resulting solutions developed for the present study significantly reduced the congestion of the intersection under study.
The solutions in this paper reduced queue length by 42% for the most congested link (Link-1) and 30% for Link-3. Thus, an overall significant reduction in delays in vehicle operation was achieved. In previous studies, Tian and Zhang [12] applied adaptive control for three intersections in Hefei, China using TRYNSYT optimization and VISSIM simulation. Their study observed 1.5 to 13.9% less delay in their intersections. Ren et al.'s [58] adaptive control system decreased vehicle delay by 40%. Liu et al. [59] achieved a 10% reduction in average delay with an optimized actuated control system. In comparison with these studies, it can be argued that the performance of the proposed system and solutions are satisfactory.

Policy Implications
Dhaka is not only the world's densest metro area, but it is also one of the fastest growing megacities with increasing population [47,60,61]. Due to a combination of continual increases in population and accompanying economic activity, traffic volumes and congestion continue to increase [2,44]. The present study demonstrated that traffic management with intersection flow optimization based on hybrid-variable signal timing is relatively effective, even in budget-constrained, developing-world urban areas. As a result, such urban areas can benefit from optimized traffic management systems that do not require extensive and costly modifications to urban infrastructure such as those often required when using actuated signal optimization methods. As of the time of writing, traffic congestion in Dhaka consumes 3.2 million person-hours per day, and this figure is rising as congestion increases. Technically sophisticated and costly ITS systems (e.g., actuated, adaptive control systems) have long been deployed in developed world cities to optimize traffic management. However, the present study suggests that, for a city like Dhaka, it is possible to establish an intelligent signal system that can reduce congestion with a low-cost integrated traffic signal management system. The findings from this study can be useful for policy makers and traffic planners seeking alternative options for reducing traffic congestion.

Limitations of the Study
The present study successfully developed and tested a new system to update fixed traffic signal timing using video analysis, data mining, and microsimulation modelling. It demonstrated the use of this system in a case study utilizing real-world data. However, the present study and its methodological approach were not free from limitations. The first limitation was that the present study only considers updating signal timing with a pre-programmed variable timing profile. The present study could not explore how much difference continuous real-time traffic data would make when changing signal timing. It must be noted that real-time traffic video in Dhaka was not available at the time of the present study. A second limitation was that the optimization scenarios all considered the intersection as an isolated entity. Thus, different results might be possible under conditions where data from multiple intersections are used (see Section 2.2). A third limitation was that the present study assumed that the geometric design of the intersection was fixed. Modifications of the geometric design of the intersection might also aid traffic flow optimization in conjunction with various timing scenarios. The fourth limitation was that only two scenarios were tested combining expert knowledge and contextual information. The present study's approach to data analysis based on the selected data extraction and data mining methods were not the only possible approaches to data analysis. Utilizing AI and highly sophisticated optimization algorithms (e.g., POS, GA, ANN) many possible scenarios could be tested quickly using the present study's analytic approach. However, this approach was beyond either the financial or temporal scope of the present study.
The final limitation was the cost of the software. In this study VISSIM micro-simulation software was utilized, and licensing is costly. Therefore, the overall cost related to this study was not significantly low. However, it should be noted that several pieces of open source traffic simulation software (e.g., Simulation of Urban Mobility-SUMO) [62] are available and can be used instead of VISSIM; however, the integrated application of these open source software packages was not tested in this paper. For future studies, such applications can be integrated and tested to design a system with a minimum overall cost.

Conclusions
Improving traffic conditions in fast-growing urban areas of the developing world is a matter of ongoing concern to urban planners, traffic managers, and environmental advocates to both uplift travel efficiency and the quality of the natural world. Elimination of nearly obsolete fixed-time signals at busy intersections in favor of variable timing signal systems geared to extant traffic conditions can make a significant difference to traffic congestion, safety, and air-borne pollution. The present study demonstrated the integration of various available and relatively low-cost technologies to create a more efficient traffic signal management system. The proposed system combined video data analysis, data mining, and expert opinion to optimize variable signal timing profiles and reduce congestion at a signalized intersection based on microsimulations.
The proposed system was tested on real-world data collected from a congested intersection in Dhaka, and results suggested that the system could improve traffic outcomes more cost-effectively than ITS systems through the use of real-time traffic videos. Additionally, the inclusion of the expert opinion-based scenarios averted the need for sophisticated and data-intensive AI systems and infrastructure and addressed the gap in the contemporary research on signal tuning and optimization. The video analysis algorithm used in this study successfully detected the location and composition of the vehicles in the intersection with minor errors for vehicle occlusion. Both scenarios developed utilizing microsimulations of extant real-world traffic conditions resulted in improvements in traffic conditions characterized by significantly reduced vehicle queue lengths (e.g., approximately 40% reduction on congested links) and an increase in the average traffic speed. The significant reduction of the queue length illustrated the effectiveness of the application of the custom variable timing profile on traffic congestion at the intersection under study and validated the outcome of the proposed system supporting its testing at other intersections in Dhaka and other urban areas.
The results of the present study suggest several areas that might prove useful for future research. The first of these is to study the effect of changes on intersection infrastructure and site geometry on traffic optimization based on variable signal timing in response to traffic conditions. Future studies might include detection of pedestrians and bicyclists crossing the intersection and incorporate these considerations into overall vehicular volumes to better optimize signal timing in dimensions that include these participants and their safety in intersection activity. Finally, given the low cost and effectiveness of the technologies utilized in the present study and the rapid spread of data networks, both fixed and wireless, further research can explore the possibility of designing a networked system integrating all signalized intersections in a city such as Dhaka.