1. Introduction
Crowdsourcing is a platform that manages dissemination of tasks by a recruiter—the entity operating the crowdsourcing platform for compensated workers to perform. With the rise of the Internet and personal mobile devices (PMDs), crowdsourcing can help solve complex tasks that machines cannot, at scale [
1]. The scaling ability depends on the emergence of the Internet of things (IoT) and distributed data storage architectures. The increase in mobile devices, higher data bandwidth, and ultra reliable and low-latency communications in current and next generation cellular networks provide more opportunities to develop powerful mobile crowdsourcing frameworks. On the other hand, researchers have recently designed data gathering and dissemination infrastructure for intelligent transportation systems (ITS), such as roadside units (RSU), vehicular social networks (VSN) [
2], and even unmanned aerial vehicles (UAV) [
3] that strive to apply current and future widespread broadband connectivity to the Internet for data-driven improvement of transportation systems.
ITS-based crowdsourcing that takes advantage of the aforementioned infrastructure and its dense connectivity has the potential to provide a massive influx of new data for ITS operators to work with. The data influx will aid in a variety of verticals such as infrastructure design and improvement decisions, real-time navigation, safety system operations (e.g., traffic and collision monitoring), and will improve the interconnectivity of vehicles. Consider Waze—a traffic crowdsourcing system that allows users to provide information about traffic conditions, providing other users the ability to make more informed data-driven decisions regarding how to navigate along their trajectory safely and efficiently. While Waze is effective, it is limited by human data inputs, which may be inaccurate, suffer from a very limited useful time-frame, and require passengers to actually input the data to avoid unsafe distracted driving. By leveraging ITS and sensors on smart vehicles, data collection can be automated, increasing collection efficiency, volume, and data quality. Vehicular social networks can improve connectivity amongst ITS infrastructure, providing new channels for intelligent data dissemination [
2,
4]. The increased connectivity provides more conduits for data to be collected, improving performance of crowdsourcing systems. The improvement in ITS communications technology has lead to increased potential for the amount of useful information to be disseminated. This evolution in technological progress, visualized in
Figure 1, started with an era of no communication between vehicles, to radio-based communications devices such as cellular phones, to integrating sensors within vehicles, then adding the potential for connecting the vehicle sensors via broadband connection to infrastructure, and formation of ad-hoc networks, with the final stage being autonomous vehicles that are capable of passively gathering data.
The current work in engineering these data systems is motivated by the need to develop robust, resilient, secure, future-proof, and scalable infrastructure. Unstructured data is generated on the scale of exabytes on a daily basis [
5]. This will be further compounded with the expected rapid growth in IoT device usage in many places, including the transportation sector. The combination of the rapid growth in these intertwined domains along with the impending ocean of data from ITS crowdsourcing systems potentiates the need for new avenues of recruitment of workers for mobile, spatial, and passive collection crowdsourcing tasks in harmony with already existing technology and methods.
In this paper, we first present the ITS crowdsourcing architecture, and then discuss them in further detail by highlighting the different crowdsourcing types: mobile (i.e., using mobile devices such as smartphones), spatial (i.e., tasks are tied to a specific location or region), and passive sensing (i.e., the data is collected automatically by smart devices) crowdsourcing, along with promising applications of each. Next, we discuss the technology and techniques that will lead to enabling efficient ITS crowdsourcing applications, such as advances in ITS communications technologies and cloud computing. We accompany the discussion with selected simulation results showing how these techniques bring about benefits for users of some crowdsourcing-based ITS systems. Afterwards, we provide a high-level overview of the state of the art in ITS crowdsourcing, by reviewing the different recent work led by researchers to address shortcomings in this area. From there, we consider the logical future directions needed for improvement, development, and deployment of automated real-time crowdsourcing systems. These systems will provide the basis for many useful applications, such as advanced traffic monitoring, highly granular roadway mapping, and other systems that will facilitate the emergence of autonomous vehicle technology on the roadway.
The rest of this paper is organized as following.
Section 2 outlines the main crowdsourcing architectures that are utilized in ITS.
Section 3 discusses the technology that enables functionality of these systems supported by selected simulation results.
Section 4 overviews the state of the art.
Section 5 discusses potential future research directions and challenges. Finally,
Section 6 concludes the paper.
2. Its Crowdsourcing: Architecture and Variants
This section takes a high-level look at some of the different kinds of ITS-based crowdsourcing applications in mobile, spatial, and passive sensing crowdsourcing. In ITS crowdsourcing, the task requesters are oftentimes drivers or people attempting to make transportation decisions such as what route to take, where to park, etc. In the future, they may also be smart/autonomous vehicles. The information they request serves to aid in their decision-making. Workers are usually other people through mobile phones or automated vehicles/UAVs/IoT devices that perform tasks in diverse locations across the transportation system at the request of the aforementioned task requesters.
Figure 2 illustrates a basic example of a crowdsourcing system architecture. We use the example of a mobile crowdsourcing system due to its prevalence today.
Figure 3 shows an example of how all three crowdsourcing variants, discussed in this section, could work in parallel in an urban setting.
2.1. Mobile Crowdsourcing
According to Statista (
https://www.statista.com/statistics/330695/number-of-smartphone-users-worldwide/), just under 2.5 billion people globally use smartphones. The explosive growth of smartphone use over the last decade has put mobile computing into the hands of nearly half of the world’s population in just over a decade. The large user-base of smartphone technology leads to an ever growing set of potential crowdsourcing opportunities in many domains, including transportation.
Mobile crowdsourcing is when an assigned task is performed by workers on mobile platforms (e.g., a smartphone). Waze and Uber are both contemporary examples of mobile crowdsourcing applied for transportation; Waze allows for users to input road conditions for the benefit of others, and Uber is a ride-share service operated through mobile technology. Other novel systems aim to leverage social media data to predict incidents [
6], which may have promise in disseminiation of information related to car accidents and traffic. Many people driving in unfamiliar places rely on applications such as Google Maps or Apple Maps to navigate. The authors of [
7] developed a mobile crowdsourcing platform called CrowdNavi that addresses a weakness in these maps applications—they often give incorrect navigation instructions near the destination. CrowdNavi works by tracking how drivers performing as the workers of the crowdsourcing framework exit off of a road toward their destination. By utilizing the behavior of multiple drivers entering from different approaches, the system improves upon the directions provided by the traditional Maps applications, improving the experience for other drivers.
Whenever drivers encounter congestion, they typically attempt to change to an alternate route to avoid the congestion, hence making them poorer potential workers for crowdsourcing frameworks to determine congestion in real-time. In urban settings, where some of the worst congestion can occur, there is often a large fixed-route bus system as part of the public transportation system. Since these buses operate on fixed routes, their drivers cannot re-route the bus’ path to avoid congestion. The authors of [
4] proposed a crowdsourcing platform that takes advantage of the fixed paths of the bus system and delegates the task of measuring the bus’ trajectory. If the bus has an unusual trajectory (i.e., is too slow), then it provides the system operators notice that there may be congestion. This information can also be disseminated quicker to other drivers, providing them more time to re-route and achieve a better user-equilibrium. Other methodologies have been developed to measure and predict traffic congestion based on mobile crowdsourcing techniques. The authors of [
8,
9] developed systems that utilize data mining techniques and social media to predict traffic. As mentioned before, the authors of [
10] developed a relationship-based data dissemination architecture. In conjunction with automated data collection, this type of system could be used to pass the congestion information to drivers that frequently traverse along the congested route to make a detour. One commonality amongst the systems described above is that they are designed to assign one type of task to the workers, possibly through heuristics such as the ones developed by the authors of [
11].
2.2. Spatial Crowdsourcing
Spatial crowdsourcing is when a crowdsourcing task requires workers to be at a specified location while completing the task [
12,
13]. Mobile and spatial crowdsourcing have a significant amount of overlap. PMDs provide the power of connectivity and computation anywhere the user brings the device, assuming a strong enough network connection. Indeed, the mobility of smartphones along with ubiquitous connectivity lends itself as an effective platform for spatial crowdsourcing tasks. While mobile devices are the primary setting for spatial crowdsourcing now, the rise of embedded IoT technology, especially in vehicles allows for the growth of new spatial crowdsourcing systems that work more passively than mobile crowdsourcing tasks [
14].
In [
11,
15], parking location monitoring systems are an example of spatial crowdsourcing. In the envisioned platform, workers report open parking spaces in certain locations. This information can assist drivers in finding street parking, and provide other information regarding their destination, reducing wasted time searching for a parking space. Information regarding the location of parking in proximity to an end-user’s destination will reduce energy and time wasted looking for parking, and will aid drivers in making efficient navigation decisions.
Spatial crowdsourcing may also be utilized for panoramic tour construction [
16,
17]. By stitching together multiple images from a allocation and creating a panoramic tour, platforms such as Google Maps may be able to update their street-level views much more rapidly, removing the need for dedicated vehicles and workers to perform the task of taking pictures along fixed, assigned routes. Beyond stitching crowdsourced images for panoramic tours, spatial crowdsourcing may be utilized for generating 3D maps for autonomous vehicles [
16]. With the utilization of crowdsourced data for this application, costs associated with 3D map generation can be drastically reduced, enabling more detailed, useful maps for future autonomous vehicle navigation.
2.3. Automatic Sensing Crowdsourcing
Automatic sensing crowdsourcing takes advantage of sensors embedded into smart vehicles to passively collect data while operating that is sent to a central system for processing and use. One of the main objectives of passive collection tasks is to schedule the collection based on the vehicle trajectories [
11]. Other passive collection tasks may be utilized to track vehicle fleets in real-time [
18].
Although this example utilizes a mobile phone for proof-of-concept, the system developed in [
19] is an example of automated sensing crowdsourcing. The system leverages smartphones (playing the role of future embedded sensors) mounted on the dashboard of a vehicle to monitor the quality of pavement. In areas with high volumes of traffic, a system like this can collect a very large amount of data (including redundant data that can smooth out individual observations). This can be leveraged by ITS operators to make more informed decisions on how to improve existing roadways. In addition, trip data can be incorporated with this to determine the best ways to re-route traffic when the roadway does require physical repairs. Another example is that of automated lane imaging [
20], which can be utilized to gain information on the width and condition of roads automatically, and can be leveraged into machine learning algorithms that aid in navigation for autonomous vehicles [
21].
By automating data collection, the system becomes more reliable and safer, as less bias would exist in the data collected, and humans would not be distracted from vehicle operation by the data collection processes. The abundance of passively-collected data may even bring the future of fully automated transportation and logistics systems to fruition by providing researchers and practitioners more robust datasets to train machine learning models that aid in future traffic control systems. These developments would make roadways safer and more efficient for end-users, and would promote increased economic productivity in the regions they serve.
3. What Enables Its Crowdsourcing?
Advances in communications and cloud computing are resulting in those technologies becoming the main technological underpinning of ITS crowdsourcing systems. Increased connectivity increases the size and number of gateways to the main cloud systems, which act as scalable storage and computing power centers. The adaptability of these technologies in concert lead the technologies to act almost as a living system, that can respond in real time to shifts in use and demand. In this section, we examine technologies that enable automated ITS crowdsourcing and investigate some practical use-cases.
3.1. Its Technologies Enabling Crowdsourcing
ITS technology powered by IoT devices such as embedded sensors in smart vehicles and new communications infrastructure stand to provide an untapped basis for mobile crowdsourcing applications from the exponential growth of inter-connectivity between all nodes of the overlaid communications network. The rise of 5G networks can lead to all sorts of high-bandwidth network topologies connecting smart vehicles in a VSN [
2] (V2X), such as vehicle-to-vehicle (V2V), vehicle-to-infrastructure (V2I), vehicle-to-pedestrian (V2P), and vehicle-to-drone (V2D). See
Figure 4 for an illustration of all of these communication paradigms working in concert together. A common challenge associated to the effective exploitation of ITS technologies to enable automated crowdsourcing tasks is its efficient infrastructure planning. In [
22], we developed a smart placement of RSUs assisting crowdsourcing ITS infrastructure in a way to maximize the effective coverage of transmission architecture, while considering solar-powered RSUs and an amortized budget. We also developed heuristics designed to find a suboptimal configuration faster than solving a mixed integer programming problem [
23] and to adjust RSU coverage to unexpected events [
24].
Figure 5 shows RSU placement through our optimization framework. The aim of the framework is to determine an effective placement of RSUs in order to maximize the sum of the “demand” of all the covered points, where the “demand” could refer to any metric of interest for ITS planners, given an amortized budget consisting of periodic operational costs and amortized capital expenditures. In other words, this framework was purposely designed to be a general problem, so the “demand” metric can be freely changed based on the planner’s choice. In our example, we defined the “demand” as a normalized linear combination of randomly sampled data meant to represent traffic fluctuations, along with car accident data sourced from NYC OpenData (
https://data.cityofnewyork.us/Public-Safety/Motor-Vehicle-Collisions-Crashes/h9gi-nx95). In our case, overall coverage efficiency is 92.5% given the provided by budget. The framework places RSUs to cover highly demand points of interest. The coverage of low demand points is a consequence of them existing in the same general region as high demand points. The placement of RSUs aids in intelligent infrastructure planning, providing increased connectivity. The improvement in the number and quality of connection gateways, along with smart vehicles armed with a dizzying array of sensors and increased autonomous driving capabilities, provide new means of collecting transportation-related data for ITS mobile crowdsourcing applications.
Automated sensors on vehicles, UAVs, and RSUs can collect data that could not be manually collected by humans in parallel with human-produced data, and provide wireless transmission access for the data to the cloud. When the data is aggregated, it could be applied to aid ITS planners for decision-making, or can be used by end-users on the roadways to make more informed decisions while in transit.
3.2. Vehicular Social Networks
A VSN is a type of ad-hoc, dynamic connection between users of smart vehicles in an ITS based on common factors such as destination, route, and location [
2,
4]. In the context of ITS crowdsourcing, VSNs may be leveraged for operational applications. VSNs can be used to gather information on traffic congestion, how vehicles interact with each other on the roadways, and may be used to relay other information from one user to another. VSNs offer the advantage of connectivity based on shared interests, much like ordinary social networks (OSN). The social aspect allows for more granular sharing aspects between users, providing a mechanism for more accurate and useful information dissemination.
We illustrate a benefit of mobile and automated sensing crowdsourcing for ride-share recommendation systems in
Figure 6. The proposed recommendation system utilizes automated sensing and mobile-crowdsourcing techniques to improve traditional ride-hailing and regular curbside ride-sharing services by introducing collaboration among drivers, improving quality of service by reducing customer wait time and wasted searching for new fares [
25,
26]. In this comparison, we evaluate the performance of the proposed recommendation system for both regular and ride-hailing taxi services. There are four cases: Two traditional taxi services without recommendations (Regular Trad. and Ride-hailing Trad.) and two taxi services based on our proposed recommendation system (Regular Recom. (VSN), and Ride-hailing Recom.). With VSN, the taxis can instantaneously share their data through the vehicular social networks, which allow drivers to have more knowledge about the customers’ demand and the traffic situation. We demonstrate around 10–15% reductions in customer waiting times for traditional curbside services, 20–25% reductions in waiting time for ride-hailing services, 60% reductions in extra distance driven by workers while searching for new fares in curbside services, and 25% reductions in extra driven distance by ride-share workers. This is a significant savings for both the customers and the taxi drivers that are exploiting crowdsourcing services through VSN.
3.3. Cloud Techniques Enabling Crowdsourcing
The cloud is a network of computers that work in coordinated fashion to process data or store it in parallel. Cloud computing infrastructure stands to serve as the basis for many ITS crowdsourcing platforms; IoT and communications technology discussed in the previous section go hand-in-hand with the scalability offered by advances in cloud computing technology and techniques. Cloud computing technologies and techniques such as edge computing [
27], distributed file processing, storage, task management [
28,
29,
30], worker reliability metrics [
31], and trust systems [
32] can streamline the system. Edge computing distributes work away from the centralized control system by leasing computations out to devices on the edge of the network (e.g., workers’ PMDs or other embedded computers in the network, such as smart vehicles, RSUs, or UAVs) to optimize the transmission of data across the network for prevention of bandwidth issues. Distributed file-systems allow for cloud storage and resources to scale up or down in response to demand, and worker reliability metrics will ensure that the system is not polluted with useless data. In this section, we touch on some key characteristics of the cloud that enable ITS crowdsourcing.
Figure 7 conveys a basic image of how these cloud systems mat work in practice.
3.3.1. Edge Computing
Computers “on the edge” of a cloud computing network are far away from the centralized data centers that make up the heart of the system’s computing and storage capabilities [
33]. While edge computing devices are less powerful than the powerful servers in the datacenter, they exist to offset the main bottleneck now present in cloud computing: data transmission across the network.
Figure 8 illustrates the trade-off in action. In
Figure 8, “slow edge GPU” means the edge computers in the simulated network have a benchmark speed of processing 250 MB/s of image data, which is analogous to LiDAR point set data—this benchmark speed is based on the Nvidia Jetson System on Chip (SoC). The “med.” and “fast” speeds correspo nd to 500 MB/s and 1000 MB/s, respectively, which would be present in more powerful edge computing devices that may consist of crowdsourced computing power similar to the Folding@Home project (
https://foldingathome.org).The figure shows that in the “fast” case, 60% of the tasks are assigned to the the edge computers, as their speed offsets the high latency associated with sending all tasks to the cloud, leading to overall lower system latency even though the cloud computing center is much more powerful than the edge computers [
34]. Computing tasks can be performed on the edge to minimize the amount of network bandwidth taken up by transmission of data to the center of the network, eschewing a traditional server-client framework for a much more scalable, flexible network topology [
27].
ITS crowdsourcing frameworks likely will involve many spatial and mobile tasks being performed in parallel amongst the many users of the future transportation systems where transmission of information across the many connected entities will be massive. Edge computing will allow for distribution of computing and storage tasks to be outsourced away from the centralized servers to devices much closer to data sources, reducing bandwidth consumed by transmissions, reducing loads on the central servers, and speeding up the latency of a task’s completion.
3.3.2. Distributed Storage
Technologies such as Apache Hadoop allow for a distributed filesystem to run on a connected network of computers in the cloud, essentially acting as one massive data storage system. This distributed storage allows for “easy” large-order scalability simply by adding (or subtracting) computational resources, acts as a redundancy system in the case of the failure of a node in the network, and distributes data across in the system in a way to reduce latency.
A cloud-based data pipeline that shares data amongst members of a VSN based on factors such as similarity of driving patterns was proposed in [
10]. In conjunction with edge computing and scaling systems such as Hadoop, this type of architecture can further reduce bandwidth consumption by data in the network, along with developing more personalized and spatially segmented services.
An example of an ITS crowdsourcing technology that requires high computational power and storage is the use of shared LIDAR for autonomous vehicles. Elevated LiDAR (ELiD) [
35] is touted as an alternative or at least complementary system to LiDAR units placed on-board, in order to reduce the amount of computational and cost overhead required to safely operate an autonomous vehicle. One challenge for ELiD systems is to allocate their real-time generated data to edge and cloud servers for processing of the raw point cloud data in order to minimize the latency and speed up their operation to provide vehicles instantaneous information, e.g., 3D mapping. In
Figure 8, we provide the performance of a backhaul latency-minimizing framework that considers low communication latency to edge computers with less-powerful resources (leading to larger processing latency), and longer communication latency but shorter processing times to the cloud.
Figure 8 visualizes this trade-off—as the processing speed of the edge computers increases, average processing latency for each LiDAR decreases. Optimizing the task-server allocation for ITS crowdsourcing applications is important to enable smooth operation of the system.
3.4. Automated Crowdsourcing Management
The complexity of ITS crowdsourcing frameworks, large set of tasks, and the numerous potential workers requires systems in place to allocate tasks across the platform. Spatial tasks involve their own set of unique challenges regarding task allocation [
36]. The authors of [
37] developed an automated assignment system for mobile crowdsourcing frameworks based on relationship metrics, and network service parameters.
Automated management systems would go hand-in-hand with the cloud systems mentioned earlier. By automatically optimizing the task allocation, the tasks can be divided spatially and results can be passed through the network at optimized access points and data can be pipelined, collected, and stored in locations that are geographically optimize to be best-suited for use based on the areas it was collected at.
Recent developments of cloud infrastructure is laying the groundwork for ITS’ to act as a massive crowdsourcing platform. Incorporating traditional transportation infrastructure into the IoT will make transportation systems into cloud computers, providing ITS operators a much greater amount of information for decisions that would improve the quality of the transportation experience for anyone who utilizes the roadways in the future.
3.5. Reliability Scoring
When humans are in the loop of crowdsourcing systems, oftentimes there is uncertainty on how reliable human-crowdsourced responses are. Until automated systems are robust enough, humans stand to play roles in ITS crowdsourcing. With ITS data, reliability is a key metric of focus, as faulty information can lead to potentially fatal mistakes.
Researchers developed a probabilistic model for predicting the reliability of human crowdsourcing workers [
31]. Development of reliability systems such as this stand to not only assist in the improvement of the quality of human-produced data, but also automatically collected data—probabilistic systems can be used as a type of anomaly detection to filter out any corrupted input data from transmission errors. While not specifically a cloud technology, this type of reliability metric can be utilized in conjunction with other cloud infrastructure to minimize bandwidth consumption by faulty data or by malicious actors performing denial of service attacks.
The work in [
38] addresses the reliability scoring and automated crowdsourcing management problems in crowdsourcing-based real-time vehicle navigation where routes are continuously updates according to other workers’ feedback. Oftentimes, there is uncertainty surrounding the accuracy of data reported by workers in crowdsourcing systems.
Figure 9 visualizes how accounting for uncertainty may influence navigation decisions. The red path corresponds to a traditional navigation following the shortest path taken by a driver, which has an obstruction that forces the driver to reroute. In [
38], an algorithm designed to account for these types of blockages, automatically using other road users’ feedback, finds a new optimal path for the driver (blue path), and the green path is the optimal path when also attempting to minimize the risk associated with traversing along paths where the data collection is not as reliable. Hence, the crowdsourcing platform proposes to the driver to follow a longer route to avoid the risk to be trapped in a congested route due to its limited knowledge.
3.6. Security and Privacy
Whenever data is processed in the cloud/edge and transmitted over the internet, measures must be taken in order to protect the identities and data of crowdsourcing workers. Part of the aim of privacy and security researchers in this area is to ensure “privacy-by-design” [
39]. End-to-end (E2E) encryption [
40,
41] is vital to harden communications across the network, so malicious actors cannot passively collect information in transit. In addition to E2E encryption, obfuscation techniques related to the location metadata of the data collection agent [
42,
43,
44,
45] are necessary when the data is being processed at the end of its journey through the communications network. These obfuscation techniques have also been explored in the edge as well [
46].
4. Overview of the State of the Art
In this section, we briefly review the current state of ITS crowdsourcing. While doing so, we categorize work based on the concept explored or problem addressed, and summarize the general contribution made. In addition, we mark which crowdsourcing variant that each topic is most closely associated with. The summary of our findings are shown in
Table 1.
The literature we reviewed can be broadly broken down into five main categories. The Architecture/Planning category is mainly survey papers that define and explore one of the crowdsourcing models in depth, or are papers that focus on the planning/infrastructure that underpins crowdsourcing systems. We consider which class of crowdsourcing each paper contributes most to. The ITS crowdsourcing applications category contains papers that are systems that apply one or more of the crowdsourcing architectures to solve an ITS problem. The task recruitment category contains work that mainly focuses on efficiently selecting workers, assigning tasks relevant to the system, and rating their performance. The edge computing category focuses on the edge computing networks along with distributed storage that plays a vital role in the operation of a crowdsourcing system in real time. We conclude with the security/privacy category, which focuses on work that is designed to keep these crowdsourcing systems safe for people to participate, and to ensure that data is collected ethically.
The architecture/planning set of papers mainly focuses on the definitions of the ITS crowdsourcing systems as well as planning systems for the implementation of the crowdsourcing architecture and systems. In order to understand the core aspects of crowdsourcing systems, we considered surveys of mobile [
1] and spatial crowdsourcing [
12,
13], which gave us broad insight into how these systems operated, and how they may work in ITS. After, we reviewed papers pertaining to the current technologies that will underpin these crowdsourcing systems in ITS. These include VSN [
2,
4], which will provide the main communications backbone for ITS crowdsourcing systems. In addition, we considered VSN components. We looked at the planning of RSU installation [
22,
23,
24], along with UAVs as flying RSUs [
3], trusted cloud computing systems [
32], automated sensing systems [
14,
35], and decentralized control/access systems [
28,
29].
After reviewing the systems that serve as the basis for ITS crowdsourcing, we review several promising applications. These include street parking monitoring and reporting [
15], lane data collection [
20] and detection [
21] social-media based traffic [
8,
9] and incident detection [
6], advanced route planning based on on-board sensors [
47] and considering uncertainty [
38] and crowdsourced last-mile navigation data [
7], and 3D mapping [
16,
17]. We chose these application examples due to their potential capability of significantly improving the performance of existing transportation systems.
Crowdsourcing applications cannot work well without efficient task recruitment and verification processes. We reviewed techniques developed by authors to address this. Work has been dome to develop mullti-objective recruitment systems [
11], task allocation solvers [
30,
36,
37], ratings systems to reward good workers and incentivize poor performers to improve [
31], and a system to take into account uncertainty associated with data collection [
38].
We then moved on to looking at more cutting-edge advances related to ITS crowdsourcing. Edge computing is a recent development that aims to utilize computers closer to the data collection sources to minimize bandwidth utilization in the network [
27]. We reeviewed papers that related on applying edge computing in crowdsourcing applications [
33,
48], and how data processing on the edge is more efficient [
10,
49].
Finally, we considered privacy and security measures that are being undertaken to protect crowdsourced worker’s identities. Security and privacy must be built into the system while it is being developed [
39]. End-to-end encryption is utilized to safeguard communications in transit [
40,
41], and obfuscation methods are used to anonymize data during mission-critical processing tasks [
42,
43,
44,
46].
Our review encompasses all of these areas related to ITS crowdsourcing to get an idea of how it is a system of systems. The architecture survey papers provide the system-level view. Application papers show the broad set of subsystems that will be powered by ITS crowdsourcing systems. The recruitment papers consider the human aspect of the system. The edge computing papers highlight technological advancements that continue to improve the viability of ITS communications systems. Finally, we consider security and privacy papers, as these are a critical subsystem that ensures the systems are safe in practice.