Next Article in Journal
Hard Decision-Based Cooperative Localization for Wireless Sensor Networks
Previous Article in Journal
Augmenting Deep Learning Performance in an Evidential Multiple Classifier System
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Service Benefit Aware Multi-Task Assignment Strategy for Mobile Crowd Sensing

1
School of Communication and Information Engineering, Chongqing University of Posts and Telecommunications, Chongqing 400065, China
2
Key Laboratory of Optical Communication and Networks, Chongqing 400065, China
3
Key Laboratory of Ubiquitous Sensing and Networking, Chongqing 400065, China
*
Author to whom correspondence should be addressed.
Sensors 2019, 19(21), 4666; https://doi.org/10.3390/s19214666
Submission received: 20 September 2019 / Revised: 23 October 2019 / Accepted: 25 October 2019 / Published: 27 October 2019
(This article belongs to the Section Internet of Things)

Abstract

:
Mobile crowd sensing (MCS) systems usually attract numerous participants with widely varying sensing costs and interest preferences to perform tasks, where accurate task assignment plays an indispensable role and also faces many challenges (e.g., how to simplify the complicated task assignment process and improve matching accuracy between tasks and participants, while guaranteeing submitted data credibility). To overcome these challenges, we propose a service benefit aware multi-task assignment (SBAMA) strategy in this paper. Firstly, service benefits of participants are modeled based on their task difficulty, task history, sensing capacity, and sensing positivity to meet differentiated requirements of various task types. Subsequently, users are then clustered by enhanced fuzzy clustering method. Finally, a gradient descent algorithm is designed to match task types to participants achieving the maximum service benefit. Simulation results verify that the proposed task assignment strategy not only effectively reduces matching complexity but also improves task completion rate.

1. Introduction

The diversification and popularization of embedded mobile devices enable innumerable user-centric mobile crowd sensing (MCS) applications (e.g., traffic monitoring, pollution monitoring, and indoor positioning) [1,2,3,4,5,6,7,8,9]. A typical MCS system includes two entities (i.e., users and platform), where users not only publish tasks to acquire information from the platform but also collect sensing data for the platform. As a bridge between task publishers and participants, the platform helpfully selects suitable participants to complete tasks for publishers. Generally, sensing tasks in a MCS system are allocated to multiple participants and accomplished cooperatively [10,11,12,13,14].
It is crucial for a MCS system to provide task publishers with reliable services. The task sensing process of MCS system relies on massive participants whose sensing positivity and sensing capacity are diverse for different tasks. If a task is randomly assigned to participants, the quality of its results may be severely affected, the credibility of collected sensing data may be reduced, and the corresponding computing resource consumption may increase [15,16,17,18,19,20]. Therefore, task assignment strategies should be reasonably designed to eliminate the above uncertainties, which can dramatically enhance the philosophy behind MCS participant collaborations [21,22,23].
Technically, how to objectively evaluate the matching accuracy of task assignment strategies is a major challenge and remains in dispute. In response to this challenge, many studies presented solutions using varying methods and with different emphases. Jin, H et al. assigned MCS tasks to participants according to their capabilities to maximize sensing coverage [24]. The authors of [25,26] assigned tasks based on the history of submitted high-quality results. However, they had different definitions of data quality. Yue, W et al. mainly considered the coverage quality of sensing results [25], while Sabrina, K.N.M et al. measured the data quality based on the actual evaluation of data [26]. Liu, S et al. evaluated participant’s service quality based on its context and cost, and developed a Modified Thompson Sampling Worker Selection (MTS-WS) algorithm to select workers in a reinforcement learning manner [27]. Addressing the problem of performance maximization in MCS, a context-aware hierarchical online learning algorithm was proposed in [28]. In detail, a local controller (LC) in the mobile device of a participant regularly observed the participant’s context, based on which the participant’s context-specific performance could be estimated and the participants could be selected. Although the authors of [24,25,26,27,28] considered multiple task-related factors, there are still some limitations and factors that may intuitively affect MCS performance (e.g., task difficulty, task history, sensing capacity, and sensing positivity were omitted). Besides, the quality of data uploaded for different task types was not clearly modeled in these studies. Consequently, it may be impossible for the platform to assign tasks reasonably and accurately, resulting in low matching accuracy/task completion rate, high computing resource consumption, and incredibility of data.
Focusing on above limitations, we propose a service benefit aware (SBAMA) multi-task assignment strategy for MCS. The service benefit of participants is first modeled. Subsequently, we propose an enhanced Fuzzy C-Means (FCM) algorithm to dynamically cluster users in terms of their task preferences. Finally, an iterative participant search method based on gradient descent is designed to match participants with the best service benefit in each cluster quickly and accurately. The proposed SBAMA multi-task assignment strategy offers an interest tradeoff between participants and platform given a fixed budget and a certain movement distance. Simulation results verify that SBAMA can quickly and accurately find the most appropriate participants for all types of tasks. Therefore, task completion rate based on the proposed strategy is convincingly high. The main contributions of our paper can be summarized as follows:
(1)
A service benefit evaluation model is established, from several different perspectives of tasks and participants, to comprehensively interpret impacts of task difficulty, task history, sensing capacity, and sensing positivity on service benefit received by the platform.
(2)
An enhanced FCM algorithm is designed to cluster users. Specifically, a task preference threshold allows participants to join more than one cluster if such participants have similar task preferences. The generated clusters can effectively reduce the time consumption of the optimization problem while increasing participant matching accuracy.
(3)
An iterative gradient descent algorithm is proposed to tune the tradeoff between interests of participants and platform. Particularly, it decouples the service benefit from movement distance such that the most appropriate participants for tasks can be found accurately and quickly.
The rest of this paper is organized as follows. Related works are introduced in Section 2. Section 3 presents the task assignment framework. The service benefit model is proposed in Section 4. Section 5 elaborates on the task assignment strategy. Simulation results validating our proposed SBAMA are given in Section 6. Finally, Section 7 concludes this paper and discusses future work.

2. Related Works

In recent years, task assignment for MCS systems has been attracting increasing research attention. Wei, G et al. proposed a heterogeneous multi-task allocation mechanism based on spatiotemporal correlation [29]. The reference and non-reference tasks were distinguished through utilizing granularity settings based on which the best triple (i.e., worker-cycle-region) was obtained. Besides, to improve the task assignment efficiency, a decomposition and combination framework was designed in [29] for large-scale scenarios. In [30], a location-based online task assignment method was proposed under constraints of distance and budget, incorporating quality/progress-based, task-density-based, travel-distance-balance-based, and bio-inspired-travel-distance-balance-based algorithms, to search for the optimal participants maximizing the overall task quality. Wang, J et al. studied the deterministic model and random model of trajectories in vehicle-based MCS and proposed an effective vehicle recruitment algorithm to minimize the overall recruitment cost [31]. However, the authors of [29,30,31] only considered either the mobility of participants or the spatiotemporal correlation among tasks, where impacts of task requirements and service benefits were ignored. A failure to consider these critical factors together may result in an inaccurate participant matching during the task assignment process, which reduces the task completion rate.
The criterion of participant selection in task assignment has been extensively studied. Considering various factors affecting the task participant selection, a task assignment framework was proposed in [32]. Specifically, a unified estimation function was employed to calculate the feasibility of task assignments and the optimal task assignment using a greedy algorithm was obtained. Wang, L et al. and Alsayasneh, M et al. focused on the context information of participants to enhance the task completion rate and MCS quality [28,33]. In particular, a diverse task composition scheme was studied in terms of participant personalities to dramatically improve user experience [33]. Mavridis, P et al. inferred skills required for tasks from available skill sets and modeled a hierarchical skill tree to match participants with tasks, which was however computationally intensive and therefore inapplicable to scenarios with massive users and tasks [34]. Besides, the credibility of data submitted by participants could not be guaranteed only based on their skills. Although the authors of [28,32,33,34] assigned tasks to relatively appropriate participants to ensure the data credibility, when evaluating the participant selection, key factors such as participant positivity and task difficulty should be carefully considered for the matching accuracy. Therefore, in this paper, we comprehensively evaluate the service benefits of participants as the matching criterion and propose the SBAMA multi-task assignment strategy to enhance MCS performance.

3. Task Assignment Framework

The proposed SBAMA framework is shown in Figure 1. Since tasks arrive randomly, tasks assigned by the platform can be divided into n identical time intervals, denoted as t = [ t 1 , t 2 t η ] . At the beginning of t η , task publishers R = R 1 , R 2 R m submit tasks of different types A = A 1 , A 2 A m to the platform. Each publisher can only submit one type of tasks within each time interval. Note that each task type contains q subtasks, i.e., A i = a 1 i , a 2 i a q i . Specifically, a task has several requirements (i.e., deadline, location, data format, and ID), denoted as T d o c = T d ; l A i ; ϖ ; I D , where T d refers to the time range from task start t s to task end t e (i.e., T d = t s , t e ) and participants must submit sensing results before this deadline. The specific locations for subtasks of task A i are denoted by l A i = l i 1 , l i 2 l i q . Due to different content types for each task, the format of collected data also varies. Without loss of generality, the data format is identified by ϖ = ϖ 1 , ϖ 2 ϖ k , where  ϖ k specifies the content type acceptable for each task.
The platform then publishes tasks to all potential participants U = u 1 , u 2 u n satisfying task requirements T d o c . Task candidates then submit a subset of the received tasks S A = A 1 , A 2 A k , where k m , to the platform indicating their task preferences. Subsequently, the platform employs task assignment strategy to select participants with high service benefits and low costs to perform tasks. In other words, the optimal task participants W = W 1 , W 2 . . . . . W y ( y n ) are found.
When tasks are completed, the platform compensates the selected participants according to their costs and task difficulties. Finally, upon receiving feedback from the platform, task publishers score the service quality of the participants. Apparently, the scores should be exploited by the platform as an important reference to evaluate and update the service benefit, which further serves as an indicator for the next rounds of task assignments.
Generally, MCS tasks are location-dependent and participants have to travel a certain distance to perform the tasks. Therefore, movement distances are an inevitable cost for participants, i.e., cost C i j of participant u j performing task A i should be a function of movement distance d i j and sensing cost c i j . Evidently, C i j is proportional to the distances traveled by participants (i.e., C i j d ˜ i j C i j d i j , d ˜ i j d i j ) and its growth rate also should increase with the distance (i.e., d 2 C i j d i j d d i j 0 ). The cost C i j is defined as follows
C i j = a i j d i j 2 + b i j d i j + c i j
In Equation (1), a i j > 0 and b i j > 0 are pre-defined system parameters and c i j is constant. Participants expect to get rewards from the platform after completing tasks. Specifically, the reward depends on the task difficulty and movement distance. Given a difficult task, its price per meter should be high, and prices of different types of tasks are denoted by P = [ P 1 , P 2 P m ] . Besides, the reward should not exceed the budget. Note that participants can perform different subtasks in the same time interval to maximize their incomes, only when their locations are not in conflict. Thus, the income of participants can be easily calculated through deducting the cost from the reward; there holds
ψ j d i j = i = 1 m P i · d i j C i j
For the platform, its profit mainly comes from service benefits contributed by participants. Intuitively, participants with greater service benefits generate more profits for the platform and therefore should be prioritized. However, the growth rate of the profit should be slowly attenuated because the participant service benefits become smaller and smaller after the participant is selected. The profit obtained from completing task can be calculated as follows
f i s i j , d i j = j = 1 n w i · ln ( 1 + s i j ) j = 1 n P i j · d i j
where system coefficient w i > 0 is determined by the platform for each task type. Apparently, both the platform and participants want to maximize their profits or incomes. Hence, a reasonable task assignment strategy should select participants with low sensing costs and high service benefits, where a tradeoff between the platform and participants must also be made under constraints of the maximum movement distance r j max and budget of each type of tasks B = B 1 , B 2 B m . Rationally, the reward should be more than the cost to motivate participants to travel within r j max . Eventually, the task assignment can be formulated as the following optimization problem.
max ψ j d i j = i = 1 m P i · d i j C i j
max f i s i j , d i j = j = 1 n w i · ln ( 1 + s i j ) j = 1 n P i · d i j
s . t . i = 1 m d i j r j max , d i j > 0 j = 1 n P i · d i j B i 0 < s i j 1

4. Service Benefit Evaluation

Service benefit is an important indicator for the platform to estimate the potential profit gained from a certain participant, which often relates to task requirements. In this paper, task difficulty, task history, sensing capacity, and sensing positivity are employed, from perspectives of tasks, participants, and publishers, to comprehensively evaluate the service benefit of all types of tasks, so as to achieve accurate participant matching and reliable MCS data collection.

4.1. Sensing Positivity

Sensing positivity refers to the motivation of participants in performing sensing tasks, which is a dynamic process. Given the same sensing capacity, a higher sensing positivity signifies a greater contribution to the platform. Interactions between participants and the platform are employed to measure the sensing positivity, where interaction frequency and task performance are two major observable indicators for these interactions. Specifically, performance p is a function of response time Δ t and cost C i j . If a participant has a relatively low Δ t and C i j , the platform deems his/her performance positive. Due to the restrictive task deadline and movement distance, p decreases with the growing Δ t and C i j , and then gradually stabilizes. Therefore, performance p j i of participant u j in task A i can be calculated by
p j i = log 2 1 C i j · Δ t + 1
where Δ t = t q i t s i , Δ t T d . t q i is the time participant u j starts task A i and obviously Δ t = 0 if t q i is equal to task start time t s i , which generates the value of maximum performance is 1. Conversely, the minimum performance 0 can be obtained when Δ t = T d . Besides, when C i j is infinitely large, p j i  reaches 0 (i.e., log 2 1 = 0 ), as bounded by
p j i = 0 , Δ t = T d log 2 1 C i j · Δ t + 1 , 0 < Δ t < T d 1 , Δ t = 0
Interaction frequency is another important indicator for sensing positivity. Apparently, a high interaction frequency signifies a stable and positive sensing behavior, and thereby the interaction frequency has the equivalent weight with the task performance in sensing positivity. Therefore, the sensing positivity of participant u j in task A i can be obtained as
χ j i = f j m j = 1 n i = 1 m f j i · p j i
In Equation (9), f j = f j 1 , f j 2 f j m , f j m = 1 , 2 h refers to the tasks participated by u j , and h denotes the latest task. The sensing positivity for different types of tasks is represented by a vector X j = χ j 1 , χ j 2 , χ j 3 χ j m .

4.2. Task Difficulty

The task difficulty challenges the sensing capacity of participants, and we utilize a difficulty coefficient to measure it in this paper, where a small coefficient signifies a difficult task. However, dynamic MCS tasks are large in number and rich in type. Evaluating the task difficulty in real time will inevitably consume massive computing power of the platform, which is prohibitively expensive. Therefore, we employ an offline method evaluating the completion rate in task history to obtain the difficulty coefficient. Specifically, the completion rate is defined as the ratio of completed subtasks A c o m p e t e d i to all published subtasks q, i = A c o m p e t e d i q and A c o m p e t e d i q . Note that the completion rate of different types of tasks is denoted by a vector = 1 , 2 m . Intuitively, a high completion rate signifies a simple task. Besides, the completion time must be before the deadline. Here, we exploit a theoretical completion time ϑ = ϑ 1 , ϑ 2 ϑ m to evaluate the actual completion time ϑ ˜ = ϑ ˜ 1 , ϑ ˜ 2 ϑ ˜ m , and they are calculated by ϑ ˜ i = q = 1 A c o m p e t e d i a q i · t m a k e s p a n a c t u a l and ϑ i = q · a q A i t m a k e s p a n t h e o r e t i c a l , where q is the total number of published subtasks. If θ ˜ i is within ϑ i , this task can be completed. Eventually, the difficulty coefficient can be obtained as
D i = i × ϑ i ϑ ˜ i max ϑ i ϑ ˜ i , ϑ i ϑ ˜ i i × 1 ϑ ˜ i ϑ i max ϑ ˜ i ϑ i , ϑ i < ϑ ˜ i , ϑ i T d
It is also worth noting that the task difficulty is relative. If the sensing capacity of a participant is low, the platform will not assign a difficult task to him/her. We denote the varying sensing capacity of participants with a vector K = κ 1 j , κ 2 j κ i j , where κ i j is a constant determined by hardware specifications of sensing devices. Therefore, the relative difficulty coefficient of participant u j performing task A i can be easily calculated and normalized using logarithmic function, as shown in the following
i j = log 2 κ i j D i + 1 , ϑ i ϑ ˜ i log 2 κ i j D i + 1 , ϑ i < ϑ ˜ i , ϑ i T d
Relative difficulty coefficients are further denoted by matrix C n × m , as shown in Equation (12), where rows represent participants and columns indicate tasks.
C n × m = 1 , 1 1 , 2 1 , m 2 , 1 2 , 2 2 , m n , 1 n , 2 n , m

4.3. Task History

Generally, subjective feedback from task publishers is an effective benchmark for the credibility of data submitted by participants. However, due to insufficient labeled MCS data, it is challenging to objectively evaluate the data credibility. We exploit historical records of participants including ID, collected data format ϖ j , ϖ j ϖ and reward j , j = P i · d i j , as denoted by Z j = # , ϖ j , j , to evaluate data credibility. Collected data format ϖ j is compared with task requirements ϖ defined by T doc , and a small gap signifies the complete data, which can be indicated by I j = ϖ ϖ j max ϖ ϖ j . Besides, the more data formats the platform receives, the higher data credibility a task can obtain. Generally, the value of data is defined as a quotient of frequency and the residual of ϖ j in the task history of publisher R i . The value of the historical data is estimated by linear regression model Y = X ω + δ , where Y and δ are both U dimension vector, U denotes the number of participants, X is a U × Z i matrix, and ω is a Z i dimension vector. Therefore, the residual between the actual data value and the estimated is δ ^ = Y Y ^ = 1 M Y , where M = X X T X 1 X T is a hat matrix and a small residual indicates high data credibility. Consequently, the data credibility of participant u j for task A i can be obtained as r e l i a b l e i j = I j · f v a l u e j δ ^ . Moreover, a publisher gives a high score to participants requiring relatively low task payments, and the score given to participants can be obtained as follows
n i j = r e l i b l e i j j
We further adopt the logarithm to normalize the score, as shown in the following
n i j = log 2 1 + r e l i b l e i j i j
Then, the score matrix N n × m can thus be obtained as follows
N = n 11 n 12 n 13 n 1 m n 21 n 22 n 23 n 2 m n 31 n 32 n 33 n 3 m n n 1 n n 2 n n 3 n n m
where rows represent task publishers and columns represent participants.

4.4. Service Benefit

Mathematically, the service benefit of participants is a function of task score n i j , relative task difficulty i j , and sensing positivity χ j i , where χ j i serves as the weight of the service benefit indicating the motivation of participants. In addition, the service benefit grows monotonically with the increasing task score and difficulty. As marginal benefits of submitted data gradually decrease, the growth rate of service benefits drops and stabilizes. Thus, the service benefit of u j to A j can be formulated through the following inverse trigonometric function
s i j = χ j i π × arctan n i j · i j + 1 2
However, the evaluation of service benefits depends on task history, which is inapplicable to new participants. Therefore, we propose to set the default value of service benefits to 0.5, indicating an uncertain service benefit for strange participants. Besides, 0.5 also serves as a threshold to distinguish participants with low service benefits. Based on their task histories, we can reformulate the service benefit of participants as
s i j = χ j i π × arctan n i j i j + 1 2 , t a s k h i s t o r y 0.5 , n o t a s k h i s t o r y
Similarly, service benefits of participants for different types of tasks can also be denoted by matrix  S n × m .

5. Service Benefit Aware Multi-Task Assignment

Technically, the optimization goal of MCS task assignment is to select participants with high service benefits and low costs, so as to balance the interests of participants and the platform, given constraints of movement distance and budget. To solve this optimization problem, we first cluster users (i.e., task candidates) according to their task preferences, and then exploit a gradient descent algorithm to find the optimal participants in each cluster.

5.1. User Clustering Based on Task Preference

In MCS scenarios with massive users, the matching accuracy of optimization algorithms always suffers from the large search range of task candidates. Therefore, we propose to employ the similarity among task preferences to cluster task candidates. Specifically, task preferences indicate the interest of users in certain tasks, which can be reflected by task acceptance rate and task performance. The task acceptance rate is defined as the proportion of tasks submitted by user u j to the total number of tasks submitted by all the selected participants, calculated by p a c c j m = S A i u j u j W i = 1 k S A i u j , which further serves as the weight of the task preference. Intuitively, the acceptance rate of participant, calculated by p a c c j m = S A i u j u j W i = 1 k S A i u j , which further serves as the weight of the task preference. Intuitively, the acceptance rate of participant u j for different types of tasks can be denoted by vector P a c c j i = p a c c j 1 , p a c c j 2 p a c c j m . The task preferences of users p j = p j 1 , p j 2 p j m are also perceived by the platform, which can be calculated similarly with Equation (8). Therefore, task preferences can be denoted by the product of the task acceptance rate and task performance (i.e., h i j = p a c c j i × p j i ), and its matrix holds as
H = h 11 h 12 h 13 h 1 m h 21 h 22 h 23 h 2 m h n 1 h n 2 h n 3 h n m
where rows represent users and columns represent tasks.
The number of MCS clusters depends on the number of published tasks in each time interval, which varies dynamically with the task preference. Note that a user may be interested in multiple tasks and therefore belongs to more than one cluster, which makes Fuzzy C-Means (FCM) algorithm a perfect clustering method for this scenario. In terms of the task preference defined above, we employ cosine similarity to replace the Euclidean distance in standard FCM and modify it into similarity FCM (SFCM). The cosine similarity of task preferences indicating the preference similarity between cluster center o k and user u j in SFCM can be calculated by
d k j = 1 cos ( h k , h j ) = 1 i = 1 m h j i × h k i i = 1 m h j i 2 i = 1 m h k i 2
In FCM, the fuzzy weighted exponent m is commonly employed to determine the fuzzy degree of clustering results, and its optimal value is usually set to 1 . 5 m 2 . 5 . We take m = 2 and the objective function of SFCM can be obtained as follows
J U , O = j = 1 n k = 1 Λ μ k j 2 d k j 2 s . t . 0 μ k j 1 , k 1 , Λ , j 1 , n k = 1 Λ μ k j = 1 , j 1 , n 0 < j = 1 Λ μ k j n , k 1 , Λ
In Equation (20), μ k j represents the membership degree of user u j to cluster O k . The membership matrix can then be denoted by U Λ × n and the cluster center matrix is O , which can be calculated in the following
o k = j = 1 n μ k j 2 h j i j = 1 n μ k j 2 , k 1 , Λ
μ k j = 1 i = 1 Λ d k j d j i 2 , k 1 , Λ , j 1 , n
We set the iteration times to l and the stop parameter to ξ , respectively. Given the user preference matrix H , SFCM randomly generates an initial membership matrix U 0 and calculates Λ initial cluster centers o k , k 1 , 2 Λ . According to cluster center matrix O , both cosine similarity d k j and membership matrix U Λ × n can be obtained. For instance, if d k j = 0 , membership degree of u j to O k is 1. Finally, the iteration is stopped, if J l + 1 J l ξ , to generate clustering results U Λ × k and obtain cluster center matrix O . Otherwise, iterations continue to update U Λ × k and O until reaching iteration times l or stop parameter ξ .
Generally, FCM constructs clusters according to the membership matrix (i.e., O k = k | μ k j = max i u i j , 1 j Λ , u j O k ). However, users clustered by FCM can only belong to one cluster according to her/his highest task preference, which is against the intuition that users with similar preferences for several tasks may simultaneously belong to multiple clusters. Therefore, we define clustering threshold Θ to establish these characteristic overlapping clusters. Specifically, the maximum membership value of a user in the cluster is compared with the other memberships value that he/she belongs to, then all comparison values are sorted, the largest comparison value are got among them as the threshold, which calculated by Θ = arg max E j j = 1 U k E j , E j = i = 1 m min μ i j max μ i j . The true label is obtained by the maximum average preference value among different types of tasks in a cluster; there holds
H ¯ i = j = 1 U k h j = 1 U k
where U k is the total number of users in cluster O k .

5.2. Optimization Problem Based on Lagrange Duality

5.2.1. Problem Reformulation

Since the task price per meter is fixed, the income of participants can be maximized by reducing the movement distances, whereas the platform maximizes its profit by selecting participants with high service benefits. Therefore, the optimization problems in Equations (4)–(6) can be rewritten as follows
max i = 1 m φ i j = 1 n C j s . t . i = 1 m d i j r j max , d i j > 0 , j = 1 , 2 n j = 1 n P i · d i j B i , i = 1 , 2 m 0 < s i j 1
In Equation (24), φ i = j = 1 n w i · ln 1 + s i j and C j = i = 1 m a i j · d i j 2 + b i j · d i j + c i j represent the service benefits obtained by the platform and the costs consumed by participants, respectively.

5.2.2. Lagrange Duality

Equation (24) shows that the objective function is convex with respect to d i j and s i j . Hence, the Lagrange multiplier can be employed to solve this unconstrained dual problem; there holds
L d i j , α i j , β i j = i = 1 m φ i j = 1 n C j + j = 1 n α i j · i = 1 m d i j r j max + i = 1 m β i j · j = 1 n P i · d i j B i = i = 1 m φ i + β i j · j = 1 n P i · d i j B i + j = 1 n α i j · i = 1 m d i j r j max C j
In Equation (25), the Lagrange multiplier is denoted by matrix β m × n = β 1 , , β 2 , β m , , α n × m = α 1 , , α 2 , α n , and α i j 0 . Since the service benefit of participants is already evaluated, the dual problem can be defined by
min α j 0 ; β i D α , β = min α i j 0 ; β i max d i j > 0 L d i j , α i j , β i j = min α i j 0 ; β i max d i j > 0 i = 1 m Φ i β i j + j = 1 n Ψ j α i j
In Equation (26), Φ i β i j = max d i j 0 φ i + β i j · j = 1 n p i j · d i j B i and Ψ j α j = max d i j 0 α i j · i = 1 m d i j r j max C j . Because the original objective function is convex, the strong duality must satisfy the Slater condition to generate the optimal solution for this dual problem.

5.2.3. Optimization Algorithm

We employ a gradient descent algorithm to iteratively solve the dual problem. The variables of the dual problem can be updated as follows
α i j l + 1 , β i j l + 1 = α i j l , β i j l λ · D α i j , D β i j + = α i j l , β i j l λ d i j l r j max , P i · d i j l B i +
In Equation (27), d i j l is the variable of the original optimization problem in the lth iteration, α i j l and β i j l are the variables of the dual problem in the lth iteration, and λ is the learning step size. Participants with the best service benefits and optimal movement distances can be obtained iteratively by the platform. First, in the iteration of service benefits, participants with the best benefits in the lth iteration can be obtained. Then, in the gradient descent algorithm, dual variables α i j l and β i j l of the lth iteration are obtained. Finally, s i j l , α i j l and β i j l are all set for Equation (28) to generate the optimal movement distance. The iteration process does not stop until convergence conditions are met. Algorithms 1 and 2 are updated as Equations (28) and (29).
u j U , d i j l = arg max φ i + β i j l · j = 1 n P i · d i j B i + α i j l · i = 1 m d i j r j max C j
A i A , s i j l = arg max φ i
Specifically, the complexity of Algorithm 1 is O ( n ) , where n is number of task candidate. The complexity of Algorithm 2 is O m , where m is the number of task cluster. The complexity of overall assignment strategy is O m × n .
When the best participants are selected for each type of task, the platform pays their task reward, updates their service benefits, and exploits scores from task publishers for the next round participant selection.
Algorithm 1 Service benefits.
  • Input: potential participants set U; service benefit matrix S; Task set A
  • Output: the optimal participants W for Task A i ; Profit f i s i j , d i j
  • Initialize
  • W = ϕ , the number of iterations l = 0
  • Select participant u 0 randomly, W u 0
  • Receive d i j l from user
  • Calculate the profit of each task of u 0 through Equation (5)
  • while f i l + 1 W f i l W 10 × e k or l 0 do
  • if s i j = arg max φ i then
  •   Select participant u j and corresponding s i j
  •   Calculate the profit of the platform through Equation (5))
  •   return s i j , f i l ( s i j , d i j )
  •   break
  • else
  •    l = l 1
  •   update Equation (29)
  •   return to Line 5
  • end if
  • end while
  • return s i j , f l ( W )
Algorithm 2 Iterative of d i j progress.
  • Initialization
  • for l = 0 , 1 , 2 , 3 do
  •  Receive s i j l from platform
  •  Update Equation (28)
  •  Compute the new value of α i j l + 1 and β i j l + 1 using Equation (27)
  • if α i j l + 1 α i j l ρ and β i j l + 1 β i j l ρ , where ρ is a tunable little real number then
  •   Return to Line 3
  • else
  •   return d i j l , ψ j d i j
  •   break
  • end if
  • end for

6. Experiment

Gowalla, employed in this study to validate the proposed SBAMA, is a location-based real world social network dataset that allows users to share their information, including ID, access time, longitude, latitude, and location tags. The dataset collected all public check-in data between February 2009 and October 2010. There are 19,6591 nodes and 950,327 edges in Gowalla. Gowalla is mainly used to study human mobility [35]. Specifically, 500 locations and 1000 users were extracted from Gowalla as task locations and candidates, respectively. Subsequently, these 1000 task candidates were clustered into five groups, where each group maintains a task preference matrix and a corresponding service benefit matrix, containing five types of tasks. In addition, SFCM clustering algorithm and optimization algorithm in SBAMA were compared with original FCM algorithm and greedy algorithm in Dynamic Trust-Based Recruitment Framework (DTRF) [20] on MATLAB platform, respectively. Simulation parameters are given in Table 1.

6.1. Advantages of SFCM

The objective function iteration and clustering accuracy of SFCM were compared with those of FCM to verify the effectiveness of SFCM. Objectively, both FCM and SFCM adopt the same initial membership matrix and the simulation was repeated 100 times, where seven tests were randomly selected for observation.
The iteration times needed by FCM and SFCM for objective function convergence are shown in Figure 2. Compared with FCM, SFCM requires a stably lower number of iterations around 30. In addition, SFCM converges quickly and has significantly short clustering time. Figure 3 illustrates the iteration of their objective function values, where the initial value of SFCM is notably much smaller than that of FCM, because the Euclidean distance in FCM is replaced by cosine similarity of SFCM to reduce the membership value.
The clustering accuracy of FCM and SFCM, given the maximum membership value, is shown in Figure 4, which is measured based on the original dataset with labels. The clustering accuracy of SFCM is generally higher than 95%, whereas the worst case of FCM is only 74.5%. Similarly, given the maximum membership value, randomly selected clustering results of FCM and SFCM are shown in Figure 5, where SFCM has a significantly better clustering result. Figure 6 shows the final membership matrix value of users from a random test. Cluster labels can be determined by Equation (22). For example, the cluster for Task A1 almost includes Users 40–80. However, according to their membership matrices, Users 34, 84, 100, 103, and 156 have similar membership values for different types of tasks. As shown in Table 2, the membership values of User 34 for Tasks A3 and A5 only differ by approximately 0.074. Besides, Users 103 and 156 have membership differences only within 0.1 for Tasks A2/A4 and Tasks A2/A5, respectively. Therefore, the task preference threshold is set to 0.1 for overlapping clustering and the clustering result of SFCM based on this threshold is shown in Figure 7. Compared with Figure 5, clusters overlap and User 34 belongs to clusters of Tassk A3 and A5 simultaneously, which is more practical for real world MCS scenarios. In short, SFCM with membership threshold can cluster users with similar task preferences, which is an effective underpinning for the subsequent optimization problem.

6.2. Analysis of Optimization Algorithm

Figure 8 shows how the platform profit gained from each type of task varies with the number of iterations. It is observed that the platform profit converges to the optimal value when the number of iterations reaches about 45, which implies the platform can stably match appropriate participants to tasks. Besides, the platform profit increases as the average service benefit increases. In Figure 9, the impact of the number of iterations on the participant income is depicted. For a fixed task price P i , the participant income first grows sharply as the number of iterations increases and then tends to be stable. The income of Participant 3 is significantly higher than those of others. This is because his/her task is more difficult to be performed and requires a stronger sensing capacity, which thereby receives a higher payment from the platform. In Figure 8 and Figure 9, the fast iteration convergence of the proposed gradient descent algorithm for achieving the best task participants is validated.
In Figure 10, the task completion rates of DTRF and SBAMA are compared. In addition to sensing quality considered by DTRF, the proposed SBAMA also takes service benefits of participants, task preferences and real-time feedbacks from task publishers. Hence, SBAMA acquires 8% higher task completion rate.
In short, through narrowing the search range of task candidates, SBAMA effectively improves the matching accuracy of tasks assignment with fast algorithm convergence.

7. Conclusions

In this paper, we propose the SBAMA to quickly and accurately match MCS tasks with the most appropriate participants to improve the task completion rate and data credibility. Firstly, the service benefit of participants is modeled based on their task difficulty, task history, sensing capacity and sensing positivity to improve the accuracy of task assignment. Then, task candidates are clustered according to their task preference to narrow the search range. Finally, the gradient descent algorithm is designed to select the optimal participants in each cluster. Simulation results verify that the proposed SBAMA can quickly find the most appropriate participants to meet the requirements of multiple concurrent types of tasks under a massive user scenario, for example, crowded road condition monitoring. Although the proposed SBAMA can be applied to the scenario of massive users and numerous concurrent tasks, the strategy still has some limitations for the scenario where participants are sparse, which can lead to a low matching accuracy between the task and the participant. In the future, we will focus on the sparse participant scenario and study the associated task assignment strategy.

Author Contributions

This work was a collaborative development by all authors. Z.L., H.L. and R.W. proposed the idea, were involved in the theoretical performance analysis, designed and optimized the algorithm, and wrote the paper.

Funding

This research was funded by the National Natural Science Foundation of China under Grants 61901078, 61771082 and 61871062; in part by the Science and Technology Research Program of Chongqing Municipal Education Commission under grant KJQN201900609; and in part by the Program for Innovation Team Building at Institutions of Higher Education in Chongqing under Grant CXTDX201601020.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Guo, B.; Wang, Z.; Yu, Z.; Wang, Y.; Yen, N.Y.; Huang, R.; Zhou, X. Mobile crowd sensing and computing: The review of an emerging human-powered sensing paradigm. ACM Comput. Surv. 2015, 48, 1–31. [Google Scholar] [CrossRef]
  2. Gao, H.; Liu, C.H.; Wang, W. A survey of incentive mechanisms for participatory sensing. IEEE Commun. Surv. Tutor. 2017, 17, 918–943. [Google Scholar] [CrossRef]
  3. Wang, X.; Zhang, J.; Tian, X.; Gan, X.; Guan, Y.; Wang, X. Crowdsensing-based consensus incident report for road traffic acquisition. IEEE Trans. Intell. Transp. Syst. 2018, 19, 2536–2547. [Google Scholar] [CrossRef]
  4. Zheng, Y.; Liu, F.; Hsieh, H. U-Air: When urban air quality inference meets big data. In Proceedings of the 19th ACM SIGKDD, Chicago, IL, USA, 11–14 August 2013; pp. 1436–1444. [Google Scholar]
  5. Yan, J.; Wu, D.; Wang, H.; Wang, R. User Centric Content Sharing Based on D2D Cellular Networks. IEEE Trans. Veh. Technol. 2018, 67, 11208–11218. [Google Scholar] [CrossRef]
  6. Yan, J.; Wu, D.; Wang, R. Socially Aware Trust Framework for Multimedia Delivery in D2D Cooperative Communication. IEEE Trans. Multimed. 2019, 21, 625–635. [Google Scholar] [CrossRef]
  7. Wu, D.; Liu, Q.; Li, Y.; McCann, J.A.; Regan, A.C.; Venkatasubramanian, N. Adaptive lookup of open wifi using crowdsensing. IEEE/ACM Trans. Netw. 2016, 24, 3634–3647. [Google Scholar] [CrossRef]
  8. Li, Z.; Jiang, Y.; Gao, Y.; Sang, L.; Yang, D. On Buffer-Constrained Throughput of a Wireless-Powered Communication System. IEEE J. Sel. Areas Commun. 2019, 37, 283–297. [Google Scholar] [CrossRef]
  9. Zhang, P.; Kang, X.; Wu, P.; Wang, R. High-Accuracy Entity State Prediction Method Based on Deep Belief Network Toward IoT Search. IEEE Wirel. Commun. Lett. 2019, 8, 492–495. [Google Scholar] [CrossRef]
  10. Wu, D.; Shi, H.; Wang, H.; Wang, R.; Hua, F. A Feature based Learning System for Internet of Things Applications. IEEE Internet Things J. 2019, 6, 1928–1937. [Google Scholar] [CrossRef]
  11. Xiao, M.; Wu, J.; Huang, L.; Cheng, R.; Wang, Y. Online task assignment for crowdsensing in predictable mobile social networks. IEEE Trans. Mob. Comput. 2017, 16, 2306–2320. [Google Scholar] [CrossRef]
  12. Wu, D.; Deng, L.; Wang, H.; Liu, K.; Wang, R. Similarity Aware Safety Multimedia Data Transmission Mechanism for Internet of Vehicles. Future Gener. Comput. Syst. 2019, 7, 1788–1798. [Google Scholar] [CrossRef]
  13. Li, Z.; Chen, J.; Zhang, Z. Socially Aware Caching in D2D Enabled Fog Radio Access Networks. IEEE Access 2019, 7, 84293–84303. [Google Scholar] [CrossRef]
  14. Wu, D.; Liu, Q.; Wang, H.; Yang, Q.; Wang, R. Cache Less for More: Exploiting Cooperative Video Caching and Delivery in D2D Communications. IEEE Trans. Multimed. 2019, 21, 1788–1798. [Google Scholar] [CrossRef]
  15. Marjanović, M.; Antonić, A.; Podnarárko, I. Edge computing architecture for mobile crowdsensing. IEEE Access 2018, 6, 10662–10674. [Google Scholar] [CrossRef]
  16. Yang, B.; Wu, D.; Wang, R. CUE: An Intelligent Edge Computing Framework. IEEE Netw. 2019, 33, 18–25. [Google Scholar] [CrossRef]
  17. Yang, S.; Wu, F.; Tang, S.; Gao, X.; Yang, B.; Chen, G. On designing data quality-aware truth estimation and surplus sharing method for mobile crowdsensing. IEEE J. Sel. Areas Commun. 2017, 35, 832–847. [Google Scholar] [CrossRef]
  18. Gao, Y.; Li, X.; Li, J.; Gao, Y. A dynamic-trust-based recruitment framework for mobile crowd sensing. In Proceedings of the IEEE International Conference on Communications, Paris, France, 21–25 May 2017; pp. 1–6. [Google Scholar]
  19. Jiangtao, W.; Lee, W.; Yasha, W. Task allocation in mobile crowd sensing: State of the art and future opportunities. IEEE Internet Things J. 2018, 5, 3747–3757. [Google Scholar]
  20. Yonglong, Y.; Haiyan, Z. Truthful mechanism for crowdsourcing task assignment. J. Tsinghua Univ. Nat. Sci. Engl. Ed. 2018, 23, 645–659. [Google Scholar]
  21. Wei, X.; Wang, Y.; Tan, J.; Gao, S. Data quality aware task allocation with budget constraint in mobile crowdsensing. IEEE Access J. 2018, 6, 48010–48020. [Google Scholar] [CrossRef]
  22. Peng, D.; Wu, F.; Chen, G. Data quality guided incentive mechanism design for crowdsensing. IEEE Trans. Mob. Comput. 2018, 17, 307–319. [Google Scholar] [CrossRef]
  23. He, P.; Han, X.; Liu, H. Chain Modeling of Molecular Communications for Body Area Network. Sensors 2019, 19, 395. [Google Scholar] [CrossRef] [PubMed]
  24. Jin, H.; Su, L.; Nahrstedt, K. CENTURION: Incentivizing multi-requester mobile crowd sensing. In Proceedings of the IEEE Conference on Computer Communications, Atlanta, GA, USA, 1–4 May 2017; pp. 307–319. [Google Scholar]
  25. Yue, W.; Fan, L.; Liran, M. A context-aware multi-armed bandit incentive mechanism for mobile crowd sensing systems. IEEE Internet Things J. 2019, 6, 7648–7658. [Google Scholar]
  26. Sabrina, K.N.M.; Tekin, C.; Mihaela, V.D.S.; Anja, K. Context-aware hierarchical online learning for performance maximization in mobile crowdsourcing. IEEE/ACM Trans. Netw. 2018, 26, 1334–1347. [Google Scholar]
  27. Liu, S.; Zheng, Z.; Wu, F.; Tang, S.; Chen, G. Context-aware data quality estimation in mobile crowdsensing. In Proceedings of the IEEE Conference on Computer Communications, Atlanta, GA, USA, 1–4 May 2017; pp. 1334–1347. [Google Scholar]
  28. Wang, L.; Yu, Z.; Zhang, D.; Guo, B.; Liu, C.H. Heterogeneous multi-task assignment in mobile crowdsensing using spatiotemporal correlation. IEEE Trans. Mob. Comput. 2018, 18, 84–97. [Google Scholar] [CrossRef]
  29. Wei, G.; Baoxian, Z.; Cheng, L. Location-based online task assignment and path planning for mobile crowdsensing. IEEE Trans. Veh. Technol. 2018, 68, 1772–1783. [Google Scholar]
  30. Wang, X.; Wu, W.; Qi, D. Mobility-aware participant recruitment for vehicle-based mobile crowdsensing. IEEE Trans. Veh. Technol. 2018, 67, 4415–4426. [Google Scholar] [CrossRef]
  31. Wang, J.; Wang, F.; Wang, Y.; Zhang, D.; Lim, B.; Wang, L. Allocating heterogeneous tasks in participatory sensing with diverse participant-side factors. IEEE Trans. Mob. Comput. 2018, 14, 1979–1991. [Google Scholar] [CrossRef]
  32. Ikeda, K.; Hoashi, K. Crowdsourcing GO: Effect of Worker Situation on Mobile Crowdsourcing Performance. In Proceedings of the 2017 CHI Conference on Human Factors in Computing Systems, New York, NY, USA, 6–11 May 2017; pp. 1142–1153. [Google Scholar]
  33. Alsayasneh, M.; Amer-Yahia, S.; Gaussier, E. Personalized and Diverse Task Composition in Crowdsourcing. IEEE Trans. Knowl. Data Eng. 2018, 30, 128–141. [Google Scholar] [CrossRef]
  34. Mavridis, P.; Gross-Amblard, D.; Miklós, Z. Using Hierarchical Skills for Optimized Task Assignment in Knowledge-Intensive Crowdsourcing. In Proceedings of the 25th International Conference on World Wide Web, Geneva, Switzerland, 2016; pp. 843–853. [Google Scholar]
  35. E. Cho, S.; Myers, A.; Leskovec, J. Friendship and Mobility: User Movement in Location-Based Social Networks. In Proceedings of the 17th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, San Diego, CA, USA, 21–24 August 2011. [Google Scholar]
Figure 1. SBAMA framework.
Figure 1. SBAMA framework.
Sensors 19 04666 g001
Figure 2. Comparison of the number of iterations.
Figure 2. Comparison of the number of iterations.
Sensors 19 04666 g002
Figure 3. Comparison of the iterative process.
Figure 3. Comparison of the iterative process.
Sensors 19 04666 g003
Figure 4. The accuracy of clustering.
Figure 4. The accuracy of clustering.
Sensors 19 04666 g004
Figure 5. Comparison of user clustering.
Figure 5. Comparison of user clustering.
Sensors 19 04666 g005
Figure 6. Membership matrix values of SFCM.
Figure 6. Membership matrix values of SFCM.
Sensors 19 04666 g006
Figure 7. Result of SFCM cluster based on threshold.
Figure 7. Result of SFCM cluster based on threshold.
Sensors 19 04666 g007
Figure 8. Changes in platform profit.
Figure 8. Changes in platform profit.
Sensors 19 04666 g008
Figure 9. Changes in participant income.
Figure 9. Changes in participant income.
Sensors 19 04666 g009
Figure 10. Comparison of task completion rates for different tasks.
Figure 10. Comparison of task completion rates for different tasks.
Sensors 19 04666 g010
Table 1. Parameter settings.
Table 1. Parameter settings.
ParametersValue
target region ( k m 2 ) 600 × 50
types of task5
number of unit tasks for each task 20 , 200
participant service benefit value 0 , 1
user preference value 0 , 1
a i j 0 , 1
b i j 0 , 1
c i j 1 , 2
inherent sensing cost c ij ( 1 , 4 )
maximum movement distance r j max (km)10
task effective time (min) 20 , 60
task price per meter p i j ($/m) 0.2 , 0.5
number of candidate users200
ξ 1 0 5
w i 100 , 300
B i 2000 , 5000
Table 2. Similar membership values of randomly tested users.
Table 2. Similar membership values of randomly tested users.
Task User3484100103156
A10.0463552020.4238353170.00434606300.0216147700.087140617
A20.0341770250.0462201280.06614771530.3884450330.479426570
A30.4946244100.0032439610.49774845330.0091197400.002899209
A40.0037623700.5170184540.42543068070.4589557660.039708579
A50.4210809910.0096821370.00632708750.1218646880.390825023

Share and Cite

MDPI and ACS Style

Li, Z.; Liu, H.; Wang, R. Service Benefit Aware Multi-Task Assignment Strategy for Mobile Crowd Sensing. Sensors 2019, 19, 4666. https://doi.org/10.3390/s19214666

AMA Style

Li Z, Liu H, Wang R. Service Benefit Aware Multi-Task Assignment Strategy for Mobile Crowd Sensing. Sensors. 2019; 19(21):4666. https://doi.org/10.3390/s19214666

Chicago/Turabian Style

Li, Zhidu, Hailiang Liu, and Ruyan Wang. 2019. "Service Benefit Aware Multi-Task Assignment Strategy for Mobile Crowd Sensing" Sensors 19, no. 21: 4666. https://doi.org/10.3390/s19214666

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop