开放科学(资源服务)标志码(OSID):
Nowadays, with the proliferation of smart phones, bracelets, and other mobile devices, Mobile Crowd Sensing(MCS)[1] has been widely used in traffic prediction[2], air pollution monitoring[3], and so on.As a critical part of MCS, the allocation of sensing tasks is extensively studied[4].In the sensing task allocation stage, it needs to choose an appropriate strategy to allocate tasks to workers in order to reduce costs or improve data quality.In this paper, authors classify workers into opportunistic[5] and participatory workers[6], and assign dynamic tasks(tasks with different start time and deadlines) in each round.To minimize the total cost under the constraint that all tasks are completed on time, three challenges must be addressed.First, how to accurately predict the path of opportunistic workers based on the past trajectory; Second, how to combine the task allocation of opportunistic workers and participatory workers organically; Third, how to find the appropriate workers to reduce the overall system cost.
To address the above challenges, this paper puts forward a reliable hybrid two-phase task allocation algorithm.Specifically, it focuses on the mixed task allocation problem in which all tasks should be completed before the deadline with a minimum cost.The proposed algorithm considers the opportunistic worker phase and the participatory worker phase in each round.In the former phase, this paper first predicts the probability of workers visiting each location, then introduces the greedy algorithm to choose a group of candidate workers based on geographical entropy.In the later phase, it divides uncompleted tasks into urgent tasks and non-urgent tasks.This paper employs hierarchical clustering[7] to cluster the urgent tasks and assigns them to participatory workers by extending the Kuhn-Munkres (KM) algorithm in the current round.Non-urgent tasks are added to the task set of next round, waiting for possible opportunistic workers to complete the tasks to save cost.
1 Related workThere has been large numbers of studies on the task allocation problem in MCS, which mainly focus on maximizing task quality or task coverage under the budget constraints[8].YANG et al.proposed an efficient budgeted informativeness maximization algorithm to recruit workers and realized informativeness maximization under the budget constraint[9].GONG et al.focused on the scenarios where users and tasks arrived dynamically and designed four online task assignment algorithms to maximize total task quality[10].GUO et al. proposed a worker selection framework for time-sensitive tasks that seek to minimize the total distance and for delay-tolerant tasks that seek to minimize the total number of workers[11].
Some studies also focused on selecting appropriate workers to minimize the overall system cost.XIAO et al.introduced a highly reliable multi-task allocation framework that considered the offline and online stage[12].They combined the reverse auction mechanism with the network flow model to effectively minimize cost and maximize coverage.WANG et al.proposed a sparse mobile crowdsensing paradigm to reduce the number of tasks according to the spatial and temporal correlation among the data, thus lowering total sensing cost[13].KANG et al. proposed a hitchhiking working style that encourages crowd workers to commit tasks on their way so as to expand space coverage while reducing costs[14].
Most of the above studies have given incomplete views.Some of them consider spatial tasks but ignore time constraints, while some take deadline into account but neglect the dynamic changes of tasks and workers.In this paper, authors consider the real-time requirements and heterogeneity of sensing tasks with different time of arrival and deadlines.In addition, the active time of opportunistic workers and participatory workers may also be different.
2 Problem modelIn this section, the task allocation problem is definesd and formulatesd, in order to minimize the total cost under the constraint of completing all tasks before each task expires.
2.1 Problem definitionIn the mobile crowd sensing system, tasks are generally heterogeneous in reality.There may be short-time tasks such as puddle detection, or non-urgent tasks such as data acquisition of old buildings in the city.Moreover, workers in the MCS are diverse.They can be busy employees, students with a fixed route, or part-time workers recruited by the platform.This paper studies how to assign heterogeneous tasks to mixed workers, in order to minimize the cost while ensuring that all tasks are completed.
In this paper, time is divided into numerous rounds, expressed as
For workers in the MCS model, this paper divides them into two categories based on their historical activity patterns.One is opportunistic workers, denoted as
With the above definition of opportunistic and participatory workers, the round
![]() |
Download:
|
Fig. 1 Hybrid two-phase task allocation framework |
The objective of this paper is to minimize the total cost[16].Therefore, the costs of both types of workers need to be considered.For opportunistic workers, this paper denotes their individual cost as
$ {c}_{\mathrm{p}}={D}_{j}^{k}\times {I}_{\mathrm{d}\mathrm{i}\mathrm{s}}+{q}_{j}^{k}\times {I}_{\mathrm{t}\mathrm{a}\mathrm{s}\mathrm{k}} $ | (1) |
In summary, the total cost of employing workers in round
$ {C}^{k}=\left|{S}^{k}\right|\times {I}_{\mathrm{b}\mathrm{a}\mathrm{s}\mathrm{i}\mathrm{c}}+\sum\limits_{{p_j} \in {P^k}} {\left( {D_j^k \times {I_{{\rm{dis}}}} + q_j^k \times {I_{{\rm{task}}}}} \right)} $ | (2) |
where
At the same time, the constraints need to be met that all tasks are completed by the deadline and participatory workers have an upper limit of tasks:
$ \mathrm{m}\mathrm{i}\mathrm{n}\sum\limits_{k=1}^{t}{C}^{k} $ | (3) |
$ \mathrm{s}.\mathrm{t}.\mathrm{ }\mathrm{C}\mathrm{o}\mathrm{m}\mathrm{p}\mathrm{l}\mathrm{e}\mathrm{t}\mathrm{e}\left({T}_{s}\right), \forall {T}_{s}\in \Gamma $ | (4) |
$ {n}_{j}^{k}\le \eta , \forall {p}_{j}\in {P}^{k}, \mathrm{ }\forall {R}_{k}\in R $ | (5) |
For opportunistic workers, this paper predicts the future location of
The probability of oi arriving at location
$ {P}_{s}^{k}\left({o}_{i}\right)=\sum\limits_{h=1}^{\mathrm{\infty }}\frac{{\lambda }_{i, s}^{h}\times {\mathrm{e}}^{-{\lambda }_{i, s}}}{h!}=1-{\mathrm{e}}^{-{\lambda }_{i, s}} $ | (6) |
where
Based on the geographical entropy[18], the priority of each task is:
$ W\left({T}_{s}\right)=\frac{1/\left(\mathrm{E}\mathrm{n}\mathrm{t}\mathrm{r}\mathrm{o}\mathrm{p}\mathrm{y}\right({T}_{s})+1)}{\sum\limits_{i=1}^{w}1/\left(\mathrm{E}\mathrm{n}\mathrm{t}\mathrm{r}\mathrm{o}\mathrm{p}\right({T}_{i})+1)} $ | (7) |
The greater the weight of a location is, the lower the probability that it will be visited by participatory workers, and the higher the expected reward is required by participatory workers to complete tasks nearby.Since the cost of an opportunistic worker is certain, more cost can be saved by allocating the tasks to opportunistic workers.
The utility of each opportunity worker to overall system is defined as:
$ {U}^{k}\left({S}^{k}\right)=\sum\limits_{{T}_{s}\in \Gamma , {o}_{i}\in {S}^{k}}{P}_{s}^{k}\left({o}_{i}\right)\times w\left({T}_{s}\right) $ | (8) |
In the search of opportunistic workers, authors seek to find an oi that maximizes the overall increase in utility for which this paper sets a threshold
Algorithm1 Gain-based OW selection greedy algorithm
Input set of opportunistic workers
Output set of candidate opportunistic workers
Initialize
while
MaxU = 0;
for
if
Result:
Based on the set of candidate opportunistic workers
$ {J}^{k}\left({T}_{s}\right)=\left\{\begin{array}{l}1, {\tau }_{s}^{k}-\left|{R}_{k}\right|-\left|{R}_{k}^{\mathrm{o}}\right| < \mu \\ 0, {\tau }_{s}^{k}-\left|{R}_{k}\right|-\left|{R}_{k}^{\mathrm{o}}\right|\ge \mu \end{array}\right. $ | (9) |
Where
Then this paper introduces the improved KM algorithm[19] to achieve task allocation in the
$ {W}_{j, i}=\left\{\begin{array}{l}1-\frac{{D}_{j, i}}{\underset{{p}_{j}\in {P}^{k}, \mathrm{c}{\mathrm{t}}_{i}\in \mathrm{C}{\mathrm{\Gamma }}^{k}}{\mathrm{m}\mathrm{a}\mathrm{x}}\left({D}_{j, i}\right)+1}, \frac{D}{{v}_{j}} < \mathrm{m}\mathrm{i}\mathrm{n}({\tau }_{s}^{k}, |{R}_{k}\left|\right)-\left|{R}_{k}^{\mathrm{o}}\right|\\ 0, \frac{D}{{v}_{j}}\ge \mathrm{m}\mathrm{i}\mathrm{n}({\tau }_{s}^{k}, |{R}_{k}\left|\right)-\left|{R}_{k}^{\mathrm{o}}\right|\mathrm{ }\mathrm{o}\mathrm{r}\mathrm{ }\mathrm{c}{\mathrm{t}}_{i}\in \mathrm{V}{\mathrm{\Gamma }}^{k}\end{array}\right. $ | (10) |
![]() |
Download:
|
Fig. 2 Bipartite graph structure |
The initial mark value of the left node is the maximal value of the weights for the edges connected to it.The initial mark value of the right node is 0.
Where
With the KM-based PW-task match algorithm(as shown in Algorithm2), the result of task allocation can be obtained and its cost can be calculated.Finally, this paper estimates the total cost of both opportunistic workers and participatory workers through adding up their costs during each round.
Algorithm2 KM-based PW-task match algorithm
Input cluster task set
Output total cost C
Initialize C = 0
for
introduce virtual nodes to construct bipartite graph;
initialize
for
while
if
Break;
else
update the mark values of conflicted nodes;
for
calculate the distance D and the number of tasks q in
Result: C
3 Performance evaluationIn this section, the experimental results and analyses are presented.All experiments are carried out by Java on Windows platform, realized on an Intel® CoreTM i5-8265U CPU and 8 GB memory laptop computer.
3.1 Baseline methodsIn order to verify the performance of the proposed algorithm, this paper selects three typical different baseline methods to carry out comparison experiments.To be specific, the baseline methods this paper selected include the KM algorithm with pure participatory workers (BasicKM), the Greedy-genetic algorithm, and the Hy-greedy algorithm based on the hybrid mode.BasicKM takes the participatory workers and tasks as the node sets, and constructs the inverse function of the distance between them to carry out the maximum weight matching.The Greedy-genetic algorithm[20] initializes the population by taking the greedy solution of task allocation as the initial solution.Hy-greedy algorithm greedily selects the opportunistic workers whose incremental utility is greater than the threshold, and assigns each remaining task to the nearest participatory worker to deal with.
3.2 Dataset processingIn the experiment, authors use the Brightkite dataset[4, 21].The dataset includes a total collection of 4 491 143 check-in information of users.This paper selects an area of 60 km×80 km near
![]() |
Download:
|
Fig. 3 Distribution of workers in the dataset |
This paper uses the real dataset to compare the performance of the proposed algorithm with the baseline methods.The experimental results are shown in Fig.4.
![]() |
Download:
|
Fig. 4 Average cost of tasks vs. threshold for the incremental utility |
Fig.4(a) shows the change of the optimal
Fig.5 shows the performance of each algorithm under the condition of different numbers of participatory workers and tasks.It is illustrated that the proposed algorithm is much more efficient than Hy-greedy and BasicKM algorithm.And even the greedy-genetic algorithm is at a disadvantage compared to the proposed algorithm.
![]() |
Download:
|
Fig. 5 Average cost of tasks vs. the number of participatory workers |
In Fig.6, authors pick out the best-performing greedy-genetic algorithm to conduct the further comparison experiments.It can be seen that the proposed algorithm delivers better performance under various conditions than baseline methods.
![]() |
Download:
|
Fig. 6 The proposed algorithm vs. Greedy-genetic algorithm |
This paper puts forward a reliable hybrid two-phase task allocation algorithm.In the opportunistic workers phase, it predicts the probability of workers arriving at each location, and uses the greedy algorithm to calculate the utility to choose a group of candidate workers based on the geographical entropy.In the participatory workers phase, it employs hierarchical clustering to cluster the urgent tasks and assigns them to workers through the improved KM algorithm.Experiments based on the real-world dataset show that the proposed algorithm outperforms Hy-greedy, BasicKM and Greedy-genetic algorithms.
In this paper, the accurate geographic information of participants is assumed to be obtained in advance, which might not be achieved for privacy-sensitive participants[22-23].In future work, authors will conduct further research on how to realize dynamic task allocation based on user privacy protection.
[1] |
ZHAO J, ZHANG X T, LI J, et al. Research review of crowd intelligence 2.0[J]. Computer Engineering, 2019, 45(12): 1-7. ( ![]() |
[2] |
CHEN Y Y, LÜ P, GUO D K, et al. A survey on task and participant matching in mobile crowd sensing[J]. Journal of Computer Science and Technology, 2018, 33(4): 768-791. DOI:10.1007/s11390-018-1855-y ( ![]() |
[3] |
XIONG H Y, ZHANG D Q, WANG L Y, et al. EMC 3: energy-effcient data transfer in mobile crowdsensing under full coverage constraint[J]. IEEE Transactions on Mobile Computing, 2015, 14(7): 1355-1368. DOI:10.1109/TMC.2014.2357791 ( ![]() |
[4] |
GONG W, ZHANG B X, LI C. Task assignment in mobile crowdsensing: present and future directions[J]. IEEE Network, 2018, 32(4): 100-107. DOI:10.1109/MNET.2018.1700331 ( ![]() |
[5] |
WANG C W, LI C S, QIN C, et al. Maximizing spatial-temporal coverage in mobile crowd-sensing based on public transports with predictable trajectory[J]. International Journal of Distributed Sensor Networks, 2018, 14(8): 1-10. ( ![]() |
[6] |
YUAN S, ZHOU C R, YANG Z Q, et al. Task allocation based on whale optimization algorithm in crowdsensing systems[J]. Computer Engineering and Design, 2020, 41(7): 2031-2037. ( ![]() |
[7] |
JOHNSON S C. Hierarchical clustering schemes[J]. Psychometrika, 1967, 32(3): 241-254. DOI:10.1007/BF02289588 ( ![]() |
[8] |
WANG J T, WANG L Y, WANG Y S, et al. Task allocation in mobile crowd sensing: state-of-the-art and future opportunities[J]. IEEE Internet of Things Journal, 2018, 5(5): 3747-3757. DOI:10.1109/JIOT.2018.2864341 ( ![]() |
[9] |
YANG S, WU F, TANG S J, et al. Selecting most informative contributors with unknown costs for budgeted crowdsensing [C]//Proceedings of the 24th IEEE/ACM International Symposium on Quality of Service. Washington D. C., USA: IEEE Press, 2016: 1-6.
( ![]() |
[10] |
GONG W, ZHANG B X, LI C. Location-based online task assignment and path planning for mobile crowdsensing[J]. IEEE Transactions on Vehicular Technology, 2019, 68(2): 1772-1783. DOI:10.1109/TVT.2018.2884318 ( ![]() |
[11] |
GUO B, LIU Y, WU W L, et al. ActiveCrowd: a framework for optimized multitask allocation in mobile crowdsensing systems[J]. IEEE Transactions on Human-Machine Systems, 2017, 47(3): 392-403. DOI:10.1109/THMS.2016.2599489 ( ![]() |
[12] |
XIAO J L, LI P, NIE L. A reliable multi-task allocation based on reverse auction for mobile crowdsensing [C]//Proceedings of WASA'20. Berlin, Germany: Springer, 2020: 529-541.
( ![]() |
[13] |
WANG L Y, ZHANG D Q, WANG Y S, et al. Sparse mobile crowdsensing: challenges and opportunities[J]. IEEE Communications Magazine, 2016, 54(7): 161-167. DOI:10.1109/MCOM.2016.7509395 ( ![]() |
[14] |
KANG Y R, MIAO X, LIU K B, et al. Quality-aware online task assignment in mobile crowdsourcing[C]//Proceedings of MASS'15. Washington D. C., USA: IEEE Press, 2015: 127-135.
( ![]() |
[15] |
AKTER S, THU N T, YOON S. Deadline sensitive task assignment in mobile crowd sensing: a greedy approach [C]//Proceedings of ICTC'19. Washington D. C., USA: IEEE Press, 2019: 354-356.
( ![]() |
[16] |
YU H, MIAO C Y, SHEN Z Q, et al. Quality and budget aware task allocation for spatial crowdsourcing[C]//Proceedings of AAMAS'15. Washington D. C., USA: IEEE Press, 2015: 1689-1690.
( ![]() |
[17] |
GAO H, LIU C H, TIAN Y, et al. Ensuring high-quality data collection for mobile crowd sensing [C]//Proceedings of WCNC'17. Washington D. C., USA: IEEE Press, 2017: 1-6.
( ![]() |
[18] |
WANG J T, WANG F, WANG Y S, et al. HyTasker: hybrid task allocation in mobile crowd sensing[J]. IEEE Transactions on Mobile Computing, 2020, 19(3): 598-611. DOI:10.1109/TMC.2019.2898950 ( ![]() |
[19] |
GAO G J, XIAO M J, ZHAO Z H. Optimal multi-taxi dispatch for mobile taxi-hailing systems[C]//Proceedings of ICPP'16. Washington D. C., USA: IEEE Press, 2016: 294-303.
( ![]() |
[20] |
XING Q, SUN X M, YUAN C M. Online task allocation strategy for spatial crowd-sourcing based on prediction algorithm[J]. Application Research of Computers, 2020, 37(3): 868-871. ( ![]() |
[21] |
CHO E, MYERS S A, LESKOVEC J. Friendship and mobility: user movement in location-based social networks [C]//Proceedings of KDD'11. New York, USA: ACM Press, 2011: 1082-1090.
( ![]() |
[22] |
JIN X, WANG T C, LÜ C M, et al. Privacy preservation algorithm of original data in mobile crowd sensing[J]. Journal of Computer Applications, 2020, 40(11): 3249-3254. ( ![]() |
[23] |
LI M H, LI Y, FANG L M. ELPPS: an enhanced location privacy preserving scheme in mobile crowd-sensing network based on edge computing[C]//Proceedings of TrustCom'20. Washington D. C., USA: IEEE Press, 2021: 475-482.
( ![]() |