Submit to this Journal Review for this Journal Propose a Special Issue

Article Menu

Share Help Cite Discuss in SciProfiles

Open AccessArticle

Peer-Review Record

Multi-Agent Reinforcement Learning for Online Food Delivery with Location Privacy Preservation

Information 2023, 14(11), 597; https://doi.org/10.3390/info14110597

by Suleiman Abahussein¹, Dayong Ye^1,*, Congcong Zhu², Zishuo Cheng³, Umer Siddique⁴

and Sheng Shen³

Reviewer 1: Anonymous

Reviewer 2:

José Luis Hernández-Hernández

Reviewer 3:

Habib Hamam

Information 2023, 14(11), 597; https://doi.org/10.3390/info14110597

Submission received: 14 August 2023 / Revised: 17 October 2023 / Accepted: 1 November 2023 / Published: 3 November 2023

(This article belongs to the Section Artificial Intelligence)

Round 1

Reviewer 1 Report

Comments and Suggestions for Authors

1 The article is not highly related to agriculture.

2 The objectives of the research need to be clearly defined.

3 The abstract did not explain the results and draw conclusions.

4 The evaluation cannot be based solely on number of food delivery orders.

5 The algorithm can guide the couriers to the areas with a high demand for food delivery orders, which seems to imply a higher waiting time for the areas with a low demand for food delivery orders.

6 The article needs to clarify the purpose of privacy protection, which is to prevent who obtains private information.

7 There seems to be no significant correlation between privacy protection and area filtering. They are more inclined towards two independent topics.

8 The article does not propose a new algorithm, it is only an application of existing algorithms.

Comments on the Quality of English Language

Language is expected to be improved and polished.

Author Response

Comment 1: The article is not highly related to agriculture.

Response: We appreciate your comment, and we apologize for any confusion. Our research aligns with the journal's areas of interest, as our work focus on the application of artificial intelligence (specifically, multi-agent reinforcement learning) and data security. Our study addresses two critical challenges: improving the performance of online food delivery services and safeguarding user privacy within these platforms by preventing the disclosure of location information.

Comment 2: The objectives of the research need to be clearly defined.

Response: Thank you for your valuable comment. In this research we have two main objectives as the follow:

Introducing a method based on multi-agent reinforcement learning for online food delivery services, employing two multi-agent reinforcement learning algorithms. The principal aim of this approach is to increase the number of food delivery orders received, consequently enhancing the long-term income for couriers.

We propose a privacy-preserving defense method for safeguarding user privacy in online food delivery services. Our approach is designed to protect customer location information within online food delivery services using the differential privacy Laplace mechanism, achieved by inject Laplace noise to both customer locations and courier trajectories. The privacy parameter ε, which influences the level of noise injected, depends on two key factors: the size of the city area and the frequency of customer online food delivery orders.

Comment 3: The abstract did not explain the results and draw conclusions.

Response: Thank you for this insightful comment. We update the paper to comply with this comment.

Comment 4: The evaluation cannot be based solely on number of food delivery orders.

Response: Thank you for raising this issue. In this research, we propose two solutions: one for enhancing food delivery services to increase the number of orders and consequently boost long-term income. The results are based on

Multi agent reinforcement learning (MARL) rewards as the MARL get rewards when achieve the goal.
The average number of collected food delivery orders as this is the main objective.

The second solution was proposed a method to solve the privacy issue in online food delivery (location information) and we used multiple method to present our results:

Hausdorff Distance to evaluate the data utility and differences after adding Laplace
noise and use our method to show the similarity between the two datasets of location (the original datasets and obfuscated datasets).
privacy parameter ε distribution generated by PULM algorithm

Comment 5: The algorithm can guide the couriers to the areas with a high demand for food delivery orders, which seems to imply a higher waiting time for the areas with a low demand for food delivery orders.

Response: Thank you for your comment. Directing couriers to high-demand areas can effectively reduce average waiting times. This approach ensures that a majority of couriers are dispatched to regions with a high demand for services. In contrast, sending couriers primarily to low-demand areas can result in extended waiting times for customers in high-demand areas, while couriers may find themselves with minimal tasks to perform.

Comment 6: The article needs to clarify the purpose of privacy protection, which is to prevent who obtains private information.

Response: We much appreciate your insightful comments. Thousands of food delivery orders are received by these food delivery services every single day, resulting in the collection of vast amounts of user data. This data could potentially be hosted by a third party and further processed for training and analysis. Additionally, the IT department might be outsourced to a third party. Various access permissions are granted to different types of data, creating opportunities for unauthorized access to customer information. Furthermore, adversaries could employ various attacks, such as inference attacks, to potentially infer sensitive information, thus posing a significant threat to customer privacy, including the disclosure of their location.

Comment 7: There seems to be no significant correlation between privacy protection and area filtering. They are more inclined towards two independent topics.

Response: We greatly appreciate your insightful feedback. Online food delivery services are essential to many people who rely on this solution for their meals. Our solution focuses on enhancing this service and we found there is potential threat that can come of using this service, leading us to propose a defence method.

Comment 8: The article does not propose a new algorithm, it is only an application of existing algorithms.

Response: Thank you for your comments. This research proposes two solution one for the online food delivery services and how increase the number of order and we propose PULM method to protect customer location information.

Reviewer 2 Report

Comments and Suggestions for Authors

Multi agent reinforcement learning for online food delivery with location privacy preserving

1. Very interesting research entitled “Multi agent reinforcement learning for online food delivery with location privacy preserving”.

2. Correct the structure of the article (see attached file).

** Check "Microsoft Word template" from information-MDPI.

https://www.mdpi.com/files/word-templates/information-template.dot

3. I suggest restructuring the article according to the journal template (link above).

4. Place the title of table 1 on a single line. Avoid using only capital letters. Use the format indicated below. (see attached file).

5. The equation of line 377 is not numbered. Correct.

6. In the middle of lines 431-438, is figure 2. The figure must be outside the paragraph.

7. In the middle of lines 570-573, there are figures 5 and 6. The figures must be outside the paragraph.

8. In the middle of lines 593-597, there are figures 7 and 8. The figures must be outside the paragraph.

9. The title of the figures (3, 4, 5, 6, 7 and 8) must be short. The previous paragraph should explain the figure.

10. The paragraph on lines 70-75 should be deleted, it is not appropriate to anticipate what will be dealt with in later sections.

11. How "Multi-agent reinforcement learning" is applied to train and guide the agent to the area with high demand for food delivery orders. Explain in detail.

12. How noise is injected for “preserving the privacy of customer location“, using PULM method. Explain the procedure in detail.

13. Consider future work on this research.

14. Very good bibliography.

The article has good content and very interesting.

Authors are requested to make all indicated corrections.

Comments for author File: Comments.pdf

Author Response

Comment 1: Very interesting research entitled “Multi agent reinforcement learning for online food delivery with location privacy preserving”.

Response: Thank you for your feedback, and we appreciate this feedback.

Comment 2: Correct the structure of the article (see attached file).

Response: Thank you for your comments. We convert our paper to the structure you suggested but we add two more parts: Preliminary and Experiment Design

Comment 3: I suggest restructuring the article according to the journal template (link above).

Response: Thank you for your comments. We convert our paper to the structure you suggested but we add two more parts: Preliminary and Experiment Design

Comment 4: Place the title of table 1 on a single line. Avoid using only capital letters. Use the format indicated below. (see attached file).