Next Article in Journal
Personalised and Coordinated Demand-Responsive Feeder Transit Service Design: A Genetic Algorithms Approach
Previous Article in Journal
Clock Recovery Challenges in DSP-Based Coherent Single-Mode and Multi-Mode Optical Systems
Article Menu
Issue 7 (July) cover image

Export Article

Open AccessArticle
Future Internet 2018, 10(7), 60; https://doi.org/10.3390/fi10070060

A Novel Two-Layered Reinforcement Learning for Task Offloading with Tradeoff between Physical Machine Utilization Rate and Delay

1
School of Computer and Communication Engineering, University of Science and Technology Beijing, Beijing 100083, China
2
School of Computer and Information, Hefei University of Technology, Hefei 230000, China
*
Author to whom correspondence should be addressed.
Received: 28 March 2018 / Revised: 24 May 2018 / Accepted: 22 June 2018 / Published: 1 July 2018
Full-Text   |   PDF [4497 KB, uploaded 1 July 2018]   |  

Abstract

Mobile devices could augment their ability via cloud resources in mobile cloud computing environments. This paper developed a novel two-layered reinforcement learning (TLRL) algorithm to consider task offloading for resource-constrained mobile devices. As opposed to existing literature, the utilization rate of the physical machine and the delay for offloaded tasks are taken into account simultaneously by introducing a weighted reward. The high dimensionality of the state space and action space might affect the speed of convergence. Therefore, a novel reinforcement learning algorithm with a two-layered structure is presented to address this problem. First, k clusters of the physical machines are generated based on the k-nearest neighbors algorithm (k-NN). The first layer of TLRL is implemented by a deep reinforcement learning to determine the cluster to be assigned for the offloaded tasks. On this basis, the second layer intends to further specify a physical machine for task execution. Finally, simulation examples are carried out to verify that the proposed TLRL algorithm is able to speed up the optimal policy learning and can deal with the tradeoff between physical machine utilization rate and delay. View Full-Text
Keywords: mobile device; task offloading; tradeoff; mobile cloud computing; two layered reinforcement learning mobile device; task offloading; tradeoff; mobile cloud computing; two layered reinforcement learning
Figures

Figure 1

This is an open access article distributed under the Creative Commons Attribution License which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited (CC BY 4.0).
SciFeed

Share & Cite This Article

MDPI and ACS Style

Quan, L.; Wang, Z.; Ren, F. A Novel Two-Layered Reinforcement Learning for Task Offloading with Tradeoff between Physical Machine Utilization Rate and Delay. Future Internet 2018, 10, 60.

Show more citation formats Show less citations formats

Note that from the first issue of 2016, MDPI journals use article numbers instead of page numbers. See further details here.

Related Articles

Article Metrics

Article Access Statistics

1

Comments

[Return to top]
Future Internet EISSN 1999-5903 Published by MDPI AG, Basel, Switzerland RSS E-Mail Table of Contents Alert
Back to Top