Application-Aware Resource Allocation Based on Benefit–Cost Ratio in Computing Power Network with Heterogeneous Computing Resources
Abstract
:1. Introduction
2. Problem Statement
2.1. Applications in CPNs with Heterogeneous Resources
2.2. Resource Allocation for Applications
3. Application-Aware Resource Allocation Model
3.1. Network Model
3.2. AARA Algorithm
Algorithm 1: AARA |
Input: Give ; [1, 4], ; grade , {CPU, GPU} |
Output: , <> |
Initialized system computing nodes and network parameters; 1. While there are undeployed applications do 2. for each undeployed application r do 3. if m is CPU/GPU-intensive then 4. proportionally for CPU and GPU, {} 5. for each computing node do 6. Exclude node with insufficient computing resource 7. for each sufficient computing node do 8. Search for path set can satisfy by using KSP algorithm 9. if is the empty set then blocking 10. Search for grades set 11. for g in G do 12. Use Equations (1)–(8) compute BCR 13. end for 14. end if 15. end for 16. end for 17. end if 18. else if m is general then 19. Compute CPU or GPU computing resource required 20. The same as Step 5–16 21. end if 22. Find a computing node and grade with the max BCR; 23. Deploy application to computing node 24. Update computing node and network parameters. Remove tasks r from undeployed tasks set R 25. end for 26. end while 27. Use Equation (10) to compute . |
4. Simulation Setup and Results
4.1. Simulation Setup
4.2. Results and Analysis
5. Discussion
6. Conclusions
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Conflicts of Interest
References
- Lei, B.; Zhou, G. Exploration and practice of Computing Power Network (CPN) to Realize Convergence of Computing and Network. In Proceedings of the 2022 Optical Fiber Communications Conference and Exhibition (OFC), San Diego, CA, USA, 6–10 March 2022. [Google Scholar]
- Tang, X.; Cao, C.; Wang, Y.; Zhang, S.; Liu, Y.; Li, M.; He, T. Computing Power Network: The architecture of Convergence of Computing and Networking towards 6G Requirement. China Commun. 2021, 18, 175–185. [Google Scholar] [CrossRef]
- Ji, Y.; Zhang, J.; Xiao, Y.; Liu, Z. 5G flexible optical transport networks with large-capacity, low-latency and high-efficiency. China Commun. 2019, 16, 19–32. [Google Scholar] [CrossRef]
- Zeng, Q.; Du, Y.; Huang, K.; Leung, K. Energy-Efficient Resource Management for Federated Edge Learning With CPU-GPU Heterogeneous Computing. IEEE Trans. Wirel. Commun. 2021, 20, 7947–7962. [Google Scholar] [CrossRef]
- Goyat, S.; Kant, S.; Dhariwal, N. Dynamic Heterogeneous scheduling of GPU-CPU in Distributed Environment. In Proceedings of the 2019 International Conference on Smart Systems and Inventive Technology (ICSSIT), Tirunelveli, India, 27–29 November 2019. [Google Scholar]
- Chen, C.; Li, K.; Ouyang, A.; Zeng, Z.; Li, K. GFlink: An in-memory computing architecture on heterogeneous CPU-GPU clusters for big data. IEEE Trans. Parallel Distrib. Syst. 2018, 29, 1275–1288. [Google Scholar] [CrossRef]
- Heldens, S.; Varbanescu, A.L.; Iosup, A. Dynamic Load Balancing for High-Performance Graph Processing on Hybrid CPU-GPU Platforms. In Proceedings of the 2016 6th Workshop on Irregular Applications: Architecture and Algorithms (IA3), Salt Lake City, UT, USA, 13 November 2016. [Google Scholar]
- Jain, R.; Cheng, S.; Kalagi, V.; Sanghavi, V.; Kaul, S.; Arunachalam, M.; Maeng, K.; Jog, A.; Sivasubramaniam, A.; Kandemir, M.T.; et al. Optimizing CPU Performance for Recommendation Systems At-Scale. In Proceedings of the 50th Annual International Symposium on Computer Architecture (ISCA ′23), New York, NY, USA, 17–21 June 2023. [Google Scholar]
- Buber, E.; Diri, B. Performance Analysis and CPU vs GPU Comparison for Deep Learning. In Proceedings of the 2018 6th International Conference on Control Engineering & Information Technology (CEIT), Istanbul, Turkey, 25–27 October 2018. [Google Scholar]
- Qi, H.; Sparks, E.R.; Talwalkar, A. Paleo: A Performance Model for Deep Neural Networks. In Proceedings of the 2017 International Conference on Learning Representations, Toulon, France, 24–26 April 2017. [Google Scholar]
- Alsubaihi, S.; Gaudiot, J.-L. A Runtime Workload Distribution with Resource Allocation for CPU-GPU Heterogeneous Systems. In Proceedings of the 2017 IEEE International Parallel and Distributed Processing Symposium Workshops (IPDPSW), Lake Buena Vista, FL, USA, 29 May–2 June 2017. [Google Scholar]
- Peng, J.; Chen, J.; Kong, S.; Liu, D.; Qiu, M. Resource Optimization Strategy for CPU Intensive Applications in Cloud Computing Environment. In Proceedings of the 2016 IEEE 3rd International Conference on Cyber Security and Cloud Computing (CSCloud), Beijing, China, 25–27 June 2016. [Google Scholar]
- Melançon, G. Living Flows: Enhanced Exploration of Edge-Bundled Graphs Based on GPU-Intensive Edge Rendering. In Proceedings of the 2010 14th International Conference Information Visualisation, London, UK, 26–29 July 2010. [Google Scholar]
- Akutsu, H.; Naruko, T.; Suzuki, A. GPU-Intensive Fast Entropy Coding Framework for Neural Image Compression. In Proceedings of the 2021 International Conference on Visual Communications and Image Processing (VCIP), Munich, Germany, 5–8 December 2021. [Google Scholar]
- Feng, G.; Garg, S.; Buyya, R.; Li, W. Revenue Maximization Using Adaptive Resource Provisioning in Cloud Computing Environments. In Proceedings of the 2012 ACM/IEEE 13th International Conference on Grid Computing, Beijing, China, 20–23 September 2012. [Google Scholar]
- Samanta, A.; Chang, Z. Adaptive Service Offloading for Revenue Maximization in Mobile Edge Computing with Delay-Constraint. IEEE IoT J. 2019, 6, 3864–3872. [Google Scholar] [CrossRef]
- Yadav, A.; Rathod, S.B. Priority based task scheduling by mapping conflict-free resources and optimized workload utilization in cloud computing. In Proceedings of the 2016 International Conference on Computing Communication Control and automation (ICCUBEA), Pune, India, 12–13 August 2016. [Google Scholar]
- Yang, G.; Shin, C.; Lee, J.; Yoo, Y. Prediction of the Resource Consumption of Distributed Deep Learning Systems. Proc. ACM Meas. Anal. Comput. Syst. 2022, 6, 1–25. [Google Scholar]
- Cao, J.; Fu, J.; Li, M.; Chen, J. CPU load prediction for cloud environment based on a dynamic ensemble model. Softw. Pract. Exper. 2014, 44, 793–804. [Google Scholar] [CrossRef]
- Duan, L.; Zhan, D.; Hohnerlein, J. Optimizing Cloud Data Center Energy Efficiency via Dynamic Prediction of CPU Idle Intervals. In Proceedings of the 2015 IEEE 8th International Conference on Cloud Computing, New York, NY, USA, 27 June–2 July 2015. [Google Scholar]
- Liu, C.; Li, K.; Li, K. Minimal Cost Server Configuration for Meeting Time-Varying Resource Demands in Cloud Centers. IEEE Trans. Parallel Distrib. Syst. 2018, 29, 2503–2513. [Google Scholar] [CrossRef]
- Cardellini, V.; Valerio, V.D.; Presti, F.L. Game-Theoretic Resource Pricing and Provisioning Strategies in Cloud Systems. IEEE Trans. Serv. Comput. 2020, 13, 86–98. [Google Scholar] [CrossRef]
- Hu, J.; Li, K.; Liu, C.; Li, K. A Game-Based Price Bidding Algorithm for Multi-Attribute Cloud Resource Provision. IEEE Trans. Serv. Comput. 2021, 14, 1111–1122. [Google Scholar] [CrossRef]
- Ma, H.; Zhang, J.; Gu, Z.; Yu, H.; Taleb, T.; Ji, Y. DeepDefrag: Spatio-Temporal Defragmentation of Time-Varying Virtual Networks in Computing Power Network based on Model-Assisted Reinforcement Learning. In Proceedings of the 2022 European Conference on Optical Communication (ECOC), Basel, Switzerland, 18–22 September 2022. [Google Scholar]
- Lin, X.; Wu, W.; Zhu, Y.; Qiu, T.; Mi, Z. SARS: A novel QoE based service-aware resource scheduling scheme in wireless network. In Proceedings of the 2016 IEEE International Conference on Ubiquitous Wireless Broadband (ICUWB), Nanjing, China, 16–19 October 2016. [Google Scholar]
- Santos, A.S.; de Santi, J.; Figueiredo, G.B.; Mukherjee, B. Application-Aware Service Degradation in Elastic Optical Networks. IEEE Trans. Netw. Serv. Manag. 2022, 19, 949–961. [Google Scholar] [CrossRef]
- Suarez-Varela, J.; Mestres, A.; Yu, J.; Kuang, L.; Feng, H.; Barlet-Ros, P.; Cabellos-Aparicio, A. Feature Engineering for Deep Reinforcement Learning Based Routing. In Proceedings of the ICC 2019—2019 IEEE International Conference on Communications (ICC), Shanghai, China, 20–24 May 2019. [Google Scholar]
- Tang, X.; Fu, Z. CPU–GPU Utilization Aware Energy-Efficient Scheduling Algorithm on Heterogeneous Computing Systems. IEEE Access 2020, 8, 58948–58958. [Google Scholar] [CrossRef]
- Nag, A.; Tornatore, M.; Mukherjee, B. Optical Network Design with Mixed Line Rates and Multiple Modulation Formats. J. Light. Technol. 2010, 28, 466–475. [Google Scholar] [CrossRef]
- Lim, W.; Kourtessis, P.; Milosavljevic, M.; Senior, J.M. Dynamic Subcarrier Allocation for 100 Gbps, 40 km OFDMA-PONs with SLA and CoS. J. Light. Technol. 2013, 31, 1055–1062. [Google Scholar] [CrossRef]
Category | Parameters | Value |
---|---|---|
Network parameters | [1, 8, 12] | |
[5, 10] | ||
4TFLOPS | ||
8TFLOPS | ||
[50, 100, 150, 200] GFLOPS | ||
[100, 200, 300, 400] GFLOPS | ||
50GFLOPS | ||
100GFLOPS | ||
80 | ||
100 Gbps | ||
5 μs/km | ||
Computing task | [1, 2, 3, 4] | |
[4, 20] FLOPs | ||
[3, 8] Gbps | ||
[10, 30] ms | ||
20% | ||
35% | ||
45% | ||
1:4/4:1/1:3/3:1 | ||
Cost value | 0.2/(CUs · time unit) | |
0.4/(CUs · time unit) | ||
0.00375/(Gbps · km · time unit) |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Wang, Y.; Li, Y.; Guo, J.; Fan, Y.; Chen, L.; Zhang, B.; Wang, W.; Zhao, Y.; Zhang, J. Application-Aware Resource Allocation Based on Benefit–Cost Ratio in Computing Power Network with Heterogeneous Computing Resources. Photonics 2023, 10, 1273. https://doi.org/10.3390/photonics10111273
Wang Y, Li Y, Guo J, Fan Y, Chen L, Zhang B, Wang W, Zhao Y, Zhang J. Application-Aware Resource Allocation Based on Benefit–Cost Ratio in Computing Power Network with Heterogeneous Computing Resources. Photonics. 2023; 10(11):1273. https://doi.org/10.3390/photonics10111273
Chicago/Turabian StyleWang, Yahui, Yajie Li, Jiaxing Guo, Yingbo Fan, Ling Chen, Boxin Zhang, Wei Wang, Yongli Zhao, and Jie Zhang. 2023. "Application-Aware Resource Allocation Based on Benefit–Cost Ratio in Computing Power Network with Heterogeneous Computing Resources" Photonics 10, no. 11: 1273. https://doi.org/10.3390/photonics10111273
APA StyleWang, Y., Li, Y., Guo, J., Fan, Y., Chen, L., Zhang, B., Wang, W., Zhao, Y., & Zhang, J. (2023). Application-Aware Resource Allocation Based on Benefit–Cost Ratio in Computing Power Network with Heterogeneous Computing Resources. Photonics, 10(11), 1273. https://doi.org/10.3390/photonics10111273