PFDCT: An Enhanced Transport Layer Protocol for Precise Flow Control in Data Centers
Abstract
:1. Introduction
- In order to provide long-term stable throughput for the link, PFDCT only performs ECN marking on long flows, and PFDCT optimizes the traditional ECN fixed marking threshold to a traffic-aware dynamic marking threshold to cope with complex and frequently changing data center network environments.
- PFDCT has developed a priority-based queue management mechanism according to the urgency of data packet transmission, data transfer rate, and data flow size. It simultaneously guarantees the transmission quality of long-flow packets, short-flow packets, and retransmission packets, which constitute almost all traffic in the data center network. The above measures make the ECN marking algorithm of PFDCT more fair and efficient during data transfer.
- Considering the complexity of the model and measurement accuracy, PFDCT combines two kinds of feedback information, the readily available Round-Trip Time (RTT) from the network and the widely supported ECN marking, to obtain the most accurate link status while saving link overhead. The above measures can ensure that the work of PFDCT is more reasonable and efficient.
- We tested the performance of the PFDCT protocol in terms of end-to-end delay, flow completion time, and packet loss rate, and compared it with the scheme under the fixed threshold, finding that the above indicators of PFDCT are better than the fixed-threshold one to varying degrees. Extensive evaluations showed that PFDCT performs much better on FCT compared to DCTCP, ICTCP, and L2DCT in the simple-scale topology. For a link load of 100%, the throughput of PFDCT is 0.41 MB higher than that of L2DCT, which is better than other similar schemes. In the large-scale topology, PFDCT also maintains a steady increase in throughput, with a stable fairness index of 0.55, which is more than 20% higher than ICTCP.
2. Related Work
3. PFDCT Design
3.1. Basic Idea
3.2. Protocol Details
3.2.1. Packet Types
3.2.2. Switch Service Model
3.2.3. Discriminating Priority Assignment
Algorithm 1 PFDCT Switch |
Input: SocketPriorityTag priorityTag Output: prioband
|
3.2.4. Congestion Window Update
3.2.5. Dynamic ECN Marking Threshold Setting
Algorithm 2 PFDCT Sender |
Input: Flowsize Urgencydegree Output: The updated threshold
|
4. Impact of the Threshold of PFDCT
5. Evaluation
5.1. Simulation Setup
5.2. Simple-Scale Experiments
5.3. Large-Scale Experiments
6. Conclusions
Author Contributions
Funding
Data Availability Statement
Conflicts of Interest
References
- Alipio, M.; Tiglao, N.M.; Bokhari, F.; Khalid, S. TCP incast solutions in data center networks: A classification and survey. J. Netw. Comput. Appl. 2019, 146, 102421. [Google Scholar] [CrossRef]
- Alasmar, M.; Parisis, G.; Crowcroft, J. SCDP: Systematic Rateless Coding for Efficient Data Transport in Data Centres (Complete Version). arXiv 2019, arXiv:1909.08928. [Google Scholar]
- Devkota, P.; Reddy, A.N. Performance of quantized congestion notification in TCP incast scenarios of data centers. In Proceedings of the 2010 IEEE International Symposium on Modeling, Analysis and Simulation of Computer and Telecommunication Systems, Miami Beach, FL, USA, 17–19 August 2010; IEEE: Piscataway, NJ, USA, 2010; pp. 235–243. [Google Scholar] [CrossRef]
- Mittal, R.; Lam, V.T.; Dukkipati, N.; Blem, E.; Wassel, H.; Ghobadi, M.; Vahdat, A.; Wang, Y.; Wetherall, D.; Zats, D. TIMELY: RTT-based congestion control for the datacenter. ACM SIGCOMM Comput. Commun. Rev. 2015, 45, 537–550. [Google Scholar] [CrossRef]
- Rezaei, H.; Chaudhry, M.U.; Almasi, H.; Vamanan, B. Icon: Incast congestion control using packet pacing in datacenter networks. In Proceedings of the 2019 11th International Conference on Communication Systems & Networks (COMSNETS), Bengaluru, India, 7–11 January 2019; IEEE: Piscataway, NJ, USA, 2019; pp. 125–132. [Google Scholar] [CrossRef]
- Le, L.; Aikat, J.; Jeffay, K.; Smith, F.D. Differential congestion notification: Taming the elephants. In Proceedings of the 12th IEEE International Conference on Network Protocols, ICNP 2004, Washington, DC, USA, 5–8 October 2004; IEEE: Piscataway, NJ, USA, 2004; pp. 118–128. [Google Scholar]
- Abdelmoniem, A.M.; Bensaou, B. Curbing timeouts for TCP-incast in data centers via a cross-layer faster recovery mechanism. In Proceedings of the IEEE INFOCOM 2018—IEEE Conference on Computer Communications, Honolulu, HI, USA, 16–19 April 2018; IEEE: Piscataway, NJ, USA, 2018; pp. 675–683. [Google Scholar] [CrossRef]
- Wu, H.; Farha, F.; Hong, T.; Xu, Y.; Zhang, T. CWND: A Coarse, But Simple, Efficient Metric to Reduce Short Flow Completion Time in Data Centers. IEEE Access 2019, 7, 172496–172504. [Google Scholar] [CrossRef]
- Bai, W.; Chen, K.; Wu, H.; Lan, W.; Zhao, Y. PAC: Taming TCP incast congestion using proactive ACK control. In Proceedings of the 2014 IEEE 22nd International Conference on Network Protocols, Raleigh, NC, USA, 21–24 October 2014; IEEE: Piscataway, NJ, USA, 2014; pp. 385–396. [Google Scholar] [CrossRef]
- Huang, J.; He, T.; Huang, Y.; Wang, J. ARS: Cross-layer adaptive request scheduling to mitigate TCP incast in data center networks. In Proceedings of the IEEE INFOCOM 2016—The 35th Annual IEEE International Conference on Computer Communications, San Francisco, CA, USA, 10–14 April 2016; IEEE: Piscataway, NJ, USA, 2016; pp. 1–9. [Google Scholar] [CrossRef]
- Lee, C.; Park, C.; Jang, K.; Moon, S.; Han, D. Accurate latency-based congestion feedback for datacenters. In Proceedings of the 2015 USENIX Annual Technical Conference (USENIX ATC 15), Santa Clara, CA, USA, 8–10 July 2015; pp. 403–415. [Google Scholar]
- Wu, H.; Feng, Z.; Guo, C.; Zhang, Y. ICTCP: Incast congestion control for TCP in data-center networks. IEEE/ACM Trans. Netw. 2012, 21, 345–358. [Google Scholar] [CrossRef]
- Ramakrishnan, K.; Floyd, S.; Black, D. The Addition of Explicit Congestion Notification (ECN) to IP; Technical Report; IETF: Fremont, CA, USA, 2001. [Google Scholar]
- Pan, R.; Prabhakar, B.; Laxmikantha, A. QCN: Quantized congestion notification. In Proceedings of the IEEE802, Geneva, Switzerland, 29 May 2007; Volume 1, pp. 52–83. [Google Scholar]
- Luo, J.; Yang, X.; Zhang, C. CCMA: A Dynamical Concurrent-Connection Management Agent to Mitigate TCP Incast in Datacenters. IEEE Access 2019, 7, 63303–63320. [Google Scholar] [CrossRef]
- Suryavanshi, M.; Kumar, A.; Yadav, J. An application layer technique to overcome TCP incast in data center network using delayed server response. Int. J. Inf. Technol. 2021, 13, 703–711. [Google Scholar] [CrossRef]
- Wang, H.; Shen, H. Proactive incast congestion control in a datacenter serving web applications. In Proceedings of the IEEE INFOCOM 2018—IEEE Conference on Computer Communications, Honolulu, HI, USA, 16–19 April 2018; IEEE: Piscataway, NJ, USA, 2018; pp. 19–27. [Google Scholar]
- Zou, S.; Huang, J.; Wang, J.; He, T. Flow-aware adaptive pacing to mitigate TCP incast in data center networks. IEEE/ACM Trans. Netw. 2020, 29, 134–147. [Google Scholar] [CrossRef]
- Kumar, G.; Dukkipati, N.; Jang, K.; Wassel, H.M.; Wu, X.; Montazeri, B.; Wang, Y.; Springborn, K.; Alfeld, C.; Ryan, M.; et al. Swift: Delay is simple and effective for congestion control in the datacenter. In Proceedings of the Annual Conference of the ACM Special Interest Group on Data Communication on the Applications, Technologies, Architectures, and Protocols for Computer Communication, Virtual Event, 10–14 August 2020; pp. 514–528. [Google Scholar]
- Huang, J.; Huang, Y.; Wang, J.; He, T. Adjusting packet size to mitigate TCP incast in data center networks with COTS switches. IEEE Trans. Cloud Comput. 2018, 8, 749–763. [Google Scholar] [CrossRef]
- Cheng, P.; Ren, F.; Shu, R.; Lin, C. Catch the whole lot in an action: Rapid precise packet loss notification in data center. In Proceedings of the 11th USENIX Symposium on Networked Systems Design and Implementation (NSDI 14), Seattle, WA, USA, 2–4 April 2014; pp. 17–28. [Google Scholar]
- Floyd, S.; Jacobson, V. Random early detection gateways for congestion avoidance. IEEE/ACM Trans. Netw. 1993, 1, 397–413. [Google Scholar] [CrossRef]
- Alizadeh, M.; Greenberg, A.; Maltz, D.A.; Padhye, J.; Patel, P.; Prabhakar, B.; Sengupta, S.; Sridharan, M. Data center tcp (dctcp). In Proceedings of the ACM SIGCOMM 2010 Conference, New Delhi, India, 30 August–3 September 2010; pp. 63–74. [Google Scholar] [CrossRef]
- Bai, W.; Chen, L.; Chen, K.; Wu, H. Enabling ECN in multi-service multi-queue data centers. In Proceedings of the 13th USENIX Symposium on Networked Systems Design and Implementation (NSDI 16), Santa Clara, CA, USA, 16–18 March 2016; pp. 537–549. [Google Scholar]
- Shan, D.; Ren, F. Improving ECN marking scheme with micro-burst traffic in data center networks. In Proceedings of the IEEE INFOCOM 2017—IEEE Conference on Computer Communications, Atlanta, GA, USA, 1–4 May 2017; IEEE: Piscataway, NJ, USA, 2017; pp. 1–9. [Google Scholar] [CrossRef]
- Wang, S.; Zhang, J.; Huang, T.; Pan, T.; Liu, J.; Liu, Y. A-ECN minimizing queue length for datacenter networks. IEEE Access 2020, 8, 49100–49111. [Google Scholar] [CrossRef]
- Vasudevan, V.; Phanishayee, A.; Shah, H.; Krevat, E.; Andersen, D.G.; Ganger, G.R.; Gibson, G.A.; Mueller, B. Safe and effective fine-grained TCP retransmissions for datacenter communication. ACM SIGCOMM Comput. Commun. Rev. 2009, 39, 303–314. [Google Scholar] [CrossRef]
- Xu, Y.; Shukla, S.; Guo, Z.; Liu, S.; Tam, A.S.W.; Xi, K.; Chao, H.J. RAPID: Avoiding TCP incast throughput collapse in public clouds with intelligent packet discarding. IEEE J. Sel. Areas Commun. 2019, 37, 1911–1923. [Google Scholar] [CrossRef]
- Abdelmoniem, A.M.; Bensaou, B. T-RACKs: A faster recovery mechanism for TCP in data center networks. IEEE/ACM Trans. Netw. 2021, 29, 1074–1087. [Google Scholar] [CrossRef]
- Sreekumari, P.; Jung, J.I.; Lee, M. A simple and efficient approach for reducing TCP timeouts due to lack of duplicate acknowledgments in data center networks. Clust. Comput. 2016, 19, 633–645. [Google Scholar] [CrossRef]
- Zhang, J.; Ren, F.; Tang, L.; Lin, C. Modeling and solving TCP incast problem in data center networks. IEEE Trans. Parallel Distrib. Syst. 2014, 26, 478–491. [Google Scholar] [CrossRef]
- Zhuang, J.; Jiang, X.; Jin, G.; Zhu, J.; Chen, H. PTCP: A priority-driven congestion control algorithm to tame TCP incast in data centers. IEEE Access 2019, 7, 38880–38889. [Google Scholar] [CrossRef]
- Alizadeh, M.; Greenberg, A.; Maltz, D.; Padhye, J.; Patel, P.; Prabhakar, B.; Sengupta, S.; Sridharan, M. DCTCP: Efficient Packet Transport for the Commoditized Data Center; ACM SIGCOMM: Amsterdam, The Netherlands, 1973; pp. 1–15. [Google Scholar]
- Xu, L.; Xu, K.; Jiang, Y.; Ren, F.; Wang, H. Throughput optimization of TCP incast congestion control in large-scale datacenter networks. Comput. Netw. 2017, 124, 46–60. [Google Scholar] [CrossRef]
- Munir, A.; Qazi, I.A.; Uzmi, Z.A.; Mushtaq, A.; Ismail, S.N.; Iqbal, M.S.; Khan, B. Minimizing flow completion times in data centers. In Proceedings of the 2013 Proceedings IEEE INFOCOM, Turin, Italy, 14–19 April 2013; IEEE: Piscataway, NJ, USA, 2013; pp. 2157–2165. [Google Scholar] [CrossRef]
Parameters | Description |
---|---|
Current RTT | |
The minimum RTT | |
Current congestion window | |
The ideal threshold | |
The dynamical threshold | |
Congestion indicator | |
The current queue length | |
g | Weight factor for congestion factor |
The percentage of packets that are tagged | |
h | Weight factor for |
Flow Numbers | L2DCT (s) | ICTCP (s) | DCTCP (s) | PFDCT (s) |
---|---|---|---|---|
10 | 4.168 | 4.311 | 4.166 | 4.267 |
20 | 3.898 | 3.904 | 3.774 | 3.810 |
30 | 3.402 | 3.419 | 3.243 | 3.185 |
40 | 2.890 | 2.910 | 2.724 | 2.658 |
50 | 2.459 | 2.448 | 2.433 | 2.367 |
60 | 2.499 | 2.441 | 2.346 | 2.295 |
70 | 2.487 | 2.463 | 2.412 | 2.300 |
80 | 2.481 | 2.472 | 2.441 | 2.324 |
90 | 2.518 | 2.472 | 2.364 | 2.300 |
100 | 2.510 | 2.488 | 2.440 | 2.305 |
Avg.FCT | 2.931 | 2.933 | 2.834 | 2.781 |
Time (s) | (Mbps) | (Mbps) | (Mbps) | (Mbps) |
---|---|---|---|---|
0–5 | 131.48 | 22.88 | 201.31 | 166.47 |
5–10 | 611.81 | 216.14 | 757.68 | 748.86 |
10–15 | 1028.46 | 568.59 | 1275.68 | 1286.10 |
15–20 | 1444.95 | 1058.17 | 1694.90 | 1816.22 |
Avg.throughput | 797.20 | 464.44 | 972.69 | 998.89 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Li, Z.; Wang, H.; Yi, X.; Zhang, X.; Wu, J.; Zhang, Y.; Luo, P.; Liu, K. PFDCT: An Enhanced Transport Layer Protocol for Precise Flow Control in Data Centers. Electronics 2023, 12, 1890. https://doi.org/10.3390/electronics12081890
Li Z, Wang H, Yi X, Zhang X, Wu J, Zhang Y, Luo P, Liu K. PFDCT: An Enhanced Transport Layer Protocol for Precise Flow Control in Data Centers. Electronics. 2023; 12(8):1890. https://doi.org/10.3390/electronics12081890
Chicago/Turabian StyleLi, Zhuo, Huiyan Wang, Xiangdong Yi, Xinyi Zhang, Jiawen Wu, Yubin Zhang, Peng Luo, and Kaihua Liu. 2023. "PFDCT: An Enhanced Transport Layer Protocol for Precise Flow Control in Data Centers" Electronics 12, no. 8: 1890. https://doi.org/10.3390/electronics12081890
APA StyleLi, Z., Wang, H., Yi, X., Zhang, X., Wu, J., Zhang, Y., Luo, P., & Liu, K. (2023). PFDCT: An Enhanced Transport Layer Protocol for Precise Flow Control in Data Centers. Electronics, 12(8), 1890. https://doi.org/10.3390/electronics12081890