Practical Packet Deflection in Datacenters

Author:

Abdous Sepehr1ORCID,Sharafzadeh Erfan1ORCID,Ghorbani Soudeh2ORCID

Affiliation:

1. Johns Hopkins University, Baltimore, MD, USA

2. Johns Hopkins University & Meta, Baltimore, MD, USA

Abstract

Bursts, sudden surges in network utilization, are a significant root cause of packet loss and high latency in datacenters. Packet deflection, re-routing packets that arrive at a local hotspot to neighboring switches, is shown to be a potent countermeasure against bursts. Unfortunately, existing deflection techniques cannot be implemented in today's datacenter switches. This is because, to minimize packet drops and remain effective under extreme load, existing deflection techniques rely on certain hardware primitives (e.g., extracting packets from arbitrary locations in the queue) that datacenter switches do not support. In this paper, we address the implementability hurdles of packet deflection. This paper proposes heuristics for approximating state-of-the-art deflection techniques in programmable switches. We introduce Simple Deflection which deflects excess traffic to randomly selected, non-congested ports and Preemptive Deflection (PD) in which switches identify the packets likely to be selected for deflection and preemptively deflect them before they are enqueued. We implement and evaluate our techniques on a testbed with Intel Tofino switches. Our testbed evaluations show that Simple and Preemptive Deflection improve the 99th percentile response times by 8× and 425×, respectively, compared to a baseline drop-tail queue under 90% load. Using large-scale network simulations, we show that the performance of our algorithms is close to the deflection techniques that they intend to approximate, e.g., PD achieves 4% lower 99th percentile query completion times (QCT) than Vertigo, a recent deflection technique that cannot be implemented in off-the-shelf switches, and 2.5× lower QCT than ECMP under 95% load.

Funder

Facebook

NSF

Intel Fast Forward

Publisher

Association for Computing Machinery (ACM)

Reference84 articles.

1. 2020. INET Framework. https://inet.omnetpp.org/. 2020. INET Framework. https://inet.omnetpp.org/.

2. 2020. OMNeT++ Simulator. https://omnetpp.org/. 2020. OMNeT++ Simulator. https://omnetpp.org/.

3. 2020. Open Tofino. https://github.com/barefootnetworks/Open-Tofino. 2020. Open Tofino. https://github.com/barefootnetworks/Open-Tofino.

4. 2023. Intel Tofino 1 products. https://www.intel.com/content/www/us/en/products/details/network-io/intelligent-fabric-processors/tofino/products.html. 2023. Intel Tofino 1 products. https://www.intel.com/content/www/us/en/products/details/network-io/intelligent-fabric-processors/tofino/products.html.

5. 2023. Tofino 2. https://www.intel.com/content/www/us/en/products/network-io/programmable-ethernet-switch/tofino-2-series.html. 2023. Tofino 2. https://www.intel.com/content/www/us/en/products/network-io/programmable-ethernet-switch/tofino-2-series.html.

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3