Path persistence in the cloud

Author:

Reda Waleed1,Bogdanov Kirill2,Milolidakis Alexandros2,Ghasemirahni Hamid2,Chiesa Marco2,Maguire Gerald Q.2,Kostić Dejan2

Affiliation:

1. KTH Royal Institute of Technology / Universite catholique de Louvain

2. KTH Royal Institute of Technology

Abstract

A commonly held belief is that traffic engineering and routing changes are infrequent. However, based on our measurements over a number of years of traffic between data centers in one of the largest cloud provider's networks, we found that it is common for flows to change paths at ten-second intervals or even faster. These frequent path and, consequently, latency variations can negatively impact the performance of cloud applications, specifically, latency-sensitive and geo-distributed applications. Our recent measurements and analysis focused on observing path changes and latency variations between different Amazon aws regions. To this end, we devised a path change detector that we validated using both ad hoc experiments and feedback from cloud networking experts. The results provide three main insights: (1) Traffic Engineering (TE) frequently moves (TCP and UDP) flows among network paths of different latency, (2) Flows experience unfair performance, where a subset of flows between two machines can suffer large latency penalties (up to 32% at the 95th percentile) or excessive number of latency changes, and (3) Tenants may have incentives to selfishly move traffic to low latency classes (to boost the performance of their applications). We showcase this third insight with an example using rsync synchronization. To the best of our knowledge, this is the first paper to reveal the high frequency of TE activity within a large cloud provider's network. Based on these observations, we expect our paper to spur discussions and future research on how cloud providers and their tenants can ultimately reconcile their independent and possibly conflicting objectives. Our data is publicly available for reproducibility and further analysis at http://goo.gl/25BKte.

Publisher

Association for Computing Machinery (ACM)

Subject

Computer Networks and Communications,Software

Reference39 articles.

1. All measurement data used in this paper can be found via this link goo.gl/25BKte All measurement data used in this paper can be found via this link goo.gl/25BKte

2. Avoiding traceroute anomalies with Paris traceroute

3. Measuring load-balanced paths in the internet

4. Multipath tracing with Paris traceroute

Cited by 14 articles. 订阅此论文施引文献 订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献

1. Control-Plane Observability Through Data-Plane Performance;2024 16th International Conference on COMmunication Systems & NETworkS (COMSNETS);2024-01-03

2. Roracle: Enabling Lookahead Routing for Scalable Traffic Engineering with Supervised Learning;2023 IEEE 31st International Conference on Network Protocols (ICNP);2023-10-10

3. Reinforcement Learning-based Traffic Engineering for QoS Provisioning and Load Balancing;2023 IEEE/ACM 31st International Symposium on Quality of Service (IWQoS);2023-06-19

4. LARRI: Learning-based Adaptive Range Routing for Highly Dynamic Traffic in WANs;IEEE INFOCOM 2023 - IEEE Conference on Computer Communications;2023-05-17

5. SDN-enabled Resource Provisioning Framework for Geo-Distributed Streaming Analytics;ACM Transactions on Internet Technology;2023-02-23

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3