LEAF: Navigating Concept Drift in Cellular Networks

Author:

Liu Shinan1ORCID,Bronzino Francesco2ORCID,Schmitt Paul3ORCID,Bhagoji Arjun Nitin1ORCID,Feamster Nick1ORCID,Crespo Hector Garcia4ORCID,Coyle Timothy5ORCID,Ward Brian6ORCID

Affiliation:

1. University of Chicago, Chicago, IL, USA

2. Univ Lyon, EnsL, UCBL, CNRS, LIP, Lyon, France

3. University of Hawaii, Manoa, Honolulu, HI, USA

4. Verizon, North Richland Hills, TX, USA

5. Verizon, Chicopee, MA, USA

6. Verizon, Fort Worth, TX, USA

Abstract

Operational networks commonly rely on machine learning models for many tasks, including detecting anomalies, inferring application performance, and forecasting demand. Yet, model accuracy can degrade due to concept drift, whereby the relationship between the features and the target to be predicted changes. Mitigating concept drift is an essential part of operationalizing machine learning models in general, but is of particular importance in networking's highly dynamic deployment environments. In this paper, we first characterize concept drift in a large cellular network for a major metropolitan area in the United States. We find that concept drift occurs across many important key performance indicators (KPIs), independently of the model, training set size, and time interval---thus necessitating practical approaches to detect, explain, and mitigate it. We then show that frequent model retraining with newly available data is not sufficient to mitigate concept drift, and can even degrade model accuracy further. Finally, we develop a new methodology for concept drift mitigation, Local Error Approximation of Features (LEAF). LEAF works by detecting drift; explaining the features and time intervals that contribute the most to drift; and mitigates it using forgetting and over-sampling. We evaluate LEAF against industry-standard mitigation approaches (notably, periodic retraining) with more than four years of cellular KPI data. Our initial tests with a major cellular provider in the US show that LEAF consistently outperforms periodic and triggered retraining on complex, real-world data while reducing costly retraining operations.

Funder

NSF

France and Chicago Collaborating in the Sciences program

ANR

Publisher

Association for Computing Machinery (ACM)

Reference65 articles.

1. REFERENCES [1] 2023. LTE cellular network performance indicators daily measurements dataset. https://forms.gle/ g5pbB5qRHeBsEmZJ6. REFERENCES [1] 2023. LTE cellular network performance indicators daily measurements dataset. https://forms.gle/ g5pbB5qRHeBsEmZJ6.

2. AutoGluon AI. accessed July , 2021 . AutoGluon: Auto ML for Text, Image, and Tabular Data . https://auto.gluon.ai/stable/ index.html. AutoGluon AI. accessed July, 2021. AutoGluon: AutoML for Text, Image, and Tabular Data. https://auto.gluon.ai/stable/ index.html.

3. David Alvarez-Melis and Tommi S Jaakkola . 2018. On the robustness of interpretability methods. arXiv preprint arXiv:1806.08049 ( 2018 ). David Alvarez-Melis and Tommi S Jaakkola. 2018. On the robustness of interpretability methods. arXiv preprint arXiv:1806.08049 (2018).

4. Visualizing the effects of predictor variables in black box supervised learning models

5. Interpretable Feedback for AutoML and a Proposal for Domain-customized AutoML for Networking

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3