Affiliation:
1. School of Electrical and Computer Engineering, National Technical University of Athens, 15780 Athens, Greece
Abstract
Today, video analytics are becoming extremely popular due to the increasing need for extracting valuable information from videos available in public sharing services through camera-driven streams in IoT environments. To avoid data communication overheads, a common practice is to have computation close to the data source rather than Cloud offloading. Typically, video analytics are organized as separate tasks, each with different resource requirements (e.g., computational- vs. memory-intensive tasks). The serverless computing paradigm forms a promising approach for mapping such types of applications, enabling fine-grained deployment and management in a per-function, and per-device manner. However, there is a tradeoff between QoS adherence and resource efficiency. Performance variability due to function co-location and prevalent resource heterogeneity make maintaining QoS challenging. At the same time, resource efficiency is essential to avoid waste, such as unnecessary power consumption and CPU reservation. In this paper, we present Darly, a QoS-, interference- and heterogeneity-aware Deep Reinforcement Learning-based Scheduler for serverless video analytics deployments on top of distributed Edge nodes. The proposed framework incorporates a DRL agent that exploits performance counters to identify the levels of interference and the degree of heterogeneity in the underlying Edge infrastructure. It combines this information along with user-defined QoS requirements to improve resource allocations by deciding the placement, migration, or horizontal scaling of serverless functions. We evaluate Darly on a typical Edge cluster with a real-world workflow composed of commonly used serverless video analytics functions and show that our approach achieves efficient scheduling of the deployed functions by satisfying multiple QoS requirements for up to 91.6% (Profile-based) of the total requests under dynamic conditions.
Reference66 articles.
1. A review on computational intelligence techniques in cloud and edge computing;Asim;IEEE Trans. Emerg. Top. Comput. Intell.,2020
2. Lu, Y., Chowdhery, A., and Kandula, S. (2016, January 5–7). Optasia: A relational platform for efficient large-scale video analytics. Proceedings of the 7th ACM Symposium on Cloud Computing, Santa Clara, CA, USA.
3. Retail in high definition: Monitoring customer assistance through video analytics;Musalem;Manuf. Serv. Oper. Manag.,2021
4. Baresi, L., Filgueira Mendonça, D., and Garriga, M. (2017, January 27–29). Empowering low-latency applications through a serverless edge computing architecture. Proceedings of the Service-Oriented and Cloud Computing: 6th IFIP WG 2.14 European Conference, ESOCC 2017, Oslo, Norway. Proceedings 6.
5. Lyu, X., Cherkasova, L., Aitken, R., Parmer, G., and Wood, T. (2022, January 5–8). Towards efficient processing of latency-sensitive serverless dags at the edge. Proceedings of the 5th International Workshop on Edge Systems, Analytics and Networking, Rennes, France.