A Guide to Data Collection for Computation and Monitoring of Node Energy Consumption
-
Published:2023-07-11
Issue:3
Volume:7
Page:130
-
ISSN:2504-2289
-
Container-title:Big Data and Cognitive Computing
-
language:en
-
Short-container-title:BDCC
Author:
del Rio Alberto1ORCID, Conti Giuseppe1ORCID, Castano-Solis Sandra2ORCID, Serrano Javier1ORCID, Jimenez David13ORCID, Fraile-Ardanuy Jesus3ORCID
Affiliation:
1. GATV Research Group, Signals, Systems and Radiocommunications Department, Universidad Politécnica de Madrid, 28040 Madrid, Spain 2. Escuela Técnica Superior de Ingeniería y Diseño Industrial (ETSIDI), Universidad Politécnica de Madrid, 28040 Madrid, Spain 3. Information Processing and Telecommunications Center (IP&T Center), Universidad Politécnica de Madrid, 28040 Madrid, Spain
Abstract
The digital transition that drives the new industrial revolution is largely driven by the application of intelligence and data. This boost leads to an increase in energy consumption, much of it associated with computing in data centers. This fact clashes with the growing need to save and improve energy efficiency and requires a more optimized use of resources. The deployment of new services in edge and cloud computing, virtualization, and software-defined networks requires a better understanding of consumption patterns aimed at more efficient and sustainable models and a reduction in carbon footprints. These patterns are suitable to be exploited by machine, deep, and reinforced learning techniques in pursuit of energy consumption optimization, which can ideally improve the energy efficiency of data centers and big computing servers providing these kinds of services. For the application of these techniques, it is essential to investigate data collection processes to create initial information points. Datasets also need to be created to analyze how to diagnose systems and sort out new ways of optimization. This work describes a data collection methodology used to create datasets that collect consumption data from a real-world work environment dedicated to data centers, server farms, or similar architectures. Specifically, it covers the entire process of energy stimuli generation, data extraction, and data preprocessing. The evaluation and reproduction of this method is offered to the scientific community through an online repository created for this work, which hosts all the code available for its download.
Subject
Artificial Intelligence,Computer Science Applications,Information Systems,Management Information Systems
Reference47 articles.
1. Miraz, M.H., Ali, M., Excell, P.S., and Picking, R. (2018). Internet of Nano-Things, Things and Everything: Future Growth Trends. Future Internet, 10. 2. Tyagi, H. (2023, March 14). Data Repositories for Almost Every Type of Data Science Project. Medium. Available online: https://towardsdatascience.com/data-repositories-for-almost-every-type-of-data-science-project-7aa2f98128b. 3. Research, G.V. (2023, March 14). AI Training Dataset Market Size, Share & Trends Analysis Report By Type (Text, Image/Video, Audio), by Vertical (IT, Automotive, Government, Healthcare, BFSI), by Regions, and Segment Forecasts, 2022–2030. Grand View Research. Available online: https://www.grandviewresearch.com/industry-analysis/ai-training-dataset-market#:~:text=Report%20Overview,e%2Dcommerce%2C%20and%20healthcare. 4. The importance of open data and software: Is energy research lagging behind?;Pfenninger;Energy Policy,2017 5. A critical review of state-of-the-art non-intrusive load monitoring datasets;Iqbal;Electr. Power Syst. Res.,2021
|
|