Automatic P2P Energy Trading Model Based on Reinforcement Learning Using Long Short-Term Delayed Reward-Reference-Cited by-同舟云学术

Automatic P2P Energy Trading Model Based on Reinforcement Learning Using Long Short-Term Delayed Reward

Published:2020-10-14 Issue:20 Volume:13 Page:5359
ISSN:1996-1073
Container-title:Energies
language:en
Short-container-title:Energies

Author:

Kim Jin-Gyeom,Lee Bowon^ORCID

Abstract

Automatic peer-to-peer energy trading can be defined as a Markov decision process and designed using deep reinforcement learning. We consider prosumer as an entity that consumes and produces electric energy with an energy storage system, and define the prosumer’s objective as maximizing the profit through participation in peer-to-peer energy trading, similar to that of the agents in stock trading. In this paper, we propose an automatic peer-to-peer energy trading model by adopting a deep Q-network-based automatic trading algorithm originally designed for stock trading. Unlike in stock trading, the assets held by a prosumer may change owing to factors such as the consumption and generation of energy by the prosumer in addition to the changes from trading activities. Therefore, we propose a new trading evaluation criterion that considers these factors by defining profit as the sum of the gains from four components: electricity bill, trading, electric energy stored in the energy storage system, and virtual loss. For the proposed automatic peer-to-peer energy trading algorithm, we adopt a long-term delayed reward method that evaluates the delayed reward that occurs once per month by generating the termination point of an episode at each month and propose a long short-term delayed reward method that compensates for the issue with the long-term delayed reward method having only a single evaluation per month. This long short-term delayed reward method enables effective learning of the monthly long-term trading patterns and the short-term trading patterns at the same time, leading to a better trading strategy. The experimental results showed that the long short-term delayed reward method-based energy trading model achieves higher profits every month both in the progressive and fixed rate systems throughout the year and that prosumer participating in the trading not only earns profits every month but also reduces loss from over-generation of electric energy in the case of South Korea. Further experiments with various progressive rate systems of Japan, Taiwan, and the United States as well as in different prosumer environments indicate the general applicability of the proposed method.

Funder

Korea Electric Power Corporation

Ministry of Science and ICT, South Korea

Publisher

MDPI AG

Subject

Energy (miscellaneous),Energy Engineering and Power Technology,Renewable Energy, Sustainability and the Environment,Electrical and Electronic Engineering,Control and Optimization,Engineering (miscellaneous)

Link

https://www.mdpi.com/1996-1073/13/20/5359/pdf

Reference40 articles.

1. Automated Negotiation for Peer-to-Peer Electricity Trading in Local Energy Markets

2. Peer to Peer Energy Trading with Electric Vehicles

3. Using peer-to-peer energy-trading platforms to incentivize prosumers to form federated power plants

4. Transforming Energy Networks via Peer-to-Peer Energy Trading: The Potential of Game-Theoretic Approaches

5. Peer-to-peer (P2P) electricity trading in distribution systems of the future

Cited by 29 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Centralised rehearsal of decentralised cooperation: Multi-agent reinforcement learning for the scalable coordination of residential energy flexibility;Applied Energy;2025-01

2. P2P power trading based on reinforcement learning for nanogrid clusters;Expert Systems with Applications;2024-12

3. Optimal load forecasting and scheduling strategies for smart homes peer-to-peer energy networks: A comprehensive survey with critical simulation analysis;Results in Engineering;2024-06

4. A Review of Peer-to-Peer Energy Trading Markets: Enabling Models and Technologies;Energies;2024-04-02

5. Network analysis in a peer-to-peer energy trading model using blockchain and machine learning;Computer Standards & Interfaces;2024-03