Peer-to-peer energy trading in a community based on deep reinforcement learning-Reference-Cited by-同舟云学术

Peer-to-peer energy trading in a community based on deep reinforcement learning

Published:2023-11-01 Issue:6 Volume:15 Page:
ISSN:1941-7012
Container-title:Journal of Renewable and Sustainable Energy
language:en
Short-container-title:

Author:

Wang Yiqun¹^ORCID,Yang Qingyu²,Li Donghe³^ORCID

Affiliation:

1. Faculty of Electronic and Information Engineering, Xi'an Jiaotong University 1 , Xi'an 710049, China

2. SKLMSE Laboratory, Faculty of Electronic and Information Engineering, and the MOE Key Laboratory for Intelligent Networks and Network Security, Xi'an Jiaotong University 2 , Xi'an 710049, China

3. Faculty of Electronic and Information Engineering and the MOE Key Laboratory for Intelligent Networks and Network Security, Xi'an Jiaotong University 3 , Xi'an 710049, China

Abstract

With the massive access to distributed energy resources, an increasing number of users have transformed into prosumers with the functions of producing, storing, and consuming electric energy. Peer-to-peer (P2P) energy trading, as a new way to allow direct energy transactions between prosumers, is becoming increasingly widespread. How to determine the trading strategy of prosumers participating in P2P energy trading while the strategy can satisfy multiple optimization objectives simultaneously is a crucial problem to be solved. To this end, this paper introduces the demand response mechanism and applies the dissatisfaction function to represent the electricity consumption of prosumers. The mid-market rate price is adopted to attract more prosumers to participate in P2P energy trading. The P2P energy trading process among multiple prosumers in the community is constructed as a Markov decision process. We design the method of deep reinforcement learning (DRL) to solve the optimal trading policy of prosumers. DRL, by engaging in continual interactions with the environment, autonomously learns the optimal strategies. Additionally, the deep deterministic policy gradient algorithm is well-suited for handling the continuous and intricate decision problems that arise in the P2P energy trading market. Through the judicious construction of a reinforcement learning environment, this paper achieves multi-objective collaborative optimization. Simulation results show that our proposed algorithm and model reduce costs by 16.5%, compared to the transaction between prosumers and grid, and can effectively decrease the dependence of prosumers on the main grid.

Funder

National Natural Science Foundation of China

China Postdoctoral Science Foundation

Publisher

AIP Publishing

Subject

Renewable Energy, Sustainability and the Environment

Link

https://pubs.aip.org/aip/jrse/article-pdf/doi/10.1063/5.0172713/18267490/065501_1_5.0172713.pdf

Reference41 articles.

1. A review of photovoltaic systems: Design, operation and maintenance;Sol. Energy,2019

2. A survey on smart grid technologies and applications;Renewable Energy,2020

3. Smart grid architecture model for control, optimization and data analytics of future power networks with more renewable energy;J. Cleaner Prod.,2021

4. Empowering smart grid: A comprehensive review of energy storage technology and application with renewable energy integration;J. Energy Storage,2021

5. Distributed energy and microgrids (DEM);Appl. Energy,2018