Synthetic Data Generation Using Generative Adversarial Network for Burst Failure Risk Analysis of Oil and Gas Pipelines

Author:

Mazumder Ram Krishna1,Modanwal Gourav2,Li Yue3

Affiliation:

1. Arcadis U.S. Inc., 222 S. Main Street , Akron, OH 44308

2. Emory University, 201 Dowman Dr., Atlanta, GA 30322

3. Leonard Case Jr. Professor in Engineering, Department of Civil and Environmental Engineering, Case Western Reserve University , Cleveland, OH 44106

Abstract

Abstract Despite the pipeline network being the safest mode of oil and gas transportation systems, the pipeline failure rate has increased significantly over the last decade, particularly for aging pipelines. Predicting failure risk and prioritizing the riskiest asset from a large set of pipelines is one of the demanding tasks for the utilities. Machine learning (ML) application in pipeline failure risk prediction has recently shown promising results. However, due to safety and security concerns, obtaining sufficient operation and failure data to train ML models accurately is a significant challenge. This study employed a Generative Adversarial Network (GAN) based framework to generate synthetic pipeline data (DSyn) using a subset (70%) of experimental burst test results data (DExp) compiled from the literature to overcome the limitation of accessing operational data. The proposed framework was tested on (1) real data, and (2) combined real and generated synthetic data. The burst failure risk of corroded oil and gas pipelines was determined using probabilistic approaches, and pipelines were classified into two classes depending on their probability of failure: (1) low failure risk (Pf: 0–0.5) and (2) high failure risk (Pf: >0.5). Two random forest (RF) models (MExp and MComb) were trained using a subset of 70% of actual experimental pipeline data, (DExp) and a combination of 70% of actual experimental and 100% of synthetic data, respectively. These models were validated on the remaining subset (30%) of experimental test data. The validation results reveal that adding synthetic data can further improve the performance of the ML models. The area under the ROC Curve was found to be 0.96 and 0.99 for real model (MExp) and combined model (MComb) data, respectively. The combined model with improved performance can be used in strategic oil and gas pipeline resilience improvement planning, which sets long-term critical decisions regarding maintenance and potential replacement of pipes.

Publisher

ASME International

Subject

Mechanical Engineering,Safety Research,Safety, Risk, Reliability and Quality

Reference72 articles.

1. Building Portfolio Fragility Functions to Support Scalable Community Resilience Assessment;Sustainable Resilient Infrastruct.,2016

2. Machine Learning for Risk and Resilience Assessment in Structural Engineering: Progress and Future Trends;J. Struct. Eng.,2022

3. A Review of Failure Prediction Models for Oil and Gas Pipelines;J. Pipeline Syst. Eng. Pract.,2020

4. Applications of Machine Learning in Pipeline Integrity Management: A State-of-the-Art Review;Int. J. Pressure Vessels Piping,2021

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3