A predictive noise correction methodology for manufacturing process datasets-Reference-Cited by-同舟云学术

A predictive noise correction methodology for manufacturing process datasets

Published:2020-10-17 Issue:1 Volume:7 Page:
ISSN:2196-1115
Container-title:Journal of Big Data
language:en
Short-container-title:J Big Data

Author:

Oleghe Omogbai^ORCID

Abstract

AbstractIn manufacturing processes, datasets intended for data driven decisions are majorly generated from time-sequenced sensor readings. Industrial sensor systems are prone to transmit inaccurate readings, which result in noisy datasets. Noisy datasets inhibit machine learning and knowledge discovery. Using a multi-stage, multi-output process dataset as an experimental case, this article reports a methodology for replacing erroneous sensor values with their predicted likely values. In the methodology, invalid values specified by process owners are first converted to missing values. Then, ReliefF algorithm is used to select the most relevant features to progress for prediction modelling, and also to boost the performance of the prediction model. A Random Forest classifier model is built to predict replacement values for the missing values. Finally, predicted values are inserted into the dataset to fill in the missing entries. With many attributes having a significant number of erroneous values, the invalid values replacement is done one attribute at a time. To do this systematically, the process flow direction and stages in the manufacturing process are exploited to partition the dataset into subsets for model building. The results indicate that the methodology is able to replace erroneous values with likely true values, to a very high degree of accuracy. There is a paucity of this type of methodology for dealing with invalid entries in process datasets. The methodology is useful for both missing and invalid value correction in process datasets. In the future, the plan is to inject the prediction models into streaming data to simultaneously enable erroneous value correction and predictive process monitoring in real-time.

Publisher

Springer Science and Business Media LLC

Subject

Information Systems and Management,Computer Networks and Communications,Hardware and Architecture,Information Systems

Link

https://link.springer.com/content/pdf/10.1186/s40537-020-00367-w.pdf

Reference65 articles.

1. Shao J, et al. Automatic weld defect detection based on potential defect tracking in real-time radiographic image sequence. NDT and E Int. 2012;46:14–21.

2. Kim S. et al. Dealing with noise in defect prediction. In: 2011 33rd International Conference on Software Engineering (ICSE). IEEE. 2011.

3. Kaggle. Multi-Stage Continuous-Flow Manufacturing Process. 2020. https://www.kaggle.com/supergus/multistage-continuousflow-manufacturing-process. Accessed 20 Mar 2020]

4. Müller H, Freytag J-C, Problems, methods, and challenges in comprehensive data cleansing. Professoren des Inst. Für Informatik. 2005.

5. Peres RS, et al. Multistage quality control using machine learning in the automotive industry. IEEE Access. 2019;7:79908–16.

Cited by 5 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Stochastic deep Koopman model for quality propagation analysis in multistage manufacturing systems;Journal of Manufacturing Systems;2023-12

2. Comprehensive Architecture for Data Quality Assessment in Industrial IoT;2023 19th International Conference on Distributed Computing in Smart Systems and the Internet of Things (DCOSS-IoT);2023-06

3. Path Enhanced Bidirectional Graph Attention Network for Quality Prediction in Multistage Manufacturing Process;IEEE Transactions on Industrial Informatics;2022-02

4. Optimization of Dry Electrical Discharge Machining of Stainless Steel using Big Data Analytics;Procedia CIRP;2022

5. Preparing Datasets of Surface Roughness for Constructing Big Data from the Context of Smart Manufacturing and Cognitive Computing;Big Data and Cognitive Computing;2021-10-25