Manifold Learning and Clustering for Automated Phase Identification and Alignment in Data Driven Modeling of Batch Processes-Reference-Cited by-同舟云学术

Manifold Learning and Clustering for Automated Phase Identification and Alignment in Data Driven Modeling of Batch Processes

Published:2020-11-27 Issue: Volume:2 Page:
ISSN:2673-2718
Container-title:Frontiers in Chemical Engineering
language:
Short-container-title:Front. Chem. Eng.

Author:

Muñoz López Carlos André,Bhonsale Satyajeet,Peeters Kristin,Van Impe Jan F. M.

Abstract

Processing data that originates from uneven, multi-phase batches is a challenge in data-driven modeling. Training predictive and monitoring models requires the data to be in the right shape to be informative. Only then can a model learn meaningful features that describe the deterministic variability of the process. The presence of multiple phases in the data, which display different correlation patterns and have an uneven duration from batch to batch, reduces the performance of the data-driven modeling methods significantly. Therefore, phase identification and alignment is a critical step and can lead to an unsuccessful modeling exercise if not applied correctly. In this paper, a novel approach is proposed to perform unsupervised phase identification and alignment based on the correlation patterns found in the data. Phase identification is performed via manifold learning using t-Distributed Stochastic Neighbor Embedding (t-SNE), which is a state-of-the-art machine learning algorithm for non-linear dimensionality reduction. The application of t-SNE to a reduced cross-correlation matrix of every batch with respect to a reference batch results in data clustering in the embedded space. Models based on support vector machines (SVMs) are trained to, 1) reproduce the manifold learning obtained via t-SNE, and 2) determine the membership of the data points to a process phase. Compared to previously proposed clustering approaches for phase identification, this is an unsupervised, non-linear method. The perplexity parameter of the t-SNE algorithm can be interpreted as the estimated duration of the shortest phase in the process. The advantages of the proposed method are demonstrated through its application on an in-silico benchmark case study, and on real industrial data from two unit-operations in the large scale production of an active pharmaceutical ingredients (API). The efficacy and robustness of the method are evidenced in the successful phase identification and alignment obtained for these three distinct processes, displaying smooth, sudden and repetitive phase changes. Additionally, the low complexity of the method makes feasible its online implementation.

Publisher

Frontiers Media SA

Reference45 articles.

1. Theoretical foundations of the potential function method in pattern recognition learning;Aizerman;Autom. Rem. Contr.,1964

2. Cluster analysis for autocorrelated and cyclic chemical process data;Beaver;Ind. Eng. Chem. Res.,2007

3. A modular simulation package for fed-batch fermentation: penicillin production;Birol;Comput. Chem. Eng.,2002

4. A tutorial on support vector machines for pattern recognition;Burges;Data Min. Knowl. Discov.,1998

5. Rank revealing QR factorizations;Chan;Lin. Algebra Appl.,1987

Cited by 5 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Data-Driven Modeling of the Spray Drying Process. Process Monitoring and Prediction of the Particle Size in Pharmaceutical Production;ACS Omega;2024-06-07

2. Data-Driven Process Monitoring and Fault Diagnosis: A Comprehensive Survey;Processes;2024-01-24

3. Recognizing Phases in Batch Production via Interactive Feature Extraction;2022 2nd International Conference on Robotics, Automation and Artificial Intelligence (RAAI);2022-12-09

4. Open benchmarks for assessment of process monitoring and fault diagnosis techniques: A review and critical analysis;Computers & Chemical Engineering;2022-09

5. Online monitoring and fault diagnosis for uneven length batch process based on multi‐way orthogonal enhanced neighborhood preserving embedding;Asia-Pacific Journal of Chemical Engineering;2022-03-16