Affiliation:
1. Texas A&M University at Qatar,Qatar
2. Texas A&M University at Qatar, Qatar
Abstract
Measured process data are a valuable source of information about the processes they are collected from. Unfortunately, measurements are usually contaminated with errors that mask the important features in the data and degrade the quality of any related operation. Wavelet-based multiscale filtering is known to provide effective noise-feature separation. Here, the effectiveness of multiscale filtering over conventional low pass filters is illustrated though their application to chemical and biological systems. For biological systems, various online and batch multiscale filtering techniques are used to enhance the quality of metabolic and copy number data. Dynamic metabolic data are usually used to develop genetic regulatory network models that can describe the interactions among different genes inside the cell in order to design intervention techniques to cure/manage certain diseases. Copy number data, however, are usually used in the diagnosis of diseases by determining the locations and extent of variations in DNA sequences. Two case studies are presented, one involving simulated metabolic data and the other using real copy number data. For chemical processes it is shown that multiscale filtering can greatly enhance the prediction accuracy of inferential models, which are commonly used to estimate key process variables that are hard to measure. In this chapter, we present a multiscale inferential modeling technique that integrates the advantages of latent variable regression methods with the advantages of multiscale filtering, and is called Integrated Multiscale Latent Variable Regression (IMSLVR). IMSLVR performance is illustrated via a case study using synthetic data and another using simulated distillation column data.
Reference70 articles.
1. Alqallaf, A., & Tewfik, A. (2007). DNA copy number detection and sigma filter. IEEE International Workshop on Genomic Signal Processing and Statistics, GENSIPS, 1–4. IEEE Press.
2. Amd, P., & Morozov, A. R. (2001). Markov chain Monte Carlo computation of confidence intervals for substitution-rate variation in proteins. Proceedings of Pacific Symposium of Biocomputing, 203–214. Singapore: World Scientific Publishing.
3. Wavelet based fractal analysis of DNA sequences.;A.Arneodo;Physica D. Nonlinear Phenomena,1996
4. What can we learn with wavelets about DNA sequences?
5. Kernel independent component analysis.;F. R.Bach;Journal of Machine Learning Research,2003