Journal of Process Control, Vol.32, 25-37, 2015
Robust semi-supervised mixture probabilistic principal component regression model development and application to soft sensors
Traditional data-based soft sensors are constructed with equal numbers of input and output data samples, meanwhile, these collected process data are assumed to be clean enough and no outliers are mixed. However, such assumptions are too strict in practice. On one hand, those easily collected input variables are sometimes corrupted with outliers. On the other hand, output variables, which also called quality variables, are usually difficult to obtain. These two problems make traditional soft sensors cumbersome. To deal with both issues, in this paper, the Student's t distributions are used during mixture probabilistic principal component regression modeling to tolerate outliers with regulated heavy tails. Furthermore, a semi-supervised mechanism is incorporated into traditional probabilistic regression so as to deal with the unbalanced modeling issue. For simulation, two case studies are provided to demonstrate robustness and reliability of the new method. (C) 2015 Elsevier Ltd. All rights reserved.
Keywords:Soft sensor;Outliers;Semi-supervised learning;Student's t distribution;Mixture latent variable models