Chemical Engineering Communications, Vol.194, No.3, 382-397, 2007
Analysis and detection of outliers and systematic errors in industrial plant data
This article describes the analysis of industrial process data to detect outliers and systematic errors. Data reconciliation is an important step in adjusting mathematical models to plant data. The quality of the data directly affects the quality of adjustment of the model for modeling, simulation, and optimization purposes. To detect these errors in a multivariable system is not an easy task. If the origin of the abnormal values is known, these values can be immediately discarded. On the other hand, if an error or an extreme observation is not clearly justified, the decision whether or not to discard these values must be based on statistical analysis. In this work, in addition to process knowledge, the methodology employed involves an approach based on statistical analysis, first-principle equations, neural network models, and a composite of these. The neural network based approach was used to represent the process in order to classify similar inputs and outputs, i.e., to identify clusters. The elimination of gross errors was performed by the similarity principle or by hypothesis testing for means. The system studied is the Isoprene Production Unit of BRASKEM, the largest Brazilian petrochemical plant. The analysis of the process was undertaken by using a one-year database. The frequency of the data collection of the monitoring variables was 15 minutes.