Journal of Process Control, Vol.81, 54-64, 2019
K-means Bayes algorithm for imbalanced fault classification and big data application
Fault classification is an important part of process monitoring in industrial processes. Most conventional fault classification methods are under the assumption that the amount of data in different classes are similar. However, in practice, most of the data collected from industrial process are normal data (majority) and only a few of them are fault data (minority). In other words, fault classification can be seen as an imbalanced data classification problem, which has not been considered in this area to date. In this paper, a K-means Bayes algorithm is proposed to deal with the imbalanced fault classification problem. After that, a MapReduce approach is further introduced to implement the method for fault classification in the big data case. Effectiveness of the proposed method is verified through experiments based on Tennessee Eastman (TE) benchmark process. (C) 2019 Elsevier Ltd. All rights reserved.