화학공학소재연구정보센터
학회 한국화학공학회
학술대회 2005년 가을 (10/21 ~ 10/22, 인하대학교)
권호 11권 2호, p.1778
발표분야 공정시스템
제목 Supervised framework of gene selection and multiclass classification for high-dimensional biology
초록 Recent advances in microarray technologies have produced large genome-wide gene expression data. To extract new knowledge from such high-dimensional biology data, various data mining methods have been explored. Especially, gene selection, classification and clustering problems are extensively studied, and the performances of the methods are demonstrated in binary classification. However, multi-class classification is still a challenging problem in a high-dimensional biology. In this study, the discriminant partial least squares (DPLS) is applied for the selection of class-relevant genes and then the fuzzy c-mean clustering with supervised information was subsequently applied to group samples into different classes. This paper is particularly interested in incorporated wrapper approaches for gene selection information into clustering (classification) methods. A stepwise procedure to combining a weighted fuzzy-c-means with a maximal discriminant feature selection methods proposed. Supervised information is provided as the feature weights, which are calculated from the variable importance in the projection (VIP) in DPLS model.
저자 이민영, 유창규, 이창규, 이인범
소속 포항공과대
키워드 supervised clustering; fuzzy c-mean clustering; multiclass classification; gene selection; microarray; DPLS
E-Mail
원문파일 초록 보기