화학공학소재연구정보센터
학회 한국화학공학회
학술대회 2016년 봄 (04/27 ~ 04/29, 부산 BEXCO)
권호 22권 1호, p.330
발표분야 분리기술
제목 On nonlinear machine learning methods for Quantitative structure-retention relationships modeling in proteomics
초록 RP-LC-MS/MS is a powerful method widely used in proteomics. Here, proteins are broken into peptides and their spectra are matched with theoretical ones. Retention time is dependent on molecular structure. Its prediction is gaining increasing attention for simultaneous qualitative and quantitative profiling, with Quantitative Structure-Retention Relationships (QSRR) used for their prediction. Since more than 4000 molecular descriptors can be calculated, variable selection is crucial. It was shown that a Genetic Algorithm (GA) coupled with Partial Least Squares (PLS) was superior for developing QSRR models. However, it gave inadequate predictions for large peptides for which relationship between retention time and molecular structure is non-linear. In this work, machine learning methods: Support Vector Regression (SVR), Artificial Neural Networks (ANN), and kernel Partial Least Squares (kPLS) were compared in respect to their predictive ability. GA was used for variable selection. Final models: GA-SVR, GA-ANN, and GA-kPLS were constructed out of subsets of ten variables and compared to previously obtained results. They were also thoroughly validated and their applicability domain was defined.
저자 Petar Zuvela1, 유 준1, Katarzyna Macur2, Tomasz Bączek2
소속 1부경대, 2Medical Univ. of Gdańsk
키워드 크로마토그래피
E-Mail
VOD VOD 보기
원문파일 초록 보기