학회 |
한국화학공학회 |
학술대회 |
2016년 봄 (04/27 ~ 04/29, 부산 BEXCO) |
권호 |
22권 1호, p.330 |
발표분야 |
분리기술 |
제목 |
On nonlinear machine learning methods for Quantitative structure-retention relationships modeling in proteomics |
초록 |
RP-LC-MS/MS is a powerful method widely used in proteomics. Here, proteins are broken into peptides and their spectra are matched with theoretical ones. Retention time is dependent on molecular structure. Its prediction is gaining increasing attention for simultaneous qualitative and quantitative profiling, with Quantitative Structure-Retention Relationships (QSRR) used for their prediction. Since more than 4000 molecular descriptors can be calculated, variable selection is crucial. It was shown that a Genetic Algorithm (GA) coupled with Partial Least Squares (PLS) was superior for developing QSRR models. However, it gave inadequate predictions for large peptides for which relationship between retention time and molecular structure is non-linear. In this work, machine learning methods: Support Vector Regression (SVR), Artificial Neural Networks (ANN), and kernel Partial Least Squares (kPLS) were compared in respect to their predictive ability. GA was used for variable selection. Final models: GA-SVR, GA-ANN, and GA-kPLS were constructed out of subsets of ten variables and compared to previously obtained results. They were also thoroughly validated and their applicability domain was defined. |
저자 |
Petar Zuvela1, 유 준1, Katarzyna Macur2, Tomasz Bączek2
|
소속 |
1부경대, 2Medical Univ. of Gdańsk |
키워드 |
크로마토그래피 |
E-Mail |
|
VOD |
VOD 보기 |
원문파일 |
초록 보기 |