IMIBIO-SL   20937
INSTITUTO MULTIDISCIPLINARIO DE INVESTIGACIONES BIOLOGICAS DE SAN LUIS
Unidad Ejecutora - UE
artículos
Título:
Application of k-means Clustering, Linear Discriminant Analysis and Multivariate Linear Regression for the development of a predictive QSAR model on 5-lipoxygenase inhibitors
Autor/es:
ANDRADA MATIAS; ESTEBAN GABRIEL VEGA HISSI; ESTRADA MARIO R.; GARRO MARTINEZ, JUAN C.
Revista:
CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS
Editorial:
ELSEVIER SCIENCE BV
Referencias:
Año: 2015 vol. 143 p. 122 - 129
ISSN:
0169-7439
Resumen:
In this work, we performed a quantitative structure activity relationship (QSAR) model for a family of 5-lipoxygenase (5-LOX) inhibitors using k-means clustering and linear discriminant analysis (LDA) for theselection of training and test sets and multivariate linear regression (MLR) for the independent variable selection.With the k-means clustering method, the total set of compounds (58 derivatives of 5-Benzylidene-2-phenylthiazolinones) was divided in two clusters according to a simple discriminant function. We found thatpiID (conventional bond order ID number) molecular descriptor discriminates correctly 100% of the compoundsof each clusters. Thirty different models divided in three series were analyzed and the series with representativetraining and test sets (series 3) had the most predictive models. The statistical parameters of the best modelare Rtrain = 0.811 and Rtest = 0.801. We found that a rational selection in the setting-up of training and testsets allows to obtain the most predictive models and the random selection is sometimes unsuitable, especially,when the total set of compounds can be classified in different clusters according to structural features.