화학공학소재연구정보센터
Industrial & Engineering Chemistry Research, Vol.48, No.21, 9708-9712, 2009
Quantitative Structure-Property Relationship (QSPR) Prediction of Liquid Viscosities of Pure Organic Compounds Employing Random Forest Regression
A quantitative structure-property relationship (QSPR) approach was used to develop a predictive model for viscosities of pure organic liquids using a set of 403 compounds that belong to diverse classes of organic chemicals. A pool of 116 descriptors that encode topostructural, topochemical, electrotopological, geometrical, and quantum chemical properties of the organic compounds was used to develop QSPR models, based on the robust Random Forest (RF) regression algorithm. The performance of the algorithm, in terms of correlation coefficients and mean square errors, was determined to be good. The capability of the algorithm to build models and select the most-informative features simultaneously is very useful for several quantitative structure-activity/property relationship tasks. The eight most-dominant features selected by the RF regression algorithm primarily contained predictors that encode characteristics of atoms and groups that form hydrogen bonds, as well as factors involving molecular shape and size.