International Journal of Molecular Sciences, Vol.12, No.7, 4348-4364, 2011
Predictivity Approach for Quantitative Structure-Property Models. Application for Blood-Brain Barrier Permeation of Diverse Drug-Like Compounds
The goal of the present research was to present a predictivity statistical approach applied on structure-based prediction models. The approach was applied to the domain of blood-brain barrier (BBB) permeation of diverse drug-like compounds. For this purpose, 15 statistical parameters and associated 95% confidence intervals computed on a 2 x 2 contingency table were defined as measures of predictivity for binary quantitative structure-property models. The predictivity approach was applied on a set of compounds comprised of 437 diverse molecules, 122 with measured BBB permeability and 315 classified as active or inactive. A training set of 81 compounds (similar to 2/3 of 122 compounds assigned randomly) was used to identify the model and a test set of 41 compounds was used as the internal validation set. The molecular descriptor family on vertices cutting was the computation tool used to generate and calculate structural descriptors for all compounds. The identified model was assessed using the predictivity approach and compared to one model previously reported. The best-identified classification model proved to have an accuracy of 69% in the training set (95% CI [58.53-78.37]) and of 73% in the test set (95% CI [58.32-84.77]). The predictive accuracy obtained on the external set proved to be of 73% (95% CI [67.58-77.39]). The classification model proved to have better abilities in the classification of inactive compounds (specificity of similar to 74% [59.20-85.15]) compared to abilities in the classification of active compounds (sensitivity of similar to 64% [48.47-77.70]) in the training and external sets. The overall accuracy of the previously reported model seems not to be statistically significantly better compared to the identified model (similar to 81% [71.45-87.80] in the training set, similar to 93% [78.12-98.17] in the test set and similar to 79% [70.19-86.58] in the external set). In conclusion, our predictivity approach allowed us to characterize the model obtained on the investigated set of compounds as well as compare it with a previously reported model. According to the obtained results, the reported model should be chosen if a correct classification of inactive compounds is desired and the previously reported model should be chosen if a correct classification of active compounds is most wanted.
Keywords:in silico prediction;partition-coefficient;blood-brain barrier (BBB);permeation;structure-property relationship (SPR);molecular descriptors family on vertices cutting (MDFV)