Abstract
In this paper we present a methodology for evaluating the confidence in the prediction of a physicochemical or biological property. Identifying unreliable compounds’ predictions is crucial for the modern drug discovery process.This task is accomplished by the combination of the method of prediction with a self-organizing map. In this way, the method is able to segregate unconfident predictions as well as confident predictions. We applied the method to four different data sets, and we obtained significant differences in the average predictions of our segregation. This approach constitutes a novel way for evaluating confidence, since it not only looks for extrapolation situations but also it identifies interpolation problems.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Augen, J.: The Evolving Role of Information Technology in the Drug Discovery Process. Drug Discov. Today 7, 315–323 (2002)
Barril, X., Hubbard, R.E., Morley, S.D.: Virtual Screening in Structure-Based Drug Discovery. Mini Rev. Med. Chem. 4, 779–791 (2004)
van de Waterbeemd, H., Gifford, E.: ADMET in Silico Modelling: Towards Prediction Paradise? Nat. Rev. Drug Discov. 2, 192–204 (2003)
Todeschini, R., Consonni, V.: Handbook of Molecular Descriptors. Wiley–VCH, Weinheim (2000)
Yap, C.W., Li, H., Ji, Z.L., Chen, Y.Z.: Regression Methods for Developing QSAR and QSPR Models to Predict Compounds of Specific Pharmacodynamic, Pharmacokinetic and Toxicological Properties. Mini Rev. Med. Chem. 7, 1097–1107 (2007)
Tetko, I.V., Bruneau, P., Mewes, H.-W., Rohrer, D., Poda, G.: Can We Estimate the Accuracy of ADME-Tox Predictions? Drug Discov. Today 11, 700–707 (2006)
Jónsdóttir, S.Ó., Jørgensen, F.S., Brunak, S.: Prediction Methods and Databases within Chemoinformatics: Emphasis on Drugs and Drug Candidates. Bioinformatics 21, 2145–2160 (2005)
Jaworska, J., Nikolova-Jeliazkova, N., Aldenberg, T.: QSAR Applicabilty Domain Estimation by Projection of the Training Set Descriptor Space: A Review. Altern. Lab. Anim. 33, 445–459 (2005)
Konovalov, D.A., Sim, N., Deconinck, E., Heyden, Y.V., Coomans, D.: Statistical Confidence for Variable Selection in QSAR Models Via Monte Carlo Cross-Validation. J. Chem. Inf. Model 48, 370–383 (2008)
Yaffe, D., Cohen, Y., Espinosa, G., Arenas, A., Giralt, F.: Fuzzy ARTMAP and Back-Propagation Neural Networks Based Quantitative Structure - Property Relationships (QSPRs) for Octanol-Water Partition Coefficient of Organic Compounds. J. Chem. Inf. Comput. Sci. 42, 162–183 (2002)
The Physical Properties Database (PHYSPROP) is marketed by Syracuse Research Corporation (SRC), http://www.syrres.com/esc/
Soto, A.J., Cecchini, R.L., Vazquez, G.E., Ponzoni, I.: A Wrapper-Based Feature Selection Method for ADMET Prediction Using Evolutionary Computing. In: Marchiori, E., Moore, J.H. (eds.) EvoBIO 2008. LNCS, vol. 4973, pp. 188–199. Springer, Heidelberg (2008)
Kohonen, T.: Self-Organizing Maps, vol. II. Springer, Heidelberg (1997)
Winkler, D.A.: Neural Networks in ADME and Toxicity Prediction. Drug. Future 29, 1043–1057 (2004)
De Maesschalck, R., Jouan-Rimbaud, D., Massart, D.L.: The Mahalanobis Distance. Chemometr. Intell. Lab. Syst. 50, 1–18 (2002)
Qin, S.J.: Statistical Process Monitoring: Basics and Beyond. J. Chemometr. 17, 480–502 (2003)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2009 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Soto, A.J., Ponzoni, I., Vazquez, G.E. (2009). Segregating Confident Predictions of Chemicals’ Properties for Virtual Screening of Drugs. In: Omatu, S., et al. Distributed Computing, Artificial Intelligence, Bioinformatics, Soft Computing, and Ambient Assisted Living. IWANN 2009. Lecture Notes in Computer Science, vol 5518. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-02481-8_153
Download citation
DOI: https://doi.org/10.1007/978-3-642-02481-8_153
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-02480-1
Online ISBN: 978-3-642-02481-8
eBook Packages: Computer ScienceComputer Science (R0)