Abstract
We present a novel application of inductive logic programming (ILP) in the area of quantitative structure-activity relationships (QSARs). The activity we want to predict is the biodegradability of chemical compounds in water. In particular, the target variable is the half-life in water for aerobic aqueous biodegradation. Structural descriptions of chemicals in terms of atoms and bonds are derived from the chemicals’ SMILES encodings. Definition of substructures are used as background knowledge. Predicting biodegradability is essentially a regression problem, but we also consider a discretized version of the target variable. We thus employ a number of relational classification and regression methods on the relational representation and compare these to propositional methods applied to different propositionalisations of the problem. Some expert comments on the induced theories are also given.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Blockeel, H. and De Raedt, L. 1998. Top-down induction of first order logical decision trees. Artificial Intelligence, 101(1–2): 285–297.
Boethling, R.S., and Sabljic, A. 1989. Screening-level model for aerobic biodegradability based on a survey of expert knowledge. Environ. Sci. Technol. 23: 672–679.
Breiman, L., Friedman, J.H., Olshen, R.A., and Stone, C.J. 1984. Classification and Regression Trees. Wadsworth, Belmont.
Cambon, B., and Devillers, J. 1993. New trends in structure-biodegradability relationships. Quant. Struct. Act. Relat. 12(1): 49–58.
Clark, P., and Boswell, R. 1991. Rule induction with CN2: some recent improvements. In Proc. 5th European Working Session on Learning, pages 151–163. Springer, Berlin.
Cohen W. 1995. Fast effective rule induction. In Proc. 12th Intl. Conf. on Machine Learning, pages 115–123. Morgan Kaufmann, San Mateo, CA.
Dehaspe, L., and Toivonen, H. 1999. Frequent query discovery: a unifying ILP approach to association rule mining. Data Mining and Knowledge Discovery.
De Raedt, L., and Van Laer, W. 1995. Inductive constraint logic. In Proc. 6th Intl. Workshop on Algorithmic Learning Theory, pages 80–94. Springer, Berlin.
Gamberger, D., Sekuak, S., and Sabljič, A. 1993. Modelling biodegradation by an example-based learning system. Informatica 17: 157–166.
Howard, P.H., Boethling, R.S., Jarvis, W.F., Meylan, W.M., and Michalenko, E.M. 1991. Handbook of Environmental Degradation Rates. Lewis Publishers.
Howard, P.H., Boethling, R.S., Stiteler, W.M., Meylan, W.M., Hueber, A.E., Beauman, J.A., and Larosche, M.E. 1992. Predictive model for aerobic biodegradability developed from a file of evaluated biodegradation data. Environ. Toxicol. Chem. 11: 593–603.
Howard, P. and Meylan, W. 1992. User’s Guide for the Biodegradation Probability Program, Ver. 3. Syracuse Res. Corp., Chemical Hazard Assessment Division, Environmental Chemistry Center, Syracuse, NY 13210, USA.
King, R.D., Muggleton, S.H., Lewis, R.A., and Sternberg, M.J.E. 1992. Drug design by machine learning: the use of inductive logic programming to model the structure-activity relationship of trimethoprim analogues binding to dihydrofolate reductase. Proc. National Academy of Sciences USA, 89: 11322–11326.
Kompare, B. 1995. The use of artificial intelligence in ecological modelling. Ph.D. Thesis, Royal Danish School of Pharmacy, Copenhagen, Denmark.
Kramer, S. 1996. Structural regression trees. In Proc. 13th Natl. Conf. on Artificial Intelligence, pages 812–819. AAAI Press/The MIT Press.
Quinlan, J.R. 1993a. C4.5: Programs for Machine Learning. Morgan Kaufmann, San Mateo, CA.
Quinlan, J.R. 1993b. Combining instance-based and model-based learning. In Proc. 10th Intl. Conf. on Machine Learning, pages 236–243. Morgan Kaufmann, San Mateo, CA.
Quinlan, J.R. 1996. Learning first-order definitions of functions. Journal of Artificial Intelligence Research, 5:139–161.
Srinivasan, A., Muggleton, S.H., Sternberg, M.J.E., and King, R.D. 1996. Theories for mutagenicity: A study in first-order and feature-based induction. Artificial Intelligence 85(1–2): 277–299.
Srinivasan, A., King, R.D., Muggleton, S.H., and Sternberg, M.J.E. 1997. The predictive toxicology evaluation challenge. In Proc. 15th Intl. Joint Conf. on Artificial Intelligence, pages 4–9. Morgan Kaufmann, San Mateo, CA.
Wang, Y., and Witten, I.H. 1997. Inducing model trees for continuous classes. In Poster Papers-9th European Conf. on Machine Learning, pages 128–137. Prague, Czech Republic. URL: http://www.cs.waikato.ac.nz/~ml/publications.html.
Weininger D. 1988. SMILES, a Chemical and Information System. 1. Introduction to Methodology and Encoding Rules. J. Chem. Inf. Comput. Sci. 28(1): 31–6.
Zitko, V. 1991. Prediction of biodegradability of organic chemicals by an artificial neural network. Chemosphere, Vol. 23, No. 3: 305–312.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 1999 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Džeroski, S., Blockeel, H., Kompare, B., Kramer, S., Pfahringer, B., Van Laer, W. (1999). Experiments in Predicting Biodegradability. In: Džeroski, S., Flach, P. (eds) Inductive Logic Programming. ILP 1999. Lecture Notes in Computer Science(), vol 1634. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-48751-4_9
Download citation
DOI: https://doi.org/10.1007/3-540-48751-4_9
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-66109-2
Online ISBN: 978-3-540-48751-7
eBook Packages: Springer Book Archive