Abstract
Automatic ontology building is a vital issue in many fields where they are currently built manually. This paper presents a user-centred methodology for ontology construction based on the use of Machine Learning and Natural Language Processing. In our approach, the user selects a corpus of texts and sketches a preliminary ontology (or selects an existing one) for a domain with a preliminary vocabulary associated to the elements in the ontology (lexicalisations). Examples of sentences involving such lexicalisation (e.g. ISA relation) in the corpus are automatically retrieved by the system. Retrieved examples are validated by the user and used by an adaptive Information Extraction system to generate patterns that discover other lexicalisations of the same objects in the ontology, possibly identifying new concepts or relations. New instances are added to the existing ontology or used to tune it. This process is repeated until a satisfactory ontology is obtained. The methodology largely automates the ontology construction process and the output is an ontology with an associated trained leaner to be used for further ontology modifications.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Fensel, D., F. van Harmelen, I. Horrocks, D.L. McGuiness, P.F. Patel-Schneider, (2001) OIL: An Ontology Infrastructure for the Semantic Web, IEEE Intelligent Systems (16)
Brewster, C. (2002) Techniques for Automated Taxonomy Building: Towards Ontologies for Knowledge Management, in Proceeding of the 5 th Annual CLUK Research Colloquium, Leeds
Ericsson, K. A. & H. A. Simon, (1984) Protocol Analysis: verbal reports as data. Cambridge, Mass.: MIT Press
Brewster, C., F. Ciravegna, and Y. Wilks, (2001) Knowledge Acquisition for Knowledge Management: Position Paper, in Proceeding of the IJCAI-2001 Workshop on Ontology Learning held in conjuction with the 17th International Conference on Artificial Intelligence (IJCAI-01), Seattle, August, 2001
Grefenstette, Gregory, (1994), Explorations in Automatic Thesaurus Discovery, Amsterdam: Kluwer.
Scott, Michael, (1998) Focusing on the Text and Its Key Words, TALC 98 Proceedings, Oxford, Humanities Computing Unit, Oxford University.
Brown, Peter F., Vincent J. Della Pietra, Petere V. DeSouza, Jenefer C. Lai, Robert L. Mercer, (1992) Class-based n-gram models of natural language, Comp. Linguistics, (18)
Sanderson, Mark, and Bruce Croft, (1999) Deriving concept hierarchies from text, in Proceedings of the 22nd ACM SIGIR Conference
Wilks, Y. (2000) Artificial Intelligence and Information Retrieval, in Proceedings of the IEE Symposium on Information Retrieval and Artificial Intelligence, Glasgow.
Gomez-Perez, A., (1999) Evaluation of Taxonomic Knowledge in Ontologies and Knowledge Bases, in Proceedings of the 12 th Banff Knowledge Acquisition for Knowledge-Based Systems Workshop, Banff, Alberta, Canada
Hearst, M.A., (1992) Automatic Acquisition of Hyponyms from Large Text Corpora, in Proceedings of COLING 92, Nantes
Morin, Emmanuel, (1999b), Using Lexico-Syntactic patterns to Extract Semantic Relations between Terms from Technical Corpus, in Proceedings of TKE 99, Innsbruck, Austria
Ciravegna, Fabio, Alexiei Dingli, Daniela Petrelli, (2002) Document Annotation via Adaptive Information Extraction, Proceedings of the 25th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval August 11-15, 2002, Tampere, Finland.
GATE, http://www.gate.ac.uk/
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2002 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Brewster, C., Ciravegna, F., Wilks, Y. (2002). User-Centred Ontology Learning for Knowledge Management. In: Andersson, B., Bergholtz, M., Johannesson, P. (eds) Natural Language Processing and Information Systems. NLDB 2002. Lecture Notes in Computer Science, vol 2553. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-36271-1_18
Download citation
DOI: https://doi.org/10.1007/3-540-36271-1_18
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-00307-6
Online ISBN: 978-3-540-36271-5
eBook Packages: Springer Book Archive