Dokumentenmodell und automatische Klassifikation im Bürodokumentenarchiv MULTOS

Eirund, Helmut; Kreplin, Klaus

doi:10.1007/978-3-642-72617-0_6

Helmut Eirund⁴ &
Klaus Kreplin⁴

Part of the book series: Informatik-Fachberichte ((INFORMATIK,volume 136))

77 Accesses
1 Citations

Zusammenfassung

Für ein Bürodokumentenarchiv wurde ein Dokumentenmodell entwickelt, das die Beschreibung der im Dokument vorkommenden Konzepte vorsieht. Durch Gruppierung und Spezialisierung dieser Beschreibungen gelangt man zu einer Menge hierarchisch angeordneter Dokumenttypen, vergleichbar mit einem erweiterten Datenbankschema. Die Typ-Dokument-Zuordnung bildet eine Zugriffsstruktur, die die Bearbeitung von Anfragen über semantische Einheiten, Konzepte, anstelle von syntaktischen Elementen ermöglicht. Zur Unterstützung des Einfügens von Dokumenten dient eine wissensbasierte Klassifikationskomponente, die die konzeptuelle Beschreibung eines Dokuments automatisch erzeugt und das Dokument einem passenden Typ zuordnet. Die Klassifikation wird durch die Typhierarchie gesteuert, wobei der relevante Inhalt jeder konzeptuellen Komponente über einen Satz von inhaltsbeschreibenden Prädikaten definiert wird.

Abstract

To describe the conceptual components of documents in an office document archive, a document model is presented. By grouping and generalizing these descriptions we get a set of hierarchically structured document types that can be compared with an extended data base schema. With the type-document relation an additional access structure is established that provides the evaluation of queries on semantic units (concepts) rather than on syntactic elements. A knowledge based classification system automatically generates the conceptual description of a document to be stored by means of content analysis and associates the document to an appropriate type. This task is conducted by the type hierarchy where the relevant content for each conceptual component of a type is defined by a set of content description predicates.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 49.99; Price excludes VAT (USA)

Softcover Book: USD 59.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Literaturverzeichnis

F. Barbie and F. Rabitti, “The Type Concept in Office Document Retrieval,” Proc. llth Conference on Very Large Data Bases, Stockholm, 1985.
Google Scholar
E. Bertino, S. Gibbs, F. Rabitti, C. Thanos, and D. Tsichritzis, “A Multimedia File Server,” Proc. 6th Advanced Database Symposium, Information Processing Society of Japan, 1986.
Google Scholar
R. J. Brachman and J. G. Schmölze, “An Overview of the KL-ONE Knowledge Representation System,” Cognitive Science, vol. 9, 2, 1985.
Article Google Scholar
S. Christodoulakis, “Framework for the Development of an Experimental Mixed-Mode Message System,” Proc. 3rd Joint BCS and ACM Symposium Research and Development in Information Retrieval, Cambridge University Press, Cambridge, 1984.
Google Scholar
W. B. Croft, “User-specific Domain Knowledge for Document Retrieval,” Proc. ACM Conf. on Research and Development in Information Retrieval, ACM, Pisa, 1986.
Google Scholar
ECMA, Office Document Architecture, Standard 101, European Com-puter Manufacturers Association, September 1985.
Google Scholar
C. Faloutsos, “Access Methods for Text,” Computing Surveys, vol. 17, 1, ACM, 1985.
Article Google Scholar
R. Furuta, J. Scofield, and A. Shaw, “Document Formatting Systems: Survey, Concepts, and Issues,” ACM Computing Surveys, vol. 14,3, ACM, 1982.
Google Scholar
S. Gallelli, C. Iacobelli, and P. Marchisio, “An Approach to Multimedia Information Management,” Proc. ACM Conf. on Research and Development in Information Retrieval, ACM, Pisa, 1986.
Google Scholar
S. Gibbs and D. Tsichritzis, “Document Presentation and Query- Formulation in MUSE,” Proc. ACM Conf. on Research and Development in Information Retrieval, ACM, Pisa, 1986.
Google Scholar
F. Guenther and H. Lehmann, “Verarbeitung natürlicher Sprache - ein Überblick,” Informatik Spektrum, vol. 9, no. 3, Springer, 1986.
Google Scholar
G. Heyer and B. Schneider, “Extending Prolog for Processing Natural Language Semantics,” Technical Report T5.3, TA Triumph- Adler AG, Nürnberg, Nov. 1986.
Google Scholar
G. Knorz, “Kooperatives (Referenz-)Retrieval,” Forschungsbericht Projekt AIR, TH Darmstadt, FB Informatik, 1984.
Google Scholar
W. Lamersdorf, Semantische Repräsentation komplexer Objektstruk-turen, Informatik Fachberichte 100, Springer, 1985.
Google Scholar
P. C. Lockemann and H. C. Mayr, Rechnergestützte Informationssysteme, Springer, 1978.
Book MATH Google Scholar
D. Maier, J. D. Ullman, and M. Y. Vardi, “On the Foundations of the Universal Relation Model,” TODS, vol. 9, 2, ACM, 1984.
Article MathSciNet Google Scholar
N.J. Nilsson, Principles of Artifical Intelligence, Palo Alto, CA, 1980.
Google Scholar
L. Rostek,Methoden des partiellen Parsing für das automatische Indexing - Syntaxgraphen zur Analyse von Sprachmustern, Datenbanken , Datenbasen, Netzwerke, vol. 1, Saur Verlag, München, 1979 .
Google Scholar
G. M. Sacco, “OTTER - An Information Retrieval System for Office Automation,” Proc. 2nd ACM SIGOA Conference on Office Information Systems, Toronto, 1984.
Google Scholar
G. Salton and M.J. McGill, Introduction to Modern Information Retrieval, McGraw Hill, 1983.
MATH Google Scholar
H.-J. Schek and M. H. Scholl, “An Algebra for the Relational Model with Relation-Valued Attributes,” Information Systems, vol. 11, 2, 1986. Technical Report DVSI-1984-T1, TH Darmstadt, FB Informatik
Google Scholar
J. M. Smith and D. C. P. Smith, “Database Abstractions: Aggregation,” CACM, vol. 20,6, ACM, 1977.
Google Scholar
R. M. Tong, V. N. Askman, J. F. Cunningham, and C. J. Tollander, “RUBRIC - An Environment for Full Text Information Retrieval” Proc. 8th international ACM SIGIR conf. on Research and Development in Information Retrieval, ACM, Montreal, 1985.
Google Scholar
D. Tsichritzis and S. Christodoulakis, “Message Files,” ACM TOOIS, vol. 1,1, ACM, 1983.
Article Google Scholar
D. Tsichritzis, Office Automation, Springer, 1985.
Book MATH Google Scholar
S.B. Yao, A.R. Hevner, Z. Shi, and D. Luo, “Formanager: An Office Forms Management System,” TOOIS, vol. 2,3, ACM, 1984.
Google Scholar
H. H. Zimmermann, Ein Verfahren zur computergestützten Texterschließung, Forschungsbericht ID 83-006 BMFT, Universität des Saarlandes, Saarbrücken, 1983.
Google Scholar

Download references

Author information

Authors and Affiliations

Neue Technologien / Basisentwicklung, TA Triumph-Adler AG, Fürther Str. 212, 8500, Nürnberg 80, Deutschland
Helmut Eirund & Klaus Kreplin

Authors

Helmut Eirund
View author publications
You can also search for this author in PubMed Google Scholar
Klaus Kreplin
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Fachgebiet Datenverwaltungssysteme I, Technische Hochschule Darmstadt, Alexanderstr.24, 6100, Darmstadt, Deutschland
H.-J. Schek
Fachbereich Informatik, Technische Hochschule Darmstadt, Alexanderstr.24, 6100, Darmstadt, Deutschland
H.-J. Schek
Fachbereich Mathematik und Informatik, FernUniversität Hagen, Postfach 940, 5800, Hagen, Deutschland
G. Schlageter

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Eirund, H., Kreplin, K. (1987). Dokumentenmodell und automatische Klassifikation im Bürodokumentenarchiv MULTOS. In: Schek, HJ., Schlageter, G. (eds) Datenbanksysteme in Büro, Technik und Wissenschaft. Informatik-Fachberichte, vol 136. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-72617-0_6

Download citation

DOI: https://doi.org/10.1007/978-3-642-72617-0_6
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-17736-4
Online ISBN: 978-3-642-72617-0
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics