Skip to main content

Dokumentenmodell und automatische Klassifikation im Bürodokumentenarchiv MULTOS

  • Conference paper
Datenbanksysteme in Büro, Technik und Wissenschaft

Part of the book series: Informatik-Fachberichte ((INFORMATIK,volume 136))

Zusammenfassung

Für ein Bürodokumentenarchiv wurde ein Dokumentenmodell entwickelt, das die Beschreibung der im Dokument vorkommenden Konzepte vorsieht. Durch Gruppierung und Spezialisierung dieser Beschreibungen gelangt man zu einer Menge hierarchisch angeordneter Dokumenttypen, vergleichbar mit einem erweiterten Datenbankschema. Die Typ-Dokument-Zuordnung bildet eine Zugriffsstruktur, die die Bearbeitung von Anfragen über semantische Einheiten, Konzepte, anstelle von syntaktischen Elementen ermöglicht. Zur Unterstützung des Einfügens von Dokumenten dient eine wissensbasierte Klassifikationskomponente, die die konzeptuelle Beschreibung eines Dokuments automatisch erzeugt und das Dokument einem passenden Typ zuordnet. Die Klassifikation wird durch die Typhierarchie gesteuert, wobei der relevante Inhalt jeder konzeptuellen Komponente über einen Satz von inhaltsbeschreibenden Prädikaten definiert wird.

Abstract

To describe the conceptual components of documents in an office document archive, a document model is presented. By grouping and generalizing these descriptions we get a set of hierarchically structured document types that can be compared with an extended data base schema. With the type-document relation an additional access structure is established that provides the evaluation of queries on semantic units (concepts) rather than on syntactic elements. A knowledge based classification system automatically generates the conceptual description of a document to be stored by means of content analysis and associates the document to an appropriate type. This task is conducted by the type hierarchy where the relevant content for each conceptual component of a type is defined by a set of content description predicates.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 49.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 59.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

Literaturverzeichnis

  1. F. Barbie and F. Rabitti, “The Type Concept in Office Document Retrieval,” Proc. llth Conference on Very Large Data Bases, Stockholm, 1985.

    Google Scholar 

  2. E. Bertino, S. Gibbs, F. Rabitti, C. Thanos, and D. Tsichritzis, “A Multimedia File Server,” Proc. 6th Advanced Database Symposium, Information Processing Society of Japan, 1986.

    Google Scholar 

  3. R. J. Brachman and J. G. Schmölze, “An Overview of the KL-ONE Knowledge Representation System,” Cognitive Science, vol. 9, 2, 1985.

    Article  Google Scholar 

  4. S. Christodoulakis, “Framework for the Development of an Experimental Mixed-Mode Message System,” Proc. 3rd Joint BCS and ACM Symposium Research and Development in Information Retrieval, Cambridge University Press, Cambridge, 1984.

    Google Scholar 

  5. W. B. Croft, “User-specific Domain Knowledge for Document Retrieval,” Proc. ACM Conf. on Research and Development in Information Retrieval, ACM, Pisa, 1986.

    Google Scholar 

  6. ECMA, Office Document Architecture, Standard 101, European Com-puter Manufacturers Association, September 1985.

    Google Scholar 

  7. C. Faloutsos, “Access Methods for Text,” Computing Surveys, vol. 17, 1, ACM, 1985.

    Article  Google Scholar 

  8. R. Furuta, J. Scofield, and A. Shaw, “Document Formatting Systems: Survey, Concepts, and Issues,” ACM Computing Surveys, vol. 14,3, ACM, 1982.

    Google Scholar 

  9. S. Gallelli, C. Iacobelli, and P. Marchisio, “An Approach to Multimedia Information Management,” Proc. ACM Conf. on Research and Development in Information Retrieval, ACM, Pisa, 1986.

    Google Scholar 

  10. S. Gibbs and D. Tsichritzis, “Document Presentation and Query- Formulation in MUSE,” Proc. ACM Conf. on Research and Development in Information Retrieval, ACM, Pisa, 1986.

    Google Scholar 

  11. F. Guenther and H. Lehmann, “Verarbeitung natürlicher Sprache - ein Überblick,” Informatik Spektrum, vol. 9, no. 3, Springer, 1986.

    Google Scholar 

  12. G. Heyer and B. Schneider, “Extending Prolog for Processing Natural Language Semantics,” Technical Report T5.3, TA Triumph- Adler AG, Nürnberg, Nov. 1986.

    Google Scholar 

  13. G. Knorz, “Kooperatives (Referenz-)Retrieval,” Forschungsbericht Projekt AIR, TH Darmstadt, FB Informatik, 1984.

    Google Scholar 

  14. W. Lamersdorf, Semantische Repräsentation komplexer Objektstruk-turen, Informatik Fachberichte 100, Springer, 1985.

    Google Scholar 

  15. P. C. Lockemann and H. C. Mayr, Rechnergestützte Informationssysteme, Springer, 1978.

    Book  MATH  Google Scholar 

  16. D. Maier, J. D. Ullman, and M. Y. Vardi, “On the Foundations of the Universal Relation Model,” TODS, vol. 9, 2, ACM, 1984.

    Article  MathSciNet  Google Scholar 

  17. N.J. Nilsson, Principles of Artifical Intelligence, Palo Alto, CA, 1980.

    Google Scholar 

  18. L. Rostek,Methoden des partiellen Parsing für das automatische Indexing - Syntaxgraphen zur Analyse von Sprachmustern, Datenbanken , Datenbasen, Netzwerke, vol. 1, Saur Verlag, München, 1979 .

    Google Scholar 

  19. G. M. Sacco, “OTTER - An Information Retrieval System for Office Automation,” Proc. 2nd ACM SIGOA Conference on Office Information Systems, Toronto, 1984.

    Google Scholar 

  20. G. Salton and M.J. McGill, Introduction to Modern Information Retrieval, McGraw Hill, 1983.

    MATH  Google Scholar 

  21. H.-J. Schek and M. H. Scholl, “An Algebra for the Relational Model with Relation-Valued Attributes,” Information Systems, vol. 11, 2, 1986. Technical Report DVSI-1984-T1, TH Darmstadt, FB Informatik

    Google Scholar 

  22. J. M. Smith and D. C. P. Smith, “Database Abstractions: Aggregation,” CACM, vol. 20,6, ACM, 1977.

    Google Scholar 

  23. R. M. Tong, V. N. Askman, J. F. Cunningham, and C. J. Tollander, “RUBRIC - An Environment for Full Text Information Retrieval” Proc. 8th international ACM SIGIR conf. on Research and Development in Information Retrieval, ACM, Montreal, 1985.

    Google Scholar 

  24. D. Tsichritzis and S. Christodoulakis, “Message Files,” ACM TOOIS, vol. 1,1, ACM, 1983.

    Article  Google Scholar 

  25. D. Tsichritzis, Office Automation, Springer, 1985.

    Book  MATH  Google Scholar 

  26. S.B. Yao, A.R. Hevner, Z. Shi, and D. Luo, “Formanager: An Office Forms Management System,” TOOIS, vol. 2,3, ACM, 1984.

    Google Scholar 

  27. H. H. Zimmermann, Ein Verfahren zur computergestützten Texterschließung, Forschungsbericht ID 83-006 BMFT, Universität des Saarlandes, Saarbrücken, 1983.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 1987 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Eirund, H., Kreplin, K. (1987). Dokumentenmodell und automatische Klassifikation im Bürodokumentenarchiv MULTOS. In: Schek, HJ., Schlageter, G. (eds) Datenbanksysteme in Büro, Technik und Wissenschaft. Informatik-Fachberichte, vol 136. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-72617-0_6

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-72617-0_6

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-17736-4

  • Online ISBN: 978-3-642-72617-0

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics