Abstract
The (automatic) extraction of significant characteristics of files is an important feature of all long term preservation activities. We propose, however, that for the necessary automatic evaluation of the outcomes of certain preservation actions – notably migration – an approach is necessary, which follows other traditions in the abstraction of format descriptions. To implement a strategy for the automatic evaluation of various actions within a preservation environment, we define two formal, XML base languages: One allowing to define the content of a specific file, the other describing a file format in such a way, that it can be handled by multi-purpose software.
The results presented in this paper are part of the project Planets, cofunded by the European commission under contract FP6-2005-033789.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
National Library of New Zealand: Metadata Extraction Tool Software Architecture, Version 3.0, p. 2 (June 17, 2003), http://meta-extractor.sourceforge.net/meta-extractor-software-architecture-v3.pdf
National Library of New Zealand: Metadata Standards Framework – Preservation Metadata (Revised) (June 2003), http://www.natlib.govt.nz/catalogues/library-documents/preservation-metadata-revised
Harvard University Library, http://hul.harvard.edu/jhove/
HDF4 User’s Guide / HDF4 Release 2.2 (December 2007), ftp://ftp.hdfgroup.org/HDF/Documentation/HDF4.2r2/HDF42r2_UserGd.pdf
Data Format Description Language, http://forge.gridforum.org/projects/dfdl-wg/
Binary Format Description Language, http://collaboratory.emsl.pnl.gov/sam/bfd/
Planets, http://www.planets-project.eu/
The current specification of the XCD language is, http://planetarium.hki.uni-koeln.de/public/XCL/xcl/XCLDocumentation/xcdlDocu.html
Fisher, K., Mandelbaum, Y., Walker, D.: The Next 700 Data Description Languages. ACM Sigplan Notices 41(1), 2–15 (2006)
The current specification of the XCE language, http://planetarium.hki.uni-koeln.de/public/XCL/xcl/XCLDocumentation/xcelDocu.html
Hardy, M.R.B.: The Mars Project - PDF in XML. In: DocEng 2007, pp. 161–170. ACM Press, New York (2007)
Gruhl, D., Meredith, D., Pieper, J.: A case study on alternate representations of data structures in XML. In: DocEng 2005, pp. 217–219. ACM Press, New York (2005)
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 2008 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Thaller, M., Heydegger, V., Schnasse, J., Beyl, S., Chudobkaite, E. (2008). Significant Characteristics to Abstract Content: Long Term Preservation of Information. In: Christensen-Dalsgaard, B., Castelli, D., Ammitzbøll Jurik, B., Lippincott, J. (eds) Research and Advanced Technology for Digital Libraries. ECDL 2008. Lecture Notes in Computer Science, vol 5173. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-87599-4_5
Download citation
DOI: https://doi.org/10.1007/978-3-540-87599-4_5
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-87598-7
Online ISBN: 978-3-540-87599-4
eBook Packages: Computer ScienceComputer Science (R0)