Skip to main content

An ETL Framework for Online Analytical Processing of Linked Open Data

  • Conference paper
Web-Age Information Management (WAIM 2013)

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 7923))

Included in the following conference series:

Abstract

Growing amount of data are being published online in machinereadable formats, and LOD (Linked Open Data) has emerged as a way to share such data across Web resources. Since LOD data often contain numerical data, such as statistics, there is a growing demand to make OLAP (Online Analytical Processing) analysis over such data. To make it possible to apply off-the-shelf OLAP systems for analyzing LOD data, we propose a framework to streamline the Extract, Transform, and Load (ETL) process from LOD to multidimensional data models for OLAP. Unlike other related approaches, our framework does not require RDF vocabularies dedicated for specifying multidimensional model for OLAP. Instead, given an LOD dataset, we exploit the relationships among entities and external information in the referenced LOD to generate an OLAP data model. In a case study, we demonstrate that our framework can extract OLAP data models from different kinds of real LOD datasets.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Auer, S., Bizer, C., Kobilarov, G., Lehmann, J., Cyganiak, R., Ives, Z.G.: DBpedia: A Nucleus for a Web of Open Data. In: Aberer, K., et al. (eds.) ISWC/ASWC 2007. LNCS, vol. 4825, pp. 722–735. Springer, Heidelberg (2007)

    Chapter  Google Scholar 

  2. Berners-Lee, T.: Linked Data - Design Issues, http://www.w3.org/DesignIssues/LinkedData.html

  3. Carroll, J.J., Klyne, G.: Resource Description Framework (RDF): Concepts and Abstract Syntax. W3C recommendation, W3C (February 2004), http://www.w3.org/TR/2004/REC-rdf-concepts-20040210/

  4. Codd, E., Codd, S., Salley, C., Codd & Date, Inc.: Providing OLAP (On-line Analytical Processing) to User-analysts: An IT Mandate. Codd & Associates (1993)

    Google Scholar 

  5. Cyganiak, R., Reynolds, D.: The RDF Data Cube Vocabulary. W3C working draft, W3C (April 2012), http://www.w3.org/TR/vocab-data-cube/

  6. Etcheverry, L., Vaisman, A.A.: Enhancing OLAP Analysis with Web Cubes. In: Simperl, E., Cimiano, P., Polleres, A., Corcho, O., Presutti, V. (eds.) ESWC 2012. LNCS, vol. 7295, pp. 469–483. Springer, Heidelberg (2012)

    Chapter  Google Scholar 

  7. Etcheverry, L., Vaisman, A.A.: QB4OLAP: A Vocabulary for OLAP Cubes on the Semantic Web. In: COLD. CEUR Workshop Proceedings, vol. 905. CEUR-WS.org (2012)

    Google Scholar 

  8. Han, J., Kamber, M.: Data Warehouse and OLAP Technology: An Overview. In: Data Mining: Concepts and Techniques, 2nd edn., pp. 105–156. Morgan Kaufmann (2006)

    Google Scholar 

  9. Iqbal, A., Capadisli, S., Cyganiak, R., Hausenblas, M.: Eurostat - Linked Data, http://eurostat.linked-statistics.org/

  10. Kämpgen, B., Harth, A.: Transforming statistical linked data for use in OLAP systems. In: I-SEMANTICS. ACM International Conference Proceeding Series, pp. 33–40. ACM (2011)

    Google Scholar 

  11. McGuinness, D.L., van Harmelen, F.: OWL Web Ontology Language Overview. W3C recommendation, W3C (February 2004), http://www.w3.org/TR/2004/REC-owl-features-20040210/

  12. Niemi, T., Toivonen, S., Niinimäki, M., Nummenmaa, J.: Ontologies with Semantic Web/Grid in Data Integration for OLAP. Int. J. Semantic Web Inf. Syst. 3(4), 25–49 (2007)

    Article  Google Scholar 

  13. Patni, H., Henson, C.A., Sheth, A.P.: Linked Sensor Data. In: CTS, pp. 362–370 (2010)

    Google Scholar 

  14. Vatant, B., Wick, M.: GeoNames Ontology, http://www.geonames.org/ontology/

  15. Wilkinson, K., Sayers, C., Kuno, H.A., Reynolds, D.: Efficient RDF Storage and Retrieval in Jena2. In: SWDB, pp. 131–150 (2003)

    Google Scholar 

  16. Wilkinson, K.: Jena Property Table Implementation. In: SSWS (2006)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2013 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Inoue, H., Amagasa, T., Kitagawa, H. (2013). An ETL Framework for Online Analytical Processing of Linked Open Data. In: Wang, J., Xiong, H., Ishikawa, Y., Xu, J., Zhou, J. (eds) Web-Age Information Management. WAIM 2013. Lecture Notes in Computer Science, vol 7923. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-38562-9_12

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-38562-9_12

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-38561-2

  • Online ISBN: 978-3-642-38562-9

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics