Skip to main content

Semantically Annotating CEUR-WS Workshop Proceedings with RML

  • Conference paper
  • First Online:
Semantic Web Evaluation Challenges (SemWebEval 2015)

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 548))

Included in the following conference series:

Abstract

In this paper, we present our solution for the first task of the second Semantic Publishing Challenge. The task requires extracting and semantically annotating information regarding ceur-ws workshops, their chairs and conference affiliations, as well as their papers and their authors, from a set of html-encoded workshop proceedings volumes. Our solution builds on last year’s submission, while we address a number of shortcomings, assess the generated dataset for its quality and publish the queries as sparql query templates. This is accomplished using the rdf Mapping Language (rml) to define the mappings, the rmlprocessor to execute them, the rdfunit to both validate the mapping documents and assess the generated dataset’s quality, and the datatank to publish the sparql query templates. This results in an overall improved quality of the generated dataset that is reflected in the query results.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    http://ceur-ws.org/.

  2. 2.

    http://challenges.2014.eswc-conferences.org/index.php/SemPub/.

  3. 3.

    https://github.com/ceurws/lod/wiki/SemPub2015.

  4. 4.

    https://github.com/ceurws/lod/wiki/Task1.

  5. 5.

    http://rml.io.

  6. 6.

    http://www.w3.org/TR/selectors/.

  7. 7.

    http://jquery.com.

  8. 8.

    http://purl.org/ontology/bibo/.

  9. 9.

    http://purl.org/dc/terms/.

  10. 10.

    http://xmlns.com/foaf/0.1/.

  11. 11.

    http://www.w3.org/2000/01/rdf-schema.

  12. 12.

    http://purl.org/spar/fabio/.

  13. 13.

    http://purl.org/NET/c4dm/event.owl.

  14. 14.

    http://swrc.ontoware.org/ontology.

  15. 15.

    http://rml.io/data/spc2015/mappings.

  16. 16.

    http://jodd.org/doc/csselly/.

  17. 17.

    This tool is available at http://rml.io/data/spc2015/reformat_tool.

  18. 18.

    http://jtidy.sourceforge.net/.

  19. 19.

    The valid html pages are available at http://rml.io/data/spc2015/valid_html.

  20. 20.

    http://rml.io/data/spc2015/sparql_templates.

  21. 21.

    https://github.com/mmlab/RMLProcessor.

  22. 22.

    https://github.com/antidot/db2triples/.

  23. 23.

    http://thedatatank.com/.

  24. 24.

    http://www.openknowledge.be/.

References

  1. Dimou, A., Vander Sande, M., Colpaert, P., De Vocht, L., Verborgh, R., Mannens, E., Van de Walle, R.: Extraction and semantic annotation of workshop proceedings in HTML using RML. In: Presutti, V., et al. (eds.) SemWebEval 2014. CCIS, vol. 475, pp. 114–119. Springer, Heidelberg (2014)

    Google Scholar 

  2. Dimou, A., Vander Sande, M., Colpaert, P., Verborgh, R., Mannens, E., Van de Walle, R.: RML: a generic language for integrated RDF mappings of heterogeneous data. In: Workshop on Linked Data on the Web (2014)

    Google Scholar 

  3. Dimou, A., Vander Sande, M., Slepicka, J., Szekely, P., Mannens, E., Knoblock, C., Van de Walle, R.: Mapping hierarchical sources into RDF using the RML mapping language. In: Proceedings of the 8th IEEE International Conference on Semantic Computing (2014)

    Google Scholar 

  4. Lange, C., Di Iorio, A.: Semantic publishing challenge – assessing the quality of scientific output. In: Presutti, V., et al. (eds.) SemWebEval 2014. CCIS, vol. 475, pp. 61–76. Springer, Heidelberg (2014)

    Google Scholar 

  5. Das, S., Sundara, S., Cyganiak, R.: R2RML: RDB to RDF mapping language. In: Working group recommendation, W3C, September 2012. http://www.w3.org/TR/r2rml/

  6. Kontokostas, D., Westphal, P., Auer, S., Hellmann, S., Lehmann, J., Cornelissen, R., Zaveri, A.: Test-driven evaluation of linked data quality. In: Proceedings of the World Wide Web Conference, pp. 747–758 (2014)

    Google Scholar 

Download references

Acknowledgements

The described research activities were funded by Ghent University, iMinds, the Institute for the Promotion of Innovation by Science and Technology in Flanders (IWT), the Fund for Scientific Research Flanders (FWO Flanders), and the European Union.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Pieter Heyvaert .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2015 Springer International Publishing Switzerland

About this paper

Cite this paper

Heyvaert, P., Dimou, A., Verborgh, R., Mannens, E., Van de Walle, R. (2015). Semantically Annotating CEUR-WS Workshop Proceedings with RML. In: Gandon, F., Cabrio, E., Stankovic, M., Zimmermann, A. (eds) Semantic Web Evaluation Challenges. SemWebEval 2015. Communications in Computer and Information Science, vol 548. Springer, Cham. https://doi.org/10.1007/978-3-319-25518-7_14

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-25518-7_14

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-25517-0

  • Online ISBN: 978-3-319-25518-7

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics