Skip to main content

A More Specific Events Classification to Improve Crawling Techniques

  • Conference paper
On the Move to Meaningful Internet Systems: OTM 2010 Workshops (OTM 2010)

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 6428))

  • 1339 Accesses

Abstract

Nowadays the popularity of data quality is increasing notably in linked data. Linked data consuming applications need to be aware that changes in a dataset. Changes such as update, remove or creation links may occur for a time so is necessary to detect them to update local data dependencies where this annotation is made by detecting changes systems. Updated or removed links can be detected using a syntactic change similarity measure, and it can be done simply using the Levenshtein distance measure. However, a specific event subclassification of updated event and removed event, which iscreated by detecting changes systems developed, does not exist based on content analysis. A semantic signature and Maximum Similarity Measure (MaSiMe) combination approach is developed to create a more specific subclassification of the initial updated and removed event when its meaning has been changed. It is used to enrich the resources, annotating the new subclassification of the initial updated event and removed event, and will be annotated the author who created this annotation, adding provenance information. Annotations on the modification time are made in linked data resource, and making an average time study about when these specific events changes, could be improved the crawling techniques for a domain.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Tim Berners-Lee. Linked Data. World Wide Web Consortium (2006), http://www.w3.org/DesignIssue/LinkedData.html (retrieved August 08,2008)

  2. Umbrich, J., Hausenblas, M., Hogan, A., Polleres, A., Decker, S.: Toward Dataset Dynamics: Change Frequency of Linked Open Data Sources. In: LDOW 2010, Raleigh, USA (2010)

    Google Scholar 

  3. Harth, A., Umbrich, J., Decker, S.: A Pipelined Architecture for Crawling and Indexing Semantic Web Data. In: Cruz, I., Decker, S., Allemang, D., Preist, C., Schwabe, D., Mika, P., Uschold, M., Aroyo, L.M. (eds.) ISWC 2006. LNCS, vol. 4273, pp. 258–271. Springer, Heidelberg (2006)

    Chapter  Google Scholar 

  4. Auer, S., Dietzold, S., Lehmann, J., Helmann, S., Aumülle, D.: Triplify: light-weight linked data publication from relational databased. In: WWW 2009. ACM, New York (2009)

    Google Scholar 

  5. Morishima, A., Nakamizo, A., Iida, T., Sugimoto, S., Kitagawa, H.: Bringing your dead links back to life: a comprehensive approach and lessons learned. In: Proceedings of the 20 th ACM conference on Hypertext and hypermedia, HT 2009, pp. 15–24. ACM, New York (2009)

    Chapter  Google Scholar 

  6. Van de Sompel, H., Sanderson, R., Nelson, M.L.: An HTTP-Based Versioning Mechanism for Linked Data. In: LDOW 2010, Raleigh, North Carolina, USA. ACM, New York (2010)

    Google Scholar 

  7. Popitsch, N.P., Haslhofer, B.: DSNotify: Handing Brokeb Links in the Web of Data. In: WWW 2010, Raleigh, North Carolina, USA. ACM, New York (2010)

    Google Scholar 

  8. Urdiales-Nieto, D., Martínez Gil y, J., Aldana-Montes, J.F.: MaSiMe: a customized similarity measure and its application for tag cloud refactoring. In: Meersman, R., Herrero, P., Dillon, T. (eds.) OTM 2009 Workshops. LNCS, vol. 5872, pp. 937–946. Springer, Heidelberg (2009)

    Chapter  Google Scholar 

  9. Levenshtein, V.: Binary Codes Capable of Correcting Deletions, Insertions and Reversals. Soviet Physics-Doklady 10, 707–710 (1966)

    MathSciNet  MATH  Google Scholar 

  10. Ziegler, P., Kiefer, C., Sturm, C., Dittrich, K.R., Bernstein, A.: Detecting Similarities in Ontologies with the SOQA-SimPack Toolkit. In: Ioannidis, Y., Scholl, M.H., Schmidt, J.W., Matthes, F., Hatzopoulos, M., Böhm, K., Kemper, A., Grust, T., Böhm, C. (eds.) EDBT 2006. LNCS, vol. 3896, pp. 59–76. Springer, Heidelberg (2006)

    Chapter  Google Scholar 

  11. Ukkonen, E.: Approximate String Matching with q-grams and Maximal Matches. Theor. Comput. Sci. 92(1), 191–211 (1992)

    Article  MathSciNet  MATH  Google Scholar 

  12. http://textwise.com/api/semantic-signatures

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2010 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Urdiales-Nieto, D., Aldana-Montes, J.F. (2010). A More Specific Events Classification to Improve Crawling Techniques. In: Meersman, R., Dillon, T., Herrero, P. (eds) On the Move to Meaningful Internet Systems: OTM 2010 Workshops. OTM 2010. Lecture Notes in Computer Science, vol 6428. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-16961-8_86

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-16961-8_86

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-16960-1

  • Online ISBN: 978-3-642-16961-8

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics