Skip to main content

A Logic-Based Approach to Named-Entity Disambiguation in the Web of Data

  • Conference paper
  • First Online:
AI*IA 2015 Advances in Artificial Intelligence (AI*IA 2015)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 9336))

Included in the following conference series:

Abstract

Semantic annotation aims at linking parts of rough data (e.g., text, video, or image) to known entities in the Linked Open Data (LOD) space. When several entities could be linked to a given object, a Named-Entity Disambiguation (NED) problem must be solved. While disambiguation has been extensively studied in Natural Language Understanding (NLU), NED is less ambitious—it does not aim to the meaning of a whole phrase, just to correctly link objects to entities—and at the same time more peculiar since the target must be LOD-entities. Inspired by semantic similarity in NLU, this paper illustrates a way to solve disambiguation based on Common Subsumers of pairs of RDF resources related to entities recognized in the text. The inference process proposed for resolving ambiguities leverages on the DBpedia structured semantics. We apply it to a TV-program description enrichment use case, illustrating its potential in correcting errors produced by automatic text annotators (such as errors in assigning entity types and entity URIs), and in extracting a description of the main topics of a text in form of commonalities shared by its entities.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Alfonseca, E., Manandhar, S.: An unsupervised method for general named entity recognition and automated concept discovery. In: Proc. of the 1st Int. Conf. on General WordNet, Mysore, India, pp. 34–43 (2002)

    Google Scholar 

  2. Bellot, P., Bonnefoy, L., Bouvier, V., Duvert, F., Kim, Y.M.: Large scale text mining approaches for information retrieval and extraction. In: Innovations in Intelligent Machines-4, pp. 3–45. Springer (2014)

    Google Scholar 

  3. Bunescu, R.C., Pasca, M.: Using encyclopedic knowledge for named entity disambiguation. In: Proc. of the 11th Conf. of the European Chapter of the Association for Computational Linguistics (EACL-06), vol. 6, pp. 9–16 (2006)

    Google Scholar 

  4. Cambria, E., White, B.: Jumping NLP curves: A review of natural language processing research. IEEE Computational Intelligence Magazine 9(2), 48–57 (2014)

    Article  Google Scholar 

  5. Chen, L., Ortona, S., Orsi, G., Benedikt, M.: Aggregating semantic annotators. Proc. of the VLDB Endowment 6(13), 1486–1497 (2013)

    Article  Google Scholar 

  6. Chieu, H.L., Ng, H.T.: Named entity recognition: a maximum entropy approach using global information. In: Proc. of the 19th Int. Conf. on Computational linguistics, vol. 1, pp. 1–7. ACL (2002)

    Google Scholar 

  7. Cimiano, P., Völker, J.: Towards large-scale, open-domain and ontology-based named entity classification. In: Proc. of the Int. Conf. on Recent Advances in Natural Language Processing (RANLP) (2005)

    Google Scholar 

  8. Colucci, S., Donini, F.M., Di Sciascio, E.: Common subsumbers in RDF. In: Baldoni, M., Baroglio, C., Boella, G., Micalizio, R. (eds.) AI*IA 2013. LNCS, vol. 8249, pp. 348–359. Springer, Heidelberg (2013)

    Chapter  Google Scholar 

  9. Colucci, S., Giannini, S., Donini, F.M., Di Sciascio, E.: A deductive approach to the identification and description of clusters in Linked Open Data. In: Proc. of the 21th European Conf. on Artificial Intelligence (ECAI 2014). IOS Press (2014)

    Google Scholar 

  10. Cucerzan, S.: TAC entity linking by performing full-document entity extraction and disambiguation. In: Proc. of the Text Analysis Conference, vol. 2011 (2011)

    Google Scholar 

  11. Dredze, M., McNamee, P., Rao, D., Gerber, A., Finin, T.: Entity disambiguation for knowledge base population. In: Proc. of the 23rd Int. Conf. on Computational Linguistics, pp. 277–285. ACL, Beijing, August 2010

    Google Scholar 

  12. Etzioni, O., Cafarella, M., Downey, D., Popescu, A.M., Shaked, T., Soderland, S., Weld, D.S., Yates, A.: Unsupervised named-entity extraction from the web: An experimental study. Artificial Intelligence 165(1), 91–134 (2005)

    Article  Google Scholar 

  13. Fetahu, B., Dietze, S., Pereira Nunes, B., Antonio Casanova, M., Taibi, D., Nejdl, W.: What’s all the data about?: creating structured profiles of linked data on the web. In: Proc. of the Companion Publication of the 23rd Int. Conf. on World Wide Web Companion, pp. 261–262. International World Wide Web Conferences Steering Committee (2014)

    Google Scholar 

  14. Gangemi, A.: A comparison of knowledge extraction tools for the semantic web. In: Cimiano, P., Corcho, O., Presutti, V., Hollink, L., Rudolph, S. (eds.) ESWC 2013. LNCS, vol. 7882, pp. 351–366. Springer, Heidelberg (2013)

    Chapter  Google Scholar 

  15. Hayes, P.: RDF Semantics, W3C Recommendation (2004). http://www.w3.org/TR/2004/REC-rdf-mt-20040210/

  16. Hellmann, S., Lehmann, J., Auer, S., Brümmer, M.: Integrating NLP using linked data. In: Alani, H., et al. (eds.) ISWC 2013, Part II. LNCS, vol. 8219, pp. 98–113. Springer, Heidelberg (2013)

    Chapter  Google Scholar 

  17. Hobbs, J.R., Stickel, M., Martin, P., Edwards, D.: Interpretation as abduction. In: Proc. of the 26th Annual Meeting on Association for Computational Linguistics, pp. 95–103. ACL (1988)

    Google Scholar 

  18. Hoffart, J., Yosef, M.A., Bordino, Ilaria Fürstenau and H., Pinkal, M., Spaniol, M., Taneva, B., Thater, S., Weikum, G.: Robust disambiguation of named entities in text. In: Proc. of the Conf. on Empirical Methods in Natural Language Processing, pp. 782–792. ACL, Edinburgh, July 2011

    Google Scholar 

  19. Maccatrozzo, V.: Burst the filter bubble: using semantic web to enable serendipity. In: Cudré-Mauroux, P., et al. (eds.) ISWC 2012, Part II. LNCS, vol. 7650, pp. 391–398. Springer, Heidelberg (2012)

    Chapter  Google Scholar 

  20. Mihalcea, R., Csomai, A.: Wikify!: linking documents to encyclopedic knowledge. In: Proc. of the 16th ACM Conf. on Information and Knowledge Management, pp. 233–242. ACM (2007)

    Google Scholar 

  21. Milne, D., Witten, I.H.: Learning to link with wikipedia. In: Proc. of the 17th ACM Conf. on Information and Knowledge Management, pp. 509–518. ACM (2008)

    Google Scholar 

  22. Moro, A., Raganato, A., Navigli, R.: Entity linking meets word sense disambiguation: a unified approach. Transactions of the Association for Computational Linguistics 2, 231–244 (2014)

    Google Scholar 

  23. Nakashole, N., Theobald, M., Weikum, G.: Scalable knowledge harvesting with high precision and high recall. In: Proc. of the Fourth ACM Int. Conf. on Web Search and Data Mining, pp. 227–236. ACM, Hong Kong, February 2011

    Google Scholar 

  24. Navigli, R., Ponzetto, S.P.: BabelNet: The automatic construction, evaluation and application of a wide-coverage multilingual semantic network. Artificial Intelligence 193, 217–250 (2012)

    Article  MathSciNet  MATH  Google Scholar 

  25. Navigli, R., Velardi, P.: Structural semantic interconnections: a knowledge-based approach to word sense disambiguation. IEEE Transactions on Pattern Analysis and Machine Intelligence 27(7), 1075–1086 (2005)

    Article  Google Scholar 

  26. Niu, C., Li, W., Ding, J., Srihari, R.K.: A bootstrapping approach to named entity classification using successive learners. In: Proc. of the 41st Annual Meeting on Association for Computational Linguistics, vol. 1, pp. 335–342. ACL (2003)

    Google Scholar 

  27. Prokofyev, R., Demartini, G., Cudré-Mauroux, P.: Effective named entity recognition for idiosyncratic web collections. In: Proc. of the 23rd Int. Conf. on World Wide Web, pp. 397–408. International World Wide Web Conferences Steering Committee (2014)

    Google Scholar 

  28. Rizzo, G., Erp, M.V., Troncy, R.: Benchmarking the extraction and disambiguation of named entities on the semantic web. In: Proc. of the 9th Int. Conf. on Language Resources and Evaluation (LREC 2014). European Language Resources Association (ELRA), Reykjavik, May 2014

    Google Scholar 

  29. Studer, R., Burghart, C., Stojanovic, N., Thanh, T., Zacharias, V.: New dimensions in semantic knowledge management. In: Towards the Internet of Services: The THESEUS Research Program, pp. 37–50. Springer (2014)

    Google Scholar 

  30. Van Erp, M., Rizzo, G., Troncy, R.: Learning with the web: spotting named entities on the intersection of NERD and machine learning. In: # MSM, pp. 27–30. Citeseer (2013)

    Google Scholar 

  31. Zhang, L., Pan, Y., Zhang, T.: Focused named entity recognition using machine learning. In: Proc. of the 27th Annual Int. ACM SIGIR Conf. on Research and Development in Information Retrieval, pp. 281–288. ACM (2004)

    Google Scholar 

  32. Zheng, Z., Si, X., Li, F., Chang, E.Y., Zhu, X.: Entity disambiguation with freebase. In: Proc. of the The 2012 IEEE/WIC/ACM Int. Joint Conf. on Web Intelligence and Intelligent Agent Technology, vol. 01, pp. 82–89. IEEE Computer Society (2012)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Simona Colucci .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2015 Springer International Publishing Switzerland

About this paper

Cite this paper

Giannini, S., Colucci, S., Donini, F.M., Di Sciascio, E. (2015). A Logic-Based Approach to Named-Entity Disambiguation in the Web of Data. In: Gavanelli, M., Lamma, E., Riguzzi, F. (eds) AI*IA 2015 Advances in Artificial Intelligence. AI*IA 2015. Lecture Notes in Computer Science(), vol 9336. Springer, Cham. https://doi.org/10.1007/978-3-319-24309-2_28

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-24309-2_28

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-24308-5

  • Online ISBN: 978-3-319-24309-2

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics