Skip to main content

Adapting Gloss Vector Semantic Relatedness Measure for Semantic Similarity Estimation: An Evaluation in the Biomedical Domain

  • Conference paper
  • First Online:
Semantic Technology (JIST 2013)

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 8388))

Included in the following conference series:

Abstract

Automatic methods of ontology alignment are essential for establishing interoperability across web services. These methods are needed to measure semantic similarity between two ontologies’ entities to discover reliable correspondences. While existing similarity measures suffer from some difficulties, semantic relatedness measures tend to yield better results; even though they are not completely appropriate for the ‘equivalence’ relationship (e.g. “blood” and “bleeding” related but not similar). We attempt to adapt Gloss Vector relatedness measure for similarity estimation. Generally, Gloss Vector uses angles between entities’ gloss vectors for relatedness calculation. After employing Pearson’s chi-squared test for statistical elimination of insignificant features to optimize entities’ gloss vectors, by considering concepts’ taxonomy, we enrich them for better similarity measurement. Discussed measures get evaluated in the biomedical domain using MeSH, MEDLINE and dataset of 301 concept pairs. We conclude Adapted Gloss Vector similarity results are more correlated with human judgment of similarity compared to other measures.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    http://wordnet.princeton.edu

  2. 2.

    http://www.nlm.nih.gov/research/umls

  3. 3.

    http://www.nlm.nih.gov/research/umls

  4. 4.

    http://mbr.nlm.nih.gov/index.shtml

  5. 5.

    http://rxinformatics.umn.edu/data/UMNSRS_similarity.csv

References

  1. Muthaiyah, S., Kerschberg, L.: A hybrid ontology mediation approach for the semantic web. Int. J. E-Bus. Res. 4, 79–91 (2008)

    Article  Google Scholar 

  2. Chen, B., Foster, G., Kuhn, R.: Bilingual sense similarity for statistical machine translation. In: Proceedings of the ACL, pp. 834–843 (2010)

    Google Scholar 

  3. Pesaranghader, A., Mustapha, N., Pesaranghader, A.: Applying semantic similarity measures to enhance topic-specific web crawling. In: Proceedings of the 13th International Conference on Intelligent Systems Design and Applications (ISDA’13), pp. 205–212 (2013)

    Google Scholar 

  4. Gruber, T.R.: Toward principles for the design of ontologies used for knowledge sharing. Int. J. Hum Comput Stud. 43, 907–928 (1995)

    Article  Google Scholar 

  5. Firth, J.R.: A synopsis of linguistic theory 1930–1955. In: Firth, J.R. (ed.) Studies in Linguistic Analysis, pp. 1–32. Blackwell, Oxford (1957)

    Google Scholar 

  6. Lesk, M.: Automatic sense disambiguation using machine readable dictionaries: how to tell a pine cone from an ice-cream cone. In: Proceedings of the 5th Annual International Conference on Systems Documentation, New York, USA, pp. 24–26 (1986)

    Google Scholar 

  7. Banerjee, S., Pedersen, T.: An adapted Lesk algorithm for word sense disambiguation using WordNet. In: Gelbukh, A. (ed.) CICLing 2002. LNCS, vol. 2276, pp. 136–145. Springer, Heidelberg (2002)

    Google Scholar 

  8. Patwardhan, S., Pedersen, T: Using WordNet-based context vectors to estimate the semantic relatedness of concepts. In: Proceedings of the EACL 2006 Workshop (2006)

    Google Scholar 

  9. Liu, Y., McInnes, B.T., Pedersen, T., Melton-Meaux, G., Pakhomov. S.: Semantic relatedness study using second order co-occurrence vectors computed from biomedical corpora, UMLS and WordNet. In: Proceedings of the 2nd ACM SIGHIT IHI (2012)

    Google Scholar 

  10. Pesaranghader, A., Pesaranghader, A., Rezaei, A.: Applying latent semantic analysis to optimize second-order co-occurrence vectors for semantic relatedness measurement. In: Proceedings of the 1st International Conference on Mining Intelligence and Knowledge Exploration (MIKE’13), pp. 588–599 (2013)

    Google Scholar 

  11. Pesaranghader, A., Pesaranghader, A., Rezaei, A.: Augmenting concept definition in gloss vector semantic relatedness measure using Wikipedia articles. In: Proceedings of the 1st International Conference on Data Engineering (DeEng-2013), pp. 623–630 (2014)

    Google Scholar 

  12. Rada, R., Mili, H., Bicknell, E., Blettner, M.: Development and application of a metric on semantic nets. IEEE Trans. Syst. Man Cybern. 19, 17–30 (1989)

    Article  Google Scholar 

  13. Caviedes, J., Cimino, J.: Towards the development of a conceptual distance metric for the UMLS. J. Biomed. Inf. 372, 77–85 (2004)

    Article  Google Scholar 

  14. Wu, Z., Palmer, M.: Verb semantics and lexical selections. In: Proceedings of the 32nd Annual Meeting of the Association for Computational Linguistics, (1994)

    Google Scholar 

  15. Leacock, C., Chodorow, M.: Combining local context and WordNet similarity for word sense identification. In: Fellbaum, C. (ed.) WordNet: An Electronic Lexical Database, pp. 265–283. MIT press, Cambridge (1998)

    Google Scholar 

  16. Zhong, J., Zhu, H., Li, J., Yu, Y.: Conceptual graph matching for semantic search. In: Priss, U., Corbett, D.R., Angelova, G. (eds.) ICCS 2002. LNCS (LNAI), vol. 2393, pp. 92–106. Springer, Heidelberg (2002)

    Chapter  Google Scholar 

  17. Nguyen, H.A., Al-Mubaid, H.: New ontology-based semantic similarity measure for the biomedical domain. In: Proceedings of IEEE International Conference on Granular Computing GrC’06, pp. 623–628 (2006)

    Google Scholar 

  18. Resnik, P.: Using information content to evaluate semantic similarity in a taxonomy. In: Proceedings of the 14th International Joint Conference on Artificial Intelligence (1995)

    Google Scholar 

  19. Jiang, J.J., Conrath, D.W.: Semantic similarity based on corpus statistics and lexical taxonomy. In: International Conference on Research in Computational Linguistics (1997)

    Google Scholar 

  20. Lin, D.: An Information-theoretic definition of similarity. In: 15th International Conference on Machine Learning, Madison, USA, (1998)

    Google Scholar 

  21. Pesaranghader, A., Muthaiyah, S.: Definition-based information content vectors for semantic similarity measurement. In: Noah, S.A., Abdullah, A., Arshad, H., Abu Bakar, A., Othman, Z.A., Sahran, S., Omar, N., Othman, Z. (eds.) M-CAIT 2013. CCIS, vol. 378, pp. 268–282. Springer, Heidelberg (2013)

    Google Scholar 

  22. Pakhomov, S., McInnes, B., Adam, T., Liu, Y., Pedersen, T., Melton, G.: Semantic similarity and relatedness between clinical terms: an experimental study. In: Proceedings of AMIA, pp. 572–576 (2010)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Ahmad Pesaranghader .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2014 Springer International Publishing Switzerland

About this paper

Cite this paper

Pesaranghader, A., Rezaei, A., Pesaranghader, A. (2014). Adapting Gloss Vector Semantic Relatedness Measure for Semantic Similarity Estimation: An Evaluation in the Biomedical Domain. In: Kim, W., Ding, Y., Kim, HG. (eds) Semantic Technology. JIST 2013. Lecture Notes in Computer Science(), vol 8388. Springer, Cham. https://doi.org/10.1007/978-3-319-06826-8_11

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-06826-8_11

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-06825-1

  • Online ISBN: 978-3-319-06826-8

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics