Methodology for Connecting Nouns to Their Modifying Adjectives

Ofek, Nir; Rokach, Lior; Mitra, Prasenjit

doi:10.1007/978-3-642-54906-9_22

Nir Ofek¹⁷,
Lior Rokach¹⁷ &
Prasenjit Mitra¹⁸

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 8403))

Included in the following conference series:

International Conference on Intelligent Text Processing and Computational Linguistics

2063 Accesses
9 Citations

Abstract

Adjectives are words that describe or modify other elements in a sentence. As such, they are frequently used to convey facts and opinions about the nouns they modify. Connecting nouns to the corresponding adjectives becomes vital for intelligent tasks such as aspect-level sentiment analysis or interpretation of complex queries (e.g., "small hotel with large rooms") for fine-grained information retrieval. To respond to the need, we propose a methodology that identifies dependencies of nouns and adjectives by looking at syntactic clues related to part-of-speech sequences that help recognize such relationships. These sequences are generalized into patterns that are used to train a binary classifier using machine learning methods. The capabilities of the new method are demonstrated in two, syntactically different languages: English, the leading language of international discourse, and Hebrew, whose rich morphology poses additional challenges for parsing. In each language we compare our method with a designated, state-of-the-art parser and show that it performs similarly in terms of accuracy while: (a) our method uses a simple and relatively small training set; (b) it does not require a language specific adaptation, and (c) it is robust across a variety of writing styles.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Adler, M.: Hebrew Morphological Disambiguation: An Unsupervised Stochastic Word-based Approach. Ph.D. thesis, Ben-Gurion University of the Negev, Beer-Sheva, Israel (2007)
Google Scholar
Adler, M., DahanNetzer, Y., Goldberg, Y., Gabay, D., Elhadad, M.: Tagging a Hebrew Corpus: The Case of Participles. In: LREC (2008)
Google Scholar
Adler, M., Elhadad, M.: An Unsupervised Morpheme-Based HMM for Hebrew Morphological Disambiguation. In: Proceedings of COLING-ACL 2006 (2006)
Google Scholar
Agichtein, E., Gravano, L.: Snowball: Extracting relations from large plain-text collections. In: Proceedings of the Fifth ACM Conference on Digital Libraries. ACM (2000)
Google Scholar
Alfonseca, E., Filippova, K., Delort, J.-Y., Garrido, G.: Pattern learning for relation extraction with a hierarchical topic model. In: Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics: Short Papers, vol. 2, pp. 54–59. Association for Computational Linguistics (2012)
Google Scholar
Banko, M., Etzioni, O., Center, T.: The Tradeoffs Between Open and Traditional Relation Extraction. In: ACL, vol. 8, pp. 28–36 (2008)
Google Scholar
Barker, K., Szpakowicz, S.: Semi-automatic recognition of noun modifier relationships. In: Proceedings of the 17th International Conference on Computational Linguistics, vol. 1. Association for Computational Linguistics (1998)
Google Scholar
Blair-Goldensohn, S., Hannan, K., McDonald, R., Neylon, T., Reis, G.A., Reynar, J.: Building a sentiment summarizer for local service reviews. In: WWW Workshop on NLP in the Information Explosion Era (NLPIX), New York, NY, USA. ACM (2008)
Google Scholar
Buchholz, S., Marsi, E.: CoNLL-X Shared Task on Multilingual Dependency Parsing. In: Proceedings of the Tenth Conference on Computational Natural Language Learning (CoNLL-X), New York, NY (2006)
Google Scholar
De Marneffe, M.-C., MacCartney, B., Manning, C.D.: Generating typed dependency parses from phrase structure parses. In: Proceedings of LREC, vol. 6 (2006)
Google Scholar
Etzioni, O., Banko, M., Soderland, S., Weld, D.S.: Open information extraction from the web. Communications of the ACM 51(12), 68–74 (2008)
Article Google Scholar
Foster, J., Çetinoglu, Ö., Wagner, J., Le Roux, J., Hogan, S., Nivre, J., Hogan, D., Van Genabith, J.: # hard-to-parse: POS Tagging and Parsing the Twitterverse. In: Proceedings of the Workshop on Analyzing Microtext (AAAI 2011), pp. 20–25 (2011)
Google Scholar
Goldberg, Y., Adler, M., Elhadad, M.: EM Can Find Pretty Good HMM POS-Taggers (When Given a Good Start). ACL (2008)
Google Scholar
Yoav, G., Elhadad, M.: Hebrew Dependency Parsing: Initial Results. In: Proceedings of IWPT (2009)
Google Scholar
Goldberg, Y., Elhadad, M.: Easy first dependency parsing of modern Hebrew. In: Proceedings of the NAACL HLT 2010 First Workshop on Statistical Parsing of Morphologically-Rich Languages. Association for Computational Linguistics (2010)
Google Scholar
Guthmann, N., Krymolowski, Y., Milea, A., Winter, Y.: Automatic annotation of morpho-syntactic dependencies in a modern Hebrew treebank. In: Proceedings of TLT (2009)
Google Scholar
Hatzivassiloglou, V., Wiebe, J.M.: Effects of adjective orientation and gradability on sentence subjectivity. In: Proceedings of the 18th Conference on Computational Linguistics, vol. 1. Association for Computational Linguistics (2000)
Google Scholar
Kim, S.N., Nakov, P.: Large-scale noun compound interpretation using bootstrapping and the Web as a corpus. In: Proceedings of the Conference on Empirical Methods in Natural Language Processing. Association for Computational Linguistics (2011)
Google Scholar
Kübler, S.: The PaGe 2008 shared task on parsing German. In: Proceedings of the Workshop on Parsing German, pp. 55–63. Association for Computational Linguistics (2008)
Google Scholar
Lin, D.: MINIPAR: a minimalist parser. Maryland linguistics colloquium (1999)
Google Scholar
Marcus, M.P., Marcinkiewicz, M.A., Santorini, B.: Building a large annotated corpus of English: The Penn Treebank. Computational Linguistics 19(2), 313–330 (1993)
Google Scholar
Minack, E., Demartini, G., Nejdl, W.: Current approaches to search result di-versification. In: First International Workshop on Living Web: Making Web Diversity a True Asset, Washington DC (2009)
Google Scholar
Oommen, T., Baise, L.G., Vogel, R.M.: Sampling bias and class imbalance in maximum-likelihood logistic regression. Mathematical Geosciences 43(1), 99–120 (2011)
Article MATH Google Scholar
Quinlan, J.R.: Induction of decision trees. Machine Learning 1, 81–106 (1986)
Article Google Scholar
Rigoutsos, I., Floratos, A.: Combinatorial pattern discovery in biological sequences: The TEIRESIAS algorithm. Bioinformatics 14(2), 229 (1998)
Google Scholar
Rokach, Romano, Maimon: Negation recognition in medical narrative reports. Information Retrieval 11(6), 499–538 (2008)
Article Google Scholar
Swanson, B.: Exploring Syntactic Representations for Native Language Identification. In: NAACL/HLT 2013, p. 146 (2013)
Google Scholar
Tsarfaty, R., Seddah, D., Goldberg, Y., Kübler, S., Candito, M., Foster, J., Versley, Y., Rehbein, I., Tounsi, L.: Statistical parsing of morphologically rich languages (SPMRL): what, how and whither. In: Proceedings of the NAACL HLT 2010 First Workshop on Statistical Parsing of Morphologically-Rich Languages, pp. 1–12. Association for Computational Linguistics (2010)
Google Scholar
Witten, I.H., Frank, E.: Data Mining: Practical machine learning tools and techniques. Morgan Kaufmann (2005)
Google Scholar
Xu, Y., Kim, M.-Y., Quinn, K., Goebel, R., Barbosa, D.: Open Information Extraction with Tree Kernels. In: Proceedings of NAACL-HLT, pp. 868–877 (2013)
Google Scholar

Download references

Author information

Authors and Affiliations

Ben-Gurion University of the Negev, Israel
Nir Ofek & Lior Rokach
The Pennsylvania State University, USA
Prasenjit Mitra

Authors

Nir Ofek
View author publications
You can also search for this author in PubMed Google Scholar
Lior Rokach
View author publications
You can also search for this author in PubMed Google Scholar
Prasenjit Mitra
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Center for Computing Research, National Polytechnic Institute, Av. Juan Dios Bátiz, Col. Nueva Industrial Vallejo, 07738, Mexico D.F., Mexico
Alexander Gelbukh

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Ofek, N., Rokach, L., Mitra, P. (2014). Methodology for Connecting Nouns to Their Modifying Adjectives . In: Gelbukh, A. (eds) Computational Linguistics and Intelligent Text Processing. CICLing 2014. Lecture Notes in Computer Science, vol 8403. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-54906-9_22

Download citation

DOI: https://doi.org/10.1007/978-3-642-54906-9_22
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-54905-2
Online ISBN: 978-3-642-54906-9
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics