Abstract
Adjectives are words that describe or modify other elements in a sentence. As such, they are frequently used to convey facts and opinions about the nouns they modify. Connecting nouns to the corresponding adjectives becomes vital for intelligent tasks such as aspect-level sentiment analysis or interpretation of complex queries (e.g., "small hotel with large rooms") for fine-grained information retrieval. To respond to the need, we propose a methodology that identifies dependencies of nouns and adjectives by looking at syntactic clues related to part-of-speech sequences that help recognize such relationships. These sequences are generalized into patterns that are used to train a binary classifier using machine learning methods. The capabilities of the new method are demonstrated in two, syntactically different languages: English, the leading language of international discourse, and Hebrew, whose rich morphology poses additional challenges for parsing. In each language we compare our method with a designated, state-of-the-art parser and show that it performs similarly in terms of accuracy while: (a) our method uses a simple and relatively small training set; (b) it does not require a language specific adaptation, and (c) it is robust across a variety of writing styles.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Adler, M.: Hebrew Morphological Disambiguation: An Unsupervised Stochastic Word-based Approach. Ph.D. thesis, Ben-Gurion University of the Negev, Beer-Sheva, Israel (2007)
Adler, M., DahanNetzer, Y., Goldberg, Y., Gabay, D., Elhadad, M.: Tagging a Hebrew Corpus: The Case of Participles. In: LREC (2008)
Adler, M., Elhadad, M.: An Unsupervised Morpheme-Based HMM for Hebrew Morphological Disambiguation. In: Proceedings of COLING-ACL 2006 (2006)
Agichtein, E., Gravano, L.: Snowball: Extracting relations from large plain-text collections. In: Proceedings of the Fifth ACM Conference on Digital Libraries. ACM (2000)
Alfonseca, E., Filippova, K., Delort, J.-Y., Garrido, G.: Pattern learning for relation extraction with a hierarchical topic model. In: Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics: Short Papers, vol. 2, pp. 54–59. Association for Computational Linguistics (2012)
Banko, M., Etzioni, O., Center, T.: The Tradeoffs Between Open and Traditional Relation Extraction. In: ACL, vol. 8, pp. 28–36 (2008)
Barker, K., Szpakowicz, S.: Semi-automatic recognition of noun modifier relationships. In: Proceedings of the 17th International Conference on Computational Linguistics, vol. 1. Association for Computational Linguistics (1998)
Blair-Goldensohn, S., Hannan, K., McDonald, R., Neylon, T., Reis, G.A., Reynar, J.: Building a sentiment summarizer for local service reviews. In: WWW Workshop on NLP in the Information Explosion Era (NLPIX), New York, NY, USA. ACM (2008)
Buchholz, S., Marsi, E.: CoNLL-X Shared Task on Multilingual Dependency Parsing. In: Proceedings of the Tenth Conference on Computational Natural Language Learning (CoNLL-X), New York, NY (2006)
De Marneffe, M.-C., MacCartney, B., Manning, C.D.: Generating typed dependency parses from phrase structure parses. In: Proceedings of LREC, vol. 6 (2006)
Etzioni, O., Banko, M., Soderland, S., Weld, D.S.: Open information extraction from the web. Communications of the ACM 51(12), 68–74 (2008)
Foster, J., Çetinoglu, Ö., Wagner, J., Le Roux, J., Hogan, S., Nivre, J., Hogan, D., Van Genabith, J.: # hard-to-parse: POS Tagging and Parsing the Twitterverse. In: Proceedings of the Workshop on Analyzing Microtext (AAAI 2011), pp. 20–25 (2011)
Goldberg, Y., Adler, M., Elhadad, M.: EM Can Find Pretty Good HMM POS-Taggers (When Given a Good Start). ACL (2008)
Yoav, G., Elhadad, M.: Hebrew Dependency Parsing: Initial Results. In: Proceedings of IWPT (2009)
Goldberg, Y., Elhadad, M.: Easy first dependency parsing of modern Hebrew. In: Proceedings of the NAACL HLT 2010 First Workshop on Statistical Parsing of Morphologically-Rich Languages. Association for Computational Linguistics (2010)
Guthmann, N., Krymolowski, Y., Milea, A., Winter, Y.: Automatic annotation of morpho-syntactic dependencies in a modern Hebrew treebank. In: Proceedings of TLT (2009)
Hatzivassiloglou, V., Wiebe, J.M.: Effects of adjective orientation and gradability on sentence subjectivity. In: Proceedings of the 18th Conference on Computational Linguistics, vol. 1. Association for Computational Linguistics (2000)
Kim, S.N., Nakov, P.: Large-scale noun compound interpretation using bootstrapping and the Web as a corpus. In: Proceedings of the Conference on Empirical Methods in Natural Language Processing. Association for Computational Linguistics (2011)
Kübler, S.: The PaGe 2008 shared task on parsing German. In: Proceedings of the Workshop on Parsing German, pp. 55–63. Association for Computational Linguistics (2008)
Lin, D.: MINIPAR: a minimalist parser. Maryland linguistics colloquium (1999)
Marcus, M.P., Marcinkiewicz, M.A., Santorini, B.: Building a large annotated corpus of English: The Penn Treebank. Computational Linguistics 19(2), 313–330 (1993)
Minack, E., Demartini, G., Nejdl, W.: Current approaches to search result di-versification. In: First International Workshop on Living Web: Making Web Diversity a True Asset, Washington DC (2009)
Oommen, T., Baise, L.G., Vogel, R.M.: Sampling bias and class imbalance in maximum-likelihood logistic regression. Mathematical Geosciences 43(1), 99–120 (2011)
Quinlan, J.R.: Induction of decision trees. Machine Learning 1, 81–106 (1986)
Rigoutsos, I., Floratos, A.: Combinatorial pattern discovery in biological sequences: The TEIRESIAS algorithm. Bioinformatics 14(2), 229 (1998)
Rokach, Romano, Maimon: Negation recognition in medical narrative reports. Information Retrieval 11(6), 499–538 (2008)
Swanson, B.: Exploring Syntactic Representations for Native Language Identification. In: NAACL/HLT 2013, p. 146 (2013)
Tsarfaty, R., Seddah, D., Goldberg, Y., Kübler, S., Candito, M., Foster, J., Versley, Y., Rehbein, I., Tounsi, L.: Statistical parsing of morphologically rich languages (SPMRL): what, how and whither. In: Proceedings of the NAACL HLT 2010 First Workshop on Statistical Parsing of Morphologically-Rich Languages, pp. 1–12. Association for Computational Linguistics (2010)
Witten, I.H., Frank, E.: Data Mining: Practical machine learning tools and techniques. Morgan Kaufmann (2005)
Xu, Y., Kim, M.-Y., Quinn, K., Goebel, R., Barbosa, D.: Open Information Extraction with Tree Kernels. In: Proceedings of NAACL-HLT, pp. 868–877 (2013)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2014 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Ofek, N., Rokach, L., Mitra, P. (2014). Methodology for Connecting Nouns to Their Modifying Adjectives . In: Gelbukh, A. (eds) Computational Linguistics and Intelligent Text Processing. CICLing 2014. Lecture Notes in Computer Science, vol 8403. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-54906-9_22
Download citation
DOI: https://doi.org/10.1007/978-3-642-54906-9_22
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-54905-2
Online ISBN: 978-3-642-54906-9
eBook Packages: Computer ScienceComputer Science (R0)