Skip to main content

On Heads and Coordination in Valence Acquisition

  • Conference paper
Computational Linguistics and Intelligent Text Processing (CICLing 2007)

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 4394))

Abstract

The aim of this paper is to present the design of a partial syntactic annotation of the IPI PAN Corpus of Polish [22] and the corresponding extension of the corpus search engine Poliqarp [25,12] developed at the Institue of Computer Science PAS and currently employed in Polish and Portuguese corpora projects. In particular, we will argue for the need to distinguish between, and represent both, syntactic and semantic heads, and we will sketch the representation of coordination, the area traditionally controversial both in theoretical and in computational linguistics. The annotation is designed in a way intended to maximise the usefulness of the resulting corpus for the task of automatic valence acquisition.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Barreto, F., Branco, A., Ferreira, E., Mendes, A., Nascimento, M.F., Nunes, F., Silva, J.: Open resources and tools for the shallow processing of Portuguese: The TagShare project. In: Proceedings of LREC (2006)

    Google ScholarĀ 

  2. Beavers, J., Sag, I.A.: Coordinate ellipsis and apparent non-constitutent coordination. In: MĆ¼ller, S. (ed.) Proceedings of the HPSG04 Conference, pp. 48ā€“69. CSLI Publications, Stanford (2004)

    Google ScholarĀ 

  3. Bloomfield, L.: Language. Holt, New York (1933)

    Google ScholarĀ 

  4. BƶhmovĆ”, A., Hajič, J., HajičovĆ”, E., HladkĆ”, B.: The Prague Dependency Treebank: Three-level annotation scenario. In: AbeillĆ©, A. (ed.) Treebanks: Building and Using Parsed Corpora, pp. 103ā€“127. Kluwer, Dordrecht (2003)

    ChapterĀ  Google ScholarĀ 

  5. Christ, O.: A modular and flexible architecture for an integrated corpus query system. In: COMPLEXā€™94, Budapest (1994)

    Google ScholarĀ 

  6. Covington, M.A.: A 700-year-old argument for a syntactic transformation. http://www.ai.uga.edu/mc/trans700.html

  7. Fast, J., PrzepiĆ³rkowski, A.: Automatic extraction of Polish verb subcategorization: An evaluation of common statistics. In: Vetulani, Z. (ed.) Proceedings of the 2nd Language & Technology Conference, PoznaƱ, Poland, pp. 191ā€“195 (2005)

    Google ScholarĀ 

  8. Fillmore, C.J., Baker, C.F., Sato, H.: Seeing arguments through transparent structures. In: Proceedings of LREC 2002, Las Palmas, Canary Islands, Spain, pp. 787ā€“791. ELRA (2002)

    Google ScholarĀ 

  9. Fillmore, C.J., Johnson, C.R., Petruck, M.R.L.: Background to FrameNet. International Journal of LexicographyĀ 16(3), 235ā€“250 (2003)

    ArticleĀ  Google ScholarĀ 

  10. Huang, C.-R., Keh-Jiann, C., Feng-Yi, C., Keh-Jiann, C., Zhao-Ming, G., Kuang-Yu, C.: Sinica treebank: Design criteria, annotation guidelines, and on-line interface. In: Proceedings of 2nd Chinese Language Processing Workshop (Held in conjunction with the 38th Annual Meeting of the Association for Computational Linguistics, ACL-2000), Hong Kong, pp. 29ā€“37 (2000)

    Google ScholarĀ 

  11. Ide, N., Bonhomme, P., Romary, L.: XCES: An XML-based standard for linguistic corpora. In: Proceedings of the Linguistic Resources and Evaluation Conference, Athens, Greece, pp. 825ā€“830 (2000)

    Google ScholarĀ 

  12. Janus, D., PrzepiĆ³rkowski, A.: Poliqarp 1.0: Some technical aspects of a linguistic search engine for large corpora. In: WaliƱski, J., Kredens, K., GoÅŗdÅŗ-Roszkowski, S. (eds.) The proceedings of Practical Applications of Linguistic Corpora 2005, Peter Lang, Frankfurt am Main (2006)

    Google ScholarĀ 

  13. Kallas, K.: Składnia wspĆ³Å‚czesnych polskich konstrukcji wspĆ³Å‚rzČ©dnych. Wydawnictwo Uniwersytetu Mikołaja Kopernika, Toruń (1993)

    Google ScholarĀ 

  14. Kosek, I.: Przyczasownikowe frazy przyimkowo-nominalne wĀ zdaniach wspĆ³Å‚czesnego jČ©zyka polskiego. Wydawnictwo Uniwersytetu Warmińsko-Mazurskiego, Olsztyn (1999)

    Google ScholarĀ 

  15. Lezius, W.: TIGERSearch ā€” ein Suchwerkzeug fĆ¼r Baumbanken. In: Busemann, S. (ed.) Proceedings der 6.Ā Konferenz zur Verarbeitung natĆ¼rlicher Sprache (KONVENS 2002), SaarbrĆ¼cken (2002)

    Google ScholarĀ 

  16. Melā€™Äuk, I.A.: Levels of dependency in linguistic description: concepts and problems. In: ƀgel, V., Eichinger, L., Eroms, H.-W., Hellwig, P., Heringer, H.-J., Lobin, H. (eds.) Dependenz und Valenz: Ein Internationales Handbuch Der Zeitgenƶsischen Forschung, pp. 188ā€“229. De Gruyter, Berlin (2003)

    Google ScholarĀ 

  17. Monz, C., de Rijke, M.: Tequesta: The University of Amsterdamā€™s texual question answering system. In: Proceedings of Tenth Text Retrieval Conference (TREC-10), pp. 513ā€“522 (2001)

    Google ScholarĀ 

  18. Nivre, J.: Theory-supporting treebanks. In: Nivre, J., Hinrichs, E. (eds.) Proceedings of the Second Workshop on Treebanks and Linguistic Theories (TLT2003), VƤxjƶ, Norway, pp. 117ā€“128 (2003)

    Google ScholarĀ 

  19. Pollard, C., Sag, I.A.: Information-Based Syntax and Semantics, vol. 1: Fundamentals. CSLI Lecture Notes, vol.Ā 13. CSLI Publications, Stanford (1987)

    Google ScholarĀ 

  20. Pollard, C., Sag, I.A.: Head-driven Phrase Structure Grammar. Chicago University Press, Chicago (1994)

    Google ScholarĀ 

  21. PrzepiĆ³rkowski, A.: Case Assignment and the Complement-Adjunct Dichotomy: A Non-Configurational Constraint-Based Approach. Ph. D. dissertation, UniversitƤt TĆ¼bingen (1999)

    Google ScholarĀ 

  22. Adam PrzepiĆ³rkowski. The IPI PAN Corpus: Preliminary version. Institute of Computer Science, Polish Academy of Sciences, Warsaw (2004)

    Google ScholarĀ 

  23. PrzepiĆ³rkowski, A.: On heads and coordination in a partial treebank. In: Hajič, J., Nivre, J. (eds.) Proceedings of the TLT 2006, Prague, pp. 163ā€“174 (2006)

    Google ScholarĀ 

  24. PrzepiĆ³rkowski, A., Fast, J.: Baseline experiments in the extraction of Polish valence frames. In: Kłopotek, M.A., Wierzchoń, S.T., Trojanowski, K. (eds.) Intelligent Information Processing and Web Mining, Advances in Soft Computing, pp. 511ā€“520. Springer, Berlin (2005)

    ChapterĀ  Google ScholarĀ 

  25. Przepiā€™orkowski, A., Krynicki, Z., DĆŖbowski, Ł., Woliński, M., Janus, D., Bański, P.: A search tool for corpora with positional tagsets and ambiguities. In: Proceedings of LREC 2004, Lisbon, pp. 1235ā€“1238. ELRA (2004)

    Google ScholarĀ 

  26. PrzepiĆ³rkowski, A., Woliński, M.: AĀ flexemic tagset for Polish. In: Proceedings of Morphological Processing of Slavic Languages, EACLĀ 2003, Budapest, pp. 33ā€“40 (2003)

    Google ScholarĀ 

  27. PrzepiĆ³kowski, A., Woliński, M.: The unbearable lightness of tagging: A case study in morphosyntactic tagging of Polish. In: Proceedings of the LINC-03, EACLĀ 2003, pp. 109ā€“116 (2003)

    Google ScholarĀ 

  28. Sag, I.A., Gazdar, G., Wasow, T., Weisler, S.: Coordination and how to distinguish categories. Natural Language and Linguistic TheoryĀ 3, 117ā€“171 (1985)

    ArticleĀ  Google ScholarĀ 

  29. Saloni, Z., Świdziński, M.: Składnia wspĆ³Å‚czesnego jČ©zyka polskiego, 4th (changed) edn. Wydawnictwo Naukowe PWN, Warsaw (1998)

    Google ScholarĀ 

  30. Sgall, P., HajičovĆ”, E., PanevovĆ”, J.: The Meaning of the Sentence in Its Semantic and Pragmatic Aspects. Reidel, Dordrecht (1986)

    Google ScholarĀ 

  31. Silberztein, M.: Finite-state description of the French determiner system. French Language StudiesĀ 13, 221ā€“246 (2003)

    ArticleĀ  Google ScholarĀ 

  32. M. Świdziński. Realizacje zdaniowe podmiotu-mianownika, czyli o strukturalnych ograniczeniach selekcyjnych. In: A. Markowski (ed.) Opisać słowa, pp. 188ā€“201. Dom Wydawniczy Elipsa (1992)

    Google ScholarĀ 

  33. TesniĆØre, L.: ƉlĆ©ments de Syntaxe Structurale. Klincksieck, Paris (1959)

    Google ScholarĀ 

  34. Watson, R., Carroll, J., Briscoe, T.: Efficient extraction of grammatical relations. In: Proceedings of the Ninth International Workshop on Parsing Technology, Vancouver, British Columbia, pp. 160ā€“170. Association for Computational Linguistics (2005)

    Google ScholarĀ 

  35. Wright, A., Kathol, A.: When a head is not a head: A constructional approach to exocentricity in English. In: Kim, J.-B., Wechsler, S. (eds.) Proceedings of the 9th International Conference on Head-Driven Phrase Structure Grammar, pp. 373ā€“389. CSLI Publications, Stanford (2003)

    Google ScholarĀ 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Alexander Gelbukh

Rights and permissions

Reprints and permissions

Copyright information

Ā© 2007 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

PrzepiĆ³rkowski, A. (2007). On Heads and Coordination in Valence Acquisition. In: Gelbukh, A. (eds) Computational Linguistics and Intelligent Text Processing. CICLing 2007. Lecture Notes in Computer Science, vol 4394. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-70939-8_5

Download citation

  • DOI: https://doi.org/10.1007/978-3-540-70939-8_5

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-70938-1

  • Online ISBN: 978-3-540-70939-8

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics