Skip to main content

The ORD Speech Corpus of Russian Everyday Communication “One Speaker’s Day”: Creation Principles and Annotation

  • Conference paper
Text, Speech and Dialogue (TSD 2009)

Abstract

The main aim of the ORD speech corpus is to fix Russian spontaneous speech in natural communicative situations. The corpus presents the unique linguistic material, allowing to perform fundamental research in many scientific aspects and to solve different practical tasks, especially in speech technologies. The paper concerns methodology and description of the ORD corpus creating and presents the system of annotations.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Asinovsky, A.S., Arkhipova, E.A., Bogdanova, N.V., Rusakova, M.V., Ryko, A.I., Stepanova, S.B., Sherstinova, T.Y.: Polevaya lingvisticheskaya praktika. Uchebno-metodicheskij kompleks slozhnoj struktury. Chast’ 1. Teoreticheskie osnovy i metodika sbora lingvisticheskikh dannykh dl’a predstavlenia ikh v linguisticheskom korpuse russkogo yazyka. St. Petersburg (2007)

    Google Scholar 

  2. Asinovsky, A.S., Bogdanova, N.V., Rusakova, M.V., Stepanova, S.B., Sherstinova, T.Y.: Zvukovoj korpus russkogo yazyka povsednevnogo obschenia “Odin rechevoj den”: koncepcia i sosytoyanie formirovania. In: Kompjuternaya lingvistika i intellektualnye tekhnologii. Vypusk, Moscow. Po materialam mezhd. konferencii “Dialog”, vol. 7 (14), pp. 488–494 (2008)

    Google Scholar 

  3. Asinovsky, A.S., Koroleva, I.V., Rusakova, M.V., Ryko, A.I., Philippova, N.S., Stepanova, S.B.: On Integral Multilevel Annotation of a Spoken Russian Corpus. In: Proc. the XIIth International Conference “Speech and Computer” SPECOM 2007, Moscow (2007)

    Google Scholar 

  4. Bogdanova, N.V.: Allegrovye formy russkoj rechi: ot proiznositel’noj redukcii k pis’mennoj fiksacii i leksikalizacii v yazyke. Mat-ly XXXVII mezhd. filologicheskoj konferencii. Vypusk 18. “Fonetika”. St. Petersburg (2008)

    Google Scholar 

  5. ELAN - Linguistic Annotator. Version 3.6, http://www.mpi.nl/corpus/manuals/manual-elan.pdf

  6. Koroleva, I.V.: Individual’nye sostoyania i svoistva yazykovoj lichnosti: vliyanie na lingvisticheskuju strukturu vyskazyvanij. Mat-ly XXXVII mezhd. filologicheskoj konferencii. Vypusk 21. St. Petersburg. pp. 36–45 (2008)

    Google Scholar 

  7. Markasova, E.V.: Ritoricheskaya enantiosemia v korpuse russkogo yazyka povsednevnogo obschenia “Odin rechevoj den”. In: Kompjuternaya lingvistika i intellektualnye tekhnologii. Vypusk “Dialog”, Moscow, vol. 7(14), pp. 352–356 (2008)

    Google Scholar 

  8. Praat: Doing Phonetics by computer, http://www.praat.org

  9. Ryko, A.I., Stepanova, S.B.: Mnogourovnevaya lingvisticheskaya razmetka zvukovogo korpusa russkogo yazyka. In: Kompjuternaya lingvistika i intellektualnye tekhnologii. Vypusk. Po materialam mezhd. konferencii “Dialog”, Moscow, vol. 7 (14), pp. 460–465 (2008)

    Google Scholar 

  10. Ryko, A.I., Stepanova, S.B.: Problemy vychlenenia jedinic analiza spontannogo ustnogo teksta. In: Mat-ly XXXVII mezhd. filologicheskoj konferencii. Vypusk, St. Petersburg, vol. 21, pp. 71–80 (2008)

    Google Scholar 

  11. Sherstinova, T.Y.: “Odin rechevoj den” na vremennoj shkale: o perspektivakh issledovania dinamicheskikh processov na materiale zvukovogo korpusa. In: Vestnik Sankt-Peterburgskogo universiteta, Seria 9: Filologia, Vostokovedenie, Zhurnalistika, Chast’ 2, St. Petersburg, vol. 4, pp. 227–235 (2008)

    Google Scholar 

  12. The British National Corpus http://www.natcorp.ox.ac.uk/

  13. Zobnina, E.A.: Social’nye characteristiki govoriaschego: objektivnye dannye i ekspertnaya ocenka rechi (po materialam zvukovogo korpusa “Odin rechevoj den”. In: Mat-ly XXXVII mezhd. filologicheskoj konferencii. Vypusk, St. Petersburg, vol. 21, pp. 17–24 (2008)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2009 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Asinovsky, A., Bogdanova, N., Rusakova, M., Ryko, A., Stepanova, S., Sherstinova, T. (2009). The ORD Speech Corpus of Russian Everyday Communication “One Speaker’s Day”: Creation Principles and Annotation. In: Matoušek, V., Mautner, P. (eds) Text, Speech and Dialogue. TSD 2009. Lecture Notes in Computer Science(), vol 5729. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-04208-9_36

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-04208-9_36

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-04207-2

  • Online ISBN: 978-3-642-04208-9

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics