Abstract
The paper describes the corpus of Russian spontaneous monologues and the results of its expert manual annotation. The corpus is balanced with respect to speakers’ social characteristics and a text genre. The analysis of manual labelling of transcriptions reveals experts’ disagreement in sentence boundary detection. The paper demonstrates that labelled boundaries may have different status. We also show that speakers’ social characteristics (gender and speech usage) and a text genre influence inter-labeller agreement.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Vannikov, Y., Abdalyan, I.: Eksperimental’noe issledovanie chleneniya razgovornoj rechi na diskretnye intonacionno-smyslovye edinicy (frazy). In: Sirotinina, O.B., Barannikova, L.I., Serdobintsev, L.Y. (eds.) Russkaya Razgovornaya Rech, Saratov, pp. 40–46 (1973) (in Russian)
Kibrik, A.A.: Est’ li predlozhenie v russkoj rechi? In: Arkhipov, A.V., Zakharov, L.M., Kibrik, A.A., et al. (eds.) Phonetics and Non-phonetics: For the 70th Birthday of Sandro V. Kodzasov, pp. 104–115. Jazyki slavjanskih kul’tur, Moscow (2008) (in Russian)
Chistikov, P., Khomitsevich, O.: Online Automatic Sentence Boundary Detection in a Russian ASR System. In: Potapova, R.K. (ed.) SPECOM 2011. The 14th International Conference “Speech and Computer”, Kazan, Russia, September 27-30, pp. 112–117 (2011)
Skrebnev, Y.M.: Vvedenie v kollokvialistiku. Izdatel’stvo Saratovskogo universiteta, Saratov (1985) (in Russian)
Kibrik, A.A., Podlesskaya, V.I. (eds.): Night Dream Stories: A Corpus Study of Spoken Russian Discourse. Jazyki slavjanskih kul’tur, Moscow (2009) (in Russian)
Nasukawa, T., Punjani, D., Roy, S., Subramaniam, L.V., Takeuchi, H.: Adding Sentence Boundaries to Conversational Speech Transcriptions using Noisily Labelled Examples. In: AND 2007, pp. 71–78 (2007)
Gotoh, Y., Renals, S.: Sentence Boundary Detection in Broadcast Speech Transcripts. In: Proceedings of the International Speech Communication Association (ISCA) Workshop: Automatic Speech Recognition: Challenges for the New Millenium (ASR 2000), Paris, France, September 18-20, pp. 228–235 (2000)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2013 Springer International Publishing Switzerland
About this paper
Cite this paper
Stepikhov, A. (2013). Analysis of Expert Manual Annotation of the Russian Spontaneous Monologue: Evidence from Sentence Boundary Detection. In: Železný, M., Habernal, I., Ronzhin, A. (eds) Speech and Computer. SPECOM 2013. Lecture Notes in Computer Science(), vol 8113. Springer, Cham. https://doi.org/10.1007/978-3-319-01931-4_5
Download citation
DOI: https://doi.org/10.1007/978-3-319-01931-4_5
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-01930-7
Online ISBN: 978-3-319-01931-4
eBook Packages: Computer ScienceComputer Science (R0)