Skip to main content

Towards Slovak Broadcast News Automatic Recording and Transcribing Service

  • Conference paper
Verbal and Nonverbal Features of Human-Human and Human-Machine Interaction

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 5042))

Abstract

The information is one of the most valuable commodities nowadays. The information retrieval mechanisms from broadcast news recordings is then becoming the one of the most requested services from the end-users. The planned Slovak automatic broadcast news (BN) processing service provides automatic transcribing and metadata extracting abilities, enabling users to obtain information from the processed recordings using a web interface and the search engine. The resulted information is then provided trough multimodal interface, which allows users to see not only recorded audio-visual material, but also all automatically extracted metadata (verbal and nonverbal), and also to select incorrectly automatically identified data. The architecture of the present system is linear, which means every module starts after the previous has finished the data processing.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Nouza, J., Nejedlova, D., Zdansky, J., Kolorenc, J.: Very Large Vocabulary Speech Recognition System for Automatic Transcription of Czech Broadcast. In: Proceedings of ICSLP 2004, Jeju Island, Korea, October 2004, pp. 409–412 (2004) ISSN 1225-441x

    Google Scholar 

  2. Nouza, J., Zdansky, J., Cerva, P., Kolorenc, J.: Continual on-line monitoring of Czech spoken broadcast programs. In: INTERSPEECH-2006, paper 1478-Wed1CaP.13 (2006)

    Google Scholar 

  3. Seymore, K., Chen, S., Doh, S.J., Eskenazi, M., Gouvea, E., Raj, B., Ravishankar, M., Rosenfeld, R., Siegler, M., Stern, R., Thayer, E.: The 1997 CMU Sphinx-3 English Broadcast News transcription system. In: Proceedings of the DARPA Speech Recognition Workshop (1998)

    Google Scholar 

  4. Gauvain, J.-L.: The LIMSI 1999 Hub-4E Transcription System. In: Proceedings of DARPA Speech Transcription Workshop 2000 (2000)

    Google Scholar 

  5. Gauvain, J.L., Lamel, L., Adda, G.: The LIMSI Broadcast News Transcription System. In: Speech Communication (2002), http://citeseer.ist.psu.edu/gauvain02limsi.html

  6. McTait, K., Adda-Decker, M.: The 300k LIMSI German Broadcast News Transcription System. In: Eurospeech 2003, Genova,

    Google Scholar 

  7. Huerta, J.M., Thayer, E., Ravishankar, M., Stern, R.M.: The Development of the 1997 CMU Spanish Broadcast News Transcription System. In: Proceedings of the DARPA Broadcast News Transcription and Understanding Workshop, Landsdowne, Virginia (February 1998)

    Google Scholar 

  8. Meinedo, H., Caseiro, D., Neto, J., Trancoso, I.: AUDIMUS.media: a broadcast news speech recognition system for the European Portuguese language. In: Mamede, N.J., Baptista, J., Trancoso, I., Nunes, M.d.G.V. (eds.) PROPOR 2003. LNCS, vol. 2721, pp. 9–17. Springer, Heidelberg (2003)

    Chapter  Google Scholar 

  9. Riedler, J., Katsikas, S.: Development of a Modern Greek Broadcast-News Corpus and Speech Recognition System. In: Nivre, J., Kaalep, H.-J., Muischnek, K., Koit, M. (eds.) Proceedings of the 16th Nordic Conference of Computational Linguistics NODALIDA 2007, pp. 380–383. University of Tartu, Tartu (2007)

    Google Scholar 

  10. Marcello, F.: A System for the Retrieval of Italian Broadcast News. Speech Communication 33(1-2) (2000)

    Google Scholar 

  11. Brugnara, F., Cettolo, M., Federico, M., Giuliani, D.: Advances in automatic transcription of Italian broadcast news. In: Proceedings of ICSLP, Beijing, China, vol. II, pp. 660–663 (2000)

    Google Scholar 

  12. Che, C., Yuk, D., Chennoukh, S., Flanagan, J.: Development of the RU Hub4 system. In: Proceedings of DARPA Speech Recognition Workshop (1997)

    Google Scholar 

  13. Zibert, J., Mihelic, F., Martens, J.-P., Meinedo, H., Neto, J., Docio, L., Garcia-Mateo, C., David, P., Zdansky, J., Pleva, M., Cizmar, A., Zgank, A., Kacic, Z., Teleki, C., Vicsi, K.: COST278 broadcast news segmentation and speaker clustering evaluation. In: Interspeech 2005 Proceedings of the 9th European Conference on Speech Communication and Technology, Lisboa, pp. 629–632. Universität Bonn, Bonn (2005)

    Google Scholar 

  14. Pitz, M., Molau, S., Schluter, R., Ney, H.: Automatic transcription verification of broadcast news and similar speech corpora. In: Proceedings of the DARPA Broadcast News Workshop (March 1999)

    Google Scholar 

  15. Manta, M., Antoine, F., Galliano, S., Barras, C., Geoffrois, E., Liberman, M., Wu, Z.: Transcriber tool website, http://trans.sourceforge.net/en/presentation.php

  16. Pleva, M., Juhár, J., Čižmár, A.: Slovak broadcast news speech corpus for automatic speech recognition. In: RTT 2007: Research in Telecommunication Technology: 8th international conference: Žilina - Liptovský Ján, Slovak Republic, September 10-12, 2007, pp. 334–337 (2007) ISBN 978-80-8070-735-4

    Google Scholar 

  17. Young, S.: ATK: An application Toolkit for HTK, version 1.3. Cambridge University, Cambridge (2004)

    Google Scholar 

  18. Žibert, J., Mihelič, F.: Development, Evaluation and Automatic Segmentation of Slovenian Broadcast News Speech Database. In: Proceedings of the 7th International Multi-Conference Information Society IS 2004, Jozef Stefan Institute, Ljubljana, Slovenia, October 13th - 14th 2004, vol. B, pp. 72–78 (2004) ISBN 961-6303-64-3

    Google Scholar 

  19. Pollak, P., Černocký, J., Boudy, J., Choukri, K., Rusko, M., Trnka, M.: SpeechDat(E) „Eastern European Telephone Speech Databases. In: Proceedings of LREC 2000 Satellite workshop XLDB - Very large Telephone Speech Databases, Athens, Greece, pp. 20–25 (May 2000)

    Google Scholar 

  20. Juhár, J., Ondáš, S., Čižmár, A., Rusko, M., Rozinaj, G., Jarina, R.: Development of Slovak GALAXY/VoiceXML based spoken language dialogue system to retrieve information from the Internet. In: Interspeech 2006 - ICSLP, Pittsburgh, Pennsylvania, USA, September 17-21, pp. 485–488. Universität Bonn, Bonn (2006) ISSN 1990-9772

    Google Scholar 

  21. Rusko, M., Trnka, M., Darjaa, S.: MobilDat-SK - A Mobile Telephone Extension to the SpeechDat-E SK Telephone Speech Database in Slovak. In: SPEECOM 2006, Sankt Petersburg, Russia (July 2006) (accepted)

    Google Scholar 

  22. Simkova, M.: Slovak National Corpus – history and current situation. In: Insight into Slovak and Czech Corpus Linguistics, Bratislava: Veda, pp. 152–159 (2005)

    Google Scholar 

  23. Mirilovič, M., Lihan, S., Juhár, J., Čižmár, A.: Slovak speech recognition based on Sphinx-4 and SpeechDat-SK. In: Proceedings of DSP-MCOM 2005 international conference, Košice, Slovakia, pp. 76–79 (Septembert 2005)

    Google Scholar 

  24. Mirilovič, M., Juhár, J., Čižmár, A.: Steps towards the stochastic language modeling in Slovak. In: Proceedings of ECMS 2007: 8th International Workshop on Electronics, Control, Modeling, Measurement and Signals, May 21-23, 2007, p. 19. Technical University of Liberec, Liberec (2007) ISBN 978-80-7372-202-9

    Google Scholar 

  25. Mirilovič, M., Juhár, J., Čižmár, A.: Automatic segmentation of Slovak words into morphemes. In: Proceedings of RTT 2007: Research in Telecommunication Technology: 8th international conference, Žilina - Liptovský Ján, Slovak Republic, September 10-12, 2007, pp. 259–263 (2007) ISBN 978-80-8070-735-4

    Google Scholar 

  26. Zgank, A., Kacic, Z., Diehl, F., Juhar, J., Lihan, S., Vicsi, K., Szaszak, G.: Graphemes as basic units for crosslingual speech recognition. In: Proceedings of ASIDE 2005: ISCA Tutorial and Research Workshop (ITRW), 10th and 11th November 2005, pp. 23–27. Aalborg University, Aalborg (2005)

    Google Scholar 

  27. Hain, T., Woodland, P.C.: Segmentation and Classification of Broadcast News Audio. In: Proceedings of ICSLP 1998 - 5th International Conference on Spoken Language Processing, Sydney, Australia, November 30 - December 4 (1998)

    Google Scholar 

  28. Navas, E., Hernaéz, I., Luengo, I., Sainz, I., Saraxaga, I., Sanchez, J.: Meaningful Parameters in Emotion Characterisation. In: Esposito, A., Faundez-Zanuy, M., Keller, E., Marinaro, M. (eds.) COST Action 2102. LNCS (LNAI), vol. 4775, pp. 74–84. Springer, Heidelberg (2007)

    Chapter  Google Scholar 

  29. Lihan, S., Juhár, J., Čižmár, A.: Comparison of Slovak and Czech speech recognition based on grapheme and phoneme acoustic models. In: Proceedings of Interspeech 2006 ICSLP: Proceedings of the Ninth International Conference on Spoken Language Processing, Pittsburgh, Pensylvania, USA, September 17-21, 2006, pp. 149–152. Universität Bonn, Bonn (2006)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2008 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Pleva, M., Čižmár, A., Juhár, J., Ondáš, S., Mirilovič, M. (2008). Towards Slovak Broadcast News Automatic Recording and Transcribing Service. In: Esposito, A., Bourbakis, N.G., Avouris, N., Hatzilygeroudis, I. (eds) Verbal and Nonverbal Features of Human-Human and Human-Machine Interaction. Lecture Notes in Computer Science(), vol 5042. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-70872-8_12

Download citation

  • DOI: https://doi.org/10.1007/978-3-540-70872-8_12

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-70871-1

  • Online ISBN: 978-3-540-70872-8

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics