Information Retrieval from Spoken Documents

Fapšo, Michal; Smrž, Pavel; Schwarz, Petr; Szöke, Igor; Schwarz, Milan; Černocký, Jan; Karafiát, Martin; Burget, Lukáš

doi:10.1007/11671299_43

Michal Fapšo¹⁷,
Pavel Smrž¹⁷,
Petr Schwarz¹⁷,
Igor Szöke¹⁷,
Milan Schwarz¹⁷,
Jan Černocký¹⁷,
Martin Karafiát¹⁷ &
…
Lukáš Burget¹⁷

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 3878))

Included in the following conference series:

International Conference on Intelligent Text Processing and Computational Linguistics

1378 Accesses
3 Citations

Abstract

This paper describes a designed and implemented system for efficient storage, indexing and search in collections of spoken documents that takes advantage of automatic speech recognition. As the quality of current speech recognizers is not sufficient for a great deal of applications, it is necessary to index the ambiguous output of the recognition, i. e. the acyclic graphs of word hypotheses — recognition lattices. Then, it is not possible to directly apply the standard methods known from text-based systems. The paper discusses an optimized indexing system for efficient search in the complex and large data structure that has been developed by our group. The search engine works as a server. The meeting browser JFerret, developed withing the European AMI project, is used as a client to browse search results.

This work was partly supported by European project AMI (Augmented Multi-party Interaction, FP6-506811) and Grant Agency of Czech Republic under project No. 102/05/0278. Pavel Smrž was supported by MŠMT Research Plan MSM 6383917201. The hardware used in this work was partially provided by CESNET under project No. 119/2004.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Brin, S., Page, L.: The Anatomy of a Large-Scale Hypertextual Web Search Engine. Computer Science Department. Stanford University
Google Scholar
Hain, T., et al.: Transcription of Conference Room Meetings: an Investigation. In: Proc. Eurospeech 2005, Lisabon, Portugal (September 2005)
Google Scholar
Szöke, I., et al.: Comparison of Keyword Spotting Approaches for Informal Continuous Speech. In: Proc. Eurospeech 2005, Lisabon, Portugal (September 2005)
Google Scholar
Young, S., et al.: The HTK Book (for HTK Version 3. Engineering Department. Cambridge University Press, Cambridge (2005), http://htk.eng.cam.ac.uk/
Google Scholar
van der Wal, B., et al.: D6.3 Preliminary demonstrator of Browser Components and Wireless Presentation System. In: AMI deliverable (August 2005)
Google Scholar
Janin, A., Baron, D., Edwards, J., Ellis, D., Gelbart, D., Morgan, N., Peskin, B., Pfau, T., Shriberg, E., Stolcke, A., Wooters, C.: The ICSI Meeting Corpus. In: International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2003, Hong Kong (April 2003)
Google Scholar
Rohlicek, J.R., Russell, W., Roukos, S., Gish, H.: Continuous hidden Markov modeling for speaker-independent word spotting. In: International Conference on Acoustics, Speech, and Signal Processing, ICASSP 1989, Glasgow, UK, vol. 1 (May 1989)
Google Scholar

Download references

Author information

Authors and Affiliations

Faculty of Information Technology, Brno University of Technology, Božetěchova 2, 612 66, Brno, Czech Republic
Michal Fapšo, Pavel Smrž, Petr Schwarz, Igor Szöke, Milan Schwarz, Jan Černocký, Martin Karafiát & Lukáš Burget

Authors

Michal Fapšo
View author publications
You can also search for this author in PubMed Google Scholar
Pavel Smrž
View author publications
You can also search for this author in PubMed Google Scholar
Petr Schwarz
View author publications
You can also search for this author in PubMed Google Scholar
Igor Szöke
View author publications
You can also search for this author in PubMed Google Scholar
Milan Schwarz
View author publications
You can also search for this author in PubMed Google Scholar
Jan Černocký
View author publications
You can also search for this author in PubMed Google Scholar
Martin Karafiát
View author publications
You can also search for this author in PubMed Google Scholar
Lukáš Burget
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

National Polytechnic Institute, Center for Computing Research, 07738, Mexico City, México
Alexander Gelbukh

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Fapšo, M. et al. (2006). Information Retrieval from Spoken Documents. In: Gelbukh, A. (eds) Computational Linguistics and Intelligent Text Processing. CICLing 2006. Lecture Notes in Computer Science, vol 3878. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11671299_43

Download citation

DOI: https://doi.org/10.1007/11671299_43
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-32205-4
Online ISBN: 978-3-540-32206-1
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics