Festival Framework for Synthesis of Punjabi Voice

Gill, Sukhpreet Kaur; Singh, Parminder

doi:10.1007/978-981-13-1217-5_41

Sukhpreet Kaur Gill⁵ &
Parminder Singh⁶

Part of the book series: Lecture Notes in Networks and Systems ((LNNS,volume 46))

1651 Accesses

Abstract

Speech generation is process of converting the text to speech called TTS system. This paper presents the generation of Punjabi waveform by using Festival framework. Building of new voice is done through support of Festvox, Festival and Speech Tools. Festival acts as an Engine for waveform synthesis and statistical parametric synthesis method is implemented for building Punjabi voice. The required data for building the voice is recorded in the noiseless environment and maximum Punjabi valid phonemes are covered in the corpus. Text Processing is done to collect the nice prompts to build the accurate voice Model. The accuracy factor is calculated through Mel-cepstral distortion (MCD) parameters.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 169.00; Price excludes VAT (USA)

Softcover Book: USD 219.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

B. Kumar, B. Chettri, Currents trends, frameworks and techniques used in speech synthesis-a survey. Int. J. Soft Comput. Eng. 2, 2231–2307 (2012)
Google Scholar
P. Taylor, A.W. Black, R. Caley, The architecture of the festival speech synthesis system, in Proceedings of ESCA workshop in Speech synthesis (1998), pp. 147–151
Google Scholar
K. Prahallad, N. Kumar, S. Rajendran, V. Keri, The-IIT-H indic speech databases, in Proceedings of Interspeech (Portland, Oregon, USA 2012)
Google Scholar
S. Roy, A technical guide to concatenative speech synthesis for hindi using festival. Int. J. Comput. Appl. 86(8), 30–34 (2014)
Google Scholar
A.W. Black, K. Lenzo, Building voices in the festival speech synthesis system (Cambridge, 1999), pp. 23–93 (for Festival version 1.3.1)
Google Scholar
K. Richmond, S. King, Multisyn: open-domain unit selection for the festival speech synthesis system. Sci. Direct 49(4), 317–330 (2007)
Google Scholar
N.P. Narendra, S.K. Rao, K. Ghosh, R.R. Vempada, S. Maity, Development of syllable-based text to speech synthesis system in Bengali. Int. J. Speech Technol. 14, 167–181 (2011)
Article Google Scholar
B.K. Rajan, V. Rijoy, D.P. Gopinath, N. George, Duration modeling for text to speech synthesis system using festival speech engine developed for malayalam language, in Proceedings of International Conference Circuit, Power and Computing Technologies (2015), pp. 1–5
Google Scholar
A.W. Black, CLUSTERGEN: a statistical parametric synthesizer using trajectory modeling, in Proceedings of Interspeech (Pittsburgh, A, USA, 2006), pp. 1762–1765
Google Scholar
A.W. Black, C.L. Bennett, B.C. Blanchard, J. Kominek, B. Langner, K. Prahallad, A. Toth, CMU blizzard 2007: A hybrid acoustic unit selection system from statistically predicted parameters, in Proceedings of Blizzard challenge Workshop (Pittsburgh, PA, 2007)
Google Scholar
P. Singh, L. Singh, text to speech synthesis system for Punjabi language, in Proceedings of COLING 2012 (Mumbai, India 2012)
Google Scholar
S. Luthra, P. Singh, Punjabi speech generation system based on phonemes. Int. J. Comput. Appl. 49(13), 40–44 (2012)
Google Scholar
V. Goyal, G.S. Lehal, Evaluation of Hindi to Punjabi machine translation system. Int. J. Comput. Sci. Issues 4(1), 36–39 (2009)
Google Scholar
A.W. Black, K. Lenzo, Multilingual text–to-speech synthesis, in Proceedings of the ICASSP (Montreal, Canada, 2004)
Google Scholar
A.W. Black, K. Lenzo, Building synthetic voices, 7th edn. (Cambridge, 2014), pp. 147–148 (for FestVox 2)
Google Scholar
G.K. Sukhpreet, Generation of Punjabi speech using festival framework. M.Tech, Thesis submitted in Guru Nanak Dev Engineering College, Ludhiana, Punjab, India, 2016
Google Scholar

Download references

Author information

Authors and Affiliations

GNA University, Phagwara, India
Sukhpreet Kaur Gill
Guru Nanak Dev Engineering College, Ludhiana, India
Parminder Singh

Authors

Sukhpreet Kaur Gill
View author publications
You can also search for this author in PubMed Google Scholar
Parminder Singh
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Sukhpreet Kaur Gill .

Editor information

Editors and Affiliations

Department of Computer Science and Engineering, National Institute of Technical Teachers Training and Research, Chandigarh, India
C. Rama Krishna
Department of Educational Television Centre, National Institute of Technical Teachers Training and Research, Chandigarh, India
Maitreyee Dutta
Department of Computer Science and Engineering, National Institute of Technical Teachers Training and Research, Chandigarh, India
Rakesh Kumar

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Gill, S.K., Singh, P. (2019). Festival Framework for Synthesis of Punjabi Voice. In: Krishna, C., Dutta, M., Kumar, R. (eds) Proceedings of 2nd International Conference on Communication, Computing and Networking. Lecture Notes in Networks and Systems, vol 46. Springer, Singapore. https://doi.org/10.1007/978-981-13-1217-5_41

Download citation

DOI: https://doi.org/10.1007/978-981-13-1217-5_41
Published: 08 September 2018
Publisher Name: Springer, Singapore
Print ISBN: 978-981-13-1216-8
Online ISBN: 978-981-13-1217-5
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics