Abstract
Speech generation is process of converting the text to speech called TTS system. This paper presents the generation of Punjabi waveform by using Festival framework. Building of new voice is done through support of Festvox, Festival and Speech Tools. Festival acts as an Engine for waveform synthesis and statistical parametric synthesis method is implemented for building Punjabi voice. The required data for building the voice is recorded in the noiseless environment and maximum Punjabi valid phonemes are covered in the corpus. Text Processing is done to collect the nice prompts to build the accurate voice Model. The accuracy factor is calculated through Mel-cepstral distortion (MCD) parameters.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
B. Kumar, B. Chettri, Currents trends, frameworks and techniques used in speech synthesis-a survey. Int. J. Soft Comput. Eng. 2, 2231–2307 (2012)
P. Taylor, A.W. Black, R. Caley, The architecture of the festival speech synthesis system, in Proceedings of ESCA workshop in Speech synthesis (1998), pp. 147–151
K. Prahallad, N. Kumar, S. Rajendran, V. Keri, The-IIT-H indic speech databases, in Proceedings of Interspeech (Portland, Oregon, USA 2012)
S. Roy, A technical guide to concatenative speech synthesis for hindi using festival. Int. J. Comput. Appl. 86(8), 30–34 (2014)
A.W. Black, K. Lenzo, Building voices in the festival speech synthesis system (Cambridge, 1999), pp. 23–93 (for Festival version 1.3.1)
K. Richmond, S. King, Multisyn: open-domain unit selection for the festival speech synthesis system. Sci. Direct 49(4), 317–330 (2007)
N.P. Narendra, S.K. Rao, K. Ghosh, R.R. Vempada, S. Maity, Development of syllable-based text to speech synthesis system in Bengali. Int. J. Speech Technol. 14, 167–181 (2011)
B.K. Rajan, V. Rijoy, D.P. Gopinath, N. George, Duration modeling for text to speech synthesis system using festival speech engine developed for malayalam language, in Proceedings of International Conference Circuit, Power and Computing Technologies (2015), pp. 1–5
A.W. Black, CLUSTERGEN: a statistical parametric synthesizer using trajectory modeling, in Proceedings of Interspeech (Pittsburgh, A, USA, 2006), pp. 1762–1765
A.W. Black, C.L. Bennett, B.C. Blanchard, J. Kominek, B. Langner, K. Prahallad, A. Toth, CMU blizzard 2007: A hybrid acoustic unit selection system from statistically predicted parameters, in Proceedings of Blizzard challenge Workshop (Pittsburgh, PA, 2007)
P. Singh, L. Singh, text to speech synthesis system for Punjabi language, in Proceedings of COLING 2012 (Mumbai, India 2012)
S. Luthra, P. Singh, Punjabi speech generation system based on phonemes. Int. J. Comput. Appl. 49(13), 40–44 (2012)
V. Goyal, G.S. Lehal, Evaluation of Hindi to Punjabi machine translation system. Int. J. Comput. Sci. Issues 4(1), 36–39 (2009)
A.W. Black, K. Lenzo, Multilingual text–to-speech synthesis, in Proceedings of the ICASSP (Montreal, Canada, 2004)
A.W. Black, K. Lenzo, Building synthetic voices, 7th edn. (Cambridge, 2014), pp. 147–148 (for FestVox 2)
G.K. Sukhpreet, Generation of Punjabi speech using festival framework. M.Tech, Thesis submitted in Guru Nanak Dev Engineering College, Ludhiana, Punjab, India, 2016
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2019 Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Gill, S.K., Singh, P. (2019). Festival Framework for Synthesis of Punjabi Voice. In: Krishna, C., Dutta, M., Kumar, R. (eds) Proceedings of 2nd International Conference on Communication, Computing and Networking. Lecture Notes in Networks and Systems, vol 46. Springer, Singapore. https://doi.org/10.1007/978-981-13-1217-5_41
Download citation
DOI: https://doi.org/10.1007/978-981-13-1217-5_41
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-13-1216-8
Online ISBN: 978-981-13-1217-5
eBook Packages: EngineeringEngineering (R0)