Skip to main content

Prediction of Major Phrase Boundary Location and Pause Insertion Using a Stochastic Context-free Grammar

  • Chapter
Computing Prosody

Abstract

In this paper, we present models for predicting major phrase boundary location and pause insertion using a stochastic context-free grammar (SCFG) from an input part of speech (POS) sequence. These prediction models were made with similar ideas as both major phrase boundary location and pause insertion have similar characteristics. In these models, word attributes and left/right-branching probability parameters representing stochastic phrasing characteristics are used as input parameters of a feed-forward neural network for the prediction. To obtain the probabilities, first, major phrase characteristics and pause characteristics are learned through the SCFG training using the inside-outside algorithm. Then, the probabilities of each bracketing structure are computed using the SCFG. Experiments were carried out to confirm the effectiveness of these stochastic models for the prediction of major phrase boundary locations and pause locations. In a test predicting major phrase boundaries with unseen data, 92.9% of the major phrase boundaries were correctly predicted with a 16.9% false insertion rate. For pause prediction with unseen data, 85.2% of the pause boundaries were correctly predicted with a 9.1% false insertion rate.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 89.00
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 119.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. K. Hirose, H. Fujisaki, H. Kawai, and M. Yamaguchi. Manifestation of linguistic and para-linguistic information in the voice fundamental frequency contours of spoken Japanese. In Proc. ICSLP, pp. 485–488, 1990.

    Google Scholar 

  2. K. Hakota and H. Sato. Prosodic rules in connected speech synthesis. Trans. IECE Japan, J63-D:715–722, 1980 (in Japanese).

    Google Scholar 

  3. P. Haffner, H. Sawai, A. Waibel, and K. Shikano. Fast back-propagation learning methods for large phonemic neural networks. In Rec. Spring Meeting, Acoust. Soc. Jpn., pp. 27–28, Mar. 1989.

    Google Scholar 

  4. N. Kaiki and Y. Sagisaka. Pause characteristics and local phrase dependency structure in Japanese. In Proc. ICSLP, pp. 357–360, 1992

    Google Scholar 

  5. K. Lari and S. J. Young. The estimation of stochastic context-free grammars using the inside-outside algorithm. Computer Speech and Language, 4:6–656, 1989.

    Google Scholar 

  6. F. Pereira and Y. Schabes. Inside-outside reestimation from partially bracketed corpora. In Proc. ACL, pp. 128–135, 1992.

    Google Scholar 

  7. Y. Sagisaka and N. Kaiki. Optimization of intonation control using statistical F0 resetting characteristics. In Proceedings of the International Conference on Acoustics, Speech, and Signal Processes, pp. 49–52, 1992.

    Google Scholar 

  8. Y. Sagisaka and F. Pereira. Inductive learning of prosodic phrasing characteristics using stochastic context-free grammar. In Rec. Spring Meeting, Acoust. Soc. Jpn., pp. 225–226, Mar. 1994.

    Google Scholar 

  9. K. Suzuki and T. Saito. N-phrase parsing method for Japanese text-to-speech conversion and assignment of prosodic features based on N-phrase structures. Trans. IEICE Japan, J78-D- 11:177–187, Feb. 1995 (in Japanese).

    Google Scholar 

  10. Y. Sagisaka, K. Takeda, M. Abe, S. Katagiri, T. Umeda, and H. Kuwabara. A large-scale Japanese speech database. In Proceedings of the International Conference on Spoken Language Processing, Kobe, Japan, pp. 1089–1092, 1990.

    Google Scholar 

Download references

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 1997 Springer-Verlag New York, Inc.

About this chapter

Cite this chapter

Fujio, S., Sagisaka, Y., Higuchi, N. (1997). Prediction of Major Phrase Boundary Location and Pause Insertion Using a Stochastic Context-free Grammar. In: Sagisaka, Y., Campbell, N., Higuchi, N. (eds) Computing Prosody. Springer, New York, NY. https://doi.org/10.1007/978-1-4612-2258-3_17

Download citation

  • DOI: https://doi.org/10.1007/978-1-4612-2258-3_17

  • Publisher Name: Springer, New York, NY

  • Print ISBN: 978-1-4612-7476-6

  • Online ISBN: 978-1-4612-2258-3

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics