Skip to main content

How a Full Account of Segmental Perception Depends on Prosody and Vice Versa

  • Conference paper
Structure and Process in Speech Perception

Part of the book series: Communication and Cybernetics ((COMMUNICATION,volume 11))

Abstract

The synthesis time-base of a synthetic precursor phrase is one factor determining the position of the phoneme boundary on a continuum of synthetic CV target syllables, which vary in the durational cue Voice Onset Times (VOT) and are introduced by the precursor. Reducing the time-base, thereby increasing the rate of speech in the precursor, increases the probability of the consonant in the target being perceived as voiceless. The results of such perceptual experiments are compared to those of a production study in which six adult male speakers of British English produced examples of CV syllables composed of the consonants /b,p,g,k/ and the vowels /i,a/ in a sentence frame. VOTs in productions of /p/ and /k/ were 20 milliseconds shorter at fast than at slow rates of speech, warranting a normalisation of VOT duration in perception like that actually obtained.

VOTs were longer in /ki/ than in /ka/. It has been suggested that voicing onset is retarded before high vowels because the oral constriction is reduced more slowly than before lower vowels, thus delaying the attainment of the transglottal pressure drop required for voicing. However, VOTs in /pi/ were shorter than those in /pa/. In the case of voiceless bilabials the aerodynamic factor appears to be outweighed by a mechanical influence of anticipatory co-articulation of tongue position which leads to larynx elevation and less vocal cord abduction before high vowels. The production result for velars, but not that for bilabials, is paralleled in perception where longer values of VOT are required for stops to be perceived as voiceless before /i/ compared to /a/ at all places of production. This failure of the perceptual process to follow exactly the constraints in production provides an illustration of a heuristic rather than an algorithmic perceptual strategy presumably designed to allow fast decisions while tolerating some loss of accuracy in exceptional cases.

The ‘precursor target’ paradigm used in these experiments could be extended to examine prosodie influences on other segmental distinctions, but also, to determine the perceptual substrates of prosodie variables such as “rate of speech” and to measure the precision of perceptual expectations for vowel and consonant durations in different sentential and syntactic environments.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  • KIM, C.W. (1970). A theory of aspiration. Phonetica, 21, 107–116.

    Article  Google Scholar 

  • KLATT, D.H. (1973). Voice onset time, frication and aspiration in word-initial consonant clusters. M.I.T. Research Laboratory of Electronics, Quarterly Progress Report No. 109, 124–136.

    Google Scholar 

  • KLATT, D.H. and COOPER, W.E. (1975). Perception of vowel duration in sentence contexts. Paper presented to the 89th meeting of the Acoustical Society of America, Austin, Texas.

    Google Scholar 

  • KOZHEVNIKOV, V.A. and CHISTOVICH, L.A. Speech Articulation and Perception. JPRS 30, 543. Washington: U.S. Department of Commerce, 1965.

    Google Scholar 

  • LACKNER, J.R. and LEVINE, B.K. (1975). Speech production: evidence for syntactically and phonologically determined units. Perception and Psycho-physics, 17, 107–113.

    Article  Google Scholar 

  • LINDBLOM, B. (1968). Temporal organisation of syllable production. Speech Transmission Laboratory Quarterly Progress Report, 2/3, 1–5. (Royal Institute of Technology, Stockholm.)

    Google Scholar 

  • LISKER, L. (1974). Is it VOT or a first formant transition detector? Paper presented to the autumn meeting of the American Association of Phonetic Sciences.

    Google Scholar 

  • LISKER, L. and ABRAMSON, A.S. (1967). Some effects of context on Voice Onset Time in English stops. Language and Speech, 10, 1–28.

    Google Scholar 

  • LISKER, L. and ABRAMSON, A.S. (1970). The voicing dimension: some experiments in comparative phonetics. Proceedings of the 6th International Congress of Phonetic Sciences. Prague, 1967. Prague: Academia.

    Google Scholar 

  • OLLER, D.K. (1973). The effect of position in utterance of speech segment duration in English. Journal of the Acoustical Society of America, 54, 1217–1247.

    Article  Google Scholar 

  • PETERSON, G.E. and BARNEY, H.L. (1952). Control methods used in a study of the vowels. Journal of the Acoustical Society of America, 24, 175–184.

    Article  ADS  Google Scholar 

  • SUMMERFIELD, A.Q. (1971). Some tests of an information-flow model of speech perception. Unpublished Bachelor’s Thesis, University of Cambridge.

    Google Scholar 

  • SUMMERFIELD, A.Q. (1974). Towards a detailed model for the perception of voicing constrasts Speech Perception No. 3, 1–26. (Progress Report, Department of Psychology, The Queen’s University of Belfast.)

    Google Scholar 

  • SUMMERFIELD, A.Q. (1975a). Cues, contexts and complications in the perception of voicing contrasts. Speech Perception No. 4, (in press). (Progress Report, Department of Psychology, The Queen’s University of Belfast.)

    Google Scholar 

  • SUMMERFIELD, A.Q. (1975b).Aerodynamics versus mechanics in the control of voicing onset in consonant-vowel syllables. Speech Perception No. 4, (in press). (Progress Report, Department of Psychology, The Queen’s University of Belfast.)

    Google Scholar 

  • SUMMERFIELD, A.Q. and HAGGARD, M.P. (1974). Perceptual processing of multiple cues,and contexts: effects of following vowel on stop consonant voicing. Journal of Phonetics, 2, 279–295.

    Google Scholar 

  • TAYLOR, M.M. and CREELMAN, D.D. (1967). PEST: Efficient estimates on probability functions. Journal of the Acoustical Society of America, 41, 782–787.

    Article  ADS  Google Scholar 

Download references

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 1975 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Summerfield, Q. (1975). How a Full Account of Segmental Perception Depends on Prosody and Vice Versa. In: Cohen, A., Nooteboom, S.G. (eds) Structure and Process in Speech Perception. Communication and Cybernetics, vol 11. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-81000-8_4

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-81000-8_4

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-81002-2

  • Online ISBN: 978-3-642-81000-8

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics