Skip to main content

Measuring Periodicity Perturbations in Pathological Voice: General-Purpose Software vs. Custom-Tailored Methods

  • Conference paper
  • First Online:
VIII Latin American Conference on Biomedical Engineering and XLII National Conference on Biomedical Engineering (CLAIB 2019)

Abstract

The acoustic measurement of the severity of the symptoms present in pathological voice is an active research area, for being inexpensive and non invasive. Periodicity perturbations are among the most frequently used, requiring the previous extraction of the individual glottal pulse boundaries. In this paper we explore the performances of methods for detecting glottal pulse boundaries as implemented by freely available software (Praat, intended for phonetic studies) vs. a research-grade pulse cycle detector (reported as a super-resolution method). We compare the sequences of pulse markers as obtained by two of Praat’s internal implementations and the super-resolution method against the hand-marked reference sequence in a dataset of pathological sustained vowels from a well-known database. A group of performance measures is extracted from this comparison, using a Dynamic-Time Warping alignment procedure. The measures obtained show the pros and cons of each alternative. Researchers and clinicians must be aware of the benefits of selecting either approach.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 259.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 329.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Baken, R.J., Orlikoff, R.F.: Clinical Measurement of Speech and Voice, 2nd edn. Cengage Learning (2000)

    Google Scholar 

  2. Kreiman, J., Gerratt, B.R.: Measuring vocal quality. In: Kent, R.D., Ball, M.J. (eds.) Voice Quality Measurement, pp. 73–112. Singular, San Diego (2000)

    Google Scholar 

  3. Dejonckere, P.H., et al.: A basic protocol for functional assessment of voice pathology, especially for investigating the efficacy of (phonosurgical) treatments and evaluating new assessment techniques: Guideline elaborated by the Committee on Phoniatrics of the European Laryngolo. Eur. Arch. Oto-Rhino-Laryngology 258(2), 77–82 (2001)

    Article  Google Scholar 

  4. Dejonckere, P.H.: Assessment of voice and respiratory function. In: Remacle, M., Eckel, E.H. (eds.) Surgery of Larynx and Trachea, pp. 11–26. Springer, Heidelberg (2010)

    Chapter  Google Scholar 

  5. Buder, E.H.: Acoustic analysis of voice quality: a tabulation of algorithms 1902–1990. In: Kent, R.D., Ball, M.J. (eds.) Voice Quality Measurement, pp. 119–244. Singular, San Diego (2000)

    Google Scholar 

  6. Ng, M.L., Wong, J.: Voice onset time characteristics of esophageal, tracheoesophageal, and laryngeal speech of cantonese. J. Speech Lang. Hear. Res. 52, 780–789 (2009)

    Article  Google Scholar 

  7. Karlsson, F., Olofsson, K., Blomstedt, P., Linder, J., Nordh, E., van Doorn, J.: Articulatory closure proficiency in patients with Parkinson’s disease following deep brain stimulation of the subthalamic nucleus and caudal zona incerta. J. Speech. Lang. Hear. Res. 57, 1178–1190 (2014)

    Article  Google Scholar 

  8. Pouplier, M., Marin, S., Waltl, S.: Voice onset time in consonant cluster errors: can phonetic accommodation differentiate cognitive from motor errors? J. Speech Lang. Hear. Res. 57, 1577–1588 (2014)

    Article  Google Scholar 

  9. Borrie, S.A., Liss, J.M.: Rhythm as a coordinating device: entrainment with disordered speech. J. Speech Lang. Hear. Res. 57, 815–824 (2014)

    Article  Google Scholar 

  10. Skodda, S., Grönheit, W., Schlegel, U.: Intonation and speech rate in Parkinson’s disease: general and dynamic aspects and responsiveness to levodopa admission, J. Voice 25(4) (2011)

    Article  Google Scholar 

  11. Ní Chasaide, A., Yanushevskaya, I., Gobl, C.: Voice source dynamics in intonation. In: Proceedings of 17th International Congress of Phonetic Sciences (ICPhS 2011), pp. 1470–1473 (2011)

    Google Scholar 

  12. Hess, W.J.: Pitch Determination of Speech Signals. Springer, Berlin Heidelberg (1983)

    Book  Google Scholar 

  13. Hess, W.J.: Pitch and voicing determination of speech with an extension toward music signals. In: Benesty, J., Sondhi, M.M., Huang, Y.A. (eds.) Springer Handbook of Speech Processing. Springer. Heidelberg, pp. 181–212 (2008)

    Chapter  Google Scholar 

  14. Boersma, P.: PRAAT, a system for doing phonetics by computer. Glot Int. 5(9/10), 5 (2002)

    Google Scholar 

  15. Barsties, B., Latoszek, V., Maryn, Y., Gerrits, E., De Bodt, M.S.: “The acoustic breathiness index (ABI): a multivariate acoustic model for breathiness, J. Voice (2017, in press)

    Google Scholar 

  16. Uloza, V., Latoszek, B.B.V., Ulozaite-Staniene, N., Petrauskas, T., Maryn, Y.: A comparison of Dysphonia severity index and acoustic voice quality index measures in differentiating normal and dysphonic voices. Eur. Arch. Oto-Rhino-Laryngology 275(4), 949–958 (2018)

    Article  Google Scholar 

  17. Latoszek, B.B.V., Ulozaitė-Stanienė, N., Petrauskas, T., Uloza, V., Maryn, Y.: Diagnostic accuracy of dysphonia classification of DSI and AVQI. Laryngoscope (2018)

    Google Scholar 

  18. Latoszek, B.B.V., De Bodt, M., Gerrits, E., Maryn, Y.: The exploration of an objective model for roughness with several acoustic markers. J. Voice 32(2), 149–161 (2018)

    Article  Google Scholar 

  19. Boersma, P.: Should jitter be measured by peak picking or by waveform matching? Folia Phoniatr. Logop. 61(5), 305–308 (2009)

    Article  Google Scholar 

  20. Medan, Y., Yair, E., Chazan, D.: Super resolution pitch determination of speech signals. IEEE Trans. Signal Process. 39(1), 40–48 (1991)

    Article  Google Scholar 

  21. Bagshaw, P.C., Miller, S.M., Jack, M.A.: Enhanced pitch tracking and the processing of F0 contours for computer aided intonation teaching. In: 3rd European Conference on Speech Communication and Technology EUROSPEECH 1993, pp. 1003–1006 (1993)

    Google Scholar 

  22. Parsa, V., Jamieson, D.G.: A comparison of high precision F0 extraction algorithms for sustained vowels. J. Speech. Lang. Hear. Res. 42(1), 112–126 (1999)

    Article  Google Scholar 

  23. Ferrer, C., Torres, D., Hernández-Díaz, M.E.: Using dynamic time warping of T0 contours in the evaluation of cycle-to-cycle pitch detection algorithms. Pattern Recognit. Lett. 31(6), 517–522 (2010)

    Article  Google Scholar 

  24. Disordered Voice Database v1.03. Kay Elemetrics Corp (1994)

    Google Scholar 

  25. Hosokawa, K., Ogawa, M., Hashimoto, M., Inohara, H.: Statistical analysis of the reliability of acoustic and electroglottographic perturbation parameters for the detection of vocal roughness. J. Voice 28(2), 263.e9–263.e16 (2014)

    Article  Google Scholar 

  26. Manfredi, C., Giordano, A., Schoentgen, J., Fraj, S., Bocchi, L., Dejonckere, P.H.: Validity of jitter measures in non-quasi-periodic voices. part II: the effect of noise. Logop. Phoniatr. Vocology 36(2), 78–89 (2011)

    Article  Google Scholar 

Download references

Acknowledgements

This work was partially supported by an Alexander von Humboldt Foundation Fellowship granted to one of the authors (Ref 3.2-1164728-CUB-GF-E).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Carlos A. Ferrer-Riesgo .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2020 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Rodríguez-Guillén, R., Ferrer-Riesgo, C.A. (2020). Measuring Periodicity Perturbations in Pathological Voice: General-Purpose Software vs. Custom-Tailored Methods. In: González Díaz, C., et al. VIII Latin American Conference on Biomedical Engineering and XLII National Conference on Biomedical Engineering. CLAIB 2019. IFMBE Proceedings, vol 75. Springer, Cham. https://doi.org/10.1007/978-3-030-30648-9_8

Download citation

  • DOI: https://doi.org/10.1007/978-3-030-30648-9_8

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-030-30647-2

  • Online ISBN: 978-3-030-30648-9

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics