Skip to main content
Log in

Spectral analysis of infant cries and adult speech

  • Published:
International Journal of Speech Technology Aims and scope Submit manuscript

Abstract

In this paper, spectrographic analysis of the infant cries is reported. For the spectrographic analysis of the infant cries ten different cry modes are used to analyze differences in different pathological cries. A comparison of spectrograms of the adult speech signal and infant cry signals is given. Based on differences in the distribution of energy in the spectrograms, energy-based features are calculated from the short-time Fourier transform (STFT) of the adult speech and infant cry signals. The classification performance of these features is obtained using support vector machine (SVM) classifier and it is observed that the energy distribution in 0–1 kHz range is promising feature in the classification of adult speech and infant cries and the classification accuracy achieved with this feature is 98.22 %. On the contrary, it was observed that it is very difficult to classify adult speech and infant cries using the energy distribution in 1–3 kHz.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10
Fig. 11
Fig. 12
Fig. 13
Fig. 14
Fig. 15
Fig. 16
Fig. 17
Fig. 18
Fig. 19
Fig. 20
Fig. 21
Fig. 22

Similar content being viewed by others

References

  • Asthma. (2016). Retrieved April 2016, from WebMD: http://www.webmd.com/asthma/guide/what-is-asthma.

  • Birth Asphyxia. (2016). Retrieved April 2016, from Seattle Children’s Hospital Research Foundation: http://www.seattlechildrens.org/medical-conditions/airway/birthasphyxia-symptoms/.

  • Black, J. S. (1997). An eclectic cry research tool for the estimation of an infant’s level of distress. B.Sc. Thesis, University of British Columbia, Dept. of Elect. Eng.

  • Buddha, N., & Patil, H. A. (2007). Corpora for analysis of infant cry. In Int. conf. on speech databases and assessments, oriental COCOSDA, (pp. 43–48). Hanoi, Vietnam.

  • Chittora, A., & Patil, H. A. (2013). Data collection and corpus design for analysis of normal and pathological infant cry. In Oriental COCOSDA held jointly with International Conference on Asian Spoken Language Research and Evaluation (O-COCOSDA/CASLRE) (Vol. 2, pp. 1–6). New Delhi.

  • Chittora, A., Patil, H. A., & Sailor, H. (2015). Spectro-temporal analysis of HIE and asthma infant cries using auditory spectrogram. In International conference on biosignal analysis, processing and system (ICBAPS 2015) (pp. 145–150). Kuala Lumpur, Malaysia.

  • Congenital heart defects. (2016). Retrieved April 2016, from Medline Plus: https://www.nlm.nih.gov/medlineplus/congenitalheartdefects.html.

  • Down syndrome. (2016). Retrieved April 2016, from healthline: http://www.healthline.com/health/down-syndrome#Overview1.

  • Gabor, D. (1946). Theory of communication. Part 1: The analysis of information. Electrical Engineers-Part III: Radio and Communication Engineering, Journal of the Institution of, 93(26), 429–441.

    Google Scholar 

  • Garcia, J., & Garcia, C. A. (July, 2003). Mel-frequency cepstrum coefficients extraction from infant cry for classification of normal and pathological cry with feed-forward neural networks. In Proceedings of the international joint conference on neural networks, (pp. 3140–3145).

  • Haitao, Y., Wang, Y., Zhanlin, X., & Wei, L. (2010). Feature extraction and classification based on bispectrum for underwater targets. In: Inter. Conf. on Intelligent System Design and Engineering Application (ISDEA) (Vol. 1, pp. 742–745). Changsha .

  • Hearing loss—infants. (2016). Retrieved April 2016, from Medline Plus: https://www.nlm.nih.gov/medlineplus/ency/article/007322.htm.

  • Hydrocephalus Fact Sheet. (2016). Retrieved April 2016, from NIH: National school of neurological disorders and stroke: http://www.ninds.nih.gov/disorders/hydrocephalus/detail_hydrocephalus.htm.

  • Hypoxy ischemic encephalopathy. (2016). Retrieved April 2016, from Birth Injury Guide: http://www.birthinjuryguide.org/birth-injury/types/hypoxic-ischemic-encephalopathy-hie/.

  • Intraventricular hemorrhage of the newborn. (2016). Retrieved April 2016, from Pennstate Hershey: http://pennstatehershey.adam.com/content.aspx?productId=117&pid=1&gid=007301.

  • Kheddache, Y., & Tadj, C. (2013). Acoustic measures of the cry characteristics of healthy newborns and newborns with pathologies. Journal of Biomedical Science and Engineering, 6, 796–804.

    Article  Google Scholar 

  • Laryngomalacia details. (2016). Retrieved April 2016, from Medscape: http://emedicine.medscape.com/article/1002527-overview.

  • Lester, B. M. (1985). Inroduction- There’s more to crying than meets the ear. In B. M. Lester & Z. C. Boukydis (Eds.), Infant crying—Theoritical and research perspective (pp. 1–27). New York and London: Plenum Press.

    Google Scholar 

  • Meningitis. (2016). Retrieved April 2016, from MedBroadcast: http://www.medbroadcast.com/Condition/GetCondition/Meningitis.

  • Messaoud, A., & Tadj, C. (2010). A cry based infant identification system. In 4th Int. conf. on image and signal process (pp. 192–199).

  • Michelson, K., Sirvio, P., & Wasz-Hokert, O. (1977). Sound spectrographic cry analysis of infants with bacterial meningitis. Developmental Medicine and Child Neurology, 19, 309–315.

    Article  Google Scholar 

  • Neonatal respiratory distress syndrome. (2016). Retrieved April 2016, from Healthline: http://www.healthline.com/health/neonatal-respiratory-distress-syndrome#Overview1.

  • Patil, H. A. (2009). Infant identification from their cry. In IEEE 7th international conf. on advances in pattern recognition (ICAPR) (pp. 107–110). Kolkata.

  • Patil, H. A. (2010). Cry Baby: Using spectrographic analysis to assess neonatal health from an infant’s cry. In A. Neustein (Ed.), Advances in speech recognition, mobile environments, call centres and clinics (pp. 323–348). Springer-Verlag.

  • Reyes Galaviz, O. F., Cano-Ortiz, S. D., & Rayes-Garcia, C. A. (2008). Evolutionary-neural system to classify infant cry units for pathologies identification in recently born babies. In IEEE 7th Mexican Int. Conf. on Art. Intell. (pp. 330–335).

  • Reyes-Galaviz, O., Cano-Ortiz, S., & Reyes-Garcia, C. (2008). Validation of the cry unit as primary element for cry analysis using an evolutionary-neural approach. In Mexican international conference on computer science, 2008. ENC ‘08 (pp. 261–267).

  • Saraswathy, J., Hariharan, M., Vijean, V., Yaacob, S., & Khairunizam, W. (2012). Performance comparison of Daubechies wavelet family in infant cry classification. In IEEE 8th international colloquium on signal processing and its applications (pp. 451–455).

  • Wasz-Hockert, O., Michelsson, K., & Lind, J. (1985). Twenty five years of scandinavian cry research. In B. M. Lester & Z. Boukydis (Eds.), Infant crying—Theoritical and research perspective (pp. 83–101). New York: Plenum Publishing Corporation.

    Google Scholar 

  • Water related Diseases. (2016). Retrieved April 2016, from World Health Organization: http://www.who.int/water_sanitation_health/diseases/malnutrition/en/.

  • Xie, Q. (1993). Automatic infant cry analysis and recognition. Electrical engineering. University of British Columbia.

  • Xie, Q., Ward, R. K., & Laszlo, C. A. (1996). Automatic assessment of infants’ level of distress from the cry signals. IEEE Transactions on Speech and Audio Process, 4(4), 253–265.

  • Xie, Q., Ward, R. K., & Laszlo, C. A. (1996). Automatic detection of infant’s level of distress from the cry signals. IEEE Transactions on Speech and Audio Process, 4(4), 253–265.

  • Xie, Q., Ward, R. K., & Laszlo, C. A. (2009). Determining normal infant’s level of distress from cry sounds. In IEEE (Ed.), IEEE Canadian Conf. on Elect. and Comp. Eng. (Vol. 2, pp. 1094–1097). Vancouver, BC.

Download references

Acknowledgments

Authors would like to thank authorities of DA-IICT, Gandhinagar, Department of Electronics and Information Technology (DeitY), New Delhi and Department of Science and Technology (DST), New Delhi for providing necessary resources to carry out this research work.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Anshu Chittora.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Chittora, A., Patil, H.A. Spectral analysis of infant cries and adult speech. Int J Speech Technol 19, 841–856 (2016). https://doi.org/10.1007/s10772-016-9375-z

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10772-016-9375-z

Keywords

Navigation