Local Feature Extractors Accelerating HNNP for Phoneme Recognition

Janning, Ruth; Schatten, Carlotta; Schmidt-Thieme, Lars

doi:10.1007/978-3-319-11206-0_23

Ruth Janning²¹,
Carlotta Schatten²¹ &
Lars Schmidt-Thieme²¹

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 8736))

Included in the following conference series:

Joint German/Austrian Conference on Artificial Intelligence (Künstliche Intelligenz)

1131 Accesses
2 Citations

Abstract

Artificial neural networks are fast in the application phase but very slow in the training phase. On the other hand there are state-of-the-art approaches using neural networks, which are very efficient in image classification tasks, like the hybrid neural network plait (HNNP) approach for images from signal data stemming for instance from phonemes. We propose to accelerate HNNP for phoneme recognition by substituting the neural network with the highest computation costs, the convolutional neural network, within the HNNP by a preceding local feature extractor and a simpler and faster neural network. Hence, in this paper we propose appropriate feature extractors for this problem and investigate and compare the resulting computation costs as well as the classification performance. The results of our experiments show that HNNP with the best one of our proposed feature extractors in combination with a smaller neural network is more than two times faster than HNNP with the more complex convolutional neural network and delivers still a good classification performance.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Abdel-Hamid, O., Mohamed, A., Jiang, H., Penn, G.: Applying Convolutional Neural Networks concepts to hybrid NN-HMM model for speech recognition. In: IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp. 4277–4280 (2012)
Google Scholar
Janning, R., Horváth, T., Busche, A., Schmidt-Thieme, L.: GamRec: A Clustering Method Using Geometrical Background Knowledge for GPR Data Preprocessing. In: Iliadis, L., Maglogiannis, I., Papadopoulos, H. (eds.) AIAI 2012. IFIP AICT, vol. 381, pp. 347–356. Springer, Heidelberg (2012)
Chapter Google Scholar
Janning, R., Busche, A., Horváth, T., Schmidt-Thieme, L.: Buried Pipe Localization Using an Iterative Geometric Clustering on GPR Data. Artificial Intelligence Review (2013), doi:10.1007/s10462-013-9410-2
Google Scholar
Janning, R., Schatten, C., Schmidt-Thieme, L.: HNNP – A Hybrid Neural Network Plait for Improving Image Classification with Additional Side Information. In: Proceedings of the IEEE International Conference on Tools with Artificial Intelligence (ICTAI 2013), Washington DC, USA, pp. 24–29 (2013)
Google Scholar
Janning, R., Schatten, C., Schmidt-Thieme, L.: Automatic Subclasses Estimation for a Better Classification with HNNP. In: Andreasen, T., Christiansen, H., Cubero, J.-C., Raś, Z.W. (eds.) ISMIS 2014. LNCS, vol. 8502, pp. 93–102. Springer, Heidelberg (2014)
Chapter Google Scholar
Kälviäinen, H., Hirvonen, P., Xu, L., Oja, E.: Probabilistic and non-probabilistic Hough transforms: overview and comparisons. Image and Vision Computing 13(4), 239–252 (1995)
Article Google Scholar
LeCun, Y., Bottou, L., Bengio, Y., Haffner, P.: Gradient-Based Learning Applied to Document Recognition. Proceedings of the IEEE 86(11), 2278–2324 (1998)
Article Google Scholar
Lowe, D.: Distinctive image features from scale-invariant keypoints. International Journal of Computer Vision 60(2), 91–110 (2004)
Article Google Scholar
Matsugu, M., Mori, K., Mitari, Y., Kaneda, Y.: Subject independent facial expression recognition with robust face detection using a convolutional neural network. Neural Networks 16, 555–559 (2003)
Article Google Scholar
Pettengill, G.H., Ford, P.G., Johnson, W.T.K., Raney, R.K., Soderblom, L.A.: Magellan: Radar Performance and Data Products. Science 252, 260–265 (1991)
Article Google Scholar
Senthilkumaran, N., Rajesh, R.: Edge Detection Techniques for Image Segmentation – A Survey of Soft Computing Approaches. International Journal of Recent Trends in Engineering 1(2), 250–254 (2009)
Google Scholar
Simard, P.Y., Steinkraus, D., Platt, J.: Best Practices for Convolutional Neural Networks Applied to Visual Document Analysis. In: External Link International Conference on Document Analysis and Recognition (ICDAR), pp. 958–962. IEEE Computer Society, Los Alamitos (2003)
Google Scholar
TIMIT Acoustic-Phonetic Continuous Speech Corpus, http://www.ldc.upenn.edu/Catalog/CatalogEntry.jsp?catalogId=LDC93S1
Tivive, F.H.C., Bouzerdoum, A.: A Shunting Inhibitory Convolutional Neural Network for Gender Classification. In: 18th International Conference on Pattern Recognition 2006 (ICPR 2006), pp. 421–424. IEEE (2006)
Google Scholar
Ziou, D., Tabbone, S.: Edge Detection Techniques - An Overview. International Journal of Pattern Recognition and Image Analysis 8, 537–559 (1998)
Google Scholar

Download references

Author information

Authors and Affiliations

Information Systems and Machine Learning Lab (ISMLL), University of Hildesheim, Marienburger Platz 22, 31141, Hildesheim, Germany
Ruth Janning, Carlotta Schatten & Lars Schmidt-Thieme

Authors

Ruth Janning
View author publications
You can also search for this author in PubMed Google Scholar
Carlotta Schatten
View author publications
You can also search for this author in PubMed Google Scholar
Lars Schmidt-Thieme
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Universität Bremen, Germany
Carsten Lutz
University of New South Wales, 2052, Sydney, NSW, Australia
Michael Thielscher

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Janning, R., Schatten, C., Schmidt-Thieme, L. (2014). Local Feature Extractors Accelerating HNNP for Phoneme Recognition. In: Lutz, C., Thielscher, M. (eds) KI 2014: Advances in Artificial Intelligence. KI 2014. Lecture Notes in Computer Science(), vol 8736. Springer, Cham. https://doi.org/10.1007/978-3-319-11206-0_23

Download citation

DOI: https://doi.org/10.1007/978-3-319-11206-0_23
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-11205-3
Online ISBN: 978-3-319-11206-0
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics