Skip to main content

A Novel Approach for Noisy Signal Classification Through the Use of Multiple Wavelets and Ensembles of Classifiers

  • Conference paper
  • First Online:
Advanced Data Mining and Applications (ADMA 2019)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 11888))

Included in the following conference series:

Abstract

Classification of time series signals can be crucial for many practical applications. While the existing classifiers may accurately classify pure signals, the existence of noise can significantly disturb the classification accuracy of these classifiers. We propose a novel classification approach that uses multiple wavelets together with an ensemble of classifiers to return high classification accuracy even for noisy signals.

The proposed technique has two main steps. In Step 1, We convert raw signals into a useful dataset by applying multiple wavelet transforms, each from a different wavelet family or all from the same family with differing filter lengths. In Step 2, We apply the dataset processed in Step 1 to an ensemble of classifiers. We test on 500 noisy signals from five different classes. Our experimental results demonstrate the effectiveness of the proposed technique, on noisy signals, compared to the approaches that use either raw signals or a single wavelet transform.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    The DWT in principle provides more information than the time series raw data points in classification because the DWT locates where the signal energies are concentrated in the frequency domain [3].

  2. 2.

    In this new vector many data points are almost zero, a sparse representation.

  3. 3.

    note: \(\sum ^J_{j=1} \frac{n}{2^j} = \frac{n}{2} + \frac{n}{4} + \dots + 2 + 1 = 2^J -1 = n-1\).

  4. 4.

    If we decompose \(X_t\) to level \(I \le J\) then \( \mathbf V _I\) has \(n/2^{I}\) elements.

  5. 5.

    Initially we applied the MDWT data to a single Decision tree classifier as a baseline and then to ensemble classifiers, which are outlined in Sect. 4.1.

  6. 6.

    With these generated signals we attached class labels and such labels were used as the attribute for Class within the classification methods.

  7. 7.

    Here Accuracy is defined as the number of correctly Classified Instances (with respect to our initial Class labels).

  8. 8.

    Result tables not shown for this exercise.

  9. 9.

    We maintain 500 noise enhanced signals at each step.

  10. 10.

    For evaluation here, we use 80% data to form the training set and 20% of data for testing with 100 trees used for each classifier.

  11. 11.

    The MDWT comprising of four wavelets from different families, we have labelled as “MDWT Diff”, similarly for the MDWT with four wavelets from the same family, as “MDWT Same”.

References

  1. Clark, P., Niblett, T.: Induction in noisy domains. In: Progress in Machine Learning (from the Proceedings of the 2nd European Working Session on Learning), pp. 11–30. Sigma Press, Bled (1987)

    Google Scholar 

  2. Saez, J.A., Galar, M., Luengo, J., Herrera, F.: Tackling the problem of classification with noisy data using multiple classifier systems: analysis of the performance and robustness. Inf. Sci. 247(2013), 1–20 (2013). https://doi.org/10.1016/j.ins.2013.06.002

    Article  Google Scholar 

  3. Fong, S.: Using hierarchical time series clustering algorithm and wavelet classifier for biometric voice classification. J. Biomed. Biotechnol. 2012, Article ID 215019 (2012)

    Google Scholar 

  4. Li, D., Bissyand’e, T.F., Klein, J., Traon, Y.L.: Time series classification with discrete wavelet transformed data: insights from an empirical study. In: The 28th International Conference on Software Engineering and Knowledge Engineering (2016)

    Google Scholar 

  5. Moazami-Goudarzi, M., Moradi, M.H., Abbasabadi, S.: High performance method for electrocardiogram compression using two dimensional multiwavelet transform. In: IEEE 7th Workshop on Multimedia Signal Processing (2006)

    Google Scholar 

  6. Graps, A.: An introduction to wavelets. IEEE Comput. Sci. Eng. 2(2), 50–61 (1995)

    Article  Google Scholar 

  7. Percival, D.B., Walden, A.T.: Wavelet Methods for Time Series Analysis. Cambridge University Press, Cambridge (2000)

    Book  Google Scholar 

  8. Nason, G.P.: Wavelet Methods in Statistics with R. Springer, New York (2008). https://doi.org/10.1007/978-0-387-75961-6

    Book  MATH  Google Scholar 

  9. Adnan, M.N., Islam, M.Z.: A comprehensive method for attribute space extension for random forest. In: 17th International Conference on Computer and Information Technology, ICCIT 2014 (2014). https://doi.org/10.1109/ICCITechn.2014.7073129

  10. R Core Team: R: A language and environment for statistical computing. R Foundation for Statistical Computing, Vienna, Austria (2013). http://www.R-project.org/

  11. Constantine, W., Percival, D.: wmtsa: Wavelet Methods for Time Series Analysis. R package version 2.0-3 (2017). http://CRAN.R-project.org/package=wmtsa

  12. Frank, E., Hall, M.A., Witten, I.H.: The WEKA Workbench. Online Appendix for “Data Mining: Practical Machine Learning Tools and Techniques”, 4th edn. Morgan Kaufmann (2016)

    Google Scholar 

  13. Quinlan, R.: C4.5: Programs for Machine Learning. Morgan Kaufmann Publishers, San Mateo (1993)

    Google Scholar 

  14. Adnan, M.N., Islam, M.Z.: Forest PA: constructing a decision forest by penalizing attributes used in previous trees. Expert Syst. Appl. 89, 389–403 (2017)

    Article  Google Scholar 

  15. Islam, M.Z., Giggins, H.: Knowledge discovery through SysFor: a systematically developed forest of multiple decision trees. In: Proceedings of the Ninth Australasian Data Mining Conference (AusDM 2011) (2011)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Paul Grant .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2019 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Grant, P., Islam, M.Z. (2019). A Novel Approach for Noisy Signal Classification Through the Use of Multiple Wavelets and Ensembles of Classifiers. In: Li, J., Wang, S., Qin, S., Li, X., Wang, S. (eds) Advanced Data Mining and Applications. ADMA 2019. Lecture Notes in Computer Science(), vol 11888. Springer, Cham. https://doi.org/10.1007/978-3-030-35231-8_14

Download citation

  • DOI: https://doi.org/10.1007/978-3-030-35231-8_14

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-030-35230-1

  • Online ISBN: 978-3-030-35231-8

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics