Abstract
This paper presents a comparative study between a method for fuzzy clustering which retrieves pure individual types from data, the fuzzy clustering with proportional membership (FCPM), and an archetypal analysis algorithm based on Furthest-Sum approach (FS-AA). A simulation study comprising 82 data sets is conducted with a proper data generator, FCPM-DG, whose goal is twofold: first, to analyse the ability of archetypal clustering algorithm to recover Archetypes from data of distinct dimensionality; second, to analyse robustness of FCPM and FS-AA algorithms to outliers. The effectiveness of these algorithms are yet compared on clustering 12 diverse benchmark data sets from machine learning. The evaluation conducted with five primer unsupervised validation indices shows the good quality of the clustering solutions.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Bertsekas, D.: Nonlinear Programming. Athena Scientific, Belmont (1995)
Bezdek, J.: Pattern Recognition with Fuzzy Objective Function Algorithms, 1st edn. Springer, Heidelberg (1981). https://doi.org/10.1007/978-1-4757-0450-1
Campello, R., Hruschka, E.: A fuzzy extension of the silhouette width criterion for cluster analysis. Fuzzy Sets Syst. 157(21), 2858–2875 (2006)
Chen, Y., Mairal, J., Harchaoui, Z.: Fast and robust archetypal analysis for representation learning. In: 2014 IEEE Conference on Computer Vision and Pattern Recognition, pp. 1478–1485 (2014)
Cutler, A., Breiman, L.: Archetypal analysis. Technometrics 36(4), 338–347 (1994)
Eisenack, K., Lüdeke, M., Kropp, J.: Construction of archetypes as a formal method to analyze socialecological systems. In: Proceedings of the Institutional Dimensions of Global Environmental Change Synthesis Conference, Bali, Indonesia, 6–9 December 2006
Eugster, M., Leisch, F.: From spider-man to hero - archetypal analysis in R. J. Stat. Softw. 30(8), 1–23 (2009)
Eugster, M.: Performance profiles based on archetypal athletes. Int. J. Perform. Anal. Sport 12(1), 166–187 (2012)
Ferraro, M., Giordani, P.: A toolbox for fuzzy clustering using the R programming language. Fuzzy Sets Syst. 279(8), 1–16 (2015)
Geng, L., Hamilton, H.: Interestingness measures for data mining: a survey. ACM Comput. Surv. 38(3), 9 (2006)
Liu, Y., Li, Z., Xiong, H., Gao, X., Wu, J., Wu, S.: Understanding and enhancement of internal clustering validation measures. IEEE Trans. Cybern. 43(3), 982–994 (2013)
Mirkin, B.: Mathematical Classification and Clustering. Nonconvex Optimization and Its Applications. Kluwer Academic Publishers, Dordrecht (1996)
Mirkin, B.: Clustering: A Data Recovery Approach, 2nd edn. Chapman & Hall/CRC Press, London/Boca Raton (2012)
Mørup, M., Hansen, L.: Archetypal analysis for machine learning and data mining. Neurocomputing 80, 54–63 (2012)
Nascimento, S., Mirkin, B., Moura-Pires, F.: Modeling proportional membership in fuzzy clustering. IEEE Trans. Fuzzy Syst. 11(2), 173–186 (2003)
Nascimento, S.: Fuzzy Clustering via Proportional Membership Model. IOS Press, Amsterdam (2005). 178 p
Nascimento, S., Franco, P.: Segmentation of upwelling regions in sea surface temperature images via unsupervised fuzzy clustering. In: Corchado, E., Yin, H. (eds.) IDEAL 2009. LNCS, vol. 5788, pp. 543–553. Springer, Heidelberg (2009). https://doi.org/10.1007/978-3-642-04394-9_66
Nascimento, S.: Applying the gradient projection method to a model of proportional membership for fuzzy cluster analysis. In: Goldengorin, B. (ed.) Optimization and Its Applications in Control and Data Sciences. SOIA, vol. 115, pp. 353–380. Springer, Cham (2016). https://doi.org/10.1007/978-3-319-42056-1_13
Porzio, C., Ragozini, G., Vistocco, D.: On the use of archetypes as benchmarks. Appl. Stoch. Models Bus. Ind. 54(5), 419–437 (2008)
Tallón-Ballesteros, A.J., Correia, L., Xue, B.: Featuring the Attributes in Supervised Machine Learning. In: de Cos Juez, F., et al. (eds.) HAIS 2018. LNCS, vol. 10870. Springer, Cham (2018). https://doi.org/10.1007/978-3-319-92639-1_29
UCI Machine Learning Repository. http://archive.ics.uci.edu/ml. Accessed 27 July 2018
Acknowledgments
This research is supported by FCT/MCTES, NOVA LINCS (UID/CEC/04516/2013). The authors are thankful to the anonymous reviewers for their insightful and constructive comments that allowed to improve the paper.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2018 Springer Nature Switzerland AG
About this paper
Cite this paper
Mendes, G.S., Nascimento, S. (2018). A Study of Fuzzy Clustering to Archetypal Analysis. In: Yin, H., Camacho, D., Novais, P., Tallón-Ballesteros, A. (eds) Intelligent Data Engineering and Automated Learning – IDEAL 2018. IDEAL 2018. Lecture Notes in Computer Science(), vol 11315. Springer, Cham. https://doi.org/10.1007/978-3-030-03496-2_28
Download citation
DOI: https://doi.org/10.1007/978-3-030-03496-2_28
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-03495-5
Online ISBN: 978-3-030-03496-2
eBook Packages: Computer ScienceComputer Science (R0)