Skip to main content

Density Normalization in Density Peak Based Clustering

  • Conference paper
  • First Online:
Graph-Based Representations in Pattern Recognition (GbRPR 2017)

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 10310))

Abstract

As a promising clustering approach, the density peak (DP) based algorithm utilizes the data density and carefully designed distance to identify cluster centers and cluster members. The key to this approach is the density calculation, which has a significant impact on the clustering results. However, the original DP algorithm applies the local density to identify cluster centers directly, and fails to take into account the density difference among clusters. As a result, large-density clusters may be partitioned into multiple parts and small-density clusters are likely to be merged with other clusters. In this paper we introduce a density normalization step to deal with this problem, and show that the normalized density can be used to characterize cluster centers more accurately than the original one. In experiments on various datasets, our method is shown to improve the performance of different density kernels evidently.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Institutional subscriptions

References

  1. Ankerst, M., Breunig, M.M., Kriegel, H.P., Sander, J.: Optics: ordering points to identify the clustering structure. In: ACM SIGMOD International Conference on Management of Data, pp. 49–60 (1999)

    Google Scholar 

  2. Brendan, J.F., Delbert, D.: Clustering by passing messages between data points. Science 315, 972–976 (2007)

    Article  MathSciNet  MATH  Google Scholar 

  3. Chang, H., Yeung, D.Y.: Robust path-based spectral clustering. Pattern Recogn. 41(1), 191–203 (2008)

    Article  MATH  Google Scholar 

  4. Daszykowski, M., Walczak, B., Massart, D.L.: Looking for natural patterns in data: Part 1. density-based approach. Chemometr. Intell. Lab. Syst. 56(2), 83–92 (2001)

    Article  Google Scholar 

  5. Ester, M., Kriegel, H.P., Sander, J., Xu, X.W.: A density-based algorithm for discovering clusters in large spatial databases with noise. In: International Conference on Knowledge Discovery and Data Mining, pp. 226–231 (1996)

    Google Scholar 

  6. Fu, L., Medico, E.: Flame, a novel fuzzy clustering method for the analysis of dna microarray data. BMC Bioinform. 8(1), 1–17 (2007)

    Article  Google Scholar 

  7. Gionis, A., Mannila, H., Tsaparas, P.: Clustering aggregation. ACM Trans. Knowl. Discov. Data 1(1), 1–30 (2007)

    Article  Google Scholar 

  8. Hou, J., Gao, H., Li, X.: Dsets-dbscan: a parameter-free clustering algorithm. IEEE Trans. Image Process. 25(7), 3182–3193 (2016)

    Article  MathSciNet  MATH  Google Scholar 

  9. Hou, J., Liu, W., Xu, E., Cui, H.: Towards parameter-independent data clustering and image segmentation. Pattern Recogn. 60, 25–36 (2016)

    Article  Google Scholar 

  10. Hou, J., Pelillo, M.: A new density kernel in density peak based clustering. In: International Conference on Pattern Recognition, pp. 463–468 (2016)

    Google Scholar 

  11. Jain, A.K., Law, M.H.C.: Data clustering: a user’s dilemma. In: International Conference on Pattern Recognition and Machine Intelligence, pp. 1–10 (2005)

    Google Scholar 

  12. von Luxburg, U.: A tutorial on spectral clustering. Stat. Comput. 17(4), 395–416 (2007)

    Article  MathSciNet  Google Scholar 

  13. Ng, A., Jordan, M., Weiss, Y.: On spectral clustering: analysis and an algorithm. In: Advances in Neural Information Processing Systems, pp. 849–856 (2002)

    Google Scholar 

  14. Pavan, M., Pelillo, M.: A graph-theoretic approach to clustering and segmentation. In: IEEE International Conference on Computer Vision and Pattern Recognition, pp. 145–152 (2003)

    Google Scholar 

  15. Pavan, M., Pelillo, M.: Efficient out-of-sample extension of dominant-set clusters. In: Advances in Neural Information Processing Systems, pp. 1057–1064 (2005)

    Google Scholar 

  16. Pavan, M., Pelillo, M.: Dominant sets and pairwise clustering. IEEE Trans. Pattern Anal. Mach. Intell. 29(1), 167–172 (2007)

    Article  Google Scholar 

  17. Rodriguez, A., Laio, A.: Clustering by fast search and find of density peaks. Science 344, 1492–1496 (2014)

    Article  Google Scholar 

  18. Shi, J., Malik, J.: Normalized cuts and image segmentation. IEEE Trans. Pattern Anal. Mach. Intell. 22(8), 167–172 (2000)

    Google Scholar 

  19. Torsello, A., Bulo, S.R., Pelillo, M.: Grouping with asymmetric affinities: a game-theoretic perspective. In: IEEE International Conference on Computer Vision and Pattern Recognition, vol. 1, pp. 292–299 (2006)

    Google Scholar 

  20. Veenman, C.J., Reinders, M., Backer, E.: A maximum variance cluster algorithm. IEEE Trans. Pattern Anal. Mach. Intell. 24(9), 1273–1280 (2002)

    Article  Google Scholar 

  21. Zhu, X., Loy, C.C., Gong, S.: Constructing robust affinity graphs for spectral clustering. In: IEEE International Conference on Computer Vision and Pattern Recognition, pp. 1450–1457 (2014)

    Google Scholar 

Download references

Acknowledgement

This work is supported in part by National Natural Science Foundation of China under Grant No. 61473045 and No. 41371425, and in part by China Scholarship Council.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Jian Hou .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2017 Springer International Publishing AG

About this paper

Cite this paper

Hou, J., Cui, H. (2017). Density Normalization in Density Peak Based Clustering. In: Foggia, P., Liu, CL., Vento, M. (eds) Graph-Based Representations in Pattern Recognition. GbRPR 2017. Lecture Notes in Computer Science(), vol 10310. Springer, Cham. https://doi.org/10.1007/978-3-319-58961-9_17

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-58961-9_17

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-58960-2

  • Online ISBN: 978-3-319-58961-9

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics