Skip to main content

Efficient Computation of the Weighted Clustering Coefficient

  • Conference paper
  • First Online:
Algorithms and Models for the Web Graph (WAW 2014)

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 8882))

Included in the following conference series:

  • 812 Accesses

Abstract

The clustering coefficient of an unweighted network has been extensively used to quantify how tightly connected is the neighbor around a node and it has been widely adopted for assessing the quality of nodes in a social network. The computation of the clustering coefficient is challenging since it requires to count the number of triangles in the graph. Several recent works proposed efficient sampling, streaming and MapReduce algorithms that allow to overcome this computational bottleneck. As a matter of fact, the intensity of the interaction between nodes, that is usually represented with weights on the edges of the graph, is also an important measure of the statistical cohesiveness of a network. Recently various notions of weighted clustering coefficient have been proposed but all those techniques are hard to implement on large-scale graphs.

In this work we show how standard sampling techniques can be used to obtain efficient estimators for the most commonly used measures of weighted clustering coefficient. Furthermore we also propose a novel graph-theoretic notion of clustering coefficient in weighted networks.

Work partially done while visiting scientist at Google Research NY. Partially supported from Google Focused Award “Algorithms for Large-scale Data Analysis”, EU FET project MULTIPLEX 317532, EU ERC project PAAI 259515.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 34.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 44.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Applying social network analysis to the information in cvs repositories. In: 1st International Workshop on Mining Software Repositories (MSR)

    Google Scholar 

  2. Barrat, A., Barthlemy, M., Pastor-Satorras, R., Vespignani, A.: The architecture of complex weighted networks. Proceedings of the National Academy of Sciences of the United States of America

    Google Scholar 

  3. Becchetti, L., Boldi, P., Castillo, C., Gionis, A.: Efficient semi-streaming algorithms for local triangle counting in massive graphs. In: KDD 2008 (2008)

    Google Scholar 

  4. Bollobs, B.: Mathematical results on scale-free random graphs. In: Handbook of Graphs and Networks

    Google Scholar 

  5. Budak, C., Agrawal, D., El Abbadi, A.: Structural trend analysis for online social networks. In: VLDB 2011 (2011)

    Google Scholar 

  6. Buriol, L., Frahling, G., Leonardi, S., Marchetti-Spaccamela, A., Sohler, C.: Counting triangles in data streams. In: PODS 2006 (2006)

    Google Scholar 

  7. Castillo, C., Donato, D., Becchetti, L., Boldi, P., Leonardi, S., Santini, M., Vigna, S.: A reference collection for web spam. SIGIR 2006 (2006)

    Google Scholar 

  8. Dean, J., Ghemawat, S.: Mapreduce: Simplified data processing on large clusters. In: OSDI 2004 (2004)

    Google Scholar 

  9. Fagiolo, G.: Clustering in complex directed networks. Phys. Rev. E.

    Google Scholar 

  10. Hardiman, S.J., Katzir, L.: Estimating clustering coefficients and size of social networks via random walk. In: WWW 2013 (2013)

    Google Scholar 

  11. Hintsanen, P., Toivonen, H.: Finding reliable subgraphs from large probabilistic graphs. Data Min. Knowl. Discov.

    Google Scholar 

  12. Hintsanen, P., Toivonen, H.: Finding reliable subgraphs from large probabilistic graphs. Data Min. Knowl. Discov. (2008)

    Google Scholar 

  13. Jha, M., Seshadhri, C., Pinar, A.: A space efficient streaming algorithm for triangle counting using the birthday paradox. In: KDD 2013 (2013)

    Google Scholar 

  14. Kalna, G., Higham, D.J.: Clustering coefficients for weighted networks. In: Symposium on Network Analysis in Natural Sciences and Engineering

    Google Scholar 

  15. Kwak, H., Lee, C., Park, H., Moon, S.: What is twitter, a social network or a news media?. In: WWW 2010 (2010)

    Google Scholar 

  16. Latapy, M.: Main-memory triangle computations for very large (sparse(power-law)) graphs. Theoretical Computer Science

    Google Scholar 

  17. Leskovec, J., Horvitz, E.: Planetary-scale views on a large instant-messaging network. In: WWW 2008 (2008)

    Google Scholar 

  18. Liberty, E.: Simple and deterministic matrix sketches. In: KDD 2014 (2014)

    Google Scholar 

  19. Newman, M.E.J.: Analysis of weighted networks. Phys. Rev. E 70, 056131 (2004)

    Article  Google Scholar 

  20. Newman, M.E.J., Watts, D.J., Strogatz, S.H.: Random graph models of social networks. Proc. Natl. Acad. Sci. USA 99, 2566–2572 (2002)

    Article  MATH  Google Scholar 

  21. Onnela, J.-P., Saramäki, J., Kertész, J., Kaski, K.: Intensity and coherence of motifs in weighted complex networks. Physical Review E

    Google Scholar 

  22. Opsahl, T., Panzarasa, P.: Clustering in weighted networks. Social Networks

    Google Scholar 

  23. Pagh, R., Tsourakakis, C.E.: Colorful triangle counting and a mapreduce implementation

    Google Scholar 

  24. Saramäki, J., Kivelä, M., Onnela, J.-P., Kaski, K., Kertesz, J.: Generalizations of the clustering coefficient to weighted complex networks. Physical Review E

    Google Scholar 

  25. Schank, T., Wagner, D.: Approximating clustering coefficient and transitivity. Journal of Graph Algorithms and Applications

    Google Scholar 

  26. Schank, Thomas, Wagner, Dorothea: Finding, Counting and Listing All Triangles in Large Graphs, an Experimental Study. In: Nikoletseas, Sotiris E. (ed.) WEA 2005. LNCS, vol. 3503, pp. 606–609. Springer, Heidelberg (2005)

    Chapter  Google Scholar 

  27. Suri, S., Vassilvitskii, S.: Counting triangles and the curse of the last reducer. In: WWW 2011 (2011)

    Google Scholar 

  28. Tsourakakis, C.E., Kang, U., Miller, G.L., Faloutsos, C.: Doulion: counting triangles in massive graphs with a coin. In: KDD 2009 (2009)

    Google Scholar 

  29. Tsourakakis, C.E., Kolountzakis, M.N., Miller, G.L.: Triangle sparsifiers. J. Graph Algorithms Appl.

    Google Scholar 

  30. Watts, D.J., Strogatz, S.H.: Collective dynamics of small-world networks. Nature

    Google Scholar 

  31. Zhang, B., Horvath, S., et al.: A general framework for weighted gene co-expression network analysis. Statistical Applications in Genetics and Molecular Biology

    Google Scholar 

  32. Zhang, Y., Zhang, Z., Guan, J., Zhou, S.: Analytic solution to clustering coefficients on weighted networks. arXiv preprint arXiv:0911.0476 (2009)

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Stefano Leonardi .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2014 Springer International Publishing Switzerland

About this paper

Cite this paper

Lattanzi, S., Leonardi, S. (2014). Efficient Computation of the Weighted Clustering Coefficient. In: Bonato, A., Graham, F., Prałat, P. (eds) Algorithms and Models for the Web Graph. WAW 2014. Lecture Notes in Computer Science(), vol 8882. Springer, Cham. https://doi.org/10.1007/978-3-319-13123-8_4

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-13123-8_4

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-13122-1

  • Online ISBN: 978-3-319-13123-8

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics