Skip to main content
Log in

Generating Graphs by Creating Associative and Random Links Between Existing Nodes

  • Published:
Journal of Statistical Physics Aims and scope Submit manuscript

Abstract

The study and analysis of real-world social, communication, information and citation networks for understanding their structure and identifying interesting patterns have cultivated the need for designing generative models for such networks. A generative model generates an artificial but a realistic-looking network with the same characteristics as that of a real network under study. In this paper, we propose a new generative model for generating realistic networks. Our proposed model is a blend of three key ideas namely preferential attachment, associativity of social links and randomness in real networks. We present a framework that first tests these ideas separately and then blends them into a mixed model based on the idea that a real-world graph could be formed by a mixture of these concepts. Our model can be used for generating static as well as time evolving graphs and this feature distinguishes it from previous approaches. We compare our model with previous methods for generating graphs and show that it outperforms in several aspects. We compare our graphs with real-world graphs across many metrics such as degree, clustering coefficient and path length distributions, assortativity, eigenvector centrality and modularity. In addition, we give both qualitative and quantitative results for clarity.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10
Fig. 11
Fig. 12

Similar content being viewed by others

References

  1. Ahn, YY., Han, S., Kwak, H., Moon, S., Jeong, H.: Analysis of topological characteristics of huge online social networking services. In: Proceedings of the 16th International Conference on World Wide Web, pp. 835–844 (2007)

  2. Albert, R., Barabási, A.L.: Statistical mechanics of complex networks. Rev. Mod. Phys. 74(1), 47–97 (2002)

    Article  ADS  MathSciNet  Google Scholar 

  3. Amaral, L.A.N., Scala, A., Barthélémy, M., Stanley, H.E.: Classes of small-world networks. Proc. Natl. Acad. Sci. 97(21), 11149–11152 (2000)

    Article  ADS  Google Scholar 

  4. Blondel, V.D., Guillaume, J.L., Lambiotte, R., Lefebvre, E.: Fast unfolding of communities in large networks. J. Stat. Mech. 2008, P10008 (2008)

    Article  Google Scholar 

  5. Bonato, A., Hadi, N., Horn, P., Pralat, P., Wang, C.: A dynamic model for on-line social networks. In: Proceedings of the 6th International Workshop on Algorithms and Models for the Web-Graph, WAW ’09. Springer, Berlin, pp 127–142 (2009)

  6. Chung, F., Lu, L.: Connected components in random graphs with given expected degree sequences. Ann. Comb. 6(2), 125–145 (2002)

    Article  MathSciNet  Google Scholar 

  7. Collevecchio, A., Cotar, C., LiCalzi, M.: On a preferential attachment and generalized pÓlya’s urn model. Ann. Appl. Probab. 23(3), 1219–1253 (2013)

    Article  MathSciNet  Google Scholar 

  8. Golder, S.A., Wilkinson, D.M., Huberman, B.A.: Rhythms of social interaction: messaging within a massive online network. In: Communities and Technologies 2007: Proceedings of the Third Communities and Technologies Conference, Michigan State University 2007. Springer, London, pp 41–66 (2007)

  9. Hu, H., Wang, X.: Evolution of a large online social network. Phys. Lett. A 373(12), 1105–1110 (2009)

    Article  ADS  Google Scholar 

  10. Huang, H., Tang, J., Wu, S., Liu, L., fu, X.: Mining triadic closure patterns in social networks. In: Proceedings of the 23rd International Conference on World Wide Web, WWW ’14 Companion, pp. 499–504 (2014)

  11. KON The Koblenz Network Collection—KONECT. (2017) http://konect.uni-koblenz.de/networks/

  12. Kossinets, G., Watts, D.: Empirical analysis of an evolving social network. Science 311, 88–90 (2006)

    Article  ADS  MathSciNet  Google Scholar 

  13. Kumar, R., Novak, J., Tomkins, A.: Structure and evolution of online social networks. In: Proceedings of the 12th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, KDD ’06, pp. 611–617 (2006)

  14. Lansky, P., Polito, F., Sacerdote, L.: The role of detachment of in-links in scale-free networks. J. Phys. A 47(34), 345002 (2014)

    Article  Google Scholar 

  15. Lansky, P., Polito, F., Sacerdote, L.: Generalized nonlinear yule models. J. Stat. Phys. 165(3), 661–679 (2016)

    Article  ADS  MathSciNet  Google Scholar 

  16. Leskovec, J., Kleinberg, J., Faloutsos, C.: Graphs over time: densification laws, shrinking diameters and possible explanations. In: Proceedings of the Eleventh ACM SIGKDD International Conference on Knowledge Discovery in Data Mining, KDD ’05, pp 177–187 (2005)

  17. Leskovec, J., Chakrabarti, D., Kleinberg, J., Faloutsos, C., Ghahramani, Z.: Kronecker graphs: an approach to modeling networks. J. Mach. Learn. Res. 11, 985–1042 (2010)

    MathSciNet  MATH  Google Scholar 

  18. Lofdahl, C., Stickgold, E., Skarin, B., Stewart, I.: Extending generative models of large scale networks. Procedia Manuf. 3, 3868–3875 (2015)

    Article  Google Scholar 

  19. Ma, F., Su, J.F., Yao, B.: One small-world scale-free network model having tuned parameters. In: 2017 IEEE 2nd Advanced Information Technology, Electronic and Automation Control Conference (IAEAC), pp. 99–103 (2017)

  20. Magner, A., Janson, S., Kollias, G., Szpankowski, W.: On symmetry of uniform and preferential attachment graphs. Electron. J. Comb. 21, 3–32 (2014)

    MathSciNet  MATH  Google Scholar 

  21. Melancon, G.: Just how dense are dense graphs in the real world? A methodological note. In: Proceedings of the 2006 AVI Workshop on Beyond Time and Errors: Novel Evaluation Methods for Information Visualization, BELIV ’06, pp. 1–7 (2006)

  22. Mislove, A., Marcon, M., Gummadi, K.P., Druschel, P., Bhattacharjee, B.: Measurement and analysis of online social networks. In: Proceedings of the 7th ACM SIGCOMM Conference on Internet Measurement, IMC ’07, pp. 29–42 (2007)

  23. Newman, M.E.: Modularity and community structure in networks. Proc. Natl. Acad. Sci. USA 103, 8577–8582 (2006)

    Article  ADS  Google Scholar 

  24. Newman, M.E.J.: The structure of scientific collaboration networks. Proc. Natl. Acad. Sci. USA 98(2), 404–409 (2001)

    Article  ADS  MathSciNet  Google Scholar 

  25. Newman, M.E.J.: Assortative mixing in networks. Phys. Rev. Lett. 89, 208701 (2002)

    Article  ADS  Google Scholar 

  26. Ostroumova, L., Ryabchenko, A., Samosvat, E.: Generalized preferential attachment: tunable power-law degree distribution and clustering coefficient. Algorithms and Models for the Web Graph, pp. 185–202. Springer, Cham (2013)

    Chapter  Google Scholar 

  27. Pachón, A., Polito, F., Sacerdote, L.: Random graphs associated to some discrete and continuous time preferential attachment models. J. Stat. Phys. 162, 1608–1638 (2016). https://doi.org/10.1007/s10955-016-1462-7

    Article  ADS  MathSciNet  MATH  Google Scholar 

  28. Pachón, A., Sacerdote, L., Yang, S.: Scale-free behavior of networks with the copresence of preferential and uniform attachment rules. Physica D (2017)

  29. Romero, D., Kleinberg, J.: The directed closure process in hybrid social-information networks, with an analysis of link formation on twitter. In: Proc. 4th International AAAI Conference on Weblogs and Social Media (2010)

  30. Sallaberry, A., Zaidi, F., Melançon, G.: Model for generating artificial social networks having community structures with small-world and scale-free properties. Soc. Netw. Anal. Min. 3, 597–609 (2013)

    Article  Google Scholar 

  31. Valafar, M., Rejaie, R., Willinger, W.: Beyond friendship graphs: a study of user interactions in flickr. In: Proceedings of the 2nd ACM Workshop on Online Social Networks, pp. 25–30 (2009)

  32. Vazquez, A., Pastor-Satorras, R., Vespignani, A.: Large-scale topological and dynamical properties of the internet. Phys. Rev. E 65, 066130 (2002)

    Article  ADS  Google Scholar 

  33. Wilson, C., Sala, A., Puttaswamy, K., Zhao, B.: Beyond social graphs: user interactions in online social networks and their implications. ACM Trans. Web (TWEB) 6, 1–31 (2012)

    Article  Google Scholar 

  34. Yang, Y., Jin, S., Zuo, Y., Xu, J.: Online social network model based on local preferential attachment. In: Web Technologies and Applications, pp. 35–46 (2014)

Download references

Acknowledgements

This work was supported by the Technology Innovation Program (20006489, “Development of the embedded robot equipment control system equipped with an AI vision module for industrial environment”) funded by the Ministry of Trade, Industry & Energy (MOTIE, Korea), and by Korea Institute of Science and Technology (KIST) under the project “HERO Part 1: Development of core technology of ambient intelligence for proactive service in digital in-home care”.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Suhyun Kim.

Additional information

Communicated by Irene Giardina.

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

A Generating Time Evolving Graphs

A Generating Time Evolving Graphs

We calculate the cosine distance between the original and generated graphs at \(t = \{1,2, \ldots , 10\}\) and present the results in Fig. 13. We see that, on average, the Mixed Model is more consistent and generates graphs with less variation in eigenvector centrality score over time. For the assortativity coefficient metric, we calculate the point distance between the original and generated graphs and present the results in Fig. 14. In some datasets, e.g., EnronEmail and Digg, Mixed Model generates smaller distances while in some datasets, e.g., Skitter and YouTube, Chung–Lu performs better than other models. Forest Fire outperforms in LinuxEmail whereas Barabasi–Albert outperforms in the Brightkite dataset.

Fig. 13
figure 13

Cosine distance of eigenvector centrality of 10 snapshots of the original and generated graphs

Fig. 14
figure 14

Point distance of assortativity coefficient of 10 snapshots of the original and generated graphs

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Yousuf, M.I., Kim, S. Generating Graphs by Creating Associative and Random Links Between Existing Nodes. J Stat Phys 179, 1–32 (2020). https://doi.org/10.1007/s10955-020-02517-z

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10955-020-02517-z

Keywords

Navigation