Skip to main content
Log in

Joint inference of user community and interest patterns in social interaction networks

  • Original Article
  • Published:
Social Network Analysis and Mining Aims and scope Submit manuscript

Abstract

Online social media have become an integral part of our social beings. Analyzing conversations in social media platforms can lead to complex probabilistic models to understand social interaction networks. In this paper, we present a modeling approach for characterizing social interaction networks by jointly inferring user communities and interests based on social media interactions. We present several pattern inference models: (1) interest pattern model (IPM) captures population level interaction topics, (2) user interest pattern model (UIPM) captures user specific interaction topics, and (3) community interest pattern model (CIPM) captures both community structures and user interests. We test our methods on Twitter data collected from Purdue University community. From our model results, we observe the interaction topics and communities related to two big events within Purdue University community, namely Purdue Day of Giving and Senator Bernie Sanders’ visit to Purdue University as part of Indiana Primary Election 2016. Constructing social interaction networks based on user interactions accounts for the similarity of users’ interactions on various topics of interest and indicates their community belonging further beyond connectivity. We observed that the degree-distributions of such networks follow power-law that is indicative of the existence of fewer nodes in the network with higher levels of interactions, and many other nodes with less interaction. We also discuss the application of such networks as a useful tool to effectively disseminate specific information to the target audience towards planning any large-scale events and demonstrate how to single out specific nodes in a given community by running network algorithms.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10
Fig. 11

Similar content being viewed by others

References

  • Abbasi A, Rashidi TH, Maghrebi M, Travis Waller S (2015) Utilising Location Based Social Media in Travel Survey Methods: bringing Twitter data into the play. In: Proceedings of the 8th ACM SIGSPATIAL international workshop on location-based social networks

  • Albert R, Barabási A-L (2002) Statistical mechanics of complex networks. Rev Mod Phys 74(1):47

    Article  MathSciNet  MATH  Google Scholar 

  • Albert R, Jeong H, Barabási A-L (2000) Error and attack tolerance of complex networks. Nature 406 (6794):378–382

    Article  Google Scholar 

  • Anderson RM, May RM, Anderson B (1992) Infectious diseases of humans: dynamics and control, vol 28. Wiley Online Library, Boca Raton

    Google Scholar 

  • Balthrop J, Forrest S, Newman MEJ, Matthew MW (2004) Technological networks the spread of computer viruses. Science 304(5670):527–529

    Article  Google Scholar 

  • Barabási A-L, Albert R (1999) Emergence of scaling in random networks. Science 286 (5439):509–512

    Article  MathSciNet  MATH  Google Scholar 

  • Blei DM, Ng AY, Michael IJ (2003) Latent dirichlet allocation. J Mach Learn Res 3(Jan):993–1022

    MATH  Google Scholar 

  • Boccaletti S, Latora V, Moreno Y, Chavez M, Hwang D-U (2006) Complex networks: structure and dynamics. Phys Rep 424(4):175–308

    Article  MathSciNet  MATH  Google Scholar 

  • Brandes U (2001) A faster algorithm for betweenness centrality. J Math Sociol 25(2):163–177

    Article  MATH  Google Scholar 

  • Brandes U (2008) On variants of shortest-path betweenness centrality and their generic computation. Soc Netw 30(2):136–145

    Article  Google Scholar 

  • Brandes U, Pich C (2007) Centrality estimation in large networks. Int J Bifurcation Chaos 17(07):2303–2318

    Article  MathSciNet  MATH  Google Scholar 

  • Broido AD, Clauset A (2018) Scale-free networks are rare. arXiv preprint arXiv:1801.03400

  • Caragea C, McNeese N, Jaiswal A, Traylor G, Kim H-W, Mitra P, Wu D, Tapia AH, Giles L, Bernard JJ (2011) Classifying text messages for the haiti earthquake. In: Proceedings of the 8th international conference on information systems for crisis response and management (ISCRAM2011)

  • Cebelak MK (2013) Location-based social networking data: doubly-constrained gravity model origin-destination estimation of the urban travel demand for Austin, TX

  • Chen Y, Mahmassani HS (2016) Exploring activity and destination choice behavior in two metropolitan areas using social networking data. In: Transportation research board 95th annual meeting

  • Clauset A, Shalizi CR, Newman MEJ (2009) Power-law distributions in empirical data. SIAM Rev 51(4):661–703

    Article  MathSciNet  MATH  Google Scholar 

  • Coleman JS, Katz E, Menzel H (1966) Medical innovation: a diffusion study. Bobbs-Merrill Co, Indianapolis

    Google Scholar 

  • Collins C, Hasan S, Ukkusuri SV (2013) A novel transit rider satisfaction metric: rider sentiments measured from online social media data. J Public Transp 16(2):2

    Article  Google Scholar 

  • Cutter SL, Finch C (2008) Temporal and spatial changes in social vulnerability to natural hazards. Proc Natl Acad Sci 105(7):2301–2306

    Article  Google Scholar 

  • Earle PS, Bowden DC, Guy M (2012) Twitter earthquake detection: earthquake monitoring in a social world. Ann Geophys 54(6):708–715. https://doi.org/10.4401/ag-5364

    Article  Google Scholar 

  • Ferrara E (2012) Community structure discovery in facebook. Int J Soc Netw Min 1(1):67–90

    Article  Google Scholar 

  • Fortunato S (2010) Community detection in graphs. Physics reports 486(3):75–174

    Article  MathSciNet  Google Scholar 

  • Freeman LC (1978) Centrality in social networks conceptual clarification. Soc Netw 1(3):215–239

    Article  Google Scholar 

  • Gao H, Barbier G, Goolsby R, Zeng D (2011) Harnessing the crowdsourcing power of social media for disaster relief. DTIC Document

  • Griffiths TL, Steyvers M (2004) Finding scientific topics. Proc Natl Acad Sci 101(suppl 1):5228–5235

    Article  Google Scholar 

  • Guy M, Earle P, Ostrum C, Gruchalla K, Horvath S (2010) Integration and dissemination of citizen reported and seismically derived earthquake information via social network technologies. In: International symposium on intelligent data analysis

  • Hasan S, Ukkusuri SV (2011) A threshold model of social contagion process for evacuation decision making. Transp Res Part B Methodol 45(10):1590–1605

    Article  Google Scholar 

  • Hasan S, Ukkusuri SV (2014) Urban activity pattern classification using topic models from online geo-location data. Transp Res Part C Emerg Technol 44:363–381

    Article  Google Scholar 

  • Hasan S, Ukkusuri SV (2015) Location contexts of user check-ins to model urban geo life-style patterns. PLoS One 10(5):e0124819

    Article  Google Scholar 

  • Hasan S, Zhan X, Ukkusuri SV (2013) Understanding urban human activity and mobility patterns using large-scale location-based data from online social media. In: Proceedings of the 2nd ACM SIGKDD international workshop on urban computing

  • Helbing D (2013) Globally networked risks and how to respond. Nature 497(7447):51–59

    Article  Google Scholar 

  • Hong L, Davison BD (2010) Empirical study of topic modeling in twitter. In: Proceedings of the first workshop on social media analytics

  • Hughes AL, St Denis LAA, Palen L, Anderson KM (2014) Online public communications by police & fire services during the 2012 Hurricane Sandy. In: Proceedings of the 32nd annual ACM conference on human factors in computing systems

  • Imran M, Elbassuoni SM, Castillo C, Diaz F, Meier P (2013) Extracting information nuggets from disaster-related messages in social media. In: Proceeding of ISCRAM, Baden-Baden, Germany

  • Jin P, Cebelak M, Yang F, Zhang J, Walton C, Ran B (2014) Location-based social networking data: exploration into use of doubly constrained gravity model for origin-destination estimation. Transp Res Rec J Transp Res Board 2430:72–82

    Article  Google Scholar 

  • Kim Y, Shim K (2014) TWILITE: a recommendation system for Twitter using a probabilistic model based on latent Dirichlet allocation. Inf Syst 42:59–77

    Article  Google Scholar 

  • Kinney R, Crucitti P, Albert R, Latora V (2005) Modeling cascading failures in the North American power grid. Eur Phys J B Condens Matter Complex Syst 46(1):101–107

    Article  Google Scholar 

  • Klarreich E (2018) Scant evidence of power laws found in real-world networks. Quanta Mag 20180215

  • Kogan M, Palen L, Anderson KM (2015) Think local, retweet global: retweeting by the geographically-vulnerable during Hurricane Sandy. In: Proceedings of the 18th ACM conference on computer supported cooperative work & social computing

  • Korolov R, Peabody J, Lavoie A, Das S, Magdon-Ismail M, Wallace W (2015) Actions are louder than words in social media. In: Proceedings of the 2015 IEEE/ACM international conference on advances in social networks analysis and mining 2015

  • Krikorian R (2013) New tweets per second record, and how. Twitter Eng Blog 16

  • Kryvasheyeu Y, Chen H, Moro E, Van Hentenryck P, Cebrian M (2015) Performance of social network sensors during Hurricane Sandy. PLoS One 10(2):e0117288

    Article  Google Scholar 

  • Kryvasheyeu Y, Chen H, Obradovich N, Moro E, Hentenryck PV, Fowler J, Cebrian M (2016) Rapid assessment of disaster damage using social media activity. Sci Adv 2(3):e1500779

    Article  Google Scholar 

  • Kumar S, Hu X, Liu H (2014) A behavior analytics approach to identifying tweets from crisis regions. In: Proceedings of the 25th ACM conference on hypertext and social media

  • Latoski SP, Dunn Jr WM, Wagenblast B, Randall J, Walker MD (2003) Managing travel for planned special events (No. FHWA-OP-04-010)

  • Lazer D, Pentland AS, Adamic L, Aral S, Barabasi AL, Brewer D, Christakis N, Contractor N, Fowler J, Gutmann M (2009) Life in the network: the coming age of computational social science. Science 323(5915):721

    Article  Google Scholar 

  • Lee JH, Gao S, Goulias KG (2016a) Comparing the origin-destination matrices from travel demand model and social media data. In: Transportation research board 95th annual meeting

  • Lee JH, Davis AW, Goulias KG (2016b) Activity space estimation with longitudinal observations of social media data. In: Paper submitted for presentation at the 95th annual meeting of the transportation research board. Washington, DC

  • Li J, Rao HR (2010) Twitter as a rapid response news service: an exploration in the context of the 2008 China earthquake. Electron J Inf Syst Dev Ctries 42:1–22

    Article  Google Scholar 

  • Liang Y, Zheng X, Zeng DD, Zhou X, Leischow SJ, Chung W (2015) Characterizing social interaction in tobacco-oriented social networks: an empirical analysis. Sci Rep 5:10060

    Article  Google Scholar 

  • Maghrebi M, Abbasi A, Rashidi TH, Travis Waller S (2015) Complementing travel diary surveys with twitter data: application of text mining techniques on activity location, type and time. In: 2015 IEEE 18th international conference on intelligent transportation systems

  • Murray JD (2002) Mathematical biology I: an introduction, vol 17 of interdisciplinary applied mathematics. Springer, New York

    Book  Google Scholar 

  • Myers SA, Sharma A, Gupta P, Lin J (2014) Information network or social network?: the structure of the twitter follow graph. In: Proceedings of the 23rd international conference on World Wide Web

  • Newman MEJ (2003) The structure and function of complex networks. SIAM Rev 45(2):167–256

    Article  MathSciNet  MATH  Google Scholar 

  • Newman MEJ (2004) Detecting community structure in networks. Eur Phys J B Condens Matter Complex Syst 38(2):321–330

    Article  Google Scholar 

  • Newman M (2010) Networks: an introduction. Oxford University Press, Oxford

    Book  MATH  Google Scholar 

  • Newman MEJ, Girvan M (2004) Finding and evaluating community structure in networks. Phys Rev E 69(2):026113

    Article  Google Scholar 

  • Newman MEJ, Forrest S, Balthrop J (2002) Email networks and the spread of computer viruses. Phys Rev E 66(3):035101

    Article  Google Scholar 

  • Power R, Robinson B, Colton J, Cameron M (2014) Emergency situation awareness: twitter case studies. In: International conference on information systems for crisis response and management in mediterranean countries

  • Rezende PHR, Sadri AM, Ukkusuri SV (2016) Social network influence on mode choice and carpooling during special events: the case of Purdue game day. In: Transportation research board 95th annual meeting

  • Robert C (2007) The Bayesian choice: from decision-theoretic foundations to computational implementation. Springer, Berlin

    MATH  Google Scholar 

  • Rosen-Zvi M, Griffiths T, Steyvers M, Smyth P (2004) The author-topic model for authors and documents. In: Proceedings of the 20th conference on uncertainty in artificial intelligence

  • Saberi M, Mahmassani HS, Brockmann D, Hosseini A (2017) A complex network perspective for characterizing urban travel demand patterns: graph theoretical analysis of large-scale origin–destination demand networks. Transportation 44(6):1383–1402

    Article  Google Scholar 

  • Sachtjen ML, Carreras BA, Lynch VE (2000) Disturbances in a power transmission system. Phys Rev E 61(5):4877

    Article  Google Scholar 

  • Sadri AM (2016) Social network influence on ridesharing, disaster communication and community interaction. Ph.D. dissertation, Ph.D. thesis, Purdue University (20874)

  • Sadri AM, Lee S, Ukkusuri SV (2015) Modeling social network influence on joint trip frequency for regular activity travel decisions. Transp Res Rec 2495:83–93. https://doi.org/10.3141/2495-09

    Article  Google Scholar 

  • Sadri AM, Hasan S, Ukkusuri SV, Cebrian M (2017a) Understanding information spreading in social media during Hurricane Sandy: user activity and network properties. arXiv preprint arXiv:1706.03019

  • Sadri AM, Ukkusuri SV, Gladwin H (2017b) Modeling joint evacuation decisions in social networks: the case of Hurricane Sandy. J Choice Model 25:50–60

    Article  Google Scholar 

  • Sadri AM, Ukkusuri SV, Lee S, Clawson R, Aldrich D, Nelson MS, Seipel J, Kelly D (2017c) The role of social capital, personal networks, and emergency responders in post-disaster recovery and resilience: a study of rural communities in Indiana. Nat Hazard 90:1–30

    Google Scholar 

  • Sadri AM, Ukkusuri SV, Gladwin H (2017d) The role of social networks and information sources on hurricane evacuation decision making. Nat Hazard Rev. https://doi.org/10.1061/(ASCE)NH.1527-6996.0000244

    Article  Google Scholar 

  • Sadri AM, Hasan S, Ukkusuri SV, Cebrian M (2018a) Crisis communication patterns in social media during Hurricane Sandy. Transp Res Rec 2672(1):125–137

    Article  Google Scholar 

  • Sadri AM, Hasan S, Ukkusuri SV, Lopez JES (2018b) Analysis of social interaction network properties and growth on Twitter. Soc Netw Anal Min 8(1):56

    Article  Google Scholar 

  • Sakaki T, Okazaki M, Matsuo Y (2010) Earthquake shakes Twitter users: real-time event detection by social sensors. In: Proceedings of the 19th international conference on World wide web

  • Scott J (2012) Social network analysis. Sage, Thousand Oaks

    Google Scholar 

  • Sornette D (2009) Why stock markets crash: critical events in complex financial systems. Princeton University Press, Princeton

    Book  MATH  Google Scholar 

  • St Denis LA, Palen L, Anderson KM (2014) Mastering social media: an analysis of Jefferson county’s communications during the 2013 Colorado floods. In: 11th International ISCRAM conference

  • Starbird K, Palen L (2010) Pass it on?: retweeting in mass emergency. In: International community on information systems for crisis response and management

  • Steyvers M, Griffiths T (2007) Probabilistic topic models. Handb Latent Semant Anal 427(7):424–440

    Google Scholar 

  • Ukkusuri S, Zhan X, Sadri AM, Ye Q (2014) Use of social media data to explore crisis informatics: study of 2013 Oklahoma tornado. Transp Res Rec J Transp Res Board 2459:110–118

    Article  Google Scholar 

  • Ukkusuri SV, Mesa-Arango R, Narayanan B, Sadri AM, Qian X (2016) Evolution of the commonwealth trade network. International trade working paper 2016/07, Commonwealth Secretariat, London

  • Vespignani A (2009) Predicting the behavior of techno-social systems. Science 325(5939):425–428

    Article  MathSciNet  MATH  Google Scholar 

  • Vieweg S, Hughes AL, Starbird K, Palen L (2010) Microblogging during two natural hazards events: what twitter may contribute to situational awareness. In: Proceedings of the SIGCHI conference on human factors in computing systems

  • Voitalov I, van der Hoorn P, van der Hofstad R, Krioukov D (2018) Scale-free networks well done. arXiv preprint arXiv:1811.02071

  • Wang Q, Taylor JE (2014) Quantifying human mobility perturbation and resilience in Hurricane Sandy. PLoS One 9(11):e112608

    Article  Google Scholar 

  • Wang Q, Taylor JE (2015) Resilience of human mobility under the influence of typhoons. Procedia Eng 118:942–949

    Article  Google Scholar 

  • Weng L, Menczer F, Ahn Y-Y (2013) Virality prediction and community structure in social networks. Sci Rep 3:2522

    Article  Google Scholar 

  • Xiao C, Zhang Y, Zeng X, Wu Y (2013) Predicting user influence in social media. JNW 8(11):2649–2655

    Article  Google Scholar 

  • Yadav A, Wilder B, Rice E, Petering R, Craddock J, Yoshioka-Maxwell A, Hemler M, Onasch-Vera L, Tambe M, Woo D (2017) Influence maximization in the field: the arduous journey from emerging to deployed application. In: Proceedings of the 16th conference on autonomous agents and multiagent systems

  • Yang F, Jin PJ, Wan X, Li R, Ran B (2014) Dynamic origin-destination travel demand estimation using location based social networking data. In: Transportation research board 93rd annual meeting

  • Zhao S, Zhang K (2016) Observing individual dynamic choices of activity chains from location-based crowdsourced data. In: Transportation research board 95th annual meeting

  • Zhao Y, Levina E, Zhu J (2011) Community extraction for social networks. Proc Natl Acad Sci 108(18):7321–7326

    Article  Google Scholar 

Download references

Acknowledgements

The authors are grateful to National Science Foundation for the grant CMMI-1131503 and CMMI-1520338 to support the research presented in this paper. However, the authors are solely responsible for the findings presented in this study.

Author information

Authors and Affiliations

Authors

Contributions

All the authors have contributed to the design of the study, conduct of the research, and writing the manuscript.

Corresponding author

Correspondence to Arif Mohaimin Sadri.

Ethics declarations

Conflict of interest

Authors declare no competing financial interests.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Electronic supplementary material

Below is the link to the electronic supplementary material.

Supplementary material 1 (DOCX 243 KB)

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Sadri, A.M., Hasan, S. & Ukkusuri, S.V. Joint inference of user community and interest patterns in social interaction networks. Soc. Netw. Anal. Min. 9, 11 (2019). https://doi.org/10.1007/s13278-019-0551-4

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • DOI: https://doi.org/10.1007/s13278-019-0551-4

Navigation