Skip to main content

A Hybrid Feature Selection Method for Predicting User Influence on Twitter

  • Conference paper
  • First Online:
Web Information Systems Engineering – WISE 2015 (WISE 2015)

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 9418))

Included in the following conference series:

  • 1527 Accesses

Abstract

This paper proposes a hybrid feature selection method for predicting user influence on Twitter. A set of candidate features from Twitter is identified based on the five attributes of influencers defined in sociology. Firstly, less relevant features are filtered out with a feature-weighting algorithm. Then the Sequential Backward Floating Selection is utilized as the search strategy. A Back Propagation Neural Network is employed to evaluate the feature subset at each step of searching. Finally, an optimal feature set is obtained for predicting user influence with a high degree of accuracy. Experimental results are provided based on a real world Twitter dataset including seven million tweets associated with 200 popular users. The proposed method can provide a set of features that could be used as a solid foundation for studying complicated user influence evaluation and prediction.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    http://www.merriam-webster.com/.

  2. 2.

    https://klout.com/.

  3. 3.

    http://kred.com/.

  4. 4.

    http://www.peerindex.net/.

  5. 5.

    https://followerwonk.com/.

  6. 6.

    http://nlp.stanford.edu/software/tmt/tmt-0.4/.

  7. 7.

    http://www.cs.waikato.ac.nz/ml/weka/.

  8. 8.

    http://www.mathworks.com/products/neural-network/.

  9. 9.

    https://dev.twitter.com/rest/public.

  10. 10.

    https://dev.twitter.com/streaming/overview.

References

  1. Raven, B.H.: Social influence and power. Technical report, DTIC Document (1964)

    Google Scholar 

  2. Cha, M., Haddadi, H., Benevenuto, F., Gummadi, P.K.: Measuring user influence in twitter: the million follower fallacy. ICWSM 10, 30 (2010)

    Google Scholar 

  3. Leavitt, A., Burchard, E., Fisher, D., Gilbert, S.: The influentials: new approaches for analyzing influence on twitter. Web Ecol. Proj. 4, 1–18 (2009)

    Google Scholar 

  4. Rosenman, E.T.: Retweetsbut not just retweets: quantifying and predicting influence on twitter. Ph.D. thesis, Bachelors thesis, applied mathematics. Harvard College, Cambridge (2012)

    Google Scholar 

  5. Weng, J., Lim, E.P., Jiang, J., He, Q.: Twitterrank: finding topic-sensitive influential twitterers. In: Proceedings of the third ACM International Conference on Web Search and Data Mining, pp. 261–270. ACM (2010)

    Google Scholar 

  6. Tunkelang, D.: A twitter analog to pagerank. The Noisy Channel (2009)

    Google Scholar 

  7. Yu, A., Hu, C.V., Kilzer, A.: Khyrank: Using retweets and mentions to predict influential users (2011)

    Google Scholar 

  8. Chen, W., Cheng, S., He, X., Jiang, F.: Influencerank: an efficient social influence measurement for millions of users in microblog. In: 2012 Second International Conference on Cloud and Green Computing (CGC), pp. 563–570. IEEE (2012)

    Google Scholar 

  9. Cappelletti, R., Sastry, N.: Iarank: Ranking users on twitter in near real-time, based on their information amplification potential. In: 2012 International Conference on Social Informatics (SocialInformatics), pp. 70–77. IEEE (2012)

    Google Scholar 

  10. Page, L., Brin, S., Motwani, R., Winograd, T.: The pagerank citation ranking: Bringing order to the web (1999)

    Google Scholar 

  11. Kwak, H., Lee, C., Park, H., Moon, S.: What is twitter, a social network or a news media? In: Proceedings of the 19th international conference on World wide web, pp. 591–600. ACM (2010)

    Google Scholar 

  12. John, G.H., Kohavi, R., Pfleger, K., et al.: Irrelevant features and the subset selection problem. In: Machine Learning: Proceedings of the Eleventh International Conference, pp. 121–129 (1994)

    Google Scholar 

  13. Kohavi, R., John, G.H.: Wrappers for feature subset selection. Artif. intell. 97, 273–324 (1997)

    Article  MATH  Google Scholar 

  14. Keller, E., Berry, J.: The influentials: one American in ten tells the other nine how to vote, where to eat, and what to buy. Simon and Schuster, New York (2003)

    Google Scholar 

  15. Naaman, M., Boase, J., Lai, C.H.: Is it really about me? Message content in social awareness streams. In: Proceedings of the 2010 ACM Conference on Computer Supported Cooperative Work, pp. 189–192. ACM (2010)

    Google Scholar 

  16. Robnik-Šikonja, M., Kononenko, I.: Theoretical and empirical analysis of relieff and rrelieff. Mach. Learn. 53, 23–69 (2003)

    Article  MATH  Google Scholar 

  17. Robnik-Šikonja, M., Kononenko, I.: An adaptation of relief for attribute estimation in regression. In: Machine Learning: Proceedings of the Fourteenth International Conference (ICML 1997), pp. 296–304 (1997)

    Google Scholar 

  18. Jain, A., Zongker, D.: Feature selection: evaluation, application, and small sample performance. IEEE Trans. Pattern Anal. Mach. Intell 19, 153–158 (1997)

    Article  Google Scholar 

  19. Goh, A.: Back-propagation neural networks for modeling complex systems. Artif. Intell. Eng. 9, 143–151 (1995)

    Article  Google Scholar 

Download references

Acknowledgments

The work presented in this paper was partially supported by Macquarie University Research Excellence Scholarship (Allocation No.2013115), an Australian Research Council Linkage Project (LP120200231) and the China Scholarship Council. We also thank anonymous reviewers for their valuable comments.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Yan Mei .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2015 Springer International Publishing Switzerland

About this paper

Cite this paper

Mei, Y., Zhang, Z., Zhao, W., Yang, J., Nugroho, R. (2015). A Hybrid Feature Selection Method for Predicting User Influence on Twitter. In: Wang, J., et al. Web Information Systems Engineering – WISE 2015. WISE 2015. Lecture Notes in Computer Science(), vol 9418. Springer, Cham. https://doi.org/10.1007/978-3-319-26190-4_32

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-26190-4_32

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-26189-8

  • Online ISBN: 978-3-319-26190-4

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics