Skip to main content

Challenges of Mining Twitter Data for Analyzing Service Performance: A Case Study of Transportation Service in Malaysia

  • Conference paper
  • First Online:
Business Information Systems Workshops (BIS 2021)

Part of the book series: Lecture Notes in Business Information Processing ((LNBIP,volume 444))

Included in the following conference series:

Abstract

Literature has shown the prominence of extracting social media data to understand public opinion. However, there are little works on how these opportunities can be realized and the challenges in exploiting the opportunities in the transportation industry. Further, data quality and availability using social media may vary according to different demographics due to population size and languages used. Additionally, most of the related prior studies that show the opportunities of using social media data were conducted in North and South America. With this proposition, we seek to investigate the challenges of using Twitter data with text mining techniques for understanding users’ opinions and sentiment through a case study of using the data to assess public transportation service performance specifically in a Malaysian context. Our findings indicate that social media data can only be useful in generating reasonable insights if users could input informative words for forming discussed topics to derive opinion, and incline towards a certain sentiment with adjectives. The findings also identified the need for a more proficient dictionary to classify multilingual tweets. Our research provides original evidence proving the potential of using social media data to assess public transportation services performance which may vary depending on the demographics of the social media users.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 59.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 74.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Windasari, I.P., Uzzi, F.N., Satoto, K.I.: Sentiment analysis on Twitter posts: an analysis of positive or negative opinion on GoJek. In: 4th International Conference on Information Technology, Computer, And Electrical Engineering. IEEE 2017, October 2017

    Google Scholar 

  2. Almselati, A., Rahmat, R., Jaafar, O.: An overview of urban transport in Malaysia. Soc. Sci. 6(1), 24–33 (2011)

    Article  Google Scholar 

  3. Borhan, M., Syamsunur, D., Mohd Akhir, N., Mat Yazid, M., Ismail, A., Rahmat, R.: Predicting the use of public transportation: a case study from Putrajaya, Malaysia. Sci. World J. 2014, 1–9 (2014)

    Google Scholar 

  4. Hovy, D., Johannsen, A., Søgaard, A.: User review sites as a resource for large-scale sociolinguistic studies. In: Proceedings of the 24th International Conference on World Wide Web, pp. 452–461, May 2015

    Google Scholar 

  5. Cozza, V., Petrocchi, M., Spognardi, A.: Mining implicit data association from Tripadvisor Hotel Reviews. In: EDBT/ICDT Workshops, pp. 56–61, March 2018

    Google Scholar 

  6. Collins, C., Hasan, S., Ukkusuri, S.: A novel transit rider satisfaction metric: rider sentiments measured from online social media data. J. Public Transp. 16(2), 21–45 (2013)

    Article  Google Scholar 

  7. Schweitzer, L.: Planning and social media: a case study of public transit and stigma on Twitter. J. Am. Plann. Assoc. 80(3), 218–238 (2014)

    Article  Google Scholar 

  8. Haghighi, N.N., Liu, X.C., Wei, R., Li, W., Shao, H.: Using Twitter data for transit performance assessment: a framework for evaluating transit riders’ opinions about quality of service. Public Transport 10(2), 363–377 (2018). https://doi.org/10.1007/s12469-018-0184-4

    Article  Google Scholar 

  9. Mendez, J., Lobel, H., Parra, D., Herrera, J.: Using Twitter to infer user satisfaction with public transport: the case of Santiago, Chile. IEEE Access 7, 60255–60263 (2019)

    Article  Google Scholar 

  10. Hoffer, J.A., Venkataraman, R., Topi, H.: Modern database management (Global Edition). ISBN-13: 978–1292263359. Pearson. 2019 (2019)

    Google Scholar 

  11. Manning, C.D., Schütze, H., Raghavan, P.: Introduction to Information Retrieval. Cambridge University Press (2008)

    Book  Google Scholar 

  12. Casas, I., Delmelle, E.: Tweeting about public transit — Gleaning public perceptions from a socialmedia microblog. Case Stud. TransportPolicy 5(4), 634–642 (2017)

    Article  Google Scholar 

  13. Statista: Twitter: number of monthly active users 2010–2019 (2019). https://www.statista.com/statistics/282087/number-of-monthly-active-twitter-users/. Accessed 5 Jan 2020

  14. Datareportal. Digital 2021: Malaysia. https://datareportal.com/reports/digital-2021-malaysia. Accessed 24 May 2021

  15. ‘Public transport in the Klang Valley (2019). https://en.wikipedia.org/wiki/Public_transport_in_the_Klang_Valley. Accessed 7 Jan 2020

  16. Brabham, D.: motivations for participation in a crowdsourcing application to improve public engagement in transit planning. J. Appl. Commun. Res. 40(3), 307–328 (2012)

    Article  Google Scholar 

  17. Evans-Cowley, J., Griffin, G.: Microparticipation with social media for community engagement in transportation planning. Transp. Res. Record J. Transp. Res. Board 2307(1), 90–98 (2012)

    Article  Google Scholar 

  18. Grant-Muller, S., Gal-Tzur, A., Kuflik, T., Minkov, E., Shoor, I., Nocera, S.: Enhancing transport data collection through social media sources: methods, challenges and opportunities for textual data. IET Intel. Transport Syst. 9(4), 407–417 (2015)

    Article  Google Scholar 

  19. Blei, D., Ng, A., Jordan, M.: Latent Dirichlet Allocation. J. Mach. Learn. Res. 3, 993–1022 (2003)

    Google Scholar 

  20. Weng, J., Lim, E.P., Jiang, J., He, Q.: Twitterrank: finding topic-sensitive influential twitterers. In: Proceedings of the Third ACM International Conference on Web Search and Data Mining, pp. 261–270, February 2010

    Google Scholar 

  21. Yang, M., Rim, H.: Identifying interesting Twitter contents using topical analysis. Expert Syst. Appl. 41(9), 4330–4336 (2014)

    Article  Google Scholar 

  22. Hong, L., Davison, B.D.: Empirical study of topic modeling in Twitter. In Proceedings of the First Workshop on Social Media Analytics, pp. 80–88, July 2010

    Google Scholar 

  23. ‘WordNet | A Lexical Database for English’, https://wordnet.princeton.edu/. Accessed 7 Jan 2020

  24. Ren, Y., Wang, R., Ji, D.: A topic-enhanced word embedding for Twitter sentiment classification. Inf. Sci. 369, 188–198 (2016)

    Article  Google Scholar 

  25. Ali, F., et al.: Transportation sentiment analysis using word embedding and ontology-based topic modeling. Knowl.-Based Syst. 174, 27–42 (2019)

    Article  Google Scholar 

  26. Qi, B., Costin, A., Jia, M.: A framework with efficient extraction and analysis of Twitter data for evaluating public opinions on transportation services. Travel Behav. Soc. 21, 10–23 (2020)

    Article  Google Scholar 

  27. Thelwall, M., Buckley, K., Paltoglou, G., Cai, D., Kappas, A.: Sentiment strength detection in short informal text. J. Am. Soc. Inf. Sci. Technol. 61(12), 2544–2558 (2010). https://doi.org/10.1002/asi.21416

    Article  Google Scholar 

  28. Abu Bakar, M., Mohd Ariff, N., Hui, E.: Exploratory data analysis of Twitter’s rhythm in Malaysia. In: AIP Conference Proceedings 2019 (2019)

    Google Scholar 

  29. Octoparse. https://www.octoparse.com/. Accessed 25 May 2021

  30. Huberman, B.A., Romero, D.M., Wu, F.: Social networks that matter: Twitter under the microscope. arXiv preprint arXiv:0812.1045 (2008)

    Google Scholar 

Download references

Acknowledgments

This paper was supported by Partnership Grant CR-UM-SST-DCIS-2018–01 and RK004–2017 between Sunway University and the University of Malaya.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Hui Na Chua .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2022 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Chua, H.N., Liao, A.W.Q., Low, Y.C., Lee, A.S.H., Ismail, M.A. (2022). Challenges of Mining Twitter Data for Analyzing Service Performance: A Case Study of Transportation Service in Malaysia. In: Abramowicz, W., Auer, S., Stróżyna, M. (eds) Business Information Systems Workshops. BIS 2021. Lecture Notes in Business Information Processing, vol 444. Springer, Cham. https://doi.org/10.1007/978-3-031-04216-4_21

Download citation

  • DOI: https://doi.org/10.1007/978-3-031-04216-4_21

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-031-04215-7

  • Online ISBN: 978-3-031-04216-4

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics