Abstract
This paper extends a previous work done by the same authors [1] having the aim of improving the predictions coming from a matrix factorization based on latent factor models through an ensemble with the predictions obtained by an Opinion Mining methodology based on a linguistic approach. The experimental analysis was carried out on the Yelp business dataset, limited to the Restaurant category. An hypothesis of influence of the restaurant average rating on the number of stars given by the users is tested. An analysis of the meaning of some of the latent factors is shown.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Angioni, M., Clemente, M.L., Tuveri, F.: Combining opinion mining with collaborative filtering. In: Proceedings of WEBIST 2015, 11th International Conference on Web Information Systems and Technologies, Lisbon (2015)
Pang, B., Lee, L.: Opinion mining and sentiment analysis. Found. Trends Inf. Retrieval 2(1–2), 1–135 (2008). doi:10.1561/1500000011
Ghose, A., Ipeirotis, P.G.: Designing novel review ranking systems: predicting usefulness and impact of reviews. In: International Conference on Electronic Commerce (ICEC) (2007)
Koren, Y., Bell, R., Volinsky, C.: Matrix factorization techniques for recommender systems. Computer 42(8), 42–49 (2009). IEEE Computer Society
Sarwar, B., Karypis, G., Konstan, J., Riedl, J.: Item-based collaborative filtering recommendation algorithms. In: Proceedings of IEEE Internet Computing, 10th International World Wide Web Conference (2001)
Hinton, G.E.: A practical guide to training restricted boltzmann machines. In: Montavon, G., Orr, G.B., Müller, K.-R. (eds.) Neural Networks: Tricks of the Trade. LNCS, vol. 7700, 2nd edn, pp. 599–619. Springer, Heidelberg (2012)
Linden, G., Smith, B., York, J.: Amazon.com recommendations. In: IEEE Internet Computing, vol. 07, no. 1, pp. 76–80 (2003)
Clemente, M.L.: Experimental results on item-based algorithms for independent domain collaborative filtering. In: Proceedings of AXMEDIS 2008, pp. 87–92. IEEE Computer Society (2008)
Tosher, A., Jahrer, M., Bell, R.M.: The BigChaos solution to the Netflix grand prize, Netflix Prize Documentation (2009)
Jahrer, M., Töscher, A., Legenstein, R.: Combining Predictions for Accurate Recommender Systems. In: Proceedings of the 16th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 693–702. ACM (2010)
Koukourikos, A., Stoisis, G., Karampiperis, P.: Sentiment analysis: a tool for rating attribution to content in recommender systems. In: 2nd Workshop on Recommender Systems for Technology Enhances Learning (RecSysTEL 2012). Saarbrucken, Germany (2012)
Quadrana, M.: E-tourism recommender systems (2013). http://hdl.handle.net/10589/84901
Levi, A., Mokryn, O., Diot, C., Taft, N.: Finding a needle in a haystack of reviews: cold start context-based hotel recommender system. In: Proceedings of the Sixth ACM Conference on Recommender Systems, pp. 115–122. ACM (2012)
Wu, Y., Ester, M.: FLAME: a probabilistic model combining aspect based opinion mining and collaborative filtering. In: WSDM 2015, Shanghai, China (2015)
Singh, V.K., Mukherjee, M., Mehta, G.K.: Combining collaborative filtering and sentiment classification for improved movie recommendations. In: Sombattheera, C., Agarwal, A., Udgata, S.K., Lavangnananda, K. (eds.) MIWAI 2011. LNCS, vol. 7080, pp. 38–50. Springer, Heidelberg (2011)
Huang, J., Rogers, S., Joo, E.: Improving restaurants by extracting subtopics from yelp reviews. SOCIAL MEDIA EXPO (2014). https://www.ideals.illinois.edu/bitstream/handle/2142/48832/Huang-iConference2014-SocialMediaExpo.pdf
Burke, R.: Hybrid recommender systems: survey and experiments. User Model. User-Adap. Inter. 12(3), 331–370 (2002)
Ganu, G., Kakodkar, Y., Marian, A.: Improving the quality of predictions using textual information in online user reviews. Inf. Syst. (2012). doi:10.1016/j.is.2012.03.001
Ganu, G., Elhadad, N., Marian, A.: Beyond the stars: improving rating predictions using review text content. In: Twelfth International Workshop on the Web and Databases (WebDB 2009), Providence, Rhode Island, USA (2009)
Trevisiol, M., Chiarandini, L., Baeza-Yates, R.: Buon Appetito - Recommending Personalized menus (2014)
Govindarajan, M.: Sentiment analysis of restaurant reviews using hybrid classification method. Int. J. Soft Comput. Artif. Intell. 2, 17–23 (2014)
Schmid, H.: Probabilistic part-of-speech tagging using decision trees. In: Proceedings of the International Conference on New Methods in Language Processing, pp. 44–49 (1994)
Baccianella, S., Esuli, A., Sebastiani, F.: SentiWordNet 3.0: an enhanced lexical resource for sentiment analysis and opinion mining. In: LREC 2010, 7th International Conference on Language Resources and Evaluation, Malta, pp. 2200–2204 (2010)
Miller, G.: WordNet: an Electronic Lexical Database. Bradford Books, Cambridge (1998)
Clark, P.: Yelp’s Newest Weapon Against Fake Reviews: Lawsuits (2013). http://www.businessweek.com/articles/2013-09-09/yelps-newest-weapon-against-fake-reviews-lawsuits
Jong, J.: Predicting Rating with Sentiment Analysis (2011). http://cs229.stanford.edu/proj2011/Jong-%20PredictingRatingwithSentimentAnalysis.pdf
Mingming, F., Khademi, M.: Predicting a Business Star in Yelp from Its Reviews Text Alone. ArXiv e-prints: arXiv:1401.0864 (2014)
Benamara, F., Cesarano, C., Picariello, A., Reforgiato, D., Subrahmanian, V.S.: Sentiment analysis: adjectives and adverbs are better than adjectives alone. In: Proceedings of ICWSM 2007, International Conference on Weblogs and Social Media, pp. 203–206 (2007)
Ding, X., Liu, B., Yu, P.S.: A holistic lexicon-based approach to opinion mining. In: International Conference on Web Search and Web Data Mining, ACM, NY, USA (2008)
Agerri, R., Garcia-Serrano, A.: Q-WordNet: extracting polarity from WordNet senses. In: 7th International Conference on Language Resources and Evaluation (LREC2010), Malta (2010)
Tuveri, F., Angioni, M.: A linguistic approach to feature extraction based on a lexical database of the properties of adjectives and adverbs. In: Global WordNet Conference (GWC 2012), Matsue, Japan (2012)
Owen, S., Anil, R., Dunning, T., Friedman, E.: Mahout in Action. Manning Publications Co., Shelter Island (2011). ISBN 9781935182689
Shelter, S., Owen, S.: Collaborative Filtering with Apache Mahout. In: RecSys Challenge (2012)
Paterek, A.: Improving regularized singular value decomposition for collaborative filtering. In: Proceedings of KDDCup and Workshop, pp. 39–42. ACM Press (2007)
Acknowledgements
This study is part of a POR FESR 2007-2013 project co-funded by the Autonomous Region of Sardinia: Comunimatica (P.I.A. n. 205 co-funded according to the DGR 39/3 of 10/11/2012).
Author information
Authors and Affiliations
Corresponding authors
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2016 Springer International Publishing Switzerland
About this paper
Cite this paper
Angioni, M., Clemente, M.L., Tuveri, F. (2016). Improving Predictions with an Ensemble of Linguistic Approach and Matrix Factorization. In: Monfort, V., Krempels, KH., Majchrzak, T.A., Turk, Ž. (eds) Web Information Systems and Technologies. WEBIST 2015. Lecture Notes in Business Information Processing, vol 246. Springer, Cham. https://doi.org/10.1007/978-3-319-30996-5_9
Download citation
DOI: https://doi.org/10.1007/978-3-319-30996-5_9
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-30995-8
Online ISBN: 978-3-319-30996-5
eBook Packages: Computer ScienceComputer Science (R0)