Abstract
We will present the information extraction algorithms for a semantic personalised tourist recommender system Sightsplanner. The main challenges: information is spread across various information sources, it is usually stored in proprietary formats and is available in different languages in varying degrees of accuracy. We will address the mentioned challenges and describe our realization and ideas how to deal with each of them: scraping and extracting keywords from different web portals with different languages, dealing with missing multilingual data and identifying the same objects from different sources.
Preview
Unable to display preview. Download preview PDF.
References
Alves, A., Pereira, F., Biderman, A. & Ratti, C. (2009). Place Enrichment by Mining the Web. In M. Tscheligi, B. de Ruyter et al. (Eds.), Ambient Intelligence, 5859: 66–77. Berlin: Springer.
Bleiholder, J. & Naumann, F. (2008). Data Fusion. ACM Computer Surveys 41(1): 1–41.
Luberg, A., Tammet, T. & Järv, P. (2011). Smart City: A Rule-based Tourist Recommendation. In Information and Communication Technologies in Tourism 2011. New York: Springer.
Tré, G. D. & Bronselaer, A. (2010). Consistently handling geographical user data: Merging of coreferent POIs. Fuzzy Information Processing Society (NAFIPS), 2010 Annual Meeting of the North American 1(1): 117–122. New York: IEEE.
Zheng, Y., Fen, X., Xie, X. et al. (2010). Detecting nearly duplicated records in location datasets. Proceedings of the 18th SIGSPATIAL International Conference on Advances in Geographic Information Systems 1(1): 135–143. New York: ACM.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2012 Springer-Verlag/Wien
About this paper
Cite this paper
Luberg, A., Järv, P., Tammet, T. (2012). Information Extraction for a Tourist Recommender System. In: Fuchs, M., Ricci, F., Cantoni, L. (eds) Information and Communication Technologies in Tourism 2012. Springer, Vienna. https://doi.org/10.1007/978-3-7091-1142-0_29
Download citation
DOI: https://doi.org/10.1007/978-3-7091-1142-0_29
Publisher Name: Springer, Vienna
Print ISBN: 978-3-7091-1141-3
Online ISBN: 978-3-7091-1142-0