Abstract
Social tagging systems provide methods for users to categorise resources using their own choice of keywords (or “tags”) without being bound to a restrictive set of predefined terms. Such systems typically provide simple tag recommendations to increase the number of tags assigned to resources. In this paper we extend the latent Dirichlet allocation topic model to include user data and use the estimated probability distributions in order to provide personalised tag suggestions to users. We describe the resulting tripartite topic model in detail and show how it can be utilised to make personalised tag suggestions. Then, using data from a large-scale, real life tagging system, test our system against several baseline methods. Our experiments show a statistically significant increase in performance of our model over all key metrics, indicating that the model could be successfully used to provide further social tagging tools such as resource suggestion and collaborative filtering.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Blei, D., Ng, A., Jordan, M.: Latent dirichlet allocation. Journal of Machine Learning Research (3), 993–1022 (2003)
Garg, N., Weber, I.: Personalized tag suggestion for flickr. In: WWW (2008)
Griffiths, T., Steyvers, M.: Finding scientific topics. PNAS (2004)
Heinrich, G.: Parameter estimation for text analysis. Technical report, Fraunhofer IGD (2008)
Hofmann, T.: Unsupervised learning by probabilistic latent semantic analysis. Machine Learning 42(1/2), 177–196 (2001)
Hooper, R.S.: Indexer consistency tests—origin, measurements, results and utilization. Technical report, IBM, Bethesda (1965)
Hotho, A., Jaschke, R., Schmitz, C., Stumme, G.: Information retrieval in folksonomies: Search and ranking. In: Sure, Y., Domingue, J. (eds.) ESWC 2006. LNCS, vol. 4011, pp. 411–426. Springer, Heidelberg (2006)
Plangprasopchok, A., Lerman, K.: Exploiting social annotation for automatic resource discovery. In: AAAI 2007 (2007)
Schmitz, P.: Inducing ontology from flickr tags. In: WWW (2006)
Sigurbjörnsson, B., van Zwol, R.: Flickr tag recommendation based on collective knowledge. In: WWW (2008)
Smith, A.F.M., Roberts, G.O.: Bayesian computation via the gibbs sampler and related markov chain monte-carlo methods (with discussion). Journal of the Royal Statistical Society 55, 3–23 (1993)
Teh, Y.W., Jordan, M.I., Beal, M.J., Blei, D.M.: Hierarchical dirichlet processes. JASA 101(476), 1566–1581 (2006)
Wu, X., Zhang, L., Yu, Y.: Exploring social annotations of the semantic web. In: WWW (2006)
Zunde, P., Dexter, M.E.: Indexing consistency and quality. American Documentation 20(3), 259–267 (1969)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2010 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Harvey, M., Baillie, M., Ruthven, I., Carman, M.J. (2010). Tripartite Hidden Topic Models for Personalised Tag Suggestion. In: Gurrin, C., et al. Advances in Information Retrieval. ECIR 2010. Lecture Notes in Computer Science, vol 5993. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-12275-0_38
Download citation
DOI: https://doi.org/10.1007/978-3-642-12275-0_38
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-12274-3
Online ISBN: 978-3-642-12275-0
eBook Packages: Computer ScienceComputer Science (R0)