On Universal Transfer Learning

Mahmud, M. M. Hassan

doi:10.1007/978-3-540-75225-7_14

M. M. Hassan Mahmud⁴

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 4754))

Included in the following conference series:

International Conference on Algorithmic Learning Theory

2229 Accesses
7 Citations

Abstract

In transfer learning the aim is to solve new learning tasks using fewer examples by using information gained from solving related tasks. Existing transfer learning methods have been used successfully in practice and PAC analysis of these methods have been developed. But the key notion of relatedness between tasks has not yet been defined clearly, which makes it difficult to understand, let alone answer, questions that naturally arise in the context of transfer, such as, how much information to transfer, whether to transfer information, and how to transfer information across tasks. In this paper we look at transfer learning from the perspective of Algorithmic Information Theory, and formally solve these problems in the same sense Solomonoff Induction solves the problem of inductive inference. We define universal measures of relatedness between tasks, and use these measures to develop universally optimal Bayesian transfer learning methods.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Caruana, R.: Multitask learning. Machine Learning 28, 41–75 (1997)
Article Google Scholar
Thrun, S., Mitchell, T.: Lifelong robot learning. Robotics and Autonomous Systems 15, 25–46 (1995)
Article Google Scholar
Thrun, S., Pratt, L.Y. (eds.): Learning To Learn. Kluwer Academic Publishers, Boston (1998)
MATH Google Scholar
Ben-David, S., Schuller, R.: Exploiting task relatedness for learning multiple tasks. In: Proceedings of the 16^th Annual Conference on Learning Theory (2003)
Google Scholar
Baxter, J.: A model of inductive bias learning. Journal of Artificial Intelligence Research 12, 149–198 (2000)
MathSciNet MATH Google Scholar
Juba, B.: Estimating relatedness via data compression. In: Proceedings of the 23^rd International Conference on Machine Learning (2006)
Google Scholar
Bennett, C., Gacs, P., Li, M., Vitanyi, P., Zurek, W.: Information distance. IEEE Transactions on Information Theory 44(4), 1407–1423 (1998)
Article MathSciNet MATH Google Scholar
Li, M., Vitanyi, P.: An Introduction to Kolmogorov Complexity and its Applications, 2nd edn. Springer, New York (1997)
Book MATH Google Scholar
Zvonkin, A.K., Levin, L.A.: The complexity of finite objects and the development of the concepts of information and randomness by means of the theory of algorithms. Russian Math. Surveys 25(6), 83–124 (1970)
Article MATH Google Scholar
Hutter, M.: Optimality of Bayesian universal prediction for general loss and alphabet. Journal of Machine Learning Research 4, 971–1000 (2003)
MATH Google Scholar
Li, M., Chen, X., Ma, B., Vitanyi, P.: The similarity metric. IEEE Transactions on Information Theory 50(12), 3250–3264 (2004)
Article MathSciNet MATH Google Scholar
Cilibrasi, R., Vitanyi, P.: Clustering by compression. IEEE Transactions on Information theory 51(4), 1523–1545 (2004)
Article MathSciNet MATH Google Scholar
Solomonoff, R.J.: Complexity-based induction systems: comparisons and convergence theorems. IEEE Transactions on Information Theory 24(4), 422–432 (1978)
Article MathSciNet MATH Google Scholar
Hutter, M.: Universal Artificial Intelligence: Sequential Decisions Based on Algorithmic Probability. Springer, Berlin (2004)
MATH Google Scholar
Mahmud, M.M.H., Ray, S.: Transfer learning using Kolmogorov complexity:basic theory and empirical evaluations. Technical Report UIUC-DCS-R-2007-2875, Department of Computer Science, University of Illinois at Urbana-Champaign (2007)
Google Scholar
Hutter, M.: The fastest and shortest algorithm for all well defined problems. International Journal of Foundations of Computer Science 13(3), 431–443 (2002)
Article MathSciNet MATH Google Scholar
Grunwald, P., Vitanyi, P.: Shannon information and Kolmogorov complexity. IEEE Transactions on Information Theory (submitted, 2004)
Google Scholar

Download references

Author information

Authors and Affiliations

Department of Computer Science, University of Illinois at Urbana Champaign, 201 N. Goodwin Avenue, Urbana, IL 61801, USA
M. M. Hassan Mahmud

Authors

M. M. Hassan Mahmud
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

RSISE @ ANU and SML @ NICTA, Canberra,, ACT, 0200, Australia
Marcus Hutter
Columbia University, NY, P.O. Box, New York, USA
Rocco A. Servedio
Graduate School of Information Sciences, Tohoku University,, Sendai 980-8579, Japan
Eiji Takimoto

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Mahmud, M.M.H. (2007). On Universal Transfer Learning. In: Hutter, M., Servedio, R.A., Takimoto, E. (eds) Algorithmic Learning Theory. ALT 2007. Lecture Notes in Computer Science(), vol 4754. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-75225-7_14

Download citation

DOI: https://doi.org/10.1007/978-3-540-75225-7_14
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-75224-0
Online ISBN: 978-3-540-75225-7
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics