Information Preserving Dimensionality Reduction

Kushagra, Shrinu; Ben-David, Shai

doi:10.1007/978-3-319-24486-0_16

Shrinu Kushagra¹⁶ &
Shai Ben-David¹⁶

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 9355))

Included in the following conference series:

International Conference on Algorithmic Learning Theory

1322 Accesses

Abstract

Dimensionality reduction is a very common preprocessing approach in many machine learning tasks. The goal is to design data representations that on one hand reduce the dimension of the data (therefore allowing faster processing), and on the other hand aim to retain as much task-relevant information as possible. We look at generic dimensionality reduction approaches that do not rely on much task-specific prior knowledge. However, we focus on scenarios in which unlabeled samples are available and can be utilized for evaluating the usefulness of candidate data representations.

We wish to provide some theoretical principles to help explain the success of certain dimensionality reduction techniques in classification prediction tasks, as well as to guide the choice of dimensionality reduction tool and parameters. Our analysis is based on formalizing the often implicit assumption that “similar instances are likely to have similar labels”. Our theoretical analysis is supported by experimental results.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Urner, R., Ben-David, S., Shalev-Shwartz, S.: Access to unlabeled data can speed up prediction time. In: ICML (2011)
Google Scholar
van der Maaten, L.J., Postma, E.O., van den Herik, H.J.: Dimensionality reduction: A comparative review. Journal of Machine Learning Research 10(1–41), 66–71 (2009)
Google Scholar
Ghodsi, A.: Dimensionality reduction a short tutorial. Department of Statistics and Actuarial Science, Univ. of Waterloo, Ontario, Canada (2006)
Google Scholar
Johnson, W.B., Lindenstrauss, J.: Extensions of lipschitz mappings into a hilbert space. Contemporary Mathematics 26(189–206), 1 (1984)
MathSciNet MATH Google Scholar
Abraham, I., Bartal, Y., Neiman, O.: On embedding of finite metric spaces into hilbert space. Tech. Report, Technical report (2006)
Google Scholar
Chan, T.H.H., Dhamdhere, K., Gupta, A., Kleinberg, J., Slivkins, A.: Metric embeddings with relaxed guarantees. SIAM Journal on Computing 38(6), 2303–2329 (2009)
Article MathSciNet MATH Google Scholar
Urner, R., Ben-David, S.: Probabilistic lipschitzness a niceness assumption for deterministic labels. In: Learning Faster from Easy Data-Workshop@ NIPS (2013)
Google Scholar
Vapnik, V.N., Chervonenkis, A.Y.: On the uniform convergence of relative frequencies of events to their probabilities. Theory of Probability & Its Applications 16(2), 264–280 (1971)
Article MATH Google Scholar
Shalev-Shwartz, S., Ben-David, S.: Understanding machine learning (2014)
Google Scholar
Bache, K., Lichman, M.: UCI machine learning repository (2013)
Google Scholar
Hinton, G.E., Salakhutdinov, R.R.: Reducing the dimensionality of data with neural networks. Science 313(5786), 504–507 (2006)
Article MathSciNet MATH Google Scholar
Keyvanrad, M.A., Homayounpour, M.M.: A brief survey on deep belief networks and introducing a new object oriented matlab toolbox (deebnet) (2014). arXiv preprint arXiv:1408.3264
Knuth, D.E.: The art of computer programming: Fundamental algorithms, vol. i (1968)
Google Scholar

Download references

Author information

Authors and Affiliations

School of Computer Science, University of Waterloo, Waterloo, ON, N2L 3G1, Canada
Shrinu Kushagra & Shai Ben-David

Authors

Shrinu Kushagra
View author publications
You can also search for this author in PubMed Google Scholar
Shai Ben-David
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Shrinu Kushagra .

Editor information

Editors and Affiliations

University of California, La Jolla, California, USA
Kamalika Chaudhuri
UNIV DEL INSUBRIA, 21100 VARESE, Italy
CLAUDIO GENTILE
REGINA, Saskatchewan, Canada
Sandra Zilles

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Kushagra, S., Ben-David, S. (2015). Information Preserving Dimensionality Reduction. In: Chaudhuri, K., GENTILE, C., Zilles, S. (eds) Algorithmic Learning Theory. ALT 2015. Lecture Notes in Computer Science(), vol 9355. Springer, Cham. https://doi.org/10.1007/978-3-319-24486-0_16

Download citation

DOI: https://doi.org/10.1007/978-3-319-24486-0_16
Published: 31 October 2015
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-24485-3
Online ISBN: 978-3-319-24486-0
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics