Learning Languages with Rational Kernels

Cortes, Corinna; Kontorovich, Leonid; Mohri, Mehryar

doi:10.1007/978-3-540-72927-3_26

Corinna Cortes¹,
Leonid Kontorovich² &
Mehryar Mohri^3,1

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 4539))

Included in the following conference series:

International Conference on Computational Learning Theory

3236 Accesses
5 Citations

Abstract

We present a general study of learning and linear separability with rational kernels, the sequence kernels commonly used in computational biology and natural language processing. We give a characterization of the class of all languages linearly separable with rational kernels and prove several properties of the class of languages linearly separable with a fixed rational kernel. In particular, we show that for kernels with transducer values in a finite set, these languages are necessarily finite Boolean combinations of preimages by a transducer of a single sequence. We also analyze the margin properties of linear separation with rational kernels and show that kernels with transducer values in a finite set guarantee a positive margin and lead to better learning guarantees. Creating a rational kernel with values in a finite set is often non-trivial even for relatively simple cases. However, we present a novel and general algorithm, double-tape disambiguation, that takes as input a transducer mapping sequences to sequence features, and yields an associated transducer that defines a finite range rational kernel. We describe the algorithm in detail and show its application to several cases of interest.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Bartlett, P., Shawe-Taylor, J.: Generalization performance of support vector machines and other pattern classifiers. In: Advances in kernel methods: support vector learning, pp. 43–54. MIT Press, Cambridge, MA, USA (1999)
Google Scholar
Berstel, J.: Transductions and Context-Free Languages. Teubner Studienbucher: Stuttgart (1979)
Google Scholar
Boser, B.E., Guyon, I., Vapnik, V.N.: A training algorithm for optimal margin classifiers. In: Proceedings of COLT ’92, vol. 5, ACM Press, New York (1992)
Google Scholar
Collins, M., Duffy, N.: Convolution kernels for natural language. In: NIPS 14, MIT Press, Cambridge, MA (2002)
Google Scholar
Cortes, C., Haffner, P., Mohri, M.: Rational Kernels: Theory and Algorithms. Journal of Machine Learning Research 5, 1035–1062 (2004)
Google Scholar
Cortes, C., Mohri, M.: Moment Kernels for Regular Distributions. Machine Learning 60(1-3), 117–134 (2005)
Article Google Scholar
Cortes, C., Mohri, M., Rastogi, A., Riley, M.: Efficient Computation of the Relative Entropy of Probabilistic Automata. In: LATIN 2006, vol. 3887, Springer, Heidelberg (2006)
Chapter Google Scholar
Cortes, C., Vapnik, V.N.: Support-Vector Networks. Machine Learning 20(3), 273–297 (1995)
MATH Google Scholar
Denis, F., Esposito, Y.: Rational stochastic languages. In: Lugosi, G., Simon, H.U. (eds.) COLT 2006. LNCS (LNAI), vol. 4005, Springer, Heidelberg (2006)
Chapter Google Scholar
Eilenberg, S.: Automata, Languages and Machines, vol. A–B. Academic Press, 1974–1976
Google Scholar
Haussler, D.: Convolution Kernels on Discrete Structures. Technical Report UCSC-CRL-99-10, University of California at Santa Cruz (1999)
Google Scholar
Kontorovich, L., Cortes, C., Mohri, M.: Kernel Methods for Learning Languages. Theoretical Computer Science (submitted) (2006)
Google Scholar
Kontorovich, L., Cortes, C., Mohri, M.: Learning Linearly Separable Languages. In: Balcázar, J.L., Long, P.M., Stephan, F. (eds.) ALT 2006. LNCS (LNAI), vol. 4264, Springer, Heidelberg (2006)
Chapter Google Scholar
Kuich, W., Salomaa, A.: Semirings, Automata, Languages. In: EATCS Monographs on Theoretical Computer Science, vol. 5, Springer, Heidelberg (1986)
Google Scholar
Leslie, C., Eskin, E., Weston, J., Noble, W.S.: Mismatch String Kernels for SVM Protein Classification. In: NIPS 2002, MIT Press, Cambridge (2003)
Google Scholar
Lodhi, H., Shawe-Taylor, J., Cristianini, N., Watkins, C.: Text classification using string kernels. In: NIPS 2000, pp. 563–569. MIT Press, Cambridge (2001)
Google Scholar
Lothaire, M.: Combinatorics on Words. In: Encyclopedia of Mathematics and Its Applications. Encyclopedia of Mathematics and Its Applications, vol. 17, Addison-Wesley, London (1983)
Google Scholar
Mohri, M.: Finite-State Transducers in Language and Speech Processing. Computational Linguistics 23, 2 (1997)
MathSciNet Google Scholar
Paz, A.: Introduction to probabilistic automata. Academic Press, New York (1971)
MATH Google Scholar
Rabin, M.O.: Probabilistic automata. Information and Control, 6 (1963)
Google Scholar
Salomaa, A., Soittola, M. (eds.): Automata-Theoretic Aspects of Formal Power Series. Springer, Heidelberg (1978)
MATH Google Scholar
Imre Simon. Piecewise testable events. In: Aut. Theory and Formal Lang (1975)
Google Scholar
Turakainen, P.: Generalized Automata and Stochastic Languages. In: Proceedings of the American Mathematical Society, 21(2), 303–309 (1969)
Google Scholar
Vapnik, V.N.: Statistical Learning Theory. John Wiley & Sons, New York (1998)
MATH Google Scholar
Zien, A., Rätsch, G., Mika, S., Schölkopf, B., Lengauer, T., Müller, K.-R.: Engineering support vector machine kernels that recognize translation initiation sites. Bioinformatics 16(9), 799–807 (2000)
Article Google Scholar

Download references

Author information

Authors and Affiliations

Google Research, 76 Ninth Avenue, New York, NY 10011,
Corinna Cortes & Mehryar Mohri
Carnegie Mellon University, 5000 Forbes Avenue, Pittsburgh, PA 15213,
Leonid Kontorovich
Courant Institute of Mathematical Sciences, 251 Mercer Street, New York, NY 10012,
Mehryar Mohri

Authors

Corinna Cortes
View author publications
You can also search for this author in PubMed Google Scholar
Leonid Kontorovich
View author publications
You can also search for this author in PubMed Google Scholar
Mehryar Mohri
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Nader H. Bshouty Claudio Gentile

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Cortes, C., Kontorovich, L., Mohri, M. (2007). Learning Languages with Rational Kernels. In: Bshouty, N.H., Gentile, C. (eds) Learning Theory. COLT 2007. Lecture Notes in Computer Science(), vol 4539. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-72927-3_26

Download citation

DOI: https://doi.org/10.1007/978-3-540-72927-3_26
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-72925-9
Online ISBN: 978-3-540-72927-3
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics