Skip to main content
Log in

Approximation and Exact Algorithms for RNA Secondary Structure Prediction and Recognition of Stochastic Context-free Languages

  • Published:
Journal of Combinatorial Optimization Aims and scope Submit manuscript

Abstract

For a basic version (i.e., maximizing the number of base-pairs) of the RNA secondary structure prediction problem and the construction of a parse tree for a stochastic context-free language, O(n3) time algorithms were known. For both problems, this paper shows slightly improved O(n3(log log n)1/2/(log n)1/2) time exact algorithms, which are obtained by combining Valiant's algorithm for context-free recognition with fast funny matrix multiplication. Moreover, this paper shows an O(n2.776 + (1/∈)O(1)) time approximation algorithm for the former problem and an O(n2.976 log n + (1/∈)O(1)) time approximation algorithm for the latter problem, each of which has a guaranteed approximation ratio 1 − ∈ for any positive constant ∈, where the absolute value of the logarithm of the probability is considered as an objective value in the latter problem. The former algorithm is obtained from a non-trivial modification of the well-known O(n3) time dynamic programming algorithm, and the latter algorithm is obtained by combining Valiant's algorithm with approximate funny matrix multiplication. Several related results are shown too.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

References

  • J.P. Abrahams, M. Berg, E. Batenburg, and C. Pleij, “Prediction of RNA secondary structure, including pseudoknotting by computer simulation,” Nucleic Acids Research, vol. 18, pp. 3035-3044, 1990.

    Google Scholar 

  • T. Akutsu, “DP algorithms for RNA secondary structure prediction with pseudoknots,” Genome Informatics 1997, Universal Academy Press: Tokyo, 1997, pp. 173-179.

    Google Scholar 

  • N. Alon, Z. Galil, and O. Margalit, “On the exponent of the all pairs shortest path problem,” in Proc. 32nd IEEE Symp. Foundations of Computer Science, IEEE, 1991, pp. 569-575.

  • D. Coppersmith and S. Winograd, “Matrix multiplication via arithmetic progression,” J. Symbolic Computation, vol. 9, pp. 251-280, 1990.

    Google Scholar 

  • D. Eppstein, Z. Galil, and R. Giancarlo, “Speeding up dynamic programming,” in Proc. 29th IEEE Symp. Foundations of Computer Science, IEEE, 1988, pp. 488-496.

  • D. Eppstein, Z. Galil, R. Giancarlo, and G.F. Italiano, “Sparse dynamic programming II: Convex and concave cost functions,” J. ACM, vol. 39, pp. 546-567, 1992.

    Google Scholar 

  • M.L. Fredman, “New bounds on the complexity of the shortest path problem,” SIAM Journal on Computing, vol. 5, pp. 83-89, 1976.

    Google Scholar 

  • Z. Galil and K. Park, “Dynamic programming with convexity, concavity and sparsity,” Theoretical Computer Science, vol. 92, pp. 49-76, 1992.

    Google Scholar 

  • M. Kanehisa and W.B. Goad, “Pattern recognition in nucleic acid sequences II: An efficient method for finding locally stable secondary structures,” Nucleic Acids Research, vol. 10, pp. 265-277, 1982.

    Google Scholar 

  • L.L. Larmore and B. Schieber, “On-line dynamic programming with applications to the prediction of RNA secondary structure,” Journal of Algorithms, vol. 12, pp. 490-515, 1991.

    Google Scholar 

  • Y. Sakakibara, M. Brown, E. Hughey, I.S. Mian, K. Sjlölander, R.C. Underwood, and D. Haussler, “Stochastic context-free grammars for tRNA modeling,” Nucleic Acids Research, vol. 22, pp. 5112-5120, 1994.

    Google Scholar 

  • J. Setubal and J. Meidanis, Introduction to Computational Molecular Biology, PWS Pub. Co.: Boston, 1997.

    Google Scholar 

  • T. Takaoka, “A new upper bound on the complexity of all pairs shortest path problem,” Information Processing Letters, vol. 43, pp. 195-199, 1992.

    Google Scholar 

  • H. Tamaki and T. Tokuyama, “Algorithms for maximum subarray problem based on matrix multiplication,” in Proc. 9th ACM-SIAM Symp. Discrete Algorithms, ACM, 1998, pp. 446-452.

  • D.H. Turner, N. Sugimoto, and S.M. Freier, “RNA structure prediction,” Ann. Rev. Biophys. Chem., vol. 17, pp. 167-192, 1988.

    Google Scholar 

  • Y. Uemura, A. Hasegawa, S. Kobayashi, and T. Yokomori, “Grammatically modeling and predicting RNA secondary structures,” in Proc. Genome Informatics Workshop VI, Universal Academy Press: Tokyo, 1995, pp. 67-76.

    Google Scholar 

  • L.G.Valiant, “General context-free recognition in less than cubic time,” Journal of Computer and System Sciences, vol. 10, pp. 308-315, 1975.

    Google Scholar 

  • M.S.Waterman and T.F. Smith, “RNAsecondary structure:Acomplete mathematical analysis,” Math. Biosciences, vol. 41, pp. 257-266, 1978.

    Google Scholar 

  • M.S.Waterman and T.F. Smith, “Rapid dynamic programming algorithms forRNAsecondary structure,” Advances in Applied Mathematics, vol. 7, pp. 455-464, 1986.

    Google Scholar 

  • M.S. Waterman, Introduction to Computational Biology, Chapman & Hall: London, 1995.

    Google Scholar 

  • M. Zuker and P. Stiegler, “Optimal computer folding for largeRNAsequences using thermodynamics and auxiliary information,” Nucleic Acids Research, vol. 9, pp. 133-148, 1981.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Rights and permissions

Reprints and permissions

About this article

Cite this article

Akutsu, T. Approximation and Exact Algorithms for RNA Secondary Structure Prediction and Recognition of Stochastic Context-free Languages. Journal of Combinatorial Optimization 3, 321–336 (1999). https://doi.org/10.1023/A:1009898029639

Download citation

  • Issue Date:

  • DOI: https://doi.org/10.1023/A:1009898029639

Navigation