Abstract
For a basic version (i.e., maximizing the number of base-pairs) of the RNA secondary structure prediction problem and the construction of a parse tree for a stochastic context-free language, O(n3) time algorithms were known. For both problems, this paper shows slightly improved O(n3(log log n)1/2/(log n)1/2) time exact algorithms, which are obtained by combining Valiant's algorithm for context-free recognition with fast funny matrix multiplication. Moreover, this paper shows an O(n2.776 + (1/∈)O(1)) time approximation algorithm for the former problem and an O(n2.976 log n + (1/∈)O(1)) time approximation algorithm for the latter problem, each of which has a guaranteed approximation ratio 1 − ∈ for any positive constant ∈, where the absolute value of the logarithm of the probability is considered as an objective value in the latter problem. The former algorithm is obtained from a non-trivial modification of the well-known O(n3) time dynamic programming algorithm, and the latter algorithm is obtained by combining Valiant's algorithm with approximate funny matrix multiplication. Several related results are shown too.
Similar content being viewed by others
References
J.P. Abrahams, M. Berg, E. Batenburg, and C. Pleij, “Prediction of RNA secondary structure, including pseudoknotting by computer simulation,” Nucleic Acids Research, vol. 18, pp. 3035-3044, 1990.
T. Akutsu, “DP algorithms for RNA secondary structure prediction with pseudoknots,” Genome Informatics 1997, Universal Academy Press: Tokyo, 1997, pp. 173-179.
N. Alon, Z. Galil, and O. Margalit, “On the exponent of the all pairs shortest path problem,” in Proc. 32nd IEEE Symp. Foundations of Computer Science, IEEE, 1991, pp. 569-575.
D. Coppersmith and S. Winograd, “Matrix multiplication via arithmetic progression,” J. Symbolic Computation, vol. 9, pp. 251-280, 1990.
D. Eppstein, Z. Galil, and R. Giancarlo, “Speeding up dynamic programming,” in Proc. 29th IEEE Symp. Foundations of Computer Science, IEEE, 1988, pp. 488-496.
D. Eppstein, Z. Galil, R. Giancarlo, and G.F. Italiano, “Sparse dynamic programming II: Convex and concave cost functions,” J. ACM, vol. 39, pp. 546-567, 1992.
M.L. Fredman, “New bounds on the complexity of the shortest path problem,” SIAM Journal on Computing, vol. 5, pp. 83-89, 1976.
Z. Galil and K. Park, “Dynamic programming with convexity, concavity and sparsity,” Theoretical Computer Science, vol. 92, pp. 49-76, 1992.
M. Kanehisa and W.B. Goad, “Pattern recognition in nucleic acid sequences II: An efficient method for finding locally stable secondary structures,” Nucleic Acids Research, vol. 10, pp. 265-277, 1982.
L.L. Larmore and B. Schieber, “On-line dynamic programming with applications to the prediction of RNA secondary structure,” Journal of Algorithms, vol. 12, pp. 490-515, 1991.
Y. Sakakibara, M. Brown, E. Hughey, I.S. Mian, K. Sjlölander, R.C. Underwood, and D. Haussler, “Stochastic context-free grammars for tRNA modeling,” Nucleic Acids Research, vol. 22, pp. 5112-5120, 1994.
J. Setubal and J. Meidanis, Introduction to Computational Molecular Biology, PWS Pub. Co.: Boston, 1997.
T. Takaoka, “A new upper bound on the complexity of all pairs shortest path problem,” Information Processing Letters, vol. 43, pp. 195-199, 1992.
H. Tamaki and T. Tokuyama, “Algorithms for maximum subarray problem based on matrix multiplication,” in Proc. 9th ACM-SIAM Symp. Discrete Algorithms, ACM, 1998, pp. 446-452.
D.H. Turner, N. Sugimoto, and S.M. Freier, “RNA structure prediction,” Ann. Rev. Biophys. Chem., vol. 17, pp. 167-192, 1988.
Y. Uemura, A. Hasegawa, S. Kobayashi, and T. Yokomori, “Grammatically modeling and predicting RNA secondary structures,” in Proc. Genome Informatics Workshop VI, Universal Academy Press: Tokyo, 1995, pp. 67-76.
L.G.Valiant, “General context-free recognition in less than cubic time,” Journal of Computer and System Sciences, vol. 10, pp. 308-315, 1975.
M.S.Waterman and T.F. Smith, “RNAsecondary structure:Acomplete mathematical analysis,” Math. Biosciences, vol. 41, pp. 257-266, 1978.
M.S.Waterman and T.F. Smith, “Rapid dynamic programming algorithms forRNAsecondary structure,” Advances in Applied Mathematics, vol. 7, pp. 455-464, 1986.
M.S. Waterman, Introduction to Computational Biology, Chapman & Hall: London, 1995.
M. Zuker and P. Stiegler, “Optimal computer folding for largeRNAsequences using thermodynamics and auxiliary information,” Nucleic Acids Research, vol. 9, pp. 133-148, 1981.
Author information
Authors and Affiliations
Rights and permissions
About this article
Cite this article
Akutsu, T. Approximation and Exact Algorithms for RNA Secondary Structure Prediction and Recognition of Stochastic Context-free Languages. Journal of Combinatorial Optimization 3, 321–336 (1999). https://doi.org/10.1023/A:1009898029639
Issue Date:
DOI: https://doi.org/10.1023/A:1009898029639