Skip to main content

Splice Junction Prediction in DNA Sequence Using Multilayered RNN Model

  • Conference paper
  • First Online:
Advances in Decision Sciences, Image Processing, Security and Computer Vision (ICETE 2019)

Part of the book series: Learning and Analytics in Intelligent Systems ((LAIS,volume 3))

Included in the following conference series:

Abstract

Genes are parts of a DNA sequence responsible for protein synthesis. Splicing more specifically refers to a post-transcriptional modification that is responsible for multiple protein synthesis from a single gene. The classification of the splice junction has remained quite a challenging task in the field of bioinformatics and is equally important as the synthesized proteins are responsible for the unique characteristics observed in different living organisms. In this study, we propose a state of the art algorithm in splice junction prediction from DNA sequence using a multilayered stacked RNN model, which achieves an overall accuracy of 99.95% and an AUROC score of 1.0 for exon-intron, intron-exon as well as no-junction classification.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 169.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 219.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 219.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. PREMIER Biosoft: gene splicing overview & techniques. www.premierbiosoft.com/tech_notes/gene-splicing.html. Accessed 2019

  2. Medical Xpress: predicting how splicing errors impact disease risk. https://medicalxpress.com/news/2018-08-splicing-errors-impact-disease.html. Accessed 2019

  3. Murray RK, Bender DA, Botham KM, Kennelly PJ, Rodwell VW, Weil PA. Harpers illustrated biochemistry, pp 352–354. Accessed 2019

    Google Scholar 

  4. Salzberg S (1995) Locating protein coding regions in human DNA using a decision tree algorithm. J Comput Biol: J Comput Mol Cell Biol 2:473–485

    Article  Google Scholar 

  5. Ngoc Giang N, Anh Tran V, Luu Ngo D, Phan D, Lumbanraja F, Faisal MR, Abapihi B, Kubo M, Satou K (2016) DNA sequence classification by convolutional neural network. J Biomed Sci Eng 9:280–286

    Article  Google Scholar 

  6. Damasevicius R (2008) Splice site recognition in DNA sequences using k-mer frequency based mapping for support vector machine with power series kernel. In: International conference on complex, intelligent and software intensive systems, pp 687–692, March 2008

    Google Scholar 

  7. Cervantes J, Li X, Yu W (2009) Splice site detection in DNA sequences using a fast classification algorithm. In: SMC’09 Proceedings of the 2009 IEEE international conference on systems, man and cybernetics, pp 2683–2688, October 2009

    Google Scholar 

  8. Kerdprasop N, Kerdprasop K (2010) A high recall DNA splice site prediction based on association analysis. In: International conference on applied computer science proceedings

    Google Scholar 

  9. Mandal DI (2015) A novel approach for predicting DNA splice junctions using hybrid machine learning algorithms. Soft Comput 19:3431–3444

    Article  Google Scholar 

  10. Lee B, Lee T, Na B, Yoon S (2015) DNA-level splice junction prediction using deep recurrent neural networks. CoRR abs/1512.05135

    Google Scholar 

  11. Zhang Y, Liu X, MacLeod JN, Liu J (2016) Deepsplice: deep classification of novel splice junctions revealed by RNA-seq. In: 2016 IEEE international conference on bioinformatics and biomedicine (BIBM), pp 330–333. IEEE, December 2016

    Google Scholar 

  12. NCBI: Genbank. ftp://ftp.ncbi.nlm.nih.gov/genbank. Accessed 2019

    Google Scholar 

  13. Revolvy: Dna codon table. https://www.revolvy.com/page/DNA-codon-table. Accessed 2019

  14. Pham TH, Tran DH, Ho TB, Satou K, Valiente G. Qualitatively predicting acetylation & methylation areas in DNA sequences. http://www.jaist.ac.jp/~tran/nucleosome/index.htm. Accessed 2019

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Rahul Sarkar .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2020 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Sarkar, R., Chatterjee, C.C., Das, S., Mondal, D. (2020). Splice Junction Prediction in DNA Sequence Using Multilayered RNN Model. In: Satapathy, S.C., Raju, K.S., Shyamala, K., Krishna, D.R., Favorskaya, M.N. (eds) Advances in Decision Sciences, Image Processing, Security and Computer Vision. ICETE 2019. Learning and Analytics in Intelligent Systems, vol 3. Springer, Cham. https://doi.org/10.1007/978-3-030-24322-7_6

Download citation

Publish with us

Policies and ethics