CNNPSP: Pseudouridine Sites Prediction Based on Deep Learning

Fan, Yongxian; Li, Yongzhen; Yang, Huihua; Pan, Xiaoyong

doi:10.1007/978-3-030-33607-3_32

Yongxian Fan¹⁴,
Yongzhen Li¹⁵,
Huihua Yang^14,17 &
…
Xiaoyong Pan¹⁶

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 11871))

Included in the following conference series:

International Conference on Intelligent Data Engineering and Automated Learning

1643 Accesses
1 Citations

Abstract

Pseudouridine (ψ) is a kind of RNA modification, which is formed at specific site of RNA sequence due to the catalytic action of Pseudouridine synthase in the process of gene transcription. It is the most prevalent RNA modification found so far, and plays a vital role in normal biological functions. Several computational methods have been proposed to predict Pseudouridine sites, but these methods still do not achieve high accuracy. At present, deep learning has become a popular field of machine learning, and convolutional neural network (CNN) is one widely used algorithm. CNN can automatically dig into the hidden features of data and make accurate predictions, so a new algorithm based on CNN was proposed for extracting the features from RNA sequences with and without ψ sites. And a predictor called CNNPSP was developed to predict ψ sites in RNAs across three species (H. sapiens, S. cerevisiae and M. musculus). Both the rigorous jackknife test and independent test indicated that the new predictor outperformed the existing methods in this task.

This work was supported in part by the National Natural Science Foundation of China (No. 61462018, 61762026), Guangxi Natural Science Foundation (No. 2017GXNSFAA198278), Guangxi Key Laboratory of Trusted Software (No. kx201403), Guangxi Colleges and Universities Key Laboratory of Intelligent Processing of Computer Images and Graphics (No. GIIP201502).

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Czerwoniec, A., et al.: MODOMICS: a database of RNA modification pathways. 2008 update. Nucleic Acids Res. 37, D118–D121 (2009)
Article Google Scholar
Carlile, T.M., Rojas-Duran, M.F., Zinshteyn, B., Shin, H., Bartoli, K.M., Gilbert, W.V.: Pseudouridine profiling reveals regulated mRNA pseudouridylation in yeast and human cells. Nature 515, 143 (2014)
Article Google Scholar
Chen, W., Tang, H., Ye, J., Lin, H., Chou, K.-C.: iRNA-PseU: identifying RNA pseudouridine sites. Mol. Ther-Nucleic Acids 5, e332 (2016)
Google Scholar
Li, Y.-H., Zhang, G., Cui, Q.: PPUS: a web server to predict PUS-specific pseudouridine sites. Bioinformatics 31, 3362–3364 (2015)
Article Google Scholar
Pan, X., Rijnbeek, P., Yan, J., Shen, H.-B.: Prediction of RNA-protein sequence and structure binding preferences using deep convolutional and recurrent neural networks. BMC Genomics. 19, 511 (2018)
Article Google Scholar
Alipanahi, B., Delong, A., Weirauch, M.T., Frey, B.J.: Predicting the sequence specificities of DNA- and RNA-binding proteins by deep learning. Nat. Biotech. 33, 831 (2015)
Article Google Scholar
Pan, X., Shen, H.-B.: RNA-protein binding motifs mining with a new hybrid deep learning based cross-domain knowledge integration approach. BMC Bioinf. 18, 136 (2017)
Article Google Scholar
Sun, W.-J., et al.: RMBase: a resource for decoding the landscape of RNA modifications from high-throughput sequencing data. Nucleic Acids Res. 44, D259–D265 (2016)
Article Google Scholar
Speir, M.L., et al.: The UCSC genome browser database: 2016 update. Nucleic Acids Res. 44, D717–D725 (2016)
Article Google Scholar

Download references

Author information

Authors and Affiliations

School of Computer and Information Security, Guilin University of Electronic Technology, Guilin, 541004, China
Yongxian Fan & Huihua Yang
School of Electronic Engineering and Automation, Guilin University of Electronic Technology, Guilin, 541004, China
Yongzhen Li
Institute of Image Processing and Pattern Recognition, Shanghai Jiao Tong University, Shanghai, 200240, China
Xiaoyong Pan
School of Automation, Beijing University of Posts and Telecommunications, Beijing, 100876, China
Huihua Yang

Authors

Yongxian Fan
View author publications
You can also search for this author in PubMed Google Scholar
Yongzhen Li
View author publications
You can also search for this author in PubMed Google Scholar
Huihua Yang
View author publications
You can also search for this author in PubMed Google Scholar
Xiaoyong Pan
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding authors

Correspondence to Yongxian Fan or Xiaoyong Pan .

Editor information

Editors and Affiliations

University of Manchester, Manchester, UK
Hujun Yin
Technical University of Madrid, Madrid, Spain
David Camacho
University of Birmingham, Birmingham, UK
Peter Tino
University of Huelva, Huelva, Spain
Antonio J. Tallón-Ballesteros
University of Exeter, Exeter, UK
Ronaldo Menezes
University of Manchester, Manchester, UK
Richard Allmendinger

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Fan, Y., Li, Y., Yang, H., Pan, X. (2019). CNNPSP: Pseudouridine Sites Prediction Based on Deep Learning. In: Yin, H., Camacho, D., Tino, P., Tallón-Ballesteros, A., Menezes, R., Allmendinger, R. (eds) Intelligent Data Engineering and Automated Learning – IDEAL 2019. IDEAL 2019. Lecture Notes in Computer Science(), vol 11871. Springer, Cham. https://doi.org/10.1007/978-3-030-33607-3_32

Download citation

DOI: https://doi.org/10.1007/978-3-030-33607-3_32
Published: 18 October 2019
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-33606-6
Online ISBN: 978-3-030-33607-3
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics