Abstract
Pseudouridine (ψ) is a kind of RNA modification, which is formed at specific site of RNA sequence due to the catalytic action of Pseudouridine synthase in the process of gene transcription. It is the most prevalent RNA modification found so far, and plays a vital role in normal biological functions. Several computational methods have been proposed to predict Pseudouridine sites, but these methods still do not achieve high accuracy. At present, deep learning has become a popular field of machine learning, and convolutional neural network (CNN) is one widely used algorithm. CNN can automatically dig into the hidden features of data and make accurate predictions, so a new algorithm based on CNN was proposed for extracting the features from RNA sequences with and without ψ sites. And a predictor called CNNPSP was developed to predict ψ sites in RNAs across three species (H. sapiens, S. cerevisiae and M. musculus). Both the rigorous jackknife test and independent test indicated that the new predictor outperformed the existing methods in this task.
This work was supported in part by the National Natural Science Foundation of China (No. 61462018, 61762026), Guangxi Natural Science Foundation (No. 2017GXNSFAA198278), Guangxi Key Laboratory of Trusted Software (No. kx201403), Guangxi Colleges and Universities Key Laboratory of Intelligent Processing of Computer Images and Graphics (No. GIIP201502).
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Czerwoniec, A., et al.: MODOMICS: a database of RNA modification pathways. 2008 update. Nucleic Acids Res. 37, D118–D121 (2009)
Carlile, T.M., Rojas-Duran, M.F., Zinshteyn, B., Shin, H., Bartoli, K.M., Gilbert, W.V.: Pseudouridine profiling reveals regulated mRNA pseudouridylation in yeast and human cells. Nature 515, 143 (2014)
Chen, W., Tang, H., Ye, J., Lin, H., Chou, K.-C.: iRNA-PseU: identifying RNA pseudouridine sites. Mol. Ther-Nucleic Acids 5, e332 (2016)
Li, Y.-H., Zhang, G., Cui, Q.: PPUS: a web server to predict PUS-specific pseudouridine sites. Bioinformatics 31, 3362–3364 (2015)
Pan, X., Rijnbeek, P., Yan, J., Shen, H.-B.: Prediction of RNA-protein sequence and structure binding preferences using deep convolutional and recurrent neural networks. BMC Genomics. 19, 511 (2018)
Alipanahi, B., Delong, A., Weirauch, M.T., Frey, B.J.: Predicting the sequence specificities of DNA- and RNA-binding proteins by deep learning. Nat. Biotech. 33, 831 (2015)
Pan, X., Shen, H.-B.: RNA-protein binding motifs mining with a new hybrid deep learning based cross-domain knowledge integration approach. BMC Bioinf. 18, 136 (2017)
Sun, W.-J., et al.: RMBase: a resource for decoding the landscape of RNA modifications from high-throughput sequencing data. Nucleic Acids Res. 44, D259–D265 (2016)
Speir, M.L., et al.: The UCSC genome browser database: 2016 update. Nucleic Acids Res. 44, D717–D725 (2016)
Author information
Authors and Affiliations
Corresponding authors
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2019 Springer Nature Switzerland AG
About this paper
Cite this paper
Fan, Y., Li, Y., Yang, H., Pan, X. (2019). CNNPSP: Pseudouridine Sites Prediction Based on Deep Learning. In: Yin, H., Camacho, D., Tino, P., Tallón-Ballesteros, A., Menezes, R., Allmendinger, R. (eds) Intelligent Data Engineering and Automated Learning – IDEAL 2019. IDEAL 2019. Lecture Notes in Computer Science(), vol 11871. Springer, Cham. https://doi.org/10.1007/978-3-030-33607-3_32
Download citation
DOI: https://doi.org/10.1007/978-3-030-33607-3_32
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-33606-6
Online ISBN: 978-3-030-33607-3
eBook Packages: Computer ScienceComputer Science (R0)