Abstract
Motivated by a general interest to understand sequence based signaling in the cell, and in particular how (even distal) genomic strings communicate each other in the transcriptional process, we present a bioinformatic investigation on genomic repeats which occur in multiple genes. Unconventional graph based methods to abstractly represent genomes, gene networks, and genomic languages are provided. In particular, the distribution of long repeats along genomic sequences from three specific organisms (genome of N. equitans, of E. coli, and chromosome IV of S. cerevisiae) is computed, and efficiently visualized along the entire sequences, with the unexpected result to have most of them occurring inside genes.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Bartel, D.P.: MicroRNAs: Target recognition and regulatory functions. Cell 136(2), 215–233 (2009), doi:10.1016/j.cell.2009.01.002
Bilbao, C., et al.: The relationship between microsatellite instability and PTEN gene mutations in endometrial cancer. International Journal Cancer 119(3), 563–570 (2006)
Burden, C.J., Jing, J., Wilson, S.R.: Alignment-free sequence comparison for biologically realistic sequences of moderate length. Statistical Applications in Genetics and Molecular Biology 11(1), Article 3 (2012)
Castellini, A., et al.: Genome classification by dictionary based indexes. Poster. Presented at the Int. Conf. on Pattern Recognition in Bioinformatics, PRIB (2011)
Castellini, A., Franco, G., Manca, V.: A dictionary based informational genome analysis. BMC Genomics 13(1), 485 (2012), doi:10.1186/1471-2164-13-485
Castellini, A., Franco, G., Milanese, A.: A genome analysis based on repeat sharing gene networks (in preparation)
Chor, B., et al.: Genomic DNA k-mer spectra: models and modalities. Genome Biology 10, R108 (2009)
Dunham, I., et al.: (The ENCODE Project Consortium): An integrated encyclopedia of DNA elements in the human genome. Nature 489, 57–74 (2012)
Franco, G.: Perspectives in computational genome analysis, book chapter in Discrete and Topological Models in Molecular Biology. Springer (to appear, 2013)
Friedman, R.C., Farh, K.K., Burge, C.B., Bartel, D.P.: Most mammalian mRNAs are conserved targets of microRNAs. Genome Res. 19(1), 92–105 (2009)
Gottesman, S.: The small RNA regulators of Escherichia coli: roles and mechanisms. Annu. Rev. Microbiol. 58, 303–328 (2004)
International Human Genome Sequencing Consortium, Initial sequencing and analysis of the human genome. Nature 409, 860–921 (2001)
Liran, I., et al.: Cell lineage analysis of acute leukemia relapse uncovers the role of replication-rate heterogeneity and microsatellite instability. Blood 120, 603–612 (2012)
Manca, V.: Infobiotics. Springer (2013)
Song, K., Ren, J., Zhai, Z., Liu, X., Deng, M., Sun, F.: Alignment-free sequence comparison based on next-generation sequencing reads. Journal of Computational Biology 20 (2), 64–79 (2013)
Pevzner, P.A.: DNA physical mapping and alternating eulerian cycles in colored graphs. Algorithmica, 77–105 (1995)
Sanyal, A., et al.: The long-range interaction landscape of gene promoters. Nature 489, 109–113 (2012)
Searls, D.B.: The language of genes. Nature 420, 211–217 (2002)
Sharma, C.M., Vogel, J.: Experimental approaches for the discovery and characterization of regulatory small RNA. Curr. Opin. Microbiol. 12, 536–546 (2009)
Tay, Y., et al.: Coding-independent regulation of the tumor suppressor PTEN by competing endogenous mRNAs. Cell 147(2), 344–357 (2011)
Ukkonen, E.: Approximate string matching with q-grams and maximal matches. Theoretical Computer Science 92(1), 191–211 (1992)
Wagner, E.G.H., Simon, R.W.: Antisense RNA control in bacteria, phages, and plasmids. Annual Review of Microbiology 48, 713–742 (1994)
Zhou, F., Olman, V., Xu, Y.: Barcodes for genomes and applications. BMC Bioinformatics 9, 546 (2008)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2013 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Franco, G., Milanese, A. (2013). An Investigation on Genomic Repeats. In: Bonizzoni, P., Brattka, V., Löwe, B. (eds) The Nature of Computation. Logic, Algorithms, Applications. CiE 2013. Lecture Notes in Computer Science, vol 7921. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-39053-1_18
Download citation
DOI: https://doi.org/10.1007/978-3-642-39053-1_18
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-39052-4
Online ISBN: 978-3-642-39053-1
eBook Packages: Computer ScienceComputer Science (R0)