Automatic simulation of RNA editing in plants for the identification of novel putative Open Reading Frames

DIMES, Università della Calabria, Cosenza, Italy
DMI, University of Palermo, Palermo, Palermo, Italy
LMCB, University College London, University of London, London, United Kingdom
DOI
10.7287/peerj.preprints.3362v1
Subject Areas
Bioinformatics
Keywords
editing, editing plant mitocondria, protein prediction
Copyright
© 2017 Fassetti et al.
Licence
This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, reproduction and adaptation in any medium and for any purpose provided that it is properly attributed. For attribution, the original author(s), title, publication source (PeerJ Preprints) and either DOI or URL of the article must be cited.
Cite this article
Fassetti F, Giallombardo C, Leone O, Palopoli L, Rombo SE, Ruffolo P, Saiardi A. 2017. Automatic simulation of RNA editing in plants for the identification of novel putative Open Reading Frames. PeerJ Preprints 5:e3362v1

Abstract

In plant mitochondria an essential mechanism for gene expression is RNA editing, often influencing the synthesis of functional proteins. RNA editing alters the linearity of genetic information transfer, intro- ducing differences between RNAs and their coding DNA sequences that hind both experimental and computational research of genes. Thus common software tools for gene search, successfully exploited to find canonic genes, often can fail in discovering genes encrypted in the genome of plants. In this work we propose a novel strategy useful to intercept candidate coding sequences resulting from some possible editing substitutions on the start and stop codons of a given input organism DNA. Our method is based on the simulation of the RNA editing mechanism, in order to generate candidate Open Reading Frame (ORF) sequences that could code for some, yet unknown, proteins. Results obtained on the mtDNA of Oryza sativa are promising, since we identified ORF sequences trascripted in Oriza, that do not cor- respond to already known proteins in this organism. Part of the corresponding amino acid sequences present high homologies with proteins already discovered in other organisms, the remaining ones could represent novel proteins not yet discovered in Oryza.

Author Comment

This manuscript is intended for NETTAB 2017 Workshop Collection.