spliceR: An R package for classification of alternative splicing and prediction of coding potential from RNA-seq data

The Bioinformatics Centre, University of Copenhagen, Copenhagen, Denmark
Biotech Research and Innovation Centre (BRIC), University of Copenhagen, Copenhagen, Denmark
The Danish Stem Cell Centre (DanStem) Faculty of Health Sciences, University of Copenhagen, Copenhagen, Denmark
The Finsen Laboratory, Rigshospitalet, University of Copenhagen, Copenhagen, Denmark
DOI
10.7287/peerj.preprints.80v2
Subject Areas
Bioinformatics, Cell Biology, Computational Biology, Genomics, Molecular Biology
Keywords
alternative splicing, splicing, RNA-seq, bioconductor, gene expression, R Software, nonsense mediated decay
Copyright
© 2013 Vitting-Seerup et al.
Licence
This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Cite this article
Vitting-Seerup K, Porse BT, Sandelin A, Waage JE. 2013. spliceR: An R package for classification of alternative splicing and prediction of coding potential from RNA-seq data. PeerJ PrePrints 1:e80v2

Abstract

Background: With the increasing depth and decreasing costs of RNA-sequencing researchers are now able to profile the transcriptome with unprecedented detail. These advances not only allow for precise approximation of gene expression levels, but also for the characterization of alternative transcript usage/switching between conditions. Recent software improvements in full-length transcript deconvolution prompted us to develop spliceR, an R package for classification of alternative splicing and prediction of coding potential.

Results: spliceR uses the full-length transcripts output from RNA-seq assemblers, to detect single- and multiple exon skipping, alternative donor and acceptor sites, intron retention, alternative first or last exon usage, and mutually exclusive exon events. For each of these events spliceR also annotates the genomic coordinates of the differentially spliced elements facilitating downstream sequence analysis. Furthermore, isoform fraction values are calculated for effective post-filtering, i.e. identification of transcript switching between conditions. Lastly spliceR predicts the coding potential, as well as the potential nonsense mediated decay (NMD) sensitivity of each transcript.

Conclusions: spliceR is a easy-to-use tool that allows detection of alternative splicing, transcript switching and NMD sensitivity from RNA-seq data, extending the usability of RNA-seq and assembly technologies. spliceR is implemented as an R package and is freely available from the Bioconductor repository (http://www.bioconductor.org/packages/2.13/bioc/html/spliceR.html).

Author Comment

As a result of comments from the scientific community, we have revised the spliceR manuscript to be clearer, and more concise. In addition, in this version we include the analysis of a pre-existing public dataset to demonstrate the ease of use and features of spliceR.