cd167693e9315e22.tex
1: \begin{abstract} % abstract
2: \noindent
3: \textbf{Motivation:} Assigning RNA-seq reads to their transcript of origin is a fundamental task in
4: transcript expression estimation. Where ambiguities in assignments exist due to
5: transcripts sharing sequence, e.g. alternative isoforms or alleles, the problem
6: can be solved through probabilistic inference. Bayesian methods have been shown
7: to provide accurate transcript abundance estimates compared to competing
8: methods. However, exact Bayesian inference  is intractable and approximate
9: methods such as Markov chain Monte Carlo (MCMC) and Variational Bayes (VB) are
10: typically used. While providing a high degree of accuracy and modelling
11: flexibility, standard implementations can be prohibitively slow for large
12: datasets and complex transcriptome annotations.\\
13: \textbf{Results:} We propose a novel approximate inference scheme based on VB and apply it to an existing model of transcript expression inference from RNA-seq data. Recent advances in VB algorithmics are used to improve the convergence of the
14: algorithm beyond the standard Variational Bayes Expectation Maximisation (VBEM)
15: algorithm. We apply our algorithm to simulated and biological datasets,
16: demonstrating a significant increase in speed with only very small loss
17: in accuracy of expression level estimation. We carry out a comparative study
18: against seven popular alternative methods and demonstrate that our new algorithm
19: provides excellent accuracy and inter-replicate consistency while 
20: remaining competitive in computation time.\\
21: \textbf{Availability:} The methods were implemented in R and C++, and are available as part of the
22: BitSeq project at \url{github.com/BitSeq}.\\ The method is also available
23: through the BitSeq Bioconductor package. The source code to reproduce all simulation results can be accessed via \url{github.com/BitSeq/BitSeqVB\_benchmarking}\\
24: \textbf{Keywords:} RNA-Seq, Transcript expression estimation, Bayesian inference, Variational Bayes.
25: \end{abstract}
26: