How well do RNA-Seq differential gene expression tools perform in a eukaryote with a complex transcriptome?. Froussios, K., Schurch, N. J., Mackinnon, K., Gierlinski, M., Duc, C., Simpson, G. G., & Barton, G. J. bioRxiv, March, 2017.
How well do RNA-Seq differential gene expression tools perform in a eukaryote with a complex transcriptome? [link]Paper  doi  abstract   bibtex   
RNA-seq experiments are usually carried out in three or fewer replicates. In order to work well with so few samples, Differential Gene Expression (DGE) tools typically assume the form of the underlying distribution of gene expression. A recent highly replicated study revealed that RNA-seq gene expression measurements in yeast are best represented as being drawn from an underlying negative binomial distribution. In this paper, the statistical properties of gene expression in the higher eukaryote Arabidopsis thaliana are shown to be essentially identical to those from yeast despite the large increase in the size and complexity of the transcriptome: Gene expression measurements from this model plant species are consistent with being drawn from an underlying negative binomial or log-normal distribution and the false positive rate performance of nine widely used DGE tools is not strongly affected by the additional size and complexity of the A. thaliana transcriptome. For RNA-seq data, we therefore recommend the use of DGE tools that are based on the negative binomial distribution.
@article{froussios_how_2017,
	title = {How well do {RNA}-{Seq} differential gene expression tools perform in a eukaryote with a complex transcriptome?},
	copyright = {© 2017, Posted by Cold Spring Harbor Laboratory. This pre-print is available under a Creative Commons License (Attribution 4.0 International), CC BY 4.0, as described at http://creativecommons.org/licenses/by/4.0/},
	url = {http://biorxiv.org/content/early/2017/03/13/090753},
	doi = {10.1101/090753},
	abstract = {RNA-seq experiments are usually carried out in three or fewer replicates. In order to work well with so few samples, Differential Gene Expression (DGE) tools typically assume the form of the underlying distribution of gene expression. A recent highly replicated study revealed that RNA-seq gene expression measurements in yeast are best represented as being drawn from an underlying negative binomial distribution. In this paper, the statistical properties of gene expression in the higher eukaryote Arabidopsis thaliana are shown to be essentially identical to those from yeast despite the large increase in the size and complexity of the transcriptome: Gene expression measurements from this model plant species are consistent with being drawn from an underlying negative binomial or log-normal distribution and the false positive rate performance of nine widely used DGE tools is not strongly affected by the additional size and complexity of the A. thaliana transcriptome. For RNA-seq data, we therefore recommend the use of DGE tools that are based on the negative binomial distribution.},
	language = {en},
	urldate = {2017-03-30},
	journal = {bioRxiv},
	author = {Froussios, Kimon and Schurch, Nicholas J. and Mackinnon, Katarzyna and Gierlinski, Marek and Duc, Céline and Simpson, Gordon G. and Barton, Geoffrey J.},
	month = mar,
	year = {2017},
	keywords = {DAG},
	pages = {090753}
}

Downloads: 0