Scotty, We Need More Power! Power, Sample Size, and Coverage Estimation for RNA-Seq
Creators & Contributors
Two of the most common questions at the beginning of an RNA-seq experiments are "how many reads do I need?" and "how many replicates do I need?". This paper describes a web application for designing RNA-seq applications that calculates an appropriate sample size and read depth to satisfy user-defined criteria such as cost, maximum number of reads or replicates attainable, etc. The power and sample size estimations are based on a t-test, which the authors claim, performs no worse than the negative binomial models implemented by popular RNA-seq methods such as DESeq, when there are three or more replicates present. Empirical distributions are taken from either (1) pilot data that the user can upload, or (2) built in publicly available data. The authors find that there is substantial heterogeneity between experiments (technical variation is larger than biological variation in many cases), and that power and sample size estimation will be more accurate when the user provides their own pilot data.
My only complaint, for all the reasons expressed in my previous blog post about why you shouldn't host things like this exclusively on your lab website, is that the code to run this analysis doesn't appear to be available to save, study, modify, maintain, or archive. When lead author Michele Busby leaves Gabor Marth's lab, hopefully the app doesn't fall into the graveyard of computational biology web apps. Update 2/7/13: Michele Busby created a public Github repository for the Scotty code: https://github.com/mbusby/Scotty
Source code: https://github.com/mbusby/Scotty
Additional details
Description
Two of the most common questions at the beginning of an RNA-seq experiments are "how many reads do I need?" and "how many replicates do I need?". This paper describes a web application for designing RNA-seq applications that calculates an appropriate sample size and read depth to satisfy user-defined criteria such as cost, maximum number of reads or replicates attainable, etc.
Identifiers
- UUID
- ad8d8126-1a36-4bd5-a4c2-a259ea849cd5
- GUID
- tag:blogger.com,1999:blog-6232819486261696035.post-8205877176050413452
- URL
- https://gettinggeneticsdone.blogspot.com/2013/01/scotty-power-sample-size-coverage-rna-seq.html
Dates
- Issued
-
2013-01-28T14:09:00
- Updated
-
2013-02-08T02:17:41