Your browser doesn't support javascript.
loading
A comparison of methods for multiple degree of freedom testing in repeated measures RNA-sequencing experiments.
Wynn, Elizabeth A; Vestal, Brian E; Fingerlin, Tasha E; Moore, Camille M.
Afiliación
  • Wynn EA; Department of Biostatistics and Informatics, University of Colorado, Anschutz Medical Campus, Aurora, CO, USA.
  • Vestal BE; Center for Genes, Environment and Health, National Jewish Health, 1400 Jackson St, Denver, 80206, CO, USA.
  • Fingerlin TE; Center for Genes, Environment and Health, National Jewish Health, 1400 Jackson St, Denver, 80206, CO, USA.
  • Moore CM; Center for Genes, Environment and Health, National Jewish Health, 1400 Jackson St, Denver, 80206, CO, USA. mooreca@njhealth.org.
BMC Med Res Methodol ; 22(1): 153, 2022 05 28.
Article en En | MEDLINE | ID: mdl-35643435
ABSTRACT

BACKGROUND:

As the cost of RNA-sequencing decreases, complex study designs, including paired, longitudinal, and other correlated designs, become increasingly feasible. These studies often include multiple hypotheses and thus multiple degree of freedom tests, or tests that evaluate multiple hypotheses jointly, are often useful for filtering the gene list to a set of interesting features for further exploration while controlling the false discovery rate. Though there are several methods which have been proposed for analyzing correlated RNA-sequencing data, there has been little research evaluating and comparing the performance of multiple degree of freedom tests across methods.

METHODS:

We evaluated 11 different methods for modelling correlated RNA-sequencing data by performing a simulation study to compare the false discovery rate, power, and model convergence rate across several hypothesis tests and sample size scenarios. We also applied each method to a real longitudinal RNA-sequencing dataset.

RESULTS:

Linear mixed modelling using transformed data had the best false discovery rate control while maintaining relatively high power. However, this method had high model non-convergence, particularly at small sample sizes. No method had high power at the lowest sample size. We found a mix of conservative and anti-conservative behavior across the other methods, which was influenced by the sample size and the hypothesis being evaluated. The patterns observed in the simulation study were largely replicated in the analysis of a longitudinal study including data from intensive care unit patients experiencing cardiogenic or septic shock.

CONCLUSIONS:

Multiple degree of freedom testing is a valuable tool in longitudinal and other correlated RNA-sequencing experiments. Of the methods that we investigated, linear mixed modelling had the best overall combination of power and false discovery rate control. Other methods may also be appropriate in some scenarios.
Asunto(s)
Palabras clave

Texto completo: 1 Colección: 01-internacional Banco de datos: MEDLINE Asunto principal: Proyectos de Investigación / ARN Tipo de estudio: Observational_studies / Prognostic_studies Límite: Humans Idioma: En Revista: BMC Med Res Methodol Asunto de la revista: MEDICINA Año: 2022 Tipo del documento: Article País de afiliación: Estados Unidos

Texto completo: 1 Colección: 01-internacional Banco de datos: MEDLINE Asunto principal: Proyectos de Investigación / ARN Tipo de estudio: Observational_studies / Prognostic_studies Límite: Humans Idioma: En Revista: BMC Med Res Methodol Asunto de la revista: MEDICINA Año: 2022 Tipo del documento: Article País de afiliación: Estados Unidos