Your browser doesn't support javascript.
loading
bootRanges: flexible generation of null sets of genomic ranges for hypothesis testing.
Mu, Wancen; Davis, Eric S; Lee, Stuart; Dozmorov, Mikhail G; Phanstiel, Douglas H; Love, Michael I.
Affiliation
  • Mu W; Department of Biostatistics, University of North Carolina, Chapel Hill 27514, United States.
  • Davis ES; Curriculum in Bioinformatics and Computational Biology, University of North Carolina, Chapel Hill 27514, United States.
  • Lee S; Genentech, South San Francisco, Western California 94080, United States.
  • Dozmorov MG; Department of Biostatistics, Virginia Commonwealth University, Richmond, VA 23284, United States.
  • Phanstiel DH; Department of Pathology, Virginia Commonwealth University, Richmond, VA 23284, United States.
  • Love MI; Curriculum in Bioinformatics and Computational Biology, University of North Carolina, Chapel Hill 27514, United States.
Bioinformatics ; 39(5)2023 05 04.
Article in En | MEDLINE | ID: mdl-37042725
ABSTRACT
MOTIVATION Enrichment analysis is a widely utilized technique in genomic analysis that aims to determine if there is a statistically significant association between two sets of genomic features. To conduct this type of hypothesis testing, an appropriate null model is typically required. However, the null distribution that is commonly used can be overly simplistic and may result in inaccurate conclusions.

RESULTS:

bootRanges provides fast functions for generation of block bootstrapped genomic ranges representing the null hypothesis in enrichment analysis. As part of a modular workflow, bootRanges offers greater flexibility for computing various test statistics leveraging other Bioconductor packages. We show that shuffling or permutation schemes may result in overly narrow test statistic null distributions and over-estimation of statistical significance, while creating new range sets with a block bootstrap preserves local genomic correlation structure and generates more reliable null distributions. It can also be used in more complex analyses, such as accessing correlations between cis-regulatory elements (CREs) and genes across cell types or providing optimized thresholds, e.g. log fold change (logFC) from differential analysis. AVAILABILITY AND IMPLEMENTATION bootRanges is freely available in the R/Bioconductor package nullranges hosted at https//bioconductor.org/packages/nullranges.
Subject(s)

Full text: 1 Collection: 01-internacional Database: MEDLINE Main subject: Genome / Genomics Language: En Journal: Bioinformatics Journal subject: INFORMATICA MEDICA Year: 2023 Document type: Article Affiliation country:

Full text: 1 Collection: 01-internacional Database: MEDLINE Main subject: Genome / Genomics Language: En Journal: Bioinformatics Journal subject: INFORMATICA MEDICA Year: 2023 Document type: Article Affiliation country: