Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 7 de 7
Filtrar
Más filtros










Base de datos
Intervalo de año de publicación
1.
J Am Stat Assoc ; 118(541): 165-176, 2023.
Artículo en Inglés | MEDLINE | ID: mdl-37346227

RESUMEN

Scientific hypotheses in a variety of applications have domain-specific structures, such as the tree structure of the International Classification of Diseases (ICD), the directed acyclic graph structure of the Gene Ontology (GO), or the spatial structure in genome-wide association studies. In the context of multiple testing, the resulting relationships among hypotheses can create redundancies among rejections that hinder interpretability. This leads to the practice of filtering rejection sets obtained from multiple testing procedures, which may in turn invalidate their inferential guarantees. We propose Focused BH, a simple, flexible, and principled methodology to adjust for the application of any pre-specified filter. We prove that Focused BH controls the false discovery rate under various conditions, including when the filter satisfies an intuitive monotonicity property and the p-values are positively dependent. We demonstrate in simulations that Focused BH performs well across a variety of settings, and illustrate this method's practical utility via analyses of real datasets based on ICD and GO.

2.
Biometrika ; 108(3): 575-590, 2021 Sep.
Artículo en Inglés | MEDLINE | ID: mdl-36825068

RESUMEN

We introduce a multiple testing procedure that controls global error rates at multiple levels of resolution. Conceptually, we frame this problem as the selection of hypotheses that are organized hierarchically in a tree structure. We describe a fast algorithm and prove that it controls relevant error rates given certain assumptions on the dependence between the p-values. Through simulations, we demonstrate that the proposed procedure provides the desired guarantees under a range of dependency structures and that it has the potential to gain power over alternative methods. Finally, we apply the method to studies on the genetic regulation of gene expression across multiple tissues and on the relation between the gut microbiome and colorectal cancer.

3.
Nat Genet ; 49(12): 1714-1721, 2017 Dec.
Artículo en Inglés | MEDLINE | ID: mdl-29083405

RESUMEN

By analyzing multitissue gene expression and genome-wide genetic variation data in samples from a vervet monkey pedigree, we generated a transcriptome resource and produced the first catalog of expression quantitative trait loci (eQTLs) in a nonhuman primate model. This catalog contains more genome-wide significant eQTLs per sample than comparable human resources and identifies sex- and age-related expression patterns. Findings include a master regulatory locus that likely has a role in immune function and a locus regulating hippocampal long noncoding RNAs (lncRNAs), whose expression correlates with hippocampal volume. This resource will facilitate genetic investigation of quantitative traits, including brain and behavioral phenotypes relevant to neuropsychiatric disorders.


Asunto(s)
Chlorocebus aethiops/genética , Perfilación de la Expresión Génica , Variación Genética , Sitios de Carácter Cuantitativo/genética , Animales , Encéfalo/crecimiento & desarrollo , Encéfalo/metabolismo , Chlorocebus aethiops/crecimiento & desarrollo , Estudio de Asociación del Genoma Completo , Genotipo , Humanos , Fenotipo , Polimorfismo de Nucleótido Simple
4.
Genet Epidemiol ; 41(3): 251-258, 2017 04.
Artículo en Inglés | MEDLINE | ID: mdl-28090672

RESUMEN

In genome-wide association studies (GWAS), "generalization" is the replication of genotype-phenotype association in a population with different ancestry than the population in which it was first identified. Current practices for declaring generalizations rely on testing associations while controlling the family-wise error rate (FWER) in the discovery study, then separately controlling error measures in the follow-up study. This approach does not guarantee control over the FWER or false discovery rate (FDR) of the generalization null hypotheses. It also fails to leverage the two-stage design to increase power for detecting generalized associations. We provide a formal statistical framework for quantifying the evidence of generalization that accounts for the (in)consistency between the directions of associations in the discovery and follow-up studies. We develop the directional generalization FWER (FWERg ) and FDR (FDRg ) controlling r-values, which are used to declare associations as generalized. This framework extends to generalization testing when applied to a published list of Single Nucleotide Polymorphism-(SNP)-trait associations. Our methods control FWERg or FDRg under various SNP selection rules based on P-values in the discovery study. We find that it is often beneficial to use a more lenient P-value threshold than the genome-wide significance threshold. In a GWAS of total cholesterol in the Hispanic Community Health Study/Study of Latinos (HCHS/SOL), when testing all SNPs with P-values <5×10-8 (15 genomic regions) for generalization in a large GWAS of whites, we generalized SNPs from 15 regions. But when testing all SNPs with P-values <6.6×10-5 (89 regions), we generalized SNPs from 27 regions.


Asunto(s)
Genoma Humano , Estudio de Asociación del Genoma Completo/métodos , Hispánicos o Latinos/genética , Modelos Estadísticos , Polimorfismo de Nucleótido Simple/genética , Algoritmos , Simulación por Computador , Estudios de Seguimiento , Genómica , Humanos , Desequilibrio de Ligamiento , Fenotipo
5.
PLoS Genet ; 12(5): e1006046, 2016 05.
Artículo en Inglés | MEDLINE | ID: mdl-27176483

RESUMEN

The observation that variants regulating gene expression (expression quantitative trait loci, eQTL) are at a high frequency among SNPs associated with complex traits has made the genome-wide characterization of gene expression an important tool in genetic mapping studies of such traits. As part of a study to identify genetic loci contributing to bipolar disorder and other quantitative traits in members of 26 pedigrees from Costa Rica and Colombia, we measured gene expression in lymphoblastoid cell lines derived from 786 pedigree members. The study design enabled us to comprehensively reconstruct the genetic regulatory network in these families, provide estimates of heritability, identify eQTL, evaluate missing heritability for the eQTL, and quantify the number of different alleles contributing to any given locus. In the eQTL analysis, we utilize a recently proposed hierarchical multiple testing strategy which controls error rates regarding the discovery of functional variants. Our results elucidate the heritability and regulation of gene expression in this unique Latin American study population and identify a set of regulatory SNPs which may be relevant in future investigations of complex disease in this population. Since our subjects belong to extended families, we are able to compare traditional kinship-based estimates with those from more recent methods that depend only on genotype information.


Asunto(s)
Trastorno Bipolar/genética , Predisposición Genética a la Enfermedad , Estudio de Asociación del Genoma Completo , Sitios de Carácter Cuantitativo/genética , Alelos , Trastorno Bipolar/patología , Mapeo Cromosómico , Colombia , Costa Rica , Femenino , Expresión Génica , Redes Reguladoras de Genes , Humanos , Masculino , Polimorfismo de Nucleótido Simple/genética
6.
Genet Epidemiol ; 40(1): 45-56, 2016 Jan.
Artículo en Inglés | MEDLINE | ID: mdl-26626037

RESUMEN

The genetic basis of multiple phenotypes such as gene expression, metabolite levels, or imaging features is often investigated by testing a large collection of hypotheses, probing the existence of association between each of the traits and hundreds of thousands of genotyped variants. Appropriate multiplicity adjustment is crucial to guarantee replicability of findings, and the false discovery rate (FDR) is frequently adopted as a measure of global error. In the interest of interpretability, results are often summarized so that reporting focuses on variants discovered to be associated to some phenotypes. We show that applying FDR-controlling procedures on the entire collection of hypotheses fails to control the rate of false discovery of associated variants as well as the expected value of the average proportion of false discovery of phenotypes influenced by such variants. We propose a simple hierarchical testing procedure that allows control of both these error rates and provides a more reliable basis for the identification of variants with functional effects. We demonstrate the utility of this approach through simulation studies comparing various error rates and measures of power for genetic association studies of multiple traits. Finally, we apply the proposed method to identify genetic variants that impact flowering phenotypes in Arabidopsis thaliana, expanding the set of discoveries.


Asunto(s)
Estudios de Asociación Genética , Arabidopsis/genética , Arabidopsis/fisiología , Simulación por Computador , Flores/fisiología , Estudio de Asociación del Genoma Completo , Modelos Genéticos , Fenotipo , Polimorfismo de Nucleótido Simple , Reproducibilidad de los Resultados
7.
Proc Natl Acad Sci U S A ; 111(46): 16262-7, 2014 Nov 18.
Artículo en Inglés | MEDLINE | ID: mdl-25368172

RESUMEN

We propose a formal method to declare that findings from a primary study have been replicated in a follow-up study. Our proposal is appropriate for primary studies that involve large-scale searches for rare true positives (i.e., needles in a haystack). Our proposal assigns an r value to each finding; this is the lowest false discovery rate at which the finding can be called replicated. Examples are given and software is available.

SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA
...