Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 13 de 13
Filtrar
2.
Nature ; 550(7675): 244-248, 2017 10 11.
Artículo en Inglés | MEDLINE | ID: mdl-29022598

RESUMEN

X chromosome inactivation (XCI) silences transcription from one of the two X chromosomes in female mammalian cells to balance expression dosage between XX females and XY males. XCI is, however, incomplete in humans: up to one-third of X-chromosomal genes are expressed from both the active and inactive X chromosomes (Xa and Xi, respectively) in female cells, with the degree of 'escape' from inactivation varying between genes and individuals. The extent to which XCI is shared between cells and tissues remains poorly characterized, as does the degree to which incomplete XCI manifests as detectable sex differences in gene expression and phenotypic traits. Here we describe a systematic survey of XCI, integrating over 5,500 transcriptomes from 449 individuals spanning 29 tissues from GTEx (v6p release) and 940 single-cell transcriptomes, combined with genomic sequence data. We show that XCI at 683 X-chromosomal genes is generally uniform across human tissues, but identify examples of heterogeneity between tissues, individuals and cells. We show that incomplete XCI affects at least 23% of X-chromosomal genes, identify seven genes that escape XCI with support from multiple lines of evidence and demonstrate that escape from XCI results in sex biases in gene expression, establishing incomplete XCI as a mechanism that is likely to introduce phenotypic diversity. Overall, this updated catalogue of XCI across human tissues helps to increase our understanding of the extent and impact of the incompleteness in the maintenance of XCI.


Asunto(s)
Especificidad de Órganos/genética , Análisis de la Célula Individual , Inactivación del Cromosoma X/genética , Cromosomas Humanos X/genética , Femenino , Genes Ligados a X/genética , Genoma Humano/genética , Genómica , Humanos , Masculino , Fenotipo , Análisis de Secuencia de ARN , Transcriptoma/genética
3.
Am J Hum Genet ; 102(6): 1204-1211, 2018 06 07.
Artículo en Inglés | MEDLINE | ID: mdl-29861106

RESUMEN

There is a limited understanding about the impact of rare protein-truncating variants across multiple phenotypes. We explore the impact of this class of variants on 13 quantitative traits and 10 diseases using whole-exome sequencing data from 100,296 individuals. Protein-truncating variants in genes intolerant to this class of mutations increased risk of autism, schizophrenia, bipolar disorder, intellectual disability, and ADHD. In individuals without these disorders, there was an association with shorter height, lower education, increased hospitalization, and reduced age at enrollment. Gene sets implicated from GWASs did not show a significant protein-truncating variants burden beyond what was captured by established Mendelian genes. In conclusion, we provide a thorough investigation of the impact of rare deleterious coding variants on complex traits, suggesting widespread pleiotropic risk.


Asunto(s)
Mutación/genética , Sistemas de Lectura Abierta/genética , Bases de Datos Genéticas , Etnicidad/genética , Predisposición Genética a la Enfermedad , Estudio de Asociación del Genoma Completo , Humanos , Fenotipo , Proteínas/genética
4.
J Med Genet ; 54(9): 598-606, 2017 09.
Artículo en Inglés | MEDLINE | ID: mdl-28756411

RESUMEN

BACKGROUND: Microdeletions are known to confer risk to epilepsy, particularly at genomic rearrangement 'hotspot' loci. However, microdeletion burden not overlapping these regions or within different epilepsy subtypes has not been ascertained. OBJECTIVE: To decipher the role of microdeletions outside hotspots loci and risk assessment by epilepsy subtype. METHODS: We assessed the burden, frequency and genomic content of rare, large microdeletions found in a previously published cohort of 1366 patients with genetic generalised epilepsy (GGE) in addition to two sets of additional unpublished genome-wide microdeletions found in 281 patients with rolandic epilepsy (RE) and 807 patients with adult focal epilepsy (AFE), totalling 2454 cases. Microdeletions were assessed in a combined and subtype-specific approaches against 6746 controls. RESULTS: When hotspots are considered, we detected an enrichment of microdeletions in the combined epilepsy analysis (adjusted p=1.06×10-6,OR 1.89, 95% CI 1.51 to 2.35). Epilepsy subtype-specific analyses showed that hotspot microdeletions in the GGE subgroup contribute most of the overall signal (adjusted p=9.79×10-12, OR 7.45, 95% CI 4.20-13.5). Outside hotspots , microdeletions were enriched in the GGE cohort for neurodevelopmental genes (adjusted p=9.13×10-3,OR 2.85, 95% CI 1.62-4.94). No additional signal was observed for RE and AFE. Still, gene-content analysis identified known (NRXN1, RBFOX1 and PCDH7) and novel (LOC102723362) candidate genes across epilepsy subtypes that were not deleted in controls. CONCLUSIONS: Our results show a heterogeneous effect of recurrent and non-recurrent microdeletions as part of the genetic architecture of GGE and a minor contribution in the aetiology of RE and AFE.


Asunto(s)
Deleción Cromosómica , Epilepsias Parciales/genética , Epilepsia Generalizada/genética , Epilepsia Rolándica/genética , Estudios de Casos y Controles , Estudios de Cohortes , Variaciones en el Número de Copia de ADN , Expresión Génica , Estudios de Asociación Genética , Humanos
5.
Bioinformatics ; 31(18): 2955-62, 2015 Sep 15.
Artículo en Inglés | MEDLINE | ID: mdl-25979475

RESUMEN

UNLABELLED: In next generation sequencing (NGS)-based genetic studies, researchers typically perform genotype calling first and then apply standard genotype-based methods for association testing. However, such a two-step approach ignores genotype calling uncertainty in the association testing step and may incur power loss and/or inflated type-I error. In the recent literature, a few robust and efficient likelihood based methods including both likelihood ratio test (LRT) and score test have been proposed to carry out association testing without intermediate genotype calling. These methods take genotype calling uncertainty into account by directly incorporating genotype likelihood function (GLF) of NGS data into association analysis. However, existing LRT methods are computationally demanding or do not allow covariate adjustment; while existing score tests are not applicable to markers with low minor allele frequency (MAF). We provide an LRT allowing flexible covariate adjustment, develop a statistically more powerful score test and propose a combination strategy (UNC combo) to leverage the advantages of both tests. We have carried out extensive simulations to evaluate the performance of our proposed LRT and score test. Simulations and real data analysis demonstrate the advantages of our proposed combination strategy: it offers a satisfactory trade-off in terms of computational efficiency, applicability (accommodating both common variants and variants with low MAF) and statistical power, particularly for the analysis of quantitative trait where the power gain can be up to ∼60% when the causal variant is of low frequency (MAF < 0.01). AVAILABILITY AND IMPLEMENTATION: UNC combo and the associated R files, including documentation, examples, are available at http://www.unc.edu/∼yunmli/UNCcombo/ CONTACT: yunli@med.unc.edu SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Asunto(s)
Estudios de Asociación Genética , Variación Genética/genética , Secuenciación de Nucleótidos de Alto Rendimiento/métodos , Funciones de Verosimilitud , Sitios de Carácter Cuantitativo , Análisis de Secuencia de ADN/métodos , Simulación por Computador , Frecuencia de los Genes , Marcadores Genéticos , Genotipo , Humanos , Fenotipo
6.
Genet Epidemiol ; 37(7): 666-74, 2013 Nov.
Artículo en Inglés | MEDLINE | ID: mdl-23836599

RESUMEN

In the past few years, a plethora of methods for rare variant association with phenotype have been proposed. These methods aggregate information from multiple rare variants across genomic region(s), but there is little consensus as to which method is most effective. The weighting scheme adopted when aggregating information across variants is one of the primary determinants of effectiveness. Here we present a systematic evaluation of multiple weighting schemes through a series of simulations intended to mimic large sequencing studies of a quantitative trait. We evaluate existing phenotype-independent and phenotype-dependent methods, as well as weights estimated by penalized regression approaches including Lasso, Elastic Net, and SCAD. We find that the difference in power between phenotype-dependent schemes is negligible when high-quality functional annotations are available. When functional annotations are unavailable or incomplete, all methods suffer from power loss; however, the variable selection methods outperform the others at the cost of increased computational time. Therefore, in the absence of good annotation, we recommend variable selection methods (which can be viewed as "statistical annotation") on top of regions implicated by a phenotype-independent weighting scheme. Further, once a region is implicated, variable selection can help to identify potential causal single nucleotide polymorphisms for biological validation. These findings are supported by an analysis of a high coverage targeted sequencing study of 1,898 individuals.


Asunto(s)
Biología Computacional , Variación Genética/genética , Anotación de Secuencia Molecular , Sitios de Carácter Cuantitativo , Estudio de Asociación del Genoma Completo , Genómica , Genotipo , Humanos , Modelos Genéticos , Fenotipo , Polimorfismo de Nucleótido Simple/genética
7.
Am J Hum Genet ; 87(5): 728-35, 2010 Nov 12.
Artículo en Inglés | MEDLINE | ID: mdl-21055717

RESUMEN

Empirical evidences suggest that both common and rare variants contribute to complex disease etiology. Although the effects of common variants have been thoroughly assessed in recent genome-wide association studies (GWAS), our knowledge of the impact of rare variants on complex diseases remains limited. A number of methods have been proposed to test for rare variant association in sequencing-based studies, a study design that is becoming popular but is still not economically feasible. On the contrary, few (if any) methods exist to detect rare variants in GWAS data, the data we have collected on thousands of individuals. Here we propose two methods, a weighted haplotype-based approach and an imputation-based approach, to test for the effect of rare variants with GWAS data. Both methods can incorporate external sequencing data when available. We evaluated our methods and compared them with methods proposed in the sequencing setting through extensive simulations. Our methods clearly show enhanced statistical power over existing methods for a wide range of population-attributable risk, percentage of disease-contributing rare variants, and proportion of rare alleles working in different directions. We also applied our methods to the IFIH1 region for the type 1 diabetes GWAS data collected by the Wellcome Trust Case-Control Consortium. Our methods yield p values in the order of 10⁻³, whereas the most significant p value from the existing methods is greater than 0.17. We thus demonstrate that the evaluation of rare variants with GWAS data is possible, particularly when public sequencing data are incorporated.


Asunto(s)
Técnicas Genéticas , Variación Genética , Estudio de Asociación del Genoma Completo , Haplotipos , Diabetes Mellitus Tipo 1/genética , Predisposición Genética a la Enfermedad , Humanos , Modelos Estadísticos , Análisis de Secuencia de ADN
9.
Nat Neurosci ; 22(12): 1966-1974, 2019 12.
Artículo en Inglés | MEDLINE | ID: mdl-31768050

RESUMEN

To discover novel genes underlying amyotrophic lateral sclerosis (ALS), we aggregated exomes from 3,864 cases and 7,839 ancestry-matched controls. We observed a significant excess of rare protein-truncating variants among ALS cases, and these variants were concentrated in constrained genes. Through gene level analyses, we replicated known ALS genes including SOD1, NEK1 and FUS. We also observed multiple distinct protein-truncating variants in a highly constrained gene, DNAJC7. The signal in DNAJC7 exceeded genome-wide significance, and immunoblotting assays showed depletion of DNAJC7 protein in fibroblasts in a patient with ALS carrying the p.Arg156Ter variant. DNAJC7 encodes a member of the heat-shock protein family, HSP40, which, along with HSP70 proteins, facilitates protein homeostasis, including folding of newly synthesized polypeptides and clearance of degraded proteins. When these processes are not regulated, misfolding and accumulation of aberrant proteins can occur and lead to protein aggregation, which is a pathological hallmark of neurodegeneration. Our results highlight DNAJC7 as a novel gene for ALS.


Asunto(s)
Esclerosis Amiotrófica Lateral/genética , Exoma/genética , Predisposición Genética a la Enfermedad/genética , Proteínas de Choque Térmico/genética , Chaperonas Moleculares/genética , Estudios de Casos y Controles , Femenino , Variación Genética/genética , Humanos , Masculino
10.
Neuron ; 103(2): 217-234.e4, 2019 07 17.
Artículo en Inglés | MEDLINE | ID: mdl-31171447

RESUMEN

Synapses are fundamental information-processing units of the brain, and synaptic dysregulation is central to many brain disorders ("synaptopathies"). However, systematic annotation of synaptic genes and ontology of synaptic processes are currently lacking. We established SynGO, an interactive knowledge base that accumulates available research about synapse biology using Gene Ontology (GO) annotations to novel ontology terms: 87 synaptic locations and 179 synaptic processes. SynGO annotations are exclusively based on published, expert-curated evidence. Using 2,922 annotations for 1,112 genes, we show that synaptic genes are exceptionally well conserved and less tolerant to mutations than other genes. Many SynGO terms are significantly overrepresented among gene variations associated with intelligence, educational attainment, ADHD, autism, and bipolar disorder and among de novo variants associated with neurodevelopmental disorders, including schizophrenia. SynGO is a public, universal reference for synapse research and an online analysis platform for interpretation of large-scale -omics data (https://syngoportal.org and http://geneontology.org).


Asunto(s)
Encéfalo/citología , Ontología de Genes , Proteómica , Programas Informáticos , Sinapsis/fisiología , Animales , Encéfalo/fisiología , Bases de Datos Genéticas , Humanos , Bases del Conocimiento , Potenciales Sinápticos/fisiología , Sinaptosomas
11.
Nat Genet ; 50(4): 621-629, 2018 04.
Artículo en Inglés | MEDLINE | ID: mdl-29632380

RESUMEN

We introduce an approach to identify disease-relevant tissues and cell types by analyzing gene expression data together with genome-wide association study (GWAS) summary statistics. Our approach uses stratified linkage disequilibrium (LD) score regression to test whether disease heritability is enriched in regions surrounding genes with the highest specific expression in a given tissue. We applied our approach to gene expression data from several sources together with GWAS summary statistics for 48 diseases and traits (average N = 169,331) and found significant tissue-specific enrichments (false discovery rate (FDR) < 5%) for 34 traits. In our analysis of multiple tissues, we detected a broad range of enrichments that recapitulated known biology. In our brain-specific analysis, significant enrichments included an enrichment of inhibitory over excitatory neurons for bipolar disorder, and excitatory over inhibitory neurons for schizophrenia and body mass index. Our results demonstrate that our polygenic approach is a powerful way to leverage gene expression data for interpreting GWAS signals.


Asunto(s)
Expresión Génica , Predisposición Genética a la Enfermedad , Trastorno Bipolar/genética , Índice de Masa Corporal , Encéfalo/metabolismo , Cromatina/genética , Epigénesis Genética , Perfilación de la Expresión Génica/estadística & datos numéricos , Estudio de Asociación del Genoma Completo/estadística & datos numéricos , Humanos , Enfermedades del Sistema Inmune/genética , Desequilibrio de Ligamiento , Modelos Genéticos , Herencia Multifactorial , Neuronas/metabolismo , Esquizofrenia/genética , Distribución Tisular/genética
12.
Nat Neurosci ; 19(12): 1563-1565, 2016 12.
Artículo en Inglés | MEDLINE | ID: mdl-27694993

RESUMEN

Disruptive, damaging ultra-rare variants in highly constrained genes are enriched in individuals with neurodevelopmental disorders. In the general population, this class of variants was associated with a decrease in years of education (YOE). This effect was stronger among highly brain-expressed genes and explained more YOE variance than pathogenic copy number variation but less than common variants. Disruptive, damaging ultra-rare variants in highly constrained genes influence the determinants of YOE in the general population.


Asunto(s)
Variaciones en el Número de Copia de ADN/genética , Predisposición Genética a la Enfermedad , Mutación/genética , Trastornos del Neurodesarrollo/genética , Educación , Humanos , Análisis y Desempeño de Tareas
13.
PLoS One ; 4(6): e5805, 2009 Jun 05.
Artículo en Inglés | MEDLINE | ID: mdl-19503787

RESUMEN

BACKGROUND: Chronic fatiguing illness remains a poorly understood syndrome of unknown pathogenesis. We attempted to identify biomarkers for chronic fatiguing illness using microarrays to query the transcriptome in peripheral blood leukocytes. METHODS: Cases were 44 individuals who were clinically evaluated and found to meet standard international criteria for chronic fatigue syndrome or idiopathic chronic fatigue, and controls were their monozygotic co-twins who were clinically evaluated and never had even one month of impairing fatigue. Biological sampling conditions were standardized and RNA stabilizing media were used. These methodological features provide rigorous control for bias resulting from case-control mismatched ancestry and experimental error. Individual gene expression profiles were assessed using Affymetrix Human Genome U133 Plus 2.0 arrays. FINDINGS: There were no significant differences in gene expression for any transcript. CONCLUSIONS: Contrary to our expectations, we were unable to identify a biomarker for chronic fatiguing illness in the transcriptome of peripheral blood leukocytes suggesting that positive findings in prior studies may have resulted from experimental bias.


Asunto(s)
Biomarcadores/metabolismo , Enfermedades en Gemelos/genética , Síndrome de Fatiga Crónica/diagnóstico , Síndrome de Fatiga Crónica/genética , Regulación de la Expresión Génica , Gemelos Monocigóticos , Adulto , Femenino , Perfilación de la Expresión Génica , Humanos , Masculino , Persona de Mediana Edad , Análisis de Secuencia por Matrices de Oligonucleótidos , ARN/metabolismo
SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA