Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 74
Filtrar
1.
Nat Med ; 2024 Apr 16.
Artigo em Inglês | MEDLINE | ID: mdl-38627562

RESUMO

Reduced insulin sensitivity (insulin resistance) is a hallmark of normal physiology in late pregnancy and also underlies gestational diabetes mellitus (GDM). We conducted transcriptomic profiling of 434 human placentas and identified a positive association between insulin-like growth factor binding protein 1 gene (IGFBP1) expression in the placenta and insulin sensitivity at ~26 weeks gestation. Circulating IGFBP1 protein levels rose over the course of pregnancy and declined postpartum, which, together with high gene expression levels in our placenta samples, suggests a placental or decidual source. Higher circulating IGFBP1 levels were associated with greater insulin sensitivity (lesser insulin resistance) at ~26 weeks gestation in the same cohort and in two additional pregnancy cohorts. In addition, low circulating IGFBP1 levels in early pregnancy predicted subsequent GDM diagnosis in two cohorts of pregnant women. These results implicate IGFBP1 in the glycemic physiology of pregnancy and suggest a role for placental IGFBP1 deficiency in GDM pathogenesis.

2.
Nat Rev Genet ; 2024 Mar 28.
Artigo em Inglês | MEDLINE | ID: mdl-38548833

RESUMO

Germline variation and somatic mutation are intricately connected and together shape human traits and disease risks. Germline variants are present from conception, but they vary between individuals and accumulate over generations. By contrast, somatic mutations accumulate throughout life in a mosaic manner within an individual due to intrinsic and extrinsic sources of mutations and selection pressures acting on cells. Recent advancements, such as improved detection methods and increased resources for association studies, have drastically expanded our ability to investigate germline and somatic genetic variation and compare underlying mutational processes. A better understanding of the similarities and differences in the types, rates and patterns of germline and somatic variants, as well as their interplay, will help elucidate the mechanisms underlying their distinct yet interlinked roles in human health and biology.

3.
Am J Hum Genet ; 111(1): 133-149, 2024 Jan 04.
Artigo em Inglês | MEDLINE | ID: mdl-38181730

RESUMO

Bulk-tissue molecular quantitative trait loci (QTLs) have been the starting point for interpreting disease-associated variants, and context-specific QTLs show particular relevance for disease. Here, we present the results of mapping interaction QTLs (iQTLs) for cell type, age, and other phenotypic variables in multi-omic, longitudinal data from the blood of individuals of diverse ancestries. By modeling the interaction between genotype and estimated cell-type proportions, we demonstrate that cell-type iQTLs could be considered as proxies for cell-type-specific QTL effects, particularly for the most abundant cell type in the tissue. The interpretation of age iQTLs, however, warrants caution because the moderation effect of age on the genotype and molecular phenotype association could be mediated by changes in cell-type composition. Finally, we show that cell-type iQTLs contribute to cell-type-specific enrichment of diseases that, in combination with additional functional data, could guide future functional studies. Overall, this study highlights the use of iQTLs to gain insights into the context specificity of regulatory effects.


Assuntos
Regulação da Expressão Gênica , Locos de Características Quantitativas , Humanos , Locos de Características Quantitativas/genética , Genótipo , Fenótipo
4.
J Clin Endocrinol Metab ; 109(3): e1159-e1166, 2024 Feb 20.
Artigo em Inglês | MEDLINE | ID: mdl-37864851

RESUMO

CONTEXT: Elevated body mass index (BMI) in pregnancy is associated with adverse maternal and fetal outcomes. The placental transcriptome may elucidate molecular mechanisms underlying these associations. OBJECTIVE: We examined the association of first-trimester maternal BMI with the placental transcriptome in the Gen3G prospective cohort. METHODS: We enrolled participants at 5 to 16 weeks of gestation and measured height and weight. We collected placenta samples at delivery. We performed whole-genome RNA sequencing using Illumina HiSeq 4000 and aligned RNA sequences based on the GTEx v8 pipeline. We conducted differential gene expression analysis of over 15 000 genes from 450 placental samples and reported the change in normalized gene expression per 1-unit increase in log2 BMI (kg/m2) as a continuous variable using Limma Voom. We adjusted models for maternal age, fetal sex, gestational age at delivery, gravidity, and surrogate variables accounting for technical variability. We compared participants with BMI of 18.5 to 24.9 mg/kg2 (N = 257) vs those with obesity (BMI ≥30 kg/m2, N = 82) in secondary analyses. RESULTS: Participants' mean ± SD age was 28.2 ± 4.4 years and BMI was 25.4 ± 5.5 kg/m2 in early pregnancy. Higher maternal BMI was associated with lower placental expression of EPYC (slope = -1.94, false discovery rate [FDR]-adjusted P = 7.3 × 10-6 for continuous BMI; log2 fold change = -1.35, FDR-adjusted P = 3.4 × 10-3 for BMI ≥30 vs BMI 18.5-24.9 kg/m2) and with higher placental expression of IGFBP6, CHRDL1, and CXCL13 after adjustment for covariates and accounting for multiple testing (FDR < 0.05). CONCLUSION: Our genome-wide transcriptomic study revealed novel genes potentially implicated in placental biologic response to higher maternal BMI in early pregnancy.


Assuntos
Placenta , Transcriptoma , Gravidez , Humanos , Feminino , Adulto Jovem , Adulto , Índice de Massa Corporal , Placenta/metabolismo , Estudos Prospectivos , Perfilação da Expressão Gênica
5.
bioRxiv ; 2023 Aug 21.
Artigo em Inglês | MEDLINE | ID: mdl-37662416

RESUMO

Blood lipid traits are treatable and heritable risk factors for heart disease, a leading cause of mortality worldwide. Although genome-wide association studies (GWAS) have discovered hundreds of variants associated with lipids in humans, most of the causal mechanisms of lipids remain unknown. To better understand the biological processes underlying lipid metabolism, we investigated the associations of plasma protein levels with total cholesterol (TC), triglycerides (TG), high-density lipoprotein cholesterol (HDL), and low-density lipoprotein cholesterol (LDL) in blood. We trained protein prediction models based on samples in the Multi-Ethnic Study of Atherosclerosis (MESA) and applied them to conduct proteome-wide association studies (PWAS) for lipids using the Global Lipids Genetics Consortium (GLGC) data. Of the 749 proteins tested, 42 were significantly associated with at least one lipid trait. Furthermore, we performed transcriptome-wide association studies (TWAS) for lipids using 9,714 gene expression prediction models trained on samples from peripheral blood mononuclear cells (PBMCs) in MESA and 49 tissues in the Genotype-Tissue Expression (GTEx) project. We found that although PWAS and TWAS can show different directions of associations in an individual gene, 40 out of 49 tissues showed a positive correlation between PWAS and TWAS signed p-values across all the genes, which suggests a high-level consistency between proteome-lipid associations and transcriptome-lipid associations.

6.
Nat Genet ; 55(10): 1665-1676, 2023 Oct.
Artigo em Inglês | MEDLINE | ID: mdl-37770633

RESUMO

Genetic variants associated with complex traits are primarily noncoding, and their effects on gene-regulatory activity remain largely uncharacterized. To address this, we profile epigenomic variation of histone mark H3K27ac across 387 brain, heart, muscle and lung samples from Genotype-Tissue Expression (GTEx). We annotate 282 k active regulatory elements (AREs) with tissue-specific activity patterns. We identify 2,436 sex-biased AREs and 5,397 genetically influenced AREs associated with 130 k genetic variants (haQTLs) across tissues. We integrate genetic and epigenomic variation to provide mechanistic insights for disease-associated loci from 55 genome-wide association studies (GWAS), by revealing candidate tissues of action, driver SNPs and impacted AREs. Lastly, we build ARE-gene linking scores based on genetics (gLink scores) and demonstrate their unique ability to prioritize SNP-ARE-gene circuits. Overall, our epigenomic datasets, computational integration and mechanistic predictions provide valuable resources and important insights for understanding the molecular basis of human diseases/traits such as schizophrenia.


Assuntos
Epigenômica , Estudo de Associação Genômica Ampla , Humanos , Locos de Características Quantitativas/genética , Genótipo , Redes Reguladoras de Genes , Polimorfismo de Nucleotídeo Único/genética , Predisposição Genética para Doença
7.
Cell Genom ; 3(8): 100359, 2023 Aug 09.
Artigo em Inglês | MEDLINE | ID: mdl-37601969

RESUMO

Multi-omics datasets are becoming more common, necessitating better integration methods to realize their revolutionary potential. Here, we introduce multi-set correlation and factor analysis (MCFA), an unsupervised integration method tailored to the unique challenges of high-dimensional genomics data that enables fast inference of shared and private factors. We used MCFA to integrate methylation markers, protein expression, RNA expression, and metabolite levels in 614 diverse samples from the Trans-Omics for Precision Medicine/Multi-Ethnic Study of Atherosclerosis multi-omics pilot. Samples cluster strongly by ancestry in the shared space, even in the absence of genetic information, while private spaces frequently capture dataset-specific technical variation. Finally, we integrated genetic data by conducting a genome-wide association study (GWAS) of our inferred factors, observing that several factors are enriched for GWAS hits and trans-expression quantitative trait loci. Two of these factors appear to be related to metabolic disease. Our study provides a foundation and framework for further integrative analysis of ever larger multi-modal genomic datasets.

8.
J Clin Invest ; 133(18)2023 09 15.
Artigo em Inglês | MEDLINE | ID: mdl-37498674

RESUMO

Clonal hematopoiesis of indeterminate potential (CHIP) is associated with an increased risk of cardiovascular diseases (CVDs), putatively via inflammasome activation. We pursued an inflammatory gene modifier scan for CHIP-associated CVD risk among 424,651 UK Biobank participants. We identified CHIP using whole-exome sequencing data of blood DNA and modeled as a composite, considering all driver genes together, as well as separately for common drivers (DNMT3A, TET2, ASXL1, and JAK2). We developed predicted gene expression scores for 26 inflammasome-related genes and assessed how they modify CHIP-associated CVD risk. We identified IL1RAP as a potential key molecule for CHIP-associated CVD risk across genes and increased AIM2 gene expression leading to heightened JAK2- and ASXL1-associated CVD risk. We show that CRISPR-induced Asxl1-mutated murine macrophages had a particularly heightened inflammatory response to AIM2 agonism, associated with an increased DNA damage response, as well as increased IL-10 secretion, mirroring a CVD-protective effect of IL10 expression in ASXL1 CHIP. Our study supports the role of inflammasomes in CHIP-associated CVD and provides evidence to support gene-specific strategies to address CHIP-associated CVD risk.


Assuntos
Doenças Cardiovasculares , Humanos , Animais , Camundongos , Doenças Cardiovasculares/genética , Hematopoiese Clonal/genética , Fatores de Risco , Inflamassomos/genética , Hematopoese/genética , Inflamação/genética , Inflamação/complicações , Fatores de Risco de Doenças Cardíacas , Mutação
9.
bioRxiv ; 2023 Jun 29.
Artigo em Inglês | MEDLINE | ID: mdl-37425716

RESUMO

Bulk tissue molecular quantitative trait loci (QTLs) have been the starting point for interpreting disease-associated variants, while context-specific QTLs show particular relevance for disease. Here, we present the results of mapping interaction QTLs (iQTLs) for cell type, age, and other phenotypic variables in multi-omic, longitudinal data from blood of individuals of diverse ancestries. By modeling the interaction between genotype and estimated cell type proportions, we demonstrate that cell type iQTLs could be considered as proxies for cell type-specific QTL effects. The interpretation of age iQTLs, however, warrants caution as the moderation effect of age on the genotype and molecular phenotype association may be mediated by changes in cell type composition. Finally, we show that cell type iQTLs contribute to cell type-specific enrichment of diseases that, in combination with additional functional data, may guide future functional studies. Overall, this study highlights iQTLs to gain insights into the context-specificity of regulatory effects.

10.
Nat Genet ; 55(8): 1267-1276, 2023 08.
Artigo em Inglês | MEDLINE | ID: mdl-37443254

RESUMO

Genome-wide association studies (GWASs) are a valuable tool for understanding the biology of complex human traits and diseases, but associated variants rarely point directly to causal genes. In the present study, we introduce a new method, polygenic priority score (PoPS), that learns trait-relevant gene features, such as cell-type-specific expression, to prioritize genes at GWAS loci. Using a large evaluation set of genes with fine-mapped coding variants, we show that PoPS and the closest gene individually outperform other gene prioritization methods, but observe the best overall performance by combining PoPS with orthogonal methods. Using this combined approach, we prioritize 10,642 unique gene-trait pairs across 113 complex traits and diseases with high precision, finding not only well-established gene-trait relationships but nominating new genes at unresolved loci, such as LGR4 for estimated glomerular filtration rate and CCR7 for deep vein thrombosis. Overall, we demonstrate that PoPS provides a powerful addition to the gene prioritization toolbox.


Assuntos
Herança Multifatorial , Locos de Características Quantitativas , Humanos , Herança Multifatorial/genética , Locos de Características Quantitativas/genética , Estudo de Associação Genômica Ampla/métodos , Predisposição Genética para Doença/genética , Fenótipo , Polimorfismo de Nucleotídeo Único/genética
11.
Science ; 380(6641): eabn7113, 2023 04 14.
Artigo em Inglês | MEDLINE | ID: mdl-37053313

RESUMO

Postzygotic mutations (PZMs) begin to accrue in the human genome immediately after fertilization, but how and when PZMs affect development and lifetime health remain unclear. To study the origins and functional consequences of PZMs, we generated a multitissue atlas of PZMs spanning 54 tissue and cell types from 948 donors. Nearly half the variation in mutation burden among tissue samples can be explained by measured technical and biological effects, and 9% can be attributed to donor-specific effects. Through phylogenetic reconstruction of PZMs, we found that their type and predicted functional impact vary during prenatal development, across tissues, and through the germ cell life cycle. Thus, methods for interpreting effects across the body and the life span are needed to fully understand the consequences of genetic variants.


Assuntos
Análise Mutacional de DNA , Longevidade , Zigoto , Feminino , Humanos , Longevidade/genética , Mutação , Filogenia , RNA-Seq
12.
Cell Genom ; 3(1): 100244, 2023 Jan 11.
Artigo em Inglês | MEDLINE | ID: mdl-36777183

RESUMO

Understanding the consequences of individual transcriptome variation is fundamental to deciphering human biology and disease. We implement a statistical framework to quantify the contributions of 21 individual traits as drivers of gene expression and alternative splicing variation across 46 human tissues and 781 individuals from the Genotype-Tissue Expression project. We demonstrate that ancestry, sex, age, and BMI make additive and tissue-specific contributions to expression variability, whereas interactions are rare. Variation in splicing is dominated by ancestry and is under genetic control in most tissues, with ribosomal proteins showing a strong enrichment of tissue-shared splicing events. Our analyses reveal a systemic contribution of types 1 and 2 diabetes to tissue transcriptome variation with the strongest signal in the nerve, where histopathology image analysis identifies novel genes related to diabetic neuropathy. Our multi-tissue and multi-trait approach provides an extensive characterization of the main drivers of human transcriptome variation in health and disease.

13.
Respir Res ; 24(1): 30, 2023 Jan 25.
Artigo em Inglês | MEDLINE | ID: mdl-36698131

RESUMO

BACKGROUND: Chronic obstructive pulmonary disease (COPD) varies significantly in symptomatic and physiologic presentation. Identifying disease subtypes from molecular data, collected from easily accessible blood samples, can help stratify patients and guide disease management and treatment. METHODS: Blood gene expression measured by RNA-sequencing in the COPDGene Study was analyzed using a network perturbation analysis method. Each COPD sample was compared against a learned reference gene network to determine the part that is deregulated. Gene deregulation values were used to cluster the disease samples. RESULTS: The discovery set included 617 former smokers from COPDGene. Four distinct gene network subtypes are identified with significant differences in symptoms, exercise capacity and mortality. These clusters do not necessarily correspond with the levels of lung function impairment and are independently validated in two external cohorts: 769 former smokers from COPDGene and 431 former smokers in the Multi-Ethnic Study of Atherosclerosis (MESA). Additionally, we identify several genes that are significantly deregulated across these subtypes, including DSP and GSTM1, which have been previously associated with COPD through genome-wide association study (GWAS). CONCLUSIONS: The identified subtypes differ in mortality and in their clinical and functional characteristics, underlining the need for multi-dimensional assessment potentially supplemented by selected markers of gene expression. The subtypes were consistent across cohorts and could be used for new patient stratification and disease prognosis.


Assuntos
Redes Reguladoras de Genes , Doença Pulmonar Obstrutiva Crônica , Humanos , Redes Reguladoras de Genes/genética , Fumantes , Estudo de Associação Genômica Ampla/métodos , Doença Pulmonar Obstrutiva Crônica/diagnóstico , Doença Pulmonar Obstrutiva Crônica/genética , Prognóstico
14.
Am J Hum Genet ; 109(7): 1286-1297, 2022 07 07.
Artigo em Inglês | MEDLINE | ID: mdl-35716666

RESUMO

Despite the growing number of genome-wide association studies (GWASs), it remains unclear to what extent gene-by-gene and gene-by-environment interactions influence complex traits in humans. The magnitude of genetic interactions in complex traits has been difficult to quantify because GWASs are generally underpowered to detect individual interactions of small effect. Here, we develop a method to test for genetic interactions that aggregates information across all trait-associated loci. Specifically, we test whether SNPs in regions of European ancestry shared between European American and admixed African American individuals have the same causal effect sizes. We hypothesize that in African Americans, the presence of genetic interactions will drive the causal effect sizes of SNPs in regions of European ancestry to be more similar to those of SNPs in regions of African ancestry. We apply our method to two traits: gene expression in 296 African Americans and 482 European Americans in the Multi-Ethnic Study of Atherosclerosis (MESA) and low-density lipoprotein cholesterol (LDL-C) in 74K African Americans and 296K European Americans in the Million Veteran Program (MVP). We find significant evidence for genetic interactions in our analysis of gene expression; for LDL-C, we observe a similar point estimate, although this is not significant, most likely due to lower statistical power. These results suggest that gene-by-gene or gene-by-environment interactions modify the effect sizes of causal variants in human complex traits.


Assuntos
Estudo de Associação Genômica Ampla , Herança Multifatorial , LDL-Colesterol , Expressão Gênica , Humanos , Herança Multifatorial/genética , Polimorfismo de Nucleotídeo Único/genética , População Branca/genética
15.
Science ; 376(6594): eabl4290, 2022 05 13.
Artigo em Inglês | MEDLINE | ID: mdl-35549429

RESUMO

Understanding gene function and regulation in homeostasis and disease requires knowledge of the cellular and tissue contexts in which genes are expressed. Here, we applied four single-nucleus RNA sequencing methods to eight diverse, archived, frozen tissue types from 16 donors and 25 samples, generating a cross-tissue atlas of 209,126 nuclei profiles, which we integrated across tissues, donors, and laboratory methods with a conditional variational autoencoder. Using the resulting cross-tissue atlas, we highlight shared and tissue-specific features of tissue-resident cell populations; identify cell types that might contribute to neuromuscular, metabolic, and immune components of monogenic diseases and the biological processes involved in their pathology; and determine cell types and gene modules that might underlie disease mechanisms for complex traits analyzed by genome-wide association studies.


Assuntos
Núcleo Celular , Doença , RNA-Seq , Biomarcadores , Núcleo Celular/genética , Doença/genética , Estudo de Associação Genômica Ampla , Humanos , Especificidade de Órgãos , Fenótipo , RNA-Seq/métodos
16.
PLoS Genet ; 18(1): e1009719, 2022 01.
Artigo em Inglês | MEDLINE | ID: mdl-35100260

RESUMO

Tens of thousands of genetic variants associated with gene expression (cis-eQTLs) have been discovered in the human population. These eQTLs are active in various tissues and contexts, but the molecular mechanisms of eQTL variability are poorly understood, hindering our understanding of genetic regulation across biological contexts. Since many eQTLs are believed to act by altering transcription factor (TF) binding affinity, we hypothesized that analyzing eQTL effect size as a function of TF level may allow discovery of mechanisms of eQTL variability. Using GTEx Consortium eQTL data from 49 tissues, we analyzed the interaction between eQTL effect size and TF level across tissues and across individuals within specific tissues and generated a list of 10,098 TF-eQTL interactions across 2,136 genes that are supported by at least two lines of evidence. These TF-eQTLs were enriched for various TF binding measures, supporting with orthogonal evidence that these eQTLs are regulated by the implicated TFs. We also found that our TF-eQTLs tend to overlap genes with gene-by-environment regulatory effects and to colocalize with GWAS loci, implying that our approach can help to elucidate mechanisms of context-specificity and trait associations. Finally, we highlight an interesting example of IKZF1 TF regulation of an APBB1IP gene eQTL that colocalizes with a GWAS signal for blood cell traits. Together, our findings provide candidate TF mechanisms for a large number of eQTLs and offer a generalizable approach for researchers to discover TF regulators of genetic variant effects in additional QTL datasets.


Assuntos
Locos de Características Quantitativas , Fatores de Transcrição/fisiologia , Alelos , Sítios de Ligação , Técnicas de Silenciamento de Genes , Interação Gene-Ambiente , Estudo de Associação Genômica Ampla , Humanos , Fator Regulador 1 de Interferon/genética , Modelos Genéticos , Fenótipo , Fatores de Transcrição/metabolismo
17.
Nat Commun ; 12(1): 3394, 2021 06 07.
Artigo em Inglês | MEDLINE | ID: mdl-34099641

RESUMO

The large majority of variants identified by GWAS are non-coding, motivating detailed characterization of the function of non-coding variants. Experimental methods to assess variants' effect on gene expressions in native chromatin context via direct perturbation are low-throughput. Existing high-throughput computational predictors thus have lacked large gold standard sets of regulatory variants for training and validation. Here, we leverage a set of 14,807 putative causal eQTLs in humans obtained through statistical fine-mapping, and we use 6121 features to directly train a predictor of whether a variant modifies nearby gene expression. We call the resulting prediction the expression modifier score (EMS). We validate EMS by comparing its ability to prioritize functional variants with other major scores. We then use EMS as a prior for statistical fine-mapping of eQTLs to identify an additional 20,913 putatively causal eQTLs, and we incorporate EMS into co-localization analysis to identify 310 additional candidate genes across UK Biobank phenotypes.


Assuntos
Mapeamento Cromossômico/métodos , Biologia Computacional/métodos , Locos de Características Quantitativas , Aprendizado de Máquina Supervisionado , Adulto , Estudos de Coortes , Conjuntos de Dados como Assunto , Perfilação da Expressão Gênica , Humanos , Polimorfismo de Nucleotídeo Único
18.
Cell ; 184(10): 2633-2648.e19, 2021 05 13.
Artigo em Inglês | MEDLINE | ID: mdl-33864768

RESUMO

Long non-coding RNA (lncRNA) genes have well-established and important impacts on molecular and cellular functions. However, among the thousands of lncRNA genes, it is still a major challenge to identify the subset with disease or trait relevance. To systematically characterize these lncRNA genes, we used Genotype Tissue Expression (GTEx) project v8 genetic and multi-tissue transcriptomic data to profile the expression, genetic regulation, cellular contexts, and trait associations of 14,100 lncRNA genes across 49 tissues for 101 distinct complex genetic traits. Using these approaches, we identified 1,432 lncRNA gene-trait associations, 800 of which were not explained by stronger effects of neighboring protein-coding genes. This included associations between lncRNA quantitative trait loci and inflammatory bowel disease, type 1 and type 2 diabetes, and coronary artery disease, as well as rare variant associations to body mass index.


Assuntos
Doença/genética , Herança Multifatorial/genética , População/genética , RNA Longo não Codificante/genética , Transcriptoma , Doença da Artéria Coronariana/genética , Diabetes Mellitus Tipo 1/genética , Diabetes Mellitus Tipo 2/genética , Perfilação da Expressão Gênica , Variação Genética , Humanos , Doenças Inflamatórias Intestinais/genética , Especificidade de Órgãos/genética , Locos de Características Quantitativas
19.
Bioinformatics ; 37(18): 3048-3050, 2021 09 29.
Artigo em Inglês | MEDLINE | ID: mdl-33677499

RESUMO

SUMMARY: Post-sequencing quality control is a crucial component of RNA sequencing (RNA-seq) data generation and analysis, as sample quality can be affected by sample storage, extraction and sequencing protocols. RNA-seq is increasingly applied to cohorts ranging from hundreds to tens of thousands of samples in size, but existing tools do not readily scale to these sizes, and were not designed for a wide range of sample types and qualities. Here, we describe RNA-SeQC 2, an efficient reimplementation of RNA-SeQC (DeLuca et al., 2012) that adds multiple metrics designed to characterize sample quality across a wide range of RNA-seq protocols. AVAILABILITY AND IMPLEMENTATION: The command-line tool, documentation and C++ source code are available at the GitHub repository https://github.com/getzlab/rnaseqc. Code and data for reproducing the figures in this paper are available at https://github.com/getzlab/rnaseqc2-paper. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Assuntos
RNA , Software , Humanos , RNA-Seq , Análise de Sequência de RNA/métodos , Controle de Qualidade
20.
Nature ; 590(7845): 290-299, 2021 02.
Artigo em Inglês | MEDLINE | ID: mdl-33568819

RESUMO

The Trans-Omics for Precision Medicine (TOPMed) programme seeks to elucidate the genetic architecture and biology of heart, lung, blood and sleep disorders, with the ultimate goal of improving diagnosis, treatment and prevention of these diseases. The initial phases of the programme focused on whole-genome sequencing of individuals with rich phenotypic data and diverse backgrounds. Here we describe the TOPMed goals and design as well as the available resources and early insights obtained from the sequence data. The resources include a variant browser, a genotype imputation server, and genomic and phenotypic data that are available through dbGaP (Database of Genotypes and Phenotypes)1. In the first 53,831 TOPMed samples, we detected more than 400 million single-nucleotide and insertion or deletion variants after alignment with the reference genome. Additional previously undescribed variants were detected through assembly of unmapped reads and customized analysis in highly variable loci. Among the more than 400 million detected variants, 97% have frequencies of less than 1% and 46% are singletons that are present in only one individual (53% among unrelated individuals). These rare variants provide insights into mutational processes and recent human evolutionary history. The extensive catalogue of genetic variation in TOPMed studies provides unique opportunities for exploring the contributions of rare and noncoding sequence variants to phenotypic variation. Furthermore, combining TOPMed haplotypes with modern imputation methods improves the power and reach of genome-wide association studies to include variants down to a frequency of approximately 0.01%.


Assuntos
Variação Genética/genética , Genoma Humano/genética , Genômica , National Heart, Lung, and Blood Institute (U.S.) , Medicina de Precisão , Citocromo P-450 CYP2D6/genética , Haplótipos/genética , Heterozigoto , Humanos , Mutação INDEL , Mutação com Perda de Função , Mutagênese , Fenótipo , Polimorfismo de Nucleotídeo Único , Densidade Demográfica , Medicina de Precisão/normas , Controle de Qualidade , Tamanho da Amostra , Estados Unidos , Sequenciamento Completo do Genoma/normas
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA
...