Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 51
Filtrar
1.
Artículo en Inglés | MEDLINE | ID: mdl-39135439

RESUMEN

OBJECTIVES: The All of Us Research Program is a precision medicine initiative aimed at establishing a vast, diverse biomedical database accessible through a cloud-based data analysis platform, the Researcher Workbench (RW). Our goal was to empower the research community by co-designing the implementation of SAS in the RW alongside researchers to enable broader use of All of Us data. MATERIALS AND METHODS: Researchers from various fields and with different SAS experience levels participated in co-designing the SAS implementation through user experience interviews. RESULTS: Feedback and lessons learned from user testing informed the final design of the SAS application. DISCUSSION: The co-design approach is critical for reducing technical barriers, broadening All of Us data use, and enhancing the user experience for data analysis on the RW. CONCLUSION: Our co-design approach successfully tailored the implementation of the SAS application to researchers' needs. This approach may inform future software implementations on the RW.

2.
JCO Precis Oncol ; 8: e2300368, 2024 Jan.
Artículo en Inglés | MEDLINE | ID: mdl-38237100

RESUMEN

PURPOSE: Somatic chromosomal alterations, particularly monosomy 3 and 8q gains, have been associated with metastatic risk in uveal melanoma (UM). Whole genome-scale evaluation of detectable alterations in cell-free DNA (cfDNA) in UM could provide valuable prognostic information. Our pilot study evaluates the correlation between genomic information using ultra-low-pass whole-genome sequencing (ULP-WGS) of cfDNA in UM and associated clinical outcomes. MATERIALS AND METHODS: ULP-WGS of cfDNA was performed on 29 plasma samples from 16 patients, 14 metastatic UM (mUM) and two non-metastatic, including pre- and post-treatment mUM samples from 10 patients treated with immunotherapy and one with liver-directed therapy. We estimated tumor fraction (TFx) and detected copy-number alterations (CNAs) using ichorCNA. Presence of 8q amplification was further analyzed using the likelihood ratio test (LRT). RESULTS: Eleven patients with mUM (17 samples) of 14 had detectable circulating tumor DNA (ctDNA). 8q gain was detected in all 17, whereas monosomy 3 was detectable in 10 of 17 samples. TFx generally correlated with disease status, showing an increase at the time of disease progression (PD). 8q gain detection sensitivity appeared greater with the LRT than with ichorCNA at lower TFxs. The only patient with mUM with partial response on treatment had a high pretreatment TFx and undetectable on-treatment ctDNA, correlating with her profound response and durable survival. CONCLUSION: ctDNA can be detected in mUM using ULP-WGS, and the TFx correlates with DS. 8q gain was consistently detectable in mUM, in line with previous studies indicating 8q gains early in primary UM and higher amplification with PD. Our work suggests that detection of CNAs by ULP-WGS, particularly focusing on 8q gain, could be a valuable blood biomarker to monitor PD in UM.


Asunto(s)
ADN Tumoral Circulante , Melanoma , Neoplasias de la Úvea , Femenino , Humanos , Proyectos Piloto , Melanoma/genética , Melanoma/diagnóstico , Monosomía , ADN Tumoral Circulante/genética
5.
Nature ; 625(7993): 92-100, 2024 Jan.
Artículo en Inglés | MEDLINE | ID: mdl-38057664

RESUMEN

The depletion of disruptive variation caused by purifying natural selection (constraint) has been widely used to investigate protein-coding genes underlying human disorders1-4, but attempts to assess constraint for non-protein-coding regions have proved more difficult. Here we aggregate, process and release a dataset of 76,156 human genomes from the Genome Aggregation Database (gnomAD)-the largest public open-access human genome allele frequency reference dataset-and use it to build a genomic constraint map for the whole genome (genomic non-coding constraint of haploinsufficient variation (Gnocchi)). We present a refined mutational model that incorporates local sequence context and regional genomic features to detect depletions of variation. As expected, the average constraint for protein-coding sequences is stronger than that for non-coding regions. Within the non-coding genome, constrained regions are enriched for known regulatory elements and variants that are implicated in complex human diseases and traits, facilitating the triangulation of biological annotation, disease association and natural selection to non-coding DNA analysis. More constrained regulatory elements tend to regulate more constrained protein-coding genes, which in turn suggests that non-coding constraint can aid the identification of constrained genes that are as yet unrecognized by current gene constraint metrics. We demonstrate that this genome-wide constraint map improves the identification and interpretation of functional human genetic variation.


Asunto(s)
Genoma Humano , Genómica , Modelos Genéticos , Mutación , Humanos , Acceso a la Información , Bases de Datos Genéticas , Conjuntos de Datos como Asunto , Frecuencia de los Genes , Genoma Humano/genética , Mutación/genética , Selección Genética
6.
Nat Biotechnol ; 42(4): 582-586, 2024 Apr.
Artículo en Inglés | MEDLINE | ID: mdl-37291427

RESUMEN

Full-length RNA-sequencing methods using long-read technologies can capture complete transcript isoforms, but their throughput is limited. We introduce multiplexed arrays isoform sequencing (MAS-ISO-seq), a technique for programmably concatenating complementary DNAs (cDNAs) into molecules optimal for long-read sequencing, increasing the throughput >15-fold to nearly 40 million cDNA reads per run on the Sequel IIe sequencer. When applied to single-cell RNA sequencing of tumor-infiltrating T cells, MAS-ISO-seq demonstrated a 12- to 32-fold increase in the discovery of differentially spliced genes.


Asunto(s)
Secuenciación de Nucleótidos de Alto Rendimiento , Isoformas de ARN , ADN Complementario/genética , Isoformas de ARN/genética , Secuenciación de Nucleótidos de Alto Rendimiento/métodos , Isoformas de Proteínas/genética , Análisis de Secuencia de ARN/métodos , Transcriptoma , Perfilación de la Expresión Génica/métodos , ARN/genética
7.
Annu Rev Biomed Data Sci ; 6: 443-464, 2023 08 10.
Artículo en Inglés | MEDLINE | ID: mdl-37561600

RESUMEN

The All of Us Research Program's Data and Research Center (DRC) was established to help acquire, curate, and provide access to one of the world's largest and most diverse datasets for precision medicine research. Already, over 500,000 participants are enrolled in All of Us, 80% of whom are underrepresented in biomedical research, and data are being analyzed by a community of over 2,300 researchers. The DRC created this thriving data ecosystem by collaborating with engaged participants, innovative program partners, and empowered researchers. In this review, we first describe how the DRC is organized to meet the needs of this broad group of stakeholders. We then outline guiding principles, common challenges, and innovative approaches used to build the All of Us data ecosystem. Finally, we share lessons learned to help others navigate important decisions and trade-offs in building a modern biomedical data platform.


Asunto(s)
Investigación Biomédica , Salud Poblacional , Humanos , Ecosistema , Medicina de Precisión
8.
Nat Genet ; 55(9): 1589-1597, 2023 09.
Artículo en Inglés | MEDLINE | ID: mdl-37604963

RESUMEN

Copy number variants (CNVs) are major contributors to genetic diversity and disease. While standardized methods, such as the genome analysis toolkit (GATK), exist for detecting short variants, technical challenges have confounded uniform large-scale CNV analyses from whole-exome sequencing (WES) data. Given the profound impact of rare and de novo coding CNVs on genome organization and human disease, we developed GATK-gCNV, a flexible algorithm to discover rare CNVs from sequencing read-depth information, complete with open-source distribution via GATK. We benchmarked GATK-gCNV in 7,962 exomes from individuals in quartet families with matched genome sequencing and microarray data, finding up to 95% recall of rare coding CNVs at a resolution of more than two exons. We used GATK-gCNV to generate a reference catalog of rare coding CNVs in WES data from 197,306 individuals in the UK Biobank, and observed strong correlations between per-gene CNV rates and measures of mutational constraint, as well as rare CNV associations with multiple traits. In summary, GATK-gCNV is a tunable approach for sensitive and specific CNV discovery in WES data, with broad applications.


Asunto(s)
Variaciones en el Número de Copia de ADN , Exoma , Humanos , Exoma/genética , Secuenciación del Exoma , Variaciones en el Número de Copia de ADN/genética , Mapeo Cromosómico , Exones
9.
Nat Methods ; 20(9): 1323-1335, 2023 09.
Artículo en Inglés | MEDLINE | ID: mdl-37550580

RESUMEN

Droplet-based single-cell assays, including single-cell RNA sequencing (scRNA-seq), single-nucleus RNA sequencing (snRNA-seq) and cellular indexing of transcriptomes and epitopes by sequencing (CITE-seq), generate considerable background noise counts, the hallmark of which is nonzero counts in cell-free droplets and off-target gene expression in unexpected cell types. Such systematic background noise can lead to batch effects and spurious differential gene expression results. Here we develop a deep generative model based on the phenomenology of noise generation in droplet-based assays. The proposed model accurately distinguishes cell-containing droplets from cell-free droplets, learns the background noise profile and provides noise-free quantification in an end-to-end fashion. We implement this approach in the scalable and robust open-source software package CellBender. Analysis of simulated data demonstrates that CellBender operates near the theoretically optimal denoising limit. Extensive evaluations using real datasets and experimental benchmarks highlight enhanced concordance between droplet-based single-cell data and established gene expression patterns, while the learned background noise profile provides evidence of degraded or uncaptured cell types.


Asunto(s)
ARN Nuclear Pequeño , Programas Informáticos , Análisis de Secuencia de ARN/métodos , Análisis de la Célula Individual/métodos , Perfilación de la Expresión Génica/métodos
10.
Nat Genet ; 54(9): 1320-1331, 2022 09.
Artículo en Inglés | MEDLINE | ID: mdl-35982160

RESUMEN

Some individuals with autism spectrum disorder (ASD) carry functional mutations rarely observed in the general population. We explored the genes disrupted by these variants from joint analysis of protein-truncating variants (PTVs), missense variants and copy number variants (CNVs) in a cohort of 63,237 individuals. We discovered 72 genes associated with ASD at false discovery rate (FDR) ≤ 0.001 (185 at FDR ≤ 0.05). De novo PTVs, damaging missense variants and CNVs represented 57.5%, 21.1% and 8.44% of association evidence, while CNVs conferred greatest relative risk. Meta-analysis with cohorts ascertained for developmental delay (DD) (n = 91,605) yielded 373 genes associated with ASD/DD at FDR ≤ 0.001 (664 at FDR ≤ 0.05), some of which differed in relative frequency of mutation between ASD and DD cohorts. The DD-associated genes were enriched in transcriptomes of progenitor and immature neuronal cells, whereas genes showing stronger evidence in ASD were more enriched in maturing neurons and overlapped with schizophrenia-associated genes, emphasizing that these neuropsychiatric disorders may share common pathways to risk.


Asunto(s)
Trastorno del Espectro Autista , Trastorno Autístico , Trastorno del Espectro Autista/genética , Trastorno Autístico/genética , Variaciones en el Número de Copia de ADN/genética , Predisposición Genética a la Enfermedad , Humanos , Mutación
SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA