Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 3 de 3
Filtrar
Más filtros

Banco de datos
Tipo de estudio
Tipo del documento
País de afiliación
Intervalo de año de publicación
1.
Cell ; 186(7): 1493-1511.e40, 2023 03 30.
Artículo en Inglés | MEDLINE | ID: mdl-37001506

RESUMEN

Understanding how genetic variants impact molecular phenotypes is a key goal of functional genomics, currently hindered by reliance on a single haploid reference genome. Here, we present the EN-TEx resource of 1,635 open-access datasets from four donors (∼30 tissues × âˆ¼15 assays). The datasets are mapped to matched, diploid genomes with long-read phasing and structural variants, instantiating a catalog of >1 million allele-specific loci. These loci exhibit coordinated activity along haplotypes and are less conserved than corresponding, non-allele-specific ones. Surprisingly, a deep-learning transformer model can predict the allele-specific activity based only on local nucleotide-sequence context, highlighting the importance of transcription-factor-binding motifs particularly sensitive to variants. Furthermore, combining EN-TEx with existing genome annotations reveals strong associations between allele-specific and GWAS loci. It also enables models for transferring known eQTLs to difficult-to-profile tissues (e.g., from skin to heart). Overall, EN-TEx provides rich data and generalizable models for more accurate personal functional genomics.


Asunto(s)
Epigenoma , Sitios de Carácter Cuantitativo , Estudio de Asociación del Genoma Completo , Genómica , Fenotipo , Polimorfismo de Nucleótido Simple
2.
bioRxiv ; 2024 Jun 10.
Artículo en Inglés | MEDLINE | ID: mdl-38895432

RESUMEN

Understanding the function and fitness effects of diverse plant genomes requires transferable models. Language models (LMs) pre-trained on large-scale biological sequences can learn evolutionary conservation, thus expected to offer better cross-species prediction through fine-tuning on limited labeled data compared to supervised deep learning models. We introduce PlantCaduceus, a plant DNA LM based on the Caduceus and Mamba architectures, pre-trained on a carefully curated dataset consisting of 16 diverse Angiosperm genomes. Fine-tuning PlantCaduceus on limited labeled Arabidopsis data for four tasks involving transcription and translation modeling demonstrated high transferability to maize that diverged 160 million years ago, outperforming the best baseline model by 1.45-fold to 7.23-fold. PlantCaduceus also enables genome-wide deleterious mutation identification without multiple sequence alignment (MSA). PlantCaduceus demonstrated a threefold enrichment of rare alleles in prioritized deleterious mutations compared to MSA-based methods and matched state-of-the-art protein LMs. PlantCaduceus is a versatile pre-trained DNA LM expected to accelerate plant genomics and crop breeding applications.

3.
Gene Regul Syst Bio ; 13: 1177625019840282, 2019.
Artículo en Inglés | MEDLINE | ID: mdl-31019365

RESUMEN

Pharmacological time-series data, from comparative dosing studies, are critical to characterizing drug effects. Reconciling the data from multiple studies is inevitably difficult; multiple in vivo high-throughput -omics studies are necessary to capture the global and temporal effects of the drug, but these experiments, though analogous, differ in (microarray or other) platforms, time-scales, and dosing regimens and thus cannot be directly combined or compared. This investigation addresses this reconciliation issue with a meta-analysis technique aimed at assessing the intrinsic activity at the pathway level. The purpose of this is to characterize the dosing effects of methylprednisolone (MPL), a widely used anti-inflammatory and immunosuppressive corticosteroid (CS), within the liver. A multivariate decomposition approach is applied to analyze acute and chronic MPL dosing in male adrenalectomized rats and characterize the dosing-dependent differences in the dynamic response of MPL-responsive signaling and metabolic pathways. We demonstrate how to deconstruct signaling and metabolic pathways into their constituent pathway activities, activities which are scored for intrinsic pathway activity. Dosing-induced changes in the dynamics of pathway activities are compared using a model-based assessment of pathway dynamics, extending the principles of pharmacokinetics/pharmacodynamics (PKPD) to describe pathway activities. The model-based approach enabled us to hypothesize on the likely emergence (or disappearance) of indirect dosing-dependent regulatory interactions, pointing to likely mechanistic implications of dosing of MPL transcriptional regulation. Both acute and chronic MPL administration induced a strong core of activity within pathway families including the following: lipid metabolism, amino acid metabolism, carbohydrate metabolism, metabolism of cofactors and vitamins, regulation of essential organelles, and xenobiotic metabolism pathway families. Pathway activities alter between acute and chronic dosing, indicating that MPL response is dosing dependent. Furthermore, because multiple pathway activities are dominant within a single pathway, we observe that pathways cannot be defined by a single response. Instead, pathways are defined by multiple, complex, and temporally related activities corresponding to different subgroups of genes within each pathway.

SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA