Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 20
Filtrar
Más filtros











Base de datos
Intervalo de año de publicación
1.
Artículo en Inglés | MEDLINE | ID: mdl-39345948

RESUMEN

Purpose: The etiopathogenesis of coronal nonsyndromic craniosynostosis (cNCS), a congenital condition defined by premature fusion of 1 or both coronal sutures, remains largely unknown. Methods: We conducted the largest genome-wide association study of cNCS followed by replication, fine mapping, and functional validation of the most significant region using zebrafish animal model. Results: Genome-wide association study identified 6 independent genome-wide-significant risk alleles, 4 on chromosome 7q21.3 SEM1-DLX5-DLX6 locus, and their combination conferred over 7-fold increased risk of cNCS. The top variants were replicated in an independent cohort and showed pleiotropic effects on brain and facial morphology and bone mineral density. Fine mapping of 7q21.3 identified a craniofacial transcriptional enhancer (eDlx36) within the linkage region of the top variant (rs4727341; odds ratio [95% confidence interval], 0.48[0.39-0.59]; P = 1.2E-12) that was located in SEM1 intron and enriched in 4 rare risk variants. In zebrafish, the activity of the transfected human eDlx36 enhancer was observed in the frontonasal prominence and calvaria during skull development and was reduced when the 4 rare risk variants were introduced into the sequence. Conclusion: Our findings support a polygenic nature of cNCS risk and functional role of craniofacial enhancers in cNCS susceptibility with potential broader implications for bone health.

2.
Commun Biol ; 7(1): 482, 2024 Apr 20.
Artículo en Inglés | MEDLINE | ID: mdl-38643247

RESUMEN

Many biomedical research publications contain gene sets in their supporting tables, and these sets are currently not available for search and reuse. By crawling PubMed Central, the Rummagene server provides access to hundreds of thousands of such mammalian gene sets. So far, we scanned 5,448,589 articles to find 121,237 articles that contain 642,389 gene sets. These sets are served for enrichment analysis, free text, and table title search. Investigating statistical patterns within the Rummagene database, we demonstrate that Rummagene can be used for transcription factor and kinase enrichment analyses, and for gene function predictions. By combining gene set similarity with abstract similarity, Rummagene can find surprising relationships between biological processes, concepts, and named entities. Overall, Rummagene brings to surface the ability to search a massive collection of published biomedical datasets that are currently buried and inaccessible. The Rummagene web application is available at https://rummagene.com .


Asunto(s)
Investigación Biomédica , Minería de Datos , Animales , Programas Informáticos , Bases de Datos Factuales , Regulación de la Expresión Génica , Mamíferos
3.
Cell ; 187(5): 1255-1277.e27, 2024 Feb 29.
Artículo en Inglés | MEDLINE | ID: mdl-38359819

RESUMEN

Despite the successes of immunotherapy in cancer treatment over recent decades, less than <10%-20% cancer cases have demonstrated durable responses from immune checkpoint blockade. To enhance the efficacy of immunotherapies, combination therapies suppressing multiple immune evasion mechanisms are increasingly contemplated. To better understand immune cell surveillance and diverse immune evasion responses in tumor tissues, we comprehensively characterized the immune landscape of more than 1,000 tumors across ten different cancers using CPTAC pan-cancer proteogenomic data. We identified seven distinct immune subtypes based on integrative learning of cell type compositions and pathway activities. We then thoroughly categorized unique genomic, epigenetic, transcriptomic, and proteomic changes associated with each subtype. Further leveraging the deep phosphoproteomic data, we studied kinase activities in different immune subtypes, which revealed potential subtype-specific therapeutic targets. Insights from this work will facilitate the development of future immunotherapy strategies and enhance precision targeting with existing agents.


Asunto(s)
Neoplasias , Proteogenómica , Humanos , Terapia Combinada , Genómica , Neoplasias/genética , Neoplasias/inmunología , Neoplasias/terapia , Proteómica , Escape del Tumor
4.
Commun Med (Lond) ; 3(1): 98, 2023 Jul 17.
Artículo en Inglés | MEDLINE | ID: mdl-37460679

RESUMEN

BACKGROUND: Birth defects are functional and structural abnormalities that impact about 1 in 33 births in the United States. They have been attributed to genetic and other factors such as drugs, cosmetics, food, and environmental pollutants during pregnancy, but for most birth defects there are no known causes. METHODS: To further characterize associations between small molecule compounds and their potential to induce specific birth abnormalities, we gathered knowledge from multiple sources to construct a reproductive toxicity Knowledge Graph (ReproTox-KG) with a focus on associations between birth defects, drugs, and genes. Specifically, we gathered data from drug/birth-defect associations from co-mentions in published abstracts, gene/birth-defect associations from genetic studies, drug- and preclinical-compound-induced gene expression changes in cell lines, known drug targets, genetic burden scores for human genes, and placental crossing scores for small molecules. RESULTS: Using ReproTox-KG and semi-supervised learning (SSL), we scored >30,000 preclinical small molecules for their potential to cross the placenta and induce birth defects, and identified >500 birth-defect/gene/drug cliques that can be used to explain molecular mechanisms for drug-induced birth defects. The ReproTox-KG can be accessed via a web-based user interface available at https://maayanlab.cloud/reprotox-kg . This site enables users to explore the associations between birth defects, approved and preclinical drugs, and all human genes. CONCLUSIONS: ReproTox-KG provides a resource for exploring knowledge about the molecular mechanisms of birth defects with the potential of predicting the likelihood of genes and preclinical small molecules to induce birth defects.


While birth defects are common, for most birth defects there are no known causes. During pregnancy, developing babies are exposed to drugs, cosmetics, food, and environmental pollutants that may cause birth defects. However, exactly how these environmental factors are involved in producing birth defects is difficult to discern. Also, birth defects can be a consequence of the genes inherited from the parents. We combined general data about human genes and drugs with specific data previously implicating genes and drugs in inducing birth defects to create a knowledge graph representation that connects genes, drugs, and birth defects. This knowledge graph can be used to explore new links that may explain why birth defects occur, particularly those that result from a combination of inherited and environmental influences.

5.
Nucleic Acids Res ; 51(W1): W168-W179, 2023 07 05.
Artículo en Inglés | MEDLINE | ID: mdl-37166973

RESUMEN

Gene and protein set enrichment analysis is a critical step in the analysis of data collected from omics experiments. Enrichr is a popular gene set enrichment analysis web-server search engine that contains hundreds of thousands of annotated gene sets. While Enrichr has been useful in providing enrichment analysis with many gene set libraries from different categories, integrating enrichment results across libraries and domains of knowledge can further hypothesis generation. To this end, Enrichr-KG is a knowledge graph database and a web-server application that combines selected gene set libraries from Enrichr for integrative enrichment analysis and visualization. The enrichment results are presented as subgraphs made of nodes and links that connect genes to their enriched terms. In addition, users of Enrichr-KG can add gene-gene links, as well as predicted genes to the subgraphs. This graphical representation of cross-library results with enriched and predicted genes can illuminate hidden associations between genes and annotated enriched terms from across datasets and resources. Enrichr-KG currently serves 26 gene set libraries from different categories that include transcription, pathways, ontologies, diseases/drugs, and cell types. To demonstrate the utility of Enrichr-KG we provide several case studies. Enrichr-KG is freely available at: https://maayanlab.cloud/enrichr-kg.


Asunto(s)
Biblioteca de Genes , Proteínas , Programas Informáticos , Bases de Datos Factuales , Motor de Búsqueda , Internet
6.
Commun Biol ; 5(1): 1066, 2022 10 07.
Artículo en Inglés | MEDLINE | ID: mdl-36207580

RESUMEN

The phenotype of a cell and its underlying molecular state is strongly influenced by extracellular signals, including growth factors, hormones, and extracellular matrix proteins. While these signals are normally tightly controlled, their dysregulation leads to phenotypic and molecular states associated with diverse diseases. To develop a detailed understanding of the linkage between molecular and phenotypic changes, we generated a comprehensive dataset that catalogs the transcriptional, proteomic, epigenomic and phenotypic responses of MCF10A mammary epithelial cells after exposure to the ligands EGF, HGF, OSM, IFNG, TGFB and BMP2. Systematic assessment of the molecular and cellular phenotypes induced by these ligands comprise the LINCS Microenvironment (ME) perturbation dataset, which has been curated and made publicly available for community-wide analysis and development of novel computational methods ( synapse.org/LINCS_MCF10A ). In illustrative analyses, we demonstrate how this dataset can be used to discover functionally related molecular features linked to specific cellular phenotypes. Beyond these analyses, this dataset will serve as a resource for the broader scientific community to mine for biological insights, to compare signals carried across distinct molecular modalities, and to develop new computational methods for integrative data analysis.


Asunto(s)
Factor de Crecimiento Epidérmico , Proteómica , Factor de Crecimiento Epidérmico/farmacología , Proteínas de la Matriz Extracelular , Ligandos , Fenotipo
7.
BMC Bioinformatics ; 23(1): 374, 2022 Sep 13.
Artículo en Inglés | MEDLINE | ID: mdl-36100892

RESUMEN

The L1000 technology, a cost-effective high-throughput transcriptomics technology, has been applied to profile a collection of human cell lines for their gene expression response to > 30,000 chemical and genetic perturbations. In total, there are currently over 3 million available L1000 profiles. Such a dataset is invaluable for the discovery of drug and target candidates and for inferring mechanisms of action for small molecules. The L1000 assay only measures the mRNA expression of 978 landmark genes while 11,350 additional genes are computationally reliably inferred. The lack of full genome coverage limits knowledge discovery for half of the human protein coding genes, and the potential for integration with other transcriptomics profiling data. Here we present a Deep Learning two-step model that transforms L1000 profiles to RNA-seq-like profiles. The input to the model are the measured 978 landmark genes while the output is a vector of 23,614 RNA-seq-like gene expression profiles. The model first transforms the landmark genes into RNA-seq-like 978 gene profiles using a modified CycleGAN model applied to unpaired data. The transformed 978 RNA-seq-like landmark genes are then extrapolated into the full genome space with a fully connected neural network model. The two-step model achieves 0.914 Pearson's correlation coefficients and 1.167 root mean square errors when tested on a published paired L1000/RNA-seq dataset produced by the LINCS and GTEx programs. The processed RNA-seq-like profiles are made available for download, signature search, and gene centric reverse search with unique case studies.


Asunto(s)
Aprendizaje Profundo , Perfilación de la Expresión Génica , Humanos , RNA-Seq , Transcriptoma
8.
Curr Protoc ; 2(7): e487, 2022 Jul.
Artículo en Inglés | MEDLINE | ID: mdl-35876555

RESUMEN

The Library of Integrated Network-based Cellular Signatures (LINCS) was an NIH Common Fund program that aimed to expand our knowledge about human cellular responses to chemical, genetic, and microenvironment perturbations. Responses to perturbations were measured by transcriptomics, proteomics, cellular imaging, and other high content assays. The second phase of the LINCS program, which lasted 7 years, involved the engagement of six data and signature generation centers (DSGCs) and one data coordination and integration center (DCIC). The DSGCs and the DCIC developed several digital resources, including tools, databases, and workflows that aim to facilitate the use of the LINCS data and integrate this data with other publicly available data. The digital resources developed by the DSGCs and the DCIC can be used to gain new biological and pharmacological insights that can lead to the development of novel therapeutics. This protocol provides step-by-step instructions for processing the LINCS data into signatures, and utilizing the digital resources developed by the LINCS consortia for hypothesis generation and knowledge discovery. © 2022 The Authors. Current Protocols published by Wiley Periodicals LLC. Basic Protocol 1: Navigating L1000 tools and data in CLUE.io Basic Protocol 2: Computing signatures from the L1000 data with the CD method Basic Protocol 3: Analyzing lists of differentially expressed genes and querying them against the L1000 data with BioJupies and the Bulk RNA-seq Appyter Basic Protocol 4: Utilizing the L1000FWD resource for drug discovery Basic Protocol 5: KINOMEscan and the KINOMEscan Appyter Basic Protocol 6: LINCS P100 and GCP Proteomics Assays Basic Protocol 7: The LINCS Joint Project (LJP) Basic Protocol 8: The LINCS Data Portals and SigCom LINCS Basic Protocol 9: Creating and analyzing signatures with iLINCS.


Asunto(s)
Descubrimiento de Drogas , Proteómica , Bases de Datos Factuales , Descubrimiento de Drogas/métodos , Biblioteca de Genes , Humanos , Transcriptoma
9.
Nucleic Acids Res ; 50(W1): W697-W709, 2022 07 05.
Artículo en Inglés | MEDLINE | ID: mdl-35524556

RESUMEN

Millions of transcriptome samples were generated by the Library of Integrated Network-based Cellular Signatures (LINCS) program. When these data are processed into searchable signatures along with signatures extracted from Genotype-Tissue Expression (GTEx) and Gene Expression Omnibus (GEO), connections between drugs, genes, pathways and diseases can be illuminated. SigCom LINCS is a webserver that serves over a million gene expression signatures processed, analyzed, and visualized from LINCS, GTEx, and GEO. SigCom LINCS is built with Signature Commons, a cloud-agnostic skeleton Data Commons with a focus on serving searchable signatures. SigCom LINCS provides a rapid signature similarity search for mimickers and reversers given sets of up and down genes, a gene set, a single gene, or any search term. Additionally, users of SigCom LINCS can perform a metadata search to find and analyze subsets of signatures and find information about genes and drugs. SigCom LINCS is findable, accessible, interoperable, and reusable (FAIR) with metadata linked to standard ontologies and vocabularies. In addition, all the data and signatures within SigCom LINCS are available via a well-documented API. In summary, SigCom LINCS, available at https://maayanlab.cloud/sigcom-lincs, is a rich webserver resource for accelerating drug and target discovery in systems pharmacology.


Asunto(s)
Metadatos , Transcriptoma , Transcriptoma/genética , Motor de Búsqueda
10.
Bioinform Adv ; 2(1): vbac013, 2022.
Artículo en Inglés | MEDLINE | ID: mdl-35368424

RESUMEN

Motivation: Many biological and biomedical researchers commonly search for information about genes and drugs to gather knowledge from these resources. For the most part, such information is served as landing pages in disparate data repositories and web portals. Results: The Gene and Drug Landing Page Aggregator (GDLPA) provides users with access to 50 gene-centric and 19 drug-centric repositories, enabling them to retrieve landing pages corresponding to their gene and drug queries. Bringing these resources together into one dashboard that directs users to the landing pages across many resources can help centralize gene- and drug-centric knowledge, as well as raise awareness of available resources that may be missed when using standard search engines. To demonstrate the utility of GDLPA, case studies for the gene klotho and the drug remdesivir were developed. The first case study highlights the potential role of klotho as a drug target for aging and kidney disease, while the second study gathers knowledge regarding approval, usage, and safety for remdesivir, the first approved coronavirus disease 2019 therapeutic. Finally, based on our experience, we provide guidelines for developing effective landing pages for genes and drugs. Availability and implementation: GDLPA is open source and is available from: https://cfde-gene-pages.cloud/. Supplementary information: Supplementary data are available at Bioinformatics Advances online.

SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA