Pesquisa | BVS Aleitamento Materno

pyPAGE: A framework for Addressing biases in gene-set enrichment analysis-A case study on Alzheimer's disease.

Bakulin, Artemy; Teyssier, Noam B; Kampmann, Martin; Khoroshkin, Matvei; Goodarzi, Hani.

PLoS Comput Biol ; 20(9): e1012346, 2024 Sep.

Artigo em Inglês | MEDLINE | ID: mdl-39236079

RESUMO

Inferring the driving regulatory programs from comparative analysis of gene expression data is a cornerstone of systems biology. Many computational frameworks were developed to address this problem, including our iPAGE (information-theoretic Pathway Analysis of Gene Expression) toolset that uses information theory to detect non-random patterns of expression associated with given pathways or regulons. Our recent observations, however, indicate that existing approaches are susceptible to the technical biases that are inherent to most real world annotations. To address this, we have extended our information-theoretic framework to account for specific biases and artifacts in biological networks using the concept of conditional information. To showcase pyPAGE, we performed a comprehensive analysis of regulatory perturbations that underlie the molecular etiology of Alzheimer's disease (AD). pyPAGE successfully recapitulated several known AD-associated gene expression programs. We also discovered several additional regulons whose differential activity is significantly associated with AD. We further explored how these regulators relate to pathological processes in AD through cell-type specific analysis of single cell and spatial gene expression datasets. Our findings showcase the utility of pyPAGE as a precise and reliable biomarker discovery in complex diseases such as Alzheimer's disease.

Assuntos

Doença de Alzheimer , Perfilação da Expressão Gênica , Doença de Alzheimer/genética , Humanos , Perfilação da Expressão Gênica/métodos , Biologia Computacional/métodos , Redes Reguladoras de Genes/genética , Software , Bases de Dados Genéticas , Biologia de Sistemas/métodos

Ribonanza: deep learning of RNA structure through dual crowdsourcing.

He, Shujun; Huang, Rui; Townley, Jill; Kretsch, Rachael C; Karagianes, Thomas G; Cox, David B T; Blair, Hamish; Penzar, Dmitry; Vyaltsev, Valeriy; Aristova, Elizaveta; Zinkevich, Arsenii; Bakulin, Artemy; Sohn, Hoyeol; Krstevski, Daniel; Fukui, Takaaki; Tatematsu, Fumiya; Uchida, Yusuke; Jang, Donghoon; Lee, Jun Seong; Shieh, Roger; Ma, Tom; Martynov, Eduard; Shugaev, Maxim V; Bukhari, Habib S T; Fujikawa, Kazuki; Onodera, Kazuki; Henkel, Christof; Ron, Shlomo; Romano, Jonathan; Nicol, John J; Nye, Grace P; Wu, Yuan; Choe, Christian; Reade, Walter; Das, Rhiju.

bioRxiv ; 2024 Jun 11.

Artigo em Inglês | MEDLINE | ID: mdl-38464325

RESUMO

Prediction of RNA structure from sequence remains an unsolved problem, and progress has been slowed by a paucity of experimental data. Here, we present Ribonanza, a dataset of chemical mapping measurements on two million diverse RNA sequences collected through Eterna and other crowdsourced initiatives. Ribonanza measurements enabled solicitation, training, and prospective evaluation of diverse deep neural networks through a Kaggle challenge, followed by distillation into a single, self-contained model called RibonanzaNet. When fine tuned on auxiliary datasets, RibonanzaNet achieves state-of-the-art performance in modeling experimental sequence dropout, RNA hydrolytic degradation, and RNA secondary structure, with implications for modeling RNA tertiary structure.

RESUMO

Assuntos

RESUMO

ENVIAR RESULTADO:

SELEÇÃO DE REFERÊNCIAS

DETALHE DA PESQUISA