Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 2.133
Filtrar
1.
Cell ; 187(12): 3056-3071.e17, 2024 Jun 06.
Artigo em Inglês | MEDLINE | ID: mdl-38848678

RESUMO

The currently accepted intestinal epithelial cell organization model proposes that Lgr5+ crypt-base columnar (CBC) cells represent the sole intestinal stem cell (ISC) compartment. However, previous studies have indicated that Lgr5+ cells are dispensable for intestinal regeneration, leading to two major hypotheses: one favoring the presence of a quiescent reserve ISC and the other calling for differentiated cell plasticity. To investigate these possibilities, we studied crypt epithelial cells in an unbiased fashion via high-resolution single-cell profiling. These studies, combined with in vivo lineage tracing, show that Lgr5 is not a specific ISC marker and that stemness potential exists beyond the crypt base and resides in the isthmus region, where undifferentiated cells participate in intestinal homeostasis and regeneration following irradiation (IR) injury. Our results provide an alternative model of intestinal epithelial cell organization, suggesting that stemness potential is not restricted to CBC cells, and neither de-differentiation nor reserve ISC are drivers of intestinal regeneration.


Assuntos
Homeostase , Mucosa Intestinal , Receptores Acoplados a Proteínas G , Regeneração , Células-Tronco , Animais , Células-Tronco/metabolismo , Células-Tronco/citologia , Camundongos , Mucosa Intestinal/metabolismo , Receptores Acoplados a Proteínas G/metabolismo , Intestinos/citologia , Diferenciação Celular , Camundongos Endogâmicos C57BL , Células Epiteliais/metabolismo , Análise de Célula Única , Masculino
2.
Annu Rev Immunol ; 33: 505-38, 2015.
Artigo em Inglês | MEDLINE | ID: mdl-25650177

RESUMO

Mammalian lymphoid immunity is mediated by fast and slow responders to pathogens. Fast innate lymphocytes are active within hours after infections in mucosal tissues. Slow adaptive lymphocytes are conventional T and B cells with clonal antigen receptors that function days after pathogen exposure. A transcription factor (TF) regulatory network guiding early T cell development is at the core of effector function diversification in all innate lymphocytes, and the kinetics of immune responses is set by developmental programming. Operational units within the innate lymphoid system are not classified by the types of pathogen-sensing machineries but rather by discrete effector functions programmed by regulatory TF networks. Based on the evolutionary history of TFs of the regulatory networks, fast effectors likely arose earlier in the evolution of animals to fortify body barriers, and in mammals they often develop in fetal ontogeny prior to the establishment of fully competent adaptive immunity.


Assuntos
Imunidade Inata/fisiologia , Linfócitos/imunologia , Linfócitos/metabolismo , Linfopoese , Fatores de Transcrição/metabolismo , Animais , Evolução Biológica , Humanos , Imunidade , Ligação Proteica/imunologia , Transdução de Sinais
3.
Annu Rev Cell Dev Biol ; 39: 91-121, 2023 Oct 16.
Artigo em Inglês | MEDLINE | ID: mdl-37418774

RESUMO

Intercellular signaling molecules, known as morphogens, act at a long range in developing tissues to provide spatial information and control properties such as cell fate and tissue growth. The production, transport, and removal of morphogens shape their concentration profiles in time and space. Downstream signaling cascades and gene regulatory networks within cells then convert the spatiotemporal morphogen profiles into distinct cellular responses. Current challenges are to understand the diverse molecular and cellular mechanisms underlying morphogen gradient formation, as well as the logic of downstream regulatory circuits involved in morphogen interpretation. This knowledge, combining experimental and theoretical results, is essential to understand emerging properties of morphogen-controlled systems, such as robustness and scaling.

4.
Cell ; 182(1): 245-261.e17, 2020 07 09.
Artigo em Inglês | MEDLINE | ID: mdl-32649877

RESUMO

Genomic studies of lung adenocarcinoma (LUAD) have advanced our understanding of the disease's biology and accelerated targeted therapy. However, the proteomic characteristics of LUAD remain poorly understood. We carried out a comprehensive proteomics analysis of 103 cases of LUAD in Chinese patients. Integrative analysis of proteome, phosphoproteome, transcriptome, and whole-exome sequencing data revealed cancer-associated characteristics, such as tumor-associated protein variants, distinct proteomics features, and clinical outcomes in patients at an early stage or with EGFR and TP53 mutations. Proteome-based stratification of LUAD revealed three subtypes (S-I, S-II, and S-III) related to different clinical and molecular features. Further, we nominated potential drug targets and validated the plasma protein level of HSP 90ß as a potential prognostic biomarker for LUAD in an independent cohort. Our integrative proteomics analysis enables a more comprehensive understanding of the molecular landscape of LUAD and offers an opportunity for more precise diagnosis and treatment.


Assuntos
Adenocarcinoma de Pulmão/metabolismo , Neoplasias Pulmonares/metabolismo , Proteômica , Adenocarcinoma de Pulmão/genética , Povo Asiático/genética , Biomarcadores Tumorais/genética , Biomarcadores Tumorais/metabolismo , Sistemas de Liberação de Medicamentos , Feminino , Perfilação da Expressão Gênica , Regulação Neoplásica da Expressão Gênica , Humanos , Neoplasias Pulmonares/genética , Masculino , Pessoa de Meia-Idade , Mutação/genética , Estadiamento de Neoplasias , Fosfoproteínas/metabolismo , Análise de Componente Principal , Prognóstico , Proteoma/metabolismo , Resultado do Tratamento , Proteína Supressora de Tumor p53/genética
5.
Cell ; 174(2): 350-362.e17, 2018 07 12.
Artigo em Inglês | MEDLINE | ID: mdl-29887379

RESUMO

Noncoding RNAs (ncRNAs) play increasingly appreciated gene-regulatory roles. Here, we describe a regulatory network centered on four ncRNAs-a long ncRNA, a circular RNA, and two microRNAs-using gene editing in mice to probe the molecular consequences of disrupting key components of this network. The long ncRNA Cyrano uses an extensively paired site to miR-7 to trigger destruction of this microRNA. Cyrano-directed miR-7 degradation is much more effective than previously described examples of target-directed microRNA degradation, which come primarily from studies of artificial and viral RNAs. By reducing miR-7 levels, Cyrano prevents repression of miR-7-targeted mRNAs and enables accumulation of Cdr1as, a circular RNA known to regulate neuronal activity. Without Cyrano, excess miR-7 causes cytoplasmic destruction of Cdr1as in neurons, in part through enhanced slicing of Cdr1as by a second miRNA, miR-671. Thus, several types of ncRNAs can collaborate to establish a sophisticated regulatory network.


Assuntos
Encéfalo/metabolismo , Redes Reguladoras de Genes , RNA não Traduzido/metabolismo , Animais , Citoplasma/metabolismo , Camundongos , Camundongos Endogâmicos C57BL , Camundongos Knockout , MicroRNAs/genética , MicroRNAs/metabolismo , Neurônios/metabolismo , RNA Longo não Codificante/genética , RNA Longo não Codificante/metabolismo
6.
Mol Cell ; 83(9): 1462-1473.e5, 2023 05 04.
Artigo em Inglês | MEDLINE | ID: mdl-37116493

RESUMO

DNA binding domains (DBDs) of transcription factors (TFs) recognize DNA sequence motifs that are highly abundant in genomes. Within cells, TFs bind a subset of motif-containing sites as directed by either their DBDs or DBD-external (nonDBD) sequences. To define the relative roles of DBDs and nonDBDs in directing binding preferences, we compared the genome-wide binding of 48 (∼30%) budding yeast TFs with their DBD-only, nonDBD-truncated, and nonDBD-only mutants. With a few exceptions, binding locations differed between DBDs and TFs, resulting from the cumulative action of multiple determinants mapped mostly to disordered nonDBD regions. Furthermore, TFs' preferences for promoters of the fuzzy nucleosome architecture were lost in DBD-only mutants, whose binding spread across promoters, implicating nonDBDs' preferences in this hallmark of budding yeast regulatory design. We conclude that DBDs and nonDBDs employ complementary DNA-targeting strategies, whose balance defines TF binding specificity along genomes.


Assuntos
DNA , Fatores de Transcrição , Sítios de Ligação , Fatores de Transcrição/metabolismo , Ligação Proteica , DNA/genética
7.
Mol Cell ; 82(2): 248-259, 2022 01 20.
Artigo em Inglês | MEDLINE | ID: mdl-35063095

RESUMO

While measurements of RNA expression have dominated the world of single-cell analyses, new single-cell techniques increasingly allow collection of different data modalities, measuring different molecules, structural connections, and intermolecular interactions. Integrating the resulting multimodal single-cell datasets is a new bioinformatics challenge. Equally important, it is a new experimental design challenge for the bench scientist, who is not only choosing from a myriad of techniques for each data modality but also faces new challenges in experimental design. The ultimate goal is to design, execute, and analyze multimodal single-cell experiments that are more than just descriptive but enable the learning of new causal and mechanistic biology. This objective requires strict consideration of the goals behind the analysis, which might range from mapping the heterogeneity of a cellular population to assembling system-wide causal networks that can further our understanding of cellular functions and eventually lead to models of tissues and organs. We review steps and challenges toward this goal. Single-cell transcriptomics is now a mature technology, and methods to measure proteins, lipids, small-molecule metabolites, and other molecular phenotypes at the single-cell level are rapidly developing. Integrating these single-cell readouts so that each cell has measurements of multiple types of data, e.g., transcriptomes, proteomes, and metabolomes, is expected to allow identification of highly specific cellular subpopulations and to provide the basis for inferring causal biological mechanisms.


Assuntos
Biologia Computacional , Projetos de Pesquisa , Análise de Célula Única , Integração de Sistemas , Animais , Perfilação da Expressão Gênica , Humanos , Metabolômica , Proteômica
8.
Annu Rev Neurosci ; 43: 1-30, 2020 07 08.
Artigo em Inglês | MEDLINE | ID: mdl-31299170

RESUMO

Cortical interneurons display striking differences in shape, physiology, and other attributes, challenging us to appropriately classify them. We previously suggested that interneuron types should be defined by their role in cortical processing. Here, we revisit the question of how to codify their diversity based upon their division of labor and function as controllers of cortical information flow. We suggest that developmental trajectories provide a guide for appreciating interneuron diversity and argue that subtype identity is generated using a configurational (rather than combinatorial) code of transcription factors that produce attractor states in the underlying gene regulatory network. We present our updated three-stage model for interneuron specification: an initial cardinal step, allocating interneurons into a few major classes, followed by definitive refinement, creating subclasses upon settling within the cortex, and lastly, state determination, reflecting the incorporation of interneurons into functional circuit ensembles. We close by discussing findings indicating that major interneuron classes are both evolutionarily ancient and conserved. We propose that the complexity of cortical circuits is generated by phylogenetically old interneuron types, complemented by an evolutionary increase in principal neuron diversity. This suggests that a natural neurobiological definition of interneuron types might be derived from a match between their developmental origin and computational function.


Assuntos
Diferenciação Celular/fisiologia , Córtex Cerebral/fisiologia , Interneurônios/fisiologia , Neurogênese/fisiologia , Animais , Humanos , Neurônios/fisiologia , Fatores de Transcrição/metabolismo
9.
Annu Rev Cell Dev Biol ; 30: 647-75, 2014.
Artigo em Inglês | MEDLINE | ID: mdl-25288119

RESUMO

Mouse embryonic stem (ES) cells perpetuate in vitro the broad developmental potential of naïve founder cells in the preimplantation embryo. ES cells self-renew relentlessly in culture but can reenter embryonic development seamlessly, differentiating on schedule to form all elements of the fetus. Here we review the properties of these remarkable cells. Arising from the stability, homogeneity, and equipotency of ES cells, we consider the concept of a pluripotent ground state. We evaluate the authenticity of ES cells in relation to cells in the embryo and examine their utility for dissecting mechanisms that confer pluripotency and that execute fate choice. We summarize current knowledge of the transcription factor circuitry that governs the ES cell state and discuss the opportunity to expose molecular logic further through iterative computational modeling and experimentation. Finally, we present a perspective on unresolved questions, including the challenge of deriving ground state pluripotent stem cells from non-rodent species.


Assuntos
Células-Tronco Embrionárias/citologia , Animais , Divisão Celular Assimétrica , Blastocisto/citologia , Técnicas de Cultura de Células , Linhagem da Célula , Células Cultivadas , Reprogramação Celular , Técnicas de Cocultura , Meios de Cultura , Meios de Cultura Livres de Soro , Células-Tronco de Carcinoma Embrionário/citologia , Células-Tronco Embrionárias/fisiologia , Fibroblastos/citologia , Perfilação da Expressão Gênica , Regulação da Expressão Gênica no Desenvolvimento , Genes Reporter , Camadas Germinativas/citologia , Humanos , Peptídeos e Proteínas de Sinalização Intercelular/farmacologia , Peptídeos e Proteínas de Sinalização Intercelular/fisiologia , Fator Inibidor de Leucemia/fisiologia , Camundongos , Células-Tronco Pluripotentes/citologia , Células-Tronco Pluripotentes/fisiologia , Fatores de Transcrição/farmacologia , Fatores de Transcrição/fisiologia
10.
Mol Cell ; 78(5): 960-974.e11, 2020 06 04.
Artigo em Inglês | MEDLINE | ID: mdl-32330456

RESUMO

Dynamic cellular processes such as differentiation are driven by changes in the abundances of transcription factors (TFs). However, despite years of studies, our knowledge about the protein copy number of TFs in the nucleus is limited. Here, by determining the absolute abundances of 103 TFs and co-factors during the course of human erythropoiesis, we provide a dynamic and quantitative scale for TFs in the nucleus. Furthermore, we establish the first gene regulatory network of cell fate commitment that integrates temporal protein stoichiometry data with mRNA measurements. The model revealed quantitative imbalances in TFs' cross-antagonistic relationships that underlie lineage determination. Finally, we made the surprising discovery that, in the nucleus, co-repressors are dramatically more abundant than co-activators at the protein level, but not at the RNA level, with profound implications for understanding transcriptional regulation. These analyses provide a unique quantitative framework to understand transcriptional regulation of cell differentiation in a dynamic context.


Assuntos
Eritropoese/genética , Redes Reguladoras de Genes/genética , Fatores de Transcrição/genética , Bases de Dados Factuais , Regulação da Expressão Gênica/genética , Hematopoese/genética , Humanos , Proteômica/métodos , Fatores de Transcrição/análise , Fatores de Transcrição/metabolismo
11.
Genes Dev ; 34(13-14): 950-964, 2020 07 01.
Artigo em Inglês | MEDLINE | ID: mdl-32499402

RESUMO

Hematopoietic stem cell (HSC) ontogeny is accompanied by dynamic changes in gene regulatory networks. We performed RNA-seq and histone mark ChIP-seq to define the transcriptomes and epigenomes of cells representing key developmental stages of HSC ontogeny in mice. The five populations analyzed were embryonic day 10.5 (E10.5) endothelium and hemogenic endothelium from the major arteries, an enriched population of prehematopoietic stem cells (pre-HSCs), fetal liver HSCs, and adult bone marrow HSCs. Using epigenetic signatures, we identified enhancers for each developmental stage. Only 12% of enhancers are primed, and 78% are active, suggesting the vast majority of enhancers are established de novo without prior priming in earlier stages. We constructed developmental stage-specific transcriptional regulatory networks by linking enhancers and predicted bound transcription factors to their target promoters using a novel computational algorithm, target inference via physical connection (TIPC). TIPC predicted known transcriptional regulators for the endothelial-to-hematopoietic transition, validating our overall approach, and identified putative novel transcription factors, including the broadly expressed transcription factors SP3 and MAZ. Finally, we validated a role for SP3 and MAZ in the formation of hemogenic endothelium. Our data and computational analyses provide a useful resource for uncovering regulators of HSC formation.


Assuntos
Regulação da Expressão Gênica no Desenvolvimento/genética , Redes Reguladoras de Genes/genética , Hematopoese/genética , Células-Tronco Hematopoéticas/citologia , Algoritmos , Animais , Proteínas de Ligação a DNA/metabolismo , Elementos Facilitadores Genéticos/genética , Epigênese Genética/genética , Edição de Genes , Camundongos , Fator de Transcrição Sp3/metabolismo , Fatores de Transcrição/metabolismo , Transcriptoma
12.
Development ; 151(13)2024 Jul 01.
Artigo em Inglês | MEDLINE | ID: mdl-38958075

RESUMO

Development is regulated by coordinated changes in gene expression. Control of these changes in expression is largely governed by the binding of transcription factors to specific regulatory elements. However, the packaging of DNA into chromatin prevents the binding of many transcription factors. Pioneer factors overcome this barrier owing to unique properties that enable them to bind closed chromatin, promote accessibility and, in so doing, mediate binding of additional factors that activate gene expression. Because of these properties, pioneer factors act at the top of gene-regulatory networks and drive developmental transitions. Despite the ability to bind target motifs in closed chromatin, pioneer factors have cell type-specific chromatin occupancy and activity. Thus, developmental context clearly shapes pioneer-factor function. Here, we discuss this reciprocal interplay between pioneer factors and development: how pioneer factors control changes in cell fate and how cellular environment influences pioneer-factor binding and activity.


Assuntos
Cromatina , Regulação da Expressão Gênica no Desenvolvimento , Fatores de Transcrição , Animais , Fatores de Transcrição/metabolismo , Fatores de Transcrição/genética , Cromatina/metabolismo , Humanos , Redes Reguladoras de Genes , Ligação Proteica
13.
Proc Natl Acad Sci U S A ; 121(16): e2313440121, 2024 Apr 16.
Artigo em Inglês | MEDLINE | ID: mdl-38578985

RESUMO

Developmental phenotypic changes can evolve under selection imposed by age- and size-related ecological differences. Many of these changes occur through programmed alterations to gene expression patterns, but the molecular mechanisms and gene-regulatory networks underlying these adaptive changes remain poorly understood. Many venomous snakes, including the eastern diamondback rattlesnake (Crotalus adamanteus), undergo correlated changes in diet and venom expression as snakes grow larger with age, providing models for identifying mechanisms of timed expression changes that underlie adaptive life history traits. By combining a highly contiguous, chromosome-level genome assembly with measures of expression, chromatin accessibility, and histone modifications, we identified cis-regulatory elements and trans-regulatory factors controlling venom ontogeny in the venom glands of C. adamanteus. Ontogenetic expression changes were significantly correlated with epigenomic changes within genes, immediately adjacent to genes (e.g., promoters), and more distant from genes (e.g., enhancers). We identified 37 candidate transcription factors (TFs), with the vast majority being up-regulated in adults. The ontogenetic change is largely driven by an increase in the expression of TFs associated with growth signaling, transcriptional activation, and circadian rhythm/biological timing systems in adults with corresponding epigenomic changes near the differentially expressed venom genes. However, both expression activation and repression contributed to the composition of both adult and juvenile venoms, demonstrating the complexity and potential evolvability of gene regulation for this trait. Overall, given that age-based trait variation is common across the tree of life, we provide a framework for understanding gene-regulatory-network-driven life-history evolution more broadly.


Assuntos
Venenos de Crotalídeos , Serpentes Peçonhentas , Animais , Venenos de Crotalídeos/genética , Venenos de Crotalídeos/metabolismo , Epigenômica , Crotalus/genética , Crotalus/metabolismo
14.
Proc Natl Acad Sci U S A ; 121(18): e2322751121, 2024 Apr 30.
Artigo em Inglês | MEDLINE | ID: mdl-38652750

RESUMO

Organ-specific gene expression datasets that include hundreds to thousands of experiments allow the reconstruction of organ-level gene regulatory networks (GRNs). However, creating such datasets is greatly hampered by the requirements of extensive and tedious manual curation. Here, we trained a supervised classification model that can accurately classify the organ-of-origin for a plant transcriptome. This K-Nearest Neighbor-based multiclass classifier was used to create organ-specific gene expression datasets for the leaf, root, shoot, flower, and seed in Arabidopsis thaliana. A GRN inference approach was used to determine the: i. influential transcription factors (TFs) in each organ and, ii. most influential TFs for specific biological processes in that organ. These genome-wide, organ-delimited GRNs (OD-GRNs), recalled many known regulators of organ development and processes operating in those organs. Importantly, many previously unknown TF regulators were uncovered as potential regulators of these processes. As a proof-of-concept, we focused on experimentally validating the predicted TF regulators of lipid biosynthesis in seeds, an important food and biofuel trait. Of the top 20 predicted TFs, eight are known regulators of seed oil content, e.g., WRI1, LEC1, FUS3. Importantly, we validated our prediction of MybS2, TGA4, SPL12, AGL18, and DiV2 as regulators of seed lipid biosynthesis. We elucidated the molecular mechanism of MybS2 and show that it induces purple acid phosphatase family genes and lipid synthesis genes to enhance seed lipid content. This general approach has the potential to be extended to any species with sufficiently large gene expression datasets to find unique regulators of any trait-of-interest.


Assuntos
Arabidopsis , Regulação da Expressão Gênica de Plantas , Redes Reguladoras de Genes , Fatores de Transcrição , Arabidopsis/genética , Arabidopsis/metabolismo , Fatores de Transcrição/metabolismo , Fatores de Transcrição/genética , Proteínas de Arabidopsis/genética , Proteínas de Arabidopsis/metabolismo , Especificidade de Órgãos/genética , Transcriptoma/genética , Sementes/genética , Sementes/metabolismo , Perfilação da Expressão Gênica/métodos
15.
Trends Genet ; 39(7): 528-530, 2023 07.
Artigo em Inglês | MEDLINE | ID: mdl-37024335

RESUMO

Marine larvae have factored heavily in pursuits to understand the origin and evolution of animal life cycles. Recent comparisons of gene expression and chromatin state in different species of sea urchin and annelid show how evolutionary changes in embryonic gene regulation can lead to markedly different larval forms.


Assuntos
Estágios do Ciclo de Vida , Ouriços-do-Mar , Animais , Larva/genética , Estágios do Ciclo de Vida/genética , Ouriços-do-Mar/genética
16.
Development ; 150(22)2023 Nov 15.
Artigo em Inglês | MEDLINE | ID: mdl-37902109

RESUMO

Multinucleated cells, or syncytia, are found in diverse taxa. Their biological function is often associated with the compartmentalization of biochemical or cellular activities within the syncytium. How such compartments are generated and maintained is poorly understood. The sea urchin embryonic skeleton is secreted by a syncytium, and local patterns of skeletal growth are associated with distinct sub-domains of gene expression within the syncytium. For such molecular compartments to be maintained and to control local patterns of skeletal growth: (1) the mobility of TFs must be restricted to produce stable differences in the transcriptional states of nuclei within the syncytium; and (2) the mobility of biomineralization proteins must also be restricted to produce regional differences in skeletal growth. To test these predictions, we expressed fluorescently tagged forms of transcription factors and biomineralization proteins in sub-domains of the skeletogenic syncytium. We found that both classes of proteins have restricted mobility within the syncytium and identified motifs that limit their mobility. Our findings have general implications for understanding the functional and molecular compartmentalization of syncytia.


Assuntos
Ouriços-do-Mar , Fatores de Transcrição , Animais , Fatores de Transcrição/metabolismo , Células Gigantes/metabolismo , Mesoderma/metabolismo , Regulação da Expressão Gênica no Desenvolvimento
17.
Brief Bioinform ; 25(5)2024 Jul 25.
Artigo em Inglês | MEDLINE | ID: mdl-39082651

RESUMO

Constructing accurate gene regulatory network s (GRNs), which reflect the dynamic governing process between genes, is critical to understanding the diverse cellular process and unveiling the complexities in biological systems. With the development of computer sciences, computational-based approaches have been applied to the GRNs inference task. However, current methodologies face challenges in effectively utilizing existing topological information and prior knowledge of gene regulatory relationships, hindering the comprehensive understanding and accurate reconstruction of GRNs. In response, we propose a novel graph neural network (GNN)-based Multi-Task Learning framework for GRN reconstruction, namely MTLGRN. Specifically, we first encode the gene promoter sequences and the gene biological features and concatenate the corresponding feature representations. Then, we construct a multi-task learning framework including GRN reconstruction, Gene knockout predict, and Gene expression matrix reconstruction. With joint training, MTLGRN can optimize the gene latent representations by integrating gene knockout information, promoter characteristics, and other biological attributes. Extensive experimental results demonstrate superior performance compared with state-of-the-art baselines on the GRN reconstruction task, efficiently leveraging biological knowledge and comprehensively understanding the gene regulatory relationships. MTLGRN also pioneered attempts to simulate gene knockouts on bulk data by incorporating gene knockout information.


Assuntos
Biologia Computacional , Redes Reguladoras de Genes , Biologia Computacional/métodos , Técnicas de Inativação de Genes , Redes Neurais de Computação , Humanos , Regiões Promotoras Genéticas , Algoritmos
18.
Brief Bioinform ; 25(4)2024 May 23.
Artigo em Inglês | MEDLINE | ID: mdl-38886006

RESUMO

Reconstructing the topology of gene regulatory network from gene expression data has been extensively studied. With the abundance functional transcriptomic data available, it is now feasible to systematically decipher regulatory interaction dynamics in a logic form such as a Boolean network (BN) framework, which qualitatively indicates how multiple regulators aggregated to affect a common target gene. However, inferring both the network topology and gene interaction dynamics simultaneously is still a challenging problem since gene expression data are typically noisy and data discretization is prone to information loss. We propose a new method for BN inference from time-series transcriptional profiles, called LogicGep. LogicGep formulates the identification of Boolean functions as a symbolic regression problem that learns the Boolean function expression and solve it efficiently through multi-objective optimization using an improved gene expression programming algorithm. To avoid overly emphasizing dynamic characteristics at the expense of topology structure ones, as traditional methods often do, a set of promising Boolean formulas for each target gene is evolved firstly, and a feed-forward neural network trained with continuous expression data is subsequently employed to pick out the final solution. We validated the efficacy of LogicGep using multiple datasets including both synthetic and real-world experimental data. The results elucidate that LogicGep adeptly infers accurate BN models, outperforming other representative BN inference algorithms in both network topology reconstruction and the identification of Boolean functions. Moreover, the execution of LogicGep is hundreds of times faster than other methods, especially in the case of large network inference.


Assuntos
Algoritmos , Perfilação da Expressão Gênica , Redes Reguladoras de Genes , Perfilação da Expressão Gênica/métodos , Humanos , Transcriptoma , Software , Biologia Computacional/métodos , Redes Neurais de Computação
19.
Brief Bioinform ; 25(4)2024 May 23.
Artigo em Inglês | MEDLINE | ID: mdl-38935070

RESUMO

Inferring gene regulatory network (GRN) is one of the important challenges in systems biology, and many outstanding computational methods have been proposed; however there remains some challenges especially in real datasets. In this study, we propose Directed Graph Convolutional neural network-based method for GRN inference (DGCGRN). To better understand and process the directed graph structure data of GRN, a directed graph convolutional neural network is conducted which retains the structural information of the directed graph while also making full use of neighbor node features. The local augmentation strategy is adopted in graph neural network to solve the problem of poor prediction accuracy caused by a large number of low-degree nodes in GRN. In addition, for real data such as E.coli, sequence features are obtained by extracting hidden features using Bi-GRU and calculating the statistical physicochemical characteristics of gene sequence. At the training stage, a dynamic update strategy is used to convert the obtained edge prediction scores into edge weights to guide the subsequent training process of the model. The results on synthetic benchmark datasets and real datasets show that the prediction performance of DGCGRN is significantly better than existing models. Furthermore, the case studies on bladder uroepithelial carcinoma and lung cancer cells also illustrate the performance of the proposed model.


Assuntos
Biologia Computacional , Redes Reguladoras de Genes , Redes Neurais de Computação , Humanos , Biologia Computacional/métodos , Algoritmos , Neoplasias da Bexiga Urinária/genética , Neoplasias da Bexiga Urinária/patologia , Escherichia coli/genética
20.
Brief Bioinform ; 25(3)2024 Mar 27.
Artigo em Inglês | MEDLINE | ID: mdl-38581416

RESUMO

The inference of gene regulatory networks (GRNs) from gene expression profiles has been a key issue in systems biology, prompting many researchers to develop diverse computational methods. However, most of these methods do not reconstruct directed GRNs with regulatory types because of the lack of benchmark datasets or defects in the computational methods. Here, we collect benchmark datasets and propose a deep learning-based model, DeepFGRN, for reconstructing fine gene regulatory networks (FGRNs) with both regulation types and directions. In addition, the GRNs of real species are always large graphs with direction and high sparsity, which impede the advancement of GRN inference. Therefore, DeepFGRN builds a node bidirectional representation module to capture the directed graph embedding representation of the GRN. Specifically, the source and target generators are designed to learn the low-dimensional dense embedding of the source and target neighbors of a gene, respectively. An adversarial learning strategy is applied to iteratively learn the real neighbors of each gene. In addition, because the expression profiles of genes with regulatory associations are correlative, a correlation analysis module is designed. Specifically, this module not only fully extracts gene expression features, but also captures the correlation between regulators and target genes. Experimental results show that DeepFGRN has a competitive capability for both GRN and FGRN inference. Potential biomarkers and therapeutic drugs for breast cancer, liver cancer, lung cancer and coronavirus disease 2019 are identified based on the candidate FGRNs, providing a possible opportunity to advance our knowledge of disease treatments.


Assuntos
Redes Reguladoras de Genes , Neoplasias Hepáticas , Humanos , Biologia de Sistemas/métodos , Transcriptoma , Algoritmos , Biologia Computacional/métodos
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA