RESUMO
The challenges in recapitulating in vivo human T cell development in laboratory models have posed a barrier to understanding human thymopoiesis. Here, we used single-cell RNA sequencing (sRNA-seq) to interrogate the rare CD34+ progenitor and the more differentiated CD34- fractions in the human postnatal thymus. CD34+ thymic progenitors were comprised of a spectrum of specification and commitment states characterized by multilineage priming followed by gradual T cell commitment. The earliest progenitors in the differentiation trajectory were CD7- and expressed a stem-cell-like transcriptional profile, but had also initiated T cell priming. Clustering analysis identified a CD34+ subpopulation primed for the plasmacytoid dendritic lineage, suggesting an intrathymic dendritic specification pathway. CD2 expression defined T cell commitment stages where loss of B cell potential preceded that of myeloid potential. These datasets delineate gene expression profiles spanning key differentiation events in human thymopoiesis and provide a resource for the further study of human T cell development.
Assuntos
Diferenciação Celular/genética , Linhagem da Célula/genética , Linfopoese/genética , Linfócitos T/metabolismo , Timócitos/metabolismo , Animais , Biomarcadores , Biologia Computacional , Perfilação da Expressão Gênica , Regulação da Expressão Gênica no Desenvolvimento , Sequenciamento de Nucleotídeos em Larga Escala/métodos , Humanos , Imunofenotipagem , Camundongos , Análise de Célula Única , Linfócitos T/citologia , Timócitos/citologia , TranscriptomaRESUMO
Cooperative interactions in protein-protein interfaces demonstrate the interdependency or the linked network-like behavior and their effect on the coupling of proteins. Cooperative interactions also could cause ripple or allosteric effects at a distance in protein-protein interfaces. Although they are critically important in protein-protein interfaces, it is challenging to determine which amino acid pair interactions are cooperative. In this work, we have used Bayesian network modeling, an interpretable machine learning method, combined with molecular dynamics trajectories to identify the residue pairs that show high cooperativity and their allosteric effect in the interface of G protein-coupled receptor (GPCR) complexes with Gα subunits. Our results reveal six GPCR:Gα contacts that are common to the different Gα subtypes and show strong cooperativity in the formation of interface. Both the C terminus helix5 and the core of the G protein are codependent entities and play an important role in GPCR coupling. We show that a promiscuous GPCR coupling to different Gα subtypes, makes all the GPCR:Gα contacts that are specific to each Gα subtype (Gαs, Gαi, and Gαq). This work underscores the potential of data-driven Bayesian network modeling in elucidating the intricate dependencies and selectivity determinants in GPCR:G protein complexes, offering valuable insights into the dynamic nature of these essential cellular signaling components.
Assuntos
Teorema de Bayes , Receptores Acoplados a Proteínas G , Receptores Acoplados a Proteínas G/metabolismo , Receptores Acoplados a Proteínas G/química , Humanos , Simulação de Dinâmica Molecular , Ligação Proteica , Subunidades alfa de Proteínas de Ligação ao GTP/metabolismo , Subunidades alfa de Proteínas de Ligação ao GTP/química , Subunidades alfa de Proteínas de Ligação ao GTP/genéticaRESUMO
MiR-126 and miR-155 are key microRNAs (miRNAs) that regulate, respectively, hematopoietic cell quiescence and proliferation. Herein we showed that in acute myeloid leukemia (AML), the biogenesis of these two miRNAs is interconnected through a network of regulatory loops driven by the FMS-like tyrosine kinase 3-internal tandem duplication (FLT3-ITD). In fact, FLT3-ITD induces the expression of miR-155 through a noncanonical mechanism of miRNA biogenesis that implicates cytoplasmic Drosha ribonuclease III (DROSHA). In turn, miR-155 down-regulates SH2-containing inositol phosphatase 1 (SHIP1), thereby increasing phosphor-protein kinase B (AKT) that in turn serine-phosphorylates, stabilizes, and activates Sprouty related EVH1 domain containing 1 (SPRED1). Activated SPRED1 inhibits the RAN/XPO5 complex and blocks the nucleus-to-cytoplasm transport of pre-miR-126, which cannot then complete the last steps of biogenesis. The net result is aberrantly low levels of mature miR-126 that allow quiescent leukemia blasts to be recruited into the cell cycle and proliferate. Thus, miR-126 down-regulation in proliferating AML blasts is downstream of FLT3-ITDdependent miR-155 expression that initiates a complex circuit of concatenated regulatory feedback (i.e., miR-126/SPRED1, miR-155/human dead-box protein 3 [DDX3X]) and feed-forward (i.e., miR-155/SHIP1/AKT/miR-126) regulatory loops that eventually converge into an output signal for leukemic growth.
Assuntos
Leucemia Mieloide Aguda , MicroRNAs , Tirosina Quinase 3 Semelhante a fms , RNA Helicases DEAD-box/metabolismo , Regulação para Baixo , Humanos , Leucemia Mieloide Aguda/genética , Leucemia Mieloide Aguda/metabolismo , MicroRNAs/metabolismo , Mutação , Tirosina Quinase 3 Semelhante a fms/genética , Tirosina Quinase 3 Semelhante a fms/metabolismoRESUMO
Cyclin-dependent kinase 12 (CDK12) is a critical regulatory protein involved in transcription and DNA repair processes. Dysregulation of CDK12 has been implicated in various diseases, including cancer. Understanding the CDK12 interactome is pivotal for elucidating its functional roles and potential therapeutic targets. Traditional methods for interactome prediction often rely on protein structure information, limiting applicability to CDK12 characterized by partly disordered terminal C region. In this study, we present a structure-independent machine-learning model that utilizes proteins' sequence and functional data to predict the CDK12 interactome. This approach is motivated by the disordered character of the CDK12 C-terminal region mitigating a structure-driven search for binding partners. Our approach incorporates multiple data sources, including protein-protein interaction networks, functional annotations, and sequence-based features, to construct a comprehensive CDK12 interactome prediction model. The ability to predict CDK12 interactions without relying on structural information is a significant advancement, as many potential interaction partners may lack crystallographic data. In conclusion, our structure-independent machine-learning model presents a powerful tool for predicting the CDK12 interactome and holds promise in advancing our understanding of CDK12 biology, identifying potential therapeutic targets, and facilitating precision-medicine approaches for CDK12-associated diseases.
Assuntos
Quinases Ciclina-Dependentes , Aprendizado de Máquina , Quinases Ciclina-Dependentes/metabolismo , Quinases Ciclina-Dependentes/química , Ligação Proteica , Humanos , Mapas de Interação de ProteínasRESUMO
Olfactory receptors (ORs) belong to class A G-protein coupled receptors (GPCRs) and are activated by a variety of odorants. To date, there is no three-dimensional structure of an OR available. One of the major bottlenecks in obtaining purified protein for structural studies of ORs is their poor expression in heterologous cells. To design mutants that enhance expression and thereby enable protein purification, we first identified computable physical properties that recapitulate OR and class A GPCR expression and further conducted an iterative computational prediction-experimental test cycle and generated human OR mutants that express as high as biogenic amine receptors for which structures have been solved. In the process of developing the computational method to recapitulate the expression of ORs in membranes, we identified properties, such as amino acid sequence coevolution, and the strength of the interactions between intracellular loop 1 (ICL1) and the helix 8 region of ORs, to enhance their heterologous expression. We identified mutations that are directly located in these regions as well as other mutations not located in these regions but allosterically strengthen the ICL1-helix 8 enhance expression. These mutants also showed functional responses to known odorants. This method to enhance heterologous expression of mammalian ORs will facilitate high-throughput "deorphanization" of ORs, and enable OR purification for biochemical and structural studies to understand odorant-OR interactions.
Assuntos
Receptores Odorantes , Sequência de Aminoácidos , Animais , Humanos , Mamíferos/metabolismo , Odorantes , Receptores Acoplados a Proteínas G , Receptores Odorantes/química , Receptores Odorantes/genética , Receptores Odorantes/metabolismoRESUMO
The role of plasticity and epigenetics in shaping cancer evolution and response to therapy has taken center stage with recent technological advances including single cell sequencing. This roadmap article is focused on state-of-the-art mathematical and experimental approaches to interrogate plasticity in cancer, and addresses the following themes and questions: is there a formal overarching framework that encompasses both non-genetic plasticity and mutation-driven somatic evolution? How do we measure and model the role of the microenvironment in influencing/controlling non-genetic plasticity? How can we experimentally study non-genetic plasticity? Which mathematical techniques are required or best suited? What are the clinical and practical applications and implications of these concepts?
Assuntos
Epigênese Genética , Neoplasias , Epigenômica , Humanos , Mutação , Neoplasias/tratamento farmacológico , Neoplasias/genética , Microambiente TumoralRESUMO
Cellular mosaicism due to monoallelic autosomal expression (MAE), with cell selection during development, is becoming increasingly recognized as prevalent in mammals, leading to interest in understanding its extent and mechanism(s). We report here use of clonal cell lines derived from the CNS of adult female [Formula: see text] hybrid (C57BL/6 X JF1) mice to characterize MAE as neural stem cells (nscs) differentiate to astrocyte-like cells (asls). We found that different subsets of genes show MAE in the two populations of cells; in each case, there is strong enrichment for genes specific to the respective developmental state. Genes that exhibit MAE are 22% of nsc-specific genes and 26% of asl-specific genes. Moreover, the promoters of genes with MAE have reduced CpG dinucleotides but increased CpG differences between the two parental mouse strains. Extending the study of variability to wild populations of mice, we found evidence for balancing selection as a contributing force in evolution of those genes showing developmental specificity (i.e., expressed in either nsc or asl), not just for genes showing MAE. Furthermore, we found that genes showing skewed allelic expression (SKE) were similarly enriched among cell type-specific genes and also showed a heightened probability of balancing selection. Thus, developmental stage-specific genes and genes with MAE or SKE seem to make up overlapping classes subject to selection for increased diversity. The implications of these results for development and evolution are discussed in the context of a model with stochastic epigenetic modifications taking place only during a relatively brief developmental window.
Assuntos
Sistema Nervoso Central/fisiologia , Regulação da Expressão Gênica no Desenvolvimento/genética , Genes Controladores do Desenvolvimento/genética , Seleção Genética/genética , Alelos , Animais , Astrócitos/fisiologia , Diferenciação Celular/genética , Feminino , Camundongos , Camundongos Endogâmicos C57BL , Células-Tronco Neurais/fisiologia , Regiões Promotoras Genéticas/genéticaRESUMO
Long-range intrachromosomal interactions play an important role in 3D chromosome structure and function, but our understanding of how various factors contribute to the strength of these interactions remains poor. In this study we used a recently developed analysis framework for Bayesian network (BN) modeling to analyze publicly available datasets for intrachromosomal interactions. We investigated how 106 variables affect the pairwise interactions of over 10 million 5-kb DNA segments in the B-lymphocyte cell line GB12878. Strictly data-driven BN modeling indicates that the strength of intrachromosomal interactions (hic_strength) is directly influenced by only four types of factors: distance between segments, Rad21 or SMC3 (cohesin components),transcription at transcription start sites (TSS), and the number of CCCTC-binding factor (CTCF)-cohesin complexes between the interacting DNA segments. Subsequent studies confirmed that most high-intensity interactions have a CTCF-cohesin complex in at least one of the interacting segments. However, 46% have CTCF on only one side, and 32% are without CTCF. As expected, high-intensity interactions are strongly dependent on the orientation of the ctcf motif, and, moreover, we find that the interaction between enhancers and promoters is similarly dependent on ctcf motif orientation. Dependency relationships between transcription factors were also revealed, including known lineage-determining B-cell transcription factors (e.g., Ebf1) as well as potential novel relationships. Thus, BN analysis of large intrachromosomal interaction datasets is a useful tool for gaining insight into DNA-DNA, protein-DNA, and protein-protein interactions.
Assuntos
Teorema de Bayes , Cromatina/metabolismo , DNA/metabolismo , Modelos Moleculares , Linfócitos B , Sítios de Ligação , Proteínas de Ciclo Celular/metabolismo , Linhagem Celular , Proteoglicanas de Sulfatos de Condroitina/metabolismo , Cromatina/química , Proteínas Cromossômicas não Histona/metabolismo , Biologia Computacional , DNA/química , Proteínas de Ligação a DNA/metabolismo , Conjuntos de Dados como Assunto , Humanos , Conformação Molecular , Proteínas Nucleares/metabolismo , Motivos de Nucleotídeos , Fosfoproteínas/metabolismo , Regiões Promotoras Genéticas , Mapeamento de Interação de Proteínas/métodos , Software , Fatores de Transcrição/metabolismo , Sítio de Iniciação de Transcrição , Transcrição GênicaRESUMO
Recent developments in sequencing and growth of bioinformatics resources provide us with vast depositories of protein network and single nucleotide polymorphism data. It allows us to re-examine, on a larger and more comprehensive scale, the relationship between protein-protein interactions and protein variability and evolutionary rates. This relationship has remained far from unambiguously resolved for quite a long time, reflecting shifting analysis approaches in the literature, and growing data availability. In this study, we utilized several public genomic databases to investigate this relationship in human, mouse, pig, chicken, and zebrafish. We observed strong non-linear relationship patterns (tending towards convex decreasing function shapes) between protein variability and the density of corresponding protein-protein interactions across all five species. To investigate further, we carried out stochastic simulations, modeling the interplay between protein connectivity and variability. Our results indicate that a simple negative linear correlation model, often suggested (or tacitly assumed) in the literature, as either a null or an alternative hypothesis, is not a good fit with the observed data. After considering different (but still relatively simple, and not overfitting) simulation models, we found that a convex decreasing protein variability-connectivity function (specifically, exponential decay) led to a much better fit with the real data. We conclude that simple correlation models might be inadequate for describing protein variability-connectivity interplay in vertebrates; they often tend towards false negatives (showing no more than marginal linear or rank correlation where there are in fact strong non-random patterns).
Assuntos
Evolução Molecular , Modelos Estatísticos , Processos Estocásticos , Vertebrados/genética , Animais , Biologia Computacional/métodos , Simulação por Computador , Bases de Dados de Proteínas , Humanos , Domínios e Motivos de Interação entre Proteínas/fisiologiaRESUMO
Evolution by gene duplication is generally accepted as one of the crucial driving forces for the gain of new complexity and functions, but the formation of pseudogenes remains a problem for this mechanism. Here we expand on earlier ideas that epigenetic modifications can drive neo- and subfunctionalization in evolution by gene duplication. We explore the effects of stochastic epigenetic modifications on the evolution (and thus development) of complex organisms in a constant environment. Modeling is done both using a modified genetic drift analytical treatment and computer simulations, which were found to agree. A transposon silencing model is also explored. Some key assumptions made include (i) stochastic, incomplete removal (or addition) of repressive epigenetic marks takes place during a window(s) of opportunity in the zygote and early embryo; (ii) there is no statistical variation of the marks after the window closes; and (iii) the genes affected are sensitive to dosage. Our genetic drift treatment takes into account that after gene duplication the prevailing case upon which selection operates is a duplicate/singlet heterozygote; to the best of our knowledge, this has not been considered in previous treatments. We conclude from our modeling that stochastic epigenetic modifications, with rates consistent with experimental observation, can both increase the rate of gene fixation and decrease pseudogenization, thus dramatically improving the efficacy of evolution by gene duplication. We also find that a transposon silencing model is advantageous for fixation of recessive genes in diploid organisms, especially with large effective population sizes.
Assuntos
Evolução Biológica , Embrião de Mamíferos/metabolismo , Epigênese Genética , Animais , Simulação por Computador , Elementos de DNA Transponíveis/genética , Difusão , Duplicação Gênica , Loci Gênicos , Genótipo , Camundongos , Modelos Genéticos , Fenótipo , Pseudogenes , Processos EstocásticosRESUMO
While there are currently over 40 replicated genes with mapped risk alleles for Late Onset Alzheimer's disease (LOAD), the Apolipoprotein E locus E4 haplotype is still the biggest driver of risk, with odds ratios for neuropathologically confirmed E44 carriers exceeding 30 (95% confidence interval 16.59-58.75). We sought to address whether the APOE E4 haplotype modifies expression globally through networks of expression to increase LOAD risk. We have used the Human Brainome data to build expression networks comparing APOE E4 carriers to non-carriers using scalable mixed-datatypes Bayesian network (BN) modeling. We have found that VGF had the greatest explanatory weight. High expression of VGF is a protective signal, even on the background of APOE E4 alleles. LOAD risk signals, considering an APOE background, include high levels of SPECC1L, HLA-DRA and RANBP3L. Our findings nominate several new transcripts, taking a combined approach to network building including known LOAD risk loci.
Assuntos
Doença de Alzheimer , Apolipoproteína E4 , Predisposição Genética para Doença , Idoso , Idoso de 80 Anos ou mais , Feminino , Humanos , Masculino , Proteínas Adaptadoras de Transdução de Sinal/genética , Proteínas Adaptadoras de Transdução de Sinal/metabolismo , Alelos , Doença de Alzheimer/genética , Doença de Alzheimer/metabolismo , Apolipoproteína E4/genética , Teorema de Bayes , Haplótipos , Cadeias alfa de HLA-DR/genética , Proteínas Nucleares/genética , Proteínas Nucleares/metabolismo , Fatores de RiscoRESUMO
Acute myeloid leukemia (AML) is prevalent in both adult and pediatric patients. Despite advances in patient categorization, the heterogeneity of AML remains a challenge. Recent studies have explored the use of gene expression data to enhance AML diagnosis and prognosis, however, alternative approaches rooted in physics and chemistry may provide another level of insight into AML transformation. Utilizing publicly available databases, we analyze 884 human and mouse blood and bone marrow samples. We employ a personalized medicine strategy, combining state-transition theory and surprisal analysis, to assess the RNA transcriptome of individual patients. The transcriptome is transformed into physical parameters that represent each sample's steady state and the free energy change (FEC) from that steady state, which is the state with the lowest free energy.We found the transcriptome steady state was invariant across normal and AML samples. FEC, representing active molecular processes, varied significantly between samples and was used to create patient-specific barcodes to characterize the biology of the disease. We discovered that AML samples that were in a transition state had the highest FEC. This disease state may be characterized as the most unstable and hence the most therapeutically targetable since a change in free energy is a thermodynamic requirement for disease progression. We also found that distinct sets of ongoing processes may be at the root of otherwise similar clinical phenotypes, implying that our integrated analysis of transcriptome profiles may facilitate a personalized medicine approach to cure AML and restore a steady state in each patient.
Assuntos
Leucemia Mieloide Aguda , Transcriptoma , Adulto , Animais , Camundongos , Humanos , Criança , Transcriptoma/genética , Perfilação da Expressão Gênica , Leucemia Mieloide Aguda/genética , Biomarcadores Tumorais/genética , FenótipoRESUMO
Enhancers are fundamental to gene regulation. Post-translational modifications by the small ubiquitin-like modifiers (SUMO) modify chromatin regulation enzymes, including histone acetylases and deacetylases. However, it remains unclear whether SUMOylation regulates enhancer marks, acetylation at the 27th lysine residue of the histone H3 protein (H3K27Ac). To investigate whether SUMOylation regulates H3K27Ac, we performed genome-wide ChIP-seq analyses and discovered that knockdown (KD) of the SUMO activating enzyme catalytic subunit UBA2 reduced H3K27Ac at most enhancers. Bioinformatic analysis revealed that TFAP2C-binding sites are enriched in enhancers whose H3K27Ac was reduced by UBA2 KD. ChIP-seq analysis in combination with molecular biological methods showed that TFAP2C binding to enhancers increased upon UBA2 KD or inhibition of SUMOylation by a small molecule SUMOylation inhibitor. However, this is not due to the SUMOylation of TFAP2C itself. Proteomics analysis of TFAP2C interactome on the chromatin identified histone deacetylation (HDAC) and RNA splicing machineries that contain many SUMOylation targets. TFAP2C KD reduced HDAC1 binding to chromatin and increased H3K27Ac marks at enhancer regions, suggesting that TFAP2C is important in recruiting HDAC machinery. Taken together, our findings provide insights into the regulation of enhancer marks by SUMOylation and TFAP2C and suggest that SUMOylation of proteins in the HDAC machinery regulates their recruitments to enhancers.
RESUMO
Chronic myeloid leukemia (CML) is initiated and maintained by BCR::ABL which is clinically targeted using tyrosine kinase inhibitors (TKIs). TKIs can induce long-term remission but are also not curative. Thus, CML is an ideal system to test our hypothesis that transcriptome-based state-transition models accurately predict cancer evolution and treatment response. We collected time-sequential blood samples from tetracycline-off (Tet-Off) BCR::ABL-inducible transgenic mice and wild-type controls. From the transcriptome, we constructed a CML state-space and a three-well leukemogenic potential landscape. The potential's stable critical points defined observable disease states. Early states were characterized by anti-CML genes opposing leukemia; late states were characterized by pro-CML genes. Genes with expression patterns shaped similarly to the potential landscape were identified as drivers of disease transition. Re-introduction of tetracycline to silence the BCR::ABL gene returned diseased mice transcriptomes to a near healthy state, without reaching it, suggesting parts of the transition are irreversible. TKI only reverted the transcriptome to an intermediate disease state, without approaching a state of health; disease relapse occurred soon after treatment. Using only the earliest time-point as initial conditions, our state-transition models accurately predicted both disease progression and treatment response, supporting this as a potentially valuable approach to time clinical intervention, before phenotypic changes become detectable.
Assuntos
Leucemia Mielogênica Crônica BCR-ABL Positiva , Transcriptoma , Camundongos , Animais , Proteínas de Fusão bcr-abl/metabolismo , Inibidores de Proteínas Quinases/farmacologia , Inibidores de Proteínas Quinases/uso terapêutico , Leucemia Mielogênica Crônica BCR-ABL Positiva/tratamento farmacológico , Leucemia Mielogênica Crônica BCR-ABL Positiva/genética , Tetraciclinas/uso terapêutico , Resistencia a Medicamentos AntineoplásicosRESUMO
Cytokines operate in concert to maintain immune homeostasis and coordinate immune responses. In cases of ER+ breast cancer, peripheral immune cells exhibit altered responses to several cytokines, and these alterations are correlated strongly with patient outcomes. To develop a systems-level understanding of this dysregulation, we measured a panel of cytokine responses and receptor abundances in the peripheral blood of healthy controls and ER+ breast cancer patients across immune cell types. Using tensor factorization to model this multidimensional data, we found that breast cancer patients exhibited widespread alterations in response, including drastically reduced response to IL-10 and heightened basal levels of pSmad2/3 and pSTAT4. ER+ patients also featured upregulation of PD-L1, IL6Rα, and IL2Rα, among other receptors. Despite this, alterations in response to cytokines were not explained by changes in receptor abundances. Thus, tensor factorization helped to reveal a coordinated reprogramming of the immune system that was consistent across our cohort.
Assuntos
Neoplasias da Mama , Citocinas , Transdução de Sinais , Humanos , Neoplasias da Mama/imunologia , Feminino , Citocinas/sangue , Citocinas/metabolismo , Receptores de Estrogênio/metabolismo , Pessoa de Meia-Idade , Biologia de Sistemas/métodosRESUMO
CpG dinucleotides contribute to epigenetic mechanisms by being the only site for DNA methylation in mammalian somatic cells. They are also mutation hotspots and approximately 5-fold depleted genome-wide. We report here a study focused on CpG sites in the coding regions of Hox and other transcription factor genes, comparing methylated genomes of Homo sapiens, Mus musculus, and Danio rerio with nonmethylated genomes of Drosophila melanogaster and Caenorhabditis elegans. We analyzed 4-fold degenerate, synonymous codons with the potential for CpG. That is, we studied "silent" changes that do not affect protein products but could damage epigenetic marking. We find that DNA-binding transcription factors and other developmentally relevant genes show, only in methylated genomes, a bimodal distribution of CpG usage. Several genetic code-based tests indicate, again for methylated genomes only, that the frequency of silent CpGs in Hox genes is much greater than expectation. Also informative are NCG-GNN and NCC-GNN codon doublets, for which an unusually high rate of G to C and C to G transversions was observed at the third (silent) position of the first codon. Together these results are interpreted as evidence for strong "pro-epigenetic" selection acting to preserve CpG sites in coding regions of many genes controlling development. We also report that DNA-binding transcription factors and developmentally important genes are dramatically overrepresented in or near clusters of three or more CpG islands, suggesting a possible relationship between evolutionary preservation of CpG dinucleotides in both coding regions and CpG islands.
Assuntos
Ilhas de CpG/genética , Metilação de DNA , Proteínas de Homeodomínio/genética , Fases de Leitura Aberta/genética , Fatores de Transcrição/genética , Algoritmos , Aminoácidos/genética , Animais , Caenorhabditis elegans/genética , Códon/genética , Proteínas de Ligação a DNA/genética , Drosophila melanogaster/genética , Epigênese Genética , Evolução Molecular , Éxons/genética , Genoma/genética , Camundongos , Modelos Genéticos , Mutação Puntual , Seleção Genética , Peixe-Zebra/genéticaRESUMO
While there are currently over 40 replicated genes with mapped risk alleles for Late Onset Alzheimer's disease (LOAD), the Apolipoprotein E locus E4 haplotype is still the biggest driver of risk, with odds ratios for neuropathologically confirmed E44 carriers exceeding 30 (95% confidence interval 16.59-58.75). We sought to address whether the APOE E4 haplotype modifies expression globally through networks of expression to increase LOAD risk. We have used the Human Brainome data to build expression networks comparing APOE E4 carriers to non-carriers using scalable mixed-datatypes Bayesian network (BN) modeling. We have found that VGF had the greatest explanatory weight. High expression of VGF is a protective signal, even on the background of APOE E4 alleles. LOAD risk signals, considering an APOE background, include high levels of SPECC1L, HLA-DRA and RANBP3L. Our findings nominate several new transcripts, taking a combined approach to network building including known LOAD risk loci.
RESUMO
Cooperative interactions in protein-protein interfaces demonstrate the interdependency or the linked network-like behavior of interface interactions and their effect on the coupling of proteins. Cooperative interactions also could cause ripple or allosteric effects at a distance in protein-protein interfaces. Although they are critically important in protein-protein interfaces it is challenging to determine which amino acid pair interactions are cooperative. In this work we have used Bayesian network modeling, an interpretable machine learning method, combined with molecular dynamics trajectories to identify the residue pairs that show high cooperativity and their allosteric effect in the interface of G protein-coupled receptor (GPCR) complexes with G proteins. Our results reveal a strong co-dependency in the formation of interface GPCR:G protein contacts. This observation indicates that cooperativity of GPCR:G protein interactions is necessary for the coupling and selectivity of G proteins and is thus critical for receptor function. We have identified subnetworks containing polar and hydrophobic interactions that are common among multiple GPCRs coupling to different G protein subtypes (Gs, Gi and Gq). These common subnetworks along with G protein-specific subnetworks together confer selectivity to the G protein coupling. This work underscores the potential of data-driven Bayesian network modeling in elucidating the intricate dependencies and selectivity determinants in GPCR:G protein complexes, offering valuable insights into the dynamic nature of these essential cellular signaling components.
RESUMO
Modern artificial neural networks (ANNs) have long been designed on foundations of mathematics as opposed to their original foundations of biomimicry. However, the structure and function of these modern ANNs are often analogous to real-life biological networks. We propose that the ubiquitous information-theoretic principles underlying the development of ANNs are similar to the principles guiding the macro-evolution of biological networks and that insights gained from one field can be applied to the other. We generate hypotheses on the bow-tie network structure of the Janus kinase - signal transducers and activators of transcription (JAK-STAT) pathway, additionally informed by the evolutionary considerations, and carry out ANN simulation experiments to demonstrate that an increase in the network's input and output complexity does not necessarily require a more complex intermediate layer. This observation should guide novel biomarker discovery-namely, to prioritize sections of the biological networks in which information is most compressed as opposed to biomarkers representing the periphery of the network.
RESUMO
Chronic myeloid leukemia (CML) is initiated and maintained by BCR::ABL which is clinically targeted using tyrosine kinase inhibitors (TKIs). TKIs can induce long-term remission but are also not curative. Thus, CML is an ideal system to test our hypothesis that transcriptome-based state-transition models accurately predict cancer evolution and treatment response. We collected time-sequential blood samples from tetracycline-off (Tet-Off) BCR::ABL-inducible transgenic mice and wild-type controls. From the transcriptome, we constructed a CML state-space and a three-well leukemogenic potential landscape. The potential's stable critical points defined observable disease states. Early states were characterized by anti-CML genes opposing leukemia; late states were characterized by pro-CML genes. Genes with expression patterns shaped similarly to the potential landscape were identified as drivers of disease transition. Re-introduction of tetracycline to silence the BCR::ABL gene returned diseased mice transcriptomes to a near healthy state, without reaching it, suggesting parts of the transition are irreversible. TKI only reverted the transcriptome to an intermediate disease state, without approaching a state of health; disease relapse occurred soon after treatment. Using only the earliest time-point as initial conditions, our state-transition models accurately predicted both disease progression and treatment response, supporting this as a potentially valuable approach to time clinical intervention even before phenotypic changes become detectable.