Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 42
Filtrar
Más filtros

Banco de datos
Tipo del documento
Intervalo de año de publicación
1.
Immunity ; 52(6): 1105-1118.e9, 2020 06 16.
Artículo en Inglés | MEDLINE | ID: mdl-32553173

RESUMEN

The challenges in recapitulating in vivo human T cell development in laboratory models have posed a barrier to understanding human thymopoiesis. Here, we used single-cell RNA sequencing (sRNA-seq) to interrogate the rare CD34+ progenitor and the more differentiated CD34- fractions in the human postnatal thymus. CD34+ thymic progenitors were comprised of a spectrum of specification and commitment states characterized by multilineage priming followed by gradual T cell commitment. The earliest progenitors in the differentiation trajectory were CD7- and expressed a stem-cell-like transcriptional profile, but had also initiated T cell priming. Clustering analysis identified a CD34+ subpopulation primed for the plasmacytoid dendritic lineage, suggesting an intrathymic dendritic specification pathway. CD2 expression defined T cell commitment stages where loss of B cell potential preceded that of myeloid potential. These datasets delineate gene expression profiles spanning key differentiation events in human thymopoiesis and provide a resource for the further study of human T cell development.


Asunto(s)
Diferenciación Celular/genética , Linaje de la Célula/genética , Linfopoyesis/genética , Linfocitos T/metabolismo , Timocitos/metabolismo , Animales , Biomarcadores , Biología Computacional , Perfilación de la Expresión Génica , Regulación del Desarrollo de la Expresión Génica , Secuenciación de Nucleótidos de Alto Rendimiento/métodos , Humanos , Inmunofenotipificación , Ratones , Análisis de la Célula Individual , Linfocitos T/citología , Timocitos/citología , Transcriptoma
2.
J Biol Chem ; 300(6): 107362, 2024 Jun.
Artículo en Inglés | MEDLINE | ID: mdl-38735478

RESUMEN

Cooperative interactions in protein-protein interfaces demonstrate the interdependency or the linked network-like behavior and their effect on the coupling of proteins. Cooperative interactions also could cause ripple or allosteric effects at a distance in protein-protein interfaces. Although they are critically important in protein-protein interfaces, it is challenging to determine which amino acid pair interactions are cooperative. In this work, we have used Bayesian network modeling, an interpretable machine learning method, combined with molecular dynamics trajectories to identify the residue pairs that show high cooperativity and their allosteric effect in the interface of G protein-coupled receptor (GPCR) complexes with Gα subunits. Our results reveal six GPCR:Gα contacts that are common to the different Gα subtypes and show strong cooperativity in the formation of interface. Both the C terminus helix5 and the core of the G protein are codependent entities and play an important role in GPCR coupling. We show that a promiscuous GPCR coupling to different Gα subtypes, makes all the GPCR:Gα contacts that are specific to each Gα subtype (Gαs, Gαi, and Gαq). This work underscores the potential of data-driven Bayesian network modeling in elucidating the intricate dependencies and selectivity determinants in GPCR:G protein complexes, offering valuable insights into the dynamic nature of these essential cellular signaling components.


Asunto(s)
Teorema de Bayes , Receptores Acoplados a Proteínas G , Receptores Acoplados a Proteínas G/metabolismo , Receptores Acoplados a Proteínas G/química , Humanos , Simulación de Dinámica Molecular , Unión Proteica , Subunidades alfa de la Proteína de Unión al GTP/metabolismo , Subunidades alfa de la Proteína de Unión al GTP/química , Subunidades alfa de la Proteína de Unión al GTP/genética
3.
Proc Natl Acad Sci U S A ; 119(16): e2112482119, 2022 04 19.
Artículo en Inglés | MEDLINE | ID: mdl-35412895

RESUMEN

MiR-126 and miR-155 are key microRNAs (miRNAs) that regulate, respectively, hematopoietic cell quiescence and proliferation. Herein we showed that in acute myeloid leukemia (AML), the biogenesis of these two miRNAs is interconnected through a network of regulatory loops driven by the FMS-like tyrosine kinase 3-internal tandem duplication (FLT3-ITD). In fact, FLT3-ITD induces the expression of miR-155 through a noncanonical mechanism of miRNA biogenesis that implicates cytoplasmic Drosha ribonuclease III (DROSHA). In turn, miR-155 down-regulates SH2-containing inositol phosphatase 1 (SHIP1), thereby increasing phosphor-protein kinase B (AKT) that in turn serine-phosphorylates, stabilizes, and activates Sprouty related EVH1 domain containing 1 (SPRED1). Activated SPRED1 inhibits the RAN/XPO5 complex and blocks the nucleus-to-cytoplasm transport of pre-miR-126, which cannot then complete the last steps of biogenesis. The net result is aberrantly low levels of mature miR-126 that allow quiescent leukemia blasts to be recruited into the cell cycle and proliferate. Thus, miR-126 down-regulation in proliferating AML blasts is downstream of FLT3-ITD­dependent miR-155 expression that initiates a complex circuit of concatenated regulatory feedback (i.e., miR-126/SPRED1, miR-155/human dead-box protein 3 [DDX3X]) and feed-forward (i.e., miR-155/SHIP1/AKT/miR-126) regulatory loops that eventually converge into an output signal for leukemic growth.


Asunto(s)
Leucemia Mieloide Aguda , MicroARNs , Tirosina Quinasa 3 Similar a fms , ARN Helicasas DEAD-box/metabolismo , Regulación hacia Abajo , Humanos , Leucemia Mieloide Aguda/genética , Leucemia Mieloide Aguda/metabolismo , MicroARNs/metabolismo , Mutación , Tirosina Quinasa 3 Similar a fms/genética , Tirosina Quinasa 3 Similar a fms/metabolismo
4.
Biophys J ; 2024 May 18.
Artículo en Inglés | MEDLINE | ID: mdl-38762754

RESUMEN

Cyclin-dependent kinase 12 (CDK12) is a critical regulatory protein involved in transcription and DNA repair processes. Dysregulation of CDK12 has been implicated in various diseases, including cancer. Understanding the CDK12 interactome is pivotal for elucidating its functional roles and potential therapeutic targets. Traditional methods for interactome prediction often rely on protein structure information, limiting applicability to CDK12 characterized by partly disordered terminal C region. In this study, we present a structure-independent machine-learning model that utilizes proteins' sequence and functional data to predict the CDK12 interactome. This approach is motivated by the disordered character of the CDK12 C-terminal region mitigating a structure-driven search for binding partners. Our approach incorporates multiple data sources, including protein-protein interaction networks, functional annotations, and sequence-based features, to construct a comprehensive CDK12 interactome prediction model. The ability to predict CDK12 interactions without relying on structural information is a significant advancement, as many potential interaction partners may lack crystallographic data. In conclusion, our structure-independent machine-learning model presents a powerful tool for predicting the CDK12 interactome and holds promise in advancing our understanding of CDK12 biology, identifying potential therapeutic targets, and facilitating precision-medicine approaches for CDK12-associated diseases.

5.
Biophys J ; 121(5): 830-840, 2022 03 01.
Artículo en Inglés | MEDLINE | ID: mdl-35065915

RESUMEN

Olfactory receptors (ORs) belong to class A G-protein coupled receptors (GPCRs) and are activated by a variety of odorants. To date, there is no three-dimensional structure of an OR available. One of the major bottlenecks in obtaining purified protein for structural studies of ORs is their poor expression in heterologous cells. To design mutants that enhance expression and thereby enable protein purification, we first identified computable physical properties that recapitulate OR and class A GPCR expression and further conducted an iterative computational prediction-experimental test cycle and generated human OR mutants that express as high as biogenic amine receptors for which structures have been solved. In the process of developing the computational method to recapitulate the expression of ORs in membranes, we identified properties, such as amino acid sequence coevolution, and the strength of the interactions between intracellular loop 1 (ICL1) and the helix 8 region of ORs, to enhance their heterologous expression. We identified mutations that are directly located in these regions as well as other mutations not located in these regions but allosterically strengthen the ICL1-helix 8 enhance expression. These mutants also showed functional responses to known odorants. This method to enhance heterologous expression of mammalian ORs will facilitate high-throughput "deorphanization" of ORs, and enable OR purification for biochemical and structural studies to understand odorant-OR interactions.


Asunto(s)
Receptores Odorantes , Secuencia de Aminoácidos , Animales , Humanos , Mamíferos/metabolismo , Odorantes , Receptores Acoplados a Proteínas G , Receptores Odorantes/química , Receptores Odorantes/genética , Receptores Odorantes/metabolismo
6.
Phys Biol ; 19(3)2022 04 18.
Artículo en Inglés | MEDLINE | ID: mdl-35078159

RESUMEN

The role of plasticity and epigenetics in shaping cancer evolution and response to therapy has taken center stage with recent technological advances including single cell sequencing. This roadmap article is focused on state-of-the-art mathematical and experimental approaches to interrogate plasticity in cancer, and addresses the following themes and questions: is there a formal overarching framework that encompasses both non-genetic plasticity and mutation-driven somatic evolution? How do we measure and model the role of the microenvironment in influencing/controlling non-genetic plasticity? How can we experimentally study non-genetic plasticity? Which mathematical techniques are required or best suited? What are the clinical and practical applications and implications of these concepts?


Asunto(s)
Epigénesis Genética , Neoplasias , Epigenómica , Humanos , Mutación , Neoplasias/tratamiento farmacológico , Neoplasias/genética , Microambiente Tumoral
7.
Proc Natl Acad Sci U S A ; 115(44): E10379-E10386, 2018 10 30.
Artículo en Inglés | MEDLINE | ID: mdl-30322913

RESUMEN

Cellular mosaicism due to monoallelic autosomal expression (MAE), with cell selection during development, is becoming increasingly recognized as prevalent in mammals, leading to interest in understanding its extent and mechanism(s). We report here use of clonal cell lines derived from the CNS of adult female [Formula: see text] hybrid (C57BL/6 X JF1) mice to characterize MAE as neural stem cells (nscs) differentiate to astrocyte-like cells (asls). We found that different subsets of genes show MAE in the two populations of cells; in each case, there is strong enrichment for genes specific to the respective developmental state. Genes that exhibit MAE are 22% of nsc-specific genes and 26% of asl-specific genes. Moreover, the promoters of genes with MAE have reduced CpG dinucleotides but increased CpG differences between the two parental mouse strains. Extending the study of variability to wild populations of mice, we found evidence for balancing selection as a contributing force in evolution of those genes showing developmental specificity (i.e., expressed in either nsc or asl), not just for genes showing MAE. Furthermore, we found that genes showing skewed allelic expression (SKE) were similarly enriched among cell type-specific genes and also showed a heightened probability of balancing selection. Thus, developmental stage-specific genes and genes with MAE or SKE seem to make up overlapping classes subject to selection for increased diversity. The implications of these results for development and evolution are discussed in the context of a model with stochastic epigenetic modifications taking place only during a relatively brief developmental window.


Asunto(s)
Sistema Nervioso Central/fisiología , Regulación del Desarrollo de la Expresión Génica/genética , Genes del Desarrollo/genética , Selección Genética/genética , Alelos , Animales , Astrocitos/fisiología , Diferenciación Celular/genética , Femenino , Ratones , Ratones Endogámicos C57BL , Células-Madre Neurales/fisiología , Regiones Promotoras Genéticas/genética
8.
Proc Natl Acad Sci U S A ; 114(48): E10359-E10368, 2017 11 28.
Artículo en Inglés | MEDLINE | ID: mdl-29133398

RESUMEN

Long-range intrachromosomal interactions play an important role in 3D chromosome structure and function, but our understanding of how various factors contribute to the strength of these interactions remains poor. In this study we used a recently developed analysis framework for Bayesian network (BN) modeling to analyze publicly available datasets for intrachromosomal interactions. We investigated how 106 variables affect the pairwise interactions of over 10 million 5-kb DNA segments in the B-lymphocyte cell line GB12878. Strictly data-driven BN modeling indicates that the strength of intrachromosomal interactions (hic_strength) is directly influenced by only four types of factors: distance between segments, Rad21 or SMC3 (cohesin components),transcription at transcription start sites (TSS), and the number of CCCTC-binding factor (CTCF)-cohesin complexes between the interacting DNA segments. Subsequent studies confirmed that most high-intensity interactions have a CTCF-cohesin complex in at least one of the interacting segments. However, 46% have CTCF on only one side, and 32% are without CTCF. As expected, high-intensity interactions are strongly dependent on the orientation of the ctcf motif, and, moreover, we find that the interaction between enhancers and promoters is similarly dependent on ctcf motif orientation. Dependency relationships between transcription factors were also revealed, including known lineage-determining B-cell transcription factors (e.g., Ebf1) as well as potential novel relationships. Thus, BN analysis of large intrachromosomal interaction datasets is a useful tool for gaining insight into DNA-DNA, protein-DNA, and protein-protein interactions.


Asunto(s)
Teorema de Bayes , Cromatina/metabolismo , ADN/metabolismo , Modelos Moleculares , Linfocitos B , Sitios de Unión , Proteínas de Ciclo Celular/metabolismo , Línea Celular , Proteoglicanos Tipo Condroitín Sulfato/metabolismo , Cromatina/química , Proteínas Cromosómicas no Histona/metabolismo , Biología Computacional , ADN/química , Proteínas de Unión al ADN/metabolismo , Conjuntos de Datos como Asunto , Humanos , Conformación Molecular , Proteínas Nucleares/metabolismo , Motivos de Nucleótidos , Fosfoproteínas/metabolismo , Regiones Promotoras Genéticas , Mapeo de Interacción de Proteínas/métodos , Programas Informáticos , Factores de Transcripción/metabolismo , Sitio de Iniciación de la Transcripción , Transcripción Genética
9.
J Mol Evol ; 87(4-6): 184-198, 2019 07.
Artículo en Inglés | MEDLINE | ID: mdl-31302723

RESUMEN

Recent developments in sequencing and growth of bioinformatics resources provide us with vast depositories of protein network and single nucleotide polymorphism data. It allows us to re-examine, on a larger and more comprehensive scale, the relationship between protein-protein interactions and protein variability and evolutionary rates. This relationship has remained far from unambiguously resolved for quite a long time, reflecting shifting analysis approaches in the literature, and growing data availability. In this study, we utilized several public genomic databases to investigate this relationship in human, mouse, pig, chicken, and zebrafish. We observed strong non-linear relationship patterns (tending towards convex decreasing function shapes) between protein variability and the density of corresponding protein-protein interactions across all five species. To investigate further, we carried out stochastic simulations, modeling the interplay between protein connectivity and variability. Our results indicate that a simple negative linear correlation model, often suggested (or tacitly assumed) in the literature, as either a null or an alternative hypothesis, is not a good fit with the observed data. After considering different (but still relatively simple, and not overfitting) simulation models, we found that a convex decreasing protein variability-connectivity function (specifically, exponential decay) led to a much better fit with the real data. We conclude that simple correlation models might be inadequate for describing protein variability-connectivity interplay in vertebrates; they often tend towards false negatives (showing no more than marginal linear or rank correlation where there are in fact strong non-random patterns).


Asunto(s)
Evolución Molecular , Modelos Estadísticos , Procesos Estocásticos , Vertebrados/genética , Animales , Biología Computacional/métodos , Simulación por Computador , Bases de Datos de Proteínas , Humanos , Dominios y Motivos de Interacción de Proteínas/fisiología
10.
Proc Natl Acad Sci U S A ; 111(17): 6353-8, 2014 Apr 29.
Artículo en Inglés | MEDLINE | ID: mdl-24733912

RESUMEN

Evolution by gene duplication is generally accepted as one of the crucial driving forces for the gain of new complexity and functions, but the formation of pseudogenes remains a problem for this mechanism. Here we expand on earlier ideas that epigenetic modifications can drive neo- and subfunctionalization in evolution by gene duplication. We explore the effects of stochastic epigenetic modifications on the evolution (and thus development) of complex organisms in a constant environment. Modeling is done both using a modified genetic drift analytical treatment and computer simulations, which were found to agree. A transposon silencing model is also explored. Some key assumptions made include (i) stochastic, incomplete removal (or addition) of repressive epigenetic marks takes place during a window(s) of opportunity in the zygote and early embryo; (ii) there is no statistical variation of the marks after the window closes; and (iii) the genes affected are sensitive to dosage. Our genetic drift treatment takes into account that after gene duplication the prevailing case upon which selection operates is a duplicate/singlet heterozygote; to the best of our knowledge, this has not been considered in previous treatments. We conclude from our modeling that stochastic epigenetic modifications, with rates consistent with experimental observation, can both increase the rate of gene fixation and decrease pseudogenization, thus dramatically improving the efficacy of evolution by gene duplication. We also find that a transposon silencing model is advantageous for fixation of recessive genes in diploid organisms, especially with large effective population sizes.


Asunto(s)
Evolución Biológica , Embrión de Mamíferos/metabolismo , Epigénesis Genética , Animales , Simulación por Computador , Elementos Transponibles de ADN/genética , Difusión , Duplicación de Gen , Sitios Genéticos , Genotipo , Ratones , Modelos Genéticos , Fenotipo , Seudogenes , Procesos Estocásticos
11.
Sci Rep ; 14(1): 14954, 2024 06 28.
Artículo en Inglés | MEDLINE | ID: mdl-38942763

RESUMEN

While there are currently over 40 replicated genes with mapped risk alleles for Late Onset Alzheimer's disease (LOAD), the Apolipoprotein E locus E4 haplotype is still the biggest driver of risk, with odds ratios for neuropathologically confirmed E44 carriers exceeding 30 (95% confidence interval 16.59-58.75). We sought to address whether the APOE E4 haplotype modifies expression globally through networks of expression to increase LOAD risk. We have used the Human Brainome data to build expression networks comparing APOE E4 carriers to non-carriers using scalable mixed-datatypes Bayesian network (BN) modeling. We have found that VGF had the greatest explanatory weight. High expression of VGF is a protective signal, even on the background of APOE E4 alleles. LOAD risk signals, considering an APOE background, include high levels of SPECC1L, HLA-DRA and RANBP3L. Our findings nominate several new transcripts, taking a combined approach to network building including known LOAD risk loci.


Asunto(s)
Enfermedad de Alzheimer , Apolipoproteína E4 , Predisposición Genética a la Enfermedad , Humanos , Enfermedad de Alzheimer/genética , Enfermedad de Alzheimer/metabolismo , Apolipoproteína E4/genética , Cadenas alfa de HLA-DR/genética , Femenino , Masculino , Anciano , Proteínas Adaptadoras Transductoras de Señales/genética , Proteínas Adaptadoras Transductoras de Señales/metabolismo , Alelos , Haplotipos , Teorema de Bayes , Factores de Riesgo , Proteínas Nucleares/genética , Proteínas Nucleares/metabolismo , Anciano de 80 o más Años
12.
NPJ Syst Biol Appl ; 10(1): 32, 2024 Mar 25.
Artículo en Inglés | MEDLINE | ID: mdl-38527998

RESUMEN

Acute myeloid leukemia (AML) is prevalent in both adult and pediatric patients. Despite advances in patient categorization, the heterogeneity of AML remains a challenge. Recent studies have explored the use of gene expression data to enhance AML diagnosis and prognosis, however, alternative approaches rooted in physics and chemistry may provide another level of insight into AML transformation. Utilizing publicly available databases, we analyze 884 human and mouse blood and bone marrow samples. We employ a personalized medicine strategy, combining state-transition theory and surprisal analysis, to assess the RNA transcriptome of individual patients. The transcriptome is transformed into physical parameters that represent each sample's steady state and the free energy change (FEC) from that steady state, which is the state with the lowest free energy.We found the transcriptome steady state was invariant across normal and AML samples. FEC, representing active molecular processes, varied significantly between samples and was used to create patient-specific barcodes to characterize the biology of the disease. We discovered that AML samples that were in a transition state had the highest FEC. This disease state may be characterized as the most unstable and hence the most therapeutically targetable since a change in free energy is a thermodynamic requirement for disease progression. We also found that distinct sets of ongoing processes may be at the root of otherwise similar clinical phenotypes, implying that our integrated analysis of transcriptome profiles may facilitate a personalized medicine approach to cure AML and restore a steady state in each patient.


Asunto(s)
Leucemia Mieloide Aguda , Transcriptoma , Adulto , Animales , Ratones , Humanos , Niño , Transcriptoma/genética , Perfilación de la Expresión Génica , Leucemia Mieloide Aguda/genética , Biomarcadores de Tumor/genética , Fenotipo
13.
Res Sq ; 2024 Apr 02.
Artículo en Inglés | MEDLINE | ID: mdl-38645262

RESUMEN

Enhancers are fundamental to gene regulation. Post-translational modifications by the small ubiquitin-like modifiers (SUMO) modify chromatin regulation enzymes, including histone acetylases and deacetylases. However, it remains unclear whether SUMOylation regulates enhancer marks, acetylation at the 27th lysine residue of the histone H3 protein (H3K27Ac). To investigate whether SUMOylation regulates H3K27Ac, we performed genome-wide ChIP-seq analyses and discovered that knockdown (KD) of the SUMO activating enzyme catalytic subunit UBA2 reduced H3K27Ac at most enhancers. Bioinformatic analysis revealed that TFAP2C-binding sites are enriched in enhancers whose H3K27Ac was reduced by UBA2 KD. ChIP-seq analysis in combination with molecular biological methods showed that TFAP2C binding to enhancers increased upon UBA2 KD or inhibition of SUMOylation by a small molecule SUMOylation inhibitor. However, this is not due to the SUMOylation of TFAP2C itself. Proteomics analysis of TFAP2C interactome on the chromatin identified histone deacetylation (HDAC) and RNA splicing machineries that contain many SUMOylation targets. TFAP2C KD reduced HDAC1 binding to chromatin and increased H3K27Ac marks at enhancer regions, suggesting that TFAP2C is important in recruiting HDAC machinery. Taken together, our findings provide insights into the regulation of enhancer marks by SUMOylation and TFAP2C and suggest that SUMOylation of proteins in the HDAC machinery regulates their recruitments to enhancers.

14.
Leukemia ; 38(4): 769-780, 2024 Apr.
Artículo en Inglés | MEDLINE | ID: mdl-38307941

RESUMEN

Chronic myeloid leukemia (CML) is initiated and maintained by BCR::ABL which is clinically targeted using tyrosine kinase inhibitors (TKIs). TKIs can induce long-term remission but are also not curative. Thus, CML is an ideal system to test our hypothesis that transcriptome-based state-transition models accurately predict cancer evolution and treatment response. We collected time-sequential blood samples from tetracycline-off (Tet-Off) BCR::ABL-inducible transgenic mice and wild-type controls. From the transcriptome, we constructed a CML state-space and a three-well leukemogenic potential landscape. The potential's stable critical points defined observable disease states. Early states were characterized by anti-CML genes opposing leukemia; late states were characterized by pro-CML genes. Genes with expression patterns shaped similarly to the potential landscape were identified as drivers of disease transition. Re-introduction of tetracycline to silence the BCR::ABL gene returned diseased mice transcriptomes to a near healthy state, without reaching it, suggesting parts of the transition are irreversible. TKI only reverted the transcriptome to an intermediate disease state, without approaching a state of health; disease relapse occurred soon after treatment. Using only the earliest time-point as initial conditions, our state-transition models accurately predicted both disease progression and treatment response, supporting this as a potentially valuable approach to time clinical intervention, before phenotypic changes become detectable.


Asunto(s)
Leucemia Mielógena Crónica BCR-ABL Positiva , Transcriptoma , Ratones , Animales , Proteínas de Fusión bcr-abl/metabolismo , Inhibidores de Proteínas Quinasas/farmacología , Inhibidores de Proteínas Quinasas/uso terapéutico , Leucemia Mielógena Crónica BCR-ABL Positiva/tratamiento farmacológico , Leucemia Mielógena Crónica BCR-ABL Positiva/genética , Tetraciclinas/uso terapéutico , Resistencia a Antineoplásicos
15.
Proc Natl Acad Sci U S A ; 107(35): 15485-90, 2010 Aug 31.
Artículo en Inglés | MEDLINE | ID: mdl-20716685

RESUMEN

CpG dinucleotides contribute to epigenetic mechanisms by being the only site for DNA methylation in mammalian somatic cells. They are also mutation hotspots and approximately 5-fold depleted genome-wide. We report here a study focused on CpG sites in the coding regions of Hox and other transcription factor genes, comparing methylated genomes of Homo sapiens, Mus musculus, and Danio rerio with nonmethylated genomes of Drosophila melanogaster and Caenorhabditis elegans. We analyzed 4-fold degenerate, synonymous codons with the potential for CpG. That is, we studied "silent" changes that do not affect protein products but could damage epigenetic marking. We find that DNA-binding transcription factors and other developmentally relevant genes show, only in methylated genomes, a bimodal distribution of CpG usage. Several genetic code-based tests indicate, again for methylated genomes only, that the frequency of silent CpGs in Hox genes is much greater than expectation. Also informative are NCG-GNN and NCC-GNN codon doublets, for which an unusually high rate of G to C and C to G transversions was observed at the third (silent) position of the first codon. Together these results are interpreted as evidence for strong "pro-epigenetic" selection acting to preserve CpG sites in coding regions of many genes controlling development. We also report that DNA-binding transcription factors and developmentally important genes are dramatically overrepresented in or near clusters of three or more CpG islands, suggesting a possible relationship between evolutionary preservation of CpG dinucleotides in both coding regions and CpG islands.


Asunto(s)
Islas de CpG/genética , Metilación de ADN , Proteínas de Homeodominio/genética , Sistemas de Lectura Abierta/genética , Factores de Transcripción/genética , Algoritmos , Aminoácidos/genética , Animales , Caenorhabditis elegans/genética , Codón/genética , Proteínas de Unión al ADN/genética , Drosophila melanogaster/genética , Epigénesis Genética , Evolución Molecular , Exones/genética , Genoma/genética , Ratones , Modelos Genéticos , Mutación Puntual , Selección Genética , Pez Cebra/genética
16.
Res Sq ; 2023 Dec 14.
Artículo en Inglés | MEDLINE | ID: mdl-38168398

RESUMEN

While there are currently over 40 replicated genes with mapped risk alleles for Late Onset Alzheimer's disease (LOAD), the Apolipoprotein E locus E4 haplotype is still the biggest driver of risk, with odds ratios for neuropathologically confirmed E44 carriers exceeding 30 (95% confidence interval 16.59-58.75). We sought to address whether the APOE E4 haplotype modifies expression globally through networks of expression to increase LOAD risk. We have used the Human Brainome data to build expression networks comparing APOE E4 carriers to non-carriers using scalable mixed-datatypes Bayesian network (BN) modeling. We have found that VGF had the greatest explanatory weight. High expression of VGF is a protective signal, even on the background of APOE E4 alleles. LOAD risk signals, considering an APOE background, include high levels of SPECC1L, HLA-DRA and RANBP3L. Our findings nominate several new transcripts, taking a combined approach to network building including known LOAD risk loci.

17.
bioRxiv ; 2023 Oct 12.
Artículo en Inglés | MEDLINE | ID: mdl-37873104

RESUMEN

Cooperative interactions in protein-protein interfaces demonstrate the interdependency or the linked network-like behavior of interface interactions and their effect on the coupling of proteins. Cooperative interactions also could cause ripple or allosteric effects at a distance in protein-protein interfaces. Although they are critically important in protein-protein interfaces it is challenging to determine which amino acid pair interactions are cooperative. In this work we have used Bayesian network modeling, an interpretable machine learning method, combined with molecular dynamics trajectories to identify the residue pairs that show high cooperativity and their allosteric effect in the interface of G protein-coupled receptor (GPCR) complexes with G proteins. Our results reveal a strong co-dependency in the formation of interface GPCR:G protein contacts. This observation indicates that cooperativity of GPCR:G protein interactions is necessary for the coupling and selectivity of G proteins and is thus critical for receptor function. We have identified subnetworks containing polar and hydrophobic interactions that are common among multiple GPCRs coupling to different G protein subtypes (Gs, Gi and Gq). These common subnetworks along with G protein-specific subnetworks together confer selectivity to the G protein coupling. This work underscores the potential of data-driven Bayesian network modeling in elucidating the intricate dependencies and selectivity determinants in GPCR:G protein complexes, offering valuable insights into the dynamic nature of these essential cellular signaling components.

18.
iScience ; 26(2): 106041, 2023 Feb 17.
Artículo en Inglés | MEDLINE | ID: mdl-36818303

RESUMEN

Modern artificial neural networks (ANNs) have long been designed on foundations of mathematics as opposed to their original foundations of biomimicry. However, the structure and function of these modern ANNs are often analogous to real-life biological networks. We propose that the ubiquitous information-theoretic principles underlying the development of ANNs are similar to the principles guiding the macro-evolution of biological networks and that insights gained from one field can be applied to the other. We generate hypotheses on the bow-tie network structure of the Janus kinase - signal transducers and activators of transcription (JAK-STAT) pathway, additionally informed by the evolutionary considerations, and carry out ANN simulation experiments to demonstrate that an increase in the network's input and output complexity does not necessarily require a more complex intermediate layer. This observation should guide novel biomarker discovery-namely, to prioritize sections of the biological networks in which information is most compressed as opposed to biomarkers representing the periphery of the network.

19.
bioRxiv ; 2023 Nov 03.
Artículo en Inglés | MEDLINE | ID: mdl-37961682

RESUMEN

Cytokines mediate cell-to-cell communication across the immune system and therefore are critical to immunosurveillance in cancer and other diseases. Several cytokines show dysregulated abundance or signaling responses in breast cancer, associated with the disease and differences in survival and progression. Cytokines operate in a coordinated manner to affect immune surveillance and regulate one another, necessitating a systems approach for a complete picture of this dysregulation. Here, we profiled cytokine signaling responses of peripheral immune cells from breast cancer patients as compared to healthy controls in a multidimensional manner across ligands, cell populations, and responsive pathways. We find alterations in cytokine responsiveness across pathways and cell types that are best defined by integrated signatures across dimensions. Alterations in the abundance of a cytokine's cognate receptor do not explain differences in responsiveness. Rather, alterations in baseline signaling and receptor abundance suggesting immune cell reprogramming are associated with altered responses. These integrated features suggest a global reprogramming of immune cell communication in breast cancer.

20.
bioRxiv ; 2023 Dec 09.
Artículo en Inglés | MEDLINE | ID: mdl-37873185

RESUMEN

Chronic myeloid leukemia (CML) is initiated and maintained by BCR::ABL which is clinically targeted using tyrosine kinase inhibitors (TKIs). TKIs can induce long-term remission but are also not curative. Thus, CML is an ideal system to test our hypothesis that transcriptome-based state-transition models accurately predict cancer evolution and treatment response. We collected time-sequential blood samples from tetracycline-off (Tet-Off) BCR::ABL-inducible transgenic mice and wild-type controls. From the transcriptome, we constructed a CML state-space and a three-well leukemogenic potential landscape. The potential's stable critical points defined observable disease states. Early states were characterized by anti-CML genes opposing leukemia; late states were characterized by pro-CML genes. Genes with expression patterns shaped similarly to the potential landscape were identified as drivers of disease transition. Re-introduction of tetracycline to silence the BCR::ABL gene returned diseased mice transcriptomes to a near healthy state, without reaching it, suggesting parts of the transition are irreversible. TKI only reverted the transcriptome to an intermediate disease state, without approaching a state of health; disease relapse occurred soon after treatment. Using only the earliest time-point as initial conditions, our state-transition models accurately predicted both disease progression and treatment response, supporting this as a potentially valuable approach to time clinical intervention even before phenotypic changes become detectable.

SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA