Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 3.010
Filtrar
Más filtros

Intervalo de año de publicación
1.
Annu Rev Immunol ; 41: 317-342, 2023 04 26.
Artículo en Inglés | MEDLINE | ID: mdl-37126419

RESUMEN

Over the last decade, immunometabolism has emerged as a novel interdisciplinary field of research and yielded significant fundamental insights into the regulation of immune responses. Multiple classical approaches to interrogate immunometabolism, including bulk metabolic profiling and analysis of metabolic regulator expression, paved the way to appreciating the physiological complexity of immunometabolic regulation in vivo. Studying immunometabolism at the systems level raised the need to transition towards the next-generation technology for metabolic profiling and analysis. Spatially resolved metabolic imaging and computational algorithms for multi-modal data integration are new approaches to connecting metabolism and immunity. In this review, we discuss recent studies that highlight the complex physiological interplay between immune responses and metabolism and give an overview of technological developments that bear the promise of capturing this complexity most directly and comprehensively.


Asunto(s)
Alergia e Inmunología , Inmunidad , Metabolismo , Animales , Humanos , Biología de Sistemas
2.
Cell ; 187(1): 149-165.e23, 2024 01 04.
Artículo en Inglés | MEDLINE | ID: mdl-38134933

RESUMEN

Deciphering the cell-state transitions underlying immune adaptation across time is fundamental for advancing biology. Empirical in vivo genomic technologies that capture cellular dynamics are currently lacking. We present Zman-seq, a single-cell technology recording transcriptomic dynamics across time by introducing time stamps into circulating immune cells, tracking them in tissues for days. Applying Zman-seq resolved cell-state and molecular trajectories of the dysfunctional immune microenvironment in glioblastoma. Within 24 hours of tumor infiltration, cytotoxic natural killer cells transitioned to a dysfunctional program regulated by TGFB1 signaling. Infiltrating monocytes differentiated into immunosuppressive macrophages, characterized by the upregulation of suppressive myeloid checkpoints Trem2, Il18bp, and Arg1, over 36 to 48 hours. Treatment with an antagonistic anti-TREM2 antibody reshaped the tumor microenvironment by redirecting the monocyte trajectory toward pro-inflammatory macrophages. Zman-seq is a broadly applicable technology, enabling empirical measurements of differentiation trajectories, which can enhance the development of more efficacious immunotherapies.


Asunto(s)
Glioblastoma , Humanos , Perfilación de la Expresión Génica , Glioblastoma/patología , Inmunoterapia , Células Asesinas Naturales , Macrófagos , Microambiente Tumoral , Análisis de la Célula Individual
3.
Cell ; 187(15): 3919-3935.e19, 2024 Jul 25.
Artículo en Inglés | MEDLINE | ID: mdl-38908368

RESUMEN

In aging, physiologic networks decline in function at rates that differ between individuals, producing a wide distribution of lifespan. Though 70% of human lifespan variance remains unexplained by heritable factors, little is known about the intrinsic sources of physiologic heterogeneity in aging. To understand how complex physiologic networks generate lifespan variation, new methods are needed. Here, we present Asynch-seq, an approach that uses gene-expression heterogeneity within isogenic populations to study the processes generating lifespan variation. By collecting thousands of single-individual transcriptomes, we capture the Caenorhabditis elegans "pan-transcriptome"-a highly resolved atlas of non-genetic variation. We use our atlas to guide a large-scale perturbation screen that identifies the decoupling of total mRNA content between germline and soma as the largest source of physiologic heterogeneity in aging, driven by pleiotropic genes whose knockdown dramatically reduces lifespan variance. Our work demonstrates how systematic mapping of physiologic heterogeneity can be applied to reduce inter-individual disparities in aging.


Asunto(s)
Envejecimiento , Caenorhabditis elegans , Redes Reguladoras de Genes , Longevidad , Transcriptoma , Caenorhabditis elegans/genética , Caenorhabditis elegans/fisiología , Animales , Envejecimiento/genética , Transcriptoma/genética , Longevidad/genética , Proteínas de Caenorhabditis elegans/metabolismo , Proteínas de Caenorhabditis elegans/genética , ARN Mensajero/metabolismo , ARN Mensajero/genética
4.
Cell ; 187(19): 5431-5452.e20, 2024 Sep 19.
Artículo en Inglés | MEDLINE | ID: mdl-39303691

RESUMEN

Breastfeeding and microbial colonization during infancy occur within a critical time window for development, and both are thought to influence the risk of respiratory illness. However, the mechanisms underlying the protective effects of breastfeeding and the regulation of microbial colonization are poorly understood. Here, we profiled the nasal and gut microbiomes, breastfeeding characteristics, and maternal milk composition of 2,227 children from the CHILD Cohort Study. We identified robust colonization patterns that, together with milk components, predict preschool asthma and mediate the protective effects of breastfeeding. We found that early cessation of breastfeeding (before 3 months) leads to the premature acquisition of microbial species and functions, including Ruminococcus gnavus and tryptophan biosynthesis, which were previously linked to immune modulation and asthma. Conversely, longer exclusive breastfeeding supports a paced microbial development, protecting against asthma. These findings underscore the importance of extended breastfeeding for respiratory health and highlight potential microbial targets for intervention.


Asunto(s)
Lactancia Materna , Leche Humana , Humanos , Femenino , Leche Humana/microbiología , Lactante , Preescolar , Asma/microbiología , Asma/prevención & control , Asma/inmunología , Microbiota , Microbioma Gastrointestinal , Masculino , Estudios de Cohortes , Recién Nacido
5.
Cell ; 186(25): 5440-5456.e26, 2023 12 07.
Artículo en Inglés | MEDLINE | ID: mdl-38065078

RESUMEN

Today's genomics workflows typically require alignment to a reference sequence, which limits discovery. We introduce a unifying paradigm, SPLASH (Statistically Primary aLignment Agnostic Sequence Homing), which directly analyzes raw sequencing data, using a statistical test to detect a signature of regulation: sample-specific sequence variation. SPLASH detects many types of variation and can be efficiently run at scale. We show that SPLASH identifies complex mutation patterns in SARS-CoV-2, discovers regulated RNA isoforms at the single-cell level, detects the vast sequence diversity of adaptive immune receptors, and uncovers biology in non-model organisms undocumented in their reference genomes: geographic and seasonal variation and diatom association in eelgrass, an oceanic plant impacted by climate change, and tissue-specific transcripts in octopus. SPLASH is a unifying approach to genomic analysis that enables expansive discovery without metadata or references.


Asunto(s)
Algoritmos , Genómica , Genoma , Análisis de Secuencia de ARN , Humanos , Antígenos HLA/genética , Análisis de la Célula Individual
6.
Cell ; 185(5): 881-895.e20, 2022 03 03.
Artículo en Inglés | MEDLINE | ID: mdl-35216672

RESUMEN

Post-acute sequelae of COVID-19 (PASC) represent an emerging global crisis. However, quantifiable risk factors for PASC and their biological associations are poorly resolved. We executed a deep multi-omic, longitudinal investigation of 309 COVID-19 patients from initial diagnosis to convalescence (2-3 months later), integrated with clinical data and patient-reported symptoms. We resolved four PASC-anticipating risk factors at the time of initial COVID-19 diagnosis: type 2 diabetes, SARS-CoV-2 RNAemia, Epstein-Barr virus viremia, and specific auto-antibodies. In patients with gastrointestinal PASC, SARS-CoV-2-specific and CMV-specific CD8+ T cells exhibited unique dynamics during recovery from COVID-19. Analysis of symptom-associated immunological signatures revealed coordinated immunity polarization into four endotypes, exhibiting divergent acute severity and PASC. We find that immunological associations between PASC factors diminish over time, leading to distinct convalescent immune states. Detectability of most PASC factors at COVID-19 diagnosis emphasizes the importance of early disease measurements for understanding emergent chronic conditions and suggests PASC treatment strategies.


Asunto(s)
COVID-19/complicaciones , COVID-19/diagnóstico , Convalecencia , Inmunidad Adaptativa/genética , Adolescente , Adulto , Anciano , Anciano de 80 o más Años , Autoanticuerpos/sangre , Biomarcadores/metabolismo , Proteínas Sanguíneas/metabolismo , Linfocitos T CD8-positivos/inmunología , Linfocitos T CD8-positivos/metabolismo , COVID-19/inmunología , COVID-19/patología , COVID-19/virología , Progresión de la Enfermedad , Femenino , Humanos , Inmunidad Innata/genética , Estudios Longitudinales , Masculino , Persona de Mediana Edad , Factores de Riesgo , SARS-CoV-2/aislamiento & purificación , Transcriptoma , Adulto Joven , Síndrome Post Agudo de COVID-19
7.
Cell ; 184(11): 3022-3040.e28, 2021 05 27.
Artículo en Inglés | MEDLINE | ID: mdl-33961781

RESUMEN

Thousands of interactions assemble proteins into modules that impart spatial and functional organization to the cellular proteome. Through affinity-purification mass spectrometry, we have created two proteome-scale, cell-line-specific interaction networks. The first, BioPlex 3.0, results from affinity purification of 10,128 human proteins-half the proteome-in 293T cells and includes 118,162 interactions among 14,586 proteins. The second results from 5,522 immunoprecipitations in HCT116 cells. These networks model the interactome whose structure encodes protein function, localization, and complex membership. Comparison across cell lines validates thousands of interactions and reveals extensive customization. Whereas shared interactions reside in core complexes and involve essential proteins, cell-specific interactions link these complexes, "rewiring" subnetworks within each cell's interactome. Interactions covary among proteins of shared function as the proteome remodels to produce each cell's phenotype. Viewable interactively online through BioPlexExplorer, these networks define principles of proteome organization and enable unknown protein characterization.


Asunto(s)
Mapeo de Interacción de Proteínas/métodos , Mapas de Interacción de Proteínas/genética , Proteoma/genética , Biología Computacional/métodos , Células HCT116/metabolismo , Células HEK293/metabolismo , Humanos , Espectrometría de Masas/métodos , Mapas de Interacción de Proteínas/fisiología , Proteoma/metabolismo , Proteómica/métodos
8.
Cell ; 178(6): 1465-1477.e17, 2019 09 05.
Artículo en Inglés | MEDLINE | ID: mdl-31491388

RESUMEN

Most human protein-coding genes are regulated by multiple, distinct promoters, suggesting that the choice of promoter is as important as its level of transcriptional activity. However, while a global change in transcription is recognized as a defining feature of cancer, the contribution of alternative promoters still remains largely unexplored. Here, we infer active promoters using RNA-seq data from 18,468 cancer and normal samples, demonstrating that alternative promoters are a major contributor to context-specific regulation of transcription. We find that promoters are deregulated across tissues, cancer types, and patients, affecting known cancer genes and novel candidates. For genes with independently regulated promoters, we demonstrate that promoter activity provides a more accurate predictor of patient survival than gene expression. Our study suggests that a dynamic landscape of active promoters shapes the cancer transcriptome, opening new diagnostic avenues and opportunities to further explore the interplay of regulatory mechanisms with transcriptional aberrations in cancer.


Asunto(s)
Biología Computacional/métodos , Regulación Neoplásica de la Expresión Génica/genética , Neoplasias/genética , Regiones Promotoras Genéticas/genética , Transcriptoma/genética , Bases de Datos Genéticas , Humanos , RNA-Seq/métodos
9.
Immunity ; 57(5): 1160-1176.e7, 2024 May 14.
Artículo en Inglés | MEDLINE | ID: mdl-38697118

RESUMEN

Multimodal single-cell profiling methods can capture immune cell variations unfolding over time at the molecular, cellular, and population levels. Transforming these data into biological insights remains challenging. Here, we introduce a framework to integrate variations at the human population and single-cell levels in vaccination responses. Comparing responses following AS03-adjuvanted versus unadjuvanted influenza vaccines with CITE-seq revealed AS03-specific early (day 1) response phenotypes, including a B cell signature of elevated germinal center competition. A correlated network of cell-type-specific transcriptional states defined the baseline immune status associated with high antibody responders to the unadjuvanted vaccine. Certain innate subsets in the network appeared "naturally adjuvanted," with transcriptional states resembling those induced uniquely by AS03-adjuvanted vaccination. Consistently, CD14+ monocytes from high responders at baseline had elevated phospho-signaling responses to lipopolysaccharide stimulation. Our findings link baseline immune setpoints to early vaccine responses, with positive implications for adjuvant development and immune response engineering.


Asunto(s)
Linfocitos B , Vacunas contra la Influenza , Análisis de la Célula Individual , Humanos , Vacunas contra la Influenza/inmunología , Linfocitos B/inmunología , Centro Germinal/inmunología , Gripe Humana/inmunología , Gripe Humana/prevención & control , Vacunación , Anticuerpos Antivirales/inmunología , Adyuvantes Inmunológicos , Adyuvantes de Vacunas , Monocitos/inmunología , Polisorbatos , Escualeno/inmunología , Inmunidad Innata/inmunología
10.
Cell ; 167(1): 158-170.e12, 2016 Sep 22.
Artículo en Inglés | MEDLINE | ID: mdl-27662088

RESUMEN

Protein flexibility ranges from simple hinge movements to functional disorder. Around half of all human proteins contain apparently disordered regions with little 3D or functional information, and many of these proteins are associated with disease. Building on the evolutionary couplings approach previously successful in predicting 3D states of ordered proteins and RNA, we developed a method to predict the potential for ordered states for all apparently disordered proteins with sufficiently rich evolutionary information. The approach is highly accurate (79%) for residue interactions as tested in more than 60 known disordered regions captured in a bound or specific condition. Assessing the potential for structure of more than 1,000 apparently disordered regions of human proteins reveals a continuum of structural order with at least 50% with clear propensity for three- or two-dimensional states. Co-evolutionary constraints reveal hitherto unseen structures of functional importance in apparently disordered proteins.


Asunto(s)
Proteínas Intrínsecamente Desordenadas/química , Evolución Molecular Dirigida/métodos , Genómica , Humanos , Proteínas Intrínsecamente Desordenadas/genética , Estructura Secundaria de Proteína , Estructura Terciaria de Proteína , Proteoma/química , Proteoma/genética
11.
Mol Cell ; 82(2): 260-273, 2022 01 20.
Artículo en Inglés | MEDLINE | ID: mdl-35016036

RESUMEN

Biological systems are composed of a vast web of multiscale molecular interactors and interactions. High-throughput technologies, both bulk and single cell, now allow for investigation of the properties and quantities of these interactors. Computational algorithms and machine learning methods then provide the tools to derive meaningful insights from the resulting data sets. One such approach is graphical network modeling, which provides a computational framework to explicitly model the molecular interactions within and between the cells comprising biological systems. These graphical networks aim to describe a putative chain of cause and effect between interacting molecules. This feature allows for determination of key molecules in a biological process, accelerated generation of mechanistic hypotheses, and simulation of experimental outcomes. We review the computational concepts and applications of graphical network models across molecular scales for both intracellular and intercellular regulatory biology, examples of successful applications, and the future directions needed to overcome current limitations.


Asunto(s)
Biología Computacional , Redes Reguladoras de Genes , Aprendizaje Automático , Mapas de Interacción de Proteínas , Animales , Regulación de la Expresión Génica , Humanos , Modelos Biológicos , Proyectos de Investigación , Transducción de Señal
12.
Brief Bioinform ; 25(5)2024 Jul 25.
Artículo en Inglés | MEDLINE | ID: mdl-39297879

RESUMEN

Structural variation (SV) refers to insertions, deletions, inversions, and duplications in human genomes. SVs are present in approximately 1.5% of the human genome. Still, this small subset of genetic variation has been implicated in the pathogenesis of psoriasis, Crohn's disease and other autoimmune disorders, autism spectrum and other neurodevelopmental disorders, and schizophrenia. Since identifying structural variants is an important problem in genetics, several specialized computational techniques have been developed to detect structural variants directly from sequencing data. With advances in whole-genome sequencing (WGS) technologies, a plethora of SV detection methods have been developed. However, dissecting SVs from WGS data remains a challenge, with the majority of SV detection methods prone to a high false-positive rate, and no existing method able to precisely detect a full range of SVs present in a sample. Previous studies have shown that none of the existing SV callers can maintain high accuracy across various SV lengths and genomic coverages. Here, we report an integrated structural variant calling framework, Variant Identification and Structural Variant Analysis (VISTA), that leverages the results of individual callers using a novel and robust filtering and merging algorithm. In contrast to existing consensus-based tools which ignore the length and coverage, VISTA overcomes this limitation by executing various combinations of top-performing callers based on variant length and genomic coverage to generate SV events with high accuracy. We evaluated the performance of VISTA on comprehensive gold-standard datasets across varying organisms and coverage. We benchmarked VISTA using the Genome-in-a-Bottle gold standard SV set, haplotype-resolved de novo assemblies from the Human Pangenome Reference Consortium, along with an in-house polymerase chain reaction (PCR)-validated mouse gold standard set. VISTA maintained the highest F1 score among top consensus-based tools measured using a comprehensive gold standard across both mouse and human genomes. VISTA also has an optimized mode, where the calls can be optimized for precision or recall. VISTA-optimized can attain 100% precision and the highest sensitivity among other variant callers. In conclusion, VISTA represents a significant advancement in structural variant calling, offering a robust and accurate framework that outperforms existing consensus-based tools and sets a new standard for SV detection in genomic research.


Asunto(s)
Genoma Humano , Variación Estructural del Genoma , Programas Informáticos , Humanos , Secuenciación Completa del Genoma/métodos , Algoritmos , Genómica/métodos , Biología Computacional/métodos , Variación Genética
13.
Brief Bioinform ; 25(4)2024 May 23.
Artículo en Inglés | MEDLINE | ID: mdl-38783705

RESUMEN

Tumor mutational signatures have gained prominence in cancer research, yet the lack of standardized methods hinders reproducibility and robustness. Leveraging colorectal cancer (CRC) as a model, we explored the influence of computational parameters on mutational signature analyses across 230 CRC cell lines and 152 CRC patients. Results were validated in three independent datasets: 483 endometrial cancer patients stratified by mismatch repair (MMR) status, 35 lung cancer patients by smoking status and 12 patient-derived organoids (PDOs) annotated for colibactin exposure. Assessing various bioinformatic tools, reference datasets and input data sizes including whole genome sequencing, whole exome sequencing and a pan-cancer gene panel, we demonstrated significant variability in the results. We report that the use of distinct algorithms and references led to statistically different results, highlighting how arbitrary choices may induce variability in the mutational signature contributions. Furthermore, we found a differential contribution of mutational signatures between coding and intergenic regions and defined the minimum number of somatic variants required for reliable mutational signature assignment. To facilitate the identification of the most suitable workflows, we developed Comparative Mutational Signature analysis on Coding and Extragenic Regions (CoMSCER), a bioinformatic tool which allows researchers to easily perform comparative mutational signature analysis by coupling the results from several tools and public reference datasets and to assess mutational signature contributions in coding and non-coding genomic regions. In conclusion, our study provides a comparative framework to elucidate the impact of distinct computational workflows on mutational signatures.


Asunto(s)
Neoplasias Colorrectales , Biología Computacional , Mutación , Humanos , Neoplasias Colorrectales/genética , Neoplasias Colorrectales/patología , Biología Computacional/métodos , Flujo de Trabajo , Línea Celular Tumoral , Secuenciación del Exoma/métodos , Femenino , Algoritmos
14.
Brief Bioinform ; 25(5)2024 Jul 25.
Artículo en Inglés | MEDLINE | ID: mdl-39129360

RESUMEN

The genetic blueprint for the essential functions of life is encoded in DNA, which is translated into proteins-the engines driving most of our metabolic processes. Recent advancements in genome sequencing have unveiled a vast diversity of protein families, but compared with the massive search space of all possible amino acid sequences, the set of known functional families is minimal. One could say nature has a limited protein "vocabulary." A major question for computational biologists, therefore, is whether this vocabulary can be expanded to include useful proteins that went extinct long ago or have never evolved (yet). By merging evolutionary algorithms, machine learning, and bioinformatics, we can develop highly customized "designer proteins." We dub the new subfield of computational evolution, which employs evolutionary algorithms with DNA string representations, biologically accurate molecular evolution, and bioinformatics-informed fitness functions, Evolutionary Algorithms Simulating Molecular Evolution.


Asunto(s)
Algoritmos , Biología Computacional , Evolución Molecular , Biología Computacional/métodos , Proteínas/genética , Proteínas/química , Proteínas/metabolismo , Simulación por Computador
15.
Proc Natl Acad Sci U S A ; 120(12): e2216805120, 2023 03 21.
Artículo en Inglés | MEDLINE | ID: mdl-36920920

RESUMEN

Homeostasis, the ability to maintain a relatively constant internal environment in the face of perturbations, is a hallmark of biological systems. It is believed that this constancy is achieved through multiple internal regulation and control processes. Given observations of a system, or even a detailed model of one, it is both valuable and extremely challenging to extract the control objectives of the homeostatic mechanisms. In this work, we develop a robust data-driven method to identify these objectives, namely to understand: "what does the system care about?". We propose an algorithm, Identifying Regulation with Adversarial Surrogates (IRAS), that receives an array of temporal measurements of the system and outputs a candidate for the control objective, expressed as a combination of observed variables. IRAS is an iterative algorithm consisting of two competing players. The first player, realized by an artificial deep neural network, aims to minimize a measure of invariance we refer to as the coefficient of regulation. The second player aims to render the task of the first player more difficult by forcing it to extract information about the temporal structure of the data, which is absent from similar "surrogate" data. We test the algorithm on four synthetic and one natural data set, demonstrating excellent empirical results. Interestingly, our approach can also be used to extract conserved quantities, e.g., energy and momentum, in purely physical systems, as we demonstrate empirically.


Asunto(s)
Algoritmos , Homeostasis
16.
Semin Cell Dev Biol ; 138: 68-80, 2023 03 30.
Artículo en Inglés | MEDLINE | ID: mdl-35260294

RESUMEN

Since the discovery of this cell population by His in 1850, the neural crest has been under intense study for its important role during vertebrate development. Much has been learned about the function and regulation of neural crest cell differentiation, and as a result, the neural crest has become a key model system for stem cell biology in general. The experiments performed in embryology, genetics, and cell biology in the last 150 years in the neural crest field has given rise to several big questions that have been debated intensely for many years: "How does positional information impact developmental potential? Are neural crest cells individually multipotent or a mixed population of committed progenitors? What are the key factors that regulate the acquisition of stem cell identity, and how does a stem cell decide to differentiate towards one cell fate versus another?" Recently, a marriage between single cell multi-omics, statistical modeling, and developmental biology has shed a substantial amount of light on these questions, and has paved a clear path for future researchers in the field.


Asunto(s)
Cresta Neural , Células Madre , Animales , Diferenciación Celular/genética , Vertebrados
17.
Semin Cell Dev Biol ; 147: 83-90, 2023 09 30.
Artículo en Inglés | MEDLINE | ID: mdl-36754751

RESUMEN

Understanding the mechanism by which cells coordinate their differentiation and migration is critical to our understanding of many fundamental processes such as wound healing, disease progression, and developmental biology. Mathematical models have been an essential tool for testing and developing our understanding, such as models of cells as soft spherical particles, reaction-diffusion systems that couple cell movement to environmental factors, and multi-scale multi-physics simulations that combine bottom-up rule-based models with continuum laws. However, mathematical models can often be loosely related to data or have so many parameters that model behaviour is weakly constrained. Recent methods in machine learning introduce new means by which models can be derived and deployed. In this review, we discuss examples of mathematical models of aspects of developmental biology, such as cell migration, and how these models can be combined with these recent machine learning methods.


Asunto(s)
Simulación por Computador , Biología Evolutiva , Modelos Biológicos , Morfogénesis , Biología Evolutiva/métodos , Biología Evolutiva/tendencias , Movimiento Celular , Simulación por Computador/tendencias , Aprendizaje Automático , Humanos , Animales
18.
Circulation ; 149(10): 774-787, 2024 03 05.
Artículo en Inglés | MEDLINE | ID: mdl-38018436

RESUMEN

BACKGROUND: Cholesterol efflux capacity (CEC) predicts cardiovascular disease independently of high-density lipoprotein (HDL) cholesterol levels. Isolated small HDL particles are potent promoters of macrophage CEC by the ABCA1 (ATP-binding cassette transporter A1) pathway, but the underlying mechanisms are unclear. METHODS: We used model system studies of reconstituted HDL and plasma from control and lecithin-cholesterol acyltransferase (LCAT)-deficient subjects to investigate the relationships among the sizes of HDL particles, the structure of APOA1 (apolipoprotein A1) in the different particles, and the CECs of plasma and isolated HDLs. RESULTS: We quantified macrophage and ABCA1 CEC of 4 distinct sizes of reconstituted HDL. CEC increased as particle size decreased. Tandem mass spectrometric analysis of chemically cross-linked peptides and molecular dynamics simulations of APOA1, the major protein of HDL, indicated that the mobility of C-terminus of that protein was markedly higher and flipped off the surface in the smallest particles. To explore the physiological relevance of the model system studies, we isolated HDL from LCAT-deficient subjects, whose small HDLs (like reconstituted HDLs) are discoidal and composed of APOA1, cholesterol, and phospholipid. Despite their very low plasma levels of HDL particles, these subjects had normal CEC. In both the LCAT-deficient subjects and control subjects, the CEC of isolated extra-small HDL (a mixture of extra-small and small HDL by calibrated ion mobility analysis) was 3- to 5-fold greater than that of the larger sizes of isolated HDL. Incubating LCAT-deficient plasma and control plasma with human LCAT converted extra-small and small HDL particles into larger particles, and it markedly inhibited CEC. CONCLUSIONS: We present a mechanism for the enhanced CEC of small HDLs. In smaller particles, the C-termini of the 2 antiparallel molecules of APOA1 are "flipped" off the lipid surface of HDL. This extended conformation allows them to engage with ABCA1. In contrast, the C-termini of larger HDLs are unable to interact productively with ABCA1 because they form a helical bundle that strongly adheres to the lipid on the particle. Enhanced CEC, as seen with the smaller particles, predicts decreased cardiovascular disease risk. Thus, extra-small and small HDLs may be key mediators and indicators of the cardioprotective effects of HDL.


Asunto(s)
Apolipoproteína A-I , Enfermedades Cardiovasculares , Humanos , Apolipoproteína A-I/metabolismo , Enfermedades Cardiovasculares/metabolismo , Lipoproteínas HDL/metabolismo , Colesterol , Transportador 1 de Casete de Unión a ATP/genética , Transportador 1 de Casete de Unión a ATP/metabolismo , Macrófagos/metabolismo , HDL-Colesterol
19.
Brief Bioinform ; 24(3)2023 05 19.
Artículo en Inglés | MEDLINE | ID: mdl-37031956

RESUMEN

MOTIVATION: Interpretability has become a necessary feature for machine learning models deployed in critical scenarios, e.g. legal system, healthcare. In these situations, algorithmic decisions may have (potentially negative) long-lasting effects on the end-user affected by the decision. While deep learning models achieve impressive results, they often function as a black-box. Inspired by linear models, we propose a novel class of structurally constrained deep neural networks, which we call FLAN (Feature-wise Latent Additive Networks). Crucially, FLANs process each input feature separately, computing for each of them a representation in a common latent space. These feature-wise latent representations are then simply summed, and the aggregated representation is used for the prediction. These feature-wise representations allow a user to estimate the effect of each individual feature independently from the others, similarly to the way linear models are interpreted. RESULTS: We demonstrate FLAN on a series of benchmark datasets in different biological domains. Our experiments show that FLAN achieves good performances even in complex datasets (e.g. TCR-epitope binding prediction), despite the structural constraint we imposed. On the other hand, this constraint enables us to interpret FLAN by deciphering its decision process, as well as obtaining biological insights (e.g. by identifying the marker genes of different cell populations). In supplementary experiments, we show similar performances also on non-biological datasets. CODE AND DATA AVAILABILITY: Code and example data are available at https://github.com/phineasng/flan_bio.


Asunto(s)
Aprendizaje Automático , Redes Neurales de la Computación , Unión Proteica
20.
Brief Bioinform ; 24(6)2023 09 22.
Artículo en Inglés | MEDLINE | ID: mdl-37974507

RESUMEN

In recent years, there has been an explosion of research on the application of deep learning to the prediction of various peptide properties, due to the significant development and market potential of peptides. Molecular dynamics has enabled the efficient collection of large peptide datasets, providing reliable training data for deep learning. However, the lack of systematic analysis of the peptide encoding, which is essential for artificial intelligence-assisted peptide-related tasks, makes it an urgent problem to be solved for the improvement of prediction accuracy. To address this issue, we first collect a high-quality, colossal simulation dataset of peptide self-assembly containing over 62 000 samples generated by coarse-grained molecular dynamics. Then, we systematically investigate the effect of peptide encoding of amino acids into sequences and molecular graphs using state-of-the-art sequential (i.e. recurrent neural network, long short-term memory and Transformer) and structural deep learning models (i.e. graph convolutional network, graph attention network and GraphSAGE), on the accuracy of peptide self-assembly prediction, an essential physiochemical process prior to any peptide-related applications. Extensive benchmarking studies have proven Transformer to be the most powerful sequence-encoding-based deep learning model, pushing the limit of peptide self-assembly prediction to decapeptides. In summary, this work provides a comprehensive benchmark analysis of peptide encoding with advanced deep learning models, serving as a guide for a wide range of peptide-related predictions such as isoelectric points, hydration free energy, etc.


Asunto(s)
Inteligencia Artificial , Redes Neurales de la Computación , Péptidos/metabolismo , Aminoácidos , Simulación por Computador
SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA