Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 20 de 26
Filter
Add more filters










Publication year range
1.
Proc Natl Acad Sci U S A ; 121(6): e2300838121, 2024 Feb 06.
Article in English | MEDLINE | ID: mdl-38300863

ABSTRACT

Proteins play a central role in biology from immune recognition to brain activity. While major advances in machine learning have improved our ability to predict protein structure from sequence, determining protein function from its sequence or structure remains a major challenge. Here, we introduce holographic convolutional neural network (H-CNN) for proteins, which is a physically motivated machine learning approach to model amino acid preferences in protein structures. H-CNN reflects physical interactions in a protein structure and recapitulates the functional information stored in evolutionary data. H-CNN accurately predicts the impact of mutations on protein stability and binding of protein complexes. Our interpretable computational model for protein structure-function maps could guide design of novel proteins with desired function.


Subject(s)
Algorithms , Neural Networks, Computer , Proteins/genetics , Machine Learning , Amino Acids
2.
Immunity ; 57(2): 271-286.e13, 2024 Feb 13.
Article in English | MEDLINE | ID: mdl-38301652

ABSTRACT

The immune system encodes information about the severity of a pathogenic threat in the quantity and type of memory cells it forms. This encoding emerges from lymphocyte decisions to maintain or lose self-renewal and memory potential during a challenge. By tracking CD8+ T cells at the single-cell and clonal lineage level using time-resolved transcriptomics, quantitative live imaging, and an acute infection model, we find that T cells will maintain or lose memory potential early after antigen recognition. However, following pathogen clearance, T cells may regain memory potential if initially lost. Mechanistically, this flexibility is implemented by a stochastic cis-epigenetic switch that tunably and reversibly silences the memory regulator, TCF1, in response to stimulation. Mathematical modeling shows how this flexibility allows memory T cell numbers to scale robustly with pathogen virulence and immune response magnitudes. We propose that flexibility and stochasticity in cellular decisions ensure optimal immune responses against diverse threats.


Subject(s)
CD8-Positive T-Lymphocytes , Memory T Cells , Epigenesis, Genetic , Clone Cells , Immunologic Memory , Cell Differentiation
3.
Nat Commun ; 14(1): 7137, 2023 11 06.
Article in English | MEDLINE | ID: mdl-37932288

ABSTRACT

HIV-1 broadly neutralizing antibodies (bNAbs) are able to suppress viremia and prevent infection. Their induction by vaccination is therefore a major goal. However, in contrast to antibodies that neutralize other pathogens, HIV-1-specific bNAbs frequently carry uncommon molecular characteristics that might prevent their induction. Here, we perform unbiased sequence analyses of B cell receptor repertoires from 57 uninfected and 46 chronically HIV-1- or HCV-infected individuals and learn probabilistic models to predict the likelihood of bNAb development. We formally show that lower probabilities for bNAbs are predictive of higher HIV-1 neutralization activity. Moreover, ranking bNAbs by their probabilities allows to identify highly potent antibodies with superior generation probabilities as preferential targets for vaccination approaches. Importantly, we find equal bNAb probabilities across infected and uninfected individuals. This implies that chronic infection is not a prerequisite for the generation of bNAbs, fostering the hope that HIV-1 vaccines can induce bNAb development in uninfected people.


Subject(s)
AIDS Vaccines , HIV Infections , HIV-1 , Humans , Broadly Neutralizing Antibodies , HIV Antibodies , Antibodies, Neutralizing
4.
ArXiv ; 2023 Nov 28.
Article in English | MEDLINE | ID: mdl-38013891

ABSTRACT

Accurately modeling protein 3D structure is essential for the design of functional proteins. An important sub-task of structure modeling is protein side-chain packing: predicting the conformation of side-chains (rotamers) given the protein's backbone structure and amino-acid sequence. Conventional approaches for this task rely on expensive sampling procedures over hand-crafted energy functions and rotamer libraries. Recently, several deep learning methods have been developed to tackle the problem in a data-driven way, albeit with vastly different formulations (from image-to-image translation to directly predicting atomic coordinates). Here, we frame the problem as a joint regression over the side-chains' true degrees of freedom: the dihedral χ angles. We carefully study possible objective functions for this task, while accounting for the underlying symmetries of the task. We propose Holographic Packer (H-Packer), a novel two-stage algorithm for side-chain packing built on top of two light-weight rotationally equivariant neural networks. We evaluate our method on CASP13 and CASP14 targets. H-Packer is computationally efficient and shows favorable performance against conventional physics-based algorithms and is competitive against alternative deep learning solutions.

5.
Nat Rev Genet ; 24(12): 851-867, 2023 Dec.
Article in English | MEDLINE | ID: mdl-37400577

ABSTRACT

Control interventions steer the evolution of molecules, viruses, microorganisms or other cells towards a desired outcome. Applications range from engineering biomolecules and synthetic organisms to drug, therapy and vaccine design against pathogens and cancer. In all these instances, a control system alters the eco-evolutionary trajectory of a target system, inducing new functions or suppressing escape evolution. Here, we synthesize the objectives, mechanisms and dynamics of eco-evolutionary control in different biological systems. We discuss how the control system learns and processes information about the target system by sensing or measuring, through adaptive evolution or computational prediction of future trajectories. This information flow distinguishes pre-emptive control strategies by humans from feedback control in biotic systems. We establish a cost-benefit calculus to gauge and optimize control protocols, highlighting the fundamental link between predictability of evolution and efficacy of pre-emptive control.


Subject(s)
Bioengineering , Biological Evolution , Humans
6.
Elife ; 112022 07 19.
Article in English | MEDLINE | ID: mdl-35852143

ABSTRACT

Infusion of broadly neutralizing antibodies (bNAbs) has shown promise as an alternative to anti-retroviral therapy against HIV. A key challenge is to suppress viral escape, which is more effectively achieved with a combination of bNAbs. Here, we propose a computational approach to predict the efficacy of a bNAb therapy based on the population genetics of HIV escape, which we parametrize using high-throughput HIV sequence data from bNAb-naive patients. By quantifying the mutational target size and the fitness cost of HIV-1 escape from bNAbs, we predict the distribution of rebound times in three clinical trials. We show that a cocktail of three bNAbs is necessary to effectively suppress viral escape, and predict the optimal composition of such bNAb cocktail. Our results offer a rational therapy design for HIV, and show how genetic data can be used to predict treatment outcomes and design new approaches to pathogenic control.


Subject(s)
HIV Infections , HIV-1 , Antibodies, Neutralizing , Broadly Neutralizing Antibodies , HIV Antibodies , HIV Infections/drug therapy , HIV-1/genetics , Humans
7.
Phys Rev E ; 105(5-2): 055309, 2022 May.
Article in English | MEDLINE | ID: mdl-35706287

ABSTRACT

Simulation-based inference enables learning the parameters of a model even when its likelihood cannot be computed in practice. One class of methods uses data simulated with different parameters to infer models of the likelihood-to-evidence ratio, or equivalently the posterior function. Here we frame the inference task as an estimation of an energy function parametrized with an artificial neural network. We present an intuitive approach, named MINIMALIST, in which the optimal model of the likelihood-to-evidence ratio is found by maximizing the likelihood of simulated data. Within this framework, the connection between the task of simulation-based inference and mutual information maximization is clear, and we show how several known methods of posterior estimation relate to alternative lower bounds to mutual information. These distinct objective functions aim at the same optimal energy form and therefore can be directly benchmarked. We compare their accuracy in the inference of model parameters, focusing on four dynamical systems that encompass common challenges in time series analysis: dynamics driven by multiplicative noise, nonlinear interactions, chaotic behavior, and high-dimensional parameter space.

8.
Science ; 376(6595): 796-797, 2022 05 20.
Article in English | MEDLINE | ID: mdl-35587975

ABSTRACT

A machine-learning approach reveals antigen encoding that predicts T cell responses.


Subject(s)
Adaptive Immunity , Antigens , Machine Learning , T-Lymphocytes , Animals , Antigen Presentation , Antigens/classification , Antigens/immunology , Humans , Mice , T-Lymphocytes/immunology
9.
Elife ; 102021 12 08.
Article in English | MEDLINE | ID: mdl-34878407

ABSTRACT

As one of the main influenza antigens, neuraminidase (NA) in H3N2 virus has evolved extensively for more than 50 years due to continuous immune pressure. While NA has recently emerged as an effective vaccine target, biophysical constraints on the antigenic evolution of NA remain largely elusive. Here, we apply combinatorial mutagenesis and next-generation sequencing to characterize the local fitness landscape in an antigenic region of NA in six different human H3N2 strains that were isolated around 10 years apart. The local fitness landscape correlates well among strains and the pairwise epistasis is highly conserved. Our analysis further demonstrates that local net charge governs the pairwise epistasis in this antigenic region. In addition, we show that residue coevolution in this antigenic region is correlated with the pairwise epistasis between charge states. Overall, this study demonstrates the importance of quantifying epistasis and the underlying biophysical constraint for building a model of influenza evolution.


Subject(s)
Antigens, Viral/immunology , Evolution, Molecular , Influenza A Virus, H3N2 Subtype/immunology , Neuraminidase/genetics , Viral Proteins/genetics , Humans , Influenza, Human/immunology , Neuraminidase/immunology , Viral Proteins/immunology
10.
Cell Rep ; 35(8): 109173, 2021 05 25.
Article in English | MEDLINE | ID: mdl-33991510

ABSTRACT

Individuals with the 2019 coronavirus disease (COVID-19) show varying severity of the disease, ranging from asymptomatic to requiring intensive care. Although monoclonal antibodies specific to the severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) have been identified, we still lack an understanding of the overall landscape of B cell receptor (BCR) repertoires in individuals with COVID-19. We use high-throughput sequencing of bulk and plasma B cells collected at multiple time points during infection to characterize signatures of the B cell response to SARS-CoV-2 in 19 individuals. Using principled statistical approaches, we associate differential features of BCRs with different disease severity. We identify 38 significantly expanded clonal lineages shared among individuals as candidates for responses specific to SARS-CoV-2. Using single-cell sequencing, we verify the reactivity of BCRs shared among individuals to SARS-CoV-2 epitopes. Moreover, we identify the natural emergence of a BCR with cross-reactivity to SARS-CoV-1 and SARS-CoV-2 in some individuals. Our results provide insights important for development of rational therapies and vaccines against COVID-19.


Subject(s)
B-Lymphocytes/immunology , COVID-19/immunology , Cross Reactions , Receptors, Antigen, B-Cell/genetics , Receptors, Antigen, B-Cell/immunology , SARS-CoV-2/immunology , Animals , Antibodies, Viral/immunology , COVID-19/genetics , Epitopes , High-Throughput Nucleotide Sequencing , Humans , Severity of Illness Index , Sf9 Cells , Single-Cell Analysis , Spike Glycoprotein, Coronavirus/immunology
11.
Proc Natl Acad Sci U S A ; 118(14)2021 04 06.
Article in English | MEDLINE | ID: mdl-33795515

ABSTRACT

Subclasses of lymphocytes carry different functional roles to work together and produce an immune response and lasting immunity. Additionally to these functional roles, T and B cell lymphocytes rely on the diversity of their receptor chains to recognize different pathogens. The lymphocyte subclasses emerge from common ancestors generated with the same diversity of receptors during selection processes. Here, we leverage biophysical models of receptor generation with machine learning models of selection to identify specific sequence features characteristic of functional lymphocyte repertoires and subrepertoires. Specifically, using only repertoire-level sequence information, we classify CD4+ and CD8+ T cells, find correlations between receptor chains arising during selection, and identify T cell subsets that are targets of pathogenic epitopes. We also show examples of when simple linear classifiers do as well as more complex machine learning methods.


Subject(s)
B-Lymphocytes/immunology , Machine Learning , Receptors, Immunologic/chemistry , T-Lymphocytes/immunology , Epitopes/chemistry , Epitopes/immunology , Humans , Receptors, Immunologic/classification , Receptors, Immunologic/immunology
12.
Elife ; 102021 04 28.
Article in English | MEDLINE | ID: mdl-33908347

ABSTRACT

The adaptive immune system provides a diverse set of molecules that can mount specific responses against a multitude of pathogens. Memory is a key feature of adaptive immunity, which allows organisms to respond more readily upon re-infections. However, differentiation of memory cells is still one of the least understood cell fate decisions. Here, we introduce a mathematical framework to characterize optimal strategies to store memory to maximize the utility of immune response over an organism's lifetime. We show that memory production should be actively regulated to balance between affinity and cross-reactivity of immune receptors for an effective protection against evolving pathogens. Moreover, we predict that specificity of memory should depend on the organism's lifespan, and shorter lived organisms with fewer pathogenic encounters should store more cross-reactive memory. Our framework provides a baseline to gauge the efficacy of immune memory in light of an organism's coevolutionary history with pathogens.


Subject(s)
Biological Evolution , Immunologic Memory , Adaptive Immunity , Animals , Humans , Models, Theoretical
13.
ArXiv ; 2021 Apr 06.
Article in English | MEDLINE | ID: mdl-32699813

ABSTRACT

COVID-19 patients show varying severity of the disease ranging from asymptomatic to requiring intensive care. Although a number of SARS-CoV-2 specific monoclonal antibodies have been identified, we still lack an understanding of the overall landscape of B-cell receptor (BCR) repertoires in COVID-19 patients. Here, we used high-throughput sequencing of bulk and plasma B-cells collected over multiple time points during infection to characterize signatures of B-cell response to SARS-CoV-2 in 19 patients. Using principled statistical approaches, we determined differential features of BCRs associated with different disease severity. We identified 38 significantly expanded clonal lineages shared among patients as candidates for specific responses to SARS-CoV-2. Using single-cell sequencing, we verified reactivity of BCRs shared among individuals to SARS-CoV-2 epitopes. Moreover, we identified natural emergence of a BCR with cross-reactivity to SARS-CoV-1 and SARS-CoV-2 in a number of patients. Our results provide important insights for development of rational therapies and vaccines against COVID-19.

14.
medRxiv ; 2021 Apr 05.
Article in English | MEDLINE | ID: mdl-32699862

ABSTRACT

COVID-19 patients show varying severity of the disease ranging from asymptomatic to requiring intensive care. Although a number of SARS-CoV-2 specific monoclonal antibodies have been identified, we still lack an understanding of the overall landscape of B-cell receptor (BCR) repertoires in COVID-19 patients. Here, we used high-throughput sequencing of bulk and plasma B-cells collected over multiple time points during infection to characterize signatures of B-cell response to SARS-CoV-2 in 19 patients. Using principled statistical approaches, we determined differential features of BCRs associated with different disease severity. We identified 38 significantly expanded clonal lineages shared among patients as candidates for specific responses to SARS-CoV-2. Using single-cell sequencing, we verified reactivity of BCRs shared among individuals to SARS-CoV-2 epitopes. Moreover, we identified natural emergence of a BCR with cross-reactivity to SARS-CoV-1 and SARS-CoV-2 in a number of patients. Our results provide important insights for development of rational therapies and vaccines against COVID-19.

15.
Entropy (Basel) ; 22(9)2020 Aug 31.
Article in English | MEDLINE | ID: mdl-33286736

ABSTRACT

Evolutionary algorithms, inspired by natural evolution, aim to optimize difficult objective functions without computing derivatives. Here we detail the relationship between classical population genetics of quantitative traits and evolutionary optimization, and formulate a new evolutionary algorithm. Optimization of a continuous objective function is analogous to searching for high fitness phenotypes on a fitness landscape. We describe how natural selection moves a population along the non-Euclidean gradient that is induced by the population on the fitness landscape (the natural gradient). We show how selection is related to Newton's method in optimization under quadratic fitness landscapes, and how selection increases fitness at the cost of reducing diversity. We describe the generation of new phenotypes and introduce an operator that recombines the whole population to generate variants. Finally, we introduce a proof-of-principle algorithm that combines natural selection, our recombination operator, and an adaptive method to increase selection and find the optimum. The algorithm is extremely simple in implementation; it has no matrix inversion or factorization, does not require storing a covariance matrix, and may form the basis of more general model-based optimization algorithms with natural gradient updates.

16.
Bioinformatics ; 36(16): 4510-4512, 2020 08 15.
Article in English | MEDLINE | ID: mdl-32814974

ABSTRACT

SUMMARY: Recent advances in modelling VDJ recombination and subsequent selection of T- and B-cell receptors provide useful tools to analyse and compare immune repertoires across time, individuals and tissues. A suite of tools-IGoR, OLGA and SONIA-have been publicly released to the community that allow for the inference of generative and selection models from high-throughput sequencing data. However, using these tools requires some scripting or command-line skills and familiarity with complex datasets. As a result, the application of the above models has not been available to a broad audience. In this application note, we fill this gap by presenting Simple OLGA & SONIA (SOS), a web-based interface where users with no coding skills can compute the generation and post-selection probabilities of their sequences, as well as generate batches of synthetic sequences. The application also functions on mobile phones. AVAILABILITY AND IMPLEMENTATION: SOS is freely available to use at sites.google.com/view/statbiophysens/sos with source code at github.com/statbiophys/sos.


Subject(s)
High-Throughput Nucleotide Sequencing , Software , Family Characteristics , Humans , Probability , Receptors, Antigen, B-Cell/genetics
17.
Phys Rev E ; 101(6-1): 062414, 2020 Jun.
Article in English | MEDLINE | ID: mdl-32688532

ABSTRACT

T-cell receptors (TCR) are key proteins of the adaptive immune system, generated randomly in each individual, whose diversity underlies our ability to recognize infections and malignancies. Modeling the distribution of TCR sequences is of key importance for immunology and medical applications. Here, we compare two inference methods trained on high-throughput sequencing data: a knowledge-guided approach, which accounts for the details of sequence generation, supplemented by a physics-inspired model of selection; and a knowledge-free variational autoencoder based on deep artificial neural networks. We show that the knowledge-guided model outperforms the deep network approach at predicting TCR probabilities, while being more interpretable, at a lower computational cost.


Subject(s)
Models, Biological , Receptors, Antigen, T-Cell/chemistry , Receptors, Antigen, T-Cell/metabolism , Amino Acid Sequence , Deep Learning , Ligands
18.
Nat Commun ; 11(1): 1233, 2020 03 06.
Article in English | MEDLINE | ID: mdl-32144244

ABSTRACT

Antigenic drift of influenza virus hemagglutinin (HA) is enabled by facile evolvability. However, HA antigenic site B, which has become immunodominant in recent human H3N2 influenza viruses, is also evolutionarily constrained by its involvement in receptor binding. Here, we employ deep mutational scanning to probe the local fitness landscape of HA antigenic site B in six different human H3N2 strains spanning from 1968 to 2016. We observe that the fitness landscape of HA antigenic site B can be very different between strains. Sequence variants that exhibit high fitness in one strain can be deleterious in another, indicating that the evolutionary constraints of antigenic site B have changed over time. Structural analysis suggests that the local fitness landscape of antigenic site B can be reshaped by natural mutations via modulation of the receptor-binding mode. Overall, these findings elucidate how influenza virus continues to explore new antigenic space despite strong functional constraints.


Subject(s)
Antigens, Viral/genetics , Evolution, Molecular , Hemagglutinin Glycoproteins, Influenza Virus/genetics , Influenza A Virus, H3N2 Subtype/genetics , Receptors, Cell Surface/metabolism , Animals , Antigens, Viral/immunology , Antigens, Viral/metabolism , Binding Sites/genetics , Crystallography, X-Ray , DNA Mutational Analysis , Dogs , HEK293 Cells , Hemagglutinin Glycoproteins, Influenza Virus/immunology , Hemagglutinin Glycoproteins, Influenza Virus/metabolism , Humans , Influenza A Virus, H3N2 Subtype/immunology , Influenza A Virus, H3N2 Subtype/metabolism , Madin Darby Canine Kidney Cells , Mutation , Protein Domains/genetics , Protein Domains/immunology , RNA, Viral/genetics , RNA, Viral/isolation & purification , Reverse Transcriptase Polymerase Chain Reaction , Sequence Analysis, DNA
19.
Proc Natl Acad Sci U S A ; 117(10): 5144-5151, 2020 03 10.
Article in English | MEDLINE | ID: mdl-32071241

ABSTRACT

Some bacteria and archaea possess an immune system, based on the CRISPR-Cas mechanism, that confers adaptive immunity against viruses. In such species, individual prokaryotes maintain cassettes of viral DNA elements called spacers as a memory of past infections. Typically, the cassettes contain several dozen expressed spacers. Given that bacteria can have very large genomes and since having more spacers should confer a better memory, it is puzzling that so little genetic space would be devoted by prokaryotes to their adaptive immune systems. Here, assuming that CRISPR functions as a long-term memory-based defense against a diverse landscape of viral species, we identify a fundamental tradeoff between the amount of immune memory and effectiveness of response to a given threat. This tradeoff implies an optimal size for the prokaryotic immune repertoire in the observational range.


Subject(s)
Adaptive Immunity , Bacteria/genetics , Bacteria/virology , Bacteriophages , CRISPR-Cas Systems/physiology
20.
Mol Biol Evol ; 36(10): 2184-2194, 2019 10 01.
Article in English | MEDLINE | ID: mdl-31209469

ABSTRACT

During chronic infection, HIV-1 engages in a rapid coevolutionary arms race with the host's adaptive immune system. While it is clear that HIV exerts strong selection on the adaptive immune system, the characteristics of the somatic evolution that shape the immune response are still unknown. Traditional population genetics methods fail to distinguish chronic immune response from healthy repertoire evolution. Here, we infer the evolutionary modes of B-cell repertoires and identify complex dynamics with a constant production of better B-cell receptor (BCR) mutants that compete, maintaining large clonal diversity and potentially slowing down adaptation. A substantial fraction of mutations that rise to high frequencies in pathogen-engaging CDRs of BCRs are beneficial, in contrast to many such changes in structurally relevant frameworks that are deleterious and circulate by hitchhiking. We identify a pattern where BCRs in patients who experience larger viral expansions undergo stronger selection with a rapid turnover of beneficial mutations due to clonal interference in their CDR3 regions. Using population genetics modeling, we show that the extinction of these beneficial mutations can be attributed to the rise of competing beneficial alleles and clonal interference. The picture is of a dynamic repertoire, where better clones may be outcompeted by new mutants before they fix.


Subject(s)
Adaptive Immunity , HIV Infections/immunology , HIV-1/immunology , Receptors, Antigen, B-Cell/genetics , Selection, Genetic , Humans
SELECTION OF CITATIONS
SEARCH DETAIL
...