Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 18 de 18
Filtrar
1.
Proc Natl Acad Sci U S A ; 121(6): e2300838121, 2024 Feb 06.
Artigo em Inglês | MEDLINE | ID: mdl-38300863

RESUMO

Proteins play a central role in biology from immune recognition to brain activity. While major advances in machine learning have improved our ability to predict protein structure from sequence, determining protein function from its sequence or structure remains a major challenge. Here, we introduce holographic convolutional neural network (H-CNN) for proteins, which is a physically motivated machine learning approach to model amino acid preferences in protein structures. H-CNN reflects physical interactions in a protein structure and recapitulates the functional information stored in evolutionary data. H-CNN accurately predicts the impact of mutations on protein stability and binding of protein complexes. Our interpretable computational model for protein structure-function maps could guide design of novel proteins with desired function.


Assuntos
Algoritmos , Redes Neurais de Computação , Proteínas/genética , Aprendizado de Máquina , Aminoácidos
2.
Proc Natl Acad Sci U S A ; 115(32): E7550-E7558, 2018 08 07.
Artigo em Inglês | MEDLINE | ID: mdl-30037990

RESUMO

Genotype-phenotype relationships are notoriously complicated. Idiosyncratic interactions between specific combinations of mutations occur and are difficult to predict. Yet it is increasingly clear that many interactions can be understood in terms of global epistasis. That is, mutations may act additively on some underlying, unobserved trait, and this trait is then transformed via a nonlinear function to the observed phenotype as a result of subsequent biophysical and cellular processes. Here we infer the shape of such global epistasis in three proteins, based on published high-throughput mutagenesis data. To do so, we develop a maximum-likelihood inference procedure using a flexible family of monotonic nonlinear functions spanned by an I-spline basis. Our analysis uncovers dramatic nonlinearities in all three proteins; in some proteins a model with global epistasis accounts for virtually all of the measured variation, whereas in others we find substantial local epistasis as well. This method allows us to test hypotheses about the form of global epistasis and to distinguish variance components attributable to global epistasis, local epistasis, and measurement error.


Assuntos
Epistasia Genética , Evolução Molecular , Aptidão Genética , Modelos Genéticos , Genótipo , Modelos Estatísticos , Mutação , Dinâmica não Linear , Fenótipo
3.
Mol Biol Evol ; 36(10): 2184-2194, 2019 10 01.
Artigo em Inglês | MEDLINE | ID: mdl-31209469

RESUMO

During chronic infection, HIV-1 engages in a rapid coevolutionary arms race with the host's adaptive immune system. While it is clear that HIV exerts strong selection on the adaptive immune system, the characteristics of the somatic evolution that shape the immune response are still unknown. Traditional population genetics methods fail to distinguish chronic immune response from healthy repertoire evolution. Here, we infer the evolutionary modes of B-cell repertoires and identify complex dynamics with a constant production of better B-cell receptor (BCR) mutants that compete, maintaining large clonal diversity and potentially slowing down adaptation. A substantial fraction of mutations that rise to high frequencies in pathogen-engaging CDRs of BCRs are beneficial, in contrast to many such changes in structurally relevant frameworks that are deleterious and circulate by hitchhiking. We identify a pattern where BCRs in patients who experience larger viral expansions undergo stronger selection with a rapid turnover of beneficial mutations due to clonal interference in their CDR3 regions. Using population genetics modeling, we show that the extinction of these beneficial mutations can be attributed to the rise of competing beneficial alleles and clonal interference. The picture is of a dynamic repertoire, where better clones may be outcompeted by new mutants before they fix.


Assuntos
Imunidade Adaptativa , Infecções por HIV/imunologia , HIV-1/imunologia , Receptores de Antígenos de Linfócitos B/genética , Seleção Genética , Humanos
4.
Entropy (Basel) ; 22(9)2020 Aug 31.
Artigo em Inglês | MEDLINE | ID: mdl-33286736

RESUMO

Evolutionary algorithms, inspired by natural evolution, aim to optimize difficult objective functions without computing derivatives. Here we detail the relationship between classical population genetics of quantitative traits and evolutionary optimization, and formulate a new evolutionary algorithm. Optimization of a continuous objective function is analogous to searching for high fitness phenotypes on a fitness landscape. We describe how natural selection moves a population along the non-Euclidean gradient that is induced by the population on the fitness landscape (the natural gradient). We show how selection is related to Newton's method in optimization under quadratic fitness landscapes, and how selection increases fitness at the cost of reducing diversity. We describe the generation of new phenotypes and introduce an operator that recombines the whole population to generate variants. Finally, we introduce a proof-of-principle algorithm that combines natural selection, our recombination operator, and an adaptive method to increase selection and find the optimum. The algorithm is extremely simple in implementation; it has no matrix inversion or factorization, does not require storing a covariance matrix, and may form the basis of more general model-based optimization algorithms with natural gradient updates.

5.
Mol Biol Evol ; 35(10): 2345-2354, 2018 10 01.
Artigo em Inglês | MEDLINE | ID: mdl-30085303

RESUMO

Understanding the relationship between protein sequence, function, and stability is a fundamental problem in biology. The essential function of many proteins that fold into a specific structure is their ability to bind to a ligand, which can be assayed for thousands of mutated variants. However, binding assays do not distinguish whether mutations affect the stability of the binding interface or the overall fold. Here, we introduce a statistical method to infer a detailed energy landscape of how a protein folds and binds to a ligand by combining information from many mutated variants. We fit a thermodynamic model describing the bound, unbound, and unfolded states to high quality data of protein G domain B1 binding to IgG-Fc. We infer distinct folding and binding energies for each mutation providing a detailed view of how mutations affect binding and stability across the protein. We accurately infer the folding energy of each variant in physical units, validated by independent data, whereas previous high-throughput methods could only measure indirect changes in stability. While we assume an additive sequence-energy relationship, the binding fraction is epistatic due its nonlinear relation to energy. Despite having no epistasis in energy, our model explains much of the observed epistasis in binding fraction, with the remaining epistasis identifying conformationally dynamic regions.


Assuntos
Estabilidade Proteica , Proteínas/genética , Proteínas/fisiologia , Sequência de Aminoácidos , Animais , Simulação por Computador/estatística & dados numéricos , Epistasia Genética/fisiologia , Evolução Molecular , Humanos , Ligantes , Mutação , Conformação Proteica , Dobramento de Proteína , Relação Estrutura-Atividade , Termodinâmica
6.
PLoS Genet ; 12(7): e1006171, 2016 07.
Artigo em Inglês | MEDLINE | ID: mdl-27442127

RESUMO

The vertebrate adaptive immune system provides a flexible and diverse set of molecules to neutralize pathogens. Yet, viruses such as HIV can cause chronic infections by evolving as quickly as the adaptive immune system, forming an evolutionary arms race. Here we introduce a mathematical framework to study the coevolutionary dynamics between antibodies and antigens within a host. We focus on changes in the binding interactions between the antibody and antigen populations, which result from the underlying stochastic evolution of genotype frequencies driven by mutation, selection, and drift. We identify the critical viral and immune parameters that determine the distribution of antibody-antigen binding affinities. We also identify definitive signatures of coevolution that measure the reciprocal response between antibodies and viruses, and we introduce experimentally measurable quantities that quantify the extent of adaptation during continual coevolution of the two opposing populations. Using this analytical framework, we infer rates of viral and immune adaptation based on time-shifted neutralization assays in two HIV-infected patients. Finally, we analyze competition between clonal lineages of antibodies and characterize the fate of a given lineage in terms of the state of the antibody and viral populations. In particular, we derive the conditions that favor the emergence of broadly neutralizing antibodies, which may have relevance to vaccine design against HIV.


Assuntos
Imunidade Adaptativa/genética , Anticorpos Neutralizantes/genética , Infecções por HIV/imunologia , Antígenos/imunologia , Evolução Molecular , Infecções por HIV/virologia , HIV-1/imunologia , Interações Hospedeiro-Patógeno , Humanos , Modelos Genéticos
7.
Proc Natl Acad Sci U S A ; 111(22): E2301-9, 2014 Jun 03.
Artigo em Inglês | MEDLINE | ID: mdl-24843135

RESUMO

The genotype-fitness map plays a fundamental role in shaping the dynamics of evolution. However, it is difficult to directly measure a fitness landscape in practice, because the number of possible genotypes is astronomical. One approach is to sample as many genotypes as possible, measure their fitnesses, and fit a statistical model of the landscape that includes additive and pairwise interactive effects between loci. Here, we elucidate the pitfalls of using such regressions by studying artificial but mathematically convenient fitness landscapes. We identify two sources of bias inherent in these regression procedures, each of which tends to underestimate high fitnesses and overestimate low fitnesses. We characterize these biases for random sampling of genotypes as well as samples drawn from a population under selection in the Wright-Fisher model of evolutionary dynamics. We show that common measures of epistasis, such as the number of monotonically increasing paths between ancestral and derived genotypes, the prevalence of sign epistasis, and the number of local fitness maxima, are distorted in the inferred landscape. As a result, the inferred landscape will provide systematically biased predictions for the dynamics of adaptation. We identify the same biases in a computational RNA-folding landscape as well as regulatory sequence binding data treated with the same fitting procedure. Finally, we present a method to ameliorate these biases in some cases.


Assuntos
Epistasia Genética/genética , Evolução Molecular , Aptidão Genética/genética , Genótipo , Modelos Genéticos , Resistência a Medicamentos/genética , Dobramento de RNA/genética , Análise de Regressão , Seleção Genética/genética
8.
Phys Biol ; 11(5): 056003, 2014 Aug 26.
Artigo em Inglês | MEDLINE | ID: mdl-25156977

RESUMO

Competition between independently arising beneficial mutations is enhanced in spatial populations due to the linear rather than exponential growth of clones. Recent theoretical studies have pointed out that the resulting fitness dynamics is analogous to a surface growth process, where new layers nucleate and spread stochastically, leading to the build up of scale-invariant roughness. This scenario differs qualitatively from the standard view of adaptation in that the speed of adaptation becomes independent of population size while the fitness variance does not. Here we exploit recent progress in the understanding of surface growth processes to obtain precise predictions for the universal, non-Gaussian shape of the fitness distribution for one-dimensional habitats, which are verified by simulations. When the mutations are deleterious rather than beneficial the problem becomes a spatial version of Muller's ratchet. In contrast to the case of well-mixed populations, the rate of fitness decline remains finite even in the limit of an infinite habitat, provided the ratio [Formula: see text] between the deleterious mutation rate and the square of the (negative) selection coefficient is sufficiently large. Using, again, an analogy to surface growth models we show that the transition between the stationary and the moving state of the ratchet is governed by directed percolation.


Assuntos
Adaptação Biológica , Evolução Biológica , Ecossistema , Modelos Genéticos , Mutação , Aptidão Genética , Variação Genética
9.
Elife ; 112022 07 19.
Artigo em Inglês | MEDLINE | ID: mdl-35852143

RESUMO

Infusion of broadly neutralizing antibodies (bNAbs) has shown promise as an alternative to anti-retroviral therapy against HIV. A key challenge is to suppress viral escape, which is more effectively achieved with a combination of bNAbs. Here, we propose a computational approach to predict the efficacy of a bNAb therapy based on the population genetics of HIV escape, which we parametrize using high-throughput HIV sequence data from bNAb-naive patients. By quantifying the mutational target size and the fitness cost of HIV-1 escape from bNAbs, we predict the distribution of rebound times in three clinical trials. We show that a cocktail of three bNAbs is necessary to effectively suppress viral escape, and predict the optimal composition of such bNAb cocktail. Our results offer a rational therapy design for HIV, and show how genetic data can be used to predict treatment outcomes and design new approaches to pathogenic control.


Assuntos
Infecções por HIV , HIV-1 , Anticorpos Neutralizantes , Anticorpos Amplamente Neutralizantes , Anticorpos Anti-HIV , Infecções por HIV/tratamento farmacológico , HIV-1/genética , Humanos
10.
Cell Rep ; 35(8): 109173, 2021 05 25.
Artigo em Inglês | MEDLINE | ID: mdl-33991510

RESUMO

Individuals with the 2019 coronavirus disease (COVID-19) show varying severity of the disease, ranging from asymptomatic to requiring intensive care. Although monoclonal antibodies specific to the severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) have been identified, we still lack an understanding of the overall landscape of B cell receptor (BCR) repertoires in individuals with COVID-19. We use high-throughput sequencing of bulk and plasma B cells collected at multiple time points during infection to characterize signatures of the B cell response to SARS-CoV-2 in 19 individuals. Using principled statistical approaches, we associate differential features of BCRs with different disease severity. We identify 38 significantly expanded clonal lineages shared among individuals as candidates for responses specific to SARS-CoV-2. Using single-cell sequencing, we verify the reactivity of BCRs shared among individuals to SARS-CoV-2 epitopes. Moreover, we identify the natural emergence of a BCR with cross-reactivity to SARS-CoV-1 and SARS-CoV-2 in some individuals. Our results provide insights important for development of rational therapies and vaccines against COVID-19.


Assuntos
Linfócitos B/imunologia , COVID-19/imunologia , Reações Cruzadas , Receptores de Antígenos de Linfócitos B/genética , Receptores de Antígenos de Linfócitos B/imunologia , SARS-CoV-2/imunologia , Animais , Anticorpos Antivirais/imunologia , COVID-19/genética , Epitopos , Sequenciamento de Nucleotídeos em Larga Escala , Humanos , Índice de Gravidade de Doença , Células Sf9 , Análise de Célula Única , Glicoproteína da Espícula de Coronavírus/imunologia
11.
ArXiv ; 2021 Apr 06.
Artigo em Inglês | MEDLINE | ID: mdl-32699813

RESUMO

COVID-19 patients show varying severity of the disease ranging from asymptomatic to requiring intensive care. Although a number of SARS-CoV-2 specific monoclonal antibodies have been identified, we still lack an understanding of the overall landscape of B-cell receptor (BCR) repertoires in COVID-19 patients. Here, we used high-throughput sequencing of bulk and plasma B-cells collected over multiple time points during infection to characterize signatures of B-cell response to SARS-CoV-2 in 19 patients. Using principled statistical approaches, we determined differential features of BCRs associated with different disease severity. We identified 38 significantly expanded clonal lineages shared among patients as candidates for specific responses to SARS-CoV-2. Using single-cell sequencing, we verified reactivity of BCRs shared among individuals to SARS-CoV-2 epitopes. Moreover, we identified natural emergence of a BCR with cross-reactivity to SARS-CoV-1 and SARS-CoV-2 in a number of patients. Our results provide important insights for development of rational therapies and vaccines against COVID-19.

12.
medRxiv ; 2021 Apr 05.
Artigo em Inglês | MEDLINE | ID: mdl-32699862

RESUMO

COVID-19 patients show varying severity of the disease ranging from asymptomatic to requiring intensive care. Although a number of SARS-CoV-2 specific monoclonal antibodies have been identified, we still lack an understanding of the overall landscape of B-cell receptor (BCR) repertoires in COVID-19 patients. Here, we used high-throughput sequencing of bulk and plasma B-cells collected over multiple time points during infection to characterize signatures of B-cell response to SARS-CoV-2 in 19 patients. Using principled statistical approaches, we determined differential features of BCRs associated with different disease severity. We identified 38 significantly expanded clonal lineages shared among patients as candidates for specific responses to SARS-CoV-2. Using single-cell sequencing, we verified reactivity of BCRs shared among individuals to SARS-CoV-2 epitopes. Moreover, we identified natural emergence of a BCR with cross-reactivity to SARS-CoV-1 and SARS-CoV-2 in a number of patients. Our results provide important insights for development of rational therapies and vaccines against COVID-19.

13.
Nat Commun ; 11(1): 1233, 2020 03 06.
Artigo em Inglês | MEDLINE | ID: mdl-32144244

RESUMO

Antigenic drift of influenza virus hemagglutinin (HA) is enabled by facile evolvability. However, HA antigenic site B, which has become immunodominant in recent human H3N2 influenza viruses, is also evolutionarily constrained by its involvement in receptor binding. Here, we employ deep mutational scanning to probe the local fitness landscape of HA antigenic site B in six different human H3N2 strains spanning from 1968 to 2016. We observe that the fitness landscape of HA antigenic site B can be very different between strains. Sequence variants that exhibit high fitness in one strain can be deleterious in another, indicating that the evolutionary constraints of antigenic site B have changed over time. Structural analysis suggests that the local fitness landscape of antigenic site B can be reshaped by natural mutations via modulation of the receptor-binding mode. Overall, these findings elucidate how influenza virus continues to explore new antigenic space despite strong functional constraints.


Assuntos
Antígenos Virais/genética , Evolução Molecular , Glicoproteínas de Hemaglutininação de Vírus da Influenza/genética , Vírus da Influenza A Subtipo H3N2/genética , Receptores de Superfície Celular/metabolismo , Animais , Antígenos Virais/imunologia , Antígenos Virais/metabolismo , Sítios de Ligação/genética , Cristalografia por Raios X , Análise Mutacional de DNA , Cães , Células HEK293 , Glicoproteínas de Hemaglutininação de Vírus da Influenza/imunologia , Glicoproteínas de Hemaglutininação de Vírus da Influenza/metabolismo , Humanos , Vírus da Influenza A Subtipo H3N2/imunologia , Vírus da Influenza A Subtipo H3N2/metabolismo , Células Madin Darby de Rim Canino , Mutação , Domínios Proteicos/genética , Domínios Proteicos/imunologia , RNA Viral/genética , RNA Viral/isolamento & purificação , Reação em Cadeia da Polimerase Via Transcriptase Reversa , Análise de Sequência de DNA
14.
Phys Rev Lett ; 103(15): 156101, 2009 Oct 09.
Artigo em Inglês | MEDLINE | ID: mdl-19905653

RESUMO

We present a refractive-index-matched colloidal system that allows direct observation of critical Casimir induced aggregation with a confocal microscope. We show that in this system, in which van der Waals forces are negligible, a simple competition between repulsive screened Coulomb and attractive critical Casimir forces can account quantitatively for the reversible aggregation. Above the temperature T(a), the critical Casimir force drives aggregation of the particles into fractal clusters, while below T(a), the electrostatic repulsion between the particles breaks up the clusters, and the particles resuspend by thermal diffusion. The aggregation is observed in a remarkably wide temperature range of as much as 15 degrees. We derive a simple expression for the particle pair potential that accounts quantitatively for the temperature-dependent aggregation and aggregate breakup.

15.
Evolution ; 69(9): 2359-70, 2015 Sep.
Artigo em Inglês | MEDLINE | ID: mdl-26194030

RESUMO

The role that epistasis plays during adaptation remains an outstanding problem, which has received considerable attention in recent years. Most of the recent empirical studies are based on ensembles of replicate populations that adapt in a fixed, laboratory controlled condition. Researchers often seek to infer the presence and form of epistasis in the fitness landscape from the time evolution of various statistics averaged across the ensemble of populations. Here, we provide a rigorous analysis of what quantities, drawn from time series of such ensembles, can be used to infer epistasis for populations evolving under weak mutation on finite-site fitness landscapes. First, we analyze the mean fitness trajectory-that is, the time course of the ensemble average fitness. We show that for any epistatic fitness landscape and starting genotype, there always exists a non-epistatic fitness landscape that produces the exact same mean fitness trajectory. Thus, the presence of epistasis is not identifiable from the mean fitness trajectory. By contrast, we show that two other ensemble statistics-the time evolution of the fitness variance across populations, and the time evolution of the mean number of substitutions-can detect certain forms of epistasis in the underlying fitness landscape.


Assuntos
Adaptação Biológica/genética , Epistasia Genética , Aptidão Genética , Modelos Genéticos , Mutação , Genética Populacional
16.
PLoS One ; 8(5): e61570, 2013.
Artigo em Inglês | MEDLINE | ID: mdl-23650500

RESUMO

Genotype-to-phenotype maps and the related fitness landscapes that include epistatic interactions are difficult to measure because of their high dimensional structure. Here we construct such a map using the recently collected corpora of high-throughput sequence data from the 75 base pairs long mutagenized E. coli lac promoter region, where each sequence is associated with its phenotype, the induced transcriptional activity measured by a fluorescent reporter. We find that the additive (non-epistatic) contributions of individual mutations account for about two-thirds of the explainable phenotype variance, while pairwise epistasis explains about 7% of the variance for the full mutagenized sequence and about 15% for the subsequence associated with protein binding sites. Surprisingly, there is no evidence for third order epistatic contributions, and our inferred fitness landscape is essentially single peaked, with a small amount of antagonistic epistasis. There is a significant selective pressure on the wild type, which we deduce to be multi-objective optimal for gene expression in environments with different nutrient sources. We identify transcription factor (CRP) and RNA polymerase binding sites in the promotor region and their interactions without difficult optimization steps. In particular, we observe evidence for previously unexplored genetic regulatory mechanisms, possibly kinetic in nature. We conclude with a cautionary note that inferred properties of fitness landscapes may be severely influenced by biases in the sequence data.


Assuntos
Escherichia coli/genética , Óperon Lac , Modelos Genéticos , Regiões Promotoras Genéticas , Algoritmos , Epistasia Genética , Regulação Bacteriana da Expressão Gênica , Estudos de Associação Genética , Aptidão Genética , Genótipo
17.
Phys Rev E Stat Nonlin Soft Matter Phys ; 84(1 Pt 1): 011925, 2011 Jul.
Artigo em Inglês | MEDLINE | ID: mdl-21867231

RESUMO

When beneficial mutations are relatively common, competition between multiple unfixed mutations can reduce the rate of fixation in well-mixed asexual populations. We introduce a one-dimensional model with a steady accumulation of beneficial mutations. We find a transition between periodic selection and multiple-mutation regimes. In the multiple-mutation regime, the increase of fitness along the lattice bears a striking similarity to surface growth phenomena, with power-law growth and saturation of the interface width. We also find significant differences compared to the well-mixed model. In our lattice model, the transition between regimes happens at a much lower mutation rate due to slower fixation times in one dimension. Also, the rate of fixation is reduced with increasing mutation rate due to the more intense competition, and it saturates with large population size.


Assuntos
Análise Mutacional de DNA/métodos , Mutação , Algoritmos , Animais , Evolução Molecular , Genética , Genética Populacional , Humanos , Modelos Genéticos , Modelos Estatísticos , Fatores de Tempo
18.
J Stat Phys ; 144(2): 367-378, 2011 Jul.
Artigo em Inglês | MEDLINE | ID: mdl-22822270

RESUMO

We consider a fixed size population that undergoes an evolutionary adaptation in the weak mutation rate limit, which we model as a biased Langevin process in the genotype space. We show analytically and numerically that, if the fitness landscape has a small highly epistatic (rough) and time-varying component, then the population genotype exhibits a high effective diffusion in the genotype space and is able to escape local fitness minima with a large probability. We argue that our principal finding that even very small time-dependent fluctuations of fitness can substantially speed up evolution is valid for a wide class of models.

SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA