Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 18 de 18
Filtrar
Más filtros










Base de datos
Intervalo de año de publicación
1.
Proc Natl Acad Sci U S A ; 121(6): e2300838121, 2024 Feb 06.
Artículo en Inglés | MEDLINE | ID: mdl-38300863

RESUMEN

Proteins play a central role in biology from immune recognition to brain activity. While major advances in machine learning have improved our ability to predict protein structure from sequence, determining protein function from its sequence or structure remains a major challenge. Here, we introduce holographic convolutional neural network (H-CNN) for proteins, which is a physically motivated machine learning approach to model amino acid preferences in protein structures. H-CNN reflects physical interactions in a protein structure and recapitulates the functional information stored in evolutionary data. H-CNN accurately predicts the impact of mutations on protein stability and binding of protein complexes. Our interpretable computational model for protein structure-function maps could guide design of novel proteins with desired function.


Asunto(s)
Algoritmos , Redes Neurales de la Computación , Proteínas/genética , Aprendizaje Automático , Aminoácidos
2.
Elife ; 112022 07 19.
Artículo en Inglés | MEDLINE | ID: mdl-35852143

RESUMEN

Infusion of broadly neutralizing antibodies (bNAbs) has shown promise as an alternative to anti-retroviral therapy against HIV. A key challenge is to suppress viral escape, which is more effectively achieved with a combination of bNAbs. Here, we propose a computational approach to predict the efficacy of a bNAb therapy based on the population genetics of HIV escape, which we parametrize using high-throughput HIV sequence data from bNAb-naive patients. By quantifying the mutational target size and the fitness cost of HIV-1 escape from bNAbs, we predict the distribution of rebound times in three clinical trials. We show that a cocktail of three bNAbs is necessary to effectively suppress viral escape, and predict the optimal composition of such bNAb cocktail. Our results offer a rational therapy design for HIV, and show how genetic data can be used to predict treatment outcomes and design new approaches to pathogenic control.


Asunto(s)
Infecciones por VIH , VIH-1 , Anticuerpos Neutralizantes , Anticuerpos ampliamente neutralizantes , Anticuerpos Anti-VIH , Infecciones por VIH/tratamiento farmacológico , VIH-1/genética , Humanos
3.
Cell Rep ; 35(8): 109173, 2021 05 25.
Artículo en Inglés | MEDLINE | ID: mdl-33991510

RESUMEN

Individuals with the 2019 coronavirus disease (COVID-19) show varying severity of the disease, ranging from asymptomatic to requiring intensive care. Although monoclonal antibodies specific to the severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) have been identified, we still lack an understanding of the overall landscape of B cell receptor (BCR) repertoires in individuals with COVID-19. We use high-throughput sequencing of bulk and plasma B cells collected at multiple time points during infection to characterize signatures of the B cell response to SARS-CoV-2 in 19 individuals. Using principled statistical approaches, we associate differential features of BCRs with different disease severity. We identify 38 significantly expanded clonal lineages shared among individuals as candidates for responses specific to SARS-CoV-2. Using single-cell sequencing, we verify the reactivity of BCRs shared among individuals to SARS-CoV-2 epitopes. Moreover, we identify the natural emergence of a BCR with cross-reactivity to SARS-CoV-1 and SARS-CoV-2 in some individuals. Our results provide insights important for development of rational therapies and vaccines against COVID-19.


Asunto(s)
Linfocitos B/inmunología , COVID-19/inmunología , Reacciones Cruzadas , Receptores de Antígenos de Linfocitos B/genética , Receptores de Antígenos de Linfocitos B/inmunología , SARS-CoV-2/inmunología , Animales , Anticuerpos Antivirales/inmunología , COVID-19/genética , Epítopos , Secuenciación de Nucleótidos de Alto Rendimiento , Humanos , Índice de Severidad de la Enfermedad , Células Sf9 , Análisis de la Célula Individual , Glicoproteína de la Espiga del Coronavirus/inmunología
4.
ArXiv ; 2021 Apr 06.
Artículo en Inglés | MEDLINE | ID: mdl-32699813

RESUMEN

COVID-19 patients show varying severity of the disease ranging from asymptomatic to requiring intensive care. Although a number of SARS-CoV-2 specific monoclonal antibodies have been identified, we still lack an understanding of the overall landscape of B-cell receptor (BCR) repertoires in COVID-19 patients. Here, we used high-throughput sequencing of bulk and plasma B-cells collected over multiple time points during infection to characterize signatures of B-cell response to SARS-CoV-2 in 19 patients. Using principled statistical approaches, we determined differential features of BCRs associated with different disease severity. We identified 38 significantly expanded clonal lineages shared among patients as candidates for specific responses to SARS-CoV-2. Using single-cell sequencing, we verified reactivity of BCRs shared among individuals to SARS-CoV-2 epitopes. Moreover, we identified natural emergence of a BCR with cross-reactivity to SARS-CoV-1 and SARS-CoV-2 in a number of patients. Our results provide important insights for development of rational therapies and vaccines against COVID-19.

5.
medRxiv ; 2021 Apr 05.
Artículo en Inglés | MEDLINE | ID: mdl-32699862

RESUMEN

COVID-19 patients show varying severity of the disease ranging from asymptomatic to requiring intensive care. Although a number of SARS-CoV-2 specific monoclonal antibodies have been identified, we still lack an understanding of the overall landscape of B-cell receptor (BCR) repertoires in COVID-19 patients. Here, we used high-throughput sequencing of bulk and plasma B-cells collected over multiple time points during infection to characterize signatures of B-cell response to SARS-CoV-2 in 19 patients. Using principled statistical approaches, we determined differential features of BCRs associated with different disease severity. We identified 38 significantly expanded clonal lineages shared among patients as candidates for specific responses to SARS-CoV-2. Using single-cell sequencing, we verified reactivity of BCRs shared among individuals to SARS-CoV-2 epitopes. Moreover, we identified natural emergence of a BCR with cross-reactivity to SARS-CoV-1 and SARS-CoV-2 in a number of patients. Our results provide important insights for development of rational therapies and vaccines against COVID-19.

6.
Entropy (Basel) ; 22(9)2020 Aug 31.
Artículo en Inglés | MEDLINE | ID: mdl-33286736

RESUMEN

Evolutionary algorithms, inspired by natural evolution, aim to optimize difficult objective functions without computing derivatives. Here we detail the relationship between classical population genetics of quantitative traits and evolutionary optimization, and formulate a new evolutionary algorithm. Optimization of a continuous objective function is analogous to searching for high fitness phenotypes on a fitness landscape. We describe how natural selection moves a population along the non-Euclidean gradient that is induced by the population on the fitness landscape (the natural gradient). We show how selection is related to Newton's method in optimization under quadratic fitness landscapes, and how selection increases fitness at the cost of reducing diversity. We describe the generation of new phenotypes and introduce an operator that recombines the whole population to generate variants. Finally, we introduce a proof-of-principle algorithm that combines natural selection, our recombination operator, and an adaptive method to increase selection and find the optimum. The algorithm is extremely simple in implementation; it has no matrix inversion or factorization, does not require storing a covariance matrix, and may form the basis of more general model-based optimization algorithms with natural gradient updates.

7.
Nat Commun ; 11(1): 1233, 2020 03 06.
Artículo en Inglés | MEDLINE | ID: mdl-32144244

RESUMEN

Antigenic drift of influenza virus hemagglutinin (HA) is enabled by facile evolvability. However, HA antigenic site B, which has become immunodominant in recent human H3N2 influenza viruses, is also evolutionarily constrained by its involvement in receptor binding. Here, we employ deep mutational scanning to probe the local fitness landscape of HA antigenic site B in six different human H3N2 strains spanning from 1968 to 2016. We observe that the fitness landscape of HA antigenic site B can be very different between strains. Sequence variants that exhibit high fitness in one strain can be deleterious in another, indicating that the evolutionary constraints of antigenic site B have changed over time. Structural analysis suggests that the local fitness landscape of antigenic site B can be reshaped by natural mutations via modulation of the receptor-binding mode. Overall, these findings elucidate how influenza virus continues to explore new antigenic space despite strong functional constraints.


Asunto(s)
Antígenos Virales/genética , Evolución Molecular , Glicoproteínas Hemaglutininas del Virus de la Influenza/genética , Subtipo H3N2 del Virus de la Influenza A/genética , Receptores de Superficie Celular/metabolismo , Animales , Antígenos Virales/inmunología , Antígenos Virales/metabolismo , Sitios de Unión/genética , Cristalografía por Rayos X , Análisis Mutacional de ADN , Perros , Células HEK293 , Glicoproteínas Hemaglutininas del Virus de la Influenza/inmunología , Glicoproteínas Hemaglutininas del Virus de la Influenza/metabolismo , Humanos , Subtipo H3N2 del Virus de la Influenza A/inmunología , Subtipo H3N2 del Virus de la Influenza A/metabolismo , Células de Riñón Canino Madin Darby , Mutación , Dominios Proteicos/genética , Dominios Proteicos/inmunología , ARN Viral/genética , ARN Viral/aislamiento & purificación , Reacción en Cadena de la Polimerasa de Transcriptasa Inversa , Análisis de Secuencia de ADN
8.
Mol Biol Evol ; 36(10): 2184-2194, 2019 10 01.
Artículo en Inglés | MEDLINE | ID: mdl-31209469

RESUMEN

During chronic infection, HIV-1 engages in a rapid coevolutionary arms race with the host's adaptive immune system. While it is clear that HIV exerts strong selection on the adaptive immune system, the characteristics of the somatic evolution that shape the immune response are still unknown. Traditional population genetics methods fail to distinguish chronic immune response from healthy repertoire evolution. Here, we infer the evolutionary modes of B-cell repertoires and identify complex dynamics with a constant production of better B-cell receptor (BCR) mutants that compete, maintaining large clonal diversity and potentially slowing down adaptation. A substantial fraction of mutations that rise to high frequencies in pathogen-engaging CDRs of BCRs are beneficial, in contrast to many such changes in structurally relevant frameworks that are deleterious and circulate by hitchhiking. We identify a pattern where BCRs in patients who experience larger viral expansions undergo stronger selection with a rapid turnover of beneficial mutations due to clonal interference in their CDR3 regions. Using population genetics modeling, we show that the extinction of these beneficial mutations can be attributed to the rise of competing beneficial alleles and clonal interference. The picture is of a dynamic repertoire, where better clones may be outcompeted by new mutants before they fix.


Asunto(s)
Inmunidad Adaptativa , Infecciones por VIH/inmunología , VIH-1/inmunología , Receptores de Antígenos de Linfocitos B/genética , Selección Genética , Humanos
9.
Mol Biol Evol ; 35(10): 2345-2354, 2018 10 01.
Artículo en Inglés | MEDLINE | ID: mdl-30085303

RESUMEN

Understanding the relationship between protein sequence, function, and stability is a fundamental problem in biology. The essential function of many proteins that fold into a specific structure is their ability to bind to a ligand, which can be assayed for thousands of mutated variants. However, binding assays do not distinguish whether mutations affect the stability of the binding interface or the overall fold. Here, we introduce a statistical method to infer a detailed energy landscape of how a protein folds and binds to a ligand by combining information from many mutated variants. We fit a thermodynamic model describing the bound, unbound, and unfolded states to high quality data of protein G domain B1 binding to IgG-Fc. We infer distinct folding and binding energies for each mutation providing a detailed view of how mutations affect binding and stability across the protein. We accurately infer the folding energy of each variant in physical units, validated by independent data, whereas previous high-throughput methods could only measure indirect changes in stability. While we assume an additive sequence-energy relationship, the binding fraction is epistatic due its nonlinear relation to energy. Despite having no epistasis in energy, our model explains much of the observed epistasis in binding fraction, with the remaining epistasis identifying conformationally dynamic regions.


Asunto(s)
Estabilidad Proteica , Proteínas/genética , Proteínas/fisiología , Secuencia de Aminoácidos , Animales , Simulación por Computador/estadística & datos numéricos , Epistasis Genética/fisiología , Evolución Molecular , Humanos , Ligandos , Mutación , Conformación Proteica , Pliegue de Proteína , Relación Estructura-Actividad , Termodinámica
10.
Proc Natl Acad Sci U S A ; 115(32): E7550-E7558, 2018 08 07.
Artículo en Inglés | MEDLINE | ID: mdl-30037990

RESUMEN

Genotype-phenotype relationships are notoriously complicated. Idiosyncratic interactions between specific combinations of mutations occur and are difficult to predict. Yet it is increasingly clear that many interactions can be understood in terms of global epistasis. That is, mutations may act additively on some underlying, unobserved trait, and this trait is then transformed via a nonlinear function to the observed phenotype as a result of subsequent biophysical and cellular processes. Here we infer the shape of such global epistasis in three proteins, based on published high-throughput mutagenesis data. To do so, we develop a maximum-likelihood inference procedure using a flexible family of monotonic nonlinear functions spanned by an I-spline basis. Our analysis uncovers dramatic nonlinearities in all three proteins; in some proteins a model with global epistasis accounts for virtually all of the measured variation, whereas in others we find substantial local epistasis as well. This method allows us to test hypotheses about the form of global epistasis and to distinguish variance components attributable to global epistasis, local epistasis, and measurement error.


Asunto(s)
Epistasis Genética , Evolución Molecular , Aptitud Genética , Modelos Genéticos , Genotipo , Modelos Estadísticos , Mutación , Dinámicas no Lineales , Fenotipo
11.
PLoS Genet ; 12(7): e1006171, 2016 07.
Artículo en Inglés | MEDLINE | ID: mdl-27442127

RESUMEN

The vertebrate adaptive immune system provides a flexible and diverse set of molecules to neutralize pathogens. Yet, viruses such as HIV can cause chronic infections by evolving as quickly as the adaptive immune system, forming an evolutionary arms race. Here we introduce a mathematical framework to study the coevolutionary dynamics between antibodies and antigens within a host. We focus on changes in the binding interactions between the antibody and antigen populations, which result from the underlying stochastic evolution of genotype frequencies driven by mutation, selection, and drift. We identify the critical viral and immune parameters that determine the distribution of antibody-antigen binding affinities. We also identify definitive signatures of coevolution that measure the reciprocal response between antibodies and viruses, and we introduce experimentally measurable quantities that quantify the extent of adaptation during continual coevolution of the two opposing populations. Using this analytical framework, we infer rates of viral and immune adaptation based on time-shifted neutralization assays in two HIV-infected patients. Finally, we analyze competition between clonal lineages of antibodies and characterize the fate of a given lineage in terms of the state of the antibody and viral populations. In particular, we derive the conditions that favor the emergence of broadly neutralizing antibodies, which may have relevance to vaccine design against HIV.


Asunto(s)
Inmunidad Adaptativa/genética , Anticuerpos Neutralizantes/genética , Infecciones por VIH/inmunología , Antígenos/inmunología , Evolución Molecular , Infecciones por VIH/virología , VIH-1/inmunología , Interacciones Huésped-Patógeno , Humanos , Modelos Genéticos
12.
Evolution ; 69(9): 2359-70, 2015 Sep.
Artículo en Inglés | MEDLINE | ID: mdl-26194030

RESUMEN

The role that epistasis plays during adaptation remains an outstanding problem, which has received considerable attention in recent years. Most of the recent empirical studies are based on ensembles of replicate populations that adapt in a fixed, laboratory controlled condition. Researchers often seek to infer the presence and form of epistasis in the fitness landscape from the time evolution of various statistics averaged across the ensemble of populations. Here, we provide a rigorous analysis of what quantities, drawn from time series of such ensembles, can be used to infer epistasis for populations evolving under weak mutation on finite-site fitness landscapes. First, we analyze the mean fitness trajectory-that is, the time course of the ensemble average fitness. We show that for any epistatic fitness landscape and starting genotype, there always exists a non-epistatic fitness landscape that produces the exact same mean fitness trajectory. Thus, the presence of epistasis is not identifiable from the mean fitness trajectory. By contrast, we show that two other ensemble statistics-the time evolution of the fitness variance across populations, and the time evolution of the mean number of substitutions-can detect certain forms of epistasis in the underlying fitness landscape.


Asunto(s)
Adaptación Biológica/genética , Epistasis Genética , Aptitud Genética , Modelos Genéticos , Mutación , Genética de Población
13.
Phys Biol ; 11(5): 056003, 2014 Aug 26.
Artículo en Inglés | MEDLINE | ID: mdl-25156977

RESUMEN

Competition between independently arising beneficial mutations is enhanced in spatial populations due to the linear rather than exponential growth of clones. Recent theoretical studies have pointed out that the resulting fitness dynamics is analogous to a surface growth process, where new layers nucleate and spread stochastically, leading to the build up of scale-invariant roughness. This scenario differs qualitatively from the standard view of adaptation in that the speed of adaptation becomes independent of population size while the fitness variance does not. Here we exploit recent progress in the understanding of surface growth processes to obtain precise predictions for the universal, non-Gaussian shape of the fitness distribution for one-dimensional habitats, which are verified by simulations. When the mutations are deleterious rather than beneficial the problem becomes a spatial version of Muller's ratchet. In contrast to the case of well-mixed populations, the rate of fitness decline remains finite even in the limit of an infinite habitat, provided the ratio [Formula: see text] between the deleterious mutation rate and the square of the (negative) selection coefficient is sufficiently large. Using, again, an analogy to surface growth models we show that the transition between the stationary and the moving state of the ratchet is governed by directed percolation.


Asunto(s)
Adaptación Biológica , Evolución Biológica , Ecosistema , Modelos Genéticos , Mutación , Aptitud Genética , Variación Genética
14.
Proc Natl Acad Sci U S A ; 111(22): E2301-9, 2014 Jun 03.
Artículo en Inglés | MEDLINE | ID: mdl-24843135

RESUMEN

The genotype-fitness map plays a fundamental role in shaping the dynamics of evolution. However, it is difficult to directly measure a fitness landscape in practice, because the number of possible genotypes is astronomical. One approach is to sample as many genotypes as possible, measure their fitnesses, and fit a statistical model of the landscape that includes additive and pairwise interactive effects between loci. Here, we elucidate the pitfalls of using such regressions by studying artificial but mathematically convenient fitness landscapes. We identify two sources of bias inherent in these regression procedures, each of which tends to underestimate high fitnesses and overestimate low fitnesses. We characterize these biases for random sampling of genotypes as well as samples drawn from a population under selection in the Wright-Fisher model of evolutionary dynamics. We show that common measures of epistasis, such as the number of monotonically increasing paths between ancestral and derived genotypes, the prevalence of sign epistasis, and the number of local fitness maxima, are distorted in the inferred landscape. As a result, the inferred landscape will provide systematically biased predictions for the dynamics of adaptation. We identify the same biases in a computational RNA-folding landscape as well as regulatory sequence binding data treated with the same fitting procedure. Finally, we present a method to ameliorate these biases in some cases.


Asunto(s)
Epistasis Genética/genética , Evolución Molecular , Aptitud Genética/genética , Genotipo , Modelos Genéticos , Resistencia a Medicamentos/genética , Pliegue del ARN/genética , Análisis de Regresión , Selección Genética/genética
15.
PLoS One ; 8(5): e61570, 2013.
Artículo en Inglés | MEDLINE | ID: mdl-23650500

RESUMEN

Genotype-to-phenotype maps and the related fitness landscapes that include epistatic interactions are difficult to measure because of their high dimensional structure. Here we construct such a map using the recently collected corpora of high-throughput sequence data from the 75 base pairs long mutagenized E. coli lac promoter region, where each sequence is associated with its phenotype, the induced transcriptional activity measured by a fluorescent reporter. We find that the additive (non-epistatic) contributions of individual mutations account for about two-thirds of the explainable phenotype variance, while pairwise epistasis explains about 7% of the variance for the full mutagenized sequence and about 15% for the subsequence associated with protein binding sites. Surprisingly, there is no evidence for third order epistatic contributions, and our inferred fitness landscape is essentially single peaked, with a small amount of antagonistic epistasis. There is a significant selective pressure on the wild type, which we deduce to be multi-objective optimal for gene expression in environments with different nutrient sources. We identify transcription factor (CRP) and RNA polymerase binding sites in the promotor region and their interactions without difficult optimization steps. In particular, we observe evidence for previously unexplored genetic regulatory mechanisms, possibly kinetic in nature. We conclude with a cautionary note that inferred properties of fitness landscapes may be severely influenced by biases in the sequence data.


Asunto(s)
Escherichia coli/genética , Operón Lac , Modelos Genéticos , Regiones Promotoras Genéticas , Algoritmos , Epistasis Genética , Regulación Bacteriana de la Expresión Génica , Estudios de Asociación Genética , Aptitud Genética , Genotipo
16.
Phys Rev E Stat Nonlin Soft Matter Phys ; 84(1 Pt 1): 011925, 2011 Jul.
Artículo en Inglés | MEDLINE | ID: mdl-21867231

RESUMEN

When beneficial mutations are relatively common, competition between multiple unfixed mutations can reduce the rate of fixation in well-mixed asexual populations. We introduce a one-dimensional model with a steady accumulation of beneficial mutations. We find a transition between periodic selection and multiple-mutation regimes. In the multiple-mutation regime, the increase of fitness along the lattice bears a striking similarity to surface growth phenomena, with power-law growth and saturation of the interface width. We also find significant differences compared to the well-mixed model. In our lattice model, the transition between regimes happens at a much lower mutation rate due to slower fixation times in one dimension. Also, the rate of fixation is reduced with increasing mutation rate due to the more intense competition, and it saturates with large population size.


Asunto(s)
Análisis Mutacional de ADN/métodos , Mutación , Algoritmos , Animales , Evolución Molecular , Genética , Genética de Población , Humanos , Modelos Genéticos , Modelos Estadísticos , Factores de Tiempo
17.
J Stat Phys ; 144(2): 367-378, 2011 Jul.
Artículo en Inglés | MEDLINE | ID: mdl-22822270

RESUMEN

We consider a fixed size population that undergoes an evolutionary adaptation in the weak mutation rate limit, which we model as a biased Langevin process in the genotype space. We show analytically and numerically that, if the fitness landscape has a small highly epistatic (rough) and time-varying component, then the population genotype exhibits a high effective diffusion in the genotype space and is able to escape local fitness minima with a large probability. We argue that our principal finding that even very small time-dependent fluctuations of fitness can substantially speed up evolution is valid for a wide class of models.

18.
Phys Rev Lett ; 103(15): 156101, 2009 Oct 09.
Artículo en Inglés | MEDLINE | ID: mdl-19905653

RESUMEN

We present a refractive-index-matched colloidal system that allows direct observation of critical Casimir induced aggregation with a confocal microscope. We show that in this system, in which van der Waals forces are negligible, a simple competition between repulsive screened Coulomb and attractive critical Casimir forces can account quantitatively for the reversible aggregation. Above the temperature T(a), the critical Casimir force drives aggregation of the particles into fractal clusters, while below T(a), the electrostatic repulsion between the particles breaks up the clusters, and the particles resuspend by thermal diffusion. The aggregation is observed in a remarkably wide temperature range of as much as 15 degrees. We derive a simple expression for the particle pair potential that accounts quantitatively for the temperature-dependent aggregation and aggregate breakup.

SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA
...