Pesquisa | Biblioteca Virtual em Saúde

1.

Comparative genomics uncovers the evolutionary history, demography, and molecular adaptations of South American canids.

Chavez, Daniel E; Gronau, Ilan; Hains, Taylor; Dikow, Rebecca B; Frandsen, Paul B; Figueiró, Henrique V; Garcez, Fabrício S; Tchaicka, Ligia; de Paula, Rogério C; Rodrigues, Flávio H G; Jorge, Rodrigo S P; Lima, Edson S; Songsasen, Nucharin; Johnson, Warren E; Eizirik, Eduardo; Koepfli, Klaus-Peter; Wayne, Robert K.

Proc Natl Acad Sci U S A ; 119(34): e2205986119, 2022 08 23.

Artigo em Inglês | MEDLINE | ID: mdl-35969758

RESUMO

The remarkable radiation of South American (SA) canids produced 10 extant species distributed across diverse habitats, including disparate forms such as the short-legged, hypercarnivorous bush dog and the long-legged, largely frugivorous maned wolf. Despite considerable research spanning nearly two centuries, many aspects of their evolutionary history remain unknown. Here, we analyzed 31 whole genomes encompassing all extant SA canid species to assess phylogenetic relationships, interspecific hybridization, historical demography, current genetic diversity, and the molecular bases of adaptations in the bush dog and maned wolf. We found that SA canids originated from a single ancestor that colonized South America 3.9 to 3.5 Mya, followed by diversification east of the Andes and then a single colonization event and radiation of Lycalopex species west of the Andes. We detected extensive historical gene flow between recently diverged lineages and observed distinct patterns of genomic diversity and demographic history in SA canids, likely induced by past climatic cycles compounded by human-induced population declines. Genome-wide scans of selection showed that disparate limb proportions in the bush dog and maned wolf may derive from mutations in genes regulating chondrocyte proliferation and enlargement. Further, frugivory in the maned wolf may have been enabled by variants in genes associated with energy intake from short-chain fatty acids. In contrast, unique genetic variants detected in the bush dog may underlie interdigital webbing and dental adaptations for hypercarnivory. Our analyses shed light on the evolution of a unique carnivoran radiation and how it was shaped by South American topography and climate change.

Assuntos

Adaptação Fisiológica , Canidae , Filogenia , Adaptação Fisiológica/genética , Animais , Canidae/classificação , Canidae/genética , Demografia , Variação Genética , Genômica , América do Sul

2.

Genomic islands of differentiation in a rapid avian radiation have been driven by recent selective sweeps.

Hejase, Hussein A; Salman-Minkov, Ayelet; Campagna, Leonardo; Hubisz, Melissa J; Lovette, Irby J; Gronau, Ilan; Siepel, Adam.

Proc Natl Acad Sci U S A ; 117(48): 30554-30565, 2020 12 01.

Artigo em Inglês | MEDLINE | ID: mdl-33199636

RESUMO

Numerous studies of emerging species have identified genomic "islands" of elevated differentiation against a background of relative homogeneity. The causes of these islands remain unclear, however, with some signs pointing toward "speciation genes" that locally restrict gene flow and others suggesting selective sweeps that have occurred within nascent species after speciation. Here, we examine this question through the lens of genome sequence data for five species of southern capuchino seedeaters, finch-like birds from South America that have undergone a species radiation during the last â¼50,000 generations. By applying newly developed statistical methods for ancestral recombination graph inference and machine-learning methods for the prediction of selective sweeps, we show that previously identified islands of differentiation in these birds appear to be generally associated with relatively recent, species-specific selective sweeps, most of which are predicted to be soft sweeps acting on standing genetic variation. Many of these sweeps coincide with genes associated with melanin-based variation in plumage, suggesting a prominent role for sexual selection. At the same time, a few loci also exhibit indications of possible selection against gene flow. These observations shed light on the complex manner in which natural selection shapes genome sequences during speciation.

Assuntos

Ilhas Genômicas , Modelos Genéticos , Animais , Biodiversidade , Variação Genética , Aprendizado de Máquina

3.

Ancient gene flow from early modern humans into Eastern Neanderthals.

Kuhlwilm, Martin; Gronau, Ilan; Hubisz, Melissa J; de Filippo, Cesare; Prado-Martinez, Javier; Kircher, Martin; Fu, Qiaomei; Burbano, Hernán A; Lalueza-Fox, Carles; de la Rasilla, Marco; Rosas, Antonio; Rudan, Pavao; Brajkovic, Dejana; Kucan, Zeljko; Gusic, Ivan; Marques-Bonet, Tomas; Andrés, Aida M; Viola, Bence; Pääbo, Svante; Meyer, Matthias; Siepel, Adam; Castellano, Sergi.

Nature ; 530(7591): 429-33, 2016 Feb 25.

Artigo em Inglês | MEDLINE | ID: mdl-26886800

RESUMO

It has been shown that Neanderthals contributed genetically to modern humans outside Africa 47,000-65,000 years ago. Here we analyse the genomes of a Neanderthal and a Denisovan from the Altai Mountains in Siberia together with the sequences of chromosome 21 of two Neanderthals from Spain and Croatia. We find that a population that diverged early from other modern humans in Africa contributed genetically to the ancestors of Neanderthals from the Altai Mountains roughly 100,000 years ago. By contrast, we do not detect such a genetic contribution in the Denisovan or the two European Neanderthals. We conclude that in addition to later interbreeding events, the ancestors of Neanderthals from the Altai Mountains and early modern humans met and interbred, possibly in the Near East, many thousands of years earlier than previously thought.

Assuntos

Fluxo Gênico/genética , Homem de Neandertal/genética , Altitude , Animais , Teorema de Bayes , Cromossomos Humanos Par 21/genética , Croácia/etnologia , Genoma Humano/genética , Genômica , Haplótipos/genética , Heterozigoto , Humanos , Hibridização Genética/genética , Filogenia , Densidade Demográfica , Sibéria , Espanha/etnologia , Fatores de Tempo

4.

Worldwide patterns of genomic variation and admixture in gray wolves.

Fan, Zhenxin; Silva, Pedro; Gronau, Ilan; Wang, Shuoguo; Armero, Aitor Serres; Schweizer, Rena M; Ramirez, Oscar; Pollinger, John; Galaverni, Marco; Ortega Del-Vecchyo, Diego; Du, Lianming; Zhang, Wenping; Zhang, Zhihe; Xing, Jinchuan; Vilà, Carles; Marques-Bonet, Tomas; Godinho, Raquel; Yue, Bisong; Wayne, Robert K.

Genome Res ; 26(2): 163-73, 2016 Feb.

Artigo em Inglês | MEDLINE | ID: mdl-26680994

RESUMO

The gray wolf (Canis lupus) is a widely distributed top predator and ancestor of the domestic dog. To address questions about wolf relationships to each other and dogs, we assembled and analyzed a data set of 34 canine genomes. The divergence between New and Old World wolves is the earliest branching event and is followed by the divergence of Old World wolves and dogs, confirming that the dog was domesticated in the Old World. However, no single wolf population is more closely related to dogs, supporting the hypothesis that dogs were derived from an extinct wolf population. All extant wolves have a surprisingly recent common ancestry and experienced a dramatic population decline beginning at least â¼30 thousand years ago (kya). We suggest this crisis was related to the colonization of Eurasia by modern human hunter-gatherers, who competed with wolves for limited prey but also domesticated them, leading to a compensatory population expansion of dogs. We found extensive admixture between dogs and wolves, with up to 25% of Eurasian wolf genomes showing signs of dog ancestry. Dogs have influenced the recent history of wolves through admixture and vice versa, potentially enhancing adaptation. Simple scenarios of dog domestication are confounded by admixture, and studies that do not take admixture into account with specific demographic models are problematic.

Assuntos

Cães/genética , Lobos/genética , Animais , Teorema de Bayes , DNA Mitocondrial/genética , Feminino , Genoma , Hibridização Genética , Masculino , Cadeias de Markov , Modelos Genéticos , Filogenia , Polimorfismo de Nucleotídeo Único , Análise de Componente Principal , Análise de Sequência de DNA

5.

Demographically-Based Evaluation of Genomic Regions under Selection in Domestic Dogs.

Freedman, Adam H; Schweizer, Rena M; Ortega-Del Vecchyo, Diego; Han, Eunjung; Davis, Brian W; Gronau, Ilan; Silva, Pedro M; Galaverni, Marco; Fan, Zhenxin; Marx, Peter; Lorente-Galdos, Belen; Ramirez, Oscar; Hormozdiari, Farhad; Alkan, Can; Vilà, Carles; Squire, Kevin; Geffen, Eli; Kusak, Josip; Boyko, Adam R; Parker, Heidi G; Lee, Clarence; Tadigotla, Vasisht; Siepel, Adam; Bustamante, Carlos D; Harkins, Timothy T; Nelson, Stanley F; Marques-Bonet, Tomas; Ostrander, Elaine A; Wayne, Robert K; Novembre, John.

PLoS Genet ; 12(3): e1005851, 2016 Mar.

Artigo em Inglês | MEDLINE | ID: mdl-26943675

RESUMO

Controlling for background demographic effects is important for accurately identifying loci that have recently undergone positive selection. To date, the effects of demography have not yet been explicitly considered when identifying loci under selection during dog domestication. To investigate positive selection on the dog lineage early in the domestication, we examined patterns of polymorphism in six canid genomes that were previously used to infer a demographic model of dog domestication. Using an inferred demographic model, we computed false discovery rates (FDR) and identified 349 outlier regions consistent with positive selection at a low FDR. The signals in the top 100 regions were frequently centered on candidate genes related to brain function and behavior, including LHFPL3, CADM2, GRIK3, SH3GL2, MBP, PDE7B, NTAN1, and GLRA1. These regions contained significant enrichments in behavioral ontology categories. The 3rd top hit, CCRN4L, plays a major role in lipid metabolism, that is supported by additional metabolism related candidates revealed in our scan, including SCP2D1 and PDXC1. Comparing our method to an empirical outlier approach that does not directly account for demography, we found only modest overlaps between the two methods, with 60% of empirical outliers having no overlap with our demography-based outlier detection approach. Demography-aware approaches have lower-rates of false discovery. Our top candidates for selection, in addition to expanding the set of neurobehavioral candidate genes, include genes related to lipid metabolism, suggesting a dietary target of selection that was important during the period when proto-dogs hunted and fed alongside hunter-gatherers.

Assuntos

Genética Populacional , Genômica , Metabolismo dos Lipídeos/genética , Seleção Genética , Animais , Demografia , Cães , Genoma , Polimorfismo de Nucleotídeo Único

6.

Comparing evolutionary distances via adaptive distance functions.

Damti, Yanir; Gronau, Ilan; Moran, Shlomo; Yavneh, Irad.

J Theor Biol ; 440: 88-99, 2018 03 07.

Artigo em Inglês | MEDLINE | ID: mdl-29277603

RESUMO

Distance-based methods for phylogenetic reconstruction are based on a two-step approach: first, pairwise distances are computed from DNA sequences associated with a given set of taxa, and then these distances are used to reconstruct the phylogenetic relationships between taxa. Because the estimated distances are based on finite sequences, they are inherently noisy, and this noise may result in reconstruction errors. Previous attempts to improve reconstruction accuracy focused either on improving the robustness of reconstruction algorithms to this stochastic noise, or on improving the accuracy of the distance estimates. Here, we aim to further improve reconstruction accuracy by utilizing the basic observation that reconstruction algorithms are based on a series of comparisons between distances (or linear combinations of distances). We start by examining the relationship between the stochastic noise in the sequence data and the accuracy of the comparisons between pairwise distance estimates. This examination results in improved methods for distance comparison, which are shown to be as accurate as likelihood-based methods, while being much simpler and more efficient to compute. We then extend these methods to improve reconstruction accuracy of quartet trees, and examine some of the challenges moving forward.

Assuntos

Algoritmos , Modelos Genéticos , Filogenia , Alinhamento de Sequência , Análise de Sequência de DNA , Evolução Molecular , Funções Verossimilhança , Processos Estocásticos

7.

Genome-wide inference of ancestral recombination graphs.

Rasmussen, Matthew D; Hubisz, Melissa J; Gronau, Ilan; Siepel, Adam.

PLoS Genet ; 10(5): e1004342, 2014.

Artigo em Inglês | MEDLINE | ID: mdl-24831947

RESUMO

The complex correlation structure of a collection of orthologous DNA sequences is uniquely captured by the "ancestral recombination graph" (ARG), a complete record of coalescence and recombination events in the history of the sample. However, existing methods for ARG inference are computationally intensive, highly approximate, or limited to small numbers of sequences, and, as a consequence, explicit ARG inference is rarely used in applied population genomics. Here, we introduce a new algorithm for ARG inference that is efficient enough to apply to dozens of complete mammalian genomes. The key idea of our approach is to sample an ARG of [Formula: see text] chromosomes conditional on an ARG of [Formula: see text] chromosomes, an operation we call "threading." Using techniques based on hidden Markov models, we can perform this threading operation exactly, up to the assumptions of the sequentially Markov coalescent and a discretization of time. An extension allows for threading of subtrees instead of individual sequences. Repeated application of these threading operations results in highly efficient Markov chain Monte Carlo samplers for ARGs. We have implemented these methods in a computer program called ARGweaver. Experiments with simulated data indicate that ARGweaver converges rapidly to the posterior distribution over ARGs and is effective in recovering various features of the ARG for dozens of sequences generated under realistic parameters for human populations. In applications of ARGweaver to 54 human genome sequences from Complete Genomics, we find clear signatures of natural selection, including regions of unusually ancient ancestry associated with balancing selection and reductions in allele age in sites under directional selection. The patterns we observe near protein-coding genes are consistent with a primary influence from background selection rather than hitchhiking, although we cannot rule out a contribution from recurrent selective sweeps.

Assuntos

Evolução Molecular , Genoma Humano , Recombinação Genética , Seleção Genética/genética , Algoritmos , Simulação por Computador , Humanos , Cadeias de Markov , Modelos Genéticos , Método de Monte Carlo

8.

Genome sequencing highlights the dynamic early history of dogs.

Freedman, Adam H; Gronau, Ilan; Schweizer, Rena M; Ortega-Del Vecchyo, Diego; Han, Eunjung; Silva, Pedro M; Galaverni, Marco; Fan, Zhenxin; Marx, Peter; Lorente-Galdos, Belen; Beale, Holly; Ramirez, Oscar; Hormozdiari, Farhad; Alkan, Can; Vilà, Carles; Squire, Kevin; Geffen, Eli; Kusak, Josip; Boyko, Adam R; Parker, Heidi G; Lee, Clarence; Tadigotla, Vasisht; Wilton, Alan; Siepel, Adam; Bustamante, Carlos D; Harkins, Timothy T; Nelson, Stanley F; Ostrander, Elaine A; Marques-Bonet, Tomas; Wayne, Robert K; Novembre, John.

PLoS Genet ; 10(1): e1004016, 2014 Jan.

Artigo em Inglês | MEDLINE | ID: mdl-24453982

RESUMO

To identify genetic changes underlying dog domestication and reconstruct their early evolutionary history, we generated high-quality genome sequences from three gray wolves, one from each of the three putative centers of dog domestication, two basal dog lineages (Basenji and Dingo) and a golden jackal as an outgroup. Analysis of these sequences supports a demographic model in which dogs and wolves diverged through a dynamic process involving population bottlenecks in both lineages and post-divergence gene flow. In dogs, the domestication bottleneck involved at least a 16-fold reduction in population size, a much more severe bottleneck than estimated previously. A sharp bottleneck in wolves occurred soon after their divergence from dogs, implying that the pool of diversity from which dogs arose was substantially larger than represented by modern wolf populations. We narrow the plausible range for the date of initial dog domestication to an interval spanning 11-16 thousand years ago, predating the rise of agriculture. In light of this finding, we expand upon previous work regarding the increase in copy number of the amylase gene (AMY2B) in dogs, which is believed to have aided digestion of starch in agricultural refuse. We find standing variation for amylase copy number variation in wolves and little or no copy number increase in the Dingo and Husky lineages. In conjunction with the estimated timing of dog origins, these results provide additional support to archaeological finds, suggesting the earliest dogs arose alongside hunter-gathers rather than agriculturists. Regarding the geographic origin of dogs, we find that, surprisingly, none of the extant wolf lineages from putative domestication centers is more closely related to dogs, and, instead, the sampled wolves form a sister monophyletic clade. This result, in combination with dog-wolf admixture during the process of domestication, suggests that a re-evaluation of past hypotheses regarding dog origins is necessary.

Assuntos

Amilases/genética , Animais Domésticos/genética , Variações do Número de Cópias de DNA/genética , Evolução Molecular , Animais , DNA Mitocondrial/genética , Dieta , Cães , Variação Genética , Filogenia , Densidade Demográfica , Lobos/classificação , Lobos/genética

9.

Adaptive evolution of testis-specific, recently evolved, clustered miRNAs in Drosophila.

Mohammed, Jaaved; Bortolamiol-Becet, Diane; Flynt, Alex S; Gronau, Ilan; Siepel, Adam; Lai, Eric C.

RNA ; 20(8): 1195-209, 2014 Aug.

Artigo em Inglês | MEDLINE | ID: mdl-24942624

RESUMO

The propensity of animal miRNAs to regulate targets bearing modest complementarity, most notably via pairing with miRNA positions â¼2-8 (the "seed"), is believed to drive major aspects of miRNA evolution. First, minimal targeting requirements have allowed most conserved miRNAs to acquire large target cohorts, thus imposing strong selection on miRNAs to maintain their seed sequences. Second, the modest pairing needed for repression suggests that evolutionarily nascent miRNAs may generally induce net detrimental, rather than beneficial, regulatory effects. Hence, levels and activities of newly emerged miRNAs are expected to be limited to preserve the status quo of gene expression. In this study, we unexpectedly show that Drosophila testes specifically express a substantial miRNA population that contravenes these tenets. We find that multiple genomic clusters of testis-restricted miRNAs harbor recently evolved miRNAs, whose experimentally verified orthologs exhibit divergent sequences, even within seed regions. Moreover, this class of miRNAs exhibits higher expression and greater phenotypic capacities in transgenic misexpression assays than do non-testis-restricted miRNAs of similar evolutionary age. These observations suggest that these testis-restricted miRNAs may be evolving adaptively, and several methods of evolutionary analysis provide strong support for this notion. Consistent with this, proof-of-principle tests show that orthologous miRNAs with divergent seeds can distinguish target sensors in a species-cognate manner. Finally, we observe that testis-restricted miRNA clusters exhibit extraordinary dynamics of miRNA gene flux in other Drosophila species. Altogether, our findings reveal a surprising tissue-directed influence of miRNA evolution, involving a distinct mode of miRNA function connected to adaptive gene regulation in the testis.

Assuntos

Adaptação Biológica , Evolução Biológica , Drosophila/genética , Drosophila/metabolismo , MicroRNAs/genética , Família Multigênica , Testículo/metabolismo , Animais , Sequência de Bases , Análise por Conglomerados , Sequência Conservada , Evolução Molecular , Expressão Gênica , Perfilação da Expressão Gênica , Variação Genética , Masculino , Dados de Sequência Molecular , Especificidade de Órgãos/genética , Fenótipo , Alinhamento de Sequência

10.

Distinguishing noise from signal in patterns of genomic divergence in a highly polymorphic avian radiation.

Campagna, Leonardo; Gronau, Ilan; Silveira, Luís Fábio; Siepel, Adam; Lovette, Irby J.

Mol Ecol ; 24(16): 4238-51, 2015 Aug.

Artigo em Inglês | MEDLINE | ID: mdl-26175196

RESUMO

Recently diverged taxa provide the opportunity to search for the genetic basis of the phenotypes that distinguish them. Genomic scans aim to identify loci that are diverged with respect to an otherwise weakly differentiated genetic background. These loci are candidates for being past targets of selection because they behave differently from the rest of the genome that has either not yet differentiated or that may cross species barriers through introgressive hybridization. Here we use a reduced-representation genomic approach to explore divergence among six species of southern capuchino seedeaters, a group of recently radiated sympatric passerine birds in the genus Sporophila. For the first time in these taxa, we discovered a small proportion of markers that appeared differentiated among species. However, when assessing the significance of these signatures of divergence, we found that similar patterns can also be recovered from random grouping of individuals representing different species. A detailed demographic inference indicates that genetic differences among Sporophila species could be the consequence of neutral processes, which include a very large ancestral effective population size that accentuates the effects of incomplete lineage sorting. As these neutral phenomena can generate genomic scan patterns that mimic those of markers involved in speciation and phenotypic differentiation, they highlight the need for caution when ascertaining and interpreting differentiated markers between species, especially when large numbers of markers are surveyed. Our study provides new insights into the demography of the southern capuchino radiation and proposes controls to distinguish signal from noise in similar genomic scans.

Assuntos

Especiação Genética , Passeriformes/genética , Simpatria , Animais , Teorema de Bayes , Feminino , Fluxo Gênico , Loci Gênicos , Genética Populacional , Genômica , Masculino , Modelos Genéticos , Passeriformes/classificação , Polimorfismo de Nucleotídeo Único , Análise de Sequência de DNA , América do Sul

11.

Inference of natural selection from interspersed genomic elements based on polymorphism and divergence.

Gronau, Ilan; Arbiza, Leonardo; Mohammed, Jaaved; Siepel, Adam.

Mol Biol Evol ; 30(5): 1159-71, 2013 May.

Artigo em Inglês | MEDLINE | ID: mdl-23386628

RESUMO

Complete genome sequences contain valuable information about natural selection, but this information is difficult to access for short, widely scattered noncoding elements such as transcription factor binding sites or small noncoding RNAs. Here, we introduce a new computational method, called Inference of Natural Selection from Interspersed Genomically coHerent elemenTs (INSIGHT), for measuring the influence of natural selection on such elements. INSIGHT uses a generative probabilistic model to contrast patterns of polymorphism and divergence in the elements of interest with those in flanking neutral sites, pooling weak information from many short elements in a manner that accounts for variation among loci in mutation rates and coalescent times. The method is able to disentangle the contributions of weak negative, strong negative, and positive selection based on their distinct effects on patterns of polymorphism and divergence. It obtains information about divergence from multiple outgroup genomes using a general statistical phylogenetic approach. The INSIGHT model is efficiently fitted to genome-wide data using an approximate expectation maximization algorithm. Using simulations, we show that the method can accurately estimate the parameters of interest even in complex demographic scenarios, and that it significantly improves on methods based on summary statistics describing polymorphism and divergence. To demonstrate the usefulness of INSIGHT, we apply it to several classes of human noncoding RNAs and to GATA2-binding sites in the human genome.

Assuntos

Evolução Molecular , Polimorfismo Genético/genética , Seleção Genética/genética , DNA/genética , Variação Genética/genética , Genética Populacional , Humanos , Filogenia , Sequências Reguladoras de Ácido Nucleico/genética

12.

Replacing and additive horizontal gene transfer in Streptococcus.

Choi, Sang Chul; Rasmussen, Matthew D; Hubisz, Melissa J; Gronau, Ilan; Stanhope, Michael J; Siepel, Adam.

Mol Biol Evol ; 29(11): 3309-20, 2012 Nov.

Artigo em Inglês | MEDLINE | ID: mdl-22617954

RESUMO

The prominent role of Horizontal Gene Transfer (HGT) in the evolution of bacteria is now well documented, but few studies have differentiated between evolutionary events that predominantly cause genes in one lineage to be replaced by homologs from another lineage ("replacing HGT") and events that result in the addition of substantial new genomic material ("additive HGT"). Here in, we make use of the distinct phylogenetic signatures of replacing and additive HGTs in a genome-wide study of the important human pathogen Streptococcus pyogenes (SPY) and its close relatives S. dysgalactiae subspecies equisimilis (SDE) and S. dysgalactiae subspecies dysgalactiae (SDD). Using recently developed statistical models and computational methods, we find evidence for abundant gene flow of both kinds within each of the SPY and SDE clades and of reduced levels of exchange between SPY and SDD. In addition, our analysis strongly supports a pronounced asymmetry in SPY-SDE gene flow, favoring the SPY-to-SDE direction. This finding is of particular interest in light of the recent increase in virulence of pathogenic SDE. We find much stronger evidence for SPY-SDE gene flow among replacing than among additive transfers, suggesting a primary influence from homologous recombination between co-occurring SPY and SDE cells in human hosts. Putative virulence genes are correlated with transfer events, but this correlation is found to be driven by additive, not replacing, HGTs. The genes affected by additive HGTs are enriched for functions having to do with transposition, recombination, and DNA integration, consistent with previous findings, whereas replacing HGTs seen to influence a more diverse set of genes. Additive transfers are also found to be associated with evidence of positive selection. These findings shed new light on the manner in which HGT has shaped pathogenic bacterial genomes.

Assuntos

Transferência Genética Horizontal/genética , Filogenia , Streptococcus/genética , Duplicação Gênica/genética , Genes Bacterianos/genética , Genes Essenciais/genética , Humanos , Modelos Genéticos , Seleção Genética

13.

SLIViT: a general AI framework for clinical-feature diagnosis from limited 3D biomedical-imaging data.

Avram, Oren; Durmus, Berkin; Rakocz, Nadav; Corradetti, Giulia; An, Ulzee; Nitalla, Muneeswar G; Rudas, Akos; Wakatsuki, Yu; Hirabayashi, Kazutaka; Velaga, Swetha; Tiosano, Liran; Corvi, Federico; Verma, Aditya; Karamat, Ayesha; Lindenberg, Sophiana; Oncel, Deniz; Almidani, Louay; Hull, Victoria; Fasih-Ahmad, Sohaib; Esmaeilkhanian, Houri; Wykoff, Charles C; Rahmani, Elior; Arnold, Corey W; Zhou, Bolei; Zaitlen, Noah; Gronau, Ilan; Sankararaman, Sriram; Chiang, Jeffrey N; Sadda, Srinivas R; Halperin, Eran.

Res Sq ; 2023 Nov 21.

Artigo em Inglês | MEDLINE | ID: mdl-38045283

RESUMO

We present SLIViT, a deep-learning framework that accurately measures disease-related risk factors in volumetric biomedical imaging, such as magnetic resonance imaging (MRI) scans, optical coherence tomography (OCT) scans, and ultrasound videos. To evaluate SLIViT, we applied it to five different datasets of these three different data modalities tackling seven learning tasks (including both classification and regression) and found that it consistently and significantly outperforms domain-specific state-of-the-art models, typically improving performance (ROC AUC or correlation) by 0.1-0.4. Notably, compared to existing approaches, SLIViT can be applied even when only a small number of annotated training samples is available, which is often a constraint in medical applications. When trained on less than 700 annotated volumes, SLIViT obtained accuracy comparable to trained clinical specialists while reducing annotation time by a factor of 5,000 demonstrating its utility to automate and expedite ongoing research and other practical clinical scenarios.

14.

Expanding the stdpopsim species catalog, and lessons learned for realistic genome simulations.

Lauterbur, M Elise; Cavassim, Maria Izabel A; Gladstein, Ariella L; Gower, Graham; Pope, Nathaniel S; Tsambos, Georgia; Adrion, Jeffrey; Belsare, Saurabh; Biddanda, Arjun; Caudill, Victoria; Cury, Jean; Echevarria, Ignacio; Haller, Benjamin C; Hasan, Ahmed R; Huang, Xin; Iasi, Leonardo Nicola Martin; Noskova, Ekaterina; Obsteter, Jana; Pavinato, Vitor Antonio Correa; Pearson, Alice; Peede, David; Perez, Manolo F; Rodrigues, Murillo F; Smith, Chris C R; Spence, Jeffrey P; Teterina, Anastasia; Tittes, Silas; Unneberg, Per; Vazquez, Juan Manuel; Waples, Ryan K; Wohns, Anthony Wilder; Wong, Yan; Baumdicker, Franz; Cartwright, Reed A; Gorjanc, Gregor; Gutenkunst, Ryan N; Kelleher, Jerome; Kern, Andrew D; Ragsdale, Aaron P; Ralph, Peter L; Schrider, Daniel R; Gronau, Ilan.

Elife ; 122023 06 21.

Artigo em Inglês | MEDLINE | ID: mdl-37342968

RESUMO

Simulation is a key tool in population genetics for both methods development and empirical research, but producing simulations that recapitulate the main features of genomic datasets remains a major obstacle. Today, more realistic simulations are possible thanks to large increases in the quantity and quality of available genetic data, and the sophistication of inference and simulation software. However, implementing these simulations still requires substantial time and specialized knowledge. These challenges are especially pronounced for simulating genomes for species that are not well-studied, since it is not always clear what information is required to produce simulations with a level of realism sufficient to confidently answer a given question. The community-developed framework stdpopsim seeks to lower this barrier by facilitating the simulation of complex population genetic models using up-to-date information. The initial version of stdpopsim focused on establishing this framework using six well-characterized model species (Adrion et al., 2020). Here, we report on major improvements made in the new release of stdpopsim (version 0.2), which includes a significant expansion of the species catalog and substantial additions to simulation capabilities. Features added to improve the realism of the simulated genomes include non-crossover recombination and provision of species-specific genomic annotations. Through community-driven efforts, we expanded the number of species in the catalog more than threefold and broadened coverage across the tree of life. During the process of expanding the catalog, we have identified common sticking points and developed the best practices for setting up genome-scale simulations. We describe the input data required for generating a realistic simulation, suggest good practices for obtaining the relevant information from the literature, and discuss common pitfalls and major considerations. These improvements to stdpopsim aim to further promote the use of realistic whole-genome population genetic simulations, especially in non-model organisms, making them available, transparent, and accessible to everyone.

Assuntos

Genoma , Software , Simulação por Computador , Genética Populacional , Genômica

15.

Optimal Design of Synthetic DNA Sequences Without Unwanted Binding Sites.

Leibovich, Zehavit; Gronau, Ilan.

J Comput Biol ; 29(9): 974-986, 2022 09.

Artigo em Inglês | MEDLINE | ID: mdl-35648072

RESUMO

Synthesizing DNA molecules by design has become an essential tool in molecular biology and is expected to become ubiquitous in the coming decade. Successful design of a synthetic DNA molecule often requires satisfying multiple objectives, some of which may conflict with others. One particularly important objective is the elimination of unwanted protein binding sites, which may interfere with the desired function of the synthesized molecule. While most design tools offer this fundamental capability, they do not follow a systematic approach that guarantees elimination of all unwanted sites whenever a feasible solution exists. Furthermore, the algorithms these tools use (when published) are often quite naive and inefficient. We present a formal description of the binding site elimination problem and suggest several efficient algorithms that eliminate unwanted patterns with minimum interference to the desired function of the synthesized sequence. These algorithms are simple, efficient, and flexible and, therefore, can be easily incorporated in all existing DNA design tools, enhancing their design capabilities.

Assuntos

Algoritmos , DNA , Sequência de Bases , Sítios de Ligação/genética , Biologia Computacional , DNA/química , Ligação Proteica

16.

A community-maintained standard library of population genetic models.

Adrion, Jeffrey R; Cole, Christopher B; Dukler, Noah; Galloway, Jared G; Gladstein, Ariella L; Gower, Graham; Kyriazis, Christopher C; Ragsdale, Aaron P; Tsambos, Georgia; Baumdicker, Franz; Carlson, Jedidiah; Cartwright, Reed A; Durvasula, Arun; Gronau, Ilan; Kim, Bernard Y; McKenzie, Patrick; Messer, Philipp W; Noskova, Ekaterina; Ortega-Del Vecchyo, Diego; Racimo, Fernando; Struck, Travis J; Gravel, Simon; Gutenkunst, Ryan N; Lohmueller, Kirk E; Ralph, Peter L; Schrider, Daniel R; Siepel, Adam; Kelleher, Jerome; Kern, Andrew D.

Elife ; 92020 06 23.

Artigo em Inglês | MEDLINE | ID: mdl-32573438

RESUMO

The explosion in population genomic data demands ever more complex modes of analysis, and increasingly, these analyses depend on sophisticated simulations. Recent advances in population genetic simulation have made it possible to simulate large and complex models, but specifying such models for a particular simulation engine remains a difficult and error-prone task. Computational genetics researchers currently re-implement simulation models independently, leading to inconsistency and duplication of effort. This situation presents a major barrier to empirical researchers seeking to use simulations for power analyses of upcoming studies or sanity checks on existing genomic data. Population genetics, as a field, also lacks standard benchmarks by which new tools for inference might be measured. Here, we describe a new resource, stdpopsim, that attempts to rectify this situation. Stdpopsim is a community-driven open source project, which provides easy access to a growing catalog of published simulation models from a range of organisms and supports multiple simulation engine backends. This resource is available as a well-documented python library with a simple command-line interface. We share some examples demonstrating how stdpopsim can be used to systematically compare demographic inference methods, and we encourage a broader community of developers to contribute to this growing resource.

Assuntos

Genética Populacional , Biblioteca Genômica , Modelos Genéticos , Animais , Arabidopsis/genética , Cães/genética , Drosophila melanogaster/genética , Escherichia coli/genética , Genética Populacional/métodos , Genética Populacional/organização & administração , Genoma/genética , Genoma Humano/genética , Humanos , Pongo abelii/genética

17.

Recursive construction of perfect DNA molecules from imperfect oligonucleotides.

Linshiz, Gregory; Yehezkel, Tuval Ben; Kaplan, Shai; Gronau, Ilan; Ravid, Sivan; Adar, Rivka; Shapiro, Ehud.

Mol Syst Biol ; 4: 191, 2008.

Artigo em Inglês | MEDLINE | ID: mdl-18463615

RESUMO

Making faultless complex objects from potentially faulty building blocks is a fundamental challenge in computer engineering, nanotechnology and synthetic biology. Here, we show for the first time how recursion can be used to address this challenge and demonstrate a recursive procedure that constructs error-free DNA molecules and their libraries from error-prone oligonucleotides. Divide and Conquer (D&C), the quintessential recursive problem-solving technique, is applied in silico to divide the target DNA sequence into overlapping oligonucleotides short enough to be synthesized directly, albeit with errors; error-prone oligonucleotides are recursively combined in vitro, forming error-prone DNA molecules; error-free fragments of these molecules are then identified, extracted and used as new, typically longer and more accurate, inputs to another iteration of the recursive construction procedure; the entire process repeats until an error-free target molecule is formed. Our recursive construction procedure surpasses existing methods for de novo DNA synthesis in speed, precision, amenability to automation, ease of combining synthetic and natural DNA fragments, and ability to construct designer DNA libraries. It thus provides a novel and robust foundation for the design and construction of synthetic biological molecules and organisms.

Assuntos

DNA/metabolismo , Oligonucleotídeos/metabolismo , Biblioteca Gênica , Proteínas de Fluorescência Verde/metabolismo , Proteínas Mutantes/metabolismo , Proteína Supressora de Tumor p53/metabolismo

18.

Towards optimal distance functions for stochastic substitution models.

Gronau, Ilan; Moran, Shlomo; Yavneh, Irad.

J Theor Biol ; 260(2): 294-307, 2009 Sep 21.

Artigo em Inglês | MEDLINE | ID: mdl-19501101

RESUMO

Distance based reconstruction methods of phylogenetic trees consist of two independent parts: first, inter-species distances are inferred assuming some stochastic model of sequence evolution; then the inferred distances are used to construct a tree. In this paper we concentrate on the task of inter-species distance estimation. Specifically, we characterize the family of valid distance functions for the assumed substitution model and show that deliberate selection of distance function significantly improves the accuracy of distance estimates and, consequently, also improves the accuracy of the reconstructed tree. Our contribution consists of three parts: first, we present a general framework for constructing families of additive distance functions for stochastic evolutionary models. Then, we present a method for selecting (near) optimal distance functions, and we conclude by presenting simulation results which support our theoretical analysis.

Assuntos

Modelos Genéticos , Filogenia , Sequência de Bases , DNA/genética , Evolução Molecular , Cadeias de Markov , Processos Estocásticos

19.

Comparative genomics provides new insights into the remarkable adaptations of the African wild dog (Lycaon pictus).

Chavez, Daniel E; Gronau, Ilan; Hains, Taylor; Kliver, Sergei; Koepfli, Klaus-Peter; Wayne, Robert K.

Sci Rep ; 9(1): 8329, 2019 06 06.

Artigo em Inglês | MEDLINE | ID: mdl-31171819

RESUMO

Within the Canidae, the African wild dog (Lycaon pictus) is the most specialized with regards to cursorial adaptations (specialized for running), having only four digits on their forefeet. In addition, this species is one of the few canids considered to be an obligate meat-eater, possessing a robust dentition for taking down large prey, and displays one of the most variable coat colorations amongst mammals. Here, we used comparative genomic analysis to investigate the evolutionary history and genetic basis for adaptations associated with cursoriality, hypercanivory, and coat color variation in African wild dogs. Genome-wide scans revealed unique amino acid deletions that suggest a mode of evolutionary digit loss through expanded apoptosis in the developing first digit. African wild dog-specific signals of positive selection also uncovered a putative mechanism of molar cusp modification through changes in genes associated with the sonic hedgehog (SHH) signaling pathway, required for spatial patterning of teeth, and three genes associated with pigmentation. Divergence time analyses suggest the suite of genomic changes we identified evolved ~1.7 Mya, coinciding with the diversification of large-bodied ungulates. Our results show that comparative genomics is a powerful tool for identifying the genetic basis of evolutionary changes in Canidae.

Assuntos

Adaptação Fisiológica , Evolução Biológica , Canidae/genética , Genômica , Animais , Animais Selvagens/genética , Padronização Corporal , Biologia Computacional , DNA/análise , Dieta , Feminino , Genótipo , Proteínas Hedgehog/genética , Dente Molar , Método de Monte Carlo , Pigmentação , Comportamento Predatório

20.

Neighbor joining algorithms for inferring phylogenies via LCA distances.

Gronau, Ilan; Moran, Shlomo.

J Comput Biol ; 14(1): 1-15, 2007.

Artigo em Inglês | MEDLINE | ID: mdl-17381342

RESUMO

Reconstructing phylogenetic trees efficiently and accurately from distance estimates is an ongoing challenge in computational biology from both practical and theoretical considerations. We study algorithms which are based on a characterization of edge-weighted trees by distances to LCAs (Least Common Ancestors). This characterization enables a direct application of ultrametric reconstruction techniques to trees which are not necessarily ultrametric. A simple and natural neighbor joining criterion based on this observation is used to provide a family of efficient neighbor-joining algorithms. These algorithms are shown to reconstruct a refinement of the Buneman tree, which implies optimal robustness to noise under criteria defined by Atteson. In this sense, they outperform many popular algorithms such as Saitou and Nei's NJ. One member of this family is used to provide a new simple version of the 3-approximation algorithm for the closest additive metric under the iota (infinity) norm. A byproduct of our work is a novel technique which yields a time optimal O (n (2)) implementation of common clustering algorithms such as UPGMA.

Assuntos

Algoritmos , Evolução Molecular , Modelos Genéticos , Filogenia

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

ENVIAR RESULTADO:

SELEÇÃO DE REFERÊNCIAS

DETALHE DA PESQUISA