Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 96
Filtrar
Mais filtros

Base de dados
País/Região como assunto
Tipo de documento
Intervalo de ano de publicação
1.
BMC Biol ; 19(1): 203, 2021 09 15.
Artigo em Inglês | MEDLINE | ID: mdl-34526021

RESUMO

BACKGROUND: Silencing of transposable elements (TEs) is essential for maintaining genome stability. Plants use small RNAs (sRNAs) to direct DNA methylation to TEs (RNA-directed DNA methylation; RdDM). Similar mechanisms of epigenetic silencing in the fungal kingdom have remained elusive. RESULTS: We use sRNA sequencing and methylation data to gain insight into epigenetics in the dikaryotic fungus Puccinia graminis f. sp. tritici (Pgt), which causes the devastating stem rust disease on wheat. We use Hi-C data to define the Pgt centromeres and show that they are repeat-rich regions (~250 kb) that are highly diverse in sequence between haplotypes and, like in plants, are enriched for young TEs. DNA cytosine methylation is particularly active at centromeres but also associated with genome-wide control of young TE insertions. Strikingly, over 90% of Pgt sRNAs and several RNAi genes are differentially expressed during infection. Pgt induces waves of functionally diversified sRNAs during infection. The early wave sRNAs are predominantly 21 nts with a 5' uracil derived from genes. In contrast, the late wave sRNAs are mainly 22-nt sRNAs with a 5' adenine and are strongly induced from centromeric regions. TEs that overlap with late wave sRNAs are more likely to be methylated, both inside and outside the centromeres, and methylated TEs exhibit a silencing effect on nearby genes. CONCLUSIONS: We conclude that rust fungi use an epigenetic silencing pathway that might have similarity with RdDM in plants. The Pgt RNAi machinery and sRNAs are under tight temporal control throughout infection and might ensure genome stability during sporulation.


Assuntos
Basidiomycota , Metilação de DNA , Puccinia , Basidiomycota/genética , Centrômero , Metilação de DNA/genética , Elementos de DNA Transponíveis , Instabilidade Genômica , Humanos , Doenças das Plantas/genética , Puccinia/patogenicidade , RNA
2.
Genome Res ; 25(5): 762-74, 2015 May.
Artigo em Inglês | MEDLINE | ID: mdl-25840857

RESUMO

Saccharomyces cerevisiae, a well-established model for species as diverse as humans and pathogenic fungi, is more recently a model for population and quantitative genetics. S. cerevisiae is found in multiple environments-one of which is the human body-as an opportunistic pathogen. To aid in the understanding of the S. cerevisiae population and quantitative genetics, as well as its emergence as an opportunistic pathogen, we sequenced, de novo assembled, and extensively manually edited and annotated the genomes of 93 S. cerevisiae strains from multiple geographic and environmental origins, including many clinical origin strains. These 93 S. cerevisiae strains, the genomes of which are near-reference quality, together with seven previously sequenced strains, constitute a novel genetic resource, the "100-genomes" strains. Our sequencing coverage, high-quality assemblies, and annotation provide unprecedented opportunities for detailed interrogation of complex genomic loci, examples of which we demonstrate. We found most phenotypic variation to be quantitative and identified population, genotype, and phenotype associations. Importantly, we identified clinical origin associations. For example, we found that an introgressed PDR5 was present exclusively in clinical origin mosaic group strains; that the mosaic group was significantly enriched for clinical origin strains; and that clinical origin strains were much more copper resistant, suggesting that copper resistance contributes to fitness in the human host. The 100-genomes strains are a novel, multipurpose resource to advance the study of S. cerevisiae population genetics, quantitative genetics, and the emergence of an opportunistic pathogen.


Assuntos
Mapeamento de Sequências Contíguas/métodos , Genoma Fúngico , Genótipo , Fenótipo , Polimorfismo Genético , Saccharomyces cerevisiae/genética , Alinhamento de Sequência/métodos , Filogenia , Saccharomyces cerevisiae/classificação , Saccharomyces cerevisiae/patogenicidade , Virulência/genética
3.
Nature ; 482(7384): 173-8, 2012 Feb 08.
Artigo em Inglês | MEDLINE | ID: mdl-22318601

RESUMO

A major challenge of biology is understanding the relationship between molecular genetic variation and variation in quantitative traits, including fitness. This relationship determines our ability to predict phenotypes from genotypes and to understand how evolutionary forces shape variation within and between species. Previous efforts to dissect the genotype-phenotype map were based on incomplete genotypic information. Here, we describe the Drosophila melanogaster Genetic Reference Panel (DGRP), a community resource for analysis of population genomics and quantitative traits. The DGRP consists of fully sequenced inbred lines derived from a natural population. Population genomic analyses reveal reduced polymorphism in centromeric autosomal regions and the X chromosome, evidence for positive and negative selection, and rapid evolution of the X chromosome. Many variants in novel genes, most at low frequency, are associated with quantitative traits and explain a large fraction of the phenotypic variance. The DGRP facilitates genotype-phenotype mapping using the power of Drosophila genetics.


Assuntos
Drosophila melanogaster/genética , Estudo de Associação Genômica Ampla , Genômica , Locos de Características Quantitativas/genética , Alelos , Animais , Centrômero/genética , Cromossomos de Insetos/genética , Genótipo , Fenótipo , Polimorfismo de Nucleotídeo Único/genética , Seleção Genética/genética , Inanição/genética , Telômero/genética , Cromossomo X/genética
4.
Proc Natl Acad Sci U S A ; 112(44): E6010-9, 2015 Nov 03.
Artigo em Inglês | MEDLINE | ID: mdl-26483487

RESUMO

Understanding how DNA sequence variation is translated into variation for complex phenotypes has remained elusive but is essential for predicting adaptive evolution, for selecting agriculturally important animals and crops, and for personalized medicine. Gene expression may provide a link between variation in DNA sequence and organismal phenotypes, and its abundance can be measured efficiently and accurately. Here we quantified genome-wide variation in gene expression in the sequenced inbred lines of the Drosophila melanogaster Genetic Reference Panel (DGRP), increasing the annotated Drosophila transcriptome by 11%, including thousands of novel transcribed regions (NTRs). We found that 42% of the Drosophila transcriptome is genetically variable in males and females, including the NTRs, and is organized into modules of genetically correlated transcripts. We found that NTRs often were negatively correlated with the expression of protein-coding genes, which we exploited to annotate NTRs functionally. We identified regulatory variants for the mean and variance of gene expression, which have largely independent genetic control. Expression quantitative trait loci (eQTLs) for the mean, but not for the variance, of gene expression were concentrated near genes. Notably, the variance eQTLs often interacted epistatically with local variants in these genes to regulate gene expression. This comprehensive characterization of population-scale diversity of transcriptomes and its genetic basis in the DGRP is critically important for a systems understanding of quantitative trait variation.


Assuntos
Drosophila melanogaster/genética , Transcriptoma , Animais , Epistasia Genética , Locos de Características Quantitativas
5.
Genome Res ; 24(7): 1193-208, 2014 Jul.
Artigo em Inglês | MEDLINE | ID: mdl-24714809

RESUMO

The Drosophila melanogaster Genetic Reference Panel (DGRP) is a community resource of 205 sequenced inbred lines, derived to improve our understanding of the effects of naturally occurring genetic variation on molecular and organismal phenotypes. We used an integrated genotyping strategy to identify 4,853,802 single nucleotide polymorphisms (SNPs) and 1,296,080 non-SNP variants. Our molecular population genomic analyses show higher deletion than insertion mutation rates and stronger purifying selection on deletions. Weaker selection on insertions than deletions is consistent with our observed distribution of genome size determined by flow cytometry, which is skewed toward larger genomes. Insertion/deletion and single nucleotide polymorphisms are positively correlated with each other and with local recombination, suggesting that their nonrandom distributions are due to hitchhiking and background selection. Our cytogenetic analysis identified 16 polymorphic inversions in the DGRP. Common inverted and standard karyotypes are genetically divergent and account for most of the variation in relatedness among the DGRP lines. Intriguingly, variation in genome size and many quantitative traits are significantly associated with inversions. Approximately 50% of the DGRP lines are infected with Wolbachia, and four lines have germline insertions of Wolbachia sequences, but effects of Wolbachia infection on quantitative traits are rarely significant. The DGRP complements ongoing efforts to functionally annotate the Drosophila genome. Indeed, 15% of all D. melanogaster genes segregate for potentially damaged proteins in the DGRP, and genome-wide analyses of quantitative traits identify novel candidate genes. The DGRP lines, sequence data, genotypes, quality scores, phenotypes, and analysis and visualization tools are publicly available.


Assuntos
Drosophila melanogaster/genética , Variação Genética , Genoma de Inseto , Fenótipo , Animais , Cromatina/genética , Cromatina/metabolismo , Drosophila melanogaster/microbiologia , Feminino , Ligação Genética , Tamanho do Genoma , Estudo de Associação Genômica Ampla , Genótipo , Técnicas de Genotipagem , Sequenciamento de Nucleotídeos em Larga Escala , Mutação INDEL , Desequilíbrio de Ligação , Masculino , Anotação de Sequência Molecular , Polimorfismo de Nucleotídeo Único , Característica Quantitativa Herdável , Reprodutibilidade dos Testes
6.
Mol Biol Evol ; 31(2): 425-33, 2014 Feb.
Artigo em Inglês | MEDLINE | ID: mdl-24214536

RESUMO

Gene conversion is the nonreciprocal exchange of genetic material between homologous chromosomes. Multiple lines of evidence from a variety of taxa strongly suggest that gene conversion events are biased toward GC-bearing alleles. However, in Drosophila, the data have largely been indirect and unclear, with some studies supporting the predictions of a GC-biased gene conversion model and other data showing contradictory findings. Here, we test whether gene conversion events are GC-biased in Drosophila melanogaster using whole-genome polymorphism and divergence data. Our results provide no support for GC-biased gene conversion and thus suggest that this process is unlikely to significantly contribute to patterns of polymorphism and divergence in this system.


Assuntos
Citosina/metabolismo , Drosophila melanogaster/genética , Conversão Gênica , Guanina/metabolismo , Alelos , Animais , Cromossomos de Insetos , Evolução Molecular , Genoma de Inseto , Genômica , Taxa de Mutação , Filogenia , Polimorfismo Genético
7.
Genome Res ; 22(5): 966-74, 2012 May.
Artigo em Inglês | MEDLINE | ID: mdl-22367192

RESUMO

High-throughput sequencing is enabling remarkably deep surveys of genomic variation. It is now possible to completely sequence multiple individuals from a single species, yet the identification of variation among them remains an evolving computational challenge. This challenge is compounded for experimental organisms when strains are studied instead of individuals. In response, we present the Joint Genotyper for Inbred Lines (JGIL) as a method for obtaining genotypes and identifying variation among a large panel of inbred strains or lines. JGIL inputs the sequence reads from each line after their alignment to a common reference. Its probabilistic model includes site-specific parameters common to all lines that describe the frequency of nucleotides segregating in the population from which the inbred panel was derived. The distribution of line genotypes is conditional on these parameters and reflects the experimental design. Site-specific error probabilities, also common to all lines, parameterize the distribution of reads conditional on line genotype and realized coverage. Both sets of parameters are estimated per site from the aggregate read data, and posterior probabilities are calculated to decode the genotype of each line. We present an application of JGIL to 162 inbred Drosophila melanogaster lines from the Drosophila Genetic Reference Panel. We explore by simulation the effect of varying coverage, sequencing error, mapping error, and the number of lines. In doing so, we illustrate how JGIL is robust to moderate levels of error. Supported by these analyses, we advocate the importance of modeling the data and the experimental design when possible.


Assuntos
Drosophila melanogaster/genética , Variação Genética , Técnicas de Genotipagem , Algoritmos , Animais , Mapeamento Cromossômico , Simulação por Computador , Técnicas de Genotipagem/normas , Endogamia , Funções Verossimilhança , Modelos Genéticos , Polimorfismo de Nucleotídeo Único , Padrões de Referência , Análise de Sequência de DNA
8.
PLoS Pathog ; 9(8): e1003574, 2013.
Artigo em Inglês | MEDLINE | ID: mdl-24009506

RESUMO

Aflatoxins are produced by Aspergillus flavus and A. parasiticus in oil-rich seed and grain crops and are a serious problem in agriculture, with aflatoxin B1 being the most carcinogenic natural compound known. Sexual reproduction in these species occurs between individuals belonging to different vegetative compatibility groups (VCGs). We examined natural genetic variation in 758 isolates of A. flavus, A. parasiticus and A. minisclerotigenes sampled from single peanut fields in the United States (Georgia), Africa (Benin), Argentina (Córdoba), Australia (Queensland) and India (Karnataka). Analysis of DNA sequence variation across multiple intergenic regions in the aflatoxin gene clusters of A. flavus, A. parasiticus and A. minisclerotigenes revealed significant linkage disequilibrium (LD) organized into distinct blocks that are conserved across different localities, suggesting that genetic recombination is nonrandom and a global occurrence. To assess the contributions of asexual and sexual reproduction to fixation and maintenance of toxin chemotype diversity in populations from each locality/species, we tested the null hypothesis of an equal number of MAT1-1 and MAT1-2 mating-type individuals, which is indicative of a sexually recombining population. All samples were clone-corrected using multi-locus sequence typing which associates closely with VCG. For both A. flavus and A. parasiticus, when the proportions of MAT1-1 and MAT1-2 were significantly different, there was more extensive LD in the aflatoxin cluster and populations were fixed for specific toxin chemotype classes, either the non-aflatoxigenic class in A. flavus or the B1-dominant and G1-dominant classes in A. parasiticus. A mating type ratio close to 1∶1 in A. flavus, A. parasiticus and A. minisclerotigenes was associated with higher recombination rates in the aflatoxin cluster and less pronounced chemotype differences in populations. This work shows that the reproductive nature of the population (more sexual versus more asexual) is predictive of aflatoxin chemotype diversity in these agriculturally important fungi.


Assuntos
Aflatoxinas/biossíntese , Aspergillus flavus/metabolismo , Proteínas Fúngicas/metabolismo , Genes Fúngicos/fisiologia , Família Multigênica/fisiologia , Proteínas Repressoras/metabolismo , Aflatoxinas/genética , Aspergillus flavus/genética , Proteínas Fúngicas/genética , Proteínas Repressoras/genética , Especificidade da Espécie
9.
Mol Ecol ; 24(8): 1889-909, 2015 Apr.
Artigo em Inglês | MEDLINE | ID: mdl-25773520

RESUMO

Aspergillus flavus and A. parasiticus are the two most important aflatoxin-producing fungi responsible for the contamination of agricultural commodities worldwide. Both species are heterothallic and undergo sexual reproduction in laboratory crosses. Here we examine the possibility of interspecific matings between A. flavus and A. parasiticus. These species can be distinguished morphologically and genetically, as well as by their mycotoxin profiles. Aspergillus flavus produces both B aflatoxins and cyclopiazonic acid (CPA), B aflatoxins or CPA alone, or neither mycotoxin; Aspergillus parasiticus produces B and G aflatoxins or the aflatoxin precursor O-methylsterigmatocystin, but not CPA. Only four of forty-five attempted interspecific crosses between opposite mating types of A. flavus and A. parasiticus were fertile and produced viable ascospores. Single ascospore strains from each cross were shown to be recombinant hybrids using multilocus genotyping and array comparative genome hybridization. Conidia of parents and their hybrid progeny were haploid and predominantly monokaryons and dikaryons based on flow cytometry. Multilocus phylogenetic inference showed that experimental hybrid progeny were grouped with naturally occurring A. flavus L strain and A. parasiticus. Higher total aflatoxin concentrations in some F1 progeny strains compared to midpoint parent aflatoxin levels indicate synergism in aflatoxin production; moreover, three progeny strains synthesized G aflatoxins that were not produced by the parents, and there was evidence of allopolyploidization in one strain. These results suggest that hybridization is an important diversifying force resulting in the genesis of novel toxin profiles in these agriculturally important fungi.


Assuntos
Aflatoxinas/biossíntese , Aspergillus flavus/genética , Aspergillus/genética , Hibridização Genética , Aspergillus/classificação , Hibridização Genômica Comparativa , Genes Fúngicos Tipo Acasalamento , Genótipo , Técnicas de Genotipagem , Dados de Sequência Molecular , Fenótipo , Filogenia , Análise de Sequência de DNA , Esterigmatocistina/análogos & derivados , Esterigmatocistina/biossíntese
10.
Nat Rev Genet ; 10(8): 565-77, 2009 Aug.
Artigo em Inglês | MEDLINE | ID: mdl-19584810

RESUMO

A major challenge in current biology is to understand the genetic basis of variation for quantitative traits. We review the principles of quantitative trait locus mapping and summarize insights about the genetic architecture of quantitative traits that have been obtained over the past decades. We are currently in the midst of a genomic revolution, which enables us to incorporate genetic variation in transcript abundance and other intermediate molecular phenotypes into a quantitative trait locus mapping framework. This systems genetics approach enables us to understand the biology inside the 'black box' that lies between genotype and phenotype in terms of causal networks of interacting genes.


Assuntos
Ligação Genética , Característica Quantitativa Herdável , Animais , Mapeamento Cromossômico , Humanos
11.
PLoS Genet ; 8(3): e1002593, 2012.
Artigo em Inglês | MEDLINE | ID: mdl-22479193

RESUMO

Phenotypic plasticity is the ability of a single genotype to produce different phenotypes in response to changing environments. We assessed variation in genome-wide gene expression and four fitness-related phenotypes of an outbred Drosophila melanogaster population under 20 different physiological, social, nutritional, chemical, and physical environments; and we compared the phenotypically plastic transcripts to genetically variable transcripts in a single environment. The environmentally sensitive transcriptome consists of two transcript categories, which comprise ∼15% of expressed transcripts. Class I transcripts are genetically variable and associated with detoxification, metabolism, proteolysis, heat shock proteins, and transcriptional regulation. Class II transcripts have low genetic variance and show sexually dimorphic expression enriched for reproductive functions. Clustering analysis of Class I transcripts reveals a fragmented modular organization and distinct environmentally responsive transcriptional signatures for the four fitness-related traits. Our analysis suggests that a restricted environmentally responsive segment of the transcriptome preserves the balance between phenotypic plasticity and environmental canalization.


Assuntos
Drosophila melanogaster , Interação Gene-Ambiente , Aptidão Genética , Transcriptoma , Animais , Análise por Conglomerados , Drosophila melanogaster/genética , Drosophila melanogaster/fisiologia , Regulação da Expressão Gênica , Genoma de Inseto , Fenótipo
12.
PLoS Genet ; 8(11): e1003055, 2012.
Artigo em Inglês | MEDLINE | ID: mdl-23189034

RESUMO

Understanding the relationship between genetic and phenotypic variation is one of the great outstanding challenges in biology. To meet this challenge, comprehensive genomic variation maps of human as well as of model organism populations are required. Here, we present a nucleotide resolution catalog of single-nucleotide, multi-nucleotide, and structural variants in 39 Drosophila melanogaster Genetic Reference Panel inbred lines. Using an integrative, local assembly-based approach for variant discovery, we identify more than 3.6 million distinct variants, among which were more than 800,000 unique insertions, deletions (indels), and complex variants (1 to 6,000 bp). While the SNP density is higher near other variants, we find that variants themselves are not mutagenic, nor are regions with high variant density particularly mutation-prone. Rather, our data suggest that the elevated SNP density around variants is mainly due to population-level processes. We also provide insights into the regulatory architecture of gene expression variation in adult flies by mapping cis-expression quantitative trait loci (cis-eQTLs) for more than 2,000 genes. Indels comprise around 10% of all cis-eQTLs and show larger effects than SNP cis-eQTLs. In addition, we identified two-fold more gene associations in males as compared to females and found that most cis-eQTLs are sex-specific, revealing a partial decoupling of the genomic architecture between the sexes as well as the importance of genetic factors in mediating sex-biased gene expression. Finally, we performed RNA-seq-based allelic expression imbalance analyses in the offspring of crosses between sequenced lines, which revealed that the majority of strong cis-eQTLs can be validated in heterozygous individuals.


Assuntos
Drosophila melanogaster/genética , Expressão Gênica , Variação Genética , Genoma , Desequilíbrio Alélico/genética , Animais , Mapeamento Cromossômico , Mutação INDEL , Fenótipo , Polimorfismo de Nucleotídeo Único , Locos de Características Quantitativas/genética
13.
PLoS Genet ; 8(5): e1002685, 2012.
Artigo em Inglês | MEDLINE | ID: mdl-22570636

RESUMO

Predicting organismal phenotypes from genotype data is important for plant and animal breeding, medicine, and evolutionary biology. Genomic-based phenotype prediction has been applied for single-nucleotide polymorphism (SNP) genotyping platforms, but not using complete genome sequences. Here, we report genomic prediction for starvation stress resistance and startle response in Drosophila melanogaster, using ∼2.5 million SNPs determined by sequencing the Drosophila Genetic Reference Panel population of inbred lines. We constructed a genomic relationship matrix from the SNP data and used it in a genomic best linear unbiased prediction (GBLUP) model. We assessed predictive ability as the correlation between predicted genetic values and observed phenotypes by cross-validation, and found a predictive ability of 0.239±0.008 (0.230±0.012) for starvation resistance (startle response). The predictive ability of BayesB, a Bayesian method with internal SNP selection, was not greater than GBLUP. Selection of the 5% SNPs with either the highest absolute effect or variance explained did not improve predictive ability. Predictive ability decreased only when fewer than 150,000 SNPs were used to construct the genomic relationship matrix. We hypothesize that predictive power in this population stems from the SNP-based modeling of the subtle relationship structure caused by long-range linkage disequilibrium and not from population structure or SNPs in linkage disequilibrium with causal variants. We discuss the implications of these results for genomic prediction in other organisms.


Assuntos
Drosophila melanogaster/genética , Genoma de Inseto , Genótipo , Locos de Características Quantitativas , Animais , Teorema de Bayes , Mapeamento Cromossômico , Genética Populacional , Desequilíbrio de Ligação , Modelos Genéticos , Modelos Teóricos , Fenótipo , Polimorfismo de Nucleotídeo Único , Seleção Genética , Análise de Sequência de DNA
14.
Proc Natl Acad Sci U S A ; 109(39): 15553-9, 2012 Sep 25.
Artigo em Inglês | MEDLINE | ID: mdl-22949659

RESUMO

Epistasis-nonlinear genetic interactions between polymorphic loci-is the genetic basis of canalization and speciation, and epistatic interactions can be used to infer genetic networks affecting quantitative traits. However, the role that epistasis plays in the genetic architecture of quantitative traits is controversial. Here, we compared the genetic architecture of three Drosophila life history traits in the sequenced inbred lines of the Drosophila melanogaster Genetic Reference Panel (DGRP) and a large outbred, advanced intercross population derived from 40 DGRP lines (Flyland). We assessed allele frequency changes between pools of individuals at the extremes of the distribution for each trait in the Flyland population by deep DNA sequencing. The genetic architecture of all traits was highly polygenic in both analyses. Surprisingly, none of the SNPs associated with the traits in Flyland replicated in the DGRP and vice versa. However, the majority of these SNPs participated in at least one epistatic interaction in the DGRP. Despite apparent additive effects at largely distinct loci in the two populations, the epistatic interactions perturbed common, biologically plausible, and highly connected genetic networks. Our analysis underscores the importance of epistasis as a principal factor that determines variation for quantitative traits and provides a means to uncover genetic networks affecting these traits. Knowledge of epistatic networks will contribute to our understanding of the genetic basis of evolutionarily and clinically important traits and enhance predictive ability at an individualized level in medicine and agriculture.


Assuntos
Epistasia Genética/fisiologia , Genes de Insetos/fisiologia , Característica Quantitativa Herdável , Animais , Drosophila melanogaster , Polimorfismo de Nucleotídeo Único
15.
PLoS Genet ; 7(2): e1001318, 2011 Feb.
Artigo em Inglês | MEDLINE | ID: mdl-21383861

RESUMO

Deep resequencing of functional regions in human genomes is key to identifying potentially causal rare variants for complex disorders. Here, we present the results from a large-sample resequencing (n  =  285 patients) study of candidate genes coupled with population genetics and statistical methods to identify rare variants associated with Autism Spectrum Disorder and Schizophrenia. Three genes, MAP1A, GRIN2B, and CACNA1F, were consistently identified by different methods as having significant excess of rare missense mutations in either one or both disease cohorts. In a broader context, we also found that the overall site frequency spectrum of variation in these cases is best explained by population models of both selection and complex demography rather than neutral models or models accounting for complex demography alone. Mutations in the three disease-associated genes explained much of the difference in the overall site frequency spectrum among the cases versus controls. This study demonstrates that genes associated with complex disorders can be mapped using resequencing and analytical methods with sample sizes far smaller than those required by genome-wide association studies. Additionally, our findings support the hypothesis that rare mutations account for a proportion of the phenotypic variance of these complex disorders.


Assuntos
Transtornos Globais do Desenvolvimento Infantil/genética , Genética Populacional , Esquizofrenia/genética , Criança , Mapeamento Cromossômico , Estudos de Coortes , Feminino , Loci Gênicos , Humanos , Masculino , Mutação , Polimorfismo de Nucleotídeo Único , Seleção Genética , Análise de Sequência de DNA
16.
Discrete Appl Math ; 117: 152-157, 2014 Nov 20.
Artigo em Inglês | MEDLINE | ID: mdl-25400306

RESUMO

It is well known that information about the structure of a graph is contained within its minimum cut. Here we investigate how the minimum cut of one graph informs the structure of a second, related graph. We consider pairs of graphs G and H, with respective Laplacian matrices L and M, and call H partially supplied provided M is a Schur complement of L. Our results show how the minimum cut of H relates to the structure of the larger graph G.

17.
Nat Med ; 30(7): 1952-1961, 2024 Jul.
Artigo em Inglês | MEDLINE | ID: mdl-38760587

RESUMO

Precision in the diagnosis of diverse central nervous system (CNS) tumor types is crucial for optimal treatment. DNA methylation profiles, which capture the methylation status of thousands of individual CpG sites, are state-of-the-art data-driven means to enhance diagnostic accuracy but are also time consuming and not widely available. Here, to address these limitations, we developed Deep lEarning from histoPathoLOgy and methYlation (DEPLOY), a deep learning model that classifies CNS tumors to ten major categories from histopathology. DEPLOY integrates three distinct components: the first classifies CNS tumors directly from slide images ('direct model'), the second initially generates predictions for DNA methylation beta values, which are subsequently used for tumor classification ('indirect model'), and the third classifies tumor types directly from routinely available patient demographics. First, we find that DEPLOY accurately predicts beta values from histopathology images. Second, using a ten-class model trained on an internal dataset of 1,796 patients, we predict the tumor categories in three independent external test datasets including 2,156 patients, achieving an overall accuracy of 95% and balanced accuracy of 91% on samples that are predicted with high confidence. These results showcase the potential future use of DEPLOY to assist pathologists in diagnosing CNS tumors within a clinically relevant short time frame.


Assuntos
Neoplasias do Sistema Nervoso Central , Metilação de DNA , Aprendizado Profundo , Humanos , Neoplasias do Sistema Nervoso Central/genética , Neoplasias do Sistema Nervoso Central/patologia , Ilhas de CpG/genética , Feminino , Masculino
18.
Nat Cancer ; 5(9): 1305-1317, 2024 Sep.
Artigo em Inglês | MEDLINE | ID: mdl-38961276

RESUMO

Advances in artificial intelligence have paved the way for leveraging hematoxylin and eosin-stained tumor slides for precision oncology. We present ENLIGHT-DeepPT, an indirect two-step approach consisting of (1) DeepPT, a deep-learning framework that predicts genome-wide tumor mRNA expression from slides, and (2) ENLIGHT, which predicts response to targeted and immune therapies from the inferred expression values. We show that DeepPT successfully predicts transcriptomics in all 16 The Cancer Genome Atlas cohorts tested and generalizes well to two independent datasets. ENLIGHT-DeepPT successfully predicts true responders in five independent patient cohorts involving four different treatments spanning six cancer types, with an overall odds ratio of 2.28 and a 39.5% increased response rate among predicted responders versus the baseline rate. Notably, its prediction accuracy, obtained without any training on the treatment data, is comparable to that achieved by directly predicting the response from the images, which requires specific training on the treatment evaluation cohorts.


Assuntos
Aprendizado Profundo , Neoplasias , Transcriptoma , Humanos , Neoplasias/genética , Neoplasias/patologia , Neoplasias/terapia , Perfilação da Expressão Gênica/métodos , Resultado do Tratamento , Medicina de Precisão/métodos
19.
Am J Hum Genet ; 87(3): 316-24, 2010 Sep 10.
Artigo em Inglês | MEDLINE | ID: mdl-20797689

RESUMO

The role of de novo mutations (DNMs) in common diseases remains largely unknown. Nonetheless, the rate of de novo deleterious mutations and the strength of selection against de novo mutations are critical to understanding the genetic architecture of a disease. Discovery of high-impact DNMs requires substantial high-resolution interrogation of partial or complete genomes of families via resequencing. We hypothesized that deleterious DNMs may play a role in cases of autism spectrum disorders (ASD) and schizophrenia (SCZ), two etiologically heterogeneous disorders with significantly reduced reproductive fitness. We present a direct measure of the de novo mutation rate (µ) and selective constraints from DNMs estimated from a deep resequencing data set generated from a large cohort of ASD and SCZ cases (n = 285) and population control individuals (n = 285) with available parental DNA. A survey of ∼430 Mb of DNA from 401 synapse-expressed genes across all cases and 25 Mb of DNA in controls found 28 candidate DNMs, 13 of which were cell line artifacts. Our calculated direct neutral mutation rate (1.36 × 10(-8)) is similar to previous indirect estimates, but we observed a significant excess of potentially deleterious DNMs in ASD and SCZ individuals. Our results emphasize the importance of DNMs as genetic mechanisms in ASD and SCZ and the limitations of using DNA from archived cell lines to identify functional variants.


Assuntos
Transtorno Autístico/genética , Análise Mutacional de DNA/métodos , Mutagênese/genética , Mutação/genética , Esquizofrenia/genética , Pareamento de Bases/genética , Linhagem Celular , Segregação de Cromossomos/genética , Estudos de Coortes , Família , Feminino , Regulação da Expressão Gênica , Humanos , Masculino
20.
Genome Res ; 20(1): 142-54, 2010 Jan.
Artigo em Inglês | MEDLINE | ID: mdl-19846609

RESUMO

ProPhylER (Protein Phylogeny and Evolutionary Rates) is a next-generation curated proteome resource that uses comparative sequence analysis to predict constraint and mutation impact for eukaryotic proteins. Its purpose is to inform any research program for which protein function and structure are relevant, by the predictive power of evolutionary constraint analyses. ProPhylER currently has nearly 9000 clusters of related proteins, including more than 200,000 sequences. It serves data via two interfaces. The "ProPhylER Interface" displays predictive analyses in sequence space; the "CrystalPainter" maps evolutionary constraints onto solved protein structures. Here we summarize ProPhylER's data content and analysis pipeline, demonstrate the use of ProPhylER's interfaces, and evaluate ProPhylER's unique regional analysis of evolutionary constraint. The high accuracy of ProPhylER's regional analysis complements the high resolution of its single-site analysis to effectively guide and inform structure-function investigations and predict the impact of polymorphisms.


Assuntos
Bases de Dados de Proteínas , Eucariotos , Evolução Molecular , Internet , Filogenia , Proteínas , Eucariotos/genética , Eucariotos/metabolismo , Polimorfismo de Nucleotídeo Único , Proteínas/química , Proteínas/genética , Proteínas/metabolismo , Relação Estrutura-Atividade , Interface Usuário-Computador
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA