RESUMO
The mutational landscape is shaped by many processes. Genic regions are vulnerable to mutation but are preferentially protected by transcription-coupled repair1. In microorganisms, transcription has been demonstrated to be mutagenic2,3; however, the impact of transcription-associated mutagenesis remains to be established in higher eukaryotes4. Here we show that ID4-a cancer insertion-deletion (indel) mutation signature of unknown aetiology5 characterized by short (2 to 5 base pair) deletions -is due to a transcription-associated mutagenesis process. We demonstrate that defective ribonucleotide excision repair in mammals is associated with the ID4 signature, with mutations occurring at a TNT sequence motif, implicating topoisomerase 1 (TOP1) activity at sites of genome-embedded ribonucleotides as a mechanistic basis. Such TOP1-mediated deletions occur somatically in cancer, and the ID-TOP1 signature is also found in physiological settings, contributing to genic de novo indel mutations in the germline. Thus, although topoisomerases protect against genome instability by relieving topological stress6, their activity may also be an important source of mutations in the human genome.
Assuntos
DNA Topoisomerases Tipo I , Células Germinativas , Mutagênese , Neoplasias , Animais , Reparo do DNA/genética , DNA Topoisomerases Tipo I/metabolismo , Células Germinativas/metabolismo , Humanos , Mutagênese/genética , Mutação , Neoplasias/genética , Ribonucleotídeos/genéticaRESUMO
The 100 000 Genome Project aims to develop a diagnostics platform by introducing whole genome sequencing (WGS) into clinical practice. Samples from patients with chronic lymphocytic leukaemia were subjected to WGS. WGS detection of single nucleotide variants and insertion/deletions were validated by targeted next generation sequencing showing high concordance (96·3%), also for detection of sub-clonal variants and low-frequency TP53 variants. Copy number alteration detection was verified by fluorescent in situ hybridisation and genome-wide single nucleotide polymorphism array (concordances of 86·7% and 92·9%, respectively), confirming adequate sensitivity by WGS. Our results confirm that WGS can provide comprehensive genomic characterisation for clinical trials, drug discovery and, ultimately, precision medicine.
Assuntos
Leucemia Linfocítica Crônica de Células B/genética , Sequenciamento Completo do Genoma/normas , Adulto , Idoso , Variações do Número de Cópias de DNA/genética , Feminino , Sequenciamento de Nucleotídeos em Larga Escala , Humanos , Mutação INDEL/genética , Masculino , Pessoa de Meia-Idade , Polimorfismo de Nucleotídeo Único/genéticaRESUMO
PURPOSE: Fresh-frozen (FF) tissue is the optimal source of DNA for whole-genome sequencing (WGS) of cancer patients. However, it is not always available, limiting the widespread application of WGS in clinical practice. We explored the viability of using formalin-fixed, paraffin-embedded (FFPE) tissues, available routinely for cancer patients, as a source of DNA for clinical WGS. METHODS: We conducted a prospective study using DNAs from matched FF, FFPE, and peripheral blood germ-line specimens collected from 52 cancer patients (156 samples) following routine diagnostic protocols. We compared somatic variants detected in FFPE and matching FF samples. RESULTS: We found the single-nucleotide variant agreement reached 71% across the genome and somatic copy-number alterations (CNAs) detection from FFPE samples was suboptimal (0.44 median correlation with FF) due to nonuniform coverage. CNA detection was improved significantly with lower reverse crosslinking temperature in FFPE DNA extraction (80 °C or 65 °C depending on the methods). Our final data showed somatic variant detection from FFPE for clinical decision making is possible. We detected 98% of clinically actionable variants (including 30/31 CNAs). CONCLUSION: We present the first prospective WGS study of cancer patients using FFPE specimens collected in a routine clinical environment proving WGS can be applied in the clinic.
Assuntos
Variações do Número de Cópias de DNA/genética , Genoma Humano/genética , Neoplasias/genética , Sequenciamento Completo do Genoma/métodos , Tomada de Decisões , Feminino , Humanos , Masculino , Neoplasias/sangue , Neoplasias/patologia , Inclusão em Parafina , Polimorfismo de Nucleotídeo Único/genéticaRESUMO
The nonrecombining regions of animal Y chromosomes are known to undergo genetic degeneration, but previous work has failed to reveal large-scale gene degeneration on plant Y chromosomes. Here, we uncover rapid and extensive degeneration of Y-linked genes in a plant species, Silene latifolia, that evolved sex chromosomes de novo in the last 10 million years. Previous transcriptome-based studies of this species missed unexpressed, degenerate Y-linked genes. To identify sex-linked genes, regardless of their expression, we sequenced male and female genomes of S. latifolia and integrated the genomic contigs with a high-density genetic map. This revealed that 45% of Y-linked genes are not expressed, and 23% are interrupted by premature stop codons. This contrasts with X-linked genes, in which only 1.3% of genes contained stop codons and 4.3% of genes were not expressed in males. Loss of functional Y-linked genes is partly compensated for by gene-specific up-regulation of X-linked genes. Our results demonstrate that the rate of genetic degeneration of Y-linked genes in S. latifolia is as fast as in animals, and that the evolutionary trajectories of sex chromosomes are similar in the two kingdoms.
Assuntos
Cromossomos de Plantas , Mecanismo Genético de Compensação de Dose , Plantas/genética , Cromossomos Sexuais , Silene/genética , Ligação GenéticaRESUMO
The value of genome-wide over targeted driver analyses for predicting clinical outcomes of cancer patients is debated. Here, we report the whole-genome sequencing of 485 chronic lymphocytic leukemia patients enrolled in clinical trials as part of the United Kingdom's 100,000 Genomes Project. We identify an extended catalog of recurrent coding and noncoding genetic mutations that represents a source for future studies and provide the most complete high-resolution map of structural variants, copy number changes and global genome features including telomere length, mutational signatures and genomic complexity. We demonstrate the relationship of these features with clinical outcome and show that integration of 186 distinct recurrent genomic alterations defines five genomic subgroups that associate with response to therapy, refining conventional outcome prediction. While requiring independent validation, our findings highlight the potential of whole-genome sequencing to inform future risk stratification in chronic lymphocytic leukemia.
Assuntos
Leucemia Linfocítica Crônica de Células B , Humanos , Leucemia Linfocítica Crônica de Células B/genética , Sequenciamento Completo do Genoma , Mutação , Genômica , PrognósticoRESUMO
Cannabis sativa has long been an important source of fiber extracted from hemp and both medicinal and recreational drugs based on cannabinoid compounds. Here, we investigated its poorly known domestication history using whole-genome resequencing of 110 accessions from worldwide origins. We show that C. sativa was first domesticated in early Neolithic times in East Asia and that all current hemp and drug cultivars diverged from an ancestral gene pool currently represented by feral plants and landraces in China. We identified candidate genes associated with traits differentiating hemp and drug cultivars, including branching pattern and cellulose/lignin biosynthesis. We also found evidence for loss of function of genes involved in the synthesis of the two major biochemically competing cannabinoids during selection for increased fiber production or psychoactive properties. Our results provide a unique global view of the domestication of C. sativa and offer valuable genomic resources for ongoing functional and molecular breeding research.
RESUMO
Suppressed recombination allows divergence between homologous sex chromosomes and the functionality of their genes. Here, we reveal patterns of the earliest stages of sex-chromosome evolution in the diploid dioecious herb Mercurialis annua on the basis of cytological analysis, de novo genome assembly and annotation, genetic mapping, exome resequencing of natural populations, and transcriptome analysis. The genome assembly contained 34,105 expressed genes, of which 10,076 were assigned to linkage groups. Genetic mapping and exome resequencing of individuals across the species range both identified the largest linkage group, LG1, as the sex chromosome. Although the sex chromosomes of M. annua are karyotypically homomorphic, we estimate that about one-third of the Y chromosome, containing 568 transcripts and spanning 22.3 cM in the corresponding female map, has ceased recombining. Nevertheless, we found limited evidence for Y-chromosome degeneration in terms of gene loss and pseudogenization, and most X- and Y-linked genes appear to have diverged in the period subsequent to speciation between M. annua and its sister species M. huetii, which shares the same sex-determining region. Taken together, our results suggest that the M. annua Y chromosome has at least two evolutionary strata: a small old stratum shared with M. huetii, and a more recent larger stratum that is probably unique to M. annua and that stopped recombining â¼1 MYA. Patterns of gene expression within the nonrecombining region are consistent with the idea that sexually antagonistic selection may have played a role in favoring suppressed recombination.
Assuntos
Cromossomos de Plantas/genética , Euphorbiaceae/genética , Evolução Molecular , Diploide , Genes de Plantas , Ligação Genética , TranscriptomaRESUMO
Many aspects of sex chromosome evolution are common to both plants and animals [1], but the process of Y chromosome degeneration, where genes on the Y become non-functional over time, may be much slower in plants due to purifying selection against deleterious mutations in the haploid gametophyte [2, 3]. Testing for differences in Y degeneration between the kingdoms has been hindered by the absence of accurate age estimates for plant sex chromosomes. Here, we used genome resequencing to estimate the spontaneous mutation rate and the age of the sex chromosomes in white campion (Silene latifolia). Screening of single nucleotide polymorphisms (SNPs) in parents and 10 F1 progeny identified 39 de novo mutations and yielded a rate of 7.31 × 10-9 (95% confidence interval: 5.20 × 10-9 - 8.00 × 10-9) mutations per site per haploid genome per generation. Applying this mutation rate to the synonymous divergence between homologous X- and Y-linked genes (gametologs) gave age estimates of 11.00 and 6.32 million years for the old and young strata, respectively. Based on SNP segregation patterns, we inferred which genes were Y-linked and found that at least 47% are already dysfunctional. Applying our new estimates for the age of the sex chromosomes indicates that the rate of Y degeneration in S. latifolia is nearly 2-fold slower when compared to animal sex chromosomes of a similar age. Our revised estimates support Y degeneration taking place more slowly in plants, a discrepancy that may be explained by differences in the life cycles of animals and plants.
Assuntos
Cromossomos de Plantas/genética , Evolução Molecular , Taxa de Mutação , Silene/genética , Cromossomos SexuaisRESUMO
Next-generation sequencing (NGS) efforts have established catalogs of mutations relevant to cancer development. However, the clinical utility of this information remains largely unexplored. Here, we present the results of the first eight patients recruited into a clinical whole-genome sequencing (WGS) program in the United Kingdom. We performed PCR-free WGS of fresh frozen tumors and germline DNA at 75× and 30×, respectively, using the HiSeq2500 HTv4. Subtracted tumor VCFs and paired germlines were subjected to comprehensive analysis of coding and noncoding regions, integration of germline with somatically acquired variants, and global mutation signatures and pathway analyses. Results were classified into tiers and presented to a multidisciplinary tumor board. WGS results helped to clarify an uncertain histopathological diagnosis in one case, led to informed or supported prognosis in two cases, leading to de-escalation of therapy in one, and indicated potential treatments in all eight. Overall 26 different tier 1 potentially clinically actionable findings were identified using WGS compared with six SNVs/indels using routine targeted NGS. These initial results demonstrate the potential of WGS to inform future diagnosis, prognosis, and treatment choice in cancer and justify the systematic evaluation of the clinical utility of WGS in larger cohorts of patients with cancer.
Assuntos
Biomarcadores Tumorais , Mutação , Neoplasias/diagnóstico , Neoplasias/genética , Sequenciamento Completo do Genoma , Adolescente , Adulto , Idoso , Biópsia , Criança , Análise Mutacional de DNA , Feminino , Humanos , Imuno-Histoquímica , Masculino , Pessoa de Meia-Idade , Reino Unido , Adulto JovemRESUMO
Neutral genetic diversity gradients have long been used to infer the colonization history of species [1, 2], but range expansion may also influence the efficacy of natural selection and patterns of non-synonymous polymorphism in different parts of a species' range [3]. Recent theory predicts both an accumulation of deleterious mutations and a reduction in the efficacy of positive selection as a result of range expansion [4-8]. These signatures have been sought in a number of studies of the human range expansion out of Africa, but with contradictory results [9-14]. We analyzed the polymorphism patterns of 578,125 SNPs (17,648 genes) in the European diploid plant Mercurialis annua, which expanded its range from an eastern Mediterranean refugium into western habitats with contrasted climates [15]. Our results confirmed strong signatures of bottlenecks and revealed the accumulation of mildly to strongly deleterious mutations in range-front populations. A significantly higher number of these mutations were homozygous in individuals in range-front populations, pointing to increased genetic load and reduced fitness under a model of recessive deleterious effects. We also inferred a reduction in the number of selective sweeps in range-front versus core populations. These signatures have persisted even in a dioecious herb subject to substantial interpopulation gene flow [15]. Our results extend support from humans to plants for theory on the dynamics of mutations under selection during range expansion, showing that colonization bottlenecks can compromise adaptive potential.
Assuntos
Evolução Biológica , Euphorbia/fisiologia , Fluxo Gênico , Dispersão Vegetal , Polimorfismo de Nucleotídeo Único , Diploide , Euphorbia/genética , Europa (Continente) , Mutação , Polimorfismo GenéticoRESUMO
The presence of large genomic regions with suppressed recombination (SR) is a key shared property of some sex- and mating-type determining (mat) chromosomes identified to date in animals, plants, and fungi. Why such regions form and how they evolve remain central questions in evolutionary genetics. The smut fungus Microbotryum lychnis-dioicae is a basidiomycete fungus in which dimorphic mat chromosomes have been reported, but the size, age, and evolutionary dynamics of the SR region remains unresolved. To identify the SR region in M. lychnis-dioicae and to study its evolution, we sequenced 12 genomes (6 per mating type) of this species and identified the genomic contigs that show fixed sequence differences between the mating types. We report that the SR region spans more than half of the mat chromosome (>2.3 Mbp) and that it is of very recent origin (â¼2 × 10(6) years) as the average sequence divergence between mating types was only 2% in the SR region. This contrasts with a much higher divergence in and around the mating-type determining pheromone receptor locus in the SR, suggesting a recent and massive expansion of the SR region. Our results comprise the first reported case of recent massive SR expansion documented in a basidiomycete fungus.
Assuntos
Basidiomycota/genética , Cromossomos Fúngicos , Genes Fúngicos , Receptores de Fator de Acasalamento/genética , Evolução Molecular , Recombinação GenéticaRESUMO
Different protein secondary structure elements have different physicochemical properties and roles in the protein, which may determine their evolutionary flexibility. However, it is not clear to what extent protein structure affects the way Darwinian selection acts at the amino acid level. Using phylogeny-based likelihood tests for positive selection, we have examined the relationship between protein secondary structure and selection across six species of Drosophila. We find that amino acids that form disordered regions, such as random coils, are far more likely to be under positive selection than expected from their proportion in the proteins, and residues in helices and beta-structures are subject to less positive selection than predicted. In addition, it appears that sites undergoing positive selection are more likely than expected to occur close to one another in the protein sequence. Finally, on a genome-wide scale, we have determined that positively selected sites are found more frequently toward the gene ends. Our results demonstrate that protein structures with a greater degree of organization and strong hydrophobicity, represented here as helices and beta-structures, are less tolerant to molecular adaptation than disordered, hydrophilic regions, across a diverse set of proteins.