Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 3.856
Filtrar
Más filtros

Intervalo de año de publicación
1.
Cell ; 185(11): 1986-2005.e26, 2022 05 26.
Artículo en Inglés | MEDLINE | ID: mdl-35525246

RESUMEN

Unlike copy number variants (CNVs), inversions remain an underexplored genetic variation class. By integrating multiple genomic technologies, we discover 729 inversions in 41 human genomes. Approximately 85% of inversions <2 kbp form by twin-priming during L1 retrotransposition; 80% of the larger inversions are balanced and affect twice as many nucleotides as CNVs. Balanced inversions show an excess of common variants, and 72% are flanked by segmental duplications (SDs) or retrotransposons. Since flanking repeats promote non-allelic homologous recombination, we developed complementary approaches to identify recurrent inversion formation. We describe 40 recurrent inversions encompassing 0.6% of the genome, showing inversion rates up to 2.7 × 10-4 per locus per generation. Recurrent inversions exhibit a sex-chromosomal bias and co-localize with genomic disorder critical regions. We propose that inversion recurrence results in an elevated number of heterozygous carriers and structural SD diversity, which increases mutability in the population and predisposes specific haplotypes to disease-causing CNVs.


Asunto(s)
Inversión Cromosómica , Duplicaciones Segmentarias en el Genoma , Inversión Cromosómica/genética , Variaciones en el Número de Copia de ADN/genética , Genoma Humano , Genómica , Humanos
2.
Cell ; 185(23): 4409-4427.e18, 2022 11 10.
Artículo en Inglés | MEDLINE | ID: mdl-36368308

RESUMEN

Fully understanding autism spectrum disorder (ASD) genetics requires whole-genome sequencing (WGS). We present the latest release of the Autism Speaks MSSNG resource, which includes WGS data from 5,100 individuals with ASD and 6,212 non-ASD parents and siblings (total n = 11,312). Examining a wide variety of genetic variants in MSSNG and the Simons Simplex Collection (SSC; n = 9,205), we identified ASD-associated rare variants in 718/5,100 individuals with ASD from MSSNG (14.1%) and 350/2,419 from SSC (14.5%). Considering genomic architecture, 52% were nuclear sequence-level variants, 46% were nuclear structural variants (including copy-number variants, inversions, large insertions, uniparental isodisomies, and tandem repeat expansions), and 2% were mitochondrial variants. Our study provides a guidebook for exploring genotype-phenotype correlations in families who carry ASD-associated rare variants and serves as an entry point to the expanded studies required to dissect the etiology in the ∼85% of the ASD population that remain idiopathic.


Asunto(s)
Trastorno del Espectro Autista , Trastorno Autístico , Humanos , Trastorno del Espectro Autista/genética , Predisposición Genética a la Enfermedad , Variaciones en el Número de Copia de ADN/genética , Genómica
3.
Cell ; 185(16): 3041-3055.e25, 2022 08 04.
Artículo en Inglés | MEDLINE | ID: mdl-35917817

RESUMEN

Rare copy-number variants (rCNVs) include deletions and duplications that occur infrequently in the global human population and can confer substantial risk for disease. In this study, we aimed to quantify the properties of haploinsufficiency (i.e., deletion intolerance) and triplosensitivity (i.e., duplication intolerance) throughout the human genome. We harmonized and meta-analyzed rCNVs from nearly one million individuals to construct a genome-wide catalog of dosage sensitivity across 54 disorders, which defined 163 dosage sensitive segments associated with at least one disorder. These segments were typically gene dense and often harbored dominant dosage sensitive driver genes, which we were able to prioritize using statistical fine-mapping. Finally, we designed an ensemble machine-learning model to predict probabilities of dosage sensitivity (pHaplo & pTriplo) for all autosomal genes, which identified 2,987 haploinsufficient and 1,559 triplosensitive genes, including 648 that were uniquely triplosensitive. This dosage sensitivity resource will provide broad utility for human disease research and clinical genetics.


Asunto(s)
Variaciones en el Número de Copia de ADN , Genoma Humano , Variaciones en el Número de Copia de ADN/genética , Dosificación de Gen , Haploinsuficiencia/genética , Humanos
4.
Cell ; 184(3): 577-595, 2021 02 04.
Artículo en Inglés | MEDLINE | ID: mdl-33545034

RESUMEN

Biomolecules are in constant motion. To understand how they function, and why malfunctions can cause disease, it is necessary to describe their three-dimensional structures in terms of dynamic conformational ensembles. Here, we demonstrate how nuclear magnetic resonance (NMR) spectroscopy provides an essential, dynamic view of structural biology that captures biomolecular motions at atomic resolution. We focus on examples that emphasize the diversity of biomolecules and biochemical applications that are amenable to NMR, such as elucidating functional dynamics in large molecular machines, characterizing transient conformations implicated in the onset of disease, and obtaining atomic-level descriptions of intrinsically disordered regions that make weak interactions involved in liquid-liquid phase separation. Finally, we discuss the pivotal role that NMR has played in driving forward our understanding of the biomolecular dynamics-function paradigm.


Asunto(s)
Resonancia Magnética Nuclear Biomolecular , Biomarcadores/metabolismo , Variaciones en el Número de Copia de ADN/genética , Humanos , Mutación/genética , Transcriptoma/genética
5.
Cell ; 184(3): 596-614.e14, 2021 02 04.
Artículo en Inglés | MEDLINE | ID: mdl-33508232

RESUMEN

Checkpoint inhibitors (CPIs) augment adaptive immunity. Systematic pan-tumor analyses may reveal the relative importance of tumor-cell-intrinsic and microenvironmental features underpinning CPI sensitization. Here, we collated whole-exome and transcriptomic data for >1,000 CPI-treated patients across seven tumor types, utilizing standardized bioinformatics workflows and clinical outcome criteria to validate multivariable predictors of CPI sensitization. Clonal tumor mutation burden (TMB) was the strongest predictor of CPI response, followed by total TMB and CXCL9 expression. Subclonal TMB, somatic copy alteration burden, and histocompatibility leukocyte antigen (HLA) evolutionary divergence failed to attain pan-cancer significance. Dinucleotide variants were identified as a source of immunogenic epitopes associated with radical amino acid substitutions and enhanced peptide hydrophobicity/immunogenicity. Copy-number analysis revealed two additional determinants of CPI outcome supported by prior functional evidence: 9q34 (TRAF2) loss associated with response and CCND1 amplification associated with resistance. Finally, single-cell RNA sequencing (RNA-seq) of clonal neoantigen-reactive CD8 tumor-infiltrating lymphocytes (TILs), combined with bulk RNA-seq analysis of CPI-responding tumors, identified CCR5 and CXCL13 as T-cell-intrinsic markers of CPI sensitivity.


Asunto(s)
Inhibidores de Puntos de Control Inmunológico/farmacología , Neoplasias/inmunología , Linfocitos T/inmunología , Biomarcadores de Tumor/metabolismo , Antígenos CD8/metabolismo , Quimiocina CXCL13/metabolismo , Cromosomas Humanos Par 9/genética , Estudios de Cohortes , Ciclina D1/genética , Variaciones en el Número de Copia de ADN/genética , Exoma/genética , Amplificación de Genes , Humanos , Evasión Inmune/efectos de los fármacos , Análisis Multivariante , Mutación/genética , Neoplasias/patología , Polimorfismo de Nucleótido Simple/genética , Receptores CCR5/metabolismo , Linfocitos T/efectos de los fármacos , Carga Tumoral/genética
6.
Cell ; 183(1): 197-210.e32, 2020 10 01.
Artículo en Inglés | MEDLINE | ID: mdl-33007263

RESUMEN

Cancer genomes often harbor hundreds of somatic DNA rearrangement junctions, many of which cannot be easily classified into simple (e.g., deletion) or complex (e.g., chromothripsis) structural variant classes. Applying a novel genome graph computational paradigm to analyze the topology of junction copy number (JCN) across 2,778 tumor whole-genome sequences, we uncovered three novel complex rearrangement phenomena: pyrgo, rigma, and tyfonas. Pyrgo are "towers" of low-JCN duplications associated with early-replicating regions, superenhancers, and breast or ovarian cancers. Rigma comprise "chasms" of low-JCN deletions enriched in late-replicating fragile sites and gastrointestinal carcinomas. Tyfonas are "typhoons" of high-JCN junctions and fold-back inversions associated with expressed protein-coding fusions, breakend hypermutation, and acral, but not cutaneous, melanomas. Clustering of tumors according to genome graph-derived features identified subgroups associated with DNA repair defects and poor prognosis.


Asunto(s)
Variación Estructural del Genoma/genética , Genómica/métodos , Neoplasias/genética , Inversión Cromosómica/genética , Cromotripsis , Variaciones en el Número de Copia de ADN/genética , Reordenamiento Génico/genética , Genoma Humano/genética , Humanos , Mutación/genética , Secuenciación Completa del Genoma/métodos
7.
Cell ; 183(7): 1962-1985.e31, 2020 12 23.
Artículo en Inglés | MEDLINE | ID: mdl-33242424

RESUMEN

We report a comprehensive proteogenomics analysis, including whole-genome sequencing, RNA sequencing, and proteomics and phosphoproteomics profiling, of 218 tumors across 7 histological types of childhood brain cancer: low-grade glioma (n = 93), ependymoma (32), high-grade glioma (25), medulloblastoma (22), ganglioglioma (18), craniopharyngioma (16), and atypical teratoid rhabdoid tumor (12). Proteomics data identify common biological themes that span histological boundaries, suggesting that treatments used for one histological type may be applied effectively to other tumors sharing similar proteomics features. Immune landscape characterization reveals diverse tumor microenvironments across and within diagnoses. Proteomics data further reveal functional effects of somatic mutations and copy number variations (CNVs) not evident in transcriptomics data. Kinase-substrate association and co-expression network analysis identify important biological mechanisms of tumorigenesis. This is the first large-scale proteogenomics analysis across traditional histological boundaries to uncover foundational pediatric brain tumor biology and inform rational treatment selection.


Asunto(s)
Neoplasias Encefálicas/genética , Neoplasias Encefálicas/patología , Proteogenómica , Neoplasias Encefálicas/inmunología , Niño , Variaciones en el Número de Copia de ADN/genética , Regulación Neoplásica de la Expresión Génica , Redes Reguladoras de Genes , Genoma Humano , Glioma/genética , Glioma/patología , Humanos , Linfocitos Infiltrantes de Tumor/inmunología , Mutación/genética , Clasificación del Tumor , Recurrencia Local de Neoplasia/patología , Fosfoproteínas/metabolismo , Fosforilación , ARN Mensajero/genética , ARN Mensajero/metabolismo , Transcriptoma/genética
8.
Cell ; 182(1): 200-225.e35, 2020 07 09.
Artículo en Inglés | MEDLINE | ID: mdl-32649874

RESUMEN

To explore the biology of lung adenocarcinoma (LUAD) and identify new therapeutic opportunities, we performed comprehensive proteogenomic characterization of 110 tumors and 101 matched normal adjacent tissues (NATs) incorporating genomics, epigenomics, deep-scale proteomics, phosphoproteomics, and acetylproteomics. Multi-omics clustering revealed four subgroups defined by key driver mutations, country, and gender. Proteomic and phosphoproteomic data illuminated biology downstream of copy number aberrations, somatic mutations, and fusions and identified therapeutic vulnerabilities associated with driver events involving KRAS, EGFR, and ALK. Immune subtyping revealed a complex landscape, reinforced the association of STK11 with immune-cold behavior, and underscored a potential immunosuppressive role of neutrophil degranulation. Smoking-associated LUADs showed correlation with other environmental exposure signatures and a field effect in NATs. Matched NATs allowed identification of differentially expressed proteins with potential diagnostic and therapeutic utility. This proteogenomics dataset represents a unique public resource for researchers and clinicians seeking to better understand and treat lung adenocarcinomas.


Asunto(s)
Adenocarcinoma del Pulmón/tratamiento farmacológico , Adenocarcinoma del Pulmón/genética , Neoplasias Pulmonares/tratamiento farmacológico , Neoplasias Pulmonares/genética , Proteogenómica , Adenocarcinoma del Pulmón/inmunología , Adulto , Anciano , Anciano de 80 o más Años , Biomarcadores de Tumor/metabolismo , Carcinogénesis/genética , Carcinogénesis/patología , Variaciones en el Número de Copia de ADN/genética , Metilación de ADN/genética , Femenino , Humanos , Neoplasias Pulmonares/inmunología , Masculino , Persona de Mediana Edad , Mutación/genética , Proteínas de Fusión Oncogénica , Fenotipo , Fosfoproteínas/metabolismo , Proteoma/metabolismo
9.
Cell ; 166(6): 1397-1410.e16, 2016 Sep 08.
Artículo en Inglés | MEDLINE | ID: mdl-27610566

RESUMEN

Whereas domestication of livestock, pets, and crops is well documented, it is still unclear to what extent microbes associated with the production of food have also undergone human selection and where the plethora of industrial strains originates from. Here, we present the genomes and phenomes of 157 industrial Saccharomyces cerevisiae yeasts. Our analyses reveal that today's industrial yeasts can be divided into five sublineages that are genetically and phenotypically separated from wild strains and originate from only a few ancestors through complex patterns of domestication and local divergence. Large-scale phenotyping and genome analysis further show strong industry-specific selection for stress tolerance, sugar utilization, and flavor production, while the sexual cycle and other phenotypes related to survival in nature show decay, particularly in beer yeasts. Together, these results shed light on the origins, evolutionary history, and phenotypic diversity of industrial yeasts and provide a resource for further selection of superior strains. PAPERCLIP.


Asunto(s)
Cerveza/microbiología , Microbiología Industrial , Filogenia , Saccharomyces cerevisiae/clasificación , Saccharomyces cerevisiae/fisiología , Variaciones en el Número de Copia de ADN/genética , Genes Fúngicos/genética , Variación Genética , Genoma Fúngico/genética , Viabilidad Microbiana/genética , Fenotipo , Ploidias , Saccharomyces cerevisiae/genética , Selección Genética
10.
Nature ; 630(8016): 401-411, 2024 Jun.
Artículo en Inglés | MEDLINE | ID: mdl-38811727

RESUMEN

Apes possess two sex chromosomes-the male-specific Y chromosome and the X chromosome, which is present in both males and females. The Y chromosome is crucial for male reproduction, with deletions being linked to infertility1. The X chromosome is vital for reproduction and cognition2. Variation in mating patterns and brain function among apes suggests corresponding differences in their sex chromosomes. However, owing to their repetitive nature and incomplete reference assemblies, ape sex chromosomes have been challenging to study. Here, using the methodology developed for the telomere-to-telomere (T2T) human genome, we produced gapless assemblies of the X and Y chromosomes for five great apes (bonobo (Pan paniscus), chimpanzee (Pan troglodytes), western lowland gorilla (Gorilla gorilla gorilla), Bornean orangutan (Pongo pygmaeus) and Sumatran orangutan (Pongo abelii)) and a lesser ape (the siamang gibbon (Symphalangus syndactylus)), and untangled the intricacies of their evolution. Compared with the X chromosomes, the ape Y chromosomes vary greatly in size and have low alignability and high levels of structural rearrangements-owing to the accumulation of lineage-specific ampliconic regions, palindromes, transposable elements and satellites. Many Y chromosome genes expand in multi-copy families and some evolve under purifying selection. Thus, the Y chromosome exhibits dynamic evolution, whereas the X chromosome is more stable. Mapping short-read sequencing data to these assemblies revealed diversity and selection patterns on sex chromosomes of more than 100 individual great apes. These reference assemblies are expected to inform human evolution and conservation genetics of non-human apes, all of which are endangered species.


Asunto(s)
Hominidae , Cromosoma X , Cromosoma Y , Animales , Femenino , Masculino , Gorilla gorilla/genética , Hominidae/genética , Hominidae/clasificación , Hylobatidae/genética , Pan paniscus/genética , Pan troglodytes/genética , Filogenia , Pongo abelii/genética , Pongo pygmaeus/genética , Telómero/genética , Cromosoma X/genética , Cromosoma Y/genética , Evolución Molecular , Variaciones en el Número de Copia de ADN/genética , Humanos , Especies en Peligro de Extinción , Estándares de Referencia
11.
Nature ; 633(8028): 127-136, 2024 Sep.
Artículo en Inglés | MEDLINE | ID: mdl-39112709

RESUMEN

Colorectal carcinoma (CRC) is a common cause of mortality1, but a comprehensive description of its genomic landscape is lacking2-9. Here we perform whole-genome sequencing of 2,023 CRC samples from participants in the UK 100,000 Genomes Project, thereby providing a highly detailed somatic mutational landscape of this cancer. Integrated analyses identify more than 250 putative CRC driver genes, many not previously implicated in CRC or other cancers, including several recurrent changes outside the coding genome. We extend the molecular pathways involved in CRC development, define four new common subgroups of microsatellite-stable CRC based on genomic features and show that these groups have independent prognostic associations. We also characterize several rare molecular CRC subgroups, some with potential clinical relevance, including cancers with both microsatellite and chromosomal instability. We demonstrate a spectrum of mutational profiles across the colorectum, which reflect aetiological differences. These include the role of Escherichia colipks+ colibactin in rectal cancers10 and the importance of the SBS93 signature11-13, which suggests that diet or smoking is a risk factor. Immune-escape driver mutations14 are near-ubiquitous in hypermutant tumours and occur in about half of microsatellite-stable CRCs, often in the form of HLA copy number changes. Many driver mutations are actionable, including those associated with rare subgroups (for example, BRCA1 and IDH1), highlighting the role of whole-genome sequencing in optimizing patient care.


Asunto(s)
Neoplasias Colorrectales , Predisposición Genética a la Enfermedad , Genoma Humano , Genómica , Mutación , Adulto , Anciano , Anciano de 80 o más Años , Femenino , Humanos , Masculino , Persona de Mediana Edad , Adulto Joven , Inestabilidad Cromosómica/genética , Neoplasias Colorrectales/clasificación , Neoplasias Colorrectales/genética , Neoplasias Colorrectales/inmunología , Dieta/efectos adversos , Variaciones en el Número de Copia de ADN/genética , Predisposición Genética a la Enfermedad/genética , Genoma Humano/genética , Antígenos HLA/genética , Inestabilidad de Microsatélites , Pronóstico , Fumar/efectos adversos , Reino Unido/epidemiología , Secuenciación Completa del Genoma
12.
Nature ; 633(8028): 137-146, 2024 Sep.
Artículo en Inglés | MEDLINE | ID: mdl-39112715

RESUMEN

Colorectal cancer is caused by a sequence of somatic genomic alterations affecting driver genes in core cancer pathways1. Here, to understand the functional and prognostic impact of cancer-causing somatic mutations, we analysed the whole genomes and transcriptomes of 1,063 primary colorectal cancers in a population-based cohort with long-term follow-up. From the 96 mutated driver genes, 9 were not previously implicated in colorectal cancer and 24 had not been linked to any cancer. Two distinct patterns of pathway co-mutations were observed, timing analyses identified nine early and three late driver gene mutations, and several signatures of colorectal-cancer-specific mutational processes were identified. Mutations in WNT, EGFR and TGFß pathway genes, the mitochondrial CYB gene and 3 regulatory elements along with 21 copy-number variations and the COSMIC SBS44 signature correlated with survival. Gene expression classification yielded five prognostic subtypes with distinct molecular features, in part explained by underlying genomic alterations. Microsatellite-instable tumours divided into two classes with different levels of hypoxia and infiltration of immune and stromal cells. To our knowledge, this study constitutes the largest integrated genome and transcriptome analysis of colorectal cancer, and interlinks mutations, gene expression and patient outcomes. The identification of prognostic mutations and expression subtypes can guide future efforts to individualize colorectal cancer therapy.


Asunto(s)
Neoplasias Colorrectales , Predisposición Genética a la Enfermedad , Genoma Humano , Transcriptoma , Femenino , Humanos , Masculino , Hipoxia de la Célula , Estudios de Cohortes , Neoplasias Colorrectales/clasificación , Neoplasias Colorrectales/diagnóstico , Neoplasias Colorrectales/genética , Neoplasias Colorrectales/inmunología , Neoplasias Colorrectales/mortalidad , Variaciones en el Número de Copia de ADN/genética , Perfilación de la Expresión Génica , Regulación Neoplásica de la Expresión Génica , Predisposición Genética a la Enfermedad/genética , Genoma Humano/genética , Inestabilidad de Microsatélites , Mutación , Medicina de Precisión , Pronóstico , Células del Estroma/metabolismo , Células del Estroma/patología , Análisis de Supervivencia , Factores de Tiempo , Transcriptoma/genética , Factor de Crecimiento Transformador beta/genética , Vía de Señalización Wnt/genética
13.
Nature ; 620(7975): 839-848, 2023 Aug.
Artículo en Inglés | MEDLINE | ID: mdl-37587338

RESUMEN

Mitochondrial DNA (mtDNA) is a maternally inherited, high-copy-number genome required for oxidative phosphorylation1. Heteroplasmy refers to the presence of a mixture of mtDNA alleles in an individual and has been associated with disease and ageing. Mechanisms underlying common variation in human heteroplasmy, and the influence of the nuclear genome on this variation, remain insufficiently explored. Here we quantify mtDNA copy number (mtCN) and heteroplasmy using blood-derived whole-genome sequences from 274,832 individuals and perform genome-wide association studies to identify associated nuclear loci. Following blood cell composition correction, we find that mtCN declines linearly with age and is associated with variants at 92 nuclear loci. We observe that nearly everyone harbours heteroplasmic mtDNA variants obeying two principles: (1) heteroplasmic single nucleotide variants tend to arise somatically and accumulate sharply after the age of 70 years, whereas (2) heteroplasmic indels are maternally inherited as mixtures with relative levels associated with 42 nuclear loci involved in mtDNA replication, maintenance and novel pathways. These loci may act by conferring a replicative advantage to certain mtDNA alleles. As an illustrative example, we identify a length variant carried by more than 50% of humans at position chrM:302 within a G-quadruplex previously proposed to mediate mtDNA transcription/replication switching2,3. We find that this variant exerts cis-acting genetic control over mtDNA abundance and is itself associated in-trans with nuclear loci encoding machinery for this regulatory switch. Our study suggests that common variation in the nuclear genome can shape variation in mtCN and heteroplasmy dynamics across the human population.


Asunto(s)
Núcleo Celular , Variaciones en el Número de Copia de ADN , ADN Mitocondrial , Heteroplasmia , Mitocondrias , Anciano , Humanos , Variaciones en el Número de Copia de ADN/genética , ADN Mitocondrial/genética , Estudio de Asociación del Genoma Completo , Heteroplasmia/genética , Mitocondrias/genética , Núcleo Celular/genética , Alelos , Polimorfismo de Nucleótido Simple , Mutación INDEL , G-Cuádruplex
14.
Nature ; 619(7971): 793-800, 2023 Jul.
Artículo en Inglés | MEDLINE | ID: mdl-37380777

RESUMEN

Aneuploidies-whole-chromosome or whole-arm imbalances-are the most prevalent alteration in cancer genomes1,2. However, it is still debated whether their prevalence is due to selection or ease of generation as passenger events1,2. Here we developed a method, BISCUT, that identifies loci subject to fitness advantages or disadvantages by interrogating length distributions of telomere- or centromere-bounded copy-number events. These loci were significantly enriched for known cancer driver genes, including genes not detected through analysis of focal copy-number events, and were often lineage specific. BISCUT identified the helicase-encoding gene WRN as a haploinsufficient tumour-suppressor gene on chromosome 8p, which is supported by several lines of evidence. We also formally quantified the role of selection and mechanical biases in driving aneuploidy, finding that rates of arm-level copy-number alterations are most highly correlated with their effects on cellular fitness1,2. These results provide insight into the driving forces behind aneuploidy and its contribution to tumorigenesis.


Asunto(s)
Aneuploidia , Transformación Celular Neoplásica , Neoplasias , Humanos , Transformación Celular Neoplásica/genética , Variaciones en el Número de Copia de ADN/genética , Neoplasias/genética , Neoplasias/patología , Oncogenes/genética , Telómero/genética , Centrómero/genética , Linaje de la Célula , Cromosomas Humanos Par 8/genética , Genes Supresores de Tumor
15.
Nature ; 624(7992): 602-610, 2023 Dec.
Artículo en Inglés | MEDLINE | ID: mdl-38093003

RESUMEN

Indigenous Australians harbour rich and unique genomic diversity. However, Aboriginal and Torres Strait Islander ancestries are historically under-represented in genomics research and almost completely missing from reference datasets1-3. Addressing this representation gap is critical, both to advance our understanding of global human genomic diversity and as a prerequisite for ensuring equitable outcomes in genomic medicine. Here we apply population-scale whole-genome long-read sequencing4 to profile genomic structural variation across four remote Indigenous communities. We uncover an abundance of large insertion-deletion variants (20-49 bp; n = 136,797), structural variants (50 b-50 kb; n = 159,912) and regions of variable copy number (>50 kb; n = 156). The majority of variants are composed of tandem repeat or interspersed mobile element sequences (up to 90%) and have not been previously annotated (up to 62%). A large fraction of structural variants appear to be exclusive to Indigenous Australians (12% lower-bound estimate) and most of these are found in only a single community, underscoring the need for broad and deep sampling to achieve a comprehensive catalogue of genomic structural variation across the Australian continent. Finally, we explore short tandem repeats throughout the genome to characterize allelic diversity at 50 known disease loci5, uncover hundreds of novel repeat expansion sites within protein-coding genes, and identify unique patterns of diversity and constraint among short tandem repeat sequences. Our study sheds new light on the dimensions and dynamics of genomic structural variation within and beyond Australia.


Asunto(s)
Aborigenas Australianos e Isleños del Estrecho de Torres , Genoma Humano , Variación Estructural del Genoma , Humanos , Alelos , Australia/etnología , Aborigenas Australianos e Isleños del Estrecho de Torres/genética , Conjuntos de Datos como Asunto , Variaciones en el Número de Copia de ADN/genética , Sitios Genéticos/genética , Genética Médica , Variación Estructural del Genoma/genética , Genómica , Mutación INDEL/genética , Secuencias Repetitivas Esparcidas/genética , Repeticiones de Microsatélite/genética , Genoma Humano/genética
16.
Nature ; 615(7953): 652-659, 2023 03.
Artículo en Inglés | MEDLINE | ID: mdl-36890232

RESUMEN

Increasing the proportion of locally produced plant protein in currently meat-rich diets could substantially reduce greenhouse gas emissions and loss of biodiversity1. However, plant protein production is hampered by the lack of a cool-season legume equivalent to soybean in agronomic value2. Faba bean (Vicia faba L.) has a high yield potential and is well suited for cultivation in temperate regions, but genomic resources are scarce. Here, we report a high-quality chromosome-scale assembly of the faba bean genome and show that it has expanded to a massive 13 Gb in size through an imbalance between the rates of amplification and elimination of retrotransposons and satellite repeats. Genes and recombination events are evenly dispersed across chromosomes and the gene space is remarkably compact considering the genome size, although with substantial copy number variation driven by tandem duplication. Demonstrating practical application of the genome sequence, we develop a targeted genotyping assay and use high-resolution genome-wide association analysis to dissect the genetic basis of seed size and hilum colour. The resources presented constitute a genomics-based breeding platform for faba bean, enabling breeders and geneticists to accelerate the improvement of sustainable protein production across the Mediterranean, subtropical and northern temperate agroecological zones.


Asunto(s)
Productos Agrícolas , Diploidia , Variación Genética , Genoma de Planta , Genómica , Fitomejoramiento , Proteínas de Plantas , Vicia faba , Cromosomas de las Plantas/genética , Productos Agrícolas/genética , Productos Agrícolas/metabolismo , Variaciones en el Número de Copia de ADN/genética , ADN Satélite/genética , Amplificación de Genes/genética , Genes de Plantas/genética , Variación Genética/genética , Genoma de Planta/genética , Estudio de Asociación del Genoma Completo , Geografía , Fitomejoramiento/métodos , Proteínas de Plantas/genética , Proteínas de Plantas/metabolismo , Recombinación Genética , Retroelementos/genética , Semillas/anatomía & histología , Semillas/genética , Vicia faba/anatomía & histología , Vicia faba/genética , Vicia faba/metabolismo
17.
Annu Rev Neurosci ; 43: 509-533, 2020 07 08.
Artículo en Inglés | MEDLINE | ID: mdl-32640929

RESUMEN

Autism is a common and complex neurologic disorder whose scientific underpinnings have begun to be established in the past decade. The essence of this breakthrough has been a focus on families, where genetic analyses are strongest, versus large-scale, case-control studies. Autism genetics has progressed in parallel with technology, from analyses of copy number variation to whole-exome sequencing (WES) and whole-genome sequencing (WGS). Gene mutations causing complete loss of function account for perhaps one-third of cases, largely detected through WES. This limitation has increased interest in understanding the regulatory variants of genes that contribute in more subtle ways to the disorder. Strategies combining biochemical analysis of gene regulation, WGS analysis of the noncoding genome, and machine learning have begun to succeed. The emerging picture is that careful control of the amounts of transcription, mRNA, and proteins made by key brain genes-stoichiometry-plays a critical role in defining the clinical features of autism.


Asunto(s)
Trastorno del Espectro Autista/genética , Trastorno Autístico/genética , Variaciones en el Número de Copia de ADN/genética , Exoma/genética , Variaciones en el Número de Copia de ADN/fisiología , Humanos , Mutación/genética , Secuenciación del Exoma/métodos
18.
Nature ; 606(7916): 984-991, 2022 06.
Artículo en Inglés | MEDLINE | ID: mdl-35705804

RESUMEN

Gains and losses of DNA are prevalent in cancer and emerge as a consequence of inter-related processes of replication stress, mitotic errors, spindle multipolarity and breakage-fusion-bridge cycles, among others, which may lead to chromosomal instability and aneuploidy1,2. These copy number alterations contribute to cancer initiation, progression and therapeutic resistance3-5. Here we present a conceptual framework to examine the patterns of copy number alterations in human cancer that is widely applicable to diverse data types, including whole-genome sequencing, whole-exome sequencing, reduced representation bisulfite sequencing, single-cell DNA sequencing and SNP6 microarray data. Deploying this framework to 9,873 cancers representing 33 human cancer types from The Cancer Genome Atlas6 revealed a set of 21 copy number signatures that explain the copy number patterns of 97% of samples. Seventeen copy number signatures were attributed to biological phenomena of whole-genome doubling, aneuploidy, loss of heterozygosity, homologous recombination deficiency, chromothripsis and haploidization. The aetiologies of four copy number signatures remain unexplained. Some cancer types harbour amplicon signatures associated with extrachromosomal DNA, disease-specific survival and proto-oncogene gains such as MDM2. In contrast to base-scale mutational signatures, no copy number signature was associated with many known exogenous cancer risk factors. Our results synthesize the global landscape of copy number alterations in human cancer by revealing a diversity of mutational processes that give rise to these alterations.


Asunto(s)
Variaciones en el Número de Copia de ADN , Análisis Mutacional de ADN , Neoplasias , Aneuploidia , Cromotripsis , Variaciones en el Número de Copia de ADN/genética , Haploidia , Recombinación Homóloga/genética , Humanos , Pérdida de Heterocigocidad/genética , Mutación , Neoplasias/genética , Neoplasias/patología , Secuenciación del Exoma
19.
Nature ; 601(7891): 85-91, 2022 01.
Artículo en Inglés | MEDLINE | ID: mdl-34912115

RESUMEN

The state and behaviour of a cell can be influenced by both genetic and environmental factors. In particular, tumour progression is determined by underlying genetic aberrations1-4 as well as the makeup of the tumour microenvironment5,6. Quantifying the contributions of these factors requires new technologies that can accurately measure the spatial location of genomic sequence together with phenotypic readouts. Here we developed slide-DNA-seq, a method for capturing spatially resolved DNA sequences from intact tissue sections. We demonstrate that this method accurately preserves local tumour architecture and enables the de novo discovery of distinct tumour clones and their copy number alterations. We then apply slide-DNA-seq to a mouse model of metastasis and a primary human cancer, revealing that clonal populations are confined to distinct spatial regions. Moreover, through integration with spatial transcriptomics, we uncover distinct sets of genes that are associated with clone-specific genetic aberrations, the local tumour microenvironment, or both. Together, this multi-modal spatial genomics approach provides a versatile platform for quantifying how cell-intrinsic and cell-extrinsic factors contribute to gene expression, protein abundance and other cellular phenotypes.


Asunto(s)
Células Clonales/metabolismo , Neoplasias Colorrectales/genética , Neoplasias Colorrectales/patología , Genómica/métodos , Animales , Células Clonales/patología , Variaciones en el Número de Copia de ADN/genética , Humanos , Ratones , Fenotipo , RNA-Seq , Análisis de Secuencia de ADN , Transcripción Genética , Transcriptoma
20.
Nature ; 608(7922): 360-367, 2022 08.
Artículo en Inglés | MEDLINE | ID: mdl-35948708

RESUMEN

Defining the transition from benign to malignant tissue is fundamental to improving early diagnosis of cancer1. Here we use a systematic approach to study spatial genome integrity in situ and describe previously unidentified clonal relationships. We used spatially resolved transcriptomics2 to infer spatial copy number variations in >120,000 regions across multiple organs, in benign and malignant tissues. We demonstrate that genome-wide copy number variation reveals distinct clonal patterns within tumours and in nearby benign tissue using an organ-wide approach focused on the prostate. Our results suggest a model for how genomic instability arises in histologically benign tissue that may represent early events in cancer evolution. We highlight the power of capturing the molecular and spatial continuums in a tissue context and challenge the rationale for treatment paradigms, including focal therapy.


Asunto(s)
Células Clonales , Variaciones en el Número de Copia de ADN , Inestabilidad Genómica , Neoplasias , Análisis Espacial , Células Clonales/metabolismo , Células Clonales/patología , Variaciones en el Número de Copia de ADN/genética , Detección Precoz del Cáncer , Genoma Humano , Inestabilidad Genómica/genética , Genómica , Humanos , Masculino , Modelos Biológicos , Neoplasias/genética , Neoplasias/patología , Próstata/metabolismo , Próstata/patología , Neoplasias de la Próstata/genética , Neoplasias de la Próstata/patología , Transcriptoma/genética
SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA