Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 3.788
Filtrar
Más filtros

Intervalo de año de publicación
1.
Cell ; 187(8): 1955-1970.e23, 2024 Apr 11.
Artículo en Inglés | MEDLINE | ID: mdl-38503282

RESUMEN

Characterizing somatic mutations in the brain is important for disentangling the complex mechanisms of aging, yet little is known about mutational patterns in different brain cell types. Here, we performed whole-genome sequencing (WGS) of 86 single oligodendrocytes, 20 mixed glia, and 56 single neurons from neurotypical individuals spanning 0.4-104 years of age and identified >92,000 somatic single-nucleotide variants (sSNVs) and small insertions/deletions (indels). Although both cell types accumulate somatic mutations linearly with age, oligodendrocytes accumulated sSNVs 81% faster than neurons and indels 28% slower than neurons. Correlation of mutations with single-nucleus RNA profiles and chromatin accessibility from the same brains revealed that oligodendrocyte mutations are enriched in inactive genomic regions and are distributed across the genome similarly to mutations in brain cancers. In contrast, neuronal mutations are enriched in open, transcriptionally active chromatin. These stark differences suggest an assortment of active mutagenic processes in oligodendrocytes and neurons.


Asunto(s)
Envejecimiento , Encéfalo , Neuronas , Oligodendroglía , Humanos , Envejecimiento/genética , Envejecimiento/patología , Cromatina/genética , Cromatina/metabolismo , Mutación , Neuronas/metabolismo , Neuronas/patología , Oligodendroglía/metabolismo , Oligodendroglía/patología , Análisis de Expresión Génica de una Sola Célula , Secuenciación Completa del Genoma , Encéfalo/metabolismo , Encéfalo/patología , Polimorfismo de Nucleótido Simple , Mutación INDEL , Bancos de Muestras Biológicas , Células Precursoras de Oligodendrocitos/metabolismo , Células Precursoras de Oligodendrocitos/patología
2.
Cell ; 185(18): 3426-3440.e19, 2022 09 01.
Artículo en Inglés | MEDLINE | ID: mdl-36055201

RESUMEN

The 1000 Genomes Project (1kGP) is the largest fully open resource of whole-genome sequencing (WGS) data consented for public distribution without access or use restrictions. The final, phase 3 release of the 1kGP included 2,504 unrelated samples from 26 populations and was based primarily on low-coverage WGS. Here, we present a high-coverage 3,202-sample WGS 1kGP resource, which now includes 602 complete trios, sequenced to a depth of 30X using Illumina. We performed single-nucleotide variant (SNV) and short insertion and deletion (INDEL) discovery and generated a comprehensive set of structural variants (SVs) by integrating multiple analytic methods through a machine learning model. We show gains in sensitivity and precision of variant calls compared to phase 3, especially among rare SNVs as well as INDELs and SVs spanning frequency spectrum. We also generated an improved reference imputation panel, making variants discovered here accessible for association studies.


Asunto(s)
Genoma Humano , Secuenciación Completa del Genoma , Femenino , Secuenciación de Nucleótidos de Alto Rendimiento/métodos , Humanos , Mutación INDEL , Masculino , Polimorfismo de Nucleótido Simple
3.
Cell ; 184(4): 1047-1063.e23, 2021 02 18.
Artículo en Inglés | MEDLINE | ID: mdl-33539780

RESUMEN

DNA has not been utilized to record temporal information, although DNA has been used to record biological information and to compute mathematical problems. Here, we found that indel generation by Cas9 and guide RNA can occur at steady rates, in contrast to typical dynamic biological reactions, and the accumulated indel frequency can be a function of time. By measuring indel frequencies, we developed a method for recording and measuring absolute time periods over hours to weeks in mammalian cells. These time-recordings were conducted in several cell types, with different promoters and delivery vectors for Cas9, and in both cultured cells and cells of living mice. As applications, we recorded the duration of chemical exposure and the lengths of elapsed time since the onset of biological events (e.g., heat exposure and inflammation). We propose that our systems could serve as synthetic "DNA clocks."


Asunto(s)
Proteína 9 Asociada a CRISPR/metabolismo , Animales , Secuencia de Bases , Microambiente Celular , Simulación por Computador , Células HEK293 , Semivida , Humanos , Mutación INDEL/genética , Inflamación/patología , Integrasas/metabolismo , Masculino , Ratones Desnudos , Regiones Promotoras Genéticas/genética , ARN Guía de Kinetoplastida/genética , Reproducibilidad de los Resultados , Factores de Tiempo
4.
Cell ; 184(12): 3267-3280.e18, 2021 06 10.
Artículo en Inglés | MEDLINE | ID: mdl-34043941

RESUMEN

Searching for factors to improve knockin efficiency for therapeutic applications, biotechnology, and generation of non-human primate models of disease, we found that the strand exchange protein RAD51 can significantly increase Cas9-mediated homozygous knockin in mouse embryos through an interhomolog repair (IHR) mechanism. IHR is a hallmark of meiosis but only occurs at low frequencies in somatic cells, and its occurrence in zygotes is controversial. Using multiple approaches, we provide evidence for an endogenous IHR mechanism in the early embryo that can be enhanced by RAD51. This process can be harnessed to generate homozygotes from wild-type zygotes using exogenous donors and to convert heterozygous alleles into homozygous alleles without exogenous templates. Furthermore, we identify additional IHR-promoting factors and describe features of IHR events. Together, our findings show conclusive evidence for IHR in mouse embryos and describe an efficient method for enhanced gene conversion.


Asunto(s)
Reparación del ADN/genética , Conversión Génica , Recombinasa Rad51/metabolismo , Alelos , Animales , Secuencia de Bases , Proteína 9 Asociada a CRISPR/metabolismo , Proteínas de Unión al Calcio/metabolismo , Proteínas de Ciclo Celular/metabolismo , Cromosomas de los Mamíferos/genética , Roturas del ADN de Doble Cadena , Embrión de Mamíferos , Femenino , Sitios Genéticos , Recombinación Homóloga/genética , Homocigoto , Humanos , Mutación INDEL/genética , Ratones Endogámicos C57BL , Mosaicismo , Proteínas Nucleares/metabolismo , Polimorfismo de Nucleótido Simple/genética , Ribonucleoproteínas/metabolismo , Cigoto/metabolismo
5.
Cell ; 183(6): 1650-1664.e15, 2020 12 10.
Artículo en Inglés | MEDLINE | ID: mdl-33125898

RESUMEN

Correction of disease-causing mutations in human embryos holds the potential to reduce the burden of inherited genetic disorders and improve fertility treatments for couples with disease-causing mutations in lieu of embryo selection. Here, we evaluate repair outcomes of a Cas9-induced double-strand break (DSB) introduced on the paternal chromosome at the EYS locus, which carries a frameshift mutation causing blindness. We show that the most common repair outcome is microhomology-mediated end joining, which occurs during the first cell cycle in the zygote, leading to embryos with non-mosaic restoration of the reading frame. Notably, about half of the breaks remain unrepaired, resulting in an undetectable paternal allele and, after mitosis, loss of one or both chromosomal arms. Correspondingly, Cas9 off-target cleavage results in chromosomal losses and hemizygous indels because of cleavage of both alleles. These results demonstrate the ability to manipulate chromosome content and reveal significant challenges for mutation correction in human embryos.


Asunto(s)
Alelos , Proteína 9 Asociada a CRISPR/metabolismo , Cromosomas Humanos/genética , Embrión de Mamíferos/metabolismo , Animales , Secuencia de Bases , Blastocisto/metabolismo , Ciclo Celular/genética , Línea Celular , Deleción Cromosómica , Roturas del ADN de Doble Cadena , Reparación del ADN por Unión de Extremidades/genética , Implantación del Embrión/genética , Proteínas del Ojo/genética , Fertilización , Edición Génica , Reordenamiento Génico/genética , Sitios Genéticos , Genoma Humano , Genotipo , Heterocigoto , Células Madre Embrionarias Humanas/metabolismo , Humanos , Mutación INDEL/genética , Ratones , Mitosis , Sistemas de Lectura Abierta/genética , Polimorfismo de Nucleótido Simple/genética
6.
Cell ; 183(7): 1826-1847.e31, 2020 12 23.
Artículo en Inglés | MEDLINE | ID: mdl-33296702

RESUMEN

Inborn errors of human interferon gamma (IFN-γ) immunity underlie mycobacterial disease. We report a patient with mycobacterial disease due to inherited deficiency of the transcription factor T-bet. The patient has extremely low counts of circulating Mycobacterium-reactive natural killer (NK), invariant NKT (iNKT), mucosal-associated invariant T (MAIT), and Vδ2+ γδ T lymphocytes, and of Mycobacterium-non reactive classic TH1 lymphocytes, with the residual populations of these cells also producing abnormally small amounts of IFN-γ. Other lymphocyte subsets develop normally but produce low levels of IFN-γ, with the exception of CD8+ αß T and non-classic CD4+ αß TH1∗ lymphocytes, which produce IFN-γ normally in response to mycobacterial antigens. Human T-bet deficiency thus underlies mycobacterial disease by preventing the development of innate (NK) and innate-like adaptive lymphocytes (iNKT, MAIT, and Vδ2+ γδ T cells) and IFN-γ production by them, with mycobacterium-specific, IFN-γ-producing, purely adaptive CD8+ αß T, and CD4+ αß TH1∗ cells unable to compensate for this deficit.


Asunto(s)
Inmunidad Adaptativa , Inmunidad Innata , Interferón gamma/inmunología , Mycobacterium/inmunología , Proteínas de Dominio T Box/metabolismo , Secuencia de Aminoácidos , Secuencia de Bases , Linaje de la Célula , Preescolar , Cromatina/metabolismo , Islas de CpG/genética , Metilación de ADN/genética , Células Dendríticas/metabolismo , Epigénesis Genética , Femenino , Homocigoto , Humanos , Mutación INDEL/genética , Lactante , Interferón gamma/metabolismo , Células Asesinas Naturales/citología , Células Asesinas Naturales/metabolismo , Mutación con Pérdida de Función/genética , Masculino , Infecciones por Mycobacterium/genética , Infecciones por Mycobacterium/inmunología , Infecciones por Mycobacterium/microbiología , Linaje , Proteínas de Dominio T Box/química , Proteínas de Dominio T Box/deficiencia , Proteínas de Dominio T Box/genética , Linfocitos T Colaboradores-Inductores/inmunología , Transcriptoma/genética
7.
Cell ; 182(3): 672-684.e11, 2020 08 06.
Artículo en Inglés | MEDLINE | ID: mdl-32697969

RESUMEN

Inflammatory bowel disease (IBD) is a chronic inflammatory disease associated with increased risk of gastrointestinal cancers. We whole-genome sequenced 446 colonic crypts from 46 IBD patients and compared these to 412 crypts from 41 non-IBD controls from our previous publication on the mutation landscape of the normal colon. The average mutation rate of affected colonic epithelial cells is 2.4-fold that of healthy colon, and this increase is mostly driven by acceleration of mutational processes ubiquitously observed in normal colon. In contrast to the normal colon, where clonal expansions outside the confines of the crypt are rare, we observed widespread millimeter-scale clonal expansions. We discovered non-synonymous mutations in ARID1A, FBXW7, PIGR, ZC3H12A, and genes in the interleukin 17 and Toll-like receptor pathways, under positive selection in IBD. These results suggest distinct selection mechanisms in the colitis-affected colon and that somatic mutations potentially play a causal role in IBD pathogenesis.


Asunto(s)
Evolución Clonal/genética , Colitis/genética , Enfermedades Inflamatorias del Intestino/genética , Tasa de Mutación , Adulto , Anciano , Anciano de 80 o más Años , Envejecimiento/genética , Evolución Clonal/inmunología , Colitis/metabolismo , Colitis Ulcerosa/genética , Colitis Ulcerosa/metabolismo , Enfermedad de Crohn/genética , Enfermedad de Crohn/metabolismo , Proteínas de Unión al ADN/genética , Células Epiteliales/metabolismo , Células Epiteliales/patología , Proteína 7 que Contiene Repeticiones F-Box-WD/genética , Femenino , Humanos , Mutación INDEL , Enfermedades Inflamatorias del Intestino/inmunología , Enfermedades Inflamatorias del Intestino/metabolismo , Enfermedades Inflamatorias del Intestino/patología , Interleucina-17/metabolismo , Mucosa Intestinal/metabolismo , Mucosa Intestinal/patología , Masculino , Persona de Mediana Edad , Filogenia , Mutación Puntual , Receptores de Superficie Celular/genética , Ribonucleasas/genética , Receptores Toll-Like/genética , Factores de Transcripción/genética , Secuenciación Completa del Genoma
8.
Cell ; 176(6): 1310-1324.e10, 2019 03 07.
Artículo en Inglés | MEDLINE | ID: mdl-30827684

RESUMEN

DNA rearrangements resulting in human genome structural variants (SVs) are caused by diverse mutational mechanisms. We used long- and short-read sequencing technologies to investigate end products of de novo chromosome 17p11.2 rearrangements and query the molecular mechanisms underlying both recurrent and non-recurrent events. Evidence for an increased rate of clustered single-nucleotide variant (SNV) mutation in cis with non-recurrent rearrangements was found. Indel and SNV formation are associated with both copy-number gains and losses of 17p11.2, occur up to ∼1 Mb away from the breakpoint junctions, and favor C > G transversion substitutions; results suggest that single-stranded DNA is formed during the genesis of the SV and provide compelling support for a microhomology-mediated break-induced replication (MMBIR) mechanism for SV formation. Our data show an additional mutational burden of MMBIR consisting of hypermutation confined to the locus and manifesting as SNVs and indels predominantly within genes.


Asunto(s)
Cromosomas Humanos Par 17 , Mutación , Anomalías Múltiples/genética , Puntos de Rotura del Cromosoma , Trastornos de los Cromosomas/genética , Duplicación Cromosómica/genética , Variaciones en el Número de Copia de ADN , Reparación del ADN/genética , Replicación del ADN , Reordenamiento Génico , Genoma Humano , Variación Estructural del Genoma , Humanos , Mutación INDEL , Modelos Genéticos , Polimorfismo de Nucleótido Simple , Recombinación Genética , Análisis de Secuencia de ADN/métodos , Síndrome de Smith-Magenis/genética
9.
Cell ; 177(4): 821-836.e16, 2019 05 02.
Artículo en Inglés | MEDLINE | ID: mdl-30982602

RESUMEN

Whole-genome-sequencing (WGS) of human tumors has revealed distinct mutation patterns that hint at the causative origins of cancer. We examined mutational signatures in 324 WGS human-induced pluripotent stem cells exposed to 79 known or suspected environmental carcinogens. Forty-one yielded characteristic substitution mutational signatures. Some were similar to signatures found in human tumors. Additionally, six agents produced double-substitution signatures and eight produced indel signatures. Investigating mutation asymmetries across genome topography revealed fully functional mismatch and transcription-coupled repair pathways. DNA damage induced by environmental mutagens can be resolved by disparate repair and/or replicative pathways, resulting in an assortment of signature outcomes even for a single agent. This compendium of experimentally induced mutational signatures permits further exploration of roles of environmental agents in cancer etiology and underscores how human stem cell DNA is directly vulnerable to environmental agents. VIDEO ABSTRACT.


Asunto(s)
Carcinógenos Ambientales/clasificación , Neoplasias/genética , Carcinógenos Ambientales/efectos adversos , Daño del ADN/genética , Análisis Mutacional de ADN/métodos , Reparación del ADN/genética , Replicación del ADN , Perfil Genético , Genoma Humano/genética , Humanos , Mutación INDEL/genética , Mutagénesis , Mutación/genética , Células Madre Pluripotentes/metabolismo , Secuenciación Completa del Genoma/métodos
10.
Cell ; 171(3): 710-722.e12, 2017 Oct 19.
Artículo en Inglés | MEDLINE | ID: mdl-28965761

RESUMEN

To further our understanding of the genetic etiology of autism, we generated and analyzed genome sequence data from 516 idiopathic autism families (2,064 individuals). This resource includes >59 million single-nucleotide variants (SNVs) and 9,212 private copy number variants (CNVs), of which 133,992 and 88 are de novo mutations (DNMs), respectively. We estimate a mutation rate of ∼1.5 × 10-8 SNVs per site per generation with a significantly higher mutation rate in repetitive DNA. Comparing probands and unaffected siblings, we observe several DNM trends. Probands carry more gene-disruptive CNVs and SNVs, resulting in severe missense mutations and mapping to predicted fetal brain promoters and embryonic stem cell enhancers. These differences become more pronounced for autism genes (p = 1.8 × 10-3, OR = 2.2). Patients are more likely to carry multiple coding and noncoding DNMs in different genes, which are enriched for expression in striatal neurons (p = 3 × 10-3), suggesting a path forward for genetically characterizing more complex cases of autism.


Asunto(s)
Trastorno Autístico/genética , Variaciones en el Número de Copia de ADN , Polimorfismo de Nucleótido Simple , Animales , Análisis Mutacional de ADN , Femenino , Estudio de Asociación del Genoma Completo , Humanos , Mutación INDEL , Masculino , Ratones
11.
Cell ; 171(5): 1029-1041.e21, 2017 Nov 16.
Artículo en Inglés | MEDLINE | ID: mdl-29056346

RESUMEN

Cancer develops as a result of somatic mutation and clonal selection, but quantitative measures of selection in cancer evolution are lacking. We adapted methods from molecular evolution and applied them to 7,664 tumors across 29 cancer types. Unlike species evolution, positive selection outweighs negative selection during cancer development. On average, <1 coding base substitution/tumor is lost through negative selection, with purifying selection almost absent outside homozygous loss of essential genes. This allows exome-wide enumeration of all driver coding mutations, including outside known cancer genes. On average, tumors carry ∼4 coding substitutions under positive selection, ranging from <1/tumor in thyroid and testicular cancers to >10/tumor in endometrial and colorectal cancers. Half of driver substitutions occur in yet-to-be-discovered cancer genes. With increasing mutation burden, numbers of driver mutations increase, but not linearly. We systematically catalog cancer genes and show that genes vary extensively in what proportion of mutations are drivers versus passengers.


Asunto(s)
Neoplasias/genética , Neoplasias/patología , Humanos , Mutación INDEL , Inestabilidad de Microsatélites , Modelos Genéticos , Tasa de Mutación , Neoplasias/inmunología , Mutación Puntual , Polimorfismo de Nucleótido Simple , Selección Genética
12.
Cell ; 168(3): 460-472.e14, 2017 01 26.
Artículo en Inglés | MEDLINE | ID: mdl-28089356

RESUMEN

Certain cell types function as factories, secreting large quantities of one or more proteins that are central to the physiology of the respective organ. Examples include surfactant proteins in lung alveoli, albumin in liver parenchyma, and lipase in the stomach lining. Whole-genome sequencing analysis of lung adenocarcinomas revealed noncoding somatic mutational hotspots near VMP1/MIR21 and indel hotspots in surfactant protein genes (SFTPA1, SFTPB, and SFTPC). Extrapolation to other solid cancers demonstrated highly recurrent and tumor-type-specific indel hotspots targeting the noncoding regions of highly expressed genes defining certain secretory cellular lineages: albumin (ALB) in liver carcinoma, gastric lipase (LIPF) in stomach carcinoma, and thyroglobulin (TG) in thyroid carcinoma. The sequence contexts of indels targeting lineage-defining genes were significantly enriched in the AATAATD DNA motif and specific chromatin contexts, including H3K27ac and H3K36me3. Our findings illuminate a prevalent and hitherto unrecognized mutational process linking cellular lineage and cancer.


Asunto(s)
Linaje de la Célula , Mutación INDEL , Mutación , Neoplasias/genética , Neoplasias/patología , Regiones no Traducidas 3' , Adulto , Anciano , Anciano de 80 o más Años , Femenino , Humanos , Masculino , Proteínas de la Membrana/genética , MicroARNs/genética , Persona de Mediana Edad , Motivos de Nucleótidos , Polimorfismo de Nucleótido Simple , Proteínas Asociadas a Surfactante Pulmonar/genética
13.
Nature ; 627(8004): 586-593, 2024 Mar.
Artículo en Inglés | MEDLINE | ID: mdl-38355797

RESUMEN

Over half of hepatocellular carcinoma (HCC) cases diagnosed worldwide are in China1-3. However, whole-genome analysis of hepatitis B virus (HBV)-associated HCC in Chinese individuals is limited4-8, with current analyses of HCC mainly from non-HBV-enriched populations9,10. Here we initiated the Chinese Liver Cancer Atlas (CLCA) project and performed deep whole-genome sequencing (average depth, 120×) of 494 HCC tumours. We identified 6 coding and 28 non-coding previously undescribed driver candidates. Five previously undescribed mutational signatures were found, including aristolochic-acid-associated indel and doublet base signatures, and a single-base-substitution signature that we termed SBS_H8. Pentanucleotide context analysis and experimental validation confirmed that SBS_H8 was distinct to the aristolochic-acid-associated SBS22. Notably, HBV integrations could take the form of extrachromosomal circular DNA, resulting in elevated copy numbers and gene expression. Our high-depth data also enabled us to characterize subclonal clustered alterations, including chromothripsis, chromoplexy and kataegis, suggesting that these catastrophic events could also occur in late stages of hepatocarcinogenesis. Pathway analysis of all classes of alterations further linked non-coding mutations to dysregulation of liver metabolism. Finally, we performed in vitro and in vivo assays to show that fibrinogen alpha chain (FGA), determined as both a candidate coding and non-coding driver, regulates HCC progression and metastasis. Our CLCA study depicts a detailed genomic landscape and evolutionary history of HCC in Chinese individuals, providing important clinical implications.


Asunto(s)
Carcinoma Hepatocelular , Genoma Humano , Secuenciación de Nucleótidos de Alto Rendimiento , Neoplasias Hepáticas , Mutación , Secuenciación Completa del Genoma , Humanos , Ácidos Aristolóquicos/metabolismo , Carcinogénesis , Carcinoma Hepatocelular/genética , Carcinoma Hepatocelular/virología , China , Cromotripsis , Progresión de la Enfermedad , ADN Circular/genética , Pueblos del Este de Asia/genética , Evolución Molecular , Genoma Humano/genética , Virus de la Hepatitis B/genética , Mutación INDEL/genética , Hígado/metabolismo , Neoplasias Hepáticas/genética , Neoplasias Hepáticas/virología , Mutación/genética , Metástasis de la Neoplasia/genética , Sistemas de Lectura Abierta/genética , Reproducibilidad de los Resultados
14.
Nature ; 620(7975): 839-848, 2023 Aug.
Artículo en Inglés | MEDLINE | ID: mdl-37587338

RESUMEN

Mitochondrial DNA (mtDNA) is a maternally inherited, high-copy-number genome required for oxidative phosphorylation1. Heteroplasmy refers to the presence of a mixture of mtDNA alleles in an individual and has been associated with disease and ageing. Mechanisms underlying common variation in human heteroplasmy, and the influence of the nuclear genome on this variation, remain insufficiently explored. Here we quantify mtDNA copy number (mtCN) and heteroplasmy using blood-derived whole-genome sequences from 274,832 individuals and perform genome-wide association studies to identify associated nuclear loci. Following blood cell composition correction, we find that mtCN declines linearly with age and is associated with variants at 92 nuclear loci. We observe that nearly everyone harbours heteroplasmic mtDNA variants obeying two principles: (1) heteroplasmic single nucleotide variants tend to arise somatically and accumulate sharply after the age of 70 years, whereas (2) heteroplasmic indels are maternally inherited as mixtures with relative levels associated with 42 nuclear loci involved in mtDNA replication, maintenance and novel pathways. These loci may act by conferring a replicative advantage to certain mtDNA alleles. As an illustrative example, we identify a length variant carried by more than 50% of humans at position chrM:302 within a G-quadruplex previously proposed to mediate mtDNA transcription/replication switching2,3. We find that this variant exerts cis-acting genetic control over mtDNA abundance and is itself associated in-trans with nuclear loci encoding machinery for this regulatory switch. Our study suggests that common variation in the nuclear genome can shape variation in mtCN and heteroplasmy dynamics across the human population.


Asunto(s)
Núcleo Celular , Variaciones en el Número de Copia de ADN , ADN Mitocondrial , Heteroplasmia , Mitocondrias , Anciano , Humanos , Variaciones en el Número de Copia de ADN/genética , ADN Mitocondrial/genética , Estudio de Asociación del Genoma Completo , Heteroplasmia/genética , Mitocondrias/genética , Núcleo Celular/genética , Alelos , Polimorfismo de Nucleótido Simple , Mutación INDEL , G-Cuádruplex
15.
Nature ; 624(7992): 602-610, 2023 Dec.
Artículo en Inglés | MEDLINE | ID: mdl-38093003

RESUMEN

Indigenous Australians harbour rich and unique genomic diversity. However, Aboriginal and Torres Strait Islander ancestries are historically under-represented in genomics research and almost completely missing from reference datasets1-3. Addressing this representation gap is critical, both to advance our understanding of global human genomic diversity and as a prerequisite for ensuring equitable outcomes in genomic medicine. Here we apply population-scale whole-genome long-read sequencing4 to profile genomic structural variation across four remote Indigenous communities. We uncover an abundance of large insertion-deletion variants (20-49 bp; n = 136,797), structural variants (50 b-50 kb; n = 159,912) and regions of variable copy number (>50 kb; n = 156). The majority of variants are composed of tandem repeat or interspersed mobile element sequences (up to 90%) and have not been previously annotated (up to 62%). A large fraction of structural variants appear to be exclusive to Indigenous Australians (12% lower-bound estimate) and most of these are found in only a single community, underscoring the need for broad and deep sampling to achieve a comprehensive catalogue of genomic structural variation across the Australian continent. Finally, we explore short tandem repeats throughout the genome to characterize allelic diversity at 50 known disease loci5, uncover hundreds of novel repeat expansion sites within protein-coding genes, and identify unique patterns of diversity and constraint among short tandem repeat sequences. Our study sheds new light on the dimensions and dynamics of genomic structural variation within and beyond Australia.


Asunto(s)
Aborigenas Australianos e Isleños del Estrecho de Torres , Genoma Humano , Variación Estructural del Genoma , Humanos , Alelos , Australia/etnología , Aborigenas Australianos e Isleños del Estrecho de Torres/genética , Conjuntos de Datos como Asunto , Variaciones en el Número de Copia de ADN/genética , Sitios Genéticos/genética , Genética Médica , Variación Estructural del Genoma/genética , Genómica , Mutación INDEL/genética , Secuencias Repetitivas Esparcidas/genética , Repeticiones de Microsatélite/genética , Genoma Humano/genética
16.
Mol Cell ; 81(10): 2216-2230.e10, 2021 05 20.
Artículo en Inglés | MEDLINE | ID: mdl-33848455

RESUMEN

DNA double-strand break (DSB) repair is mediated by multiple pathways. It is thought that the local chromatin context affects the pathway choice, but the underlying principles are poorly understood. Using a multiplexed reporter assay in combination with Cas9 cutting, we systematically measure the relative activities of three DSB repair pathways as a function of chromatin context in >1,000 genomic locations. This reveals that non-homologous end-joining (NHEJ) is broadly biased toward euchromatin, while the contribution of microhomology-mediated end-joining (MMEJ) is higher in specific heterochromatin contexts. In H3K27me3-marked heterochromatin, inhibition of the H3K27 methyltransferase EZH2 reverts the balance toward NHEJ. Single-stranded template repair (SSTR), often used for precise CRISPR editing, competes with MMEJ and is moderately linked to chromatin context. These results provide insight into the impact of chromatin on DSB repair pathway balance and guidance for the design of Cas9-mediated genome editing experiments.


Asunto(s)
Proteína 9 Asociada a CRISPR/metabolismo , Cromatina/metabolismo , Roturas del ADN de Doble Cadena , Reparación del ADN , Secuencia de Bases , Reparación del ADN por Unión de Extremidades , Eucromatina/metabolismo , Reordenamiento Génico , Genoma Humano , Heterocromatina/metabolismo , Humanos , Mutación INDEL/genética , Células K562 , Cinética , Unión Proteica , Reproducibilidad de los Resultados
17.
Nature ; 602(7897): 510-517, 2022 02.
Artículo en Inglés | MEDLINE | ID: mdl-35140399

RESUMEN

Clustered somatic mutations are common in cancer genomes and previous analyses reveal several types of clustered single-base substitutions, which include doublet- and multi-base substitutions1-5, diffuse hypermutation termed omikli6, and longer strand-coordinated events termed kataegis3,7-9. Here we provide a comprehensive characterization of clustered substitutions and clustered small insertions and deletions (indels) across 2,583 whole-genome-sequenced cancers from 30 types of cancer10. Clustered mutations were highly enriched in driver genes and associated with differential gene expression and changes in overall survival. Several distinct mutational processes gave rise to clustered indels, including signatures that were enriched in tobacco smokers and homologous-recombination-deficient cancers. Doublet-base substitutions were caused by at least 12 mutational processes, whereas most multi-base substitutions were generated by either tobacco smoking or exposure to ultraviolet light. Omikli events, which have previously been attributed to APOBEC3 activity6, accounted for a large proportion of clustered substitutions; however, only 16.2% of omikli matched APOBEC3 patterns. Kataegis was generated by multiple mutational processes, and 76.1% of all kataegic events exhibited mutational patterns that are associated with the activation-induced deaminase (AID) and APOBEC3 family of deaminases. Co-occurrence of APOBEC3 kataegis and extrachromosomal DNA (ecDNA), termed kyklonas (Greek for cyclone), was found in 31% of samples with ecDNA. Multiple distinct kyklonic events were observed on most mutated ecDNA. ecDNA containing known cancer genes exhibited both positive selection and kyklonic hypermutation. Our results reveal the diversity of clustered mutational processes in human cancer and the role of APOBEC3 in recurrently mutating and fuelling the evolution of ecDNA.


Asunto(s)
Neoplasias , Desaminasas APOBEC/genética , Genoma , Humanos , Mutación INDEL , Mutagénesis/genética , Mutación , Neoplasias/genética
18.
Nature ; 607(7920): 732-740, 2022 07.
Artículo en Inglés | MEDLINE | ID: mdl-35859178

RESUMEN

Detailed knowledge of how diversity in the sequence of the human genome affects phenotypic diversity depends on a comprehensive and reliable characterization of both sequences and phenotypic variation. Over the past decade, insights into this relationship have been obtained from whole-exome sequencing or whole-genome sequencing of large cohorts with rich phenotypic data1,2. Here we describe the analysis of whole-genome sequencing of 150,119 individuals from the UK Biobank3. This constitutes a set of high-quality variants, including 585,040,410 single-nucleotide polymorphisms, representing 7.0% of all possible human single-nucleotide polymorphisms, and 58,707,036 indels. This large set of variants allows us to characterize selection based on sequence variation within a population through a depletion rank score of windows along the genome. Depletion rank analysis shows that coding exons represent a small fraction of regions in the genome subject to strong sequence conservation. We define three cohorts within the UK Biobank: a large British Irish cohort, a smaller African cohort and a South Asian cohort. A haplotype reference panel is provided that allows reliable imputation of most variants carried by three or more sequenced individuals. We identified 895,055 structural variants and 2,536,688 microsatellites, groups of variants typically excluded from large-scale whole-genome sequencing studies. Using this formidable new resource, we provide several examples of trait associations for rare variants with large effects not found previously through studies based on whole-exome sequencing and/or imputation.


Asunto(s)
Bancos de Muestras Biológicas , Bases de Datos Genéticas , Variación Genética , Genoma Humano , Genómica , Secuenciación Completa del Genoma , África/etnología , Asia/etnología , Estudios de Cohortes , Secuencia Conservada , Exones/genética , Genoma Humano/genética , Haplotipos/genética , Humanos , Mutación INDEL , Irlanda/etnología , Repeticiones de Microsatélite , Polimorfismo de Nucleótido Simple/genética , Reino Unido
19.
Cell ; 150(6): 1121-34, 2012 Sep 14.
Artículo en Inglés | MEDLINE | ID: mdl-22980976

RESUMEN

We report the results of whole-genome and transcriptome sequencing of tumor and adjacent normal tissue samples from 17 patients with non-small cell lung carcinoma (NSCLC). We identified 3,726 point mutations and more than 90 indels in the coding sequence, with an average mutation frequency more than 10-fold higher in smokers than in never-smokers. Novel alterations in genes involved in chromatin modification and DNA repair pathways were identified, along with DACH1, CFTR, RELN, ABCB5, and HGF. Deep digital sequencing revealed diverse clonality patterns in both never-smokers and smokers. All validated EFGR and KRAS mutations were present in the founder clones, suggesting possible roles in cancer initiation. Analysis revealed 14 fusions, including ROS1 and ALK, as well as novel metabolic enzymes. Cell-cycle and JAK-STAT pathways are significantly altered in lung cancer, along with perturbations in 54 genes that are potentially targetable with currently available drugs.


Asunto(s)
Carcinoma de Pulmón de Células no Pequeñas/genética , Carcinoma de Pulmón de Células no Pequeñas/patología , Neoplasias Pulmonares/genética , Neoplasias Pulmonares/patología , Fumar/genética , Fumar/patología , Carcinoma de Pulmón de Células no Pequeñas/terapia , Aberraciones Cromosómicas , Femenino , Perfilación de la Expresión Génica , Estudio de Asociación del Genoma Completo , Secuenciación de Nucleótidos de Alto Rendimiento , Humanos , Mutación INDEL , Neoplasias Pulmonares/terapia , Masculino , Terapia Molecular Dirigida , Mutación Puntual , Proteína Reelina
20.
Nature ; 594(7862): 227-233, 2021 06.
Artículo en Inglés | MEDLINE | ID: mdl-33910227

RESUMEN

The accurate and complete assembly of both haplotype sequences of a diploid organism is essential to understanding the role of variation in genome functions, phenotypes and diseases1. Here, using a trio-binning approach, we present a high-quality, diploid reference genome, with both haplotypes assembled independently at the chromosome level, for the common marmoset (Callithrix jacchus), an primate model system that is widely used in biomedical research2,3. The full spectrum of heterozygosity between the two haplotypes involves 1.36% of the genome-much higher than the 0.13% indicated by the standard estimation based on single-nucleotide heterozygosity alone. The de novo mutation rate is 0.43 × 10-8 per site per generation, and the paternal inherited genome acquired twice as many mutations as the maternal. Our diploid assembly enabled us to discover a recent expansion of the sex-differentiation region and unique evolutionary changes in the marmoset Y chromosome. In addition, we identified many genes with signatures of positive selection that might have contributed to the evolution of Callithrix biological features. Brain-related genes were highly conserved between marmosets and humans, although several genes experienced lineage-specific copy number variations or diversifying selection, with implications for the use of marmosets as a model system.


Asunto(s)
Callithrix/genética , Diploidia , Evolución Molecular , Genoma/genética , Genómica/normas , Animales , Investigación Biomédica , Variaciones en el Número de Copia de ADN , Femenino , Mutación de Línea Germinal/genética , Haplotipos/genética , Heterocigoto , Humanos , Mutación INDEL/genética , Masculino , Estándares de Referencia , Selección Genética , Diferenciación Sexual/genética , Cromosoma Y/genética
SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA