Pesquisa | Biblioteca Virtual em Saúde Fiocruz

1.

Recurrent inversion polymorphisms in humans associate with genetic instability and genomic disorders.

Porubsky, David; Höps, Wolfram; Ashraf, Hufsah; Hsieh, PingHsun; Rodriguez-Martin, Bernardo; Yilmaz, Feyza; Ebler, Jana; Hallast, Pille; Maria Maggiolini, Flavia Angela; Harvey, William T; Henning, Barbara; Audano, Peter A; Gordon, David S; Ebert, Peter; Hasenfeld, Patrick; Benito, Eva; Zhu, Qihui; Lee, Charles; Antonacci, Francesca; Steinrücken, Matthias; Beck, Christine R; Sanders, Ashley D; Marschall, Tobias; Eichler, Evan E; Korbel, Jan O.

Cell ; 185(11): 1986-2005.e26, 2022 05 26.

Artigo em Inglês | MEDLINE | ID: mdl-35525246

RESUMO

Unlike copy number variants (CNVs), inversions remain an underexplored genetic variation class. By integrating multiple genomic technologies, we discover 729 inversions in 41 human genomes. Approximately 85% of inversions <2 kbp form by twin-priming during L1 retrotransposition; 80% of the larger inversions are balanced and affect twice as many nucleotides as CNVs. Balanced inversions show an excess of common variants, and 72% are flanked by segmental duplications (SDs) or retrotransposons. Since flanking repeats promote non-allelic homologous recombination, we developed complementary approaches to identify recurrent inversion formation. We describe 40 recurrent inversions encompassing 0.6% of the genome, showing inversion rates up to 2.7 × 10-4 per locus per generation. Recurrent inversions exhibit a sex-chromosomal bias and co-localize with genomic disorder critical regions. We propose that inversion recurrence results in an elevated number of heterozygous carriers and structural SD diversity, which increases mutability in the population and predisposes specific haplotypes to disease-causing CNVs.

Assuntos

Inversão Cromossômica , Duplicações Segmentares Genômicas , Inversão Cromossômica/genética , Variações do Número de Cópias de DNA/genética , Genoma Humano , Genômica , Humanos

2.

Assembly of 43 human Y chromosomes reveals extensive complexity and variation.

Hallast, Pille; Ebert, Peter; Loftus, Mark; Yilmaz, Feyza; Audano, Peter A; Logsdon, Glennis A; Bonder, Marc Jan; Zhou, Weichen; Höps, Wolfram; Kim, Kwondo; Li, Chong; Hoyt, Savannah J; Dishuck, Philip C; Porubsky, David; Tsetsos, Fotios; Kwon, Jee Young; Zhu, Qihui; Munson, Katherine M; Hasenfeld, Patrick; Harvey, William T; Lewis, Alexandra P; Kordosky, Jennifer; Hoekzema, Kendra; O'Neill, Rachel J; Korbel, Jan O; Tyler-Smith, Chris; Eichler, Evan E; Shi, Xinghua; Beck, Christine R; Marschall, Tobias; Konkel, Miriam K; Lee, Charles.

Nature ; 621(7978): 355-364, 2023 Sep.

Artigo em Inglês | MEDLINE | ID: mdl-37612510

RESUMO

The prevalence of highly repetitive sequences within the human Y chromosome has prevented its complete assembly to date1 and led to its systematic omission from genomic analyses. Here we present de novo assemblies of 43 Y chromosomes spanning 182,900 years of human evolution and report considerable diversity in size and structure. Half of the male-specific euchromatic region is subject to large inversions with a greater than twofold higher recurrence rate compared with all other chromosomes2. Ampliconic sequences associated with these inversions show differing mutation rates that are sequence context dependent, and some ampliconic genes exhibit evidence for concerted evolution with the acquisition and purging of lineage-specific pseudogenes. The largest heterochromatic region in the human genome, Yq12, is composed of alternating repeat arrays that show extensive variation in the number, size and distribution, but retain a 1:1 copy-number ratio. Finally, our data suggest that the boundary between the recombining pseudoautosomal region 1 and the non-recombining portions of the X and Y chromosomes lies 500 kb away from the currently established1 boundary. The availability of fully sequence-resolved Y chromosomes from multiple individuals provides a unique opportunity for identifying new associations of traits with specific Y-chromosomal variants and garnering insights into the evolution and function of complex regions of the human genome.

Assuntos

Cromossomos Humanos Y , Evolução Molecular , Humanos , Masculino , Cromossomos Humanos Y/genética , Genoma Humano/genética , Genômica , Taxa de Mutação , Fenótipo , Eucromatina/genética , Pseudogenes , Variação Genética/genética , Cromossomos Humanos X/genética , Regiões Pseudoautossômicas/genética

3.

Criteria for inference of chromothripsis in cancer genomes.

Korbel, Jan O; Campbell, Peter J.

Cell ; 152(6): 1226-36, 2013 Mar 14.

Artigo em Inglês | MEDLINE | ID: mdl-23498933

RESUMO

Chromothripsis scars the genome when localized chromosome shattering and repair occurs in a one-off catastrophe. Outcomes of this process are detectable as massive DNA rearrangements affecting one or a few chromosomes. Although recent findings suggest a crucial role of chromothripsis in cancer development, the reproducible inference of this process remains challenging, requiring that cataclysmic one-off rearrangements be distinguished from localized lesions that occur progressively. We describe conceptual criteria for the inference of chromothripsis, based on ruling out the alternative hypothesis that stepwise rearrangements occurred. Robust means of inference may facilitate in-depth studies on the impact of, and the mechanisms underlying, chromothripsis.

Assuntos

Aberrações Cromossômicas , Neoplasias/genética , Animais , Transformação Celular Neoplásica , Rearranjo Gênico , Humanos

4.

Semi-automated assembly of high-quality diploid human reference genomes.

Jarvis, Erich D; Formenti, Giulio; Rhie, Arang; Guarracino, Andrea; Yang, Chentao; Wood, Jonathan; Tracey, Alan; Thibaud-Nissen, Francoise; Vollger, Mitchell R; Porubsky, David; Cheng, Haoyu; Asri, Mobin; Logsdon, Glennis A; Carnevali, Paolo; Chaisson, Mark J P; Chin, Chen-Shan; Cody, Sarah; Collins, Joanna; Ebert, Peter; Escalona, Merly; Fedrigo, Olivier; Fulton, Robert S; Fulton, Lucinda L; Garg, Shilpa; Gerton, Jennifer L; Ghurye, Jay; Granat, Anastasiya; Green, Richard E; Harvey, William; Hasenfeld, Patrick; Hastie, Alex; Haukness, Marina; Jaeger, Erich B; Jain, Miten; Kirsche, Melanie; Kolmogorov, Mikhail; Korbel, Jan O; Koren, Sergey; Korlach, Jonas; Lee, Joyce; Li, Daofeng; Lindsay, Tina; Lucas, Julian; Luo, Feng; Marschall, Tobias; Mitchell, Matthew W; McDaniel, Jennifer; Nie, Fan; Olsen, Hugh E; Olson, Nathan D.

Nature ; 611(7936): 519-531, 2022 Nov.

Artigo em Inglês | MEDLINE | ID: mdl-36261518

RESUMO

The current human reference genome, GRCh38, represents over 20 years of effort to generate a high-quality assembly, which has benefitted society1,2. However, it still has many gaps and errors, and does not represent a biological genome as it is a blend of multiple individuals3,4. Recently, a high-quality telomere-to-telomere reference, CHM13, was generated with the latest long-read technologies, but it was derived from a hydatidiform mole cell line with a nearly homozygous genome5. To address these limitations, the Human Pangenome Reference Consortium formed with the goal of creating high-quality, cost-effective, diploid genome assemblies for a pangenome reference that represents human genetic diversity6. Here, in our first scientific report, we determined which combination of current genome sequencing and assembly approaches yield the most complete and accurate diploid genome assembly with minimal manual curation. Approaches that used highly accurate long reads and parent-child data with graph-based haplotype phasing during assembly outperformed those that did not. Developing a combination of the top-performing methods, we generated our first high-quality diploid reference assembly, containing only approximately four gaps per chromosome on average, with most chromosomes within ±1% of the length of CHM13. Nearly 48% of protein-coding genes have non-synonymous amino acid changes between haplotypes, and centromeric regions showed the highest diversity. Our findings serve as a foundation for assembling near-complete diploid human genomes at scale for a pangenome reference to capture global genetic variation from single nucleotides to structural rearrangements.

Assuntos

Mapeamento Cromossômico , Diploide , Genoma Humano , Genômica , Humanos , Mapeamento Cromossômico/normas , Genoma Humano/genética , Haplótipos/genética , Sequenciamento de Nucleotídeos em Larga Escala/métodos , Sequenciamento de Nucleotídeos em Larga Escala/normas , Análise de Sequência de DNA/métodos , Análise de Sequência de DNA/normas , Padrões de Referência , Genômica/métodos , Genômica/normas , Cromossomos Humanos/genética , Variação Genética/genética

5.

Gaps and complex structurally variant loci in phased genome assemblies.

Porubsky, David; Vollger, Mitchell R; Harvey, William T; Rozanski, Allison N; Ebert, Peter; Hickey, Glenn; Hasenfeld, Patrick; Sanders, Ashley D; Stober, Catherine; Korbel, Jan O; Paten, Benedict; Marschall, Tobias; Eichler, Evan E.

Genome Res ; 33(4): 496-510, 2023 04.

Artigo em Inglês | MEDLINE | ID: mdl-37164484

RESUMO

There has been tremendous progress in phased genome assembly production by combining long-read data with parental information or linked-read data. Nevertheless, a typical phased genome assembly generated by trio-hifiasm still generates more than 140 gaps. We perform a detailed analysis of gaps, assembly breaks, and misorientations from 182 haploid assemblies obtained from a diversity panel of 77 unique human samples. Although trio-based approaches using HiFi are the current gold standard, chromosome-wide phasing accuracy is comparable when using Strand-seq instead of parental data. Importantly, the majority of assembly gaps cluster near the largest and most identical repeats (including segmental duplications [35.4%], satellite DNA [22.3%], or regions enriched in GA/AT-rich DNA [27.4%]). Consequently, 1513 protein-coding genes overlap assembly gaps in at least one haplotype, and 231 are recurrently disrupted or missing from five or more haplotypes. Furthermore, we estimate that 6-7 Mbp of DNA are misorientated per haplotype irrespective of whether trio-free or trio-based approaches are used. Of these misorientations, 81% correspond to bona fide large inversion polymorphisms in the human species, most of which are flanked by large segmental duplications. We also identify large-scale alignment discontinuities consistent with 11.9 Mbp of deletions and 161.4 Mbp of insertions per haploid genome. Although 99% of this variation corresponds to satellite DNA, we identify 230 regions of euchromatic DNA with frequent expansions and contractions, nearly half of which overlap with 197 protein-coding genes. Such variable and incompletely assembled regions are important targets for future algorithmic development and pangenome representation.

Assuntos

DNA Satélite , Polimorfismo Genético , Humanos , DNA Satélite/genética , Haplótipos , Duplicações Segmentares Genômicas , Análise de Sequência de DNA

6.

Patterns of somatic structural variation in human cancer genomes.

Li, Yilong; Roberts, Nicola D; Wala, Jeremiah A; Shapira, Ofer; Schumacher, Steven E; Kumar, Kiran; Khurana, Ekta; Waszak, Sebastian; Korbel, Jan O; Haber, James E; Imielinski, Marcin; Weischenfeldt, Joachim; Beroukhim, Rameen; Campbell, Peter J.

Nature ; 578(7793): 112-121, 2020 02.

Artigo em Inglês | MEDLINE | ID: mdl-32025012

RESUMO

A key mutational process in cancer is structural variation, in which rearrangements delete, amplify or reorder genomic segments that range in size from kilobases to whole chromosomes1-7. Here we develop methods to group, classify and describe somatic structural variants, using data from the Pan-Cancer Analysis of Whole Genomes (PCAWG) Consortium of the International Cancer Genome Consortium (ICGC) and The Cancer Genome Atlas (TCGA), which aggregated whole-genome sequencing data from 2,658 cancers across 38 tumour types8. Sixteen signatures of structural variation emerged. Deletions have a multimodal size distribution, assort unevenly across tumour types and patients, are enriched in late-replicating regions and correlate with inversions. Tandem duplications also have a multimodal size distribution, but are enriched in early-replicating regions-as are unbalanced translocations. Replication-based mechanisms of rearrangement generate varied chromosomal structures with low-level copy-number gains and frequent inverted rearrangements. One prominent structure consists of 2-7 templates copied from distinct regions of the genome strung together within one locus. Such cycles of templated insertions correlate with tandem duplications, and-in liver cancer-frequently activate the telomerase gene TERT. A wide variety of rearrangement processes are active in cancer, which generate complex configurations of the genome upon which selection can act.

Assuntos

Variação Genética , Genoma Humano/genética , Neoplasias/genética , Rearranjo Gênico/genética , Genômica , Humanos , Mutagênese Insercional , Telomerase/genética

7.

Genomic basis for RNA alterations in cancer.

Calabrese, Claudia; Davidson, Natalie R; Demircioglu, Deniz; Fonseca, Nuno A; He, Yao; Kahles, André; Lehmann, Kjong-Van; Liu, Fenglin; Shiraishi, Yuichi; Soulette, Cameron M; Urban, Lara; Greger, Liliana; Li, Siliang; Liu, Dongbing; Perry, Marc D; Xiang, Qian; Zhang, Fan; Zhang, Junjun; Bailey, Peter; Erkek, Serap; Hoadley, Katherine A; Hou, Yong; Huska, Matthew R; Kilpinen, Helena; Korbel, Jan O; Marin, Maximillian G; Markowski, Julia; Nandi, Tannistha; Pan-Hammarström, Qiang; Pedamallu, Chandra Sekhar; Siebert, Reiner; Stark, Stefan G; Su, Hong; Tan, Patrick; Waszak, Sebastian M; Yung, Christina; Zhu, Shida; Awadalla, Philip; Creighton, Chad J; Meyerson, Matthew; Ouellette, B F Francis; Wu, Kui; Yang, Huanming; Brazma, Alvis; Brooks, Angela N; Göke, Jonathan; Rätsch, Gunnar; Schwarz, Roland F; Stegle, Oliver; Zhang, Zemin.

Nature ; 578(7793): 129-136, 2020 02.

Artigo em Inglês | MEDLINE | ID: mdl-32025019

RESUMO

Transcript alterations often result from somatic changes in cancer genomes1. Various forms of RNA alterations have been described in cancer, including overexpression2, altered splicing3 and gene fusions4; however, it is difficult to attribute these to underlying genomic changes owing to heterogeneity among patients and tumour types, and the relatively small cohorts of patients for whom samples have been analysed by both transcriptome and whole-genome sequencing. Here we present, to our knowledge, the most comprehensive catalogue of cancer-associated gene alterations to date, obtained by characterizing tumour transcriptomes from 1,188 donors of the Pan-Cancer Analysis of Whole Genomes (PCAWG) Consortium of the International Cancer Genome Consortium (ICGC) and The Cancer Genome Atlas (TCGA)5. Using matched whole-genome sequencing data, we associated several categories of RNA alterations with germline and somatic DNA alterations, and identified probable genetic mechanisms. Somatic copy-number alterations were the major drivers of variations in total gene and allele-specific expression. We identified 649 associations of somatic single-nucleotide variants with gene expression in cis, of which 68.4% involved associations with flanking non-coding regions of the gene. We found 1,900 splicing alterations associated with somatic mutations, including the formation of exons within introns in proximity to Alu elements. In addition, 82% of gene fusions were associated with structural variants, including 75 of a new class, termed 'bridged' fusions, in which a third genomic location bridges two genes. We observed transcriptomic alteration signatures that differ between cancer types and have associations with variations in DNA mutational signatures. This compendium of RNA alterations in the genomic context provides a rich resource for identifying genes and mechanisms that are functionally implicated in cancer.

Assuntos

Regulação Neoplásica da Expressão Gênica , Neoplasias/genética , RNA/genética , Variações do Número de Cópias de DNA , DNA de Neoplasias , Genoma Humano , Genômica , Humanos , Transcriptoma

8.

Structural Variation in Cancer: Role, Prevalence, and Mechanisms.

Cosenza, Marco Raffaele; Rodriguez-Martin, Bernardo; Korbel, Jan O.

Annu Rev Genomics Hum Genet ; 23: 123-152, 2022 08 31.

Artigo em Inglês | MEDLINE | ID: mdl-35655332

RESUMO

Somatic rearrangements resulting in genomic structural variation drive malignant phenotypes by altering the expression or function of cancer genes. Pan-cancer studies have revealed that structural variants (SVs) are the predominant class of driver mutation in most cancer types, but because they are difficult to discover, they remain understudied when compared with point mutations. This review provides an overview of the current knowledge of somatic SVs, discussing their primary roles, prevalence in different contexts, and mutational mechanisms. SVs arise throughout the life history of cancer, and 55% of driver mutations uncovered by the Pan-Cancer Analysis of Whole Genomes project represent SVs. Leveraging the convergence of cell biology and genomics, we propose a mechanistic classification of somatic SVs, from simple to highly complex DNA rearrangement classes. The actions of DNA repair and DNA replication processes together with mitotic errors result in a rich spectrum of SV formation processes, with cascading effects mediating extensive structural diversity after an initiating DNA lesion has formed. Thanks to new sequencing technologies, including the sequencing of single-cell genomes, open questions about the molecular triggers and the biomolecules involved in SV formation as well as their mutational rates can now be addressed.

Assuntos

Variação Estrutural do Genoma , Neoplasias , Genoma Humano , Genômica , Humanos , Mutação , Neoplasias/epidemiologia , Neoplasias/genética , Neoplasias/patologia , Prevalência

9.

Familial long-read sequencing increases yield of de novo mutations.

Noyes, Michelle D; Harvey, William T; Porubsky, David; Sulovari, Arvis; Li, Ruiyang; Rose, Nicholas R; Audano, Peter A; Munson, Katherine M; Lewis, Alexandra P; Hoekzema, Kendra; Mantere, Tuomo; Graves-Lindsay, Tina A; Sanders, Ashley D; Goodwin, Sara; Kramer, Melissa; Mokrab, Younes; Zody, Michael C; Hoischen, Alexander; Korbel, Jan O; McCombie, W Richard; Eichler, Evan E.

Am J Hum Genet ; 109(4): 631-646, 2022 04 07.

Artigo em Inglês | MEDLINE | ID: mdl-35290762

RESUMO

Studies of de novo mutation (DNM) have typically excluded some of the most repetitive and complex regions of the genome because these regions cannot be unambiguously mapped with short-read sequencing data. To better understand the genome-wide pattern of DNM, we generated long-read sequence data from an autism parent-child quad with an affected female where no pathogenic variant had been discovered in short-read Illumina sequence data. We deeply sequenced all four individuals by using three sequencing platforms (Illumina, Oxford Nanopore, and Pacific Biosciences) and three complementary technologies (Strand-seq, optical mapping, and 10X Genomics). Using long-read sequencing, we initially discovered and validated 171 DNMs across two children-a 20% increase in the number of de novo single-nucleotide variants (SNVs) and indels when compared to short-read callsets. The number of DNMs further increased by 5% when considering a more complete human reference (T2T-CHM13) because of the recovery of events in regions absent from GRCh38 (e.g., three DNMs in heterochromatic satellites). In total, we validated 195 de novo germline mutations and 23 potential post-zygotic mosaic mutations across both children; the overall true substitution rate based on this integrated callset is at least 1.41 × 10-8 substitutions per nucleotide per generation. We also identified six de novo insertions and deletions in tandem repeats, two of which represent structural variants. We demonstrate that long-read sequencing and assembly, especially when combined with a more complete reference genome, increases the number of DNMs by >25% compared to previous studies, providing a more complete catalog of DNM compared to short-read data alone.

Assuntos

Genômica , Sequenciamento de Nucleotídeos em Larga Escala , Feminino , Humanos , Mutação/genética , Nucleotídeos , Análise de Sequência de DNA , Software

10.

Somatic structural variant formation is guided by and influences genome architecture.

Sidiropoulos, Nikos; Mardin, Balca R; Rodríguez-González, F Germán; Bochkov, Ivan D; Garg, Shilpa; Stütz, Adrian M; Korbel, Jan O; Aiden, Erez Lieberman; Weischenfeldt, Joachim.

Genome Res ; 32(4): 643-655, 2022 04.

Artigo em Inglês | MEDLINE | ID: mdl-35177558

RESUMO

The occurrence and formation of genomic structural variants (SVs) is known to be influenced by the 3D chromatin architecture, but the extent and magnitude have been challenging to study. Here, we apply Hi-C to study chromatin organization before and after induction of chromothripsis in human cells. We use Hi-C to manually assemble the derivative chromosomes following the occurrence of massive complex rearrangements, which allows us to study the sources of SV formation and their consequences on gene regulation. We observe an action-reaction interplay whereby the 3D chromatin architecture directly impacts the location and formation of SVs. In turn, the SVs reshape the chromatin organization to alter the local topologies, replication timing, and gene regulation in cis We show that SVs have a strong tendency to occur between similar chromatin compartments and replication timing regions. Moreover, we find that SVs frequently occur at 3D loop anchors, that SVs can cause a switch in chromatin compartments and replication timing, and that this is a major source of SV-mediated effects on nearby gene expression changes. Finally, we provide evidence for a general mechanistic bias of the 3D chromatin on SV occurrence using data from more than 2700 patient-derived cancer genomes.

Assuntos

Cromotripsia , Genoma , Cromatina/genética , Cromossomos , Genoma Humano , Variação Estrutural do Genoma , Humanos

11.

A high-resolution map of small-scale inversions in the gibbon genome.

Mercuri, Ludovica; Palmisano, Donato; L'Abbate, Alberto; D'Addabbo, Pietro; Montinaro, Francesco; Catacchio, Claudia Rita; Hasenfeld, Patrick; Ventura, Mario; Korbel, Jan O; Sanders, Ashley D; Maggiolini, Flavia Angela Maria; Antonacci, Francesca.

Genome Res ; 32(10): 1941-1951, 2022 10.

Artigo em Inglês | MEDLINE | ID: mdl-36180231

RESUMO

Gibbons are the most speciose family of living apes, characterized by a diverse chromosome number and rapid rate of large-scale rearrangements. Here we performed single-cell template strand sequencing (Strand-seq), molecular cytogenetics, and deep in silico analysis of a southern white-cheeked gibbon genome, providing the first comprehensive map of 238 previously hidden small-scale inversions. We determined that more than half are gibbon specific, at least fivefold higher than shown for other primate lineage-specific inversions, with a significantly high number of small heterozygous inversions, suggesting that accelerated evolution of inversions may have played a role in the high sympatric diversity of gibbons. Although the precise mechanisms underlying these inversions are not yet understood, it is clear that segmental duplication-mediated NAHR only accounts for a small fraction of events. Several genomic features, including gene density and repeat (e.g., LINE-1) content, might render these regions more break-prone and susceptible to inversion formation. In the attempt to characterize interspecific variation between southern and northern white-cheeked gibbons, we identify several large assembly errors in the current GGSC Nleu3.0/nomLeu3 reference genome comprising more than 49 megabases of DNA. Finally, we provide a list of 182 candidate genes potentially involved in gibbon diversification and speciation.

Assuntos

Hominidae , Hylobates , Animais , Hylobates/genética , Genoma , Primatas/genética , Inversão Cromossômica/genética , Cromossomos , Hominidae/genética

12.

Leveraging European infrastructures to access 1 million human genomes by 2022.

Saunders, Gary; Baudis, Michael; Becker, Regina; Beltran, Sergi; Béroud, Christophe; Birney, Ewan; Brooksbank, Cath; Brunak, Søren; Van den Bulcke, Marc; Drysdale, Rachel; Capella-Gutierrez, Salvador; Flicek, Paul; Florindi, Francesco; Goodhand, Peter; Gut, Ivo; Heringa, Jaap; Holub, Petr; Hooyberghs, Jef; Juty, Nick; Keane, Thomas M; Korbel, Jan O; Lappalainen, Ilkka; Leskosek, Brane; Matthijs, Gert; Mayrhofer, Michaela Th; Metspalu, Andres; Navarro, Arcadi; Newhouse, Steven; Nyrönen, Tommi; Page, Angela; Persson, Bengt; Palotie, Aarno; Parkinson, Helen; Rambla, Jordi; Salgado, David; Steinfelder, Erik; Swertz, Morris A; Valencia, Alfonso; Varma, Susheel; Blomberg, Niklas; Scollen, Serena.

Nat Rev Genet ; 20(11): 693-701, 2019 11.

Artigo em Inglês | MEDLINE | ID: mdl-31455890

RESUMO

Human genomics is undergoing a step change from being a predominantly research-driven activity to one driven through health care as many countries in Europe now have nascent precision medicine programmes. To maximize the value of the genomic data generated, these data will need to be shared between institutions and across countries. In recognition of this challenge, 21 European countries recently signed a declaration to transnationally share data on at least 1 million human genomes by 2022. In this Roadmap, we identify the challenges of data sharing across borders and demonstrate that European research infrastructures are well-positioned to support the rapid implementation of widespread genomic data access.

Assuntos

Pesquisa Biomédica , Genoma Humano , Projeto Genoma Humano , Europa (Continente) , Humanos

13.

Author Correction: Leveraging European infrastructures to access 1 million human genomes by 2022.

Saunders, Gary; Baudis, Michael; Becker, Regina; Beltran, Sergi; Béroud, Christophe; Birney, Ewan; Brooksbank, Cath; Brunak, Søren; Van den Bulcke, Marc; Drysdale, Rachel; Capella-Gutierrez, Salvador; Flicek, Paul; Florindi, Francesco; Goodhand, Peter; Gut, Ivo; Heringa, Jaap; Holub, Petr; Hooyberghs, Jef; Juty, Nick; Keane, Thomas M; Korbel, Jan O; Lappalainen, Ilkka; Leskosek, Brane; Matthijs, Gert; Mayrhofer, Michaela Th; Metspalu, Andres; Navarro, Arcadi; Newhouse, Steven; Nyrönen, Tommi; Page, Angela; Persson, Bengt; Palotie, Aarno; Parkinson, Helen; Rambla, Jordi; Salgado, David; Steinfelder, Erik; Swertz, Morris A; Valencia, Alfonso; Varma, Susheel; Blomberg, Niklas; Scollen, Serena.

Nat Rev Genet ; 20(11): 702, 2019 Nov.

Artigo em Inglês | MEDLINE | ID: mdl-31520075

RESUMO

An amendment to this paper has been published and can be accessed via a link at the top of the paper.

14.

Expectations and blind spots for structural variation detection from long-read assemblies and short-read genome sequencing technologies.

Zhao, Xuefang; Collins, Ryan L; Lee, Wan-Ping; Weber, Alexandra M; Jun, Yukyung; Zhu, Qihui; Weisburd, Ben; Huang, Yongqing; Audano, Peter A; Wang, Harold; Walker, Mark; Lowther, Chelsea; Fu, Jack; Gerstein, Mark B; Devine, Scott E; Marschall, Tobias; Korbel, Jan O; Eichler, Evan E; Chaisson, Mark J P; Lee, Charles; Mills, Ryan E; Brand, Harrison; Talkowski, Michael E.

Am J Hum Genet ; 108(5): 919-928, 2021 05 06.

Artigo em Inglês | MEDLINE | ID: mdl-33789087

RESUMO

Virtually all genome sequencing efforts in national biobanks, complex and Mendelian disease programs, and medical genetic initiatives are reliant upon short-read whole-genome sequencing (srWGS), which presents challenges for the detection of structural variants (SVs) relative to emerging long-read WGS (lrWGS) technologies. Given this ubiquity of srWGS in large-scale genomics initiatives, we sought to establish expectations for routine SV detection from this data type by comparison with lrWGS assembly, as well as to quantify the genomic properties and added value of SVs uniquely accessible to each technology. Analyses from the Human Genome Structural Variation Consortium (HGSVC) of three families captured ~11,000 SVs per genome from srWGS and ~25,000 SVs per genome from lrWGS assembly. Detection power and precision for SV discovery varied dramatically by genomic context and variant class: 9.7% of the current GRCh38 reference is defined by segmental duplication (SD) and simple repeat (SR), yet 91.4% of deletions that were specifically discovered by lrWGS localized to these regions. Across the remaining 90.3% of reference sequence, we observed extremely high (93.8%) concordance between technologies for deletions in these datasets. In contrast, lrWGS was superior for detection of insertions across all genomic contexts. Given that non-SD/SR sequences encompass 95.9% of currently annotated disease-associated exons, improved sensitivity from lrWGS to discover novel pathogenic deletions in these currently interpretable genomic regions is likely to be incremental. However, these analyses highlight the considerable added value of assembly-based lrWGS to create new catalogs of insertions and transposable elements, as well as disease-associated repeat expansions in genomic sequences that were previously recalcitrant to routine assessment.

Assuntos

Genoma Humano/genética , Variação Estrutural do Genoma , Genômica/métodos , Objetivos , Sequenciamento Completo do Genoma/métodos , Sequenciamento Completo do Genoma/normas , Variações do Número de Cópias de DNA , Éxons/genética , Humanos , Projetos de Pesquisa , Duplicações Segmentares Genômicas , Alinhamento de Sequência

15.

Author Correction: Patterns of somatic structural variation in human cancer genomes.

Li, Yilong; Roberts, Nicola D; Wala, Jeremiah A; Shapira, Ofer; Schumacher, Steven E; Kumar, Kiran; Khurana, Ekta; Waszak, Sebastian; Korbel, Jan O; Haber, James E; Imielinski, Marcin; Weischenfeldt, Joachim; Beroukhim, Rameen; Campbell, Peter J.

Nature ; 614(7948): E38, 2023 Feb.

Artigo em Inglês | MEDLINE | ID: mdl-36697835

16.

Author Correction: Genomic basis for RNA alterations in cancer.

Calabrese, Claudia; Davidson, Natalie R; Demircioglu, Deniz; Fonseca, Nuno A; He, Yao; Kahles, André; Lehmann, Kjong-Van; Liu, Fenglin; Shiraishi, Yuichi; Soulette, Cameron M; Urban, Lara; Greger, Liliana; Li, Siliang; Liu, Dongbing; Perry, Marc D; Xiang, Qian; Zhang, Fan; Zhang, Junjun; Bailey, Peter; Erkek, Serap; Hoadley, Katherine A; Hou, Yong; Huska, Matthew R; Kilpinen, Helena; Korbel, Jan O; Marin, Maximillian G; Markowski, Julia; Nandi, Tannistha; Pan-Hammarström, Qiang; Pedamallu, Chandra Sekhar; Siebert, Reiner; Stark, Stefan G; Su, Hong; Tan, Patrick; Waszak, Sebastian M; Yung, Christina; Zhu, Shida; Awadalla, Philip; Creighton, Chad J; Meyerson, Matthew; Ouellette, B F Francis; Wu, Kui; Yang, Huanming; Brazma, Alvis; Brooks, Angela N; Göke, Jonathan; Rätsch, Gunnar; Schwarz, Roland F; Stegle, Oliver; Zhang, Zemin.

Nature ; 614(7948): E37, 2023 Feb.

Artigo em Inglês | MEDLINE | ID: mdl-36697831

17.

Single-cell strand sequencing of a macaque genome reveals multiple nested inversions and breakpoint reuse during primate evolution.

Maggiolini, Flavia Angela Maria; Sanders, Ashley D; Shew, Colin James; Sulovari, Arvis; Mao, Yafei; Puig, Marta; Catacchio, Claudia Rita; Dellino, Maria; Palmisano, Donato; Mercuri, Ludovica; Bitonto, Miriana; Porubský, David; Cáceres, Mario; Eichler, Evan E; Ventura, Mario; Dennis, Megan Y; Korbel, Jan O; Antonacci, Francesca.

Genome Res ; 30(11): 1680-1693, 2020 11.

Artigo em Inglês | MEDLINE | ID: mdl-33093070

RESUMO

Rhesus macaque is an Old World monkey that shared a common ancestor with human â¼25 Myr ago and is an important animal model for human disease studies. A deep understanding of its genetics is therefore required for both biomedical and evolutionary studies. Among structural variants, inversions represent a driving force in speciation and play an important role in disease predisposition. Here we generated a genome-wide map of inversions between human and macaque, combining single-cell strand sequencing with cytogenetics. We identified 375 total inversions between 859 bp and 92 Mbp, increasing by eightfold the number of previously reported inversions. Among these, 19 inversions flanked by segmental duplications overlap with recurrent copy number variants associated with neurocognitive disorders. Evolutionary analyses show that in 17 out of 19 cases, the Hominidae orientation of these disease-associated regions is always derived. This suggests that duplicated sequences likely played a fundamental role in generating inversions in humans and great apes, creating architectures that nowadays predispose these regions to disease-associated genetic instability. Finally, we identified 861 genes mapping at 156 inversions breakpoints, with some showing evidence of differential expression in human and macaque cell lines, thus highlighting candidates that might have contributed to the evolution of species-specific features. This study depicts the most accurate fine-scale map of inversions between human and macaque using a two-pronged integrative approach, such as single-cell strand sequencing and cytogenetics, and represents a valuable resource toward understanding of the biology and evolution of primate species.

Assuntos

Pontos de Quebra do Cromossomo , Inversão Cromossômica , Evolução Molecular , Macaca mulatta/genética , Animais , Doença/genética , Regulação da Expressão Gênica , Genoma , Genômica , Heterozigoto , Humanos , Hibridização in Situ Fluorescente , Recombinação Genética , Análise de Sequência de DNA , Análise de Célula Única

18.

Targeted Perturb-seq enables genome-scale genetic screens in single cells.

Schraivogel, Daniel; Gschwind, Andreas R; Milbank, Jennifer H; Leonce, Daniel R; Jakob, Petra; Mathur, Lukas; Korbel, Jan O; Merten, Christoph A; Velten, Lars; Steinmetz, Lars M.

Nat Methods ; 17(6): 629-635, 2020 06.

Artigo em Inglês | MEDLINE | ID: mdl-32483332

RESUMO

The transcriptome contains rich information on molecular, cellular and organismal phenotypes. However, experimental and statistical limitations constrain sensitivity and throughput of genetic screening with single-cell transcriptomics readout. To overcome these limitations, we introduce targeted Perturb-seq (TAP-seq), a sensitive, inexpensive and platform-independent method focusing single-cell RNA-seq coverage on genes of interest, thereby increasing the sensitivity and scale of genetic screens by orders of magnitude. TAP-seq permits routine analysis of thousands of CRISPR-mediated perturbations within a single experiment, detects weak effects and lowly expressed genes, and decreases sequencing requirements by up to 50-fold. We apply TAP-seq to generate perturbation-based enhancer-target gene maps for 1,778 enhancers within 2.5% of the human genome. We thereby show that enhancer-target association is jointly determined by three-dimensional contact frequency and epigenetic states, allowing accurate prediction of enhancer targets throughout the genome. In addition, we demonstrate that TAP-seq can identify cell subtypes with only 100 sequencing reads per cell.

Assuntos

Repetições Palindrômicas Curtas Agrupadas e Regularmente Espaçadas/genética , Genoma Humano , RNA-Seq/métodos , Análise de Célula Única/métodos , Transcriptoma/genética , Humanos

19.

Focal structural variants revealed by whole genome sequencing disrupt the histone demethylase KDM4C in B-cell lymphomas.

Lopez, Cristina; Schleussner, Nikolai; Bernhart, Stephan H; Kleinheinz, Kortine; Sungalee, Stephanie; Sczakiel, Henrike L; Kretzmer, Helene; Toprak, Umut H; Glaser, Selina; Wagener, Rabea; Ammerpohl, Ole; Bens, Susanne; Giefing, Maciej; Sanchez, Juan C Gonzalez; Apic, Gordana; Hubschmann, Daniel; Janz, Martin; Kreuz, Markus; Mottok, Anja; Muller, Judith M; Seufert, Julian; Hoffmann, Steve; Korbel, Jan O; Russell, Robert B; Schule, Roland; Trumper, Lorenz; Klapper, Wolfram; Radlwimmer, Bernhard; Lichter, Peter; Kuppers, Ralf; Schlesner, Matthias; Mathas, Stephan; Siebert, Reiner.

Haematologica ; 108(2): 543-554, 2023 02 01.

Artigo em Inglês | MEDLINE | ID: mdl-35522148

RESUMO

Histone methylation-modifiers, such as EZH2 and KMT2D, are recurrently altered in B-cell lymphomas. To comprehensively describe the landscape of alterations affecting genes encoding histone methylation-modifiers in lymphomagenesis we investigated whole genome and transcriptome data of 186 mature B-cell lymphomas sequenced in the ICGC MMML-Seq project. Besides confirming common alterations of KMT2D (47% of cases), EZH2 (17%), SETD1B (5%), PRDM9 (4%), KMT2C (4%), and SETD2 (4%), also identified by prior exome or RNA-sequencing studies, we here found recurrent alterations to KDM4C in chromosome 9p24, encoding a histone demethylase. Focal structural variation was the main mechanism of KDM4C alterations, and was independent from 9p24 amplification. We also identified KDM4C alterations in lymphoma cell lines including a focal homozygous deletion in a classical Hodgkin lymphoma cell line. By integrating RNA-sequencing and genome sequencing data we predict that KDM4C structural variants result in loss-offunction. By functional reconstitution studies in cell lines, we provide evidence that KDM4C can act as a tumor suppressor. Thus, we show that identification of structural variants in whole genome sequencing data adds to the comprehensive description of the mutational landscape of lymphomas and, moreover, establish KDM4C as a putative tumor suppressive gene recurrently altered in subsets of B-cell derived lymphomas.

Assuntos

Linfoma de Células B , Linfoma , Humanos , Histonas/metabolismo , Histona Desmetilases/genética , Homozigoto , Deleção de Sequência , Linfoma/genética , Linfoma de Células B/genética , Sequenciamento Completo do Genoma , RNA , Histona Desmetilases com o Domínio Jumonji/genética , Histona Desmetilases com o Domínio Jumonji/química , Histona Desmetilases com o Domínio Jumonji/metabolismo , Histona-Lisina N-Metiltransferase/genética

20.

ASHLEYS: automated quality control for single-cell Strand-seq data.

Gros, Christina; Sanders, Ashley D; Korbel, Jan O; Marschall, Tobias; Ebert, Peter.

Bioinformatics ; 37(19): 3356-3357, 2021 Oct 11.

Artigo em Inglês | MEDLINE | ID: mdl-33792647

RESUMO

SUMMARY: Single-cell DNA template strand sequencing (Strand-seq) enables chromosome length haplotype phasing, construction of phased assemblies, mapping sister-chromatid exchange events and structural variant discovery. The initial quality control of potentially thousands of single-cell libraries is still done manually by domain experts. ASHLEYS automates this tedious task, delivers near-expert performance and labels even large datasets in seconds. AVAILABILITY AND IMPLEMENTATION: github.com/friendsofstrandseq/ashleys-qc, MIT license. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

ENVIAR RESULTADO:

SELEÇÃO DE REFERÊNCIAS

DETALHE DA PESQUISA