Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 583
Filtrar
1.
Nat Commun ; 15(1): 8007, 2024 Sep 13.
Artigo em Inglês | MEDLINE | ID: mdl-39266513

RESUMO

Modern sequencing technology enables the systematic detection of complex structural variation (SV) across genomes. However, extensive DNA rearrangements arising through a series of mutations, a phenomenon we refer to as serial SV (sSV), remain underexplored, posing a challenge for SV discovery. Here, we present NAHRwhals ( https://github.com/WHops/NAHRwhals ), a method to infer repeat-mediated series of SVs in long-read genomic assemblies. Applying NAHRwhals to haplotype-resolved human genomes from 28 individuals reveals 37 sSV loci of various length and complexity. These sSVs explain otherwise cryptic variation in medically relevant regions such as the TPSAB1 gene, 8p23.1, 22q11 and Sotos syndrome regions. Comparisons with great ape assemblies indicate that most human sSVs formed recently, after the human-ape split, and involved non-repeat-mediated processes in addition to non-allelic homologous recombination. NAHRwhals reliably discovers and characterizes sSVs at scale and independent of species, uncovering their genomic abundance and suggesting broader implications for disease.


Assuntos
Genoma Humano , Variação Estrutural do Genoma , Hominidae , Humanos , Animais , Hominidae/genética , Genoma Humano/genética , Genômica/métodos , Haplótipos
2.
Genome Med ; 16(1): 113, 2024 Sep 19.
Artigo em Inglês | MEDLINE | ID: mdl-39300495

RESUMO

BACKGROUND: Structural variations (SVs) are key genetic contributors to neurodevelopmental disorders (NDDs). Exome sequencing (ES), the current first-line tool for genetic testing of NDDs, falls short in SVs detection. This diagnostic gap is being actively addressed by new methods such as optical genome mapping (OGM). METHODS: This study evaluated the utility of combining OGM and RNA-seq in the detection and interpretation of SVs in ES-negative NDDs. OGM was performed in 43 patients with NDDs with inconclusive ES results. Candidate SVs were selected based on disease association and pathogenicity evaluation, and further validated or reconstructed by alternative methods, including long-read sequencing for a complex rearrangement event. RNA-Seq was performed on blood samples from patients with candidate SVs to facilitate interpretation of pathogenicity. RESULTS: OGM detected four candidate SVs, and RNA-seq confirmed the pathogenicity of three SVs in the patient cohort. This combined approach solved three cases-two cases with de novo SVs in genes associated with autosomal dominant NDDs, including a deletion encompassing the promoter and 5'UTR of MBD5 and an intragenic duplication of PAFAH1B1, and a third case possessing an intragenic duplication in trans with a pathogenic single-nucleotide variant of PLA2G6, associated with autosomal recessive NDDs. The expression alteration of the affected genes and the tandem positioning of two intragenic duplications were confirmed by RNA-seq. In the fourth case, OGM detected a complex rearrangement involving chromosomes 2 and 6, much more complex than the de novo t(2:6)(q13;q15) indicated by conventional cytogenetic analysis. Reconstruction showed that 17 segments of 6q15 spanning 9.3 Mb were disarranged and joined 2q11.2, with four breakpoints detected in the 5' and 3' non-coding region of the NDD-associated gene SYNCRIP. RNA-seq revealed largely preserved SYNCRIP expression, leaving the pathogenicity of this complex rearrangement event uncertain. CONCLUSIONS: SVs in ES-negative NDDs can be identified by OGM, which is particularly useful for SVs in non-coding regions not covered by ES. OGM helps to construct complex SVs and provides information on the location and orientation of duplications, which is crucial for pathogenicity interpretation. The integration of RNA-seq facilitates the interpretation of the functional consequences of SVs at the transcriptional level. These findings demonstrate the utility and feasibility of combining OGM and RNA-seq in ES-negative cases with NDDs.


Assuntos
Mapeamento Cromossômico , Transtornos do Neurodesenvolvimento , RNA-Seq , Humanos , Transtornos do Neurodesenvolvimento/genética , Transtornos do Neurodesenvolvimento/diagnóstico , Masculino , Feminino , Criança , Sequenciamento do Exoma , Variação Estrutural do Genoma , Pré-Escolar
3.
Bioinformatics ; 40(Suppl 2): ii11-ii19, 2024 09 01.
Artigo em Inglês | MEDLINE | ID: mdl-39230689

RESUMO

MOTIVATION: Complex structural variants (SVs) are genomic rearrangements that involve multiple segments of DNA. They contribute to human diversity and have been shown to cause Mendelian disease. Nevertheless, our abilities to analyse complex SVs are very limited. As opposed to deletions and other canonical types of SVs, there are no established tools that have explicitly been designed for analysing complex SVs. RESULTS: Here, we describe a new computational approach that we specifically designed for genotyping complex SVs in short-read sequenced genomes. Given a variant description, our approach computes genotype-specific probability distributions for observing aligned read pairs with a wide range of properties. Subsequently, these distributions can be used to efficiently determine the most likely genotype for any set of aligned read pairs observed in a sequenced genome. In addition, we use these distributions to compute a genotyping difficulty for a given variant, which predicts the amount of data needed to achieve a reliable call. Careful evaluation confirms that our approach outperforms other genotypers by making reliable genotype predictions across both simulated and real data. On up to 7829 human genomes, we achieve high concordance with population-genetic assumptions and expected inheritance patterns. On simulated data, we show that precision correlates well with our prediction of genotyping difficulty. This together with low memory and time requirements makes our approach well-suited for application in biomedical studies involving small to very large numbers of short-read sequenced genomes. AVAILABILITY AND IMPLEMENTATION: Source code is available at https://github.com/kehrlab/Complex-SV-Genotyping.


Assuntos
Genoma Humano , Variação Estrutural do Genoma , Análise de Sequência de DNA , Software , Humanos , Análise de Sequência de DNA/métodos , Genótipo , Técnicas de Genotipagem/métodos , Algoritmos , Sequenciamento de Nucleotídeos em Larga Escala/métodos , Genômica/métodos
4.
Curr Opin Genet Dev ; 88: 102240, 2024 Oct.
Artigo em Inglês | MEDLINE | ID: mdl-39121701

RESUMO

Advances in sequencing technologies have enabled the comparison of high-quality genomes of diverse primate species, revealing vast amounts of divergence due to structural variation. Given their large size, structural variants (SVs) can simultaneously alter the function and regulation of multiple genes. Studies estimate that collectively more than 3.5% of the genome is divergent in humans versus other great apes, impacting thousands of genes. Functional genomics and gene-editing tools in various model systems recently emerged as an exciting frontier - investigating the wide-ranging impacts of SVs on molecular, cellular, and systems-level phenotypes. This review examines existing research and identifies future directions to broaden our understanding of the functional roles of SVs on phenotypic innovations and diversity impacting uniquely human features, ranging from cognition to metabolic adaptations.


Assuntos
Evolução Molecular , Genoma Humano , Humanos , Animais , Genoma Humano/genética , Genômica , Variação Estrutural do Genoma/genética , Fenótipo , Hominidae/genética , Evolução Biológica , Edição de Genes
5.
Nat Commun ; 15(1): 6956, 2024 Aug 13.
Artigo em Inglês | MEDLINE | ID: mdl-39138168

RESUMO

Structural variants (SVs) significantly contribute to human genome diversity and play a crucial role in precision medicine. Although advancements in single-molecule long-read sequencing offer a groundbreaking resource for SV detection, identifying SV breakpoints and sequences accurately and robustly remains challenging. We introduce VolcanoSV, an innovative hybrid SV detection pipeline that utilizes both a reference genome and local de novo assembly to generate a phased diploid assembly. VolcanoSV uses phased SNPs and unique k-mer similarity analysis, enabling precise haplotype-resolved SV discovery. VolcanoSV is adept at constructing comprehensive genetic maps encompassing SNPs, small indels, and all types of SVs, making it well-suited for human genomics studies. Our extensive experiments demonstrate that VolcanoSV surpasses state-of-the-art assembly-based tools in the detection of insertion and deletion SVs, exhibiting superior recall, precision, F1 scores, and genotype accuracy across a diverse range of datasets, including low-coverage (10x) datasets. VolcanoSV outperforms assembly-based tools in the identification of complex SVs, including translocations, duplications, and inversions, in both simulated and real cancer data. Moreover, VolcanoSV is robust to various evaluation parameters and accurately identifies breakpoints and SV sequences.


Assuntos
Diploide , Genoma Humano , Variação Estrutural do Genoma , Polimorfismo de Nucleotídeo Único , Humanos , Genômica/métodos , Sequenciamento de Nucleotídeos em Larga Escala/métodos , Análise de Sequência de DNA/métodos , Software , Haplótipos
6.
Virulence ; 15(1): 2382762, 2024 Dec.
Artigo em Inglês | MEDLINE | ID: mdl-39092797

RESUMO

African swine fever (ASF) is a rapidly fatal viral haemorrhagic fever in Chinese domestic pigs. Although very high mortality is observed in pig farms after an ASF outbreak, clinically healthy and antibody-positive pigs are found in those farms, and viral detection is rare from these pigs. The ability of pigs to resist ASF viral infection may be modulated by host genetic variations. However, the genetic basis of the resistance of domestic pigs against ASF remains unclear. We generated a comprehensive set of structural variations (SVs) in a Chinese indigenous Xiang pig with ASF-resistant (Xiang-R) and ASF-susceptible (Xiang-S) phenotypes using whole-genome resequencing method. A total of 53,589 nonredundant SVs were identified, with an average of 25,656 SVs per individual in the Xiang pig genome, including insertion, deletion, inversion and duplication variations. The Xiang-R group harboured more SVs than the Xiang-S group. The F-statistics (FST) was carried out to reveal genetic differences between two populations using the resequencing data at each SV locus. We identified 2,414 population-stratified SVs and annotated 1,152 Ensembl genes (including 986 protein-coding genes), in which 1,326 SVs might disturb the structure and expression of the Ensembl genes. Those protein-coding genes were mainly enriched in the Wnt, Hippo, and calcium signalling pathways. Other important pathways associated with the ASF viral infection were also identified, such as the endocytosis, apoptosis, focal adhesion, Fc gamma R-mediated phagocytosis, junction, NOD-like receptor, PI3K-Akt, and c-type lectin receptor signalling pathways. Finally, we identified 135 candidate adaptive genes overlapping 166 SVs that were involved in the virus entry and virus-host cell interactions. The fact that some of population-stratified SVs regions detected as selective sweep signals gave another support for the genetic variations affecting pig resistance against ASF. The research indicates that SVs play an important role in the evolutionary processes of Xiang pig adaptation to ASF infection.


Assuntos
Vírus da Febre Suína Africana , Febre Suína Africana , Animais , Febre Suína Africana/virologia , Febre Suína Africana/genética , Suínos , Vírus da Febre Suína Africana/genética , Resistência à Doença/genética , Variação Genética , Genoma/genética , Sequenciamento Completo do Genoma , Variação Estrutural do Genoma , China , Sus scrofa
7.
Am J Hum Genet ; 111(8): 1524-1543, 2024 Aug 08.
Artigo em Inglês | MEDLINE | ID: mdl-39053458

RESUMO

Gene misexpression is the aberrant transcription of a gene in a context where it is usually inactive. Despite its known pathological consequences in specific rare diseases, we have a limited understanding of its wider prevalence and mechanisms in humans. To address this, we analyzed gene misexpression in 4,568 whole-blood bulk RNA sequencing samples from INTERVAL study blood donors. We found that while individual misexpression events occur rarely, in aggregate they were found in almost all samples and a third of inactive protein-coding genes. Using 2,821 paired whole-genome and RNA sequencing samples, we identified that misexpression events are enriched in cis for rare structural variants. We established putative mechanisms through which a subset of SVs lead to gene misexpression, including transcriptional readthrough, transcript fusions, and gene inversion. Overall, we develop misexpression as a type of transcriptomic outlier analysis and extend our understanding of the variety of mechanisms by which genetic variants can influence gene expression.


Assuntos
Regulação da Expressão Gênica , Humanos , Análise de Sequência de RNA , Variação Genética , Variação Estrutural do Genoma/genética , Transcriptoma/genética , Doadores de Sangue
8.
Curr Opin Genet Dev ; 87: 102233, 2024 Aug.
Artigo em Inglês | MEDLINE | ID: mdl-39042999

RESUMO

Structural variants (SVs) account for the majority of base pair differences both within and between primate species. However, our understanding of inter- and intra-species SV has been historically hampered by the quality of draft primate genomes and the absence of genome resources for key taxa. Recently, advances in long-read sequencing and genome assembly have begun to radically reshape our understanding of SVs. Two landmark achievements include the publication of a human telomere-to-telomere (T2T) genome as well as the development of the first human pangenome reference. In this review, we first look back to the major works laying the foundation for these projects. We then examine the ways in which T2T genome assemblies and pangenomes are transforming our understanding of and approach to primate SV. Finally, we discuss what the future of primate SV research may look like in the era of T2T genomes and pangenomics.


Assuntos
Genômica , Primatas , Telômero , Humanos , Animais , Primatas/genética , Telômero/genética , Genômica/métodos , Genoma Humano , Genoma/genética , Evolução Molecular , Variação Estrutural do Genoma/genética
9.
Brief Bioinform ; 25(4)2024 May 23.
Artigo em Inglês | MEDLINE | ID: mdl-38980375

RESUMO

Structural variation (SV) is an important form of genomic variation that influences gene function and expression by altering the structure of the genome. Although long-read data have been proven to better characterize SVs, SVs detected from noisy long-read data still include a considerable portion of false-positive calls. To accurately detect SVs in long-read data, we present SVDF, a method that employs a learning-based noise filtering strategy and an SV signature-adaptive clustering algorithm, for effectively reducing the likelihood of false-positive events. Benchmarking results from multiple orthogonal experiments demonstrate that, across different sequencing platforms and depths, SVDF achieves higher calling accuracy for each sample compared to several existing general SV calling tools. We believe that, with its meticulous and sensitive SV detection capability, SVDF can bring new opportunities and advancements to cutting-edge genomic research.


Assuntos
Algoritmos , Humanos , Análise de Sequência de DNA/métodos , Sequenciamento de Nucleotídeos em Larga Escala/métodos , Genômica/métodos , Variação Estrutural do Genoma , Software
10.
Genes (Basel) ; 15(7)2024 Jul 16.
Artigo em Inglês | MEDLINE | ID: mdl-39062704

RESUMO

The identification of structural variants (SVs) in genomic data represents an ongoing challenge because of difficulties in reliable SV calling leading to reduced sensitivity and specificity. We prepared high-quality DNA from 9 parent-child trios, who had previously undergone short-read whole-genome sequencing (Illumina platform) as part of the Genomics England 100,000 Genomes Project. We reanalysed the genomes using both Bionano optical genome mapping (OGM; 8 probands and one trio) and Nanopore long-read sequencing (Oxford Nanopore Technologies [ONT] platform; all samples). To establish a "truth" dataset, we asked whether rare proband SV calls (n = 234) made by the Bionano Access (version 1.6.1)/Solve software (version 3.6.1_11162020) could be verified by individual visualisation using the Integrative Genomics Viewer with either or both of the Illumina and ONT raw sequence. Of these, 222 calls were verified, indicating that Bionano OGM calls have high precision (positive predictive value 95%). We then asked what proportion of the 222 true Bionano SVs had been identified by SV callers in the other two datasets. In the Illumina dataset, sensitivity varied according to variant type, being high for deletions (115/134; 86%) but poor for insertions (13/58; 22%). In the ONT dataset, sensitivity was generally poor using the original Sniffles variant caller (48% overall) but improved substantially with use of Sniffles2 (36/40; 90% and 17/23; 74% for deletions and insertions, respectively). In summary, we show that the precision of OGM is very high. In addition, when applying the Sniffles2 caller, the sensitivity of SV calling using ONT long-read sequence data outperforms Illumina sequencing for most SV types.


Assuntos
Benchmarking , Sequenciamento por Nanoporos , Sequenciamento Completo do Genoma , Humanos , Sequenciamento Completo do Genoma/métodos , Sequenciamento Completo do Genoma/normas , Sequenciamento por Nanoporos/métodos , Benchmarking/métodos , Variação Estrutural do Genoma/genética , Mapeamento Cromossômico/métodos , Genoma Humano/genética , Genômica/métodos , Software , Sequenciamento de Nucleotídeos em Larga Escala/métodos , Sequenciamento de Nucleotídeos em Larga Escala/normas , Feminino , Nanoporos , Masculino , Análise de Sequência de DNA/métodos , Análise de Sequência de DNA/normas
11.
Genome Biol ; 25(1): 188, 2024 Jul 15.
Artigo em Inglês | MEDLINE | ID: mdl-39010145

RESUMO

BACKGROUND: Structural variation (SV) detection methods using third-generation sequencing data are widely employed, yet accurately detecting SVs remains challenging. Different methods often yield inconsistent results for certain SV types, complicating tool selection and revealing biases in detection. RESULTS: This study comprehensively evaluates 53 SV detection pipelines using simulated and real data from PacBio (CLR: Continuous Long Read, CCS: Circular Consensus Sequencing) and Nanopore (ONT) platforms. We assess their performance in detecting various sizes and types of SVs, breakpoint biases, and genotyping accuracy with various sequencing depths. Notably, pipelines such as Minimap2-cuteSV2, NGMLR-SVIM, PBMM2-pbsv, Winnowmap-Sniffles2, and Winnowmap-SVision exhibit comparatively higher recall and precision. Our findings also show that combining multiple pipelines with the same aligner, like pbmm2 or winnowmap, can significantly enhance performance. The individual pipelines' detailed ranking and performance metrics can be viewed in a dynamic table: http://pmglab.top/SVPipelinesRanking . CONCLUSIONS: This study comprehensively characterizes the strengths and weaknesses of numerous pipelines, providing valuable insights that can improve SV detection in third-generation sequencing data and inform SV annotation and function prediction.


Assuntos
Sequenciamento de Nucleotídeos em Larga Escala , Humanos , Sequenciamento de Nucleotídeos em Larga Escala/métodos , Variação Estrutural do Genoma , Software , Análise de Sequência de DNA/métodos
12.
Gigascience ; 132024 01 02.
Artigo em Inglês | MEDLINE | ID: mdl-38869149

RESUMO

Structural variations (SVs) play a significant role in speciation and adaptation in many species, yet few studies have explored the prevalence and impact of different categories of SVs. We conducted a comparative analysis of long-read assembled reference genomes of closely related Eucalyptus species to identify candidate SVs potentially influencing speciation and adaptation. Interspecies SVs can be either fixed differences or polymorphic in one or both species. To describe SV patterns, we employed short-read whole-genome sequencing on over 600 individuals of Eucalyptus melliodora and Eucalyptus sideroxylon, along with recent high-quality genome assemblies. We aligned reads and genotyped interspecies SVs predicted between species reference genomes. Our results revealed that 49,756 of 58,025 and 39,536 of 47,064 interspecies SVs could be typed with short reads in E. melliodora and E. sideroxylon, respectively. Focusing on inversions and translocations, symmetric SVs that are readily genotyped within both populations, 24 were found to be structural divergences, 2,623 structural polymorphisms, and 928 shared structural polymorphisms. We assessed the functional significance of fixed interspecies SVs by examining differences in estimated recombination rates and genetic differentiation between species, revealing a complex history of natural selection. Shared structural polymorphisms displayed enrichment of potentially adaptive genes. Understanding how different classes of genetic mutations contribute to genetic diversity and reproductive barriers is essential for understanding how organisms enhance fitness, adapt to changing environments, and diversify. Our findings reveal the prevalence of interspecies SVs and elucidate their role in genetic differentiation, adaptive evolution, and species divergence within and between populations.


Assuntos
Eucalyptus , Genoma de Planta , Isolamento Reprodutivo , Eucalyptus/genética , Variação Estrutural do Genoma , Polimorfismo Genético , Evolução Molecular , Adaptação Fisiológica/genética , Especiação Genética , Sequenciamento Completo do Genoma/métodos , Genótipo
13.
Genome Biol ; 25(1): 148, 2024 06 06.
Artigo em Inglês | MEDLINE | ID: mdl-38845023

RESUMO

BACKGROUND: Sheep and goats have undergone domestication and improvement to produce similar phenotypes, which have been greatly impacted by structural variants (SVs). Here, we report a high-quality chromosome-level reference genome of Asiatic mouflon, and implement a comprehensive analysis of SVs in 897 genomes of worldwide wild and domestic populations of sheep and goats to reveal genetic signatures underlying convergent evolution. RESULTS: We characterize the SV landscapes in terms of genetic diversity, chromosomal distribution and their links with genes, QTLs and transposable elements, and examine their impacts on regulatory elements. We identify several novel SVs and annotate corresponding genes (e.g., BMPR1B, BMPR2, RALYL, COL21A1, and LRP1B) associated with important production traits such as fertility, meat and milk production, and wool/hair fineness. We detect signatures of selection involving the parallel evolution of orthologous SV-associated genes during domestication, local environmental adaptation, and improvement. In particular, we find that fecundity traits experienced convergent selection targeting the gene BMPR1B, with the DEL00067921 deletion explaining ~10.4% of the phenotypic variation observed in goats. CONCLUSIONS: Our results provide new insights into the convergent evolution of SVs and serve as a rich resource for the future improvement of sheep, goats, and related livestock.


Assuntos
Cabras , Animais , Cabras/genética , Ovinos/genética , Evolução Molecular , Variação Estrutural do Genoma , Locos de Características Quantitativas , Genoma , Variação Genética , Domesticação , Fenótipo , Seleção Genética , Receptores de Proteínas Morfogenéticas Ósseas Tipo I/genética
14.
Genome Biol ; 25(1): 155, 2024 06 13.
Artigo em Inglês | MEDLINE | ID: mdl-38872200

RESUMO

Advances in sequencing technology have facilitated population-scale long-read structural variant (SV) detection. Arguably, one of the main challenges in population-scale analysis is developing effective computational pipelines. Here, we present a new filter-based pipeline for population-scale long-read SV detection. It better captures SV signals at an early stage than conventional assembly-based or alignment-based pipelines. Assessments in this work suggest that the filter-based pipeline helps better resolve intra-read rearrangements. Moreover, it is also more computationally efficient than conventional pipelines and thus may facilitate population-scale long-read applications.


Assuntos
Software , Humanos , Sequenciamento de Nucleotídeos em Larga Escala , Análise de Sequência de DNA/métodos , Algoritmos , Variação Estrutural do Genoma
15.
G3 (Bethesda) ; 14(8)2024 Aug 07.
Artigo em Inglês | MEDLINE | ID: mdl-38934850

RESUMO

Advancements in genome sequencing and assembly techniques have increased the documentation of structural variants in wild organisms. Of these variants, chromosomal inversions are especially prominent due to their large size and active recombination suppression between alternative homokaryotypes. This suppression enables the 2 forms of the inversion to be maintained and allows the preservation of locally adapted alleles. The Barramundi Perch (BP; Lates calcarifer) is a widespread species complex with 3 main genetic lineages located in the biogeographic regions of Australia and New Guinea (AUS + NG), Southeast Asia (SEA), and the Indian Subcontinent (IND). BP are typically considered to be a protandrous sequential hermaphrodite species that exhibits catadromy. Freshwater occupancy and intraspecific variation in life history (e.g. partially migratory populations) exist and provide opportunities for strongly divergent selection associated with, for example, salinity tolerance, swimming ability, and marine dispersal. Herein, we utilize genomic data generated from all 3 genetic lineages to identify and describe 3 polymorphic candidate chromosomal inversions. These candidate chromosomal inversions appear to be fixed for ancestral variants in the IND lineage and for inverted versions in the AUS + NG lineage and exhibit variation in all 3 inversions in the SEA lineage. BP have a diverse portfolio of life history options that includes migratory strategy as well as sexual system (i.e. hermaphroditism and gonochorism). We propose that the some of the life history variabilities observed in BP may be linked to inversions and, in doing so, we present genetic data that might be useful in enhancing aquaculture production and population management.


Assuntos
Inversão Cromossômica , Especiação Genética , Percas , Animais , Percas/genética , Variação Estrutural do Genoma , Adaptação Fisiológica/genética , Genoma , Filogenia , Genômica/métodos
16.
Cell Genom ; 4(7): 100590, 2024 Jul 10.
Artigo em Inglês | MEDLINE | ID: mdl-38908378

RESUMO

The duplication-triplication/inverted-duplication (DUP-TRP/INV-DUP) structure is a complex genomic rearrangement (CGR). Although it has been identified as an important pathogenic DNA mutation signature in genomic disorders and cancer genomes, its architecture remains unresolved. Here, we studied the genomic architecture of DUP-TRP/INV-DUP by investigating the DNA of 24 patients identified by array comparative genomic hybridization (aCGH) on whom we found evidence for the existence of 4 out of 4 predicted structural variant (SV) haplotypes. Using a combination of short-read genome sequencing (GS), long-read GS, optical genome mapping, and single-cell DNA template strand sequencing (strand-seq), the haplotype structure was resolved in 18 samples. The point of template switching in 4 samples was shown to be a segment of ∼2.2-5.5 kb of 100% nucleotide similarity within inverted repeat pairs. These data provide experimental evidence that inverted low-copy repeats act as recombinant substrates. This type of CGR can result in multiple conformers generating diverse SV haplotypes in susceptible dosage-sensitive loci.


Assuntos
Haplótipos , Humanos , Haplótipos/genética , Hibridização Genômica Comparativa , Variação Estrutural do Genoma/genética , Genoma Humano/genética , Duplicação Gênica/genética
17.
Nat Commun ; 15(1): 5377, 2024 Jun 25.
Artigo em Inglês | MEDLINE | ID: mdl-38918389

RESUMO

Polyploidy, the result of whole-genome duplication (WGD), is a major driver of eukaryote evolution. Yet WGDs are hugely disruptive mutations, and we still lack a clear understanding of their fitness consequences. Here, we study whether WGDs result in greater diversity of genomic structural variants (SVs) and how they influence evolutionary dynamics in a plant genus, Cochlearia (Brassicaceae). By using long-read sequencing and a graph-based pangenome, we find both negative and positive interactions between WGDs and SVs. Masking of recessive mutations due to WGDs leads to a progressive accumulation of deleterious SVs across four ploidal levels (from diploids to octoploids), likely reducing the adaptive potential of polyploid populations. However, we also discover putative benefits arising from SV accumulation, as more ploidy-specific SVs harbor signals of local adaptation in polyploids than in diploids. Together, our results suggest that SVs play diverse and contrasting roles in the evolutionary trajectories of young polyploids.


Assuntos
Evolução Molecular , Duplicação Gênica , Genoma de Planta , Poliploidia , Genoma de Planta/genética , Variação Estrutural do Genoma/genética , Mutação
18.
Methods Mol Biol ; 2825: 39-65, 2024.
Artigo em Inglês | MEDLINE | ID: mdl-38913302

RESUMO

Based on classical karyotyping, structural genome variations (SVs) have generally been considered to be either "simple" (with one or two breakpoints) or "complex" (with more than two breakpoints). Studying the breakpoints of SVs at nucleotide resolution revealed additional, subtle structural variations, such that even "simple" SVs turned out to be "complex." Genome-wide sequencing methods, such as fosmid and paired-end mapping, short-read and long-read whole genome sequencing, and single-molecule optical mapping, also indicated that the number of SVs per individual was considerably larger than expected from karyotyping and high-resolution chromosomal array-based studies. Interestingly, SVs were detected in studies of cohorts of individuals without clinical phenotypes. The common denominator of all SVs appears to be a failure to accurately repair DNA double-strand breaks (DSBs) or to halt cell cycle progression if DSBs persist. This review discusses the various DSB response mechanisms during the mitotic cell cycle and during meiosis and their regulation. Emphasis is given to the molecular mechanisms involved in the formation of translocations, deletions, duplications, and inversions during or shortly after meiosis I. Recently, CRISPR-Cas9 studies have provided unexpected insights into the formation of translocations and chromothripsis by both breakage-fusion-bridge and micronucleus-dependent mechanisms.


Assuntos
Quebras de DNA de Cadeia Dupla , Variação Estrutural do Genoma , Humanos , Meiose/genética , Cariotipagem/métodos , Sistemas CRISPR-Cas , Animais
19.
Mol Cancer ; 23(1): 126, 2024 Jun 11.
Artigo em Inglês | MEDLINE | ID: mdl-38862995

RESUMO

BACKGROUND: In an extensive genomic analysis of lung adenocarcinomas (LUADs), driver mutations have been recognized as potential targets for molecular therapy. However, there remain cases where target genes are not identified. Super-enhancers and structural variants are frequently identified in several hundred loci per case. Despite this, most cancer research has approached the analysis of these data sets separately, without merging and comparing the data, and there are no examples of integrated analysis in LUAD. METHODS: We performed an integrated analysis of super-enhancers and structural variants in a cohort of 174 LUAD cases that lacked clinically actionable genetic alterations. To achieve this, we conducted both WGS and H3K27Ac ChIP-seq analyses using samples with driver gene mutations and those without, allowing for a comprehensive investigation of the potential roles of super-enhancer in LUAD cases. RESULTS: We demonstrate that most genes situated in these overlapped regions were associated with known and previously unknown driver genes and aberrant expression resulting from the formation of super-enhancers accompanied by genomic structural abnormalities. Hi-C and long-read sequencing data further corroborated this insight. When we employed CRISPR-Cas9 to induce structural abnormalities that mimicked cases with outlier ERBB2 gene expression, we observed an elevation in ERBB2 expression. These abnormalities are associated with a higher risk of recurrence after surgery, irrespective of the presence or absence of driver mutations. CONCLUSIONS: Our findings suggest that aberrant gene expression linked to structural polymorphisms can significantly impact personalized cancer treatment by facilitating the identification of driver mutations and prognostic factors, contributing to a more comprehensive understanding of LUAD pathogenesis.


Assuntos
Adenocarcinoma de Pulmão , Elementos Facilitadores Genéticos , Regulação Neoplásica da Expressão Gênica , Neoplasias Pulmonares , Receptor ErbB-2 , Humanos , Receptor ErbB-2/genética , Receptor ErbB-2/metabolismo , Adenocarcinoma de Pulmão/genética , Adenocarcinoma de Pulmão/patologia , Neoplasias Pulmonares/genética , Neoplasias Pulmonares/patologia , Neoplasias Pulmonares/metabolismo , Mutação , Biomarcadores Tumorais/genética , Feminino , Masculino , Variação Estrutural do Genoma , Genômica/métodos , Pessoa de Meia-Idade , Prognóstico , Idoso
20.
Proc Natl Acad Sci U S A ; 121(27): e2322291121, 2024 Jul 02.
Artigo em Inglês | MEDLINE | ID: mdl-38913905

RESUMO

Tibetan sheep were introduced to the Qinghai Tibet plateau roughly 3,000 B.P., making this species a good model for investigating genetic mechanisms of high-altitude adaptation over a relatively short timescale. Here, we characterize genomic structural variants (SVs) that distinguish Tibetan sheep from closely related, low-altitude Hu sheep, and we examine associated changes in tissue-specific gene expression. We document differentiation between the two sheep breeds in frequencies of SVs associated with genes involved in cardiac function and circulation. In Tibetan sheep, we identified high-frequency SVs in a total of 462 genes, including EPAS1, PAPSS2, and PTPRD. Single-cell RNA-Seq data and luciferase reporter assays revealed that the SVs had cis-acting effects on the expression levels of these three genes in specific tissues and cell types. In Tibetan sheep, we identified a high-frequency chromosomal inversion that exhibited modified chromatin architectures relative to the noninverted allele that predominates in Hu sheep. The inversion harbors several genes with altered expression patterns related to heart protection, brown adipocyte proliferation, angiogenesis, and DNA repair. These findings indicate that SVs represent an important source of genetic variation in gene expression and may have contributed to high-altitude adaptation in Tibetan sheep.


Assuntos
Altitude , Animais , Ovinos/genética , Tibet , Variação Estrutural do Genoma , Fatores de Transcrição Hélice-Alça-Hélice Básicos/genética , Regulação da Expressão Gênica , Genoma , Aclimatação/genética
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA
...