Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 44
Filtrar
1.
Nat Commun ; 15(1): 6297, 2024 Aug 01.
Artículo en Inglés | MEDLINE | ID: mdl-39090106

RESUMEN

Krill are vital as food for many marine animals but also impacted by global warming. To learn how they and other zooplankton may adapt to a warmer world we studied local adaptation in the widespread Northern krill (Meganyctiphanes norvegica). We assemble and characterize its large genome and compare genome-scale variation among 74 specimens from the colder Atlantic Ocean and warmer Mediterranean Sea. The 19 Gb genome likely evolved through proliferation of retrotransposons, now targeted for inactivation by extensive DNA methylation, and contains many duplicated genes associated with molting and vision. Analysis of 760 million SNPs indicates extensive homogenizing gene-flow among populations. Nevertheless, we detect signatures of adaptive divergence across hundreds of genes, implicated in photoreception, circadian regulation, reproduction and thermal tolerance, indicating polygenic adaptation to light and temperature. The top gene candidate for ecological adaptation was nrf-6, a lipid transporter with a Mediterranean variant that may contribute to early spring reproduction. Such variation could become increasingly important for fitness in Atlantic stocks. Our study underscores the widespread but uneven distribution of adaptive variation, necessitating characterization of genetic variation among natural zooplankton populations to understand their adaptive potential, predict risks and support ocean conservation in the face of climate change.


Asunto(s)
Adaptación Fisiológica , Euphausiacea , Genómica , Animales , Euphausiacea/genética , Océano Atlántico , Adaptación Fisiológica/genética , Mar Mediterráneo , Polimorfismo de Nucleótido Simple , Genoma , Zooplancton/genética , Flujo Génico , Variación Genética
2.
Sci Adv ; 10(33): eadn0597, 2024 Aug 16.
Artículo en Inglés | MEDLINE | ID: mdl-39141739

RESUMEN

Spiders produce nature's toughest fiber using renewable components at ambient temperatures and with water as solvent, making it highly interesting to replicate for the materials industry. Despite this, much remains to be understood about the bioprocessing and composition of spider silk fibers. Here, we identify 18 proteins that make up the spiders' strongest silk type, the major ampullate fiber. Single-cell RNA sequencing and spatial transcriptomics revealed that the secretory epithelium of the gland harbors six cell types. These cell types are confined to three distinct glandular zones that produce specific combinations of silk proteins. Image analysis of histological sections showed that the secretions from the three zones do not mix, and proteomics analysis revealed that these secretions form layers in the final fiber. Using a multi-omics approach, we provide substantial advancements in the understanding of the structure and function of the major ampullate silk gland as well as of the architecture and composition of the fiber it produces.


Asunto(s)
Genómica , Proteómica , Seda , Análisis de la Célula Individual , Arañas , Transcriptoma , Arañas/metabolismo , Arañas/genética , Animales , Seda/metabolismo , Seda/química , Seda/genética , Proteómica/métodos , Genómica/métodos , Análisis de la Célula Individual/métodos , Perfilación de la Expresión Génica/métodos
3.
Genome Biol Evol ; 16(7)2024 Jul 03.
Artículo en Inglés | MEDLINE | ID: mdl-38918882

RESUMEN

The European sprat is a small plankton-feeding clupeid present in the northeastern Atlantic Ocean, in the Mediterranean Sea, and in the brackish Baltic Sea and Black Sea. This species is the target of a major fishery and, therefore, an accurate characterization of its genetic population structure is crucial to delineate proper stock assessments that aid ensuring the fishery's sustainability. Here, we present (i) a draft genome assembly, (ii) pooled whole genome sequencing of 19 population samples covering most of the species' distribution range, and (iii) the design and test of a single nucleotide polymorphism (SNP)-chip resource and use this to validate the population structure inferred from pooled sequencing. These approaches revealed, using the populations sampled here, three major groups of European sprat: Oceanic, Coastal, and Brackish with limited differentiation within groups even over wide geographical stretches. Genetic structure is largely driven by six large putative inversions that differentiate Oceanic and Brackish sprats, while Coastal populations display intermediate frequencies of haplotypes at each locus. Interestingly, populations from the Baltic and the Black Seas share similar frequencies of haplotypes at these putative inversions despite their distant geographic location. The closely related clupeids European sprat and Atlantic herring both show genetic adaptation to the brackish Baltic Sea, providing an opportunity to explore the extent of genetic parallelism. This analysis revealed limited parallelism because out of 125 independent loci detected in the Atlantic herring, three showed sharp signals of selection that overlapped between the two species and contained single genes such as PRLRA, which encodes the receptor for prolactin, a freshwater-adapting hormone in euryhaline species, and THRB, a receptor for thyroid hormones, important both for metabolic regulation and the development of red cone photoreceptors.


Asunto(s)
Peces , Polimorfismo de Nucleótido Simple , Animales , Peces/genética , Océano Atlántico , Adaptación Fisiológica/genética , Aguas Salinas
4.
Life Sci Alliance ; 7(8)2024 Aug.
Artículo en Inglés | MEDLINE | ID: mdl-38777370

RESUMEN

The B-cell acute lymphoblastic leukemia (ALL) cell line REH, with the t(12;21) ETV6::RUNX1 translocation, is known to have a complex karyotype defined by a series of large-scale chromosomal rearrangements. Taken from a 15-yr-old at relapse, the cell line offers a practical model for the study of pediatric B-ALL. In recent years, short- and long-read DNA and RNA sequencing have emerged as a complement to karyotyping techniques in the resolution of structural variants in an oncological context. Here, we explore the integration of long-read PacBio and Oxford Nanopore whole-genome sequencing, IsoSeq RNA sequencing, and short-read Illumina sequencing to create a detailed genomic and transcriptomic characterization of the REH cell line. Whole-genome sequencing clarified the molecular traits of disrupted ALL-associated genes including CDKN2A, PAX5, BTG1, VPREB1, and TBL1XR1, as well as the glucocorticoid receptor NR3C1 Meanwhile, transcriptome sequencing identified seven fusion genes within the genomic breakpoints. Together, our extensive whole-genome investigation makes high-quality open-source data available to the leukemia genomics community.


Asunto(s)
Secuenciación Completa del Genoma , Humanos , Línea Celular Tumoral , Secuenciación Completa del Genoma/métodos , Secuenciación de Nucleótidos de Alto Rendimiento/métodos , Translocación Genética/genética , Proteínas de Fusión Oncogénica/genética , Genómica/métodos , Leucemia-Linfoma Linfoblástico de Células Precursoras/genética , Transcriptoma/genética , Perfilación de la Expresión Génica/métodos , Subunidad alfa 2 del Factor de Unión al Sitio Principal/genética , Cariotipificación/métodos , Análisis de Secuencia de ARN/métodos
5.
Mol Biol Evol ; 41(5)2024 May 03.
Artículo en Inglés | MEDLINE | ID: mdl-38709782

RESUMEN

Distyly is an iconic floral polymorphism governed by a supergene, which promotes efficient pollen transfer and outcrossing through reciprocal differences in the position of sexual organs in flowers, often coupled with heteromorphic self-incompatibility. Distyly has evolved convergently in multiple flowering plant lineages, but has also broken down repeatedly, often resulting in homostylous, self-compatible populations with elevated rates of self-fertilization. Here, we aimed to study the genetic causes and genomic consequences of the shift to homostyly in Linum trigynum, which is closely related to distylous Linum tenue. Building on a high-quality genome assembly, we show that L. trigynum harbors a genomic region homologous to the dominant haplotype of the distyly supergene conferring long stamens and short styles in L. tenue, suggesting that loss of distyly first occurred in a short-styled individual. In contrast to homostylous Primula and Fagopyrum, L. trigynum harbors no fixed loss-of-function mutations in coding sequences of S-linked distyly candidate genes. Instead, floral gene expression analyses and controlled crosses suggest that mutations downregulating the S-linked LtWDR-44 candidate gene for male self-incompatibility and/or anther height could underlie homostyly and self-compatibility in L. trigynum. Population genomic analyses of 224 whole-genome sequences further demonstrate that L. trigynum is highly self-fertilizing, exhibits significantly lower genetic diversity genome-wide, and is experiencing relaxed purifying selection and less frequent positive selection on nonsynonymous mutations relative to L. tenue. Our analyses shed light on the loss of distyly in L. trigynum, and advance our understanding of a common evolutionary transition in flowering plants.


Asunto(s)
Flores , Genoma de Planta , Flores/genética
6.
Sci Adv ; 10(14): eadk9315, 2024 Apr 05.
Artículo en Inglés | MEDLINE | ID: mdl-38569035

RESUMEN

The joint expression of particular colors, morphologies, and behaviors is a common feature of adaptation, but the genetic basis for such "phenotypic syndromes" remains poorly understood. Here, we identified a complex genetic architecture associated with a sexually selected syndrome in common wall lizards, by capitalizing on the adaptive introgression of coloration and morphology into a distantly related lineage. Consistent with the hypothesis that the evolution of phenotypic syndromes in vertebrates is facilitated by developmental linkage through neural crest cells, most of the genes associated with the syndrome are involved in neural crest cell regulation. A major locus was a ~400-kb region, characterized by standing structural genetic variation and previously implied in the evolutionary innovation of coloration and beak size in birds. We conclude that features of the developmental and genetic architecture contribute to maintaining trait integration, facilitating the extensive and rapid introgressive spread of suites of sexually selected characters.


Asunto(s)
Lagartos , Selección Genética , Animales , Fenotipo , Lagartos/genética
7.
Sci Rep ; 14(1): 9000, 2024 04 18.
Artículo en Inglés | MEDLINE | ID: mdl-38637641

RESUMEN

Long-read genome sequencing (lrGS) is a promising method in genetic diagnostics. Here we investigate the potential of lrGS to detect a disease-associated chromosomal translocation between 17p13 and the 19 centromere. We constructed two sets of phased and non-phased de novo assemblies; (i) based on lrGS only and (ii) hybrid assemblies combining lrGS with optical mapping using lrGS reads with a median coverage of 34X. Variant calling detected both structural variants (SVs) and small variants and the accuracy of the small variant calling was compared with those called with short-read genome sequencing (srGS). The de novo and hybrid assemblies had high quality and contiguity with N50 of 62.85 Mb, enabling a near telomere to telomere assembly with less than a 100 contigs per haplotype. Notably, we successfully identified the centromeric breakpoint of the translocation. A concordance of 92% was observed when comparing small variant calling between srGS and lrGS. In summary, our findings underscore the remarkable potential of lrGS as a comprehensive and accurate solution for the analysis of SVs and small variants. Thus, lrGS could replace a large battery of genetic tests that were used for the diagnosis of a single symptomatic translocation carrier, highlighting the potential of lrGS in the realm of digital karyotyping.


Asunto(s)
Secuenciación de Nucleótidos de Alto Rendimiento , Translocación Genética , Humanos , Análisis de Secuencia de ADN/métodos , Secuenciación de Nucleótidos de Alto Rendimiento/métodos , Secuencia de Bases , Centrómero/genética
8.
Genome Biol Evol ; 16(2)2024 02 01.
Artículo en Inglés | MEDLINE | ID: mdl-38368625

RESUMEN

The clouded apollo (Parnassius mnemosyne) is a palearctic butterfly distributed over a large part of western Eurasia, but population declines and fragmentation have been observed in many parts of the range. The development of genomic tools can help to shed light on the genetic consequences of the decline and to make informed decisions about direct conservation actions. Here, we present a high-contiguity, chromosome-level genome assembly of a female clouded apollo butterfly and provide detailed annotations of genes and transposable elements. We find that the large genome (1.5 Gb) of the clouded apollo is extraordinarily repeat rich (73%). Despite that, the combination of sequencing techniques allowed us to assemble all chromosomes (nc = 29) to a high degree of completeness. The annotation resulted in a relatively high number of protein-coding genes (22,854) compared with other Lepidoptera, of which a large proportion (21,635) could be assigned functions based on homology with other species. A comparative analysis indicates that overall genome structure has been largely conserved, both within the genus and compared with the ancestral lepidopteran karyotype. The high-quality genome assembly and detailed annotation presented here will constitute an important tool for forthcoming efforts aimed at understanding the genetic consequences of fragmentation and decline, as well as for assessments of genetic diversity, population structure, inbreeding, and genetic load in the clouded apollo butterfly.


Asunto(s)
Mariposas Diurnas , Animales , Femenino , Mariposas Diurnas/genética , Conservación de los Recursos Naturales , Genómica , Elementos Transponibles de ADN , Cromosomas , Anotación de Secuencia Molecular
9.
BMC Res Notes ; 16(1): 265, 2023 Oct 10.
Artículo en Inglés | MEDLINE | ID: mdl-37817248

RESUMEN

OBJECTIVES: The aim of this data paper is to describe a collection of 33 genomic, transcriptomic and epigenomic sequencing datasets of the B-cell acute lymphoblastic leukemia (ALL) cell line REH. REH is one of the most frequently used cell lines for functional studies of pediatric ALL, and these data provide a multi-faceted characterization of its molecular features. The datasets described herein, generated with short- and long-read sequencing technologies, can both provide insights into the complex aberrant karyotype of REH, and be used as reference datasets for sequencing data quality assessment or for methods development. DATA DESCRIPTION: This paper describes 33 datasets corresponding to 867 gigabases of raw sequencing data generated from the REH cell line. These datasets include five different approaches for whole genome sequencing (WGS) on four sequencing platforms, two RNA sequencing (RNA-seq) techniques on two different sequencing platforms, DNA methylation sequencing, and single-cell ATAC-sequencing.


Asunto(s)
Leucemia de Células B , Leucemia Linfocítica Crónica de Células B , Niño , Humanos , Línea Celular , Epigenómica/métodos , Genómica , Leucemia de Células B/genética , Leucemia Linfocítica Crónica de Células B/genética , Transcriptoma , Línea Celular Tumoral
10.
Nat Commun ; 14(1): 5164, 2023 08 24.
Artículo en Inglés | MEDLINE | ID: mdl-37620373

RESUMEN

Long-read sequencing has dramatically increased our understanding of human genome variation. Here, we demonstrate that long-read technology can give new insights into the genomic architecture of individual cells. Clonally expanded CD8+ T-cells from a human donor were subjected to droplet-based multiple displacement amplification (dMDA) to generate long molecules with reduced bias. PacBio sequencing generated up to 40% genome coverage per single-cell, enabling detection of single nucleotide variants (SNVs), structural variants (SVs), and tandem repeats, also in regions inaccessible by short reads. 28 somatic SNVs were detected, including one case of mitochondrial heteroplasmy. 5473 high-confidence SVs/cell were discovered, a sixteen-fold increase compared to Illumina-based results from clonally related cells. Single-cell de novo assembly generated a genome size of up to 598 Mb and 1762 (12.8%) complete gene models. In summary, our work shows the promise of long-read sequencing toward characterization of the full spectrum of genetic variation in single cells.


Asunto(s)
Genoma Humano , Genómica , Humanos , Tamaño del Genoma , Genoma Humano/genética , Linfocitos T CD8-positivos , Ciclo Celular
11.
Neuro Oncol ; 25(1): 97-107, 2023 01 05.
Artículo en Inglés | MEDLINE | ID: mdl-35738865

RESUMEN

BACKGROUND: Malignant gliomas, the most common malignant brain tumors in adults, represent a heterogeneous group of diseases with poor prognosis. Retroviruses can cause permanent genetic alterations that modify genes close to the viral integration site. METHODS: Here we describe the use of a high-throughput pipeline coupled to the commonly used tissue-specific retroviral RCAS-TVA mouse tumor model system. Utilizing next-generation sequencing, we show that retroviral integration sites can be reproducibly detected in malignant stem cell lines generated from RCAS-PDGFB-driven glioma biopsies. RESULTS: A large fraction of common integration sites contained genes that have been dysregulated or misexpressed in glioma. Others overlapped with loci identified in previous glioma-related forward genetic screens, but several novel putative cancer-causing genes were also found. Integrating retroviral tagging and clinical data, Ppfibp1 was highlighted as a frequently tagged novel glioma-causing gene. Retroviral integrations into the locus resulted in Ppfibp1 upregulation, and Ppfibp1-tagged cells generated tumors with shorter latency on orthotopic transplantation. In human gliomas, increased PPFIBP1 expression was significantly linked to poor prognosis and PDGF treatment resistance. CONCLUSIONS: Altogether, the current study has demonstrated a novel approach to tagging glioma genes via forward genetics, validating previous results, and identifying PPFIBP1 as a putative oncogene in gliomagenesis.


Asunto(s)
Neoplasias Encefálicas , Glioma , Animales , Humanos , Ratones , Neoplasias Encefálicas/patología , Estudios de Asociación Genética , Glioma/patología , Oncogenes , Proteínas Proto-Oncogénicas c-sis/genética
12.
Curr Biol ; 32(20): 4360-4371.e6, 2022 10 24.
Artículo en Inglés | MEDLINE | ID: mdl-36087578

RESUMEN

Supergenes govern multi-trait-balanced polymorphisms in a wide range of systems; however, our understanding of their origins and evolution remains incomplete. The reciprocal placement of stigmas and anthers in pin and thrum floral morphs of distylous species constitutes an iconic example of a balanced polymorphism governed by a supergene, the distyly S-locus. Recent studies have shown that the Primula and Turnera distyly supergenes are both hemizygous in thrums, but it remains unknown whether hemizygosity is pervasive among distyly S-loci. As hemizygosity has major consequences for supergene evolution and loss, clarifying whether this genetic architecture is shared among distylous species is critical. Here, we have characterized the genetic architecture and evolution of the distyly supergene in Linum by generating a chromosome-level genome assembly of Linum tenue, followed by the identification of the S-locus using population genomic data. We show that hemizygosity and thrum-specific expression of S-linked genes, including a pistil-expressed candidate gene for style length, are major features of the Linum S-locus. Structural variation is likely instrumental for recombination suppression, and although the non-recombining dominant haplotype has accumulated transposable elements, S-linked genes are not under relaxed purifying selection. Our findings reveal remarkable convergence in the genetic architecture and evolution of independently derived distyly supergenes, provide a counterexample to classic inversion-based supergenes, and shed new light on the origin and maintenance of an iconic floral polymorphism.


Asunto(s)
Lino , Lino/genética , Elementos Transponibles de ADN , Flores/genética , Genómica , Sitios Genéticos , Evolución Molecular
13.
Insects ; 13(5)2022 Apr 22.
Artículo en Inglés | MEDLINE | ID: mdl-35621738

RESUMEN

The complete mitochondrial genome of the Carniolan honeybee (Apis mellifera carnica) from Slovenia, a homeland of this subspecies, was acquired in two contigs from WGS data and annotated. The newly obtained mitochondrial genome is a circular closed loop of 16,447 bp. It comprises 37 genes (13 protein coding genes, 22 tRNA genes, and 2 rRNA genes) and an AT-rich control region. The order of the tRNA genes resembles the order characteristic of A. mellifera. The mitogenomic sequence of A. m. carnica from Slovenia contains 44 uniquely coded sites in comparison to the closely related subspecies A. m. ligustica and to A. m. carnica from Austria. Furthermore, 24 differences were recognised in comparison between A. m. carnica and A. m. ligustica subspecies. Among them, there are three SNPs that affect translation in the nd2, nd4, and cox2 genes, respectively. The phylogenetic placement of A. m. carnica from Slovenia within C lineage deviates from the expected position and changes the perspective on relationship between C and O lineages. The results of this study represent a valuable addition to the information available in the phylogenomic studies of A. mellifera-a pollinator species of worldwide importance. Such genomic information is essential for this local subspecies' conservation and preservation as well as its breeding and selection.

14.
Mol Ecol Resour ; 22(6): 2379-2395, 2022 Aug.
Artículo en Inglés | MEDLINE | ID: mdl-35348299

RESUMEN

The major histocompatibility complex (MHC) is of central importance to the immune system, and an optimal MHC diversity is believed to maximize pathogen elimination. Birds show substantial variation in MHC diversity, ranging from few genes in most bird orders to very many genes in passerines. Our understanding of the evolutionary trajectories of the MHC in passerines is hampered by lack of data on genomic organization. Therefore, we assembled and annotated the MHC genomic region of the great reed warbler (Acrocephalus arundinaceus), using long-read sequencing and optical mapping. The MHC region is large (>5.5 Mb), characterized by structural changes compared to hitherto investigated bird orders and shows higher repeat content than the genome average. These features were supported by analyses in three additional passerines. MHC genes in passerines are found in two different chromosomal arrangements, either as single copy MHC genes located among non-MHC genes, or as tandemly duplicated tightly linked MHC genes. Some single copy MHC genes are old and putative orthologues among species. In contrast tandemly duplicated MHC genes are monophyletic within species and have evolved by simultaneous gene duplication of several MHC genes. Structural differences in the MHC genomic region among bird orders seem substantial compared to mammals and have possibly been fuelled by clade-specific immune system adaptations. Our study provides methodological guidance in characterizing complex genomic regions, constitutes a resource for MHC research in birds, and calls for a revision of the general belief that avian MHC has a conserved gene order and small size compared to mammals.


Asunto(s)
Complejo Mayor de Histocompatibilidad , Passeriformes , Animales , Evolución Biológica , Genoma , Genómica , Complejo Mayor de Histocompatibilidad/genética , Mamíferos/genética , Passeriformes/genética , Filogenia
15.
Mol Ecol ; 31(4): 1111-1127, 2022 02.
Artículo en Inglés | MEDLINE | ID: mdl-34837435

RESUMEN

Over the last six decades, populations of the bumblebees Bombus sylvicola and Bombus balteatus in Colorado have experienced decreases in tongue length, a trait important for plant-pollinator mutualisms. It has been hypothesized that this observation reflects selection resulting from shifts in floral composition under climate change. Here we used morphometrics and population genomics to determine whether morphological change is ongoing, investigate the genetic basis of morphological variation, and analyse population structure in these populations. We generated a genome assembly of B. balteatus. We then analysed whole-genome sequencing data and morphometric measurements of 580 samples of both species from seven high-altitude localities. Out of 281 samples originally identified as B. sylvicola, 67 formed a separate genetic cluster comprising a newly-discovered cryptic species ("incognitus"). However, an absence of genetic structure within species suggests that gene flow is common between mountains. We found a significant decrease in tongue length between bees collected between 2012-2014 and in 2017, indicating that morphological shifts are ongoing. We did not discover any genetic associations with tongue length, but a SNP related to production of a proteolytic digestive enzyme was implicated in body size variation. We identified evidence of covariance between kinship and both tongue length and body size, which is suggestive of a genetic component of these traits, although it is possible that shared environmental effects between colonies are responsible. Our results provide evidence for ongoing modification of a morphological trait important for pollination and indicate that this trait probably has a complex genetic and environmental basis.


Asunto(s)
Flujo Génico , Polinización , Animales , Abejas/genética , Flujo Génico/genética , Genómica , Fenotipo , Lengua
16.
Mol Biol Evol ; 38(12): 5275-5291, 2021 12 09.
Artículo en Inglés | MEDLINE | ID: mdl-34542640

RESUMEN

How the avian sex chromosomes first evolved from autosomes remains elusive as 100 million years (My) of divergence and degeneration obscure their evolutionary history. The Sylvioidea group of songbirds is interesting for understanding avian sex chromosome evolution because a chromosome fusion event ∼24 Ma formed "neo-sex chromosomes" consisting of an added (new) and an ancestral (old) part. Here, we report the complete female genome (ZW) of one Sylvioidea species, the great reed warbler (Acrocephalus arundinaceus). Our long-read assembly shows that the added region has been translocated to both Z and W, and whereas the added-Z has retained its gene order the added-W part has been heavily rearranged. Phylogenetic analyses show that recombination between the homologous added-Z and -W regions continued after the fusion event, and that recombination suppression across this region took several million years to be completed. Moreover, recombination suppression was initiated across multiple positions over the added-Z, which is not consistent with a simple linear progression starting from the fusion point. As expected following recombination suppression, the added-W show signs of degeneration including repeat accumulation and gene loss. Finally, we present evidence for nonrandom maintenance of slowly evolving and dosage-sensitive genes on both ancestral- and added-W, a process causing correlated evolution among orthologous genes across broad taxonomic groups, regardless of sex linkage.


Asunto(s)
Passeriformes , Pájaros Cantores , Animales , Evolución Molecular , Femenino , Passeriformes/genética , Filogenia , Recombinación Genética , Cromosomas Sexuales/genética , Pájaros Cantores/genética
18.
Mol Biol Evol ; 38(8): 3126-3143, 2021 07 29.
Artículo en Inglés | MEDLINE | ID: mdl-33823537

RESUMEN

Evidence is accumulating that gene flow commonly occurs between recently diverged species, despite the existence of barriers to gene flow in their genomes. However, we still know little about what regions of the genome become barriers to gene flow and how such barriers form. Here, we compare genetic differentiation across the genomes of bumblebee species living in sympatry and allopatry to reveal the potential impact of gene flow during species divergence and uncover genetic barrier loci. We first compared the genomes of the alpine bumblebee Bombus sylvicola and a previously unidentified sister species living in sympatry in the Rocky Mountains, revealing prominent islands of elevated genetic divergence in the genome that colocalize with centromeres and regions of low recombination. This same pattern is observed between the genomes of another pair of closely related species living in allopatry (B. bifarius and B. vancouverensis). Strikingly however, the genomic islands exhibit significantly elevated absolute divergence (dXY) in the sympatric, but not the allopatric, comparison indicating that they contain loci that have acted as barriers to historical gene flow in sympatry. Our results suggest that intrinsic barriers to gene flow between species may often accumulate in regions of low recombination and near centromeres through processes such as genetic hitchhiking, and that divergence in these regions is accentuated in the presence of gene flow.


Asunto(s)
Abejas/genética , Flujo Génico , Genoma de los Insectos , Aislamiento Reproductivo , Simpatría , Animales , Evolución Molecular , Recombinación Genética
19.
Mol Ecol Resour ; 21(1): 263-286, 2021 Jan.
Artículo en Inglés | MEDLINE | ID: mdl-32937018

RESUMEN

Genome assemblies are currently being produced at an impressive rate by consortia and individual laboratories. The low costs and increasing efficiency of sequencing technologies now enable assembling genomes at unprecedented quality and contiguity. However, the difficulty in assembling repeat-rich and GC-rich regions (genomic "dark matter") limits insights into the evolution of genome structure and regulatory networks. Here, we compare the efficiency of currently available sequencing technologies (short/linked/long reads and proximity ligation maps) and combinations thereof in assembling genomic dark matter. By adopting different de novo assembly strategies, we compare individual draft assemblies to a curated multiplatform reference assembly and identify the genomic features that cause gaps within each assembly. We show that a multiplatform assembly implementing long-read, linked-read and proximity sequencing technologies performs best at recovering transposable elements, multicopy MHC genes, GC-rich microchromosomes and the repeat-rich W chromosome. Telomere-to-telomere assemblies are not a reality yet for most organisms, but by leveraging technology choice it is now possible to minimize genome assembly gaps for downstream analysis. We provide a roadmap to tailor sequencing projects for optimized completeness of both the coding and noncoding parts of nonmodel genomes.


Asunto(s)
Genoma de Planta , Genómica , Secuenciación de Nucleótidos de Alto Rendimiento , Análisis de Secuencia de ADN , Strelitziaceae/genética , Elementos Transponibles de ADN , Genómica/métodos
20.
Genome Biol ; 21(1): 290, 2020 12 01.
Artículo en Inglés | MEDLINE | ID: mdl-33261648

RESUMEN

BACKGROUND: One ongoing concern about CRISPR-Cas9 genome editing is that unspecific guide RNA (gRNA) binding may induce off-target mutations. However, accurate prediction of CRISPR-Cas9 off-target activity is challenging. Here, we present SMRT-OTS and Nano-OTS, two novel, amplification-free, long-read sequencing protocols for detection of gRNA-driven digestion of genomic DNA by Cas9 in vitro. RESULTS: The methods are assessed using the human cell line HEK293, re-sequenced at 18x coverage using highly accurate HiFi SMRT reads. SMRT-OTS and Nano-OTS are first applied to three different gRNAs targeting HEK293 genomic DNA, resulting in a set of 55 high-confidence gRNA cleavage sites identified by both methods. Twenty-five of these sites are not reported by off-target prediction software, either because they contain four or more single nucleotide mismatches or insertion/deletion mismatches, as compared with the human reference. Additional experiments reveal that 85% of Cas9 cleavage sites are also found by other in vitro-based methods and that on- and off-target sites are detectable in gene bodies where short-reads fail to uniquely align. Even though SMRT-OTS and Nano-OTS identify several sites with previously validated off-target editing activity in cells, our own CRISPR-Cas9 editing experiments in human fibroblasts do not give rise to detectable off-target mutations at the in vitro-predicted sites. However, indel and structural variation events are enriched at the on-target sites. CONCLUSIONS: Amplification-free long-read sequencing reveals Cas9 cleavage sites in vitro that would have been difficult to predict using computational tools, including in dark genomic regions inaccessible by short-read sequencing.


Asunto(s)
Secuencia de Bases , Sistemas CRISPR-Cas , Biología Computacional/métodos , Edición Génica/métodos , ADN , Variación Genética , Genómica , Células HEK293 , Humanos , Mutación , Secuenciación de Nanoporos , ARN Guía de Kinetoplastida , Análisis de Secuencia de ADN , Programas Informáticos
SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA