Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 15 de 15
Filtrar
1.
Genome Res ; 27(5): 778-786, 2017 05.
Artigo em Inglês | MEDLINE | ID: mdl-28159771

RESUMO

Long-read sequencing can overcome the weaknesses of short reads in the assembly of eukaryotic genomes; however, at present additional scaffolding is needed to achieve chromosome-level assemblies. We generated Pacific Biosciences (PacBio) long-read data of the genomes of three relatives of the model plant Arabidopsis thaliana and assembled all three genomes into only a few hundred contigs. To improve the contiguities of these assemblies, we generated BioNano Genomics optical mapping and Dovetail Genomics chromosome conformation capture data for genome scaffolding. Despite their technical differences, optical mapping and chromosome conformation capture performed similarly and doubled N50 values. After improving both integration methods, assembly contiguity reached chromosome-arm-levels. We rigorously assessed the quality of contigs and scaffolds using Illumina mate-pair libraries and genetic map information. This showed that PacBio assemblies have high sequence accuracy but can contain several misassemblies, which join unlinked regions of the genome. Most, but not all, of these misjoints were removed during the integration of the optical mapping and chromosome conformation capture data. Even though none of the centromeres were fully assembled, the scaffolds revealed large parts of some centromeric regions, even including some of the heterochromatic regions, which are not present in gold standard reference sequences.


Assuntos
Cromossomos de Plantas/química , Mapeamento de Sequências Contíguas/métodos , Genoma de Planta , Genômica/métodos , Software , Arabidopsis/genética , Cromossomos de Plantas/genética , Mapeamento de Sequências Contíguas/normas , Genômica/normas
2.
Plant J ; 93(3): 515-533, 2018 02.
Artigo em Inglês | MEDLINE | ID: mdl-29237241

RESUMO

The draft genome of the moss model, Physcomitrella patens, comprised approximately 2000 unordered scaffolds. In order to enable analyses of genome structure and evolution we generated a chromosome-scale genome assembly using genetic linkage as well as (end) sequencing of long DNA fragments. We find that 57% of the genome comprises transposable elements (TEs), some of which may be actively transposing during the life cycle. Unlike in flowering plant genomes, gene- and TE-rich regions show an overall even distribution along the chromosomes. However, the chromosomes are mono-centric with peaks of a class of Copia elements potentially coinciding with centromeres. Gene body methylation is evident in 5.7% of the protein-coding genes, typically coinciding with low GC and low expression. Some giant virus insertions are transcriptionally active and might protect gametes from viral infection via siRNA mediated silencing. Structure-based detection methods show that the genome evolved via two rounds of whole genome duplications (WGDs), apparently common in mosses but not in liverworts and hornworts. Several hundred genes are present in colinear regions conserved since the last common ancestor of plants. These syntenic regions are enriched for functions related to plant-specific cell growth and tissue organization. The P. patens genome lacks the TE-rich pericentromeric and gene-rich distal regions typical for most flowering plant genomes. More non-seed plant genomes are needed to unravel how plant genomes evolve, and to understand whether the P. patens genome structure is typical for mosses or bryophytes.


Assuntos
Evolução Biológica , Bryopsida/genética , Cromossomos de Plantas , Genoma de Planta , Centrômero , Cromatina/genética , Metilação de DNA , Elementos de DNA Transponíveis , Variação Genética , Polimorfismo de Nucleotídeo Único , Recombinação Genética , Sintenia
3.
Bioinformatics ; 34(4): 550-557, 2018 02 15.
Artigo em Inglês | MEDLINE | ID: mdl-29444236

RESUMO

Motivation: Analyzing k-mer frequencies in whole-genome sequencing data is becoming a common method for estimating genome size (GS). However, it remains uninvestigated how accurate the method is, especially if it can capture intra-species GS variation. Results: We present findGSE, which fits skew normal distributions to k-mer frequencies to estimate GS. findGSE outperformed existing tools in an extensive simulation study. Estimating GSs of 89 Arabidopsis thaliana accessions, findGSE showed the highest capability in capturing GS variations. In an application with 71 female and 71 male human individuals, findGSE delivered an average of 3039 Mb as haploid human GS, while female genomes were on average 41 Mb larger than male genomes, in astonishing agreement with size difference of the X and Y chromosomes. Further analysis showed that human GS variations link to geographical patterns and significant differences between populations, which can be explained by variable abundances of LINE-1 retrotransposons. Availability and implementation: R package of findGSE is freely available at https://github.com/schneebergerlab/findGSE and supported on linux and Mac systems. Contact: schneeberger@mpipz.mpg.de. Supplementary information: Supplementary data are available at Bioinformatics online.


Assuntos
Tamanho do Genoma , Genoma Humano , Genoma de Planta , Análise de Sequência de DNA/métodos , Software , Arabidopsis/genética , Feminino , Variação Genética , Genômica/métodos , Humanos , Elementos Nucleotídeos Longos e Dispersos/genética , Masculino , Grupos Populacionais/genética
4.
BMC Genomics ; 19(1): 821, 2018 Nov 15.
Artigo em Inglês | MEDLINE | ID: mdl-30442098

RESUMO

BACKGROUND: The three superfamilies of Long Terminal Repeat (LTR) retrotransposons are a widespread kind of transposable element and a major factor in eukaryotic genome evolution. In metazoans, recent studies suggested that Copia LTR-retrotransposons display specific dynamic compared to the more abundant and diverse Gypsy elements. Indeed, Copia elements show a relative scarcity and the prevalence of only a few clades in specific hosts. Thus, BEL/Pao seems to be the second most abundant superfamily. However, the generality of these assumptions remains to be assessed. Therefore, we carried out the first large-scale comparative genomic analysis of LTR-retrotransposons in molluscs. The aim of this study was to analyse the diversity, copy numbers, genomic proportions and distribution of LTR-retrotransposons in a large host phylum. RESULTS: We compare nine genomes of molluscs and further added LTR-retrotransposons sequences detected in databases for 47 additional species. We identified 1709 families, which enabled us to define 31 clades. We show that clade richness was highly dependent on the considered superfamily. We found only three Copia clades, including GalEa and Hydra which appear to be widely distributed and highly dominant as they account for 96% of the characterised Copia elements. Among the seven BEL/Pao clades identified, Sparrow and Surcouf are characterised for the first time. We find no BEL or Pao elements, but the rare clades Dan and Flow are present in molluscs. Finally, we characterised 21 Gypsy clades, only five of which had been previously described, the C-clade being the most abundant one. Even if they are found in the same number of host species, Copia elements are clearly less abundant than BEL/Pao elements in copy number or genomic proportions, while Gypsy elements are always the most abundant ones whatever the parameter considered. CONCLUSIONS: Our analysis confirms the contrasting dynamics of Copia and Gypsy elements in metazoans and indicates that BEL/Pao represents the second most abundant superfamily, probably reflecting an intermediate dynamic. Altogether, the data obtained in several taxa highly suggest that these patterns can be generalised for most metazoans. Finally, we highlight the importance of using database information in complement of genome analyses when analyzing transposable element diversity.


Assuntos
Variação Genética , Genoma/genética , Moluscos/genética , Retroelementos/genética , Sequências Repetidas Terminais/genética , Animais , Evolução Molecular , Genômica/métodos , Moluscos/classificação , Filogenia , Especificidade da Espécie
5.
BMC Genomics ; 18(1): 410, 2017 05 25.
Artigo em Inglês | MEDLINE | ID: mdl-28545447

RESUMO

BACKGROUND: The relative scarcity of Copia retrotransposons has been recently characterized in metazoans in comparison with the other superfamilies of LTR elements. Furthermore, Copia retrotransposons have often a particular dynamics that results in a highly predominant single clade of elements within a large host taxon, such as the GalEa-like retrotransposons in crustaceans. Taking advantage of the skyrocketing amount of genomic data available for fungi, we carried out the first large-scale comparative genomic analysis of the Copia clades in filamentous ascomycetes. RESULTS: Screening 30 completely sequenced genomes allowed us to identify more than 2500 Copia copies with conserved LTR, which are distributed in 138 families. Their characterization revealed that fungal Copia diversity is much broader than previously thought with at least 27 clades, 23 of which likely correspond to new ones. While the Copia copy number is low in most species, the two clades GalEa and FunCo1 are widely distributed and highly dominate Copia content as they both account for 80% of the detected sequences. CONCLUSIONS: In Fungi, GalEa retrotransposons are restricted to Pezizomycotina in which they can make up an outstandingly high proportion of the genome (up to 10% in Cenococcum geophilum). At last, we revealed that fungal GalEa elements structurally differ from all other Copia elements with an absence of Primer Binding Site. These elements however harbor a Conserved Hairpin Site which is probably essential for their transposition.


Assuntos
Ascomicetos/genética , Variação Genética , Genômica , Retroelementos/genética , Sequência Conservada/genética , Evolução Molecular , Genoma Fúngico/genética
6.
Syst Biol ; 64(1): 112-26, 2015 Jan.
Artigo em Inglês | MEDLINE | ID: mdl-25261464

RESUMO

A large proportion of genomic information, particularly repetitive elements, is usually ignored when researchers are using next-generation sequencing. Here we demonstrate the usefulness of this repetitive fraction in phylogenetic analyses, utilizing comparative graph-based clustering of next-generation sequence reads, which results in abundance estimates of different classes of genomic repeats. Phylogenetic trees are then inferred based on the genome-wide abundance of different repeat types treated as continuously varying characters; such repeats are scattered across chromosomes and in angiosperms can constitute a majority of nuclear genomic DNA. In six diverse examples, five angiosperms and one insect, this method provides generally well-supported relationships at interspecific and intergeneric levels that agree with results from more standard phylogenetic analyses of commonly used markers. We propose that this methodology may prove especially useful in groups where there is little genetic differentiation in standard phylogenetic markers. At the same time as providing data for phylogenetic inference, this method additionally yields a wealth of data for comparative studies of genome evolution.


Assuntos
Genoma/genética , Filogenia , Animais , Análise por Conglomerados , DNA de Plantas/genética , Drosophila/classificação , Drosophila/genética , Genes de Insetos/genética , Magnoliopsida/genética , Sequências Repetitivas de Ácido Nucleico/genética
7.
Plant J ; 75(4): 699-709, 2013 Aug.
Artigo em Inglês | MEDLINE | ID: mdl-23663083

RESUMO

Comparisons of closely related species are needed to understand the fine-scale dynamics of retrotransposon evolution in flowering plants. Towards this goal, we classified the long terminal repeat (LTR) retrotransposons from six diploid and one tetraploid species of Orobanchaceae. The study species are the autotrophic, non-parasitic Lindenbergia philippensis (as an out-group) and six closely related holoparasitic species of Orobanche [O. crenata, O. cumana, O. gracilis (tetraploid) and O. pancicii] and Phelipanche (P. lavandulacea and P. ramosa). All major plant LTR retrotransposon clades could be identified, and appear to be inherited from a common ancestor. Species of Orobanche, but not Phelipanche, are enriched in Ty3/Gypsy retrotransposons due to a diversification of elements, especially chromoviruses. This is particularly striking in O. gracilis, where tetraploidization seems to have contributed to the Ty3/Gypsy enrichment and led to the emergence of seven large species-specific families of chromoviruses. The preferential insertion of chromoviruses in heterochromatin via their chromodomains might have favored their diversification and enrichment. Our phylogenetic analyses of LTR retrotransposons from Orobanchaceae also revealed that the Bianca clade of Ty1/Copia and the SMART-related elements are much more widely distributed among angiosperms than previously known.


Assuntos
Genoma de Planta/genética , Orobanchaceae/genética , Retroelementos/genética , Sequências Repetidas Terminais/genética , DNA de Plantas/química , DNA de Plantas/genética , Diploide , Evolução Molecular , Sequenciamento de Nucleotídeos em Larga Escala , Dados de Sequência Molecular , Filogenia , Análise de Sequência de DNA , Especificidade da Espécie , Tetraploidia
8.
Mol Biol Evol ; 29(11): 3601-11, 2012 Nov.
Artigo em Inglês | MEDLINE | ID: mdl-22723303

RESUMO

We used next-generation sequencing to characterize the genomes of nine species of Orobanchaceae of known phylogenetic relationships, different life forms, and including a polyploid species. The study species are the autotrophic, nonparasitic Lindenbergia philippensis, the hemiparasitic Schwalbea americana, and seven nonphotosynthetic parasitic species of Orobanche (Orobanche crenata, Orobanche cumana, Orobanche gracilis (tetraploid), and Orobanche pancicii) and Phelipanche (Phelipanche lavandulacea, Phelipanche purpurea, and Phelipanche ramosa). Ty3/Gypsy elements comprise 1.93%-28.34% of the nine genomes and Ty1/Copia elements comprise 8.09%-22.83%. When compared with L. philippensis and S. americana, the nonphotosynthetic species contain higher proportions of repetitive DNA sequences, perhaps reflecting relaxed selection on genome size in parasitic organisms. Among the parasitic species, those in the genus Orobanche have smaller genomes but higher proportions of repetitive DNA than those in Phelipanche, mostly due to a diversification of repeats and an accumulation of Ty3/Gypsy elements. Genome downsizing in the tetraploid O. gracilis probably led to sequence loss across most repeat types.


Assuntos
DNA de Plantas/genética , Genoma de Planta/genética , Orobanchaceae/genética , Filogenia , Sequências Repetitivas de Ácido Nucleico/genética , Análise de Sequência de DNA/métodos , Análise por Conglomerados , Tamanho do Genoma/genética , Sequenciamento de Nucleotídeos em Larga Escala , Dados de Sequência Molecular , Especificidade da Espécie
9.
BMC Genomics ; 12: 621, 2011 Dec 20.
Artigo em Inglês | MEDLINE | ID: mdl-22185659

RESUMO

BACKGROUND: DIRS1-like elements compose one superfamily of tyrosine recombinase-encoding retrotransposons. They have been previously reported in only a few diverse eukaryote species, describing a patchy distribution, and little is known about their origin and dynamics. Recently, we have shown that these retrotransposons are common among decapods, which calls into question the distribution of DIRS1-like retrotransposons among eukaryotes. RESULTS: To determine the distribution of DIRS1-like retrotransposons, we developed a new computational tool, ReDoSt, which allows us to identify well-conserved DIRS1-like elements. By screening 274 completely sequenced genomes, we identified more than 4000 DIRS1-like copies distributed among 30 diverse species which can be clustered into roughly 300 families. While the diversity in most species appears restricted to a low copy number, a few bursts of transposition are strongly suggested in certain species, such as Danio rerio and Saccoglossus kowalevskii. CONCLUSION: In this study, we report 14 new species and 8 new higher taxa that were not previously known to harbor DIRS1-like retrotransposons. Now reported in 61 species, these elements appear widely distributed among eukaryotes, even if they remain undetected in streptophytes and mammals. Especially in unikonts, a broad range of taxa from Cnidaria to Sauropsida harbors such elements. Both the distribution and the similarities between the DIRS1-like element phylogeny and conventional phylogenies of the host species suggest that DIRS1-like retrotransposons emerged early during the radiation of eukaryotes.


Assuntos
Células Eucarióticas/metabolismo , Retroelementos , Genoma
10.
BMC Evol Biol ; 9: 86, 2009 Apr 28.
Artigo em Inglês | MEDLINE | ID: mdl-19400949

RESUMO

BACKGROUND: Transposable elements are major constituents of eukaryote genomes and have a great impact on genome structure and stability. Considering their mutational abilities, TEs can contribute to the genetic diversity and evolution of organisms. Knowledge of their distribution among several genomes is an essential condition to study their dynamics and to better understand their role in species evolution. DIRS1-like retrotransposons are a particular group of retrotransposons according to their mode of transposition that implies a tyrosine recombinase. To date, they have been described in a restricted number of species in comparison with the LTR retrotransposons. In this paper, we determine the distribution of DIRS1-like elements among 25 decapod species, 10 of them living in hydrothermal vents that correspond to particularly unstable environments. RESULTS: Using PCR approaches, we have identified 15 new DIRS1-like families in 15 diverse decapod species (shrimps, lobsters, crabs and galatheid crabs). Hydrothermal organisms show a particularly great diversity of DIRS1-like elements with 5 families characterized among Alvinocarididae shrimps and 3 in the galatheid crab Munidopsis recta. Phylogenic analyses show that these elements are divergent toward the DIRS1-like families previously described in other crustaceans and arthropods and form a new clade called AlDIRS1. At larger scale, the distribution of DIRS1-like retrotransposons appears more or less patchy depending on the taxa considered. Indeed, a scattered distribution can be observed in the infraorder Brachyura whereas all the species tested in infraorders Caridea and Astacidea harbor some DIRS1-like elements. CONCLUSION: Our results lead to nearly double both the number of DIRS1-like elements described to date, and the number of species known to harbor these ones. In this study, we provide the first degenerate primers designed to look specifically for DIRS1-like retrotransposons. They allowed for revealing for the first time a widespread distribution of these elements among a large phylum, here the order Decapoda. They also suggest some peculiar features of these retrotransposons in hydrothermal organisms where a great diversity of elements is already observed. Finally, this paper constitutes the first essential step which allows for considering further studies based on the dynamics of the DIRS1-like retrotransposons among several genomes.


Assuntos
Decápodes/genética , Retroelementos , Animais , Decápodes/classificação , Ecossistema , Evolução Molecular , Filogenia , Alinhamento de Sequência , Análise de Sequência de DNA , Especificidade da Espécie
11.
Nat Plants ; 5(8): 846-855, 2019 08.
Artigo em Inglês | MEDLINE | ID: mdl-31358959

RESUMO

Comparative genomics can unravel the genetic basis of species differences; however, successful reports on quantitative traits are still scarce. Here we present genome assemblies of 31 so-far unassembled Brassicaceae plant species and combine them with 16 previously published assemblies to establish the Brassicaceae Diversity Panel. Using a new interspecies association strategy for quantitative traits, we found a so-far unknown association between the unexpectedly high variation in CG to TG substitution rates in genes and the absence of CHROMOMETHYLASE3 (CMT3) orthologues. Low substitution rates were associated with the loss of CMT3, while species with conserved CMT3 orthologues showed high substitution rates. Species without CMT3 also lacked gene-body methylation (gbM), suggesting an evolutionary trade-off between the unknown function of gbM and low substitution rates in Brassicaceae, possibly due to low mutability of non-methylated cytosines.


Assuntos
Brassicaceae/genética , Genoma de Planta , Nucleotídeos/genética , Brassicaceae/classificação , Brassicaceae/metabolismo , Mapeamento Cromossômico , Citosina , Estudos de Associação Genética , Genômica , Guanina , Metilação , Filogenia , Locos de Características Quantitativas , Timina
12.
Mob DNA ; 6: 5, 2015.
Artigo em Inglês | MEDLINE | ID: mdl-25834645

RESUMO

BACKGROUND: Polyploidization can activate specific transposable elements, leading to their accumulation. At the same time, the preferential loss of repetitive elements in polyploids may be central to diploidization. The paucity of studies of transposable element (TE) dynamics in closely related diploid and polyploid species, however, prevents generalizations about these patterns. Here, we use low-coverage Illumina sequencing data for a clade of three tetraploid Orobanche species and a diploid relative to quantify the abundance and relative frequencies of different types of TEs. We confirmed tetraploidy in the sequenced individuals using standard cytogenetic methods and inferred the time of origin of the tetraploid clade with a rate-calibrated molecular clock. FINDINGS: The sequenced individuals of Orobanche austrohispanica, Orobanche densiflora, and Orobanche gracilis have 2n = 76 chromosomes, are tetraploid, and shared a most recent common ancestor some 6.7 Ma ago. Comparison of TE classifications from the Illumina data with classification from 454 data for one of the species revealed strong effects of sequencing technology on the detection of certain types of repetitive DNA. The three tetraploids show repeat enrichment especially of Gypsy TE families compared to eight previously analyzed Orobanchaceae. However, the diploid Orobanche rapum-genistae genome also has a very high proportion (30%) of Gypsy elements. CONCLUSIONS: We had earlier suggested that tetraploidization might have contributed to an amplification of Gypsy elements, particularly of the Tekay clade, and that O. gracilis underwent genome downsizing following polyploidization. The new data reveal that Gypsy amplification in Orobanchaceae does not consistently relate to tetraploidy and that more species sampling is required to generalize about Tekay accumulation patterns.

13.
Nat Plants ; 1: 14023, 2015 Feb 02.
Artigo em Inglês | MEDLINE | ID: mdl-27246759

RESUMO

Despite evolutionary conserved mechanisms to silence transposable element activity, there are drastic differences in the abundance of transposable elements even among closely related plant species. We conducted a de novo assembly for the 375 Mb genome of the perennial model plant, Arabis alpina. Analysing this genome revealed long-lasting and recent transposable element activity predominately driven by Gypsy long terminal repeat retrotransposons, which extended the low-recombining pericentromeres and transformed large formerly euchromatic regions into repeat-rich pericentromeric regions. This reduced capacity for long terminal repeat retrotransposon silencing and removal in A. alpina co-occurs with unexpectedly low levels of DNA methylation. Most remarkably, the striking reduction of symmetrical CG and CHG methylation suggests weakened DNA methylation maintenance in A. alpina compared with Arabidopsis thaliana. Phylogenetic analyses indicate a highly dynamic evolution of some components of methylation maintenance machinery that might be related to the unique methylation in A. alpina.

14.
PLoS One ; 8(3): e57675, 2013.
Artigo em Inglês | MEDLINE | ID: mdl-23469217

RESUMO

Transposable elements are major constituents of eukaryote genomes and have a great impact on genome structure and stability. They can contribute to the genetic diversity and evolution of organisms. Knowledge of their distribution among several genomes is an essential condition to study their dynamics and to better understand their role in species evolution. LTR-retrotransposons have been reported in many diverse eukaryote species, describing a ubiquitous distribution. Given their abundance, diversity and their extended ranges in C-values, environment and life styles, crustaceans are a great taxon to investigate the genomic component of adaptation and its possible relationships with TEs. However, crustaceans have been greatly underrepresented in transposable element studies. Using both degenerate PCR and in silico approaches, we have identified 35 Copia and 46 Gypsy families in 15 and 18 crustacean species, respectively. In particular, we characterized several full-length elements from the shrimp Rimicaris exoculata that is listed as a model organism from hydrothermal vents. Phylogenic analyses show that Copia and Gypsy retrotransposons likely present two opposite dynamics within crustaceans. The Gypsy elements appear relatively frequent and diverse whereas Copia are much more homogeneous, as 29 of them belong to the single GalEa clade, and species- or lineage-dependent. Our results also support the hypothesis of the Copia retrotransposon scarcity in metazoans compared to Gypsy elements. In such a context, the GalEa-like elements present an outstanding wide distribution among eukaryotes, from fishes to red algae, and can be even highly predominant within a large taxon, such as Malacostraca. Their distribution among crustaceans suggests a dynamics that follows a "domino days spreading" branching process in which successive amplifications may interact positively.


Assuntos
Adaptação Biológica/genética , Decápodes/classificação , Decápodes/genética , Filogenia , Retroelementos/genética , Transcriptoma , Sequência de Aminoácidos , Animais , Evolução Biológica , Mineração de Dados , Dados de Sequência Molecular
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA