Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 77
Filtrar
Más filtros

Banco de datos
Tipo del documento
Intervalo de año de publicación
1.
Nature ; 541(7638): 536-540, 2017 01 26.
Artículo en Inglés | MEDLINE | ID: mdl-28092920

RESUMEN

The Southern Ocean houses a diverse and productive community of organisms. Unicellular eukaryotic diatoms are the main primary producers in this environment, where photosynthesis is limited by low concentrations of dissolved iron and large seasonal fluctuations in light, temperature and the extent of sea ice. How diatoms have adapted to this extreme environment is largely unknown. Here we present insights into the genome evolution of a cold-adapted diatom from the Southern Ocean, Fragilariopsis cylindrus, based on a comparison with temperate diatoms. We find that approximately 24.7 per cent of the diploid F. cylindrus genome consists of genetic loci with alleles that are highly divergent (15.1 megabases of the total genome size of 61.1 megabases). These divergent alleles were differentially expressed across environmental conditions, including darkness, low iron, freezing, elevated temperature and increased CO2. Alleles with the largest ratio of non-synonymous to synonymous nucleotide substitutions also show the most pronounced condition-dependent expression, suggesting a correlation between diversifying selection and allelic differentiation. Divergent alleles may be involved in adaptation to environmental fluctuations in the Southern Ocean.


Asunto(s)
Aclimatación/genética , Frío , Diatomeas/genética , Evolución Molecular , Genoma/genética , Genómica , Alelos , Dióxido de Carbono/metabolismo , Oscuridad , Diatomeas/metabolismo , Congelación , Perfilación de la Expresión Génica , Flujo Genético , Cubierta de Hielo , Hierro/metabolismo , Tasa de Mutación , Océanos y Mares , Filogenia , Recombinación Genética , Transcriptoma/genética
2.
PLoS Genet ; 15(2): e1007900, 2019 02.
Artículo en Inglés | MEDLINE | ID: mdl-30753202

RESUMEN

Most of the current knowledge on the genetic basis of adaptive evolution is based on the analysis of single nucleotide polymorphisms (SNPs). Despite increasing evidence for their causal role, the contribution of structural variants to adaptive evolution remains largely unexplored. In this work, we analyzed the population frequencies of 1,615 Transposable Element (TE) insertions annotated in the reference genome of Drosophila melanogaster, in 91 samples from 60 worldwide natural populations. We identified a set of 300 polymorphic TEs that are present at high population frequencies, and located in genomic regions with high recombination rate, where the efficiency of natural selection is high. The age and the length of these 300 TEs are consistent with relatively young and long insertions reaching high frequencies due to the action of positive selection. Besides, we identified a set of 21 fixed TEs also likely to be adaptive. Indeed, we, and others, found evidence of selection for 84 of these reference TE insertions. The analysis of the genes located nearby these 84 candidate adaptive insertions suggested that the functional response to selection is related with the GO categories of response to stimulus, behavior, and development. We further showed that a subset of the candidate adaptive TEs affects expression of nearby genes, and five of them have already been linked to an ecologically relevant phenotypic effect. Our results provide a more complete understanding of the genetic variation and the fitness-related traits relevant for adaptive evolution. Similar studies should help uncover the importance of TE-induced adaptive mutations in other species as well.


Asunto(s)
Conducta Animal/fisiología , Elementos Transponibles de ADN/genética , Drosophila melanogaster/genética , Regulación del Desarrollo de la Expresión Génica/genética , Genoma de los Insectos/genética , Mutación/genética , Estrés Fisiológico/genética , Animales , Evolución Molecular , Frecuencia de los Genes/genética , Polimorfismo de Nucleótido Simple/genética , Selección Genética/genética
3.
Brief Bioinform ; 20(2): 565-571, 2019 03 25.
Artículo en Inglés | MEDLINE | ID: mdl-29659709

RESUMEN

Improving productivity of the staple crops wheat and rice is essential to feed the growing global population, particularly in the context of a changing climate. However, current rates of yield gain are insufficient to support the predicted population growth. New approaches are required to accelerate the breeding process, and many of these are driven by the application of large-scale crop data. To leverage the substantial volumes and types of data that can be applied for precision breeding, the wheat and rice research communities are working towards the development of integrated systems to access and standardize the dispersed, heterogeneous available data. Here, we outline the initiatives of the International Wheat Information System (WheatIS) and the International Rice Informatics Consortium (IRIC) to establish Web-based single-access systems and data mining tools to make the available resources more accessible, drive discovery and accelerate the production of new crop varieties. We discuss the progress of WheatIS and IRIC towards unifying specialized wheat and rice databases and building custom software platforms to manage and interrogate these data. Single-access crop information systems will strengthen scientific collaboration, optimize the use of public research funds and help achieve the required yield gains in the two most important global food crops.


Asunto(s)
Productos Agrícolas/crecimiento & desarrollo , Sistemas de Información , Oryza/crecimiento & desarrollo , Triticum/crecimiento & desarrollo
4.
Plant Cell ; 28(9): 2197-2211, 2016 Sep.
Artículo en Inglés | MEDLINE | ID: mdl-27495811

RESUMEN

LHP1-INTERACTING FACTOR2 (LIF2), a heterogeneous nuclear ribonucleoprotein involved in Arabidopsis thaliana cell fate and stress responses, interacts with LIKE HETEROCHROMATIN PROTEIN1 (LHP1), a Polycomb Repressive Complex1 subunit. To investigate LIF2-LHP1 functional interplay, we mapped their genome-wide distributions in wild-type, lif2, and lhp1 backgrounds, under standard and stress conditions. Interestingly, LHP1-targeted regions form local clusters, suggesting an underlying functional organization of the plant genome. Regions targeted by both LIF2 and LHP1 were enriched in stress-responsive genes, the H2A.Z histone variant, and antagonistic histone marks. We identified specific motifs within the targeted regions, including a G-box-like motif, a GAGA motif, and a telo-box. LIF2 and LHP1 can operate both antagonistically and synergistically. In response to methyl jasmonate treatment, LIF2 was rapidly recruited to chromatin, where it mediated transcriptional gene activation. Thus, LIF2 and LHP1 participate in transcriptional switches in stress-response pathways.

5.
New Phytol ; 213(3): 1477-1486, 2017 Feb.
Artículo en Inglés | MEDLINE | ID: mdl-27551821

RESUMEN

The origin of bread wheat (Triticum aestivum; AABBDD) has been a subject of controversy and of intense debate in the scientific community over the last few decades. In 2015, three articles published in New Phytologist discussed the origin of hexaploid bread wheat (AABBDD) from the diploid progenitors Triticum urartu (AA), a relative of Aegilops speltoides (BB) and Triticum tauschii (DD). Access to new genomic resources since 2013 has offered the opportunity to gain novel insights into the paleohistory of modern bread wheat, allowing characterization of its origin from its diploid progenitors at unprecedented resolution. We propose a reconciled evolutionary scenario for the modern bread wheat genome based on the complementary investigation of transposable element and mutation dynamics between diploid, tetraploid and hexaploid wheat. In this scenario, the structural asymmetry observed between the A, B and D subgenomes in hexaploid bread wheat derives from the cumulative effect of diploid progenitor divergence, the hybrid origin of the D subgenome, and subgenome partitioning following the polyploidization events.


Asunto(s)
Evolución Biológica , Pan , Triticum/genética , Elementos Transponibles de ADN/genética , Genoma de Planta , Modelos Genéticos , Mutagénesis Insercional/genética , Mutación/genética , Polimorfismo de Nucleótido Simple/genética , Sintenía/genética
6.
BMC Genomics ; 17(1): 624, 2016 08 12.
Artículo en Inglés | MEDLINE | ID: mdl-27519651

RESUMEN

BACKGROUND: Plant genomes are populated by different types of repetitive elements including transposable elements (TEs) and simple sequence repeats (SSRs) that can have a strong impact on genome size and dynamic as well as on the regulation of gene transcription. At least two-thirds of the tomato genome is composed of repeats. While their bulk impact on genome organization has been recently revealed by whole genome assembly, their influence on tomato biology and phenotype remains largely unaddressed. More specifically, the effects and roles of DNA repeats on the maturation of fleshy fruits, which is a complex process of key agro-economic interest, still needs to be investigated comprehensively and tomato is arguably an excellent model for such study. RESULTS: We have performed a comprehensive annotation of the tomato repeatome to explore its potential impact on tomato genome composition and gene transcription. Our results show that the tomato genome can be fractioned into three compartments with different gene and repeat density, each compartment presenting contrasting repeat and gene composition, repeat-gene associations and different gene transcriptional levels. In the context of fruit ripening, we found that repeats are present in the majority of differentially methylated regions (DMRs) and thousands of repeat-associated DMRs are found in gene proximity including hundreds that are differentially regulated. Furthermore, we found that repeats are also present in the proximity of binding sites of the key ripening protein RIN. We also observed that some repeat families are present at unexpected high frequency in the proximity of genes that are differentially expressed during tomato ripening. CONCLUSION: Altogether, our study emphasizes the fractionation as defined by repeat content in the tomato genome and enables to further characterize the specificities of each genomic compartment. Additionally, our results present strong associations between differentially regulated genes, differentially methylated regions and repeats, suggesting a potential adaptive function of repeats in tomato ripening. Our work therefore provides significant perspectives for the understanding of the impact of repeats on the maturation of fleshy fruits.


Asunto(s)
Secuencias Repetitivas de Ácidos Nucleicos/genética , Solanum lycopersicum/genética , Cromosomas de las Plantas/genética , Cromosomas de las Plantas/metabolismo , Metilación de ADN , ADN de Plantas/metabolismo , Frutas/genética , Regulación de la Expresión Génica de las Plantas , Solanum lycopersicum/crecimiento & desarrollo , Hojas de la Planta/genética , Hojas de la Planta/metabolismo , Proteínas de Plantas/genética , Proteínas de Plantas/metabolismo
7.
Nature ; 464(7291): 1033-8, 2010 Apr 15.
Artículo en Inglés | MEDLINE | ID: mdl-20348908

RESUMEN

The Périgord black truffle (Tuber melanosporum Vittad.) and the Piedmont white truffle dominate today's truffle market. The hypogeous fruiting body of T. melanosporum is a gastronomic delicacy produced by an ectomycorrhizal symbiont endemic to calcareous soils in southern Europe. The worldwide demand for this truffle has fuelled intense efforts at cultivation. Identification of processes that condition and trigger fruit body and symbiosis formation, ultimately leading to efficient crop production, will be facilitated by a thorough analysis of truffle genomic traits. In the ectomycorrhizal Laccaria bicolor, the expansion of gene families may have acted as a 'symbiosis toolbox'. This feature may however reflect evolution of this particular taxon and not a general trait shared by all ectomycorrhizal species. To get a better understanding of the biology and evolution of the ectomycorrhizal symbiosis, we report here the sequence of the haploid genome of T. melanosporum, which at approximately 125 megabases is the largest and most complex fungal genome sequenced so far. This expansion results from a proliferation of transposable elements accounting for approximately 58% of the genome. In contrast, this genome only contains approximately 7,500 protein-coding genes with very rare multigene families. It lacks large sets of carbohydrate cleaving enzymes, but a few of them involved in degradation of plant cell walls are induced in symbiotic tissues. The latter feature and the upregulation of genes encoding for lipases and multicopper oxidases suggest that T. melanosporum degrades its host cell walls during colonization. Symbiosis induces an increased expression of carbohydrate and amino acid transporters in both L. bicolor and T. melanosporum, but the comparison of genomic traits in the two ectomycorrhizal fungi showed that genetic predispositions for symbiosis-'the symbiosis toolbox'-evolved along different ways in ascomycetes and basidiomycetes.


Asunto(s)
Ascomicetos/genética , Evolución Molecular , Genoma Fúngico/genética , Simbiosis/genética , Carbohidratos , Elementos Transponibles de ADN/genética , Cuerpos Fructíferos de los Hongos/metabolismo , Genes Fúngicos/genética , Genómica , Haploidia , Datos de Secuencia Molecular , Análisis de Secuencia de ADN , Azufre/metabolismo
8.
Proc Natl Acad Sci U S A ; 110(49): 19842-7, 2013 Dec 03.
Artículo en Inglés | MEDLINE | ID: mdl-24248389

RESUMEN

Most of our understanding of Drosophila heterochromatin structure and evolution has come from the annotation of heterochromatin from the isogenic y; cn bw sp strain. However, almost nothing is known about the heterochromatin's structural dynamics and evolution. Here, we focus on a 180-kb heterochromatic locus producing Piwi-interacting RNAs (piRNA cluster), the flamenco (flam) locus, known to be responsible for the control of at least three transposable elements (TEs). We report its detailed structure in three different Drosophila lines chosen according to their capacity to repress or not to repress the expression of two retrotransposons named ZAM and Idefix, and we show that they display high structural diversity. Numerous rearrangements due to homologous and nonhomologous recombination, deletions and segmental duplications, and loss and gain of TEs are diverse sources of active genomic variation at this locus. Notably, we evidence a correlation between the presence of ZAM and Idefix in this piRNA cluster and their silencing. They are absent from flam in the strain where they are derepressed. We show that, unexpectedly, more than half of the flam locus results from recent TE insertions and that most of the elements concerned are prone to horizontal transfer between species of the melanogaster subgroup. We build a model showing how such high and constant dynamics of a piRNA master locus open the way to continual emergence of new patterns of piRNA biogenesis leading to changes in the level of transposition control.


Asunto(s)
Cadherinas/genética , Proteínas de Drosophila/genética , Drosophila melanogaster/genética , Evolución Molecular , Variación Genética , Heterocromatina/genética , ARN Interferente Pequeño/genética , Retroelementos/genética , Animales , Secuencia de Bases , Biología Computacional , Transferencia de Gen Horizontal/genética , Datos de Secuencia Molecular , Oligonucleótidos/genética , Interferencia de ARN , Alineación de Secuencia , Análisis de Secuencia de ADN
9.
BMC Genomics ; 16: 141, 2015 Feb 28.
Artículo en Inglés | MEDLINE | ID: mdl-25766680

RESUMEN

BACKGROUND: Transposable Elements (TEs) are key components that shape the organization and evolution of genomes. Fungi have developed defense mechanisms against TE invasion such as RIP (Repeat-Induced Point mutation), MIP (Methylation Induced Premeiotically) and Quelling (RNA interference). RIP inactivates repeated sequences by promoting Cytosine to Thymine mutations, whereas MIP only methylates TEs at C residues. Both mechanisms require specific cytosine DNA Methyltransferases (RID1/Masc1) of the Dnmt1 superfamily. RESULTS: We annotated TE sequences from 10 fungal genomes with different TE content (1-70%). We then used these TE sequences to carry out a genome-wide analysis of C to T mutations biases. Genomes from either Ascomycota or Basidiomycota that were massively invaded by TEs (Blumeria, Melampsora, Puccinia) were characterized by a low frequency of C to T mutation bias (10-20%), whereas other genomes displayed intermediate to high frequencies (25-75%). We identified several dinucleotide signatures at these C to T mutation sites (CpA, CpT, and CpG). Phylogenomic analysis of fungal Dnmt1 MTases revealed a previously unreported association between these dinucleotide signatures and the presence/absence of sub-classes of Dnmt1. CONCLUSIONS: We identified fungal genomes containing large numbers of TEs with many C to T mutations associated with species-specific dinucleotide signatures. This bias suggests that a basic defense mechanism against TE invasion similar to RIP is widespread in fungi, although the efficiency and specificity of this mechanism differs between species. Our analysis revealed that dinucleotide signatures are associated with the presence/absence of specific Dnmt1 subfamilies. In particular, an RID1-dependent RIP mechanism was found only in Ascomycota.


Asunto(s)
Elementos Transponibles de ADN/genética , Genoma Fúngico , Mutación/genética , Ascomicetos/genética , Basidiomycota/genética , Anotación de Secuencia Molecular , Especificidad de la Especie
10.
BMC Genomics ; 16: 917, 2015 Nov 10.
Artículo en Inglés | MEDLINE | ID: mdl-26556056

RESUMEN

BACKGROUND: The Avrk1 and Avra10 avirulence (AVR) genes encode effectors that increase the pathogenicity of the fungus Blumeria graminis f.sp. hordei (Bgh), the powdery mildew pathogen, in susceptible barley plants. In resistant barley, MLK1 and MLA10 resistance proteins recognize the presence of AVRK1 and AVRA10, eliciting the hypersensitive response typical of gene for gene interactions. Avrk1 and Avra10 have more than 1350 homologues in Bgh genome, forming the EKA (Effectors homologous to Avr k 1 and Avr a 10) gene family. RESULTS: We tested the hypothesis that the EKA family originated from degenerate copies of Class I LINE retrotransposons by analysing the EKA family in the genome of Bgh isolate DH14 with bioinformatic tools specially developed for the analysis of Transposable Elements (TE) in genomes. The Class I LINE retrotransposon copies homologous to Avrk1 and Avra10 represent 6.5 % of the Bgh annotated genome and, among them, we identified 293 AVR/effector candidate genes. We also experimentally identified peptides that indicated the translation of several predicted proteins from EKA family members, which had higher relative abundance in haustoria than in hyphae. CONCLUSIONS: Our analyses indicate that Avrk1 and Avra10 have evolved from part of the ORF1 gene of Class I LINE retrotransposons. The co-option of Avra10 and Avrk1 as effectors from truncated copies of retrotransposons explains the huge number of homologues in Bgh genome that could act as dynamic reservoirs from which new effector genes may evolve. These data provide further evidence for recruitment of retrotransposons in the evolution of new biological functions.


Asunto(s)
Ascomicetos/genética , Proteínas Fúngicas/genética , Hordeum/microbiología , Elementos de Nucleótido Esparcido Largo , Familia de Multigenes , Enfermedades de las Plantas/microbiología , Ascomicetos/clasificación , Ascomicetos/metabolismo , Biología Computacional , Secuencia de Consenso , Genoma Fúngico , Sistemas de Lectura Abierta , Filogenia , Proteómica
11.
BMC Genomics ; 16: 461, 2015 Jun 16.
Artículo en Inglés | MEDLINE | ID: mdl-26076695

RESUMEN

BACKGROUND: The genus Microbotryum includes plant pathogenic fungi afflicting a wide variety of hosts with anther smut disease. Microbotryum lychnidis-dioicae infects Silene latifolia and replaces host pollen with fungal spores, exhibiting biotrophy and necrosis associated with altering plant development. RESULTS: We determined the haploid genome sequence for M. lychnidis-dioicae and analyzed whole transcriptome data from plant infections and other stages of the fungal lifecycle, revealing the inventory and expression level of genes that facilitate pathogenic growth. Compared to related fungi, an expanded number of major facilitator superfamily transporters and secretory lipases were detected; lipase gene expression was found to be altered by exposure to lipid compounds, which signaled a switch to dikaryotic, pathogenic growth. In addition, while enzymes to digest cellulose, xylan, xyloglucan, and highly substituted forms of pectin were absent, along with depletion of peroxidases and superoxide dismutases that protect the fungus from oxidative stress, the repertoire of glycosyltransferases and of enzymes that could manipulate host development has expanded. A total of 14% of the genome was categorized as repetitive sequences. Transposable elements have accumulated in mating-type chromosomal regions and were also associated across the genome with gene clusters of small secreted proteins, which may mediate host interactions. CONCLUSIONS: The unique absence of enzyme classes for plant cell wall degradation and maintenance of enzymes that break down components of pollen tubes and flowers provides a striking example of biotrophic host adaptation.


Asunto(s)
Hongos/genética , Genoma Fúngico/genética , Parásitos/genética , Enfermedades de las Plantas/microbiología , Plantas/microbiología , Silene/microbiología , Transcriptoma/genética , Animales , Mapeo Cromosómico/métodos , Perfilación de la Expresión Génica/métodos , Interacciones Huésped-Parásitos/genética , Lipasa/genética , Peroxidasas/genética , Superóxido Dismutasa/genética
12.
BMC Genomics ; 16: 112, 2015 Feb 21.
Artículo en Inglés | MEDLINE | ID: mdl-25765701

RESUMEN

BACKGROUND: Many northern-hemisphere forests are dominated by oaks. These species extend over diverse environmental conditions and are thus interesting models for studies of plant adaptation and speciation. The genomic toolbox is an important asset for exploring the functional variation associated with natural selection. RESULTS: The assembly of previously available and newly developed long and short sequence reads for two sympatric oak species, Quercus robur and Quercus petraea, generated a comprehensive catalog of transcripts for oak. The functional annotation of 91 k contigs demonstrated the presence of a large proportion of plant genes in this unigene set. Comparisons with SwissProt accessions and five plant gene models revealed orthologous relationships, making it possible to decipher the evolution of the oak genome. In particular, it was possible to align 9.5 thousand oak coding sequences with the equivalent sequences on peach chromosomes. Finally, RNA-seq data shed new light on the gene networks underlying vegetative bud dormancy release, a key stage in development allowing plants to adapt their phenology to the environment. CONCLUSION: In addition to providing a vast array of expressed genes, this study generated essential information about oak genome evolution and the regulation of genes associated with vegetative bud phenology, an important adaptive traits in trees. This resource contributes to the annotation of the oak genome sequence and will provide support for forward genetics approaches aiming to link genotypes with adaptive phenotypes.


Asunto(s)
Adaptación Fisiológica/genética , Regulación de la Expresión Génica de las Plantas , Latencia en las Plantas/genética , Transcriptoma/genética , Secuencia de Bases , Mapeo Cromosómico , Especiación Genética , Genoma de Planta , Quercus/genética , Quercus/crecimiento & desarrollo , Análisis de Secuencia de ARN
13.
Bioinformatics ; 30(18): 2656-8, 2014 Sep 15.
Artículo en Inglés | MEDLINE | ID: mdl-24894500

RESUMEN

MOTIVATION: Recent technological advances are allowing many laboratories to sequence their research organisms. Available de novo assemblers leave repetitive portions of the genome poorly assembled. Some genomes contain high proportions of transposable elements, and transposable elements appear to be a major force behind diversity and adaptation. Few de novo assemblers for transposable elements exist, and most have either been designed for small genomes or 454 reads. RESULTS: In this article, we present a new transposable element de novo assembler, Tedna, which assembles a set of transposable elements directly from the reads. Tedna uses Illumina paired-end reads, the most widely used sequencing technology for de novo assembly, and forms full-length transposable elements. AVAILABILITY AND IMPLEMENTATION: Tedna is available at http://urgi.versailles.inra.fr/Tools/Tedna, under the GPLv3 license. It is written in C++11 and only requires the Sparsehash Package, freely available under the New BSD License. Tedna can be used on standard computers with limited RAM resources, although it may also use large memory for better results. Most of the code is parallelized and thus ready for large infrastructures.


Asunto(s)
Elementos Transponibles de ADN/genética , Genómica/métodos , Análisis de Secuencia de ADN/métodos , Arabidopsis/genética , Secuencias Repetitivas de Ácidos Nucleicos , Triticum/genética
14.
Plant J ; 76(6): 1030-44, 2013 Dec.
Artículo en Inglés | MEDLINE | ID: mdl-24164652

RESUMEN

Bread wheat derives from a grass ancestor structured in seven protochromosomes followed by a paleotetraploidization to reach a 12 chromosomes intermediate and a neohexaploidization (involving subgenomes A, B and D) event that finally shaped the 21 modern chromosomes. Insights into wheat syntenome in sequencing conserved orthologous set (COS) genes unravelled differences in genomic structure (such as gene conservation and diversity) and genetical landscape (such as recombination pattern) between ancestral as well as recent duplicated blocks. Contrasted evolutionary plasticity is observed where the B subgenome appears more sensitive (i.e. plastic) in contrast to A as dominant (i.e. stable) in response to the neotetraploidization and D subgenome as supra-dominant (i.e. pivotal) in response to the neohexaploidization event. Finally, the wheat syntenome, delivered through a public web interface PlantSyntenyViewer at http://urgi.versailles.inra.fr/synteny-wheat, can be considered as a guide for accelerated dissection of major agronomical traits in wheat.


Asunto(s)
Cromosomas de las Plantas/genética , Evolución Molecular , Genoma de Planta/genética , Genómica , Sintenía/genética , Triticum/genética , Secuencia Conservada , ADN de Plantas/química , ADN de Plantas/genética , Genes Dominantes , Marcadores Genéticos , Modelos Biológicos , Polimorfismo de Nucleótido Simple , Poliploidía , Análisis de Secuencia de ADN
15.
Bioinformatics ; 29(7): 933-9, 2013 Apr 01.
Artículo en Inglés | MEDLINE | ID: mdl-23413438

RESUMEN

MOTIVATION: High-throughput sequencing produces in a small amount of time a large amount of data, which are usually difficult to analyze. Mapping the reads to the transcripts they originate from, to quantify the expression of the genes, is a simple, yet time demanding, example of analysis. Fast genomic comparison algorithms are thus crucial for the analysis of the ever-expanding number of reads sequenced. RESULTS: We used NC-lists to implement an algorithm that compares a set of query intervals with a set of reference intervals in two steps. The first step, a pre-processing done once for all, requires time O[#R log(#R) + #Q log(#Q)], where Q and R are the sets of query and reference intervals. The search phase requires constant space, and time O[#R + #Q + #M), where M is the set of overlaps. We showed that our algorithm compares favorably with five other algorithms, especially when several comparisons are performed. AVAILABILITY: The algorithm has been included to S-MART, a versatile tool box for RNA-Seq analysis, freely available at http://urgi.versailles.inra.fr/Tools/S-Mart. The algorithm can be used for many kinds of data (sequencing reads, annotations, etc.) in many formats (GFF3, BED, SAM, etc.), on any operating system. It is thus readily useable for the analysis of next-generation sequencing data. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Asunto(s)
Algoritmos , Secuenciación de Nucleótidos de Alto Rendimiento/métodos , Genoma , Genómica , Análisis de Secuencia de ARN
16.
Proc Natl Acad Sci U S A ; 108(38): 16116-21, 2011 Sep 20.
Artículo en Inglés | MEDLINE | ID: mdl-21876149

RESUMEN

In bacteria, genes with related functions often are grouped together in operons and are cotranscribed as a single polycistronic mRNA. In eukaryotes, functionally related genes generally are scattered across the genome. Notable exceptions include gene clusters for catabolic pathways in yeast, synthesis of secondary metabolites in filamentous fungi, and the major histocompatibility complex in animals. Until quite recently it was thought that gene clusters in plants were restricted to tandem duplicates (for example, arrays of leucine-rich repeat disease-resistance genes). However, operon-like clusters of coregulated nonhomologous genes are an emerging theme in plant biology, where they may be involved in the synthesis of certain defense compounds. These clusters are unlikely to have arisen by horizontal gene transfer, and the mechanisms behind their formation are poorly understood. Previously in thale cress (Arabidopsis thaliana) we identified an operon-like gene cluster that is required for the synthesis and modification of the triterpene thalianol. Here we characterize a second operon-like triterpene cluster (the marneral cluster) from A. thaliana, compare the features of these two clusters, and investigate the evolutionary events that have led to cluster formation. We conclude that common mechanisms are likely to underlie the assembly and control of operon-like gene clusters in plants.


Asunto(s)
Arabidopsis/genética , Arabidopsis/metabolismo , Cromosomas de las Plantas/genética , Familia de Multigenes , Aciltransferasas/genética , Aciltransferasas/metabolismo , Proteínas de Arabidopsis/clasificación , Proteínas de Arabidopsis/genética , Proteínas de Arabidopsis/metabolismo , Mapeo Cromosómico , Sistema Enzimático del Citocromo P-450/genética , Sistema Enzimático del Citocromo P-450/metabolismo , Cromatografía de Gases y Espectrometría de Masas , Duplicación de Gen , Regulación de la Expresión Génica de las Plantas , Genoma de Planta/genética , Transferasas Intramoleculares/genética , Transferasas Intramoleculares/metabolismo , Modelos Genéticos , Estructura Molecular , Mutación , Filogenia , Hojas de la Planta/genética , Hojas de la Planta/metabolismo , Raíces de Plantas/genética , Raíces de Plantas/metabolismo , Plantas Modificadas Genéticamente , Triterpenos/análisis , Triterpenos/química , Triterpenos/metabolismo
17.
Proc Natl Acad Sci U S A ; 108(22): 9166-71, 2011 May 31.
Artículo en Inglés | MEDLINE | ID: mdl-21536894

RESUMEN

Rust fungi are some of the most devastating pathogens of crop plants. They are obligate biotrophs, which extract nutrients only from living plant tissues and cannot grow apart from their hosts. Their lifestyle has slowed the dissection of molecular mechanisms underlying host invasion and avoidance or suppression of plant innate immunity. We sequenced the 101-Mb genome of Melampsora larici-populina, the causal agent of poplar leaf rust, and the 89-Mb genome of Puccinia graminis f. sp. tritici, the causal agent of wheat and barley stem rust. We then compared the 16,399 predicted proteins of M. larici-populina with the 17,773 predicted proteins of P. graminis f. sp tritici. Genomic features related to their obligate biotrophic lifestyle include expanded lineage-specific gene families, a large repertoire of effector-like small secreted proteins, impaired nitrogen and sulfur assimilation pathways, and expanded families of amino acid and oligopeptide membrane transporters. The dramatic up-regulation of transcripts coding for small secreted proteins, secreted hydrolytic enzymes, and transporters in planta suggests that they play a role in host infection and nutrient acquisition. Some of these genomic hallmarks are mirrored in the genomes of other microbial eukaryotes that have independently evolved to infect plants, indicating convergent adaptation to a biotrophic existence inside plant cells.


Asunto(s)
Basidiomycota/genética , Hongos/genética , Triticum/microbiología , Perfilación de la Expresión Génica , Genes Fúngicos , Genoma , Genoma Fúngico , Modelos Genéticos , Nitratos/química , Análisis de Secuencia por Matrices de Oligonucleótidos , Filogenia , Enfermedades de las Plantas/microbiología , Hojas de la Planta/microbiología , Análisis de Secuencia de ADN , Sulfatos/química
18.
Nucleic Acids Res ; 39(16): 6919-31, 2011 Sep 01.
Artículo en Inglés | MEDLINE | ID: mdl-21586580

RESUMEN

Transposable elements (TEs) and their relics play major roles in genome evolution. However, mobilization of TEs is usually deleterious and strongly repressed. In plants and mammals, this repression is typically associated with DNA methylation, but the relationship between this epigenetic mark and TE sequences has not been investigated systematically. Here, we present an improved annotation of TE sequences and use it to analyze genome-wide DNA methylation maps obtained at single-nucleotide resolution in Arabidopsis. We show that although the majority of TE sequences are methylated, ∼26% are not. Moreover, a significant fraction of TE sequences densely methylated at CG, CHG and CHH sites (where H = A, T or C) have no or few matching small interfering RNA (siRNAs) and are therefore unlikely to be targeted by the RNA-directed DNA methylation (RdDM) machinery. We provide evidence that these TE sequences acquire DNA methylation through spreading from adjacent siRNA-targeted regions. Further, we show that although both methylated and unmethylated TE sequences located in euchromatin tend to be more abundant closer to genes, this trend is least pronounced for methylated, siRNA-targeted TE sequences located 5' to genes. Based on these and other findings, we propose that spreading of DNA methylation through promoter regions explains at least in part the negative impact of siRNA-targeted TE sequences on neighboring gene expression.


Asunto(s)
Arabidopsis/genética , Metilación de ADN , Secuencias Repetitivas Esparcidas , ARN Interferente Pequeño/química , Secuencia de Bases , Genes de Plantas , Genoma de Planta , Anotación de Secuencia Molecular , ARN de Planta/química
19.
Database (Oxford) ; 20232023 05 09.
Artículo en Inglés | MEDLINE | ID: mdl-37159239

RESUMEN

SyntenyViewer is a public web-based tool relying on a relational database available at https://urgi.versailles.inrae.fr/synteny delivering comparative genomics data and associated reservoir of conserved genes between angiosperm species for both fundamental (evolutionary studies) and applied (translational research) applications. SyntenyViewer is made available for (i) providing comparative genomics data for seven major botanical families of flowering plants, (ii) delivering a robust catalog of 103 465 conserved genes between 44 species and inferred ancestral genomes, (iii) allowing us to investigate the evolutionary fate of ancestral genes and genomic regions in modern species through duplications, inversions, deletions, fusions, fissions and translocations, (iv) use as a tool to conduct translational research of key trait-related genes from model species to crops and (v) offering to host any comparative genomics data following simplified procedures and formats Database URL https://urgi.versailles.inrae.fr/synteny.


Asunto(s)
Magnoliopsida , Investigación Biomédica Traslacional , Genómica , Productos Agrícolas , Bases de Datos Factuales
20.
Nat Commun ; 13(1): 1948, 2022 04 12.
Artículo en Inglés | MEDLINE | ID: mdl-35413957

RESUMEN

High quality reference genomes are crucial to understanding genome function, structure and evolution. The availability of reference genomes has allowed us to start inferring the role of genetic variation in biology, disease, and biodiversity conservation. However, analyses across organisms demonstrate that a single reference genome is not enough to capture the global genetic diversity present in populations. In this work, we generate 32 high-quality reference genomes for the well-known model species D. melanogaster and focus on the identification and analysis of transposable element variation as they are the most common type of structural variant. We show that integrating the genetic variation across natural populations from five climatic regions increases the number of detected insertions by 58%. Moreover, 26% to 57% of the insertions identified using long-reads were missed by short-reads methods. We also identify hundreds of transposable elements associated with gene expression variation and new TE variants likely to contribute to adaptive evolution in this species. Our results highlight the importance of incorporating the genetic variation present in natural populations to genomic studies, which is essential if we are to understand how genomes function and evolve.


Asunto(s)
Elementos Transponibles de ADN , Drosophila , Animales , Elementos Transponibles de ADN/genética , Drosophila/genética , Drosophila melanogaster/genética , Evolución Molecular , Expresión Génica , Análisis de Secuencia de ADN
SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA