Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 48
Filtrar
1.
Plant J ; 108(2): 492-508, 2021 10.
Artículo en Inglés | MEDLINE | ID: mdl-34382706

RESUMEN

Oryza sativa (rice) plays an essential food security role for more than half of the world's population. Obtaining crops with high levels of disease resistance is a major challenge for breeders, especially today, given the urgent need for agriculture to be more sustainable. Plant resistance genes are mainly encoded by three large leucine-rich repeat (LRR)-containing receptor (LRR-CR) families: the LRR-receptor-like kinase (LRR-RLK), LRR-receptor-like protein (LRR-RLP) and nucleotide-binding LRR receptor (NLR). Using lrrprofiler, a pipeline that we developed to annotate and classify these proteins, we compared three publicly available annotations of the rice Nipponbare reference genome. The extended discrepancies that we observed for LRR-CR gene models led us to perform an in-depth manual curation of their annotations while paying special attention to nonsense mutations. We then transferred this manually curated annotation to Kitaake, a cultivar that is closely related to Nipponbare, using an optimized strategy. Here, we discuss the breakthrough achieved by manual curation when comparing genomes and, in addition to 'functional' and 'structural' annotations, we propose that the community adopts this approach, which we call 'comprehensive' annotation. The resulting data are crucial for further studies on the natural variability and evolution of LRR-CR genes in order to promote their use in breeding future resilient varieties.


Asunto(s)
Anotación de Secuencia Molecular , Oryza/genética , Proteínas de Plantas/genética , Secuencias Repetitivas de Aminoácido , Genoma de Planta , Genotipo , Anotación de Secuencia Molecular/métodos , Oryza/química , Proteínas de Plantas/química
2.
Ann Bot ; 127(6): 827-840, 2021 05 07.
Artículo en Inglés | MEDLINE | ID: mdl-33637991

RESUMEN

BACKGROUND AND AIMS: Modern sugarcane cultivars (Saccharum spp.) are high polyploids, aneuploids (2n = ~12x = ~120) derived from interspecific hybridizations between the domesticated sweet species Saccharum officinarum and the wild species S. spontaneum. METHODS: To analyse the architecture and origin of such a complex genome, we analysed the sequences of all 12 hom(oe)ologous haplotypes (BAC clones) from two distinct genomic regions of a typical modern cultivar, as well as the corresponding sequence in Miscanthus sinense and Sorghum bicolor, and monitored their distribution among representatives of the Saccharum genus. KEY RESULTS: The diversity observed among haplotypes suggested the existence of three founding genomes (A, B, C) in modern cultivars, which diverged between 0.8 and 1.3 Mya. Two genomes (A, B) were contributed by S. officinarum; these were also found in its wild presumed ancestor S. robustum, and one genome (C) was contributed by S. spontaneum. These results suggest that S. officinarum and S. robustum are derived from interspecific hybridization between two unknown ancestors (A and B genomes). The A genome contributed most haplotypes (nine or ten) while the B and C genomes contributed one or two haplotypes in the regions analysed of this typical modern cultivar. Interspecific hybridizations likely involved accessions or gametes with distinct ploidy levels and/or were followed by a series of backcrosses with the A genome. The three founding genomes were found in all S. barberi, S. sinense and modern cultivars analysed. None of the analysed accessions contained only the A genome or the B genome, suggesting that representatives of these founding genomes remain to be discovered. CONCLUSIONS: This evolutionary model, which combines interspecificity and high polyploidy, can explain the variable chromosome pairing affinity observed in Saccharum. It represents a major revision of the understanding of Saccharum diversity.


Asunto(s)
Saccharum , Genoma de Planta/genética , Genómica , Haplotipos/genética , Poliploidía , Saccharum/genética
3.
Int J Mol Sci ; 21(12)2020 Jun 13.
Artículo en Inglés | MEDLINE | ID: mdl-32545790

RESUMEN

The natural rubber biosynthetic pathway is well described in Hevea, although the final stages of rubber elongation are still poorly understood. Small Rubber Particle Proteins and Rubber Elongation Factors (SRPPs and REFs) are proteins with major function in rubber particle formation and stabilization. Their corresponding genes are clustered on a scaffold1222 of the reference genomic sequence of the Hevea brasiliensis genome. Apart from gene expression by transcriptomic analyses, to date, no deep analyses have been carried out for the genomic environment of SRPPs and REFs loci. By integrative analyses on transposable element annotation, small RNAs production and gene expression, we analysed their role in the control of the transcription of rubber biosynthetic genes. The first in-depth annotation of TEs (Transposable Elements) and their capacity to produce TE-derived siRNAs (small interfering RNAs) is presented, only possible in the Hevea brasiliensis clone PB 260 for which all data are available. We observed that 11% of genes are located near TEs and their presence may interfere in their transcription at both genetic and epigenetic level. We hypothesized that the genomic environment of rubber biosynthesis genes has been shaped by TE and TE-derived siRNAs with possible transcriptional interference on their gene expression. We discussed possible functionalization of TEs as enhancers and as donors of alternative transcription start sites in promoter sequences, possibly through the modelling of genetic and epigenetic landscapes.


Asunto(s)
Vías Biosintéticas , Perfilación de la Expresión Génica/métodos , Hevea/metabolismo , Goma/metabolismo , Elementos Transponibles de ADN , Regulación de la Expresión Génica de las Plantas , Hevea/genética , Anotación de Secuencia Molecular , Filogenia , Proteínas de Plantas/genética , Regiones Promotoras Genéticas , ARN Interferente Pequeño/genética , Análisis de Secuencia de ARN
4.
Nature ; 488(7410): 213-7, 2012 Aug 09.
Artículo en Inglés | MEDLINE | ID: mdl-22801500

RESUMEN

Bananas (Musa spp.), including dessert and cooking types, are giant perennial monocotyledonous herbs of the order Zingiberales, a sister group to the well-studied Poales, which include cereals. Bananas are vital for food security in many tropical and subtropical countries and the most popular fruit in industrialized countries. The Musa domestication process started some 7,000 years ago in Southeast Asia. It involved hybridizations between diverse species and subspecies, fostered by human migrations, and selection of diploid and triploid seedless, parthenocarpic hybrids thereafter widely dispersed by vegetative propagation. Half of the current production relies on somaclones derived from a single triploid genotype (Cavendish). Pests and diseases have gradually become adapted, representing an imminent danger for global banana production. Here we describe the draft sequence of the 523-megabase genome of a Musa acuminata doubled-haploid genotype, providing a crucial stepping-stone for genetic improvement of banana. We detected three rounds of whole-genome duplications in the Musa lineage, independently of those previously described in the Poales lineage and the one we detected in the Arecales lineage. This first monocotyledon high-continuity whole-genome sequence reported outside Poales represents an essential bridge for comparative genome analysis in plants. As such, it clarifies commelinid-monocotyledon phylogenetic relationships, reveals Poaceae-specific features and has led to the discovery of conserved non-coding sequences predating monocotyledon-eudicotyledon divergence.


Asunto(s)
Evolución Molecular , Genoma de Planta/genética , Musa/genética , Secuencia Conservada/genética , Elementos Transponibles de ADN/genética , Duplicación de Gen/genética , Genes de Plantas/genética , Genotipo , Haploidia , Datos de Secuencia Molecular , Musa/clasificación , Filogenia
6.
Plant Physiol ; 170(3): 1595-610, 2016 Mar.
Artículo en Inglés | MEDLINE | ID: mdl-26773008

RESUMEN

Gene duplications are an important factor in plant evolution, and lineage-specific expanded (LSE) genes are of particular interest. Receptor-like kinases expanded massively in land plants, and leucine-rich repeat receptor-like kinases (LRR-RLK) constitute the largest receptor-like kinases family. Based on the phylogeny of 7,554 LRR-RLK genes from 31 fully sequenced flowering plant genomes, the complex evolutionary dynamics of this family was characterized in depth. We studied the involvement of selection during the expansion of this family among angiosperms. LRR-RLK subgroups harbor extremely contrasting rates of duplication, retention, or loss, and LSE copies are predominantly found in subgroups involved in environmental interactions. Expansion rates also differ significantly depending on the time when rounds of expansion or loss occurred on the angiosperm phylogenetic tree. Finally, using a dN/dS-based test in a phylogenetic framework, we searched for selection footprints on LSE and single-copy LRR-RLK genes. Selective constraint appeared to be globally relaxed at LSE genes, and codons under positive selection were detected in 50% of them. Moreover, the leucine-rich repeat domains, and specifically four amino acids in them, were found to be the main targets of positive selection. Here, we provide an extensive overview of the expansion and evolution of this very large gene family.


Asunto(s)
Evolución Molecular , Magnoliopsida/genética , Familia de Multigenes , Proteínas de Plantas/genética , Proteínas Tirosina Quinasas Receptoras/genética , Secuencias Repetitivas de Aminoácido , Secuencias de Aminoácidos , Duplicación de Gen , Variación Genética , Magnoliopsida/clasificación , Modelos Genéticos , Filogenia , Proteínas de Plantas/clasificación , Proteínas Tirosina Quinasas Receptoras/clasificación , Selección Genética , Especificidad de la Especie , Factores de Tiempo
7.
Nucleic Acids Res ; 43(Database issue): D1028-35, 2015 Jan.
Artículo en Inglés | MEDLINE | ID: mdl-25392413

RESUMEN

The whole genome sequence of Coffea canephora, the perennial diploid species known as Robusta, has been recently released. In the context of the C. canephora genome sequencing project and to support post-genomics efforts, we developed the Coffee Genome Hub (http://coffee-genome.org/), an integrative genome information system that allows centralized access to genomics and genetics data and analysis tools to facilitate translational and applied research in coffee. We provide the complete genome sequence of C. canephora along with gene structure, gene product information, metabolism, gene families, transcriptomics, syntenic blocks, genetic markers and genetic maps. The hub relies on generic software (e.g. GMOD tools) for easy querying, visualizing and downloading research data. It includes a Genome Browser enhanced by a Community Annotation System, enabling the improvement of automatic gene annotation through an annotation editor. In addition, the hub aims at developing interoperability among other existing South Green tools managing coffee data (phylogenomics resources, SNPs) and/or supporting data analyses with the Galaxy workflow manager.


Asunto(s)
Coffea/genética , Bases de Datos de Ácidos Nucleicos , Genoma de Planta , Coffea/metabolismo , Perfilación de la Expresión Génica , Genómica , Redes y Vías Metabólicas/genética , Polimorfismo de Nucleótido Simple , Programas Informáticos , Sintenía
8.
BMC Genomics ; 17: 243, 2016 Mar 16.
Artículo en Inglés | MEDLINE | ID: mdl-26984673

RESUMEN

BACKGROUND: Recent advances in genomics indicate functional significance of a majority of genome sequences and their long range interactions. As a detailed examination of genome organization and function requires very high quality genome sequence, the objective of this study was to improve reference genome assembly of banana (Musa acuminata). RESULTS: We have developed a modular bioinformatics pipeline to improve genome sequence assemblies, which can handle various types of data. The pipeline comprises several semi-automated tools. However, unlike classical automated tools that are based on global parameters, the semi-automated tools proposed an expert mode for a user who can decide on suggested improvements through local compromises. The pipeline was used to improve the draft genome sequence of Musa acuminata. Genotyping by sequencing (GBS) of a segregating population and paired-end sequencing were used to detect and correct scaffold misassemblies. Long insert size paired-end reads identified scaffold junctions and fusions missed by automated assembly methods. GBS markers were used to anchor scaffolds to pseudo-molecules with a new bioinformatics approach that avoids the tedious step of marker ordering during genetic map construction. Furthermore, a genome map was constructed and used to assemble scaffolds into super scaffolds. Finally, a consensus gene annotation was projected on the new assembly from two pre-existing annotations. This approach reduced the total Musa scaffold number from 7513 to 1532 (i.e. by 80%), with an N50 that increased from 1.3 Mb (65 scaffolds) to 3.0 Mb (26 scaffolds). 89.5% of the assembly was anchored to the 11 Musa chromosomes compared to the previous 70%. Unknown sites (N) were reduced from 17.3 to 10.0%. CONCLUSION: The release of the Musa acuminata reference genome version 2 provides a platform for detailed analysis of banana genome variation, function and evolution. Bioinformatics tools developed in this work can be used to improve genome sequence assemblies in other species.


Asunto(s)
Biología Computacional/métodos , Genoma de Planta , Musa/genética , Mapeo Contig , Marcadores Genéticos , Secuenciación de Nucleótidos de Alto Rendimiento , Anotación de Secuencia Molecular , Análisis de Secuencia de ADN
9.
Mol Biol Evol ; 32(7): 1815-32, 2015 Jul.
Artículo en Inglés | MEDLINE | ID: mdl-25761766

RESUMEN

Transposable elements comprise more than 45% of the human genome and long interspersed nuclear element 1 (LINE-1 or L1) is the only autonomous mobile element remaining active. Since its identification, it has been proposed that L1 contributes to the mobilization and amplification of other cellular RNAs and more recently, experimental demonstrations of this function has been described for many transcripts such as Alu, a nonautonomous mobile element, cellular mRNAs, or small noncoding RNAs. Detailed examination of the mobilization of various cellular RNAs revealed distinct pathways by which they could be recruited during retrotransposition; template choice or template switching. Here, by analyzing genomic structures and retrotransposition signatures associated with small nuclear RNA (snRNA) sequences, we identified distinct recruiting steps during the L1 retrotransposition cycle for the formation of snRNA-processed pseudogenes. Interestingly, some of the identified recruiting steps take place in the nucleus. Moreover, after comparison to other vertebrate genomes, we established that snRNA amplification by template switching is common to many LINE families from several LINE clades. Finally, we suggest that U6 snRNA copies can serve as markers of L1 retrotransposition dynamics in mammalian genomes.


Asunto(s)
Mamíferos/genética , Seudogenes/genética , ARN Nuclear Pequeño/genética , Retroelementos/genética , Animales , Secuencia de Bases , Genoma Humano , Humanos , Elementos de Nucleótido Esparcido Largo/genética , Datos de Secuencia Molecular , Poliadenilación/genética , Moldes Genéticos
10.
Mol Biol Evol ; 30(3): 569-72, 2013 Mar.
Artículo en Inglés | MEDLINE | ID: mdl-23112232

RESUMEN

Paleobotanical studies suggest that roots evolved at least twice independently during land plant diversification, once in lycophytes and once in euphyllophytes. Auxin promotes postembryonic root initiation in both groups but from different cell types. In several euphyllophytes, such as Arabidopsis, rice, and maize, AS2/LOB-domain (ASL/LBD) proteins act directly downstream of auxin and are conserved elements necessary for root initiation. It is currently unknown whether similar or different genetic mechanisms act downstream of auxin for root initiation in lycophytes and euphyllophytes. We searched for ASL/LBD proteins in genome sequences spanning the tree of life to retrace their evolutionary history. We performed a phylogenetic analysis of ASL/LBD proteins and mapped the functions of all characterized ASL/LBD onto the phylogenetic trees. We identified a clade specifically associated with root development, which includes no lycophyte sequence. This points toward the existence of distinct genetic mechanisms downstream of auxin for root initiation in lycophytes and euphyllophytes.


Asunto(s)
Ácidos Indolacéticos/metabolismo , Proteínas de Plantas/genética , Raíces de Plantas/genética , Secuencia de Aminoácidos , Arabidopsis/genética , Arabidopsis/crecimiento & desarrollo , Bryopsida/genética , Bryopsida/crecimiento & desarrollo , Secuencia Conservada , Evolución Molecular , Oryza/genética , Oryza/crecimiento & desarrollo , Filogenia , Reguladores del Crecimiento de las Plantas/metabolismo , Proteínas de Plantas/química , Raíces de Plantas/crecimiento & desarrollo , Estructura Terciaria de Proteína , Vitis/genética , Vitis/crecimiento & desarrollo
11.
BMC Plant Biol ; 14: 371, 2014 Dec 19.
Artículo en Inglés | MEDLINE | ID: mdl-25524444

RESUMEN

BACKGROUND: The development of genome-wide association studies (GWAS) in crops has made it possible to mine interesting alleles hidden in gene bank resources. However, only a small fraction of the rice genetic diversity of any given country has been exploited in the studies with worldwide sampling conducted to date. This study presents the development of a panel of rice varieties from Vietnam for GWAS purposes. RESULTS: The panel, initially composed of 270 accessions, was characterized for simple agronomic traits (maturity class, grain shape and endosperm type) commonly used to classify rice varieties. We first genotyped the panel using Diversity Array Technology (DArT) markers. We analyzed the panel structure, identified two subpanels corresponding to the indica and japonica sub-species and selected 182 non-redundant accessions. However, the number of usable DArT markers (241 for an initial library of 6444 clones) was too small for GWAS purposes. Therefore, we characterized the panel of 182 accessions with 25,971 markers using genotyping by sequencing. The same indica and japonica subpanels were identified. The indica subpanel was further divided into six populations (I1 to I6) using a model-based approach. The japonica subpanel, which was more highly differentiated, was divided into 4 populations (J1 to J4), including a temperate type (J2). Passport data and phenotypic traits were used to characterize these populations. Some populations were exclusively composed of glutinous types (I3 and J2). Some of the upland rice varieties appeared to belong to indica populations, which is uncommon in this region of the world. Linkage disequilibrium decayed faster in the indica subpanel (r2 below 0.2 at 101 kb) than in the japonica subpanel (r2 below 0.2 at 425 kb), likely because of the strongest differentiation of the japonica subpanel. A matrix adapted for GWAS was built by eliminating the markers with a minor allele frequency below 5% and imputing the missing data. This matrix contained 21,814 markers. A GWAS was conducted on time to flowering to prove the utility of this panel. CONCLUSIONS: This publicly available panel constitutes an important resource giving access to original allelic diversity. It will be used for GWAS on root and panicle traits.


Asunto(s)
Marcadores Genéticos/genética , Estudio de Asociación del Genoma Completo , Desequilibrio de Ligamiento , Oryza/genética , Polimorfismo de Nucleótido Simple , Mapeo Cromosómico , Datos de Secuencia Molecular , Filogenia , Vietnam
12.
Bioinformatics ; 28(7): 1054-6, 2012 Apr 01.
Artículo en Inglés | MEDLINE | ID: mdl-22285827

RESUMEN

SUMMARY: We developed a controller that is compliant with the Chado database schema, GBrowse and genome annotation-editing tools such as Artemis and Apollo. It enables the management of public and private data, monitors manual annotation (with controlled vocabularies, structural and functional annotation controls) and stores versions of annotation for all modified features. The Chado controller uses PostgreSQL and Perl. AVAILABILITY: The Chado Controller package is available for download at http://www.gnpannot.org/content/chado-controller and runs on any Unix-like operating system, and documentation is available at http://www.gnpannot.org/content/chado-controller-doc The system can be tested using the GNPAnnot Sandbox at http://www.gnpannot.org/content/gnpannot-sandbox-form CONTACT: valentin.guignon@cirad.fr; stephanie.sidibe-bocs@cirad.fr SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Asunto(s)
Biología Computacional/métodos , Sistemas de Administración de Bases de Datos , Anotación de Secuencia Molecular/métodos , Programas Informáticos , Genómica/métodos , Vocabulario Controlado
13.
Nucleic Acids Res ; 39(Database issue): D1095-102, 2011 Jan.
Artículo en Inglés | MEDLINE | ID: mdl-20864446

RESUMEN

GreenPhylDB is a database designed for comparative and functional genomics based on complete genomes. Version 2 now contains sixteen full genomes of members of the plantae kingdom, ranging from algae to angiosperms, automatically clustered into gene families. Gene families are manually annotated and then analyzed phylogenetically in order to elucidate orthologous and paralogous relationships. The database offers various lists of gene families including plant, phylum and species specific gene families. For each gene cluster or gene family, easy access to gene composition, protein domains, publications, external links and orthologous gene predictions is provided. Web interfaces have been further developed to improve the navigation through information related to gene families. New analysis tools are also available, such as a gene family ontology browser that facilitates exploration. GreenPhylDB is a component of the South Green Bioinformatics Platform (http://southgreen.cirad.fr/) and is accessible at http://greenphyl.cirad.fr. It enables comparative genomics in a broad taxonomy context to enhance the understanding of evolutionary processes and thus tends to speed up gene discovery.


Asunto(s)
Bases de Datos Genéticas , Genoma de Planta , Genes de Plantas , Genómica , Anotación de Secuencia Molecular , Filogenia , Proteínas de Plantas/química , Proteínas de Plantas/genética , Plantas/clasificación , Plantas/genética , Programas Informáticos
14.
Plant Environ Interact ; 4(5): 229-257, 2023 Oct.
Artículo en Inglés | MEDLINE | ID: mdl-37822730

RESUMEN

Rice is more vulnerable to drought than maize, wheat, and sorghum because its water requirements remain high throughout the rice life cycle. The effects of drought vary depending on the timing, intensity, and duration of the events, as well as on the rice genotype and developmental stage. It can affect all levels of organization, from genes to the cells, tissues, and/or organs. In this study, a moderate water deficit was applied to two contrasting rice genotypes, IAC 25 and CIRAD 409, during their reproductive stage. Multi-level transcriptomic, metabolomic, physiological, and morphological analyses were performed to investigate the complex traits involved in their response to drought. Weighted gene network correlation analysis was used to identify the specific molecular mechanisms regulated by each genotype, and the correlations between gene networks and phenotypic traits. A holistic analysis of all the data provided a deeper understanding of the specific mechanisms regulated by each genotype, and enabled the identification of gene markers. Under non-limiting water conditions, CIRAD 409 had a denser shoot, but shoot growth was slower despite better photosynthetic performance. Under water deficit, CIRAD 409 was weakly affected regardless of the plant level analyzed. In contrast, IAC 25 had reduced growth and reproductive development. It regulated transcriptomic and metabolic activities at a high level, and activated a complex gene regulatory network involved in growth-limiting processes. By comparing two contrasting genotypes, the present study identified the regulation of some fundamental processes and gene markers, that drive rice development, and influence its response to water deficit, in particular, the importance of the biosynthetic and regulatory pathways for cell wall metabolism. These key processes determine the biological and mechanical properties of the cell wall and thus influence plant development, organ expansion, and turgor maintenance under water deficit. Our results also question the genericity of the antagonism between morphogenesis and organogenesis observed in the two genotypes.

15.
Plant Biotechnol J ; 10(5): 555-68, 2012 Jun.
Artículo en Inglés | MEDLINE | ID: mdl-22369597

RESUMEN

We report here the molecular and phenotypic features of a library of 31,562 insertion lines generated in the model japonica cultivar Nipponbare of rice (Oryza sativa L.), called Oryza Tag Line (OTL). Sixteen thousand eight hundred and fourteen T-DNA and 12,410 Tos17 discrete insertion sites have been characterized in these lines. We estimate that 8686 predicted gene intervals--i.e. one-fourth to one-fifth of the estimated rice nontransposable element gene complement--are interrupted by sequence-indexed T-DNA (6563 genes) and/or Tos17 (2755 genes) inserts. Six hundred and forty-three genes are interrupted by both T-DNA and Tos17 inserts. High quality of the sequence indexation of the T2 seed samples was ascertained by several approaches. Field evaluation under agronomic conditions of 27,832 OTL has revealed that 18.2% exhibit at least one morphophysiological alteration in the T1 progeny plants. Screening 10,000 lines for altered response to inoculation by the fungal pathogen Magnaporthe oryzae allowed to observe 71 lines (0.7%) developing spontaneous lesions simulating disease mutants and 43 lines (0.4%) exhibiting an enhanced disease resistance or susceptibility. We show here that at least 3.5% (four of 114) of these alterations are tagged by the mutagens. The presence of allelic series of sequence-indexed mutations in a gene among OTL that exhibit a convergent phenotype clearly increases the chance of establishing a linkage between alterations and inserts. This convergence approach is illustrated by the identification of the rice ortholog of AtPHO2, the disruption of which causes a lesion-mimic phenotype owing to an over-accumulation of phosphate, in nine lines bearing allelic insertions.


Asunto(s)
ADN Bacteriano , Biblioteca de Genes , Mutagénesis Insercional , Oryza/genética , ADN de Plantas/genética , Genes de Plantas , Magnaporthe/patogenicidad , Fenotipo , Enfermedades de las Plantas/genética , Plásmidos , Análisis de Secuencia de ADN , Transformación Genética
16.
Database (Oxford) ; 20222022 07 14.
Artículo en Inglés | MEDLINE | ID: mdl-35849014

RESUMEN

In silico chromosome painting is a technique by which contributions of distinct genetic groups are represented along chromosomes of hybrid individuals. This type of analysis is used to study the mechanisms by which these individuals were formed. Such techniques are well adapted to identify genetic groups contributing to these individuals as well as hybridization events. It can also be used to follow chromosomal recombinations that occurred naturally or were generated by selective breeding. Here, we present GeMo, a novel interactive web-based and user-oriented interface to visualize in a linear-based fashion results of in silico chromosome painting. To facilitate data input generation, a script to execute analytical commands is provided and an interactive data curation mode is supported to ensure consistency of the automated procedure. GeMo contains preloaded datasets from published studies on crop domestication but can be applied to other purposes, such as breeding programs Although only applied so far on plants, GeMo can handle data from animals as well. Database URL: https://gemo.southgreen.fr/.


Asunto(s)
Curaduría de Datos , Interfaz Usuario-Computador , Animales , Cromosomas , Bases de Datos Factuales , Internet
17.
Gigascience ; 122022 12 28.
Artículo en Inglés | MEDLINE | ID: mdl-36807539

RESUMEN

BACKGROUND: Musa beccarii (Musaceae) is a banana species native to Borneo, sometimes grown as an ornamental plant. The basic chromosome number of Musa species is x = 7, 10, or 11; however, M. beccarii has a basic chromosome number of x = 9 (2n = 2x = 18), which is the same basic chromosome number of species in the sister genera Ensete and Musella. Musa beccarii is in the section Callimusa, which is sister to the section Musa. We generated a high-quality chromosome-scale genome assembly of M. beccarii to better understand the evolution and diversity of genomes within the family Musaceae. FINDINGS: The M. beccarii genome was assembled by long-read and Hi-C sequencing, and genes were annotated using both long Iso-seq and short RNA-seq reads. The size of M. beccarii was the largest among all known Musaceae assemblies (∼570 Mbp) due to the expansion of transposable elements and increased 45S ribosomal DNA sites. By synteny analysis, we detected extensive genome-wide chromosome fusions and fissions between M. beccarii and the other Musa and Ensete species, far beyond those expected from differences in chromosome number. Within Musaceae, M. beccarii showed a reduced number of terpenoid synthase genes, which are related to chemical defense, and enrichment in lipid metabolism genes linked to the physical defense of the cell wall. Furthermore, type III polyketide synthase was the most abundant biosynthetic gene cluster (BGC) in M. beccarii. BGCs were not conserved in Musaceae genomes. CONCLUSIONS: The genome assembly of M. beccarii is the first chromosome-scale genome assembly in the Callimusa section in Musa, which provides an important genetic resource that aids our understanding of the evolution of Musaceae genomes and enhances our knowledge of the pangenome.


Asunto(s)
Musa , Musaceae , Musa/genética , Musaceae/genética , Genoma de Planta , Cromosomas , ADN Ribosómico , Filogenia
18.
Gigascience ; 112022 04 30.
Artículo en Inglés | MEDLINE | ID: mdl-35488861

RESUMEN

BACKGROUND: Ensete glaucum (2n = 2x = 18) is a giant herbaceous monocotyledonous plant in the small Musaceae family along with banana (Musa). A high-quality reference genome sequence assembly of E. glaucum is a resource for functional and evolutionary studies of Ensete, Musaceae, and the Zingiberales. FINDINGS: Using Oxford Nanopore Technologies, chromosome conformation capture (Hi-C), Illumina and RNA survey sequence, supported by molecular cytogenetics, we report a high-quality 481.5 Mb genome assembly with 9 pseudo-chromosomes and 36,836 genes. A total of 55% of the genome is composed of repetitive sequences with predominantly LTR-retroelements (37%) and DNA transposons (7%). The single 5S ribosomal DNA locus had an exceptionally long monomer length of 1,056 bp, more than twice that of the monomers at multiple loci in Musa. A tandemly repeated satellite (1.1% of the genome, with no similar sequence in Musa) was present around all centromeres, together with a few copies of a long interspersed nuclear element (LINE) retroelement. The assembly enabled us to characterize in detail the chromosomal rearrangements occurring between E. glaucum and the x = 11 species of Musa. One E. glaucum chromosome has the same gene content as Musa acuminata, while others show multiple, complex, but clearly defined evolutionary rearrangements in the change between x= 9 and 11. CONCLUSIONS: The advance towards a Musaceae pangenome including E. glaucum, tolerant of extreme environments, makes a complete set of gene alleles, copy number variation, and a reference for structural variation available for crop breeding and understanding environmental responses. The chromosome-scale genome assembly shows the nature of chromosomal fusion and translocation events during speciation, and features of rapid repetitive DNA change in terms of copy number, sequence, and genomic location, critical to understanding its role in diversity and evolution.


Asunto(s)
Musa , Musaceae , Cromosomas , Variaciones en el Número de Copia de ADN , Elementos Transponibles de ADN , Musa/genética , Musaceae/genética , Fitomejoramiento , Retroelementos , Análisis de Secuencia de ADN
19.
Plant Commun ; 3(5): 100330, 2022 09 12.
Artículo en Inglés | MEDLINE | ID: mdl-35617961

RESUMEN

Vanilla planifolia, the species cultivated to produce one of the world's most popular flavors, is highly prone to partial genome endoreplication, which leads to highly unbalanced DNA content in cells. We report here the first molecular evidence of partial endoreplication at the chromosome scale by the assembly and annotation of an accurate haplotype-phased genome of V. planifolia. Cytogenetic data demonstrated that the diploid genome size is 4.09 Gb, with 16 chromosome pairs, although aneuploid cells are frequently observed. Using PacBio HiFi and optical mapping, we assembled and phased a diploid genome of 3.4 Gb with a scaffold N50 of 1.2 Mb and 59 128 predicted protein-coding genes. The atypical k-mer frequencies and the uneven sequencing depth observed agreed with our expectation of unbalanced genome representation. Sixty-seven percent of the genes were scattered over only 30% of the genome, putatively linking gene-rich regions and the endoreplication phenomenon. By contrast, low-coverage regions (non-endoreplicated) were rich in repeated elements but also contained 33% of the annotated genes. Furthermore, this assembly showed distinct haplotype-specific sequencing depth variation patterns, suggesting complex molecular regulation of endoreplication along the chromosomes. This high-quality, anchored assembly represents 83% of the estimated V. planifolia genome. It provides a significant step toward the elucidation of this complex genome. To support post-genomics efforts, we developed the Vanilla Genome Hub, a user-friendly integrated web portal that enables centralized access to high-throughput genomic and other omics data and interoperable use of bioinformatics tools.


Asunto(s)
Vanilla , Cromosomas , Endorreduplicación , Tamaño del Genoma , Haplotipos , Vanilla/genética
20.
Hortic Res ; 9: uhac221, 2022.
Artículo en Inglés | MEDLINE | ID: mdl-36479579

RESUMEN

The Banana Genome Hub provides centralized access for genome assemblies, annotations, and the extensive related omics resources available for bananas and banana relatives. A series of tools and unique interfaces are implemented to harness the potential of genomics in bananas, leveraging the power of comparative analysis, while recognizing the differences between datasets. Besides effective genomic tools like BLAST and the JBrowse genome browser, additional interfaces enable advanced gene search and gene family analyses including multiple alignments and phylogenies. A synteny viewer enables the comparison of genome structures between chromosome-scale assemblies. Interfaces for differential expression analyses, metabolic pathways and GO enrichment were also added. A catalogue of variants spanning the banana diversity is made available for exploration, filtering, and export to a wide variety of software. Furthermore, we implemented new ways to graphically explore gene presence-absence in pangenomes as well as genome ancestry mosaics for cultivated bananas. Besides, to guide the community in future sequencing efforts, we provide recommendations for nomenclature of locus tags and a curated list of public genomic resources (assemblies, resequencing, high density genotyping) and upcoming resources-planned, ongoing or not yet public. The Banana Genome Hub aims at supporting the banana scientific community for basic, translational, and applied research and can be accessed at https://banana-genome-hub.southgreen.fr.

SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA